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NOVEL NUCLEIC ACmS AND POLYPEPTmES 



1. TECHNICAL FIELD 

4 The present invention provides novel polynucleotides and proteins encoded by such 

polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

8 2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 

12 polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 

1 6 based on the presence of a now well-recognized secretory leader sequence motif, as well as 

various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 

20 case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and poljrpeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 

24 genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

28 The compositions of the present invention include novel isolated polypeptides, novel 

isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 

32 epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineei-ed to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 
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The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more pubhc databases. 
4 The invention relates also to tlie proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 
sequencesare designatedas SEQ ID NOS: 1-13901. The polypeptides sequences are designated 
SEQIDNOS: 13902-27802. The nucleic acids and polypeptides are provided in tlie Sequence 
8 Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is 
guanine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence 
Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
12 hybridize to the complement of SEQ ID NO: 1-13901 under stringenthybridizationconditions; 
nucleic acid sequences which are allelic variants or species homologues of any of the nucleic add 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 
domain or truncation ofthe peptides encoded by SEQIDNOS: 1-13901. A polynucleotide 
1 6 comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
ID NO: 1-13901 or a degenerate variant or firagment thereof. The identifying sequence can be 100 
base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 

20 firom the nucleic acid sequences of SEQ ID NO: 1 - 13901 . The sequence information can be a 
segment of any one of SEQ ID NO: 1 - 13 90 1 that uniquely identifies or represents the sequence 
inforraationofSEQIDNOS: 1-13901. 

A collection as used in this application can be a collection of only one polynucleotide. The 

24 collection of sequence information or identifying information of each sequence can be provided on 
a nucleic add array. In one embodiment, segments of sequence information is provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or mismatch to the polynucleotide that contains the segment. The collection 

28 can also be provided in a computer-readable format 

This invention also includes the reverse or direct complement of any of the nucleic add 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucldc add sequences (or their 

32 reverse or direct complements) according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PGR, use in an array, use in computer-readablemedia, use in sequencing 

2 
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full-length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -13901 or novel 

4 segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
are well known in the art. In a pailicularly preferred embodiment, the nucleic acid sequences of 
SEQ ID NO: 1-13901 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 

8 et al., Science 258:52-59 (1 992), as expressed sequence tags for physical mapping of the human 
genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901; a 

12 polynucleotide comprising any of the ftill length protein coding sequences of SEQ ID NOS: 1- 
13901; and a polynucleotide comprising any of the nucleotide sequences of the mature protein 
coding sequences of SEQ ID NOS : 1 -1 3901 . The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization 

1 6 conditions to (a) the complement of any one of the nucleotide sequences set forth in SEQ ID NO : 1 - 
1 3 90 1 ; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the 
Sequence Listing (e.g., SEQ ID NOS: 13902-27802); (c) a polynucleotide which is an allelic variant 
of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 

20 orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the pol3rpeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

24 comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
fiill length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1- 1390 1 ; or (b) polynucleotidesthat hybridize to the complement of the 

28 polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
equivalents" thereof {e.g , with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) tiiat preferably retain biological activity are also contemplated. The 

32 polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
produced by recombinant means using the genetically engineered cells (e.g. host cells) of the 
invention. 

3 
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The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of tlie invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutically acceptable, carrier. 
4 The invention also provides host ceils transformed or transfected with a polynucleotide of 

the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
8 xmder conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
12 techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
1 6 largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
20 expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al.. Science 258:52-59 (1992), as expressed sequence tags for 
physical mappiag of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
24 procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitatiiig the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
28 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
32 pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
ejqpression or biological activity. 

4 
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The present invention further relates to methods for detecting the presence of tlie 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 

4 identification of subjects exhibiting a predisposition to such conditions. The invention provides 
a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compoimd tliat binds to and forms a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufScient to form a 

8 complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting the polj^eptides of the 
invention in a sample comprising contacting the sample with a compound that binds to and forms 
a complex with tlie polypeptide under conditions and for a period sufficient to form the complex 
12 and detecting the formation of tlie complex such that if a complex is formed, the poljrpeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
16 Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 

monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compotmds that modiilate 

20 (?. e. , increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compotmds 
that can ameliorate s3anptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identijfying compotmds and other substances that interact vidth {e.g., 

24 bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 

28 sequence in the cell; and detecting the complex by detecting the reporter gene sequence 

expression such that if expression of the reporter gene is detected the compound the binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 

32 administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compotmds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
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effect such modulation either on the level of target gene/protein e)q)ression or target protein 
activity. 

The polj^eptides of the present invention and the polynucleotides encoding them are also 
4 use&l for the same functions known to one of skill in the art as the polypeptides and 

polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful for 
a variety of applications, as described herein, including use in arrays for detection. 

8 

4. DETAILED DESCRIPTION OF THE INVENTION 

12 4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 

16 and/or immunologic activities of any naturally occurring polypeptide. According to the 

invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having stnictural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activity" refers to the capability of the 

20 natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 

24 enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5'. Complementarity between two single-stranded molecules 

28 may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic add strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

32 The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 

differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived firom primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 

6 
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cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
4 are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ Hne and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
8 modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 

12 elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 

16 sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense sti and, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 

20 (U). It is contemplated that where the polynucleotide is RNA, the T (thjTuine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 

24 acid which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived firom a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "prunei" are used interchangeably and refer to a sequence of nucleotide 

28 residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 

32 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
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be used in polymerase chain reaction (PGR), various hybridization procedures or microarray 
procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
4 invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
IDNOs: 1-13901 . 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
8 described by Walsh et al. (Walsh, P.S. et al., 1992, PGR Methods Appl 1 :241-250). They may 
be labeled by nick translation, Klenow fill-in reaction, PGR, or other methods well known in the 
art. Probes of the present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et al., 1 989, Molecular- Cloning: A Laboratory Manual, Cold Spring Harbor 
12 Laboratory, NY; or Ausubel, F.M. et al., 1989, Cunent Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

The nucleic acid sequences of the present invention also include the sequence 

1 6 information from the nucleic acid sequences of SEQ ID NOS : 1 - 1 390 1 . The sequence 

information can be a segment of any one of SEQ ID NO: 1-13901 that uniquely identifies or 
represents the sequence information of that sequence of SEQ ID NOS: 1 -1 3901 . One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is 

20 fiilly matched in the human genome is 1 in 300. In the human genome, there are three biUion 
base pairs in caie set of chromosomes. Because 4^° possible twenty-mers exist, there are 300 
times more twenty-mers than there are base pairs in a set of human chromosomes. Using the 
same analysis, the probability for a seventeen-mer to be ftilly matched in the human genome is 

24 approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen- 
mer segments can be used. The probability that the fifteen-mer is fiilly matched in the expressed 
sequences is also approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

28 Similarly, when using sequence information for detecting a single mismatch, a segment can 

be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a fiill match (1 ^^^) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probabilitylhat an 

32 eighteen mer with a single mismatch can be detected in an array for expression studies is 
approxunately one in five . The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 



PCT/USOl/04927 



The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids.without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 
4 acid sequences. For example, a promoter is operably associated or operably linked with a coding 
sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
8 transcription/translation of the coding sequence. 

The term "pluripotenf refers to the capability of a cell to differentiate into a number of 
differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 
12 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
1 6 preferably at least about 9 amino acids and most preferably at least about 1 7 or more amino 
acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 1 50 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 
have not been genetically engineered and specifically contemplates various polypeptides arising 
firom post-translational modifications of the polypeptide including, but not limited to, 
24 acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the fiill 
length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
28 or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
32 may be removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encodmg for 
the mature protein coding sequence. 
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The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling {e.g., with radionuclides or various en2ymes), covalent polymer 
attachment such as pegylation (derivatization vnth polyethylene glycol) and insertion or 
4 substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
8 recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 

12 or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes which produce various restriction sites, may be 

16 introduced to optimize cloning into a plasmid or viral vector or expression in a particular 

prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-bindmg aflEinities, interchain 

20 affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are tiie result of replacing one amino acid with 
another amino acid having sinailar structural and/or chemical properties, i.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

24 similarity in polarity, charge, solubiUty, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

28 glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 

32 insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 

recombinant DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of ftmction is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
4 rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like m the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 
8 The terms "purified" or "substantially purified" as used herein denotes that the indicated • 

nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g^., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 

12 preferably at least 99% by weight, of ttie indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1 000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 

16 at least one other component (e. g. , nucleic acid or polypeptide) present with the nucleic acid or 
jiolypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 

20 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 

24 bacterial or fimgal (e.g. , yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or 

28 proteins expressed in yeast will have a glycosylation pattern in general dijfferent from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 

32 comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 

having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukat>'otic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
4 methionine residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or cany the recombinant 
8 transcriptional unit extrachiomosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 
to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 

12 gene expression, for example, promoters or enhancers. Recombinant expression systems as 

defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

16 The term "secreted" includes a protein that is transported across or through a membrane, 

including transport as a result of signal sequences in its amino acid sequence v^iien it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell in which they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 
P.A. and Young, P.R. (1992) Cytokine 4(2):134 -143) and factors released from damaged cells 

24 (e.g. Interleukia-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Aimu. Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 

28 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization 

32 to fflter-bound DNA in 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
65°C, and washing in O.IX SSC/0.1% SDS at 68''C), and moderately stringent conditions (i.e., 
washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are 
described herein in the examples. 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for20-base oligonucleotides), and 
4 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 
sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
8 functional dissimilarity between the reference and subject sequences. Typically, such a 

substantially equivalent sequence varies from one of those listed herein by no more than about 
35% (i.e., the number of individxial residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the corresponding reference sequence, divided 

12 by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. . In one 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
listed sequence by no more than 30% (70% sequence identity); in a variation of this 

16 embodiment, by no more than 25% (75% sequence identity); and in a further variation of tliis 
embodiment, by no more than 20% (80% sequence identity) and in a flirther variation of this 
embodiment, by no more than 10% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 

20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most prefei-ably at least 99% identity. Substantially equivalent 

24 nucleotide sequences of the invention can have lower percent sequence identities, taking into 
accoimt, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 

28 identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
about 99% sequence identity. For the purposes of the present invention, sequence;s having 
substantially equivalent biological activity and substantially equivalent expression characteristics 

32 are considered substantially equivalent. For the purposes of determining equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g. , using the Jotun Hein method (Hein, J. 
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(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be detemiined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term ''totipotent" refers to the capability of a cell to differentiate into all of the cell 

4 types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
8 or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
12 using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
1 6 determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

20 

4.2 NUCLEIC ACmS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated pol3mucleotides of the invention include a polynucleotide comprising the 

24 nucleotide sequences of SEQ ID NO: 1-13901 ; a polynucleotide encoding any one of the 
peptide sequences of SEQ ID NOS: 13902-27802; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one 
of SEQ ID NOS : 1 3902-27802. The polynucleotides of the present invention also include, but 

28 are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the 
complement of any of the nucleotides sequences of SEQ ID NO: 1-13901; (b) nucleotide 
sequences encoding any one of the amino acid sequences set forth in tihe Sequence Listing; (c) a 
polynucleotide which is an alleHc variant of any polynucleotide recited above; (d) a 

32 pol3mucleotide which encodes a species homolog of any of the proteins recited above; or (e) a 
polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the 
polypeptides of SEQ ID NOS: 13902-27802. Domains of interest may dqpend on tiie nature of 
the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding. 
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extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in 
immunoglobulin-like proteins include tlie variable immunoglobulin-like domains; domains in 
enzyme-like polypeptides include catalytic and substrate binding domains; and domains ia ligand 
4 polypeptides include receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
8 region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 

1 2 from the disclosed sequence information for identification and/or amplification of genes in 

appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can 
be obtained using methods known in the art. For example, fall length cDNA or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1-13901 can be obtained by screening 

1 6 appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NO: 1-13901 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO: 1-13901 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

20 The nucleic acid sequences of the invention can be assembled from ESTs and sequences 

(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment mformationfor the full-length 

24 gene. 

The pol3mucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 

28 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 

32 sequence fragments that hybridize imder stringent conditions to any of the nucleotide sequences 
of SEQ ID NO: 1-13901, or complements thereof, which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 
preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
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are selective for {i.e. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
4 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. AUeHc and species 
8 variations can be routinely determined by comparing the sequence provided SEQ ID NO: 1-13901, 
a representative fiagment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 
identical, to SEQ ID NO: 1-13901 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 

1 2 coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, 
in the coding region of an ORF , substitution of one codon for another codon that encodes the same 
amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 

16 including SEQ ID NO: 1-13901 can be obtained by searcliing a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1 993) and Altschul 
S.F. et al. J. Mol. BioL 21 :403-410 (1990)). Alternatively a FAS TA version 3 search against 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

24 acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 

28 polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducirg appropriate nucleotide changes into a 

32 native or variant poljmucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occvir in nature. These nucleic 
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acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices {e.g., 
4 hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 

choices [e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
8 insertions include amino- and/or carboxyl-terminal frisions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 
preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 

12 sequences necessary for secretion or for intracellular targeting in different host cells and 

sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 

In a prefened method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses' oligonucleotide sequences to alter a 

16 polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 

nucleotides on botli sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by pubhcations such as, Edehnan et al., 

20 DNA 2:183 (1983). A versatile and efEcient method for producing site-specific changes in a 
polynucleotide sequence was published by ZoUer and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PGR may also be tised to create amino acid sequence variants of the novel nucleic acids. 
When smaU amounts of template DNA are used as starting material, primer(s) that differs 

24 slightly in sequence firom the corresponding region in the template DNA can generate the desired 
amino acid variant. PGR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding region in the plasmid and this 

28 gives a polynucleotide encoding the desired amino acid variant. 

A fijrther technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al.. Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques m Sambrook et al., supra, and Current 

32 Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a fLmctionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and e5q)ression 
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of these novel nucleic acids. Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide trimcations of the invention can be used 
4 to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
8 synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, poljmucleotide sequences comprising the mature 

12 protein coding sequences corresponding to any one of SEQ ID NO: 1-13901 , or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 
included are the cDNA inserts of any of the clones identified herein. 

16 A polynucleotide according to the invention can be joined to any of a variety of other 

nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Use&l 
nucleotide sequences for joiioing to poljoiucleotides include an assortment of vectors, e.g., 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 

24 selectable marker for the host cell. Vectors according to the invention include expression 
vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

28 The present invention further provides recombinant constructs comprising a nucleic acid 

having any of the nucleotide sequences of SEQ ID NO: 1- 13901 or a firagment thereof or any 
other polynucleotides of the invention. In one embodiment, the recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 

32 having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof is 

inserted, in a forward or reverse orientation. In the case- of a vector comprising one of the ORFs 
of the present invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. Large numbers of suitable vectors and 
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promoters are known to those of skill in the art and are commercially available for generating the 
recombinant constructs of the present invention. The following vectors are provided by way of 
example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, 
4 pNHl 8a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVKS, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
8 control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also knovm and are exemplified in R. Kaufman, Methods in 

12 Enzymology 1 85, 537-566 (1990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 
(transfected) with the ligated polynucleotide/expression control sequence. 

16 Promoter regions can be selected firom any desired gene using CAT (chloramphenicol 

transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 

24 and 5. cereviside TRPl gene, and a promoter derived ftom a highly-expressed gene to direct 

transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phDsphoglycerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 

28 assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desii^d 

32 characteristics, e.g. , stabilization or simplified purification of expressed recombinant product 
Usefial expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
signals in operable reading phase with a fimctional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coU, Bacillus subtilis. Salmonella typhimurium and various species 
4 witliin the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 
employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
8 available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 

12 sequence to be expressed. Follovnng transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifugation, disrupted by physical or 

1 6 chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et aL, Nat. Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

24 

4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 

28 sequence of SEQ ID NO: 1-13901 , or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 

32 molecules are provided that comprise a sequence complementary to at least about 1 0, 25, 50, 

100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encodirg fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NOS: 13902-27802 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ 
ID NO: 1-13901 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
4 of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
8 "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' imtranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., SEQ 
ID NO: 1-13901 ), antisense nucleic acids of the invention can be designed according to tlie rules 

12 of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 

complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
the antisense oligonucleotide can be complementary to the region sunounding the translation 

16 start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 

20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

24 Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, S-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

28 inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosme, 2,2-dimethylguamne, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

32 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyI-2-thiouracil, 2-thiouracil, 4-thiouraciI, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-aniino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
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antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (Le., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
4 described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such tliat they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
8 protein, e.g., by inhibiting franscription and/or translation. The hybridization can be by 

conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 

12 nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 

16 by Hnking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

20 control of a strong pol II or pol IE promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 

24 strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res\5: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al. 
(1987) Nucleic Acids Res 15: 6131-6148) or achimeric RNA -DNA analogue (Inoue et al. (1987) 
FEES Lett 215: 327-330). 

28 

4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catals^tic RNA molecules with ribonuclease activity that are capable of cleaving a 
32 single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 
translation of a roKNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1- 
13901 ). For example, a derivative of a Tetrahjrmena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence to 
4 be cleaved in a SECX-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071; and 
Cech et al. U.S. Pat No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a 
cataljrtic RNA having a specific ribonuc lease activity from a pool of RNA molecules. See, e.g. , 
Bartel et al, (1993) Science 261:1411-1418. 
8 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. et al. (_1992) Ann. N.Y. Acad. Sci. 660:27-36; and 

12 Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 

16 acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) Bioorg Med 

Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

20 backbone of PNAs has been shown to allow for specific hybridi2ation to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O'Keefee/flr/. (1996)PiV^^593: 14670-675. 

24 PNAs of the invention can be used in therapeutic and diagnostic applications. For 

example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 

28 gene by, e.g., PNA directed PGR clamping; as artificial restriction ensymes when used in 

combination with other en2ymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 
primers for DNA sequence and hybridization (Hyrup etal. (1996), above; Peiry-O'Keefe (1996), 
above). 

32 In another embodiment, PNAs of the invention can be modified, e.g. , to enhance their 

stabiUty or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery knovwi in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 

4 using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
the nucleobases, and orientation (Hymp (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 

8 phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g. , 

5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA (Mag et al. (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepmse manner to produce a chimeric molecule with a 5' PNA segment and a 3' 
12 DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can be S5aithesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et al (1975) BioorgMed Chem 
Lett 5: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
16 peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g, Letsinger et at, 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; 
Lemaitre et al, 1 987, Proc. Natl. Acad. Sci. 84:648-652; PCX Publication No. W088/09810) or 
the blood-brain barrier (see, e.g., PCX PublicationNo. W089/10134). In addition, 
20 oUgonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et 
al, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 
5: 539-549). Xo this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
24 cleavage agent, etc. 

4.5 HOSTS 

The present invention fiirther provides host cells genetically engineered to contain the 
28 polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still fijrther provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
32 with a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 



24 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 



PCT/USOl/04927 



recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
4 is operatively linked to the encoding sequences. See, for example, PCT International Publication 
No. WO94/I2650, PCT International Publication No. WO92/20808, and PCT International 
Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, ampUfiable marker DNA (e.g. , ada, dhfr, and the multifuiictional CAD gene which 
8 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 
sequence, amplification of the marker DNA by standard selection methods resuhs in co- 
amplification of the desired protein coding sequences in the cells. 

12 The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 

eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 

16 L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated firagment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

20 Any host/vector system can be used to express one or more of the ORFs of the present 

invention. These include, but are not limited to, etikaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli md B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 

24 protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using • 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 

28 expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombmant 

32 protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzmah, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transfonned primate cell lines, normal diploid cells, cell strains derived 
fi-om in vitro culture of primarj' tissue, primary explants, HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 

4 replication, a suitable promoter and also any necessaiy ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional teniiination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 

8 the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 
more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
12 Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents. 

1 6 Alternatively, it may be possible to produce the protein in lower eukatyotes such as yeast 

or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Klityveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

24 attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, ia which case the regulatory sequences of the endogenous gene 

28 may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 

32 negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence iiiclude polyadenylation signals, mRNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

,4 The targeting event may be a simple insertion of the regulatory sequence, placing the 

gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
8 Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 
the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 

12 the use of one or more selectable marker genes that are contiguous with the targeting DNA, 

allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 

16 selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks tlie targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers usefial for this purpose include the 

20 Herpes Simplex Virus thsmaidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 

24 Chappel; U.S. Patent No. 5,578,46 1 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International AppUcationNo. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirely. 

28 

4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NOS: 13902-278Q2 or an 
32 ammo acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1 -13901 or 
the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with, biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901 or 
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(b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NOS: 
13902-27802 or (c) polynucleotides that hybridize to the complement of the polynucleotides of 
either (a) or (b) under stringent hybridization conditions. Hie invention also provides 
4 biologically active or immunologically active variants of any of the amino acid sequences set 
forth as SEQ ID NOS: 13902-27802 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 
75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 
8 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or 
most typically at least about 99% amino acid identity) that retain biological activity. 
Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity 
compai-ed to polypeptides comprising SEQ ID NOS: 13902-27802. 

12 Fragments of the proteins of the present invention which are capable of exhibiting 

biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using l<nown methods, for example, as described in H. 
U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and m R. S. McDowell, et al., J. Amer. 

16 Chem. Soc. 114, 9245-9253 (1 992), both of which are Incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both fiill-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a fiill-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 

24 of the protein is also determinable from the amino acid sequence of the full-length form. Where 
proteins of the present invention are membrane boimd, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted firom the cell in which they are expressed. 

28 Protein compositions of the present invention may fiirther comprise an acceptable canier, 

such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 

32 present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fiagment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 
acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
4 synthetically-constructed protein sequences, by virtue of sharing primary, secondar>' or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are viseful, for 
8 example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 

12 cells which have been altered to express the desired polypeptide or protein. As used herein, a 

cell is said to be altered to express a desired polypeptide or protein when the cell, tlirough genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
wliich the cell normally produces at a lower level. One sldlled in the art can readily adapt 

16 procedures for introducing and expressing either recombinant or synthetic sequences into 

eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 

20 culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 

24 allow expression of the encoded polypeptide. The polypeptide can be recovered from tbe 

culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
fiirther pxnified. Preferred embodiments include those in which the protein produced by such 
process is a fuU length or mature form of the proteia 

28 In an alternative method, the polypeptide or protein is purified from bacterial cells which 

naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated 
pol5^eptides or proteins of the present invention. These include, but are not limited to, 

32 inununochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g.. Scopes, Protein Purification: Principles and 
Practice, Springer- Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual; Ausubel et al.. Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain bioiogical/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fi-agments that encode specific protein 
domains. 

4 The purified polypeptides can be used in in vitr-o binding assays which are well known in 

the art to identify molecules which bind to the polypeptides. These molecules include but ai-e not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
8 activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 

12 may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ IDNOS: 13902-27802. 

The protein of the invention may also be expressed as a product of transgenic animals, 

16 e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 
made by those skilled in Ihe art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 

24 residues may be deleted or replaced with another amino acid to alter the conformation of the 
molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 

28 protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 
systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 

32 importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein fimction may be determined by the eMATRIX program. 

Other firagments and derivatives of the sequences of proteins which would be expected to 
retain protein activity in whole or in part and are useful for screening or other immimological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
4 invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, e.g. , Invitrogen, San Diego, Calif., U.S.A. 
(the MaxBatTt** kit), and such methods are well known in the art, as described in Summers and 
8 Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 

12 culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
of the protein may also include an affinity column containing agents which will bind to the 

16 protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA SepharoseTM; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoafSnily chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fiision protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression md purification of such fusion proteins are coimnercially available 

24 firom New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 

respectively. The protein can also be tagged with an epitope and subsequentiy purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

28 Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 

steps employing hydrophobic RP-HPLC media, e.g. , silica gel having pendant methyl or other 
aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 

32 homogeneous isolated recombinant protein. The protein thus purified is substantially fiee of 

other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 

4 modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs 
may exhibh improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 

8 provide for the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
12 immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 

steroids. Also, polypeptides may be fused to immtine modulators, and other cytokines such as 
alpha or beta interferon. 

1 6 4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 

AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al.. Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 
et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. 

24 vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 
Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al.. Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 

28 reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 
105-3 1 (1982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 

32 Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises a polj^ptide of the invention operatively linked to 
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another polypeptide. Within a ftision protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fUsion protein comprises at least one biologically active portion of a protein according to the 

4 invention. In another embodiment, a fusion protein comprises at least two biologically active 
portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polirpeptide according to the invention and the other 
polypeptide are fused in-&ame to each other. Tlie polypeptide can be fUsed to the N-terminus or 

8 C-terminus. 

For example, in one embodiment a fiision protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 

12 polypeptide sequences of the invention are fused to the C-terminus of the GST (z. e. , glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin Hision protein in which 
the polypeptide sequences according to the invention comprises one or more domains are fused 

16 to sequences derived from a member of the immunoglobulin' protein family. The 

immunoglobulin fusion proteins of the invention can be incorporated into phai-maceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 

20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e,g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, Hie iimnunoglobulin fusion proteins of the invention can be 

24 xised as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fijsion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 

28 are ligated together in-frame in accordance with conventional techniques, e.g. , by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fiision gene can 

32 be synthesized by conventional techniques including automated DNA synthesizers. 

Alternatively, PGR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 
subsequently be annealed and reampUfied to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 
4 invention can be cloned into such an expression vector such that the fusion moiety is linked 
in-firame to the protein of the invention. 

4.8 GENETHEEIAPY 

8 Mutations in the polynucleotides of the invention gene may result in loss of normal 

function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 

.12 appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 
Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). ¥ot additional reviews of 

16 gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989), Verma, ScientijGc 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encodmg the polypeptides of the present 
invention can also be accomplished with extracliromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be intioduced in vivo for tlierapeutic piuposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 

24 or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 
states. It is contemplated tliat antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 

28 molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by metliods known in the art. Further, the polypeptides of the present invention can be 
inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

32 The present invention still fiarther provides cells genetically engineered in vivo to express the 

polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 

the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
4 permit, iacrease, or decrease, expression of endogenous polypeptide. Cells can be modified {e.g. , by 
homologous recombination) to provide increased polypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
8 operatively linked to the desired protein encoding sequences. See, for example, PCT International 
PubhcationNo. WO 94/12650, PCT IntemationalPublicationNo. WO 92/20808, and PCT 
IntemationalPublicationNo. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g. , ada, dhfir, and the multiflinctional CAD gene which 

12 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the desired protein coding sequences in the cells. 

16 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention imder the control of 
inducible regulatory elements, in which case the regiJatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineeringmelhods. Such regulatory 
sequences may be comprised of promoters, enhancers, scafifold-attachmentregions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 

24 of said sequences. Alternatively, sequences ^^4lich affect the structure or stability of the RNA or 
protein produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequences include polyadenylationsignals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 

28 which alter or improve the firaction or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 

32 element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific dancer can be 
replaced by an enhancer that has broader or different cell-type specificity than tiie naturally 
occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
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added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the ceU genome. The identification of the 
4 targeting event may also be facilitated by the use of one or more marker genes exhibiting the 

property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 
8 not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Hei-pes Simplex Vims thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 
12 aspect of the invention are more particularly described in U.S. PatentNo. 5,272,071 to Chappel; 
U.S. PatentNo. 5,578,461 to Sherwin et al; International AppHcationNo. PCT/US92/09627 
(WO93/09222)by Seldenet al.; and International Application No. PCT/US 90/0643 6 
(W09 1/06667) by Skoultchi et al., each of which is incorporated by referencehereinin its entirety. 

16 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)J. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenoiK promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 

24 referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are usefiil to determine the roles polj^eptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 

28 identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 

32 polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
4 through, e.g. , homologous recombination or knock out strategies, of animals that fail to express 
polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 
8 In preferred methods to determine biological functions of the polypeptides of the 

invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 

12 control of exogenous or endogenous promoter elements, are Icnown as transgenic animals. 

Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 

1 6 animals are useful to determine the roles polypeptides of the invention play in biological 

processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 andPCT 

20 Publication No. W094/28 122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 

24 methods described above. Activation can be achieved by supplementing or even replacing the 
homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

28 

4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one or 
more of the uses or biological activities (including those associated with assays cited herein) 
32 identified herein. Uses or activities described for proteins of the present invention may be 

provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such, as, for example, in gene therapies or vectors suitable for introduction of DNA). The 
mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment Thus, "therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 

4 (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
polypeptides of the invention (including full length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 
8 activity. Such modulators include polypeptides, analogs, (variants), including firagments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 
indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for tiiple 

12 helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of tlie polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 
or in one of the other physiological pathways described herein. 

16 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or tiierapeutic use; as markers for tissues in which the 

corresponding protein is preferentially e^qjressed (eitiier constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 

24 positions; to compare with endogenous DNA sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PGR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 

28 oligomers for attachment to a "gene chip" or other support, including for examination of 

expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 
an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 
polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 

32 example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al.. Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 
the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immime response; as a reagent (including the 
4 labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biologfcal fluids; as markers for tissues in wliich the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
8 Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit fomat for commercialization as research products. 
12 Methods for performing the uses listed above are well Icnown to those skilled in the art. 

References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, X, E. F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
16 Techniques", Academic Press, Berger, S. L. and A. R, Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Such uses include wilhoutlirnitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or Uquid preparation, such as in the 
24 form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

28 4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 

ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
32 activity or may induce production of other cytokines in certain cell populations. A 

polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 
or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 



PCT/USOl/04927 



confirmation of c5l:okine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without Umitation, 32D, DA2, DAIG, TIO, B9, B9/11, BaF3, 
4 MC9/G, M+(preB M+), 2E8, RB5, DAI , 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 
HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immmology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
8 M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al, J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 
145:1706-1712, 1990; Bertagnolli etal., Cellular Immunology 133:327-341, 1991; Bertagnolli, 

12 etal., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Cun-ent Protocols in Immunology. J. E. e.a. Coligan 

16 eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleuldn-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; 

24 Greenberger et al., Proc. Nati. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 
and human interleukin 6~Nordan, R. In Current Protocols in Immtmology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Nati. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 11 -Bennett, F., Giannotti, J., 

28 Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Cohgan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 
9-Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

32 Assays for T-cell clone responses to antigens (which will identify, among others, proteins 

that affect APC-T cell interactions as well as direct T-cell ejBfects by measuring proliferation and 
cjrtokine production) include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et al, Proc. Natl. Acad. Sci. USA 77:6091-6095, 
4 1 980; Weinberger et al., Eur. J. Immun. 1 1 :405-4 1 1 , 1 98 1 ; Takai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 
8 A polypeptide of the present invention may exhibit stem cell groAvth factor activity and 

be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 
cells including primordial germ cells, embryonic stem cells, hematopoietic stem ceUs and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 

12 ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 
large quantities of human cells has important working applications for the production of himian 

1 6 proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors hsted herein, other stem cell maintenance factors, and 

24 specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 
3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 
inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 

28 fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 

32 optionally with other growth factors and/or cj^okines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accompHshed by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 
with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 
4 Stem cells themselves can be transfected with a polynucleotide of the invention to induce 

autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or tliat can then be 
differentiated into the desired matui-e cell types. These stable cell lines can also serve as a source 
8 of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

12 Expansion and maintenance of totipotent stem cell populations will be useftil in the 

treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 

1 6 genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its efifect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 

24 types. A broadly applicable method of obtaining pure populations of a specific differentiated 
cell type from undifferentiated stem cell populations involves the use of a cell-type specific 
promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocj^es (Wobus 

28 et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lama, et al.. 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such as 

32 retinoic acid and an antagonist of the polypeptide of the invention which would mhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 
exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein et al.. Blood, 77: 23 16-2321 (1991). 

4.10.5 HEMATOPOIESIS REGI^LATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony fonning cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythi-oid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 
cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 
monocytes/macrophages traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 
those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 
as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al.. Molecular 
and Cellular Biology 13 :473-486, 1993; McClanahan et al.. Blood 81 :2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
4 Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N. Y. 1 994; Hirayama et al, 
Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proHferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
8 al.. Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 

Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
12 Freshney, et al. eds. Vol pp. 1 63-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Cultiu e of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 



16 4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue grovWh or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of bums, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances where bone is not normally formed, has application in the heahng of bone 
ftactures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of Ihe invention may have 

24 prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

28 A polypeptide of this invention may also be involved in attracting bone-forming cells, 

stunulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 

32 inflammation or processes of tissue destruction (coUagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 
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Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendonyhgament-hke tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 
4 apphcation in the healing of tendon or ligament tears, deformities and other tendon or ligament 
defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
8 in repairing defects to tendon or ligament tissue. De novo tendon/Ugament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 
induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 

12 invention may provide envurorment to attract tendon- or ligament-forming cells, stunulate 

growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 
return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 

16 treatment of tendinitis, carpal timnel syndrome and other tendon or ligament defects. The 

compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be usefiil for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 

24 peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mecjianical and traumatic disorders, such as spinal 

28 cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 
composition of the invention. 

Compositions of the invention may also be usefiol to promote better or faster closure of 

32 non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine. 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the grovv^h of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
4 to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
A composition of tlie present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 
8 A composition of the present invention may also be useful for promoting or inhibiting 

differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
12 Assays for tissue generation activity include, without limitation, those described in: 

International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
WO91/07491 (skin, endothelium). 
1 6 Assays for wound healing activity include, without limitation, those described in: Winter, 

Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.). Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84 (1978). 

20 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 

24 herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 
protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCDD)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as ejBFecting the cytolytic activity of NK cells 

28 and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or Hmgal infections, or may result from autoimmune disorders. More 
specifically, infectious diseases causes by viral, bacterial, fiingal or other infection may be 
treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 

32 herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fimgal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, i.e., in the treatment of cancer. 
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Autoimnuine disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary' inflammation, Guillain-Barre syndrome, 
4 autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 
disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be usefiil in the treatment of allergic 
reactions and conditions {e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
8 venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens-Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
. 12 (particularly allergic asthma) or other respiratory problems. Other conditions, in which immime 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 

16 models such as the cimiulative contact enhancement test (Lastbom et al.. Toxicology 125; 59-66, 
1998), skin prick test (Hoffinaim et al.. Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al.. Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

20 Using the proteins of the invention it may also be possible to modulate immune 

responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an immime 
response. The fimctions of activated T cells may be inhibited by suppressing T cell responses or 

24 by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 
generally an active, non-antigen-specific, process which requires continuous exposure of the T 
cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 

28 persists after exposure to the toleriziag agent has ceased. Operationally, tolerance can be 

demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 
of the tolerizing agent. 

Down regulating or preventing one or more antigen fiinctions (including without 

32 limitation B lymphocyte antigen fimctions (such as, for example, B7)), e.g. , preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in grafl-versus-host disease (GVHD). For example, blockage of T cell 
ftmction should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an immune reaction that destroys the transplant The administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
4 and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient 
to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To acWeve sufficient inmfiunosuppression or tolerance in a subject, it 
8 may also be necessary to block the ftinction of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 

12 rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
al.. Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:11102-11 105 
(1992). In addition, murine models of GVHD (see Paul ed.. Fundamental Immunology, Raven 

1 6 Press, New York, 1 989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

20 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or ehminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 

24 cell-derived cytokines which may be involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 
long-term relief fiom the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 

28 human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 
collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine e^qserimental 
myasthenia gravis (see Paul ed.. Fundamental Inununology, Raven Press, New York, 1989, pp. 

32 840-856). 

Upregulation of an antigen function {e.g., a B lymphocyte antigen fimction), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immime 
responses may be in the form of enhancing an existing immtme response or ehciting an initial 
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immune response. For example, enhancing an immune response may be useful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 

4 Alternatively, anti-viral immune responses may be enhanced in an infected patient by 

removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
8 patient. Another method of enhancing anti-viral immune responses would be to isolate infected 
cells from a patient, transfect tliem with a nucleic acid encoding a protein of the present 
invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 

12 capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 

16 reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and p2 microglobulin protein or an MHC class II alpha chain 
protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 

20 proteins on the cell surfece. Expression of the ^propriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class H associated protein, such as 

24 the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject. 

28 The activity of a protein of the invention may, among other means, be measured by the 

following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols ui Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
32 H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
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Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al, J. 
Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; BertagnoUi et al.. 
Cellular Immunology 133:327-341, 1991;BroTOi et al., J. Immunol. 153:3079-3092, 1994. 
4 Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
8 Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, Jolm Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without Umitation, those described 

12 in: Current Protocols in Iminunology, Ed by J. E. Coligan, A. M, Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocj^e Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 

16 1988; BertagnoUi et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 
Gnery et al., J. Immunol. 134:536-544, 1995; Inaba et al.. Journal of Experimental Medicine 

20 173:549-559, 1991; Macatonia et al.. Journal of Immunology 154:5071-5079, 1995; Porgador et 
al., Journal of Experimental Medicine 182:255-260, 1995; Nair et al.. Journal of Virology 
67:4062-4069, 1993; Huang et al.. Science 264:961-965, 1994; Macatonia et al.. Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al.. Journal of Clinical Investigation 

24 94:797-807, 1994; and Inaba et al.. Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: DarzynkievAcz et al., Cytometry 

28 13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et al.. Cancer Research 
53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; Gorc2yca et al., International 
Journal of Oncology 1:639-648, 1992. 

32 Assays for proteins that influence early steps of T-cell commitment and development 

include, without limitation, those described in: Antica et al.. Blood 84: 1 1 1 -1 1 7, 1 994; Fine et al.. 
Cellular Immunology 155:111-122, 1994; Galy et al.. Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 
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4.10.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
4 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stirnulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
8 alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amoimts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
12 homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useflil for advancement of the onset of fertility in 
16 sexually immature mammals, so as to increase the Ufetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

20 Assays for activin/inhibin activity include, without limitation, those described in: Vale et 

al.. Endocrinology 91:562-572, 1972; Ling et aL, Nature 321:779-782, 1986; Vale et al.. Nature 
321:776-779, 1986; Mason et al.. Nature 3 18:659-663, 1985; Forage et al., Pioc. Natl. Acad. Sci. 
USA 83:3091-3095,1986. 

24 

4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, jSbroblasts, neutrophils, 

28 T-cells, mast cells, eosinophils, epitheUal and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobiUze or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions {e.g. proteins, antibodies, binding partners, or 

32 modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or mfecting agent. 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimxalate directed movement of cells. 
4 Whether a particulai- protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
8 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
12 M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind at al. APMIS 103:140-146, 
1995; Mullet et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
16 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be useful in treatment of various coagulation disorders (including 

hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 

in treating wounds resulting from trauma, surgery or other causes. A composition of the 
24 invention may also be usefiil for dissolvmg or inhibiting formation of thromboses and for 

treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 

cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
28 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 

45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

35:467-474, 1988. 

32 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
4 Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useflil for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
8 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 

1 2 cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, limg cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

16 cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers,- liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 

20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
nexiroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 

24 hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or mod\ilators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

28 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

32 The composition can also be administered in therapeutically effective amounts as a 

portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a phannaceutically 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 
4 DDP), Cyclophosphamide, Cytarabine HCl (Cjtosine arabinoside), Dacarbazine, Dactinomycin, 
Dauaombicin HCl, Doxorubicin HCl, Estramustine phosphate sodium, Etoposide (V16-213), 
Floxuridine, 5-Fluoroiiracil (5-Fu), Flutamide, Hydroxyurea (hydroxy carbamide), Ifosfamide, 
Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
8 Lomustine, Mechlorethamine HCl (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 

Methotrexate (MTX), Mitomycin, Mitoxantrone HCl, Octreotide, Plicamycin, Procarbazine HCl, 
Streptozocin, Tamoxifen citrate, Tliioguanine, Thiotepa, Vinblastine s\alfate. Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 

12 Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or enviromnental situations {e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing cancers. 

1 6 Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 

20 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1 987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst, 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 

24 Pilkington et al.. Anticancer Res., 17: 41 07-9 (1 997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., 
CliiL Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 

28 e.g. from American Type Tissue Culture Collection catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
32 receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cj^okine receptors and their ligands, receptor kinases and 
their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
4 are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/Ugand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and Hgands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 

8 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Knnsbeek, D. H. Margulies, E. M. 
12 Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 

Meastirement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Talcai at al., Proc. 

Nati. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; 

Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
16 175:59-68, 1994; Stitt et al.. Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 

ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 

through binding assays, affmity chromatography, dihybrid screening assays, BIAcore assays, gel 
20 overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 

partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 

present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
24 colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 

Purification" Murray P. Deutscher (ed) Methods in Enzsmiology Vol. 1 82 (1990) Academic 

Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 

carbon-14 . Examples of colorimetric molecules include, but are not limited to, fluorescent 
28 molecules such as fluorescamine, or rhodannine or other colorimetric molecules. Examples of 

toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

32 This invention is particularly useftil for screening chemical compounds by using the 

novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
solid support, borne on a cell surface or located intracellularly. One method of drug screening 
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utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
4 be used for standard binding assays. One may measure, for example, the formation of 

complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art. 
8 Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 

increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

12 Chemical libraries may be readily synthesized or purchased from a number of 

commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 

16 fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 

screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 252:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 

24 oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 
peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefirom, see Myers, Curr. Opin. 
Btotechnol. 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 

28 Al-Obeidi et al., Mol. Btotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 

1(1):114-19 (1997); Domer et al., BioorgMed Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hif ' (or "lead") to optimize the capacity of the "hit" to bind a 

32 polypeptide of the invention. The molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
art. In brief, the.molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animaiycells. 
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The binding molecules thios identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
4 molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

8 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

hgand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknovm binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 

12 to identify polynucleotides encoding binding partners. As another example, affinity 

chromatography with the appropriate immobilized polypeptide of Ihe invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different Hbraries used for the identification of compounds, and in particular small molecules, 

1 6 that modulate (i.e., increase or decrease) biological acti\'ity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 

20 invention whereas the other does not. The response of the two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 

24 in the art can be used to identify binding partner polypeptides, including, (1) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 

28 polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand lias been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 

32 the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the 
art can also be used to identify signaling molecules involved in receptor activity. 
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4.10.15 ANTI-INFLAMIVIATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 
4 inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
8 with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfiision injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

12 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cj^okines such as TNF or IL-1 . Compositions of the invention may also be 
useftil to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 

1 6 limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, 
graft versus host disease, iaflammatory bowel disease, inflanaation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 

20 acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 



4.10.16 LEUKEMIAS 

24 Leukemias and related disorders may be treated or prevented by axiministration of a 

therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocytic, 

28 myelomonocjftic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 

(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 



32 4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention vnth compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
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therapeutic utility, include but are not limited to ner\'ous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or'degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
4 human and non-human mammalian patients) according to the invention include but are not 
Hmited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
8 surgery, for example, lesions which sever a portion of the nervous system, or compression 

injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

1 2 infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

16 tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parldnson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

20 sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B 12 deficiency, folic acid deficiency, Wernicke disease, 

24 tobacco-alcohol amblyopia, Marchiafeva-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy. Bell's palsy), systemic lupus erjrthematosus, carcinoma, or 

28 sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions hi which a portion of the nervous system is destroyed or 
32 injured by a demyelinating disease including but not limited to multiple sclerosis, human 

immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
4 any of the following effects may be usefiil according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 
8 choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 

12 forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Arm. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 
be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 

1 6 depending on the molecule to be measured; and motor neuron dysfimction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., vreakness, motor neuron 
conduction velocity, or fiinctional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 

24 muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsen&ory Neuropathy 
(Charcot-Marie-Tooth Disease). 

28 

4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or fimction of, or killing, infectious agents, 
32 including, without limitation, bacteria, viruses, fimgi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 
(such as, for example, breast augmentation or diminution, change in bone form or shape); 
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effecting biorhj-thms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 

4 nutritional factors or component(s); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 

8 than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
12 in a vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

1 6 The demonstration of poljanorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 

20 response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
poljTtnorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 

24 polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample irom a patient, analyzing DNA from the sample, optionally 
involvmg isolation or amplification of the DNA, and identifying the presence of the 

28 poljonorphism in the DNA. For example, PCR may be used to amplify an appropriate fi-agment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 

32 single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 

adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzjmes that provide differential digestion of the genomic DNA depending on the presence or 
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absence of the polymorphism) may be perfomed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 

4 present invention. In the alternative, any one of the nucleotide sequences of the present 
invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino add sequence of the protein, e.g., 

8 by an antibody specilBc to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 

12 arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis ui rats, and the protocol is described by J. Holoshitz, et at., 1 983, 
Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 23:129. 
Induction of the disease can be caused by a single injection, generally intradermally, of a 

16 suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
roixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1-5 mg/kg. The control consists of administering PBS only. 

20 The procedure for testing the effects of the test compound would consist of intradermally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 

24 described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 

28 4.11 THERAPEUTIC METHODS 

The compositions (including polji^jeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
32 include, but are not limited to, those exemplified herein. 

4.I1.I EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. ^\''hile the mode of 
4 administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will nomially be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
8 concJition and response of the individual patient. Typically, the amount of polypeptide 

administered per dose will be in the range of about 0.01|j,g/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0. 1 |J.g/kg to 1 0 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated m an injectable form combined 

12 with a pharmaceutically acceptable parenteral vehicle. Such veliicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of tlie human serum albumin. The vehicle may contain minor amounts of 
additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 

1 6 The preparation of such solutions is within the skill of the art. 



4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of the present invention (from whatever source derived, 

includmg without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 

24 carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 

28 effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TN¥, IL-l, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, 

32 IL-13, IL-14, IL-15, IFN, TNFO, TNFl, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
Avith other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
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factor (PDGF), transforming gro\vth factors (TGF-a and TGF-P), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 
4 the activity of the protein or other active ingredient or complement its activity or use in 
treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
8 present invention may be included in formulations of the particular clotting fector, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to mimmi2B side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as 

12 IL-lRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed form. 

16 As an alternative to being included in a pharmaceutical composition of the invention 

including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein {e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 

20 Techniques for formiilation and administration of tihe compounds of the instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose fitrther refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g. , treatment, healing, prevention or amelioration of the 

24 relevant medical condition, or an increase in rate of treatment, healing, prevention or 

amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 

28 ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered to 
32 a mammal having a condition to be treated. Protein or other active ingredient of the present 

invention may be administered in accordance with the method of the invention either alone or in 
combination with other tlierapies such as treatments employing cytokines, lymphokines or other 
hematopoietic factors. When co- administered with one or more cytoldnes, lymphokines or other 
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hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simultaneously -with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), tlirombo lytic or anti-thrombotic factors, or sequentially. If administered sequentially, 
4 the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytoldne(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors. 

8 4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 
intestinal administration; paienteral deliveiy, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 

12 intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 

16 or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 

20 occurring as complication of glaucoma surgery, the compounds may be administered topically, 
, for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 

24 afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administiation and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
28 wound treatment, one administers the therapeutic compound direcfly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated fi-om these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit 

32 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
be formulated in a conventional manner using one or more physiologically acceptable carriers 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. These pharmaceutical compositions may be 
manufactured in a manner that is itself loiown, e.g. , by means of conventional mixing, 
4 dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
8 invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 

12 90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solutLon, dextrose or other 

16 saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of apyrogen-fiee, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 

24 active ingredient solutions, having due regard to pH, isotonicity, stability, and the Uke, is within 
the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active iagredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 

28 Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

other vehicle as known in the art. The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 

32 solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known 
in the art. 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
4 hquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a sohd excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
8 particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodiiim 
carbox3fmethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 

12 may be added, such as the cross-linked polyvinyl pyrrolidone, ^ar, or alginic acid or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 

16 solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 

added to the tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 

24 suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, tiie compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

28 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation fiom 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 

32 other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. Capsules and cartridges o^ e.g., gelatin for use 
in an inhaler or insufflator may be formulated containing a powder mix of the compoimd and a 
suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
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administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
4 emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
8 may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 

vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 

12 dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-fiee water, before use. 

1 6 The compounds may also be formulated in rectal compositions such as suppositories or 

retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt 

24 A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 

system comprising benzyl alcohol, a nonpolar surfectant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 

28 polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 

32 without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 

co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
be used instead of polysorbate 80; the fiaction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
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siigars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
4 such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by those 
8 skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and tlie 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

12 The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 

or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 

16 provided as salts with phamaaceutically compatible counter ions. Such pharmaceutically 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the fiee acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, diaUcylamine, 

20 monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredient(s) of present invention along with protein or peptide 

24 antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 

28 those encoded by class I and class 11 MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 

32 well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
which protein of the present invention is combined, in addition to other pharmaceutically 
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acceptable carriers, with amphipatliic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
Lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
4 sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the hke. Preparation of such 
Hposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

8 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of tlie present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amoimt of protein or other active 

12 ingredient of the present invention with which to treat each individual patient. Initially, the 

attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for 

16 the patient, and at that point the dosage is not increased further. It is contemplated that the 

various pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 (ig to about 100 mg (preferably about 0.1 fig to about 10 mg, more preferably 
about 0.1 ng to about 1 mg) of protein or other active ingredient of the present invention per kg 

20 body weight. For compositions of the present invention which are useful for bone, cartilage, 
tendon or hgament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of cohrse, in a pyrogen-free, physiologically acceptable 

24 form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 

28 described above, may alternatively or additionally, be administered simultaneously or 

sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of deUvering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 

32 cartilage damage, providing a structure for the developing bone and cartilage and optimally 

capable of being resorbed into Ihe body. Such matrices may be formed of materials presently in 
use for other implanted medical applications. 



70 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 

The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositioiis 

4 may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defmed, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 

8 matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 
12 calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 1 50 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 
16 cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

20 hydroxyethylcellulose, hydroxsrpropylcellulose, hydroxypropyl-methylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxj^vinyl polymer and poly(vinyl alcohol). 

24 The amount of sequestering agent usefiil herein is 0.5-20 wt %, preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 

28 protein the opportunity to assist the osteogenic activity of the progenitor cells. In fiirther 

compositions, proteins or other active ingredients of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth fector (EGF), 

32 platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-P), and 
insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinarj'^ applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
4 modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
8 and with inclusion of other proteins in the pharmaceutical composition. For example, the 

addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example. X-rays, histomorphometric determinations and 

12 tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression m a 
mammaUan subject. Polynucleotides of the invention may also be administered by other known 

1 6 methods for introduction of nucleic acid into a cell or organism (including, vsdthout limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

20 

4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amoimt to achieve its 

24 intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in hght of the detailed disclosure provided herein. For any compound used in 

28 the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 

32 concentration range that includes tfcie IC50 as determined m cell culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine usefiil doses in humans. 
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A therapeutically effective dose refers to that anaount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
4 cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. 
8 The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form employed and the route of administration 

1 2 utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl at al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amoimt and interval may be adjiisted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 

1 6 desired effects, or minimal effective concentration (MEC). The MEC will vary for each 

compoimd but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

20 Dosage intervals can also be determined using ^^EEC value. Compounds should be 

administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 

24 related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 \Lg/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0.1 \ig/kg to 25 mg/kg of patient body weight daily, varying in adults and 

28 children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
32 administration and thejudgmentofthe prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The paclc may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
4 be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 



8 4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 

12 an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain. Fab, Fab' and F(ab')2 
fragments, and an Fab expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 

16 by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgGi, IgG2, and otliers. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecificaUy bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The fiill-length protein can be used or, alternatively, the 

24 invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the fiill length protein, (for example the amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 

28 specific immune complex with the fiill length protein or with any fragment that contains the 
epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 
1 5 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 

32 located on its surface; commonly these are hydrophilic regions. 

In certain embodimeats of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 
hydrophihc region. A hydrophobicity analysis of the human related protein sequence will 
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indicate which regions of a related protein are particularly hydrophilic and, therefore, axe likely 
to encode surface residues useful for targeting antibody production. As a means for targeting 
antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 
4 may be generated by any method well known in the art, including, for example, the Kyte 

Doolittle or the Hopp Woods methods, eitlier with or without Fourier transformation. See, e.g., 
Hopp and Woods, 1981, Proc. Nat. Acad. Set USA 78: 3824-3828; Kyte and Doolittle 1982, J. 
Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
8 Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 
firagments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
12 immimospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 
monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orfcologs thereof (see, for example. Antibodies: A Laboratory 
16 Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Poljxlonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animals {e.g. , rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 

24 protein, a chemically synthesized polypeptide representiag the immunogenic protein, or a 

recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 

28 bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can fiorther include an 
adjuvant. Various adjuvants used to increase the immunological response include, but are not 
limited to, Freund's (complete and incomplete), mineral gels ie.g., aluminum hydroxide), surface 
active substances {e.g. , lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

32 dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 

Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). 
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The polyclonal antibody molecules directed against the iromunogenic protein can be 
isolated from the mammal {e.g. , from tlie blood) and further purified by well known techniques, 
such as affmity chromatography using protein A or protein G, which provide primarily the IgG 

4 fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 

target of fbe immunoglobulin sought, or an epitope thereof, may be immobilized on a coliimn to 
purify the immune specific antibody by immunoaffinity chromatography. Purification of 
immunoglobuHns is discussed, for example, by D. Wilkinson (The Scientist, published by The 

8 Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 

12 herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique Ught chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen 

1 6 binding site capable of immimoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, 

20 hamster, or other appropriate host animal, is typically immuni2Bd with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizirg agent will typically include the protein antigen, a fragment thereof or a 

24 fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 

28 Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103). Immortalized cell 

lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 

32 inhibit the growth or survival of the unfiised, immortalized cells. For example, if the parental 
cells lack the en2yme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody -producing cells, and are sensitive to a medium 
such as HAT medium. More prefen-ed immortalized cell lines are murine myeloma lines, which 

4 can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 

California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol. . 133:3001 (1984); Brodeur et al.. Monoclonal 

8 Antibody Production Tecbndques and Applications . Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 

1 2 specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 

16 Scatchard analysis of Munson and Pollard. Anal. Biochem.. 107:220 (1980). Preferably, 

antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 

20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified fi'om the 

24 culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

28 those described in U.S. Patent No. 4,816,567. DNA encodirg the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures {e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 

32 source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in Hie recombinant host cells. The DNA also can be modified, for 
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example, by substituting the coding sequence for human heavy and Hght chain constant domains 
in place of the homologous murine sequences (U. S. Patent No. 4,81 6,567; Morrison, Nature 368 . 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
4 coding sequence for a non-immunoglobulin polypeptide. Such a non-immujioglobuHn 

polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combinlBg site of an antibody of the 
invention to create a chimeric bivalent antibody. 



5.13.2 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further comprise 
humanized antibodies or human antibodies. These antibodies are suitable for administration to 

12 humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or j&agmeiits thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 

1 6 immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones at al.. 
Nature. 321:522-525 (1986); Riechmann et al.. Nature. 332:323-327 (1988); Verhoeyen et al.. 
Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 

20 correspondiog sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by coriresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 

24 humanized antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 

28 least a portion of an immunoglobulia constant region (Fc), typically that of a human 

immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol.. 




Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fiilly human antibodies" herein. 



8 




2:593-596 (1992)). 



32 
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Human monoclonal antibodies can be prepared by the trioma technique; tiie human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1 985 In: Monoclonal 
4 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 

antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
8 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227 :381 (1991); 
Marks et al., J. Mol. Biol.. 222:581 (1991)). Similarly, human antibodies can be made by 

1 2 introducing human immunoglobulin loci into transgenic animals, e.g. , mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 

16 is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 

5,633,425; 5,661,016, and in Marks et al. (Bio/Technology. 10. 779-783 (1992)); Lonberg et al. 
(Natiffe 368 856-859 (1994)); Morrison ( Nature 368. 812-13 (1 994)); Fishwild et al,( Natoe 
Biotechnology 14, 845-51 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 

20 Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal' s endogenous 
antibodies in response to challenge by an antigen. (See POT publication WO94/02602). The 

24 endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing flie requisite human DNA segments. An animal which 

28 provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the Ml complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 

32 cells which secrete fiilly human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an inmiunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
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immunoglobulins with human variable regions can be recovered and expressed to obtain the 

antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 

example, single chain Fv molecules. 
4 An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

expression of an endogenous imniunoglobulin heavy chain is disclosed in U.S. Patent No. 

5,939,598. It can be obtained by a method including deleting the J segment genes from at least 

one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
8 locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy'chain locus, 

the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 

and producing jfrom the embryonic stem cell a transgenic mouse whose somatic and germ cells 

contain the gene encoding the selectable marker. 
12 A method for producing an antibody of interest, such as a human antibody, is disclosed in 

U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 

nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 

an expression vector containing a nucleotide sequence encoding a light chain into another 
16 mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 

antibody contedning the heavy chain and the light chain. 

In a furtiier improvement on this procedure, a method for identifying a clinically relevant 

epitope on an immunogen, and a correlative method for selecting an antibody that binds 
20 immunospecifically to the relevant epitope with high affinity, are disclosed in PCX publication 

WO 99/53049. 

5.13.4 Fab Fragments and Single Chain Antibodies 

24 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an antigenic protein of the invention (see e.g. , U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 

28 monoclonal Fab fiagments with the desired specificity for a protein or derivatives, fragments, 
analogs or homologs thereof Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to; (i) an F(ab')2 
firagment produced by pepsin digestion of an antibody molecule; (ii) an Fab fragment generated 

32 by reducing the disulfide bridges of an ¥(ab'y2 firagment; (iii) an Fab firagment generated by the 
treatment of tiie antibody molecule with papain and a reducing agent and (iv) Fy fragments. 
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Bispecific antibodies are monocloxial, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
4 other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for raalcing bispecific antibodies are Icno-WTi in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
8 specificities (Milstein and Cuello, Nature, 305:537-539 (1983)). Because of the random 

assortment of immunoglobulin heavj' and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 

12 chromatography steps. Similar procedures are disclosed in WO 93/08829, pubHshed 13 May 
1993, andinTraunecker era/., 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fiised to immunoglobulin constant domain sequences. The fusion 

16 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain flisions and, if desired, the immunoglobulin 

20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
al, Methods in Enzvmology. 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pak 

24 of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered fi:om recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains ftom the interface of the first antibody molecule are replaced with larger side chains (e.g. 

28 tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on tiie interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing Ihe yield of the heterodimer over other unwanted end-products such as homodimers. 

32 Bispecific antibodies can be prepared as fiill length antibodies or antibody firagments (e.g. 

F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies firom antibody 
fi"agments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al.. Science 229:81 (1985) describe a procedure 
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wherein intact antibodies ai-e proteolytically cleaved to generate F(ab')2 firagments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabihze vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
4 generated ai-e then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
8 immobiHzation of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fragment 

12 was separately secreted from E. coli and subjected to directed chemical coupling in vifro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocj^tes against human breast tumor targets. 

1 6 Various techniques for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostehiy et al., J. Immunol. 148(5):1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fiision. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 

24 alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (Vh) connected to a light-chain variable domain (Vl) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and Vl domains of one fragment are forced to pair with the complementary Vl and Vh 

28 domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

32 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
immunoglobulin molecule can be combmed vAih an arm which binds to a triggering molecule on 
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a leukocyte such as a T-cell receptor molecule (e.g. CD2, CDS, CD28, or B7), or Fc receptors for 
IgG(Fc R), suchasFc RI (CD64), Fc RII (CD32) and Fc RIII (CD 16) so as to focus 
cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies 
4 can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a 
radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and fiirther binds tissue factor (TF). 

8 

5.13.6 Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within tlie scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 

12 have, for example, been proposed to target immune system cells to rmwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of fflV infection (WO 91/00360; WO 92/200373; BP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, immunotoxins 

16 can be constructed using a disulfide exchange reaction or by formmg a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed,' for example, in U.S. Patent No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector fiinction, so as 
to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 

24 formation m this region. The homodimeric antibody thus generated can have improved 
internalization capabihty and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1191-1195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 

28 tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 
et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that 
has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 



5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
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bacterial, fbngal, plant, or animal origin, or fragments thereof), or a radioactive isotope {le., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
described above. Enzymaticaily active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fi-agments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogelHn, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 
^'^Bi, '^'I, '^'ln,^«Y,and'«^Re. 

Conjugates of the antibody and cj^otoxic agent are made using a variety of bifunctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 
compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazomumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 
Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g. , avidin) that is in turn 
conjugated to a cytotoxic agent. 

4.14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
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artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of tlie present invention. As used herein, "recorded" refers to a process for 
4 storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 
A variety of data storage structures are available to a skilled artisan for creating a 
8 computer readable mediiun having recorded thereon a nucleotide sequence of the present 

invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 

12 readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 

16 formats (e.g. tejct file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NO : 1 - 1 3 901 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 

20 sequences of SEQ ID NO: 1-13901 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 

24 BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useftil in producing commercially important proteins such as enzymes used in 

28 fermentation reactions and in ±e production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the present 

32 invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the pBsent invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 
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therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
4 invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
8 sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
motif A variety of known algorithms are disclosed pubUcly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 

12 systems of the present invention. Examples of such software includes, but is not limited to, 
Smith-Waterman, MacPattem (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 

16 computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

20 sequence is ftom about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

24 As used herein, "a target structural motif," or "target motif," refers to any rationally 

selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif There are 
a variety of target motifs known in the art. Protein target motifi mclude, but are not limited to, 

28 enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
sequences). 



32 4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
methods are based on tibie binding of a polynucleotide sequence to DNA or RNA. 
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Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al, Nucl. Acids Res. 6:3073 (1979); Cooney et al. Science 15241:456 (1988); and Dervan 

4 et al.. Science 251:1360 (1991)) or to the mRNA itself (antisense - Olnmo, J. Neurochem. 

56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, wliile antisense RNA hybridization blocks translation of an mRNA 

8 molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
design of an antisense or triple helix oligonucleotide. 

12 4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 

1 6 with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 

20 detected, a polynucleotide of the invention is detected in the sample. Such methods can also 

comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that aimeal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 

24 detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 

28 polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

32 Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
skilled in the art will recognize that any one of the conmionly available hybridization. 
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amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
4 Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol, 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
8 present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 

12 extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the invention 

16 provides a compartment kit to receive, in close coofinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of abound probe or antibody. 

20 In detail, a compartment kit includes any kit in which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efiiciently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 

24 agents or solutions of each container can be added in a quantitative fashion from one 

compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 

28 contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 

32 probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

4.17 MEDICAL IMAGESTG 
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The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention {e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
4 See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of 
a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

8 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents wliich bind to a polypeptide 
12 encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 
1-13901 , or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, 
said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
16 invention, or nucleic acid of the invention; and 

(b) detennining whether tiie agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 

the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 

to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
24 polypeptide of the invention can comprise contacting a compound with a polypeptide of the 

invention for a time sufficient to form a polypeptide/compound complex, and detecting the 

complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 

polynucleotide of the invention is identified. 
28 Methods for identifying compounds that bind to a polypeptide of the invention can also 

comprise contacting a compound with a polypeptide of the invention in a ceU for a time 

sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 

receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
32 sequence expression, so that if a polj^eptide/compound complex is detected, a compound that 

binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 

activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
4 absence of the compound). Compounds, such as compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
8 carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, phamiaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 

12 the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 
readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 

16 like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al.. Application of Synthetic Peptides: Antisense 
Peptides," In Syntiietic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al.. Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

20 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 

24 sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 

28 ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 
base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 

32 Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241 :456 (1988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
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from DNA, while antisense RNA hybridization blocks translation of an inRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 

4 antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention can 
be used as a diagnostic agent. Agents which bmd to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 

8 composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 

12 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NOS: 1-13901 . Because the corresponding gene is only expressed in a 
limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences 

1 6 SEQ ID NO : 1 -1 390 1 can be used as an indicator of the presence of RNA of cell type of such a 
tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PGR as described in US Patents Nos. 4,683,195 and 4,965,1 88 provides 

20 additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PGR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 

24 Other means for producing specific hybridization probes for nucleic acids include the 

cloning of nucleic acid sequences into vectors for the production of roRNA probes. Such vectors 
are known in the art and are commercially available and may be used to sjmthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 

28 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 

32 techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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chromosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

4 chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265:1 98 If). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 

8 genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 
Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
12 example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 
using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be pi-epai-ed by any of the methods known to those of 
skiU in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
16 precisely spot oligonucleotides synthesizedby standard synthesizers. Immobilization can be 
achieved using passive adsorption (Inouye& Hondo, (1990) J. Clin. Microbiol. 28(6) 1469-72); 
using UV light (Nagatae^ al, 1985; Dahlen et al, 1987; Morrissey & Collins, (1989) Mol. Cell 
Probes3(2) 189-207) or by covalent binding ofbasemodifiedDNA(Kellere/-c//., 1988; 1989); all 
20 references being specifically incorporated herein. 

Another strategy that may be employed is the use of the stiong biotin-stieptavidin 
interaction as a linker. For example, Broude et al. (1994) Proc. Natl. Acad, Sci, USA 91(8) 3072-6, 
describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
24 streptavidin-coated magnetic beads. Streptavi din-coated beads may be purchased from Dynal, 

Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 
Biotinylated probes may be purchased from various sources, such as, e.g. , Operon Technologies 
(Alameda, CA). 

28 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nimc 

Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Covalink NH. CovaLinkNH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupUng. CovaLink Modules may be 

32 purchased from Nunc Laboratories. DNA molecules may be bound to CovaLinlc exclusively at the 
5'-end by a phosphoramidate bond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussengfa/., (1 991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLinlc NH strips for covalent binding of DNA molecules at the 5'-end has 
been described (Rasmussen et al., (1 991 ). In this technology, a phosphoramidatebond is employed 
(Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as immobilization using 
4 only a single covalent bond is preferred. The phosphoramidatebond joins the DNA to the 
CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To Unlc an oligonucleotide to 
CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus must have a 5'-end 
8 phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denatuimg for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M l-rnethylimidazole, 
12 pH7.0(l-MeIm7),isthenaddedtoafinaIconcentrationof lOmM l-Melmy. A ss DNA solution is 
then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaniinopropyl)-carbodiimide (EDC), dissolved in 
10 raM l-Melm?, is made firesh and 25 ul added per v^rell. The strips are incubated for 5 hours at 
16 50°C. After incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
20 describedin PCT Patent Application WO 90/03382 (Southern & Maskos), incorporatedherein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carried by the support. The oUgonucleotide is then synthesized on the supported 
24 nucleoside and protecting groups removed from the synthetic oUgonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparationof DNA probe 
28 arrays may be employed. For example, addressable laser-activated photodeprotection may be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodor et al. (1991) Science 25 1(4995) 767-73, incorporatedherein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al.(l99l) Nucleic Acids Res. 
32 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1 988) Anal. Biochem. 
169(1) 104-8; all references being specifically incorporatedherein. 
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To link an oligonucleotide to a nylon support, as describedby VanNesse^a/. (1991), 
requires activation of the nylon surface via alkylation and selective activation of the 5'-amine of 
oligonucleotides with cyanuric chloride. 

4 One particular way to prepare support bound oHgonucleotides is to utilize the 

light-generated synthesis described by Pease et al, (1994) PNAS USA 91(1 1) 5022-6, incorporated 
herein by reference) . These authors used current photolithographic techniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which Uglit is used to direct 

8 the sjmthesis of oligonucleotide probes in high-density, miniaturized arrays, utihze photolabile 
5'-protected7V-acyl-deoxyiiucleosidephosphoramidites, surface linker chemistry and versatile 
combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

12 421 PREPARATION OF NUCLEIC ACro FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or Y AC inserts, and R]\A, 
including roRNA without any amplification steps. For example, Sambrook a/. ( 1 989) descri bes 
16 three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-9.23). 

DNA fragments may be prepared as clones in M 1 3, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PGR or other amplificationmethods. Samples 
20 may be prepared or dispensed in multiwell plates. About 100-1000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skiU 
in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et 
24 dl. ( 1 989), sheai-uig by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al. (1 990) Nucleic 
Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
28 device allows controlled application of low to intermediate pressures to the cell. The results of 

these studies indicate that low-pressure shearuig is a useful alternative to sonic and enzymatic DNA 
fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
32 base recognition endonuclease, CvzJI, described by Fitzgerald et d. (1992) Nucleic Acids Res. 

20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 

sequencing. 

The restriction endonuclease CviJl normally cleaves the recognition sequence PuGCPy 
4 between the G and C to leave blunt ends. Atypical reaction conditions, v^hich alter the specificity of 
this enzyme (CvifJI* *), yield a quasi-random distribution of DNA fragments form the small 
moleculepUC19 (2688 base pairs). Fitzgerald etal. (1992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a Cv/JI** digest of pUC19 that w^as size 
8 fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
M 1 3 cloning vector. Sequence analysis of 76 clones showed that CvUl* * restricts pyGCPy and 
PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragnientation. 

12 As reported in tiie literature, advantages of this approach compared to sonication and 

agarose gel j&actionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and dution are needed 

1 6 Irrespective of the maimer in which the nucleic acid fragments are obtained or prepared, it is 

important to denature tlie DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 

20 chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 

24 array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a 
nylon membrane. By offset printing, a density of dots higher than tlie density of the wells is 
achieved. One to 25 dots may be accommodated in 1 mm^, depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 

28 may be formed. Samples m one subarray may be the same genomic segment of DNA (or tlie same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 

32 one 96-well plate (a1 1 96 weUs containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
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Subarraysmay contain 64 samples, one from each patient. Where the 96 subarraysare identical, the 
dot span may be 1 nrni^ and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
4 which may be partitioned by physical spacers e. g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

8 The present invention is illustrated in tlie following examples. Upon consideration of the 

present disclosure, one of skill in the art will appreciate that many other embodiments and variations 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 

12 present invention is not to be limited in scope by the exemplified embodiments wlaich are intended 
as illustations of single aspects of the invention, and compositions and methods which are 
flmctionally equivalent are within the scope of the invention. Indeed, numerous modifications and 
variations in the practice of the invention ai^e expected to occur to those skilled in the art upon 

16 consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon tlie scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specificationare hereby incorporated by 
reference in their entirety. 

20 5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained fi:t)m cDNA libraries prepared from various 
24 ' human tissues and in some cases isolated firom a genomic library derived from human chromosome 
using standard PGR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PGR using primers specific for the vector sequences 
which flank tihe inserts. Clones firom cDNA libraries were spotted on nylon membrane filters and 
28 screenedwithoIigonucleotideprobes(e.g., 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 
sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
32 Sanger sequencing protocol. PGR products were purified and subjected to fluorescent dye 

termuiator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems 
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(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random 
Amplification of cDNA Ends) was performed to further extend the sequence in the 5 ' direction. 

4 5.2 EXAMPLE 2 

Novel Contigs 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
8 from one or more public databases. The sequences for the resulting nucleic acid contigs are 

designated as SEQ ID NO: 1-13901 and are provided in the attached Sequence Listing. The contigs 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 

12 (i.e. , Hyseq's database containing EST sequences, dbEST version 1 14, gb pri 1 1 4, and UniGene 
version 101) that belong to this assemblage. The algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 
component sequences into the assemblage was based on a BLASTN hit to the extending 

16 assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO : 1 - 1 3 90 1 ) of the present invention, and their coixesponding 
nucleotide locations to each of SEQ ID NO: : 1-13901. Table3 also indicates the method by which 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

program called FAST Y (available from http://fasta.bioch.virginia.edu ') which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. 
Pearson, Methods in Enzymology, 183:63-98 (1990), herein incorporated by reference). MethodB 

24 refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositional properties (C. Burge 
and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers 

28 to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 
reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-13901 were obtained by a BLASTP 

32 version 2.0al 19MP-WashU search against Genpept release 120 and Geneseq database release 
200101 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest 
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homologue for SEQ ID NOS: 1-13901. The nearest neighbor results for SEQ ID NO: 1-13901 

are shown in Table 2 below. 

Tables 1, 2 and 3 follow. Table 1 shows the various tissue sources of SEQ ID NOS: 1- 
1 3901 . Table 2 shows the nearest neighbor result for the assembled contig. The nearest neighbor 
resuh shows the closest homolog with an identifiable fiinction for each assemblage. Table 3 
contains the start and stop nucleotides for the translated amino acid sequence for which each 
assemblage encodes. Table 3 also provides a correlation between the amino acid sequences set forth 
in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID 
NO. in USSN 09/515,126 
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TABLE 1 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


adult brain 


GIBCO 


AB3001 


83 544 597-598 600-607 616 841 1004 1148 1346 1493 1974 
2138 2141 2143 2161 2266 2345 2363 251 1 2569 2876 2880 
300 1 3099-3 1 0 1 3 1 05-3 1 06 3 1 1 0-3 1 1 1 3 1 1 5-3 1 1 7 3 1 99 3272 
3282 3284 3356 3425 3537 3634 3689 3709 3797 3810 3839 
3899 4006 4021-4022 4025 4043 4194 4201 4253 4277 4297 
4388 4399 4410 4667 4671 4722 4747-4748 4750 4755 4767 
4845 4865 4940 5037 5075 5093 5118 5163 5171-5172 5268 
5481 5523 5553 5656 5724 5894 5902 5938 6052 6170-6173 
6 1 76 62 14 6307 6336 6369 6374 6793 6894-6897 6979 705 8 
7169 7455 7492-7493 7495-7499 7501 7504 7577 7586 7761 
7792 7864.7870 8035 8065 8085 81 10 8120 8140 8224 8226 
8298 8372 8427 8452 8456 8535 8648 8672 8674-8679 8681- 
8684 8816 8838-8839 8870 8898 9012 9041 9079 9128 9257 
9264 9304 93 17 9460 9503 95 17 9567 96"^ 3 9734 9781 9792- 
9798 9929 9964 9999 10296 10330 10469-10470 10578 10679 
10778 10786 10895 10984-10986 11032 11052 11069 11130 
1 1 145 1 1239 1 1289 1 1402 1 1 8 1 8 1 1 862 1 1 870- 1 1 876 1 1 878- 
11881 12017 12037 12127 12160 12294 12363 12375 12405 
12424 12438 12467 12539 12570 12590 12615-12616 12618 
12685 12688 12712 12739 12748 12830 12913 12916 12948- 
12950 13002 13064 13073 13083 13141 13150 13153 13164- 
13166 13257 13391 13456 13479 13489 13492 13494 13499 
13501 13503 13560 13595-13596 13627 13645 13679 13782 
13795 13861 13866 13869 13882 


adult brain 


GIBCO 


ABD003 


67 83 142 443 587 598 608-609 611 613-624 633 731 734 737- 
742 760 799-800 809 1148 1152 1167-1184 1193 1346 1433 
1516 1552 1575 1671 1756 1774 1833 1974 21382145 2176- 
2178 2237 2266 2299-2301 2303-2306 2343 2363 2412 2444 
2449 251 1 2516 2555 2569 2576 2614 2716 2809 2876 291 1 
2926 3001 3093 3114 31193121-3124 3126 3128-31303234 
3254-3256 3258-3263 3265-3267 3270-3274 3276-3277 3280- 
3281 3284 3286 3348 3356 3378 3435 3459 3484 3537 3548 
3595 3605 3625 3627 3634 3686-3697 3700 3702 3709 371 1 
3720 3722 3737 3757 3797 3804 3810 3839 3856 4006 4019 
4025 4040 4055 4057-4058 4060 4078 4194 4201 4246 4253 
4277 4282 4390 4405 4412 4431 4620 4622 4641 4689 475 1- 
4764 4791 4808 4837 4845 4847-48494852-4858 4860-4862 
4864-4869 4940 4957 4962 4972 4998 5021 5031 5037-5038 
5040 5076 5093 5 1 08 5 1 1 8 5167 5 169 5 1 7 1-5 172 525 1-526 1 
5263-5265 5270 5364 5401 5481 5492 5521 5523 5535 5656 
5674 5693 5766 5788 5817 5906-5909 5938 6005 6027 6057 
6064 6147 6178 6180-6182 6189 6214 6229-6233 6254 6272 
6369 6371 6421-6426 6555 6595 6598 6601 6799 6803 6825 
6836 6886 6894 6913 6972 6995 7058 7104 7130 7133 7148 
7164 7169 7339 7347 7386 7426 7455 7494 7502 7507 7509 
751 1-7512 7516 7520 7584-7587 7590-7596 7598-7601 7603- 
7604 7608 7632 7677 7743 7748 7761 7768 7792 7797 7807 
78 15 7839 7849-7861 7864 7870 7930 7937 8035 8065 8067 
8080 8087 8095 8110 8120 8139-8140 8209 8224 8226 8235 
8246 8262 8285 8298 8320 8323 8336 8354 8361 8365 8370 
8375 8387 8452 8456 8535 8556 8576-8577 8603 8630 8648 
8674 8685-8686 8688-8690 8693 8695 8702 8712 8742 8760- 
8761 8763-8764 8766-8769 8813 8815-8816 8830 8834 8838- 
8839 8848 8863 8870 8898 8921 8943-8944 8951 8989 9010 
9041 9050-9056 9058-9064 9076 9079 9092 9097 9128 9144- 
9145 9257 9264 927 1 9278-9279 9304 93 15 93 1 7 9455 9466 
9472 9475 9480 9503 95 1 1 9517 9525 9539 9689 9734 9773 
9781 9791 9799-9802 9847 9852 9873 9928-9929 9964 9999- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








10001 10003-10004 10053 10175 10267 10276 10330 10349 
10434 10449 10466 10471-10474 10492 10496 10509 10566 
10578 10597-10599 10645 10679 10788 10891 10948 10988- 
10990 11032 1103911041-11043 11052 11065 11069 11105 
11108 11130 11145 11167 11196 1 1203-1 1207 11209 1 1239 
J1399 11401-11402 11406 11459 1 1470 1 1604 1 1606 11642 
11761 11818 11862 11877 11882-11884 11886 11889-11893 
11944-11946 11981 11988 12016 12019 12022 12037 12083 
12127 12143 12164-12165 12168-12171 12178 12195 12236 
12265 12305 12327 12363 12375 12405 12423-12424 12430 
12438 12546 12570 12590 12594 12612 12615-12618 12630 
12670 12674 12685-12688 12693 12704 12706-12707 12748 
12772 12830 12885 12904 12913 12916 12923 12933 12951 
12956 12993 13001 13020-13021 13038 13047 13064 13072- 
13073 13084-13085 13092 13117 13142 13167-13171 13191 
13254 13257 13260 13295 13390-13391 13394 13456 13479 
13483 13489 13497 13501 13503 13505-1350713512 13516 
13546 13551 13555 13575 13590 13592 13597 13613-13614 
13645 13649 13659 13711 13782 13795 13838 13861 13869 
13875 13882 13884-13885 13888 13892 13896 


adult brain 


Clontech 


ABROOl 


142 858 1542 21742407 2483 2652 3272 3287 3460 3492 3535 
3595 3737 3839 4005 4060 4282 4434 4791 4972 5040 5293 
5523 5530 5535 5788 5906 6082 6601 6799 6980 7373 7577 
7587 7759 7788 7851 8081-8082 8110 9167 9455 9466 9781 
9928 10422 10774 10791 11069 11401 1 1406 11459 11604 
11607 11791 11818 11865 11961 11979 1202212122 12160 
12327 12442 12594 12615 12640 12670 12705 12935 12957 
12985 13047 13197 13257 13456 13511-13512 13546 13554 
13646 13793 13885 13889 13893 


adult brain 


Clontech 


ABR006 


6 67 1004 1908 3272 3286 3548 401 1 4282 4998 5923 5928 
6374 6730 6815 6867 6890 7067 8365 9264 9729 9780 10776 
11587 11618 12596 12601 12605 12704 12749 12754 12951 
13047 13051 13090 13479 13488 13498-13499 13503 13512 
13575 13882 


adult brain 


Clontech 


ABR008 


6 11 21 41 51 88 142 364 376 579 598 651 736 800 1050 1 148 
1184 1251-1265 1291 1346 1404 1479 1529 1543 1671-1674 
1697 1699-1710 1820 1830 1832-1838 1840 1848-1849 1908 
1914 1919 1927 1957 1964 1974 1976 1978-19792005-2006 
2050 2081 2090 2110-2111 2129 2150 2174 22002310 2327 
2342 2408-24 10 2420 2444 2449 246 1 -2467 2484 2490 2499- 
2506 251 1 2553 2574 2576 26 1 1 2652 2809 2827 2866 2894 
3032 3207 3535 3591 3610 3634 3715 3722 3737 3766-3770 
3819 4006-4007 4011 4025 4032 4060 4078 4095 4109 4128 
4 1 43 -4 1 55 4 1 82 4 1 94 4247-4257 4277 4282 4294 4296 43 1 0 
4330 4348 4355 4360 4381 4395 4399 441 1 443 1 4543 4641 
4662 4694 4698 4767 4781 4791 4808 4833 4837 4985 5001 
5022 5040 5075 5094 5108 5163 5303-5306 5308-5314 5320 
5380 5523 5553 5615 5625-5626 5634 5638-5644 5701 5706 
5711 5727-5742 5766 5772 5775 5783 5801 5814 5817 5820- 
5821 5829 5837 5851 5855 5858 5864 5867 5874 5885 5890 
5897 590 1 5906 5923 6057 6125 6214 6223 6288 6302 6456- 
6464 6545 6598 6601-6602 6624 6676-6685 6699 6726 6728 
6746-6749 6765 6799 6805 6854 6860 6893-6894 7004-7007 
7049 7076 7078 7081-7083 7105 7117 71 19 7133 7153 7166 
7431 7579 7708 7768 7849 7900-7905 8018 8083-8084 8095 
8110 8196-8208 8262 8288 8312 8320 8331 8336 8356 8375 
8452 8482 8633 8681 8710 8739 8777 8815 8817 8830 8839 
8963 8965 8983 90 1 0 9097 9 1 00 9 1 02-9 1089111912891 42 
9257 9264 9313 9364 9378-9384 9401 9454-9455 9458 9460- 
9464 9503 9509 951 1 9515-9516 9522 9528-9529 9533 9539 
9542 9544 9573 9577 9646 9773 9780 9924 10000 10025- 
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Tissue 
Origin 


RNA 
Source 


Library 


SEQ ID NOS: 








10029 10097 10148 10209 10218-10223 10225-10229 10267 
10275-10276 10284 10292 10296 10303 10313 10326 10329- 
10331 10333-10334 10343 10346 10393 10430 10494 10496 
10513 10542 10557 10613-10616 10679 10688 10691 10719- 
10721 10729 10743 10758 10760 10776 10782 10786 10795 
10895 11100 11114 11132 11195 11240 11242-11254 11401 
1 1406 1 1437 1 1454 1 1459 1 1462 1 1466-1 1473 1 1475-1 1476 
11519 11532-11535 11550 11553 11555-1155611559 11588- 
11589 11606 11615 11618 11621 11627-1162811633 11761 
11791 11807 11818 11932 11955 12006 12039 12041 12092 
12212-12224 12231 12236 12305 12363 12368 12396-12399 
12405 12424 12439 12442 12465-12466 12468 12477 12479- 
12482 12490 12521 12546 12552-12553 12576-12577 12579 
12582 12585 12590 12601 12609 12617-12618 12636 12658 
12707 12725 12735-12736 12749 12754 12776-12779 12859 
12868 12894 12905 12909 12940 12955 12959 12977 12980 
12990 13002 13004-13005 13020 13035 13038 13042-13044 
13047 13051-13052 13056 13062 13073 13082-13083 13196 
13249 13280 13311-13312 13336 13387 13417 13421-13426 
13436 13445 13456 13458 13479-13482 13488 13490 13494- 
13495 13497-13500 13503 13507 13512 13516 13533 13546 
13554-13555 13590 13613 13630 13649 13659 13670 13678 
13713 13724 13769 13793-13794 13808 13827-13828 13838 
13861 13867-13868 13875 13882 13884-13885 13888-13889 
13893 13896 13898 


adult brain 


Clontech 


ABROll 


1006 1257 3797 4006 4025 5535 6057 7169 7870 8262 8937 
8966 9257 10778 12736 13394 13679 13793 13861 


adult brain 


BioChain 


ABR012 


88 598 1007 1 134 2597 3557 3590 3627 3797 4006 4192 4246 
4282 4391 4940 5523 5535 6288 6338 7138 8110 8898 9076 
9401 9455 9476 10772 11061 11114 12989 13394 13511 13866 


adult brain 


Invitrogen 


ABR013 


598 2614 3191 4355 4391 5523 5788 8085 8486 11513 12521 
12989 13861 


adult brain 


Invitrogen 


ABT004 


40 51 598 1050-1057 1148 1777-1778 1947 1976 2270-2272 
2327 2490 2617 3050 3600-3602 3722 3987 4390-4391 4434 
4543 4689 503 1 5 1 5 7-5 1 5 9 5 1 67 5 1 69 546 6 5 505 5 682-5 683 
5701 5766 5778 5794 5902 6147 6367-6371 6459 6545 6709 
6728 6783 6801 6971 7104 7175 7815 7839 7864 8139 8342 
8345 8355 8363 8372 8452 8633 8963 8975-89769012 9133 
9423-9424 9511 9515 9517 9528 9556 9827 9949 10260 10267 
10275 10570-10571 10733 10767 1 1132 1 1 159 11406 11459 
11932 12009 12092 12109-12111 12127 12283 12428 12511 
12579 12605 12725 12747 12830 12885-12886 12910 12913 
12954 12987-12989 13051 13054 13062 13073 13090 13249- 
13253 13438 13445 13456 13489 13500 13512 13516 13533 
13546 13590 13622 13649 13683-13684 13713 13803 13838 
13861 13866 13896 


cultured 
pre adipocytes 


Strategene 


ADPOOl 


1 134 1346 2343 2614 3272 3426 3610 3720 3839 3885 401 1 
4277 4282 4297 4346 4388 4391 4405 4434 4641 4833 4940 
4985 5018 5030 5040 5163 5167 5523 5581 5778 5788 5794 
5895 5951 6082 6147 6272 6607 7067 7141 8093 8235 8285 
8312 8363 8629 8648 8830 8839 9290 9401 9466 9503 9781 
10346 10470 10776 10795 10971 11108 11170 11513 11818 
12034 12037 12046 12093 12375 12387 12405 12424 12570 
12636 12670 12674 12688 12735 12749 12913 12940 13126 
13163 13295 13489 13494 13497 13499 13511 13516 13575 
13652-13653 13866 13888-13889 


adrenal gland 


Clontech 


ADR002 


8 83 142 225 351 443 551 569 731 864 1134 1266-1271 1273- 
1274 1276-1292 1294-1295 1381 1391 1544-1545 1658 1671 
1908 1959 1983 2010 2023 2145 2175 2283 23102328-2334 
2343 2444 2449 2510 2522 2576 3032 3069 3153 3166 3272 
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Origin 


RNA 


Name 


SEQIDNOS: 








3378 34163548 3625 3709 3711 3771-3788 3790-3791 3797 
3839 3870 3899 3985 4019 4054 4060 4109 4173 4192 4194 
4201 4253 4277 4282 4389 439 1 4395-4396 443 1 4667 4687 
4694 4783 4957 4966 4985 4998 5037 5108 5293 5316-5323 
5325-5336 5481 5505 5527 5531-5533 5560 5628 5656 5701 
5766 5865 5874 5902 5939 5979 6020 6052 6057 6227 6288 
6354 6405 6449 6465-6482 6521 6603 6726 6894 6979 7008- 
701 1 7155-71567204 7604 7634 7845 7871 7906-7912 7915- 
7918 7920-7930 8022 8067 8085-8086 8095 8110 8116 8224 
8262 8363 8365 8412 8520 8535 8554 8699 8742 8831 8870 
8950-8951 9002 9010 9012 9097 9109-91 19 9121-9127 9190 
9264 9280 9304 9317 9455 9457 9495 9503 9553 9556 9586 
9709 9780-9781 9895 9927 10007 10030-10032 10034-10035 
10037-10045 10119 10277 10284 10346 10595 10617-10620 
10623-10627 10645 10675 10679 10760-10761 10766 10774 
1 0782 1 1 108 1 1 194 1 1255 1 1258-1 1270 1 1289 1 1400 1 1406 
11457 1151911588 1160011621 11626118181195212080 
12159 12225 12227-12230 12232-12237 12239-12240 12242- 
12249 12251 12305 12363 12375 12404-12405 12424 12439 
12577 12599 12601 12630 12636 12657-12658 12663 12688 
12693 12706 12713 12715 12735 12749 12754 12780-12787 
12848 12863 12909 12913 12923 12957 12990 13006-13007 
13020-13021 13064 13095 13104 13295 13313 13315 13388- 
13389 13391 13456 13473 13494 13507 13515 13546 13613 
13631 13679 13725-13733 13858 13866 13872 13883-13884 
13888-13889 


adult heart 


GIBCO 


AHROOl 


51 83 88 94 221 239 360 366-367 404 410-41 ] 4)3 415 458- 
459 461 465-468 471 473-478 486 545-546 559 567 616 625- 
630 743-744 799 802-806 808 810-835 837-842 959 1004 1066- 
1085 1134 1178 1184-1193 1346 1512 1516 1546-1547 1556 
1575 1671 1727 1774 1829 1959 1976-19772090 2096 2108 
21 10 2128 2138 2145-2147 2161 2179 2195-2198 2257 2276 
2278-2281 2302 2307-2309 2363 2398-2399 2409 241 1-2412 
2444 2449 2497 25 1 6 2529 2563 2569 2575-2576 2597 2605 
2614 2617 2762 2809 2816 2879-2880 2911-29242926 2961 
2978-2980 2985-2986 2993 2995-3002 3032 3042 3051 3058 
3069 3081 3091-3094 3109 3114 3132-3133 3135 3137-3138 
3 141 3 191 3196 3199 3215 3263 3272 3282-3286 3317 3349- 
3350 3353-3359 3361-3362 3364-3367 3370-3393 3396-3399 
3403 3406 3425 3451 3465-3466 3479 3500 3503 3537-3538 
3544 3548 3550 3555 3557 3590-3591 3595 3604 3606-3612 
3614-3621 3623-3627 3634 3689 3697-3698 3701-3709 3711- 
3713 3720 3722 3737 3757 3797 3839 3885 3898 3988-3989 
3996 4005-4006 4008 4010-401 1 4019 4021-4022 4025 4040 
4043 4054-4055 4058-4060 4078 4183 4192 41944201 4246 
4253 4269 4277 4282 4341 4351 4391 4403 4405 4434 4517 
4543 4553 45904615 4622 4631 4633-46344641 4654 4664 
4666-4657 4672-4675 4688-4689 4693-4694 4738 4740 4755 
4783 4828 4870-4871 4905-49064909-4915 4917-49194921- 
4924 49264927 4940 4957 4972 4985 4991 4998 5001 5030 
5037-5038 5040 5076 5118 5163 5171-5175 5177-5178 5180- 
5182 5266-5268 5380 5422 5481 5490 5492 5506-5507 5523 
5534-5535 5581 5656 5682 571 1 5740 5766 5788 5895 5908 
5923-5924 5938 6047 6057 6083 6085 6104 61 10-61 11 61 17 
6147 6184-6185 6189 6195 6215 6255-6257 6259-6266 6268 
6272 6288 6307 6336 6375-6379 6382 6427-6429 6458 6555 
6588 6595 6607 6629 6665 6689 6765 6767 6799 6815 6871- 
6873 6879 6881-6882 6898 6926-6928 6975-6978 6987 7058 
7090 7148 7313-73147341 7343-7344 7346-7347 7350 7354 
7359 7362 7373 7380 7394 7402 7407-7408 7410 7413 7415- 
7416 7418-7419 7426 7431 7468 7473 7480 7494 7505 7513- 
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Library 
Name 
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7516 7520 7544 7561 7584 7587 7599 7601-7604 7635 7638- 
7643 7645-7649 7651-7655 7657 7659-7662 7733 7735 7743 
7748 7783-7796 7815 7852 7857 7862-7863 7865-7870 7930 
7933 7983 8062-8065 8067 8087-8088 8093 8095 81 10 8116 
8120 8139-8140 8224 8226 8235 8262 8298 8336 8344-8345 
8354 8356 8363 8368 8372 8378 8387 8410 8427 8452 8456 
8531-8532 8534-8535 8563-8569 8572 8576-8577 8592-8593 
8597 8603 8606 8610 8613-8614 8616-8617 8646 8648 8670 
8681 8691 8698-8699 8702 8712 8742 8756 8760 8763 8795 
8807-8809 881 1-8814 8816-8819 8821-8833 8835-8839 8858 
8863 8870 8898 8921 8927 8936 8939 8943 8946 8950-8951 
8966 8988-8992 8994-8995 9010 9049 9059 9065-9067 9070- 
9072 9076 9097-9098 9167 9190 9257-9260 9262 9269 9281 
9287 9301 9304 9315 9317 9401 9454-9455 9466 9476 9480 
9484 9556 9577 9612 9689 9698 9720-9721 9734 9741-9743 
9747-9750 9758 9781 9791 9804 9871-9882 9884-9885 9928 
9939 9942 9954-9960 9999-10000 10005 10175 10179 10275 
10284 10292 10296 10329-10331 10346 10400 10422 10430- 
10431 10437 10442 10444-10447 10475 10511-10512 10514- 
10521 10557 10576-10577 10616 10645 10679 10691 10729 
10742 10744 10772 10774 10777-10778 10782 10788 10839 
10891 10894-10895 10902 10917-10920 10937 10942 10946 
10948 10969-1097010992 11032 11044-1104511061 11066- 
11074 11108 11114 11132 11145 11153 11165-11170 11173 
11205 11208 11210-11215 11283 11289 11386 11388 11401- 
11402 11406 11462 11559 11565 11576 11596 11606 11615 
11620 11744 11788-11789 11793 11818 11823-11825 11831- 
11832 11869 11S94 11947 11961 11982-11989 12000 12006 
12009 12011 12019-12020 12028 12037 12044 12078 12081 
12093 12119-12122 12143 12160 12166 12172-12175 12177- 
12179 12197 12305 12335 12363 12375-12376 12383 12387 
12400 12402 12405 12424 12428 12438 12479 12521 12523 
12546 12560 12564 12570 12590 12599 12601 12605 12609 
12611 12616-12618 12653 12662-12663 12670 12674 12688 
12717-12720 12734 12752 12754 12772 12905 12907 12914 
12916-12917 12923 12925 12940 12961 12963-12965 12989 
12991 12993 13020-13021 13033 13035 13072-13073 13082- 
13083 13104 13117 13126 13132 13136 13141-13142 13148 
13169 13203-13210 13212 13260-13261 13293 13295 13326 
13377 13394 13413 13442 13456 13477 13480 13488 13490 
13494-13496 13501-13503 13506-13507 13511 13516 13533 
13568 13570-13571 13575 13582 13592 13613 13624-13630 
13632 13644 13646 13659-13660 13678-13679 13689 13701 
13711-13713 13775 13782 13795 13797 13866-13869 13872 
13882 13884-13885 13893 


adult kidney 


GIBCO 


AKDOOl 


49 67 83-84 142 354 405-407 41 5-429 43 1-432 445 460 462 

479-484 486 488 492-493 524 548-549 598 616 63 1-638 744 
787 809 841 1004 1068 1086 1160 1163-1166 1171 1184 1193 
1346 1359 1449 1479 1516 1552 1556 1671 1724 1727 1774 
1826 1858 1914 1974 1976 1978-1979 2081 2097-2099 2111- 
21 13 2117-21 18 2129-21302138 2145 2148-21502161 2186 
2218 2266 2268 2302 2310 2327 2343 2363 2409 2412 2444 
2449 2468 2483 2523 2569 2576 2614 2617 2827 2845 2876 
2910 2915 2926-2931 2933-2934 2938-2943 2945-2947 2955- 
2956 2976 2981 2983-2984 3001-3016 3018 3053 3109 31 14 
3 140 3 1 42-3 1 46 3 1 49-3 1 50 3 1 99 3254 3265 3272 3283-3284 
3286 3333 3350 3356 3378 3406 3435 3445 3460 3492 3503 
3535 3537 3544 3548 3590-3591 3598 3625 3627 3634 3676- 
3677 3679-3685 3689 3697 3709 371 1 3720 3722 3737 3757 
3797 3808 3810 3839 3885 3989 4005-40064011 4019 4022 
4025 4040 4043 4054-4055 4060 4078 4109 4192 4194 4201 
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Tissue 
Origin 


RNA 
Source 


Librarj' 
Name 


SEQ ID NOS: 








4246 4253 4269 4277 4341 4354 4387-4388 4390 4398 4402 
4405 4410 4434 4473 4540 4543 4583 4591-4592 4607 4637 
4640-4641 4649 4654 4665 4668 4676-4677 4680-4685 4688- 
4689 471 1-4712 475 1 4758 4766-4768 4783 4808 4833 4836- 
4837 4845 4874 4904 4940 4957 4962-4963 4972 4983 4991 
4998 5022 5037 5040 5076 5093 5118 5143 5163 5171-5172 
5246-5249 5281 5362 5364 5380 5422 5481 5521 5523 5526- 
5527 5535 5656 5693 5726 5740 5766 5769 5778 5788 5794 
5894 5902 5908 5911 5916-5917 5923 5928 5938 5990 6005 
6049-6051 6057 6078 6082 6086 6088 6105-6107 6112-6117 
6120-6122 6147 6159 618^-6189 6215 6257 6272 6288 6369 
6418 6423 6430 6555 6595 6598 6738 6743 6765 6767 6793 
6799 6805 6815 6836 6857 6867-6870 6872-6873 6878 6884 
6886 6893 6899-6900 6931 6947 6976 6979 6988 7024 7045 
7058 7104 71 18 7155 7275 7281 7306 7312 7338 7347-7348 
735 1-7352 7355 7357 7359-7363 7366 7369 7371-7373 7381 
7383 7386 7400 7402 7404-7406 7408 7420 7422 7424 7426- 
7428 7430-743 1 7433-7435 7439 7455 7465-7468 7494 7502 
7506 7517-7521 7579 7587 7604 7634 7639 7642 7733 7735 
7743 7748 7792 7797 7839 7842-7846 7848 7852 7857 7862 
7864 7870 7930 7933 8065 8077 8084 8087 8093 8095 8105 
8110 81 16 8120 8139-8140 8156 8224 8226 8235 8262 8320 
8336 8345 8351 8354 8359 8364 8368 8370 8372 8375 8377 
8387 8427 8452 8456 8461 8486 8518 8520 8527 8535 8563- 
8564 8566 8572-8573 8575-8577 8580-8583 8585 8588-8589 
8597 8603 8618-8621 8623 8625-8626 8628 8630 8634 8647- 
8650 8664 8674 8677-8678 8691 8694 8701-8704 8706 871 1 
8722 8740 8742 8763 8792 8798 8806 8812-8813 8816 8830 
8838-8839 8848 8853 8863 8876 8898 8921 8935 8943-8944 
8948 8951 8966 8989 9012 9041 9047-9049 9076 9092 9128 
9 137 9167 9218 9244 9249 9257 9264 9287 9304 93 15 93 17 
9424 9455-9457 9466 9472 9475 9484 9503 951 1 9517 9529 
9536 9542 9544 9558 9612 9683 9699 9703 9722-9723 9725 
9734 9744 9752-9755 9758 9772-9773 9780-9781 9805-9807 
9841 9843 9927-9929 9939 9942 9969 9998-10000 10007 
10175 10275 10284 10287 10292 I'03]9 10346 10376 10423- 
10424 10431 10442 10446 10448 10450 10475 10480 10496 
10542 10557 10645 10679 10774 10778 10782 10788 10895 
10921-10928 10947 10949-10953 10986 10994-10996 11032 
11052 11061 11069 11114 11130 11132 11143 11145 11197- 
11202 11205 11208 11212 11239 1 1289 11401-11402 11406 
11459 11513 11586 11596 11604 11607 11618 11620 11695 
11711 11759 11761 11790-11791 11793-11794 11818 11820- 
11822 11834-118361185711865 1186911895-11899 11950 
11970 12006 12022 12041 12078 12143 12159-12160 12178 
12195 12197 12256 12265 12305 12327 12359 12363 12375 
12387 12405 12424 12428 12438 12442 12467 12511 12521 
12523 12535 12539 12546 12564 12570 12590 12599 12601 
12609-12611 12616 12618 12630 12653 12656 12664 12666- 
12667 12670 12674 12688-12689 12691 12739 12754 12830 
12834 12904 12913-12914 12916-12917 12923 12940-12941 
12951-12952 12956 12972 12976-12977 12993 12999 13002 
13052 13062 13064 13066 13072-13073 13082-13083 13095 
13104 13127 13131 13133 13136 13141 13143-13146 13148 
13154-13155 13169 13172 13184 13195-13196 13249 13254 
13260 13263 13280 13295 13307 13319 13394 13442 13456 
13477 13479-13480 13488-13490 13492 13494-13495 13497- 
13503 13506-13507 13512 13515-13516 13546 13551 13554- 
13556 13572-13574 13590 13592 13613 13627 13631 13644- 
13645 13656 13659-13660 13665 13670 13710 13713 13776 
13782 13795 13859-13860 13864 13866-13868 13872 13882- 
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Source 












13885 13888 13891-13892 


adult kidney 


Invitrogen 


AKT002 


67 142 550-552 598 639-641 1004 1015 1493 1516 19472215 
2299 2343 2363 2449 2618 3054-3055 3152-3153 3155-3157 
3434 3535 3590 3709 3797 3808 3839 3885 4011 4022 4025 
4040 4253 4277 4391 4405 4671 4759 4767 4769 4837 4949 
4972 5001 5035 5037 5052 5108 5526 5581 5615 5726 5788 
5895 6062 6139 6190-6191 6803 6900 6947 6975 7468-7469 
7473 7733 8095 81 10 8139 8262 8323 8361 8363 8375 8520 
8539 8648 871 1 8798 8912 8950 8966 8983 9076 9264 9368 
9510 9517 9665 9703 9734 10175 10476 10791 10989 10997 
11132 11618 11745 11900 12006 12039 12160 12363 12375 
12405 12424 12685 12702 12707 12904 13035 13060 13104 
13136 13295 13394 13456 13488 13495 13501 13512 13533 
13554 13583 13644 13670 13679 13696 13713 13795 13866 
13888 13891 


adult'lung 


GIBCO 


ALGOOI 


83-84 553 598 642-644 650 747 975 1004 1009 1015-1022 
1449 1516 1816 2161 21862215 2343 2444 2968 3056 3158 
3 160-3162 3345 3466 3503 3519 35663568-3573 3595 3709 
3797 3810 3839 3885 4025 4039 4059 41 94 4405 4622 4755 
4767 4770 4797 4865 4940 4949 4963 4972 4987 4991 4998 
5001 5117-51295171-5172 5233 5401 5481 5581 5724 5788 
5938 5960 6123 6125 6140 6215 6322 6336 6343-6348 6371 
6886 6966-6967 7024 7143 7275 7441 7444 7502 7522-7523 
7749-7752 8093 81 10 8140 8336 8345 8375 8378 8449 8535 
8612 8622 8635 8648 8651 8674 8702 8707-8708 8838-8839 
8898 8941-8942 8944 8948 8951 9076 9128 9457 9466 9475 
9497 9503 9544 9567 9703 9756 9758 9808-9S10 9843 9873 
9926 9999 10161 10175 10275 10329 10344 10400 10458 
10554-10555 10796 10998 11032 11046 11069 11132 11144- 
11145 11483 11627 11818 11837 11901 12006 12028 12039 
12041 12087-12092 12146 12363 12375 12424 12438 12539 
12570 12601 12617 12661 12670 12674 12688 12738 12749 
12754 12904 12940 12974" 13062 13173 13195 13232 13234 
13260 13295 13307 13456 13472 13477 13490 13494 13497- 
13498 13569 13584 13592 13660 13663-13666 13670 13743 
13776 13882 13885 13889 13891 


lymph node 


Clontech 


ALNOOl 


83 142 364 487 495-497 554 629 645-646 648-650 716 938-951 
953-962 1134 1516 1549 1671 1774 197621382225-2232 
2234-2235 2340 2879 3019-3020 3057 3103 3163-3165 3272 
3356 3498-3505 3507-3513 3515-3516 3538 3548 3628 3697 
4194 4201 4253 4405 4641 4687-4688 4771-4772 4783 4808 
4845 4963 4972 4987 4998 5042-5065 5076 5163 5504 5523 
5835 5895 5917 6027 6142 6192 6272 6288 6308-63 1 1 6313- 
63 14 6765 6805 6871 6949-695 1 7700-7705 7707-7709 8085 
8088 8091 8110 8235 8375 8387 8432 8629 8631 8633 8648 
8677 8709 8713-8715 8830 8863 8887-8889 8891-8896 8943- 
8944 8966 9010 9076 91 1 1 9128 9142 9222 9455 9472 9520 
9544 9734 9774 9780 981 1 9905-9907 9928 9939 9999 10027 
10129 10296 10439 10452 10501 10543-10545 10679 10777- 
10778 1078810891 10999-11000 11111-11113 11115 11130 
11145 11344 11406 11513 11584 11885 12006 12028 12050- 
12056 12143 12256 12363 12405 12442 12570 12674 12690 
12913 12917 12940 12968-12970 13173 13220-13221 13394 
13400 13492 13503 13511 13533 13630 13642 13645 13713 
13868 13885 13889 13891 


young liver 


GIBCO 


ALVOOl 


211 498 598 651-653 1008-1014 1193 1264 1575 1976-1977 
2131 2161 2254-225522692363 25682617 26272633 2636 
2961 3021 3059 3093 3166 321^ 3272 3356 3378 3426 3479 
3559-3565 3590 3597 3627 3634 3673 3709 3797 3810 3885 
3993 4006 401 1 4019 4025 4194 4246 4253 4277 4422 4426 
4431 4434 4437-4438 4634 4654 4687 4714 4722 4755 4773- 
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4774 4808 4957 5001 5037 5052 5113-5115 5163 5167 5 1 71 
5508 5510 5526 5581 5616 5693 5726 5757 5772 5788 5928 
6125 6141-6142 6193 6288 6340-6342 6589 6765 6815 6886 
6901 7024 71 18 7141 7442 7524 7735 7747-7748 7792 7852 
7870 7924 7930 8066 8110 8120 8262 8375 8378 8518 8520 
8648 8652 8691 8702 8716-8718 8816 8838-8839 8938-8940 
9097 91 19 9263-9264 9301 93 17 9424 9454-9455 9544 9689 
9703 97349758 9775 9925 9947 10393 10477 10769 10774 
11001 1113211140-1114211173 112081131811406 11587- 
11588 11725 11804 11902 12081-12086 12118 12160 12178 
12375 12377 12405 12424 12479 12511 12570 12599 12601 
12618 12670 12674 12688 12737 12830 12913 12916 12933 
12953 12977 12993 13047 13062 13073 13126 13169 13174 
13230-13231 13295 13488-13489 13494 13498-13499 13502 
13506-13507 13575 13592 13646 13660-13662 13670 13866 
13868-13869 13882 13888 13901 


adult liver 


Invitrogen 


ALV002 


6 25 60 142 598 1004 1213 1296-1301 1381 1493 1513-1514 
1977 2139 2161 2269 2310 2335 2342 2400 2413 2449 2458 
2497 2568-2569 2576 2636 2809 2827 2880 2926 3471 3484 
3503 3597 3722 3792-3796 3885 3994-3995 4025 4060 4201 
4326 4422 4426 4618 4689 4767 4786 4985 4998 5093 5163 
5337-5339 551 1 5526 5581 5693 5701 5724 5726 5757 5895 
5922-5923 5979 6020 6027 6083 6125 6192 6195 6253 6333 
6346 6483-6485 6716 6765 6797 6975 7169 7422 7468 7614 
7642 7807 7932 8089 81 10 8140 8262 8323 8378 8633 8677 
871 1 8740 8759 8786 8834 8839 8944 9002 9128-9129 9264- 
9265 9282 9466 9484-9485 9517 9677 9700 9773 10007 10046- 
10047 10135 10616 10669 10777 1 1032 1 1 114 11132 1 1142 
11194 11271 11389 11462 11502 11560 1 1587 11602 11818 
1 1980 11989 12086 12160 12195 12253-12254 12316 12327 
12363 12378-12379 12411 12424 1251 1 12570 12630 12693 
12706 12788-12789 12840 12863 12913-12914 13047 13062 
13072-13073 13090 13242 13256 13378-13379 13389 13479 
13487 13489 13497 13555 13734-13738 13859 13864 13889 


adult liver 


Clontech 


ALV003 


346 2529 3548 3797 3885 4438 4940 5101 5801 5902 9597 
10769 10778 11587 11927 12086 


adult ovary 


Invitrogen 


AOVOOl 


6 13 40 67 83 88 142 196 444 522 555-560 577 579 598 609 
654-657 659-665 667-668 708 740 745-749 751-754 756-761 
806 841 871 1004 1023 1028-1034 1036-1039 1041-1047 1055 
1087-1115 1119 1151-1162 1298 13461359 1456 1493 1507 
1516 1658 1697 1752 1774 1812 1826 1848 1914 1974 1976- 
1977 1979 1983 2099 211 1 2138 2153-21552161 2175 2180- 
2181 2186 2258-2267 2284-2286 2288-2290 2298-2299 2342- 
2343 2449 2483 2523 2529 2555 2569 2576 2591 2597 2618 
2708 2750 2783 2818 28392926 2961-29622988 3007 3022 
3031 3060-3061 3063 30693166-31723174-31753177-3189 
3226 3272 3286 3288-3289 3291-3293 3295-3299 3435 3450 
3460 3479 3486 3503 3535 3577-3589 3612 3627 3629 3631- 
3643 3673 3675 3720 3722 3737 3797 3806 3810 3812-3813 
3839 3885 3985 4006 4012-4014 4023 4025 4040 4043 4060 
4078 4133 4192 4194 4201 4246 4269 4277 4282 4341 4362 
4381 4383 4385 4388 4390-4391 4396 4399 4402 4405 4410 
4434 4465 4473 4515 4543 4582 4610 4654 4689 4694 4715- 
4717 4722 4759 4767 4775-4782 4836-4837 4873-4878 4880 
4940 4944 4957 4960 4962-4964 4972 4975 4981 4985 4998 
5001-5002 5018 5030 5037 5040 5045 5075-5076 5093 5108 
5135-5136 5138-5146 5148 5163 5191-52095213 5238-5245 
5262 5374 5380 5451 5455 5490 5503 5505 5521 5523 5527 
5560 5581 5674 5701 5724 5757 5778 5783 5788 5794 5817 
5874 5894-5895 5902 5906 5908-5909 5916-5917 5923 5932 
5938 5979 5989-5990 6005 6027 6048 6054 6057 6061 6078 
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6082 6086 6125 6140 6143-6144 6165 6171 6192 6194-6200 
6215 6235-6239 6241 6270 6288 6307 6333 6336 6354-6360 
6362-6364 6374 6385-6392 6410 6412 6415-6417 6458 6461 
6508 6555 6583 6595 6598 6604 6607 6624 6626 6629 6708 
6765 6767 6793 6797 6799 6801 6805 6825 6860 6870-6871 
6873 6886 6888 6900 6902-6904 6915-6917 6947 6969-6970 
6975 6980-6987 6998 7054 7058 7104 7109 7118 7133 7137 
7141 7155 7166 7169 7171 7191 7204 7281 7386 7408 7431 
7443 7468 7471-7472 7494 7507 7525-7528 7604-7609 7614 
7642 7687 7716 7729 7762-7763 7765-7767 7769-7771 7773- 
7775 7788 7802-7818 7829-7835 7838-7841 7864 7905 7937 
7977 8007 8035 8077 8088 8093 8095 8110 8120 8134 8139- 
8140 8143 8156 8235 8246 8262 8292 8320 8336 8345 8355 
8358 8363 8365 8368 8370 8372 8375 8387 8444 8452 8456 
8486 8520 8593 8610 8633-8634 8653 8719-8724 8773-8777 
8779 8781-8782 8792 8798 8813 8830 8839 8853 8863 8870 
8876 8898 8928 8944 8950 8956-8961 8963-8971 8989 9004- 
9007 9009-9019 9042-9046 9055 9076 9097 9128 9134 9185 
9264 9283-92849291 9313 9358 9424 9445 9454-9455 9457 
9460 9466 9471-9472 9475 9480 95 11 9517 9533 9539 9542 
9544 9553 9565 9577 9586 9612 9626 9677 9694 9703 9722 
9734 9757-9758 9773 9776-9778 9791 98 12-98 19 9827 9838 
9843 9848-9853 9922 9927-9940 9942 9962-9970 9989-9992 
9994-9997 10007 10027 10097 10149-10152 10249 10252 
10275 10298 10333-10334 10346 10349 10351 10383 10386 
10417 10438-10439 10452 10459 10470 10478-10480 10493- 
10498 10557-10562 10564-10565 10580-10586 10594-10596 
10645 10658 10676 10679 10767 10772 10774 10778 10782 
10788 10791 10954 10971-10972 11002-11004 11032 11047- 
11052 11061 11069 11083 11108 11130 11132 11151-11153 
1 1 155 1 1 157-1 1 158 1 1 167 1 1 170 1 1 172-1 1 176 1 1 193-1 1 196 
11274 11343113691140611411 11431 11588 1159611600 
11604 11606-11607 11618 11620-11621 11629 11668 11814 
11860 11865 11876 11903-11909 11932 11949-11954 11980 
11985 12000 12006 12017 12022 12033 12039 12080 12083 
12092-12093 12095-12104 12127-12129 12131-12142 12146 
12159-12160 12162-12163 12178 12213 12216 12236 12256 
12260 12305 12327 12363 12368 12371 12375 12379 12387 
12401-12402 12405 12424 12430 12467 12520 12522 12546 
12570 12576 12590 12594 12599 12605 12609 12611 12615 ' 
12617-12618 12630 12636 12643 12657-12658 12663 12670 
12674 12685 12688 12691 12693 12702 12705-12707 12713 
12724 12729 12735 12740-12745 12749 12754-12760 12765- 
12770 12777 12842 12848 12875 12904 12906 12910 12913 
12916-12917 12935 12940 12955 12957 12963 12972 12977- 
12980 12982-12984 12992-12994 12997-13000 13020 13034 
13047 13051 13054 13056 13060 13062 13066 13070 13072- 
13073 13075 13082 13090 13092-13093 13104 13126 13136 
13141 13175-13177 13179-13180 13193-13194 13196-13197 
13202 13236-13241 13243 13245 13249 13254 13263-13275 
13277 13286-13292 13295 13319 13351 13377 13389-13391 
13394 13420 13436 13456 13477 13479 13488 13492 13494- 
13495 13497-13499 13502-13503 13506 13512 13516 13533 
13546 13549 13554-13555 13575 13590 13597 13600-13601 
13613 13616-13620 13627 13631 13644-13645 13649 13659- 
13660 13670 13674-13679 13693-13699 13707-13709 13713 
13782 13803 13864 13866 13868-13869 13872 13875 13882- 
13885 13888-13889 13892 


adult placenta 


Clontech 


APLOOl 


669-671 1006 1134 1184 15512053 2090 2156-2158 2342 
2490 2716 3001 3064-3065 3190 3272 3625 3670 4019 4055 
4194 4201 4246 4641 4718 4720-4721 4783-47864957 5523 
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Library 
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5536 5581 5788 5895 6145-6146 6201-6202 6358 6803 7049 
7204 7529 8367 8375 8609 8611 8630 8725-8727 9097 9480 
9734 9820 10319 10460 10579 10795 10895 11519 11723 
11737 11776 11827-11828 11959 12403 12546 12570 12663 
12848 12875 12913 13051 13394 13782 13869 13889 


placenta 


Invitrogen 


APL002 


142 561 1023 1551 1976 2449 2569 2614 3191 3340 3592 3668 
4060 4346 4767 4787-4789 4892 4985 5536 5674 5693 5757 
5772 5794 5906 6147 6203 6215 6598 6788 6799 6979 7158 
7530-7531 8139 8361 8611 8863 94579484 95179563 9703 
9758 9821 9927 9939 9942 10481 10557 10778 10802 10973 
11006-11011 11723 11731 11737 1177611807 11866 11910- 
11916 11959 12379 12403 12590 12685 13020 13054 13392 
13489 13533 13554-13555 13867 13882 13888 13891 


adult spleen 


GIDCO 


ASPOOl 


463 499 562-563 598 672 990-993 995 1004 1020 1346 1515- 
1516 1556 1774 1877 1914 1977 21592161 2215 2248 2257 
2363 2529 2569 2614 26172812 2827 3093 3192 3195 3199 
3272 3286 3350 3356 3434 3470 3503 3537 3544-3546 3548 
3604 3634 3689 3709 3720 3722 3796-3798 3812 3839 3885 
4006 401 1 4022 4025 4043 4194 4201 4246 4253 4277 4388 
4391 4396 4405 4434 4641 4689-4690 4781 4957 4987 4998 
5001 5017 5030 5037 5052 5076 5118 5512-5513 5523 5526 
5701 5740 5778 5788 5796 5801 5895 5938 6005 6064 6125 
6140 6147 6192 6204 6272 6329 6458 6551 6590 6607 6873 
6886 6931 6958 7018 7058 7109 7118 7133 7171 7233 7506 
7532-7533 7561 7586 7733 7736 7807 7842 7845 7933 8065 
8085 8093 8095 81 10 81 16 8139-8140 8226 8235 8262 8323 
8359 8363 8365 8368 8372 8456 8535 8648 8702 8792 8816 
8838-8839 8858 8863 8865 8876 8923-8928 8943 8950-8951 
9128 9257 9264 9317 9455 9457 9466 9474 9544 9560 9773 
9781 9919-99209927 9939 9942 10000 10048-10049 10155 
10175 10267 10275 10296 10331 10549 10566 10774 10777 
10828 10944 11032 11061 11108 11113 11132 11173 11272 
11401 11406 11620-11621 11804 11818 11861 1191712006 
12039 12041 12073-12074 12078 12118 12127 12236 12327 
12375 12387 12402 12405 12424 12426 12511 12546 12560 
12570 12601 12617-12618 12670 12674 12734-12735 12749 
12772 12790 12913 12916-12917 12972 12977 13002 13021 
13066 13082 13084-13085 13169 13228-13229 13256 13260 
13295 13316-13317 13394 13456 13488-13489 13494 13497 
13512 13516 13546 13645 13649 13651 13659 13679 13739 
13776 13864 13866-13867 13888 


testis 


GIBCO 


ATSOOl 


50 142 500-502 564-565 598 673-678 963-968 1193 1346 1556 
1671 1826 1968 1977 2160-2161 2236-2237 2555 2590 2597 
3024 3093 3196-3202 3284 3378 3517-3522 3524 3720 3797 
3839 3885 4006 4025 4055 4109 4253 4354 4377 4388 4405 
4434 4473 4480 4543 4688 4792 4940 4957 4962 4964 4975 
5001 5018 5030 5037 5067-5072 5076 5118 5171 5481 5525 
5535 5740 5757 5788 5907 5924 6125 6147 6257 6316-6322 
6324 6607 6886 6952 7058 7141 7288 7445 7484 7534-7540 
7677 7710-7711 7713-7717 7735 7792 7815 8087 8110 8120 
8262 8368 8370 8375 8535 8577 8648 8655 8702 8729 8838- 
8839 8848 8898-8906 8936 8939 8966 9076 9087 9244 9264 
9304 9455 9466 9485 9542 9567 9703 9758 9822-9825 9908- 
9910 9928 10462 10475 10478 10482 10551 10583 10679 
10772 10778 10795 10955 10974 11108 11113 11116 11118- 
1 1 120 1 1 132 1 1 173 1 1239 1 1425 1 1 606 1 1620 1 1 629 1 1 695 
1 1807 11862 11918-11919 12000 12006 12017 12033 12057- 
12061 12375 12405 12424 12570 12577 12599 12601 12605 
12609 12668 12692 12730-12731 12748 12904 12917 12923 
12929 12935 12940 12956 12971 13047 13051 13073 13090 
13169 13222-13225 13249 13394 13442 13456 13479 13494 
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Library 
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13497 13502-13503 13506 13512 13515 13533 13549 13555 
13575 13613 13616 13627 13644-13646 13660 13866 13875 
13882 13889 


Genomic 
DNA from 
BAG 63118 


Research 

Genetics 

(CITB 

BAG 

Library) 

Invitrogen 


BAGOOl 
BlTDObl ~' 


8711 


adult bladder 


731 1710 1779-1791 20762367 2479-2481 40674208-4214 
4681 47674775 5163 5169 5553 55605581 5674 5684-5688 
5794 6082 6244 6716 6718-6721 6799 6931 7045 7094-7097 
7967 8110 8226 8249-8258 8364 8648 8737 9012 9097 9425- 
9428 9430 9626 9703 9928 10007 10261 10557 10734-10735 
1 1459 1 1504-1 1506 12000 12212 12512-12513 12515-12517 
12670 12735 12914 13085 13439 13512 13687 13838-13841 
13866 


bone marrow 


Glontech 


BMDOOl 


11 70 83 85 142 150 162-184 186-198200-210230-243 245- 
277 279-281 306 370 373-387 389 433-435 438-440 457 483 
503-510 566 568-569 595-596 598 609 616 679-683 740 770 
843-857 859 861 864 867-876 880-884 886-890 892-893 896 
900 902-920 1000 1004 1116-1120 1122-1126 1128-1134 1184 
1211 1346 1359 1516 1552-1554 1575 1583 1671 1724 1774 
1877 1894 1927 1974 1976 1983 2012-2024 2031 2033-2038 
2040 2043-2044 2084-2088 21 1 1 2120-2121 2132-2133 2137- 
2138 21612163-216421862189 2199-22002202-2203 2205- 
2210 2213-2218 2266 2291-2295 2342-2343 2414-2416 2444 
2529 2555 2566 2569 2575-2576 259 1 2597 2652 2681-2709 
271 1-2716 2735-2738 2740-2744 2746-2748 2750-2756 2758- 
2759 2761-2764 2766-2768 2770 2772-2781 2783-2787 2806 
2812-2813 2816 2873 2875 2880-2881 2883-2885 2887-2897 
2899 2901 2926 2948 2954 2958 2977 2984 3026-3029 3032 
3068-3069 3071 3081 3093 3096-3097 3139 3203-3205 3207 
3238 3257 3272 3282-3283 3286 3317 3345 3356 3404-3418 
3422-3434 3437-3438 3440 3442 3447 3449-3450 3456-3459 
3461-34643466-3473 3483 3497 3535 3538 3548 3557 3572 
3588 3593 3600 3604-3605 3610 3612 3625-3627 3634 3644- 
3647 3649-3651 3653-3657 3709 3711 3722 3725 3727 3737 
3797 3804 3808 3810 3839 3899 4005-40064011 4015-4019 
4023 4025 4040 4043 4058 4060 4129 4132 4192 4194 4201 
4246 4253 4277 4282 4367 4403 4466-4472 4474-4477 4479- 
4484 4486-4490 4492-4497 4509 4512-4540 4582 4595 4597- 
4606 4608-4615 4622 4642 4648 4650 4654 4667 4691-4695 
4723 4741-4742 4759 4767 4783 4794-47964808 4836 4928- 
4929 4931-4934 4938 4940 4942 4944-4948 4950-4953 4955- 
4960 4962-4964 4967-4971 4973 4976-4979 4985-4987 4992- 
4998 5000 5004-5016 5030 5037 5052 5075-5076 5093 5143 
5149-5151 5163 5169 5210-5215 5217-5226 5262 5317 5357 
5503 5523 5535 5537-5540 5560 5604 5695 5740 5748 5766 
5788 5796 5801 5862 5874 5895 5906 5908 5938 5965-5967 
5969-5979 5991-5994 5996-6005 6007-6009 6027 6047 6057 
6065 6067-6068 6071-6072 6082 6086 6089-6091 6112 6125- 
6127 6148-6152 6195 6214 6233 6257 6269-6276 6278-6280 
6282-6283 6285-6289 6292-6300 6321 6374 6393-6403 6508 
6555 6605 6607 6722 6730 6788 6815-6822 6826-6832 5836 
6859-6861 6867 6870-6871 6873-6874 6905 6915 6929 6933- 
6937 6940-6942 6944-6945 6988-6992 7051 7126 7155 7166 
7169 7199-7210 7212-7219 7226 7233-7253 7256-72647275 
73 17-73 18 7320 7322-7326 7347 7349 7373-7374 7379 7382 
7446-7448 7473-7474 7491 7502 7542 7544 7569 7577 7579 
7663-7671 7674 7677-7678 7680-7684 7686 7689-76967742 
7768 7791 7815 7819-7824 7847 7864 7930 7995 8065 8085 
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Source 


Library 
Name 


SEQ ID NOS: 








8088 8090-8093 8095 8110 8120 8134 8139 8224 8226 8235 
8241 8246 8262 8336 8354 8359 8363 8365 8368 8375 8421- 
8427 8429-8440 8444 8452 8456-8460 8463-8465 8467-8476 
8478-8489 8538-8539 8542-8551 8563 8566 8577 8593-8595 
8597 8603 8633 8635-8637 8648 8656-8657 8671 8677 8679 
8701 8792 8796 8817 8830 8840-8845 8847-8849 8852 8855- 
8860 8863-8868 8870 8873 8875 8879-8881 8898 8935 8944 
8963 8965 8995 9010 9012 9020-9031 9049 9051 9066 9076 
9092 9097-9098 9128 9257 9264 9285-9288 9304 9317 9339 
9381 9399 9401 9454-9455 9466 9471-9472 9474-9476 9516- 
9517 9519 9529 9544 9622 9630-9643 9646 9650-9665 9689 
9705-9712 9730 9734 9740 97S8 9779-9781 9826 9885-9889 
9892-9902 9928-9929 9939 9942 9972-9976 9978-9982 9999- 
10000 10068 10153 10175 10249 10275 10277 10284 10287 
10296 10319 10321 10343-10344 10346 10364-10374 10379- 
10382 10385-10394 10414 10416 10426 10429 10439 10447 
10452 10467-10468 10475 10483 10520 10522-10526 10528- 
10529 10531-10535 10587-10590 10595 10616 10677-10679 
10691 10750 10760 10767 10772 10778 10788 10794-10795 
10828 10831-10837 10846-10847 10849-10851 10853-10858 
10891 10895 10897 10899-10903 10905-10907 10931-10933 
10956-10959 10975-10976 11012 11032 11046 11057 11059 
1 1 06 1 11 077- 1 1 082 1 1 085- 1 1 086 1 1 088-1 1 098 1 1 1 08 1 1 1 1 3 
11132 11145 11177-11181 11194 11208 1 1289 11344 1 1401- 
11402 11513 11618 11620 11631 11673-1167511677-11690 
11695 11697-1170411706 11708-11715 11736 11742 11762- 
1 1772 11774 1 1795-1 1796 1 1 802-1 1804 1 1 817-1 18181 1829 
11839-11842 11863 11920 11946 11992-1200212004-12008 
12010 12012-12013 12015-12019 12022-12029 12033 12041 
12081 12142-1214912160 12178 12195 12200 12231 12236 
12283 12305 12308 12316 12327 12363 12368 12404-12405 
12424 12426 12430 12439 12527 12546 12570 12576 12590 
12608 12610 12616 12634-12636 12641-12645 12654-12655 
12657-12659 12669-12670 12674 12721-12724 12754 12761 
12834 12840 12842 12904-12905 12910-12911 12913 12916- 
12917 12922-12923 12925-12928 12933 12938 12956 12966 
12977 12989-12990 12993 12995 13020 13033 13047 13060 
13062 13064 13066 13095 13098 13103-13105 13107 13111- 
13113 13129 13135 13137 13141 13150 13163 13173 13181 
13213-13217 13234 13276-13278 13280 13391 13394 13400 
13456 13472 13477 13490 13492 13494 13498 13501-13502 
13506-13507 13511 13516 13528 13530-13534 13538 13540 
13546 13551-13552 13554-13555 13560 13585-13586 13594 
13613 13630 13633-13639 13644-13646 13649 13659-13660 
13670 13673 13679 13713 13775 13795 13866 13872 13875 
13882-13885 13889 13891 13893 


bone marrow 


Clontech 


BMD002 


51 242 442 654 1004 1134 1841-1904 1908 1927 2023 2107 
2215 2342 2408 2507-2529 2576 2597 2806 2866 3286 3434 
3722 3736-3737 3817 3823 3839 4060 4246 4258-42904389 
4396 4411 4618 4641 4828 4836 4957 4987 5030 5037 5052 
5108 5163 5526 571 1 5743-5772 5774-5804 5813 5895 5939 
6131 6236 6266 6287 6306-6307 6333 6655 6675 6728 6730 
6750-6762 6764-6769 6788 7049 7106-71 17 7137 7156 7738 
8068 8085 8134 8156 8290-8297 8299-8305 8307-8312 8323 
8368 8482 8499 8858 8863 8865 8928 8935 9076 9111 9128 
9465-9502 9626 9700 9703 9923 9928-9929 9942 10277-10280 
10282-10284 10286-10297 10396 10434 10515 10551 10645 
10675 10691 10744-10753 10772 10778 11057 11098 11108 
1 1 132 1 1232 1 1252 11 5 1 9 1 1 536- 1 1 553 1 1 606 1 1620 1 2033 
12039 12146 12260 12305 12387 12402 12405 12500 12554- 
12578 12594 12599 12608 12674 12754 12777 12839 12895- 
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12904 12916 12923 13020-13021 13043 13057-13064 13169 
13249 13446-13434 13479 13492 13494 13513 13533 13555 
13659 13713 13775 13803 13863-13875 13877-13887 13889 
13891 


b„„e„.™ 


Clontech 


BMD004 


2249 2529 3286 3494 3548 3551 3797 3839 4025 4058 4201 
4277 4282 5052 5108 6545 6961 8262 8898 9474 10000 11098 
11818 13021 13893 


bone marrow 


Clontech 


BMD007 


8539 9780 9927 13021 


adult colon 


Invitrogen 


CLNOOl 


319 346487 731 799 1792 1848 20502161 24492482-2483 
3431 39014215-42174940495749875163 523955605689- 
5695 5865 5911 5923 6722 6765 7098 7815 78647880 8110 
8259-8262 8486 8597 8951 9484 9529 9542 9556 10376 1 1507- 
11508 1161711869 12127 12236 12424 12518-12523 12601 
12610 12777 12976 13062 13073 13367 13440 13507 13512 
13630 13713 13843-13844 13864 13868-13869 13888 


Mixture of 16 
tissues - 
mRNAs* 


Various 
Vendors* 


CTL016 


6815 10776 12977 13064 13512 


Mixture of 16 
mRNAs* 


Various 
Vendors* 


CTL021 


1671 6738 8432 8648 8863 8944 9511 10769 13021 13062 
13064 


adult cervix 


BioChain 


CVXOOl 


50 67 142 158 308 332 346 475 598 654 895 1004 1086 1286 
1449 1516 1671 1698 1701 1711-1756 1758-1776 1828 1848 
1959 2134 2186 2257 2267 2343 2408 2414 2468-24742476- 
2478 2608 2716 3002 3136 3166 3191 3199 3529 3535 3554 
3572 36273722 3737 3777 3797 3839 3985 4158-41764178- 
4195 4197 4199-4207 4246 4277 4391 4396 4434 4641 4667 
4759 4783 4828 4885 4940 4957 4963 4987 4998 5001 5038 
5075 5108 5163 5293-5294 5455 5481 5523 5552 5581 5646- 
5652 5654-56595661-5671 5673-5681 5687 5701 5711 5723 
5740 5788 5794 5848 5902 5908 5923-5924 5964 6020 6052 
6057 6062 6091 6106 6112 6125 6129 6181 6350 6371 6374 
6410 64466458 6504 6508 6512 6551 6598 6686-6687 6689- 
6705 6707-6715 6788 6873 6893 6917 6998 7008 7045 7078 
7084-7093 7095 7130 7141 7148 7169 7204 7507 7579 7608 
7675 7733 7768 7815 7871 7880 7893 8078 8138 8209-8215 
8217-8236 8238-8242 8244-8248 8298 8345 8370 8444 8456 
8486 8499 8535 8558 8592 8633 8635 8648 8669 8679 8742 
8853 8863 8870 8898 8921 8939 8948 9012 9061 9098 9107 
9128 9 137 91 53 9304 9308 93 17-93 1 8 9355 9385-939 1 9393- 
9403 9405-9406 9408-9418 9420-9422 9457 9466 9475 9510 
9539 9612 9734 9773 9927-9928 9939 9947 9960 101 10 10175 
10230-10256 10258-10259 10267 10274 10319 10329 10344 
10491 10496 10540 10616 10660 10691 10722-10732 10778 
10782 11055 11145 11217 11376 11462 11477-11489 11491- 
11503 115191158411604 11695 11853 1186911891 11980 
12006 12066 12081 12127 12160 12195 12216 12240 12266 
12308 12363 12379 12402 12405 12424 12438 12483-12494 
12496-12510 12579 12605 12610-12611 12617-12618 12643 
12653 12670 12674 12688 12691 12703 12707 12735 12740 
12754 12830 12840 12866 12870-12881 12883-12884 12905 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult Iddney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal Iddney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 
10) human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lympb 
node mRNA (Clontech), 13) human spinal cord mRNA (Qontech), 14) human thyroid mRNA (Clontech), 15) 
human esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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12913-12914 12917 12923 12951 12955 12957 12965 12989- 
12990 12993 13002 13020 13035 13045-13050 13062 13064 
13072 13092 13136 13141 13174 13197 13254 13283 13307 
13391 13428 13430-13437 13442 13473 13479 13492 13494- 
13495 13497-13498 13502 13532 13554-13555 13575 13590 
13597 13613 13616 13627 13644 13679 13713 13775 13829- 
13837 13866 13868-13869 13872 13884 13888 13891 


diaphragm 


BioChain 


DIA002 


731 1346 3548 3711 3885 4282 4654 5895 6873 8120 8931 
8936 9455 11132 11818 12405 12609 


endothelial 
cells 


Strategene 


EDTOOl 


21 51 67 83 332 569 598 609 762 796 1004 1024-1026 1086 
1561 1848 1928 1959 1976-1977 1983 2138 2161 2166 2257 
2282-2283 24172483 2490 2555 2569 2614 2926 3042 3189 
3191 3272 3300-3303 3426 3494 3503 3548 3574-35763605 
3627-3628 3673 3709 3720 3722 3737 3797 3839 3885 4005 
401 1 4019 4055 4133 4192 4246 4269 4282 4340 4354 4365 
4384 4388 4399 4405 4410-441 1 4434 4543 4641 4654 4767 
4797-4799 4802 4881-4882 4885-4886 4888 4940 4957 4964 
4972 4985 4998 5002 5017 5030 5076 5103-5104 5132-5 133 
5163 5167 5183-51855187-5190 53805523 552755355541- 
5542 5544 5674 5684 5693 5724 5766 5778 5788 5794 5796 
5874 5895 5916 5923 5928 5938 6005 6048 6057 6068 6082 
6165 6205 6215 6240-6241 6307 6321-6322 6349-6353 6383 
6458 6595 6598 6606-6607 6765 6799 6805 6815 6860 6871 
6873 6890 6918 6968 6972 6976 6979-6980 6998 7058 7067 
7104 71 13 7116 7137 7139 7169 7275 7468 7613 7716 7755- 
7757 7759-7760 7797 7799-7801 7930 8077 8084 8093-8095 
8120 8139 8235 8262 8320 8323 8335-8336 8345 8354 8358 
8363-8364 8370 8372 8375 8387 8452 8592 8648 8786 8788 
8792 8813 8863 8898 8944-8955 8965 8996-9001 9051 9076 
9097 9128 9264 9289-9291 9304 9315 9414 9455-9456 9466 
9472-9473 9475 9484 9504 9517 9529 9542 9563 9570 9626 
9703 9780-9781 9843 9927 9939 9961 10000 10027 10154 
10267 10285 10321 10330-10331 10342 10344 10349 10496 
10500 10550 10556-10557 10579 10679 10772 10776 10778 
10788 10795 10802 11013 11132 1 1136 1 1 146-11149 11406 
11483 11565 11588 11600 11606-11607 1 1615 11626 11807 
11818 11932 11955 12006 12034 12037 12041 12044 12078 
12092-12094 12123-12126 12150 12213 12375 12381 12387 
12405 12411 12424 12426 12522 12570 12576 12590 12601 
12610 12612 12615 12617-12618 12663 12670 12674 12707 
12729 12739 12749 12753-12754 12777 12830 12842 12913- 
12914 12916-12918 12929 12940 12972 12975-12977 13002 
13024 13047 13051 13054 13062 13064 13082-13084 13090 
13092 13094 13123 13126 13136 13195 13235 13263 13380 
13389 13392 13394 13400 13456 13479 13488-13489 13492 
13494-13499 13502 13506-13507 13514 13516 13546 13555 
13568 13575 13590 13592 13613 13616 13621 13630 13649 
13659-13660 13667-13668 13670-13673 13678-13679 13690- 
13692 13713 13796-13797 13838 13866-1386913872 13882 
13884 13888 13893 


Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPMOOl 


150 2023 2327 2490 4109 4783 5503 5560 10267 10760 12017 
12160 12557 12582 12923 13020 13514 


Genomic 
clones from 
the short arm 
of 

chromosome 


Genomic 

DNA 

from 

Genetic 

Research 


EPM003 


5560 12017 12146 
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8 








Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 
DNA 

Genetic 
Research 


EPM004 


4783 4798 5560 10817 11926 12017 12160 


esophagus 


BioChaia 


ESO002 


999-1000 2449 3272 3315 3548 3550 3634 3697 3796 401 1 
4025 4058 4201 4282 5106 5163 5553 6082 6873 7739 9304 
10296 11133 11818 12033 12570 13869 


fetal brain 


Clontech 


FBROOl 


51 142 1 184 3664 4060 4109 4940 5021 5270 5523 5553 61 12 
6805 6908 7294 8558 9457 10376 11059 11985 12006 12122 
12160 12754 13438 13507 13888 


fetal brain 


Clontech 


FBR004 


60 2704 371 1 4025 4109 4783 5001 6082 7597 9010 9504 
9949 1 1837 12033 12039 12363 12705 12905 13020 13503 
13512 13891 


fetal brain 


Clontech 


FBR006 


6 60 67 598 800 932 1004 1170 1793-1794 1796-1797 1799- 
1805 1905-1914 1916-1958 1974 1976 1979 1983 2057 2129 
2174 2221 2407 2444 2449 2484-2492 2530-25542556-2561 
2563 2576 2857 3064 3207 3479 3556 3673 3709 3722 4060 
4078 4157 4218-4221 4223-4224 4277 4291-4334 4338 4355 
4364 4369 4431 4957 5001 5109 5270 5380 5553 5634 5696- 
5706 571 1 5724 5766 5788 5794 5801 5805-5832 5834-5879 
5882-5901 5936 5990 6057 6723-6732 6765 6770-6791 6797 
6805 6894 7049-7050 7100-7102 7105 7118-7123 7125 7127 
7169 7905 8263-8265 8267-8273 8294 8312-8333 8359 8361 
8375 8452 8633 8664 8740 8757 8884 9010 91 1 1 9432-9436 
9503-9516 9518-9545 9547-9551 9556 9570 9577 9780 9895 
9923-9924 9928 9942 10007 10027 10202 10263-10268 10276 
10284 10298-10310 10329 10331 10496 10542 10595 10621 
10736-10737 10755-10761 10772 10774 10795 11108 11132 
1 1406 1 1483 1 1509-1 1523 1 1555-1 1582 1 1589-1 1 590 1 1 600 
11606 11621 11713 11729 11807 11837 12006 12039 12044 
12092 12113 1221S 12231 12236 12327 12363 12398 12405 
12465 12511 12524-12530 12576-12577 12579-12601 12729 
12735 12754 12863 12869 12889 12906-12910 12914 12954 
12973 13020-13021 13051-13052 13054 13065 13082-13083 
13427 13445 13455-13470 13488 13490 13496 13498-13501 
13507 13516 13560 13613 13630 13649 13708 13713 13769 
13831 13845-13855 13868 13872 13882 13884 13888-13894 
13896-13900 


fetal brain 


Clontech 


FBRs03 


1005 4405 5111 6337 6964 7742 13084 13864 13891 


fetal brain 


Invitrogen 


FBT002 


51 83 142 321 430 746 932 1054 1058-1065 1493 1833 1947 
2273-2275 2299 2444 2449 2926 3479 3492 3885 4347 4354 
4391 4405 44104434 45304804 4985 4998 5075 5160-5169 
5380 5428 5466 5750 5788 5801 5895 6132 6215 6371-6374 
6458 6598 6973-6974 7067 7096 7776-7778 77S0-7782 7937 
8143 8323 8361 8364 8372 8377 8452 8633 8977-8984 8986 
9010 9142 9264 9332 9457 9474 9503 95 1 1 95 17 9539 9582 
9827 9848 9927 9950-9953 10027 10161 10329 10430 10492 
10573-10575 11014 11160-11164 11406 1 1628 11742 11814 
11830 11985 12092 12112-12114 12116-12117 12127 12424 
12511 12521 12570 12576 12643 12696 12735 12748-12751 
12754 12830 12835 12913 12957 12977 12990 13002 13020 
13062 13072 13083 13117 13254-13259 13377 13486 13489 
13496 13499 13507 13590 13649 13685-13688 13713 13867 
13888 13891 13893 


fetal heart 


Invitrogen 


FHROOl 


1001 100422504025 6334 6765 7740 8933 8935 9457 9544 
10000 11132 12599 12609 13021 13568 13656 13866 


fetal kidney 


Clontech 


FKDOOl 


142 346 364 511-517 570-572 574 598 685-690 969-970 972- 
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981 983-987 1 134 1346 2123 2167-2169 2238-2246 2342 2444 
2483 25 16 2555 2617 2728 2843 2876 3032 3049 3072 3206 
3208 3282-3283 3525-3531 3533-3543 3548 3591 3709 3722 
3797 3839 3878 4015 4019 4043 4246 4277 4367 4405 4696- 
4698 4725 4767 4805-4810 4940 4947 4957 4986 4998 5037 
5056 5073-5080 5082-5091 5099-5100 5108 5258 5504 5523 
5560 5923 6005 6207-6208 6225 6272 6288 6325-6332 6478 
6603 6702 6793 6815 6906 6953-6959 7045 7058 7204 7355 
7426 7449-7450 7520 7543-7546 7561 7587 7718-7732 7930 
8077 8097 8262 8375 8387 8452 8520 8638 8658 8736-8737 
8834 8863 8898 8907-8918 8922 8950 9010 9134 9257 9401 
9457 9544 9597 9760 9781 9791 9828-9830 9912 9914-9918 
10296 10440 10484 10546-10548 10772 11108 11121-11129 
11131-11132 11170 11513 11638 11695 11923-11924 12006 
12033 12062-12070 12072 12160 12405 12522 12570 12594 
12599 12605 12626 12663 12670 12732-12733 12749 12848 
12904 12914 12940-12941 12990 13020 13083 13188 13226- 
13227 13234 13263 13277 13280 13351 13391 13394 13491 
13501 13512 13590 13644 13647-13650 13713 13782 13867- 
13868 13872 13875 


fetal kidney 


Clontech 


FKD002 


3286 5030 5037 5105 11108 12033 12490 12570 13494 13866 


fetal kidney 


Invitrogen 


FKD007 


3272 38064025 4253 4277 4654 5112 5535 5788 5801 8863 
8935 9401 9466 10553 11628 11818 13494 13646 13866 


fetal lung 


Clontech 


FLGOOl 


79 2367 2395 3010 3460 3885 4828 4948 4962 5001 5723 5748 
5902 5908 6186 6738 7051 7067 76777759 9264 9553 9700 
10007 10478 11098 12017 12383 12417 12424 12749 12917 
13020 13169 13472 13554 13644 13782 13835 


fetal lupg 


Invitrogen 


FLQ003 


142 319 364 629 1671 1806-1814 1816-1819 1877 2129 2161 
2169 2367 2449 2493 2529 3191 3503 3610 4109 4225-4234 
4367 4434 4957 5108 5380 5421 5581 5707-5710 5712 5714 
5788 5801 6057 6733-6741 7034 7103 8274-8278 8365 8597 
8948 9264 9327 9437-9442 9444 9466 9510 9525 9530 9539 
9677 9773 9841 10007 10190 10198 10269-10271 10329 11519 
1 1524-11527 11927 12531-12539 12848 12890 12904 13021 
13072 13249 13445 13472 13489 13551 13575 13649 13670 
13679 13856-13857 


fetal lung 


Clontech 


FLG004 


1003-1004 2597 51 10 6963 9924 10552 1 1 138-1 1139 12080 
12990 13659 


fetal liver- 
spleen 


Columbia 
University 


FLSOOl 


-2 4-14 16-22 24 26 28-3 1 33-46 48-49 51-61 63-68 71-91 93- 
102 104-110 112-124 126-156 158-162 282-283 285-290 292- 
299 301-304 307-3 12 3 14-326 328-338 340-344 346-353 355- 
365 369 390-400 402 436 441 483 557 567 575-585 595 598 
629 673 678 691-699 701-702 708 731 736 763-767 769-776 
778-786 788-791 793-794 796 925 975 1004 1015 1023 1038 
1068 1104 1134 1144 1184 1192 1216 1264 1298 1346 1482 
1493 1516 1518-1521 1551 1556 1575 1583 1594 1636 1641 
1707 1724 1774 1826-1829 1841 1858 1927 1959 1962-1965 
1967-1972 1974-1979 1981-1998 2000-2009201 1 2045-2051 
2053-2055 2057-2058 2060-2063 2065-2083 2089-2094 2100- 
2101 2161 2170 2174 2184-21942215 2222 2269 2290 23 10 
2342 2409 241 1 2414 2444 2449 2458 2483 2490 2497-2498 
25 10 2516 2523 2529 2555 2562 2566-2576 2578-2586 2588- 
2591 2593-2601 2604 2607-26082611-26122614-26182620 
2622-2642 2644 2646-2653 2655-2664 2666 2668 2670-2680 
2696 2750 278R-2793 2795-281 1 2814-2826 2S28-283 5 2837- 
2842 2844 2846-2848 285 1-2858 2860 2862-2871 2876 2878 
2893 2900-2905 2907-2909 2926 2929 2949 2952-2953 2959- 
2960 2984 2992 3032 3058 3069 3073-30763078-3080 3082 
3093 3166 3194 3196 3207 3210-3211 3213-3214 3217-3225 
3249 3257 3272 3282 3286-3287 3304-3307 3310-331 1 3314- 
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3317 3319 3321-3332 3334-3346 3356 3419 3426 3434 3446 
3450 3455 3468 3470 3479 3484 3494 3503 3535 3537 3548 
3552 3557 3572 3590 3594 3597 3604-3605 3610 3612 3625 
3627 3634 3668 3670 3709 371 1 3720 3722 3729 3737 3777 
3797 3806 3808 3810 3813 3839 3885 3926 3990 3996-3997 
4006 4009 401 1 4019-4022 4025 4040 4043 4060 4078 4095 
4109 4129 4192 4194 4201 4245-4246 4253 4261 4277 4282 
4297 4335-4338 4340-4357 4359-436 1 4364 4366-4367 4370- 
4372 4374-4377 4379-4382 4384-4386 4388-4392 4395-4396 
4398-4414 4417-4424 4426-4455 4457-4458 4460-4465 4542- 
4543 4545 4547-4553 4555-4562 4565-4575 4577 4579 4581- 
4582 4585-4588 4593 4596 4607 4616-4629 4644-4645 4647 
4654 4671 4676 4687 4689 4694 4721 4726-4729 4759 4767 
4775 4783-47844788 4790-4791 4811-4819 4830 4837 4845 
4862 4874 4889-4902 4930 4940 4948-4949 4957 4962-4965 
4972 4985 4998 5022 5029-5030 5037 5040 5075-5076 5092 
5108 5132 5152 5163 5167 5171-5172 5335 5380 5398 5473 
5503 5514 5516 5523 5526-5527 5535-5536 5553 5581 5598 
5604 5616 5674 5684 5691 5693 571 1 5715 5724 5726 5748 
5750 5757 5778 5788 5794 5801 5817 5865 5874 5894-5895 
5902-5914 5916-5919 5921-5928 5930-5949 5951-5954 5956- 
5964 5966 5979 6005 6010 6013-6015 6017-6018 6020-6022 
6024-6025 6028-6032 6034-6048 6052-6062 6064 6068 6073- 
6081 6086 6092-6094 6104 61 12 6125 6135 6140 6147 6149 
6153-61576165 6186 6195 6209-6211 6240-6241 6243-6248 
6250-6253 6272 6276 6287-6288 6307 63 13 6338 6371 6374 
6419 6430 6446 6451 6458 6478 6496 6508 6545 6579 6595 
6598 6607 661 1 6624 6629 6642 6658 6695 6726 6728 6730 
6738 6745 6754 6765 6767 6788 6793-6797 6799-6808 6810- 
6815 6833-6843 6845-6847 6850-6856 6858 6860 6862-6865 
6870-6871 6875 6877 6891 6893-6894 6900 6907-6908 6917 
6919-6925 6950 6979 7049 7058 7067 7077 7096 7109 7116 
71187128-7131 7133-7135 7137-7141 7144-71477149-7150 
7 1 52-7 153 7 1 55-7 1 79 7 1 8 1 -7 1 92 7 1 94-7 198 7265-727 1 7273- 
7280 7282-7305 7307-7312 7316 7327-7336 7375-7376 7383 
7386 745 1 7455 7468 7473 7475-7477 7479 7484 7548-7553 
7555-7558 7561 7608 7617-7618 7620-7631 7642 7675 7687 
7695 7716 7768 7809 7811 7839 7842 7864 7883 7933 7977 
8069 8093 8095 8105 8110 8116 8139 8224 8226 8235 8241 
8262 8323 8334-8340 8342-8350 8352-8355 8357-8359 8361- 
8408 8410-841 1 8414-8419 8452 8456 8490-8498 8500-8502 
8504-8526 8529-8530 8537 8540-8541 8553-8560 8563 8566 
8568 8577 8592 861 1 8639 8648 8659-8661 8664 8668-8669 
8710 8738-8739 8741 8777 8792-8804 8830 8834 8839 8853 
8858 8863 8865 8876 8898 8926 8935 8939 8950-8951 8957 
8963 9010 9012 9076 9092 9097 91 12 9119 9128 9257 9264 
9266 9284 9291 9301 9304 93 13 9318 9352 9375 9399 9414 
9424 9445 9455 9457 9466 9473-9474 9476 9480 9484-9485 
951 1 9517 9528-9529 9536 9539 9542 9544 9552-9557 9559 
9561-9564 9566-9572 9574-9577 9579-9630 9646 9655 9666- 
9667 9669-9671 9673 9675-9697 9700-9701 9703-97049713- 
9715 9718-9719 9722 9726-9729 9734 9752 9758 9761 9777 
9780-9784 9813 9831 9854-9867 9869 9906 9927 9929 9939 
9942-9943 9969 9983 10000 10007 10205 10267 10284 10296 
10305 10311 I03I3-10316 10318-10325 10327 10329-10331 
10333-10354 10356-10363 10395-10397 10400-10403 10405- 
10410 10413 10417-10422 10427 10446-10447 10463 10470 
10485 10496 10502-10507 10515 10542 10557 10565-10566 
10645 10679 10688 10691 10729 10750 10760 10762 10764- 
10769 10772-10774 10776-10782 10784 10786-10795 10797- 
10815 10817-10830 10859 10861-10869 10871-10879 10881- 
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10893 10908-10916 10929 10934 10943 10948 10968 10971 
10975 10977-10983 11005 11009 11011 11015-11018 11028 
11031 11054-11063 11094 11108 11114 11132 11136 11142 
11194 11208 11213 1124611390-11391 11401 11406 11411 
11414 1145911462 11483 11508 11513 11521 11534 11559- 
11560 11565 11573 11585-11603 11606-11610 11612-11616 
11618-11638 11640 11642-1165711659-1166711669-11670 
11672 11699 11716-11718 11720-11722 11724-11741 11743- 
11758 11760-11761 11775-11787 11797 11799-11801 11818 
11830 11862 11864-11866 11869 11876 11889 11911 11925- 
11928 11958-11961 11963-11976 11979-11980 12000 12006 
12017 12019 12028 12033-12034 12041 12078 12086 12092 
12122 12127 12159-12160 12178 12216 12218 12236 12245 
12256 12260 12289 12303 12305 12308 12327 12330 12363 
12375 12382 12387 12402-12405 12424 12426 12439 12442 
12465 12470 12476 12511 12522-12523 12546 12551 12560 
12570 12576 12590 12601-12602 12604-12619 12621-12633 
12636 12639 12646-12653 12670-12671 12674 12682-12683 
12688 12693 12696 12706 12709-12715 12735 12749-12750 
12772 12790 12823 12830-12832 12835 12842 12849 12855 
12863 12904-12905 12910-12911 12913-12916 12918-12924 
12929 12931-12932 12934-12937 12939-12940 12954-12955 
12965 12973 12976-12977 12989-12990 12993 12999 13020- 
13021 13035 13051 13054-13055 13060 13062 13066-13067 
13069-1308713089-13095 13097-13102 13114-1311713119- 
13120 13122-13126 13134-13136 13141 13147 13156 13174 
13193 13195-13196 13198-13201 13254 13260 13264 13277 
13280 13295 13351 13368 13377 13391 13394 13400 13409 
13412 13420 13456 13472-13477 13479-13484 13486-13492 
13494-13496 13498-13499 13501-13510 13512-13516 13518- 
13527 13533 13541-13544 13546-13551 13553-13555 13560 
13575 13587-1358913597 13603-13604 13613 13616 13622- 
13623 13630-13631 13644-13646 13649 13659 13670 13679 
13713 13743 13748 13769 13775 13782 13793 13803 13808 
13818-13819 13858-13860 13864 13866-13869 13872 13882 
13884 13888-13889 13891 13893 13901 


fetal liver- 
spleen 


Columbia 
University 


FLS002 


6 16 24 30-3163 67 81 83 89 95 103 115 117126 140142 147 
150-151 158 162 21 1 225 287 308 332 356 358 390-391 438- 
439 483 551 556 641 654 694 701 708 731 788 997 1006 1012 
1047 1082 1151 1154 1178 1184 1208 1212 1356 1480 1507 
1551 1556 1623-1647 1649-1667 1669-1671 1675 1677-1698 
1828-1829 1858 1877 1885 1889 1914 1927 1947 1961 1997 
1999 2028 2057 2070 2092 2098 2138-2139 2154 2161 2174- 
2175 2189 2191 2257 2282 2327 2342 2363 2399 2403 2409 
2440-2441 2443-2450 2452-24592490 2498 2522-2523 2544 
2555 2568 2575-2576 2592 2597 2605 2619 2623 2659 2806 
2809 2812 2860 2869 2879 2903 2926 2932 2988 303 1 3066 
3075 3089 3188-3189 3286-3287 33 19 3343-3344 3356 3426 
3535 3548 3552 3554-3555 3583 3591 3610 3625 3634 3673 
3709 371 1 3720 3722 3839 3899 3926 3985 4005-4006 401 1 
4020 4025 4055 4058 4060 4078 4091-41 1641 18-4126 4128- 
4142 4172 4194 4201 4253 4277 42824347 4350 4353-4355 
4362 4367 4374 4379 4386 439 1 4394 4396 4402 4426 443 1 
4435 4437-4439 4512 4553 4578 4586 4607 4622 4644 4654 
4671 4758 4767 4783 4798 4836 4845 4899 4940 4948 4962- 
4963 4985 4991 4998 5001 5037 5108 5167 5171 5177 5198 
5237 5293-5294 5380 5400 5523 5535-5536 5581-5591 5593- 
5599 5601-5613 5615-5624 5627-5637 5653 5674 5691 5693 
571 1 5724 5726 5733 5748 5757 5772 5778 5794 5817 5874 
5894 5902 5904 5906-5907 5909 591 1 5916-5919 5923-5924 
5927 5929 5932 5938 5941 5948 5957 5959-5960 5962 5964 
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5979 6005 6020 6027 6037-6038 6052 6054 6057 6060-6062 
6064 6068 61 12 6140 6195 6225 6246 6287-6288 6306-6307 
6336 6354 6371 6374 6423 6430 6451 6458 6461 6469 6478 
6496 6508 6538 6545 6563 6580 6583 6595 6598 6607 6629 
6638 6641-6643 6645-6675 6709 6726 6730 6765-6166 6788 
6792-6793 6795 6797-6798 6800-6801 6808 6812 6836 6843 
6850-685 1 6855 6860 6864 6870 6875 6888 6908 6924 6950 
6958 6968 6998 7003 7018 7045 7049 7056 7071-7079 7095- 
7096 7109 71 16 71 18 7126 7133 7135 7137-7139 7155-7156 
7166-7167 7169 7175 7178 7275 7291 7294 7329 7422 7426 
7478 7608 7627 7687 7695 7716 7815 7839 7871 7893 7914 
7937 7977 8022 8095 8120 8133-8134 8148-8149 8151-8167 
8169-8182 8184 8186-8192 8194-8196 8241 8246 8276 8289 
8298 8336 8339 8343 8345 8349 8355-8356 8361 8363 8365 
8367-8368 8370 8373 8375-8378 8385 8388 8417 8496 8518 
8520 8543 8558 8561 8563 8646 8667 8710 8738-8740 8786 
8803 8813 8865 8926 8946 8948 8963 8970 9010 9049 91 19 
9128 9142 9164 9222 9264 9289 9296 9301 9317-9318 9320 
9322 9335-9341 9343-9362 9365-9368 9370 9372-9373 9375- 
9377 9399 9455 9457 9466 9472 9475 9480 9483 9495 9526 
9533 9536 9553 9556 9558 9560 9563 9567 9570 9582 9597 
9601 9630 9646 9655 9671 9683 9695 9700 9703 9715 9722 
9729 9733 9752 9758 9783 9843 9848 9855 9880 9936 9942 
9983 10007 10027 10103 10142 10186-10217 10274 10284 
10287 10296 10313-1031410329 10331 10349 10352 10354 
10400 10405 10430 10439 10496 10500 10507 10542 10621 
10709-10710 10712-10717 10750 10760 10776-10779 10782 
10794 10809 10816 10837 10885 10891 10895 10913 10960 
10971 10975 11057 11071 11098 11142-11143 11148 11194 
11246 11401 11406 11430-11453 11455-11464 11483 11504 
11508 11556 11560 11589 11596 11615 11618 11626 11631 
11714 11729 11830 11833 11865 11988 12000 12006 12017 
12019-12020 12041 12044 12081 12092 12178 12195 12245 
12256 12277 12297 12327 12363 12383 12402-12403 12405 
12426-12427 12444-12455 12457-12465 12467-12473 12475- 
12476 12520 12546 12576 12599 12607 12611 12614-12615 
12617-12618 12622 12627 12630-12632 12636 12647-12648 
12650 12685 12688 12691 12693 12696 12703 12739-12740 
12743 12763 12808 12830 12834 12842 12849-12858 12860- 
12866 12885 12895 12906 12910 12913 12915 12917 12920 
12922 12929 12933 12940^12941 12954-12955 12957 12965 
12977 12990 12993 12999 13021 13039-1304013047 13051 
13054-13055 13060 13062 13072-13073 13077 13082-13083 
13085 13092-13095 13098 13117 13123 1313613173 13195- 
13197 13202 13249 13254 13362 13377 13391 13394 13406- 
13409 13411-13415 13417-13420 13456 13473-13474 13477 
13480 13484 13486 13488-13489 13496-13498 13500 13502 
13504 13506-13507 13511-13512 13515 13521 13546 13551 
13554-13555 13560 13575 13590 13597 13613 13616-13617 
13622 13630 13659-13660 13670 13678-13679 13695-13696 
13701 13708 13713 13726 13748 13775 13795-13796 13815- 
13826 13835 13838 13859-13860 13864 13867 13869 13872 
13882-13885 13888-13889 13891 13893 


fetal liver- 
spleen 


Columbia 
University 


FLS003 


525 2269 2529 2627 2636 3552 3554-3555 3597 4201 4246 
4253 4422 4426 4434 4438 5536 5801 8935 9536 1 1 136 1 1 142 
11519 11626 11927 12028 12077-12079 12086 12305 12403 
12424 13021 13472 


fetal liver 


Invitrogen 


FLVOOl 


40 51 60 598 731 1023 1048-1049 1820 19762111 2268-2269 
2449 2483 2529 2568 2576 2614 2809 3207 3220 3468 3595- 
3599 3885 4020 4346 4390 4422 4434 4543 4615 4767 4833 
5092 5152-5155 5270 5505 5526 5553 5581 5715 5724 5726 
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5750 5766 5788 5794 5801 5902 6125 6147 6458 6598 6624 
6645 6765 6799 6805 6862 7104 7856 8070 8156 8359 8361 
8432 8740 8928 8935 8972-8974 9012 9041 9128 9264 9445 
9474 9485 951 1 9536 9593 9597 9703 9780 9927 9943 9945- 
9946 9948 10007 10137 10275 10334 10529 10557 10566- 
10569 10776 10828 11142 11560 11587 11618 11927 12070 
12086 12105-12108 12127 12218 12236 12327 12424 12511 
12541-12542 12605 12746 12916 12977 12985-12986 13020- 
13021 13060 13064 13135 13246-13247 13472 13479 13488- 
13489 13497 13533 13554 13590 13659 13680-13682 13859- 
13860 13882 13888-13889 


fetal liver 


Clontech 


FLV002 


360 996 1 134 3226 5102 9401 


fetal liver 


Clontech 


FLV004 


998 1927 2449 2627 4025 4043 4426 4438 4834 503 0 5726 
6730 9474 9942 10769 1 1 142 1 1587 12590 12608 13889 


fetal muscle 


Invitrogen 


FMSOOl 


40 150 731 894 1264 1555 1613 1821-1822 1883 1959 2161 
2343 2494-2496 2555 3203 4235-4237 4434 4671 4694 4783 
4885 4940 4985 5037 5045 5481 5716-5718 5724 5788 5902 
5932 6624 6742-6743 6825 7054 7104 7141 7632 7800 8279- 
8281 8863 8935 9204 9264 9446-9449 9451 9474 951 1 9536 
9556 9722 9780-9781 10097 10273-10274 10330 10738-10739 
10750 11098 11406 11528-11530 11560 12305 12544-12546 
12590 12609 12618 12663 12670 12702 12735 12891-12892 
12916 13051 13053 13195 13441-13442 13479 13613 13782 


fetal muscle 


Invitrogen 


FMS002 


3378 43484434 8932 11132 12033 12570 13782 


fetal skin 


Invitrogen 


FSKOOl 


60 142 235 319 64 1 683 800 1015 1050 1346 1774 1823-1825 
2044 2099 2 1 1 1 2 1 6 1 22 1 5 2223 2280 2367 240 1 2408 25 1 3 
25 16 26 14 28 12 287 1 2926 3207 3356 3468 3490 3503 3548 
3599 3720 3722 3885 4020 4032 4060 4109 4238 4240-4244 
4246 4253 4277 4340 4353 4355 4388-4389 4391 4405 4434 
4543 4562 4568 4654 4667 4671 4767 4940 4944 4947 5030 
5037 5075 5163 5198 5296 5380 5481 5514 5581 5656 5674 
5691 5719-5724 5748 5757 5801 5894 5902 5923 6057 6076 
6078 6125 6147 6215 63366374 64096437-6438 6551 6563 
6744 6783 6803 6805 6815 6873 6900 6908 6917 6985 6987 
7037 7054 7058 7067 7075 7275 7355 743 1 7484 7507 7587 
7627 7642 7653 7687 7871 7977 8110 8134 8186 8226 8235 
8251 8262 8282-8287 8298 8356 8365 8368 8452 8455-8456 
8539 8597 8648 8679 8813 8863 8898 8921 8935 8939 8948 
8951 8957 8983 9002 9041 9061 91229128 9257 9285 9414 
9452-9453 9466 9474 9484 9503 9517 9529 9553 9563 9671 
9677 9703 9714 9781 9841 9927 10007 10175 10296 10343 
10491 10741-10742 10778 10795 10907 10948 11239 11513 
11531 11573 11606 11626 11807 11869 11952 12000 12006 
12160 12190 12218 12256 12327 12363 12371 12375 12378 
12426 12438 12465 12470 12488 12549-12550 12570 12604 
12615 12617 12663 12670 12735 12749 12754 12830 12893 
12904 12910 12916-12917 12976-12977 13038 13084 13090 
13116 13249 13254 13367 13389 13391 13443-13445 13456 
13472 13479-13480 13494 13496 13499 13505 13512 13516 
13551 13554-13555 13575 13590 13613 13630-13631 13644 
13670 13713 13782 13784 13793 13803 13858 13&66 13869 
13882 13891 


fetal skin 


Invitrogen 


FSK002 


1004 3544 4834 5523 9922 9942 1 1 134-1 1 135 12570 13495 
13499 13793 1388413889 


fetal spleen 


BioChain 


FSPOOl 


997 5030 9466 11108 12033 12749 13590 


umbilical 


BioChain 


FUCOOl 


60 83 89 142 166 567 609 760 997 1302-1304 1306-1307 1309 
1671 169717241848 19171978 21112154 216122072215 
23 15 2343 2444 2569 2576 2591 2597 2652 2866 2926 3468 
3526 3599 3602 3625 3697 3722 3799-3805 3813 3839 3885 
4025 4060 4173 4197 4246 4277 4340 4364 4387 4391 4395- 
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Name 


SEQTDNOS: 








4396 4405 4410 4434 4543 4589 4667 4694 4767 4834 4865 
4940 4957 4963 4985 5030 5093 5096 5108 5163 5171 5213 
5313 5340 5342-5344 5380 5481 5521 5526 5627 5656 5674 
5714 5743 5750 5757 5801 5895 5902 5912 5938 6027 6061 
6082 6125 6186 6195 6218 6241 6313 6321 64ia6458 6486- 
6495 6595 6607-6608 6668 6745 6825 6860 6870 6979 7054 
7141 7260-7261 7275 7329 7355 7373 7383 74447579 7586 
7677 7716 7807 7930 7935-7936 7995 8077 8093 8226 8246 
8262 8298 8343 8345 8358 8387 8452 8535 8563 8635 8669 
871 1 8819 8858 8863 8921 8935 8943-8944 8948 8963 9001- 
9002 9012 9097 9130 9132 9134-9138 9264 9303-9304 9313 
9332 9401 9424 9466 9472 9474 9544 9597 9665 9677 9700 
9722 9734 9758 9834 9841 9848 10050 10175 10183 10267 
10277 10311 10337 10446 10470 10508 10529 10566 10691 
10772 10828 10961 11108 11132 11145 11239 11241 11273- 
11276 11293 11408 11484 11513 11596 11604 11607 11620 
11695 11823 11909 11927 11942 12000 12077 12160 12245 
12255-12256 12292 12305 12308 12363 12383 12404-12405 
12426 12438 12511 12570 12601 12605 12630 12700 12706 
12754 12875 12913 12916 12935 12957 12993 13002 13009 
13020-13021 13062 13135-13136 13141 1325413283 13318- 
13319 13336 13394 13400 13472 13479 13481 13488-13489 
13495-13496 13499 13507 13511-13512 13516 13551 13555 
13568 13575 13590 13613 13630 13644 13713 13740-13742 
13866-13867 13882 13884 13888 


fetal brain 


GIBCO 


HFBOOI 


51 70 211-213 215-222 224-229 445 586-588 598 608 703-710 
712-716 1068 1148 1178 1184 1193 1308 1516 1556 1774 1778 
1957 1974 1976-19772010 2025-20292134-2135 2145 2161 
2171 2174 2220 2223 2257 2266 2283 2444 2449 2510 2555 
2563 2569 2590 2597 2614 2617 27172719-2723 2726 2729- 
2734 2809 2997 3002 3081 3083 3086 3109 3226-3237 3239 
3257 3272 3278 3282-3283 3286 3356 3419 3460 3479 3492 
3495 3544 3548 3557 3590 3604-3605 3625 3627 3634 3673 
3689 3697 3709 3722 3797 3810 3839 3885 4006 401 1 4019 
4022 4025 4040 4054 4059 4095 4194 4201 4253 4277 4282 
4355 4364 4383 4399 4405 4410 4412 4434 4473 4480 4498- 
4504 4507-4508 4517 4543 4654 4689 4694 4730-4733 4735 
4755 4783 4820-4823 4825-4830 4845 4885 4940 4949 4957 
4962 4972 4985 5017 5019 5022 5031 5037 5108 5132 5167 
5171-5172 5262 5380 5481 5521 5523 5527 5553 5616 5656 
5711 5740 5788 5801 5894 5902 5906 5908 5923 5938 5979- 
5989 6057 6082 6096 6125 6147 6159 6161-6163 6165 6207 
6212-6213 6215-6222 6241 6257 6288 6336 63696374 6419 
6422 6512 6551 6555 6595 6598 6607 671 1 6767 6788 6823- 
6825 6860 6871 6879 6892-6894 6900 6909-6910 7058 71 18- 
71197169 7220 7222 7224-7225 7228-72297231-7233 7275 
7426 7431 7444 7481-7483 7485-7488 7560-7567 7569 7608 
7743 7768 7792 7814 7845 7864 7930 7977 7995 8093 8095 
8110 8140 8226 8235 8262 8345 8358 8361 8370 8372 8387 
8441-8443 8445 8447 8450-8455 8520 8535 8558 8597 8603 
8648 8663-8666 8677 8702 8742-8749 881 1 8838-8839 8943 
8951 9010 9092 9134 9137 9257 9287 9304 9317 9455 9457 
9466 9472 9475 9484 9510 9515 9533 9553 9567 9644-9645 
9647-9649 9734 9758 9781 9785-9786 9791 9832-9837 9927- 
9928 9939 9942 9970 10053 10175 10275 10277 10296 10329 
10375-10378 10434 10464 10486 10496 10645 10679 10691 
10778 10782 10791 10838-10844 10928 11019-11025 11027 
11032 11055 11061 11108 11132 11145 11153 11208 11239 
11343 11483 11513 11588 11596 11604 11606 11620-11621 
11668 11691-11695 11818 11867 11869 11929-11930 12006 
12033 12039 12041 12044 12047 12066 12078 12197 12218 
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Tissue 
Origin 


BJVA 
Source 


Library 


SEQ ID NOS: 








12327 12375 12405 1241 1 12424 12438 12521-12522 12564 
12570 12576 12582 12590 12599 12601 12604 12611 12626 
12630 12637-12640 12670 12674 12688 12694-12695 12703 
12734 12748 12754 12785 12835 12840 12913 12916-12917 
12929-12930 12935 12940 12946 12959 12989 12993 13020 
13051-13052 13054 13073 13083 13090 13108-13110 13117 
13131 13136 13148 13158 13160 13169 13184-13186 13254 
13263 13277 13295 13389 13394 13438 13456 13477 13479- 
13481 13489-13490 13494-13500 13502-13503 13506 13512 
13516 13535-13536 13555 13575 13590 13592 13605-13606 
13613 13616 13627 13630 13644-13645 13660 13670 13679 
13687 13713 13793 13803 13838 13861 13866 13868-13869 
13875 13888 13891 13896 


macrophage 


Invitrogen 


HMPOOl 


1002 2253 3548 401 1 4058 4201 4246 4282 5526 8093 8262 
11137 12039 12426 12511 12521 13888 


infent brain 


Columbia 
University 


IB2002 


6 89 142 21 1 276 307 518 589-590 598 644 717-718 720-721 
773 841 921-924 926-932 937 1023 1091 1 147-1 150 1289 1493 
1522-1525 1557-1564 1724 1778 1928 1947 1976 1978 2097 
2111 21382172 2218-2223 2257 2283 2401 2418-24192444 
2449 2516 2555 2563 2569 2575 25972627 2809 2818 2880 
2932 3087 3 166 3226 3241-3245 3272 3435 3474 3476-3477 
3481 3483-34863488-3489 3492 3494-3495 3497 3538 3544 
3548 3572 3595 3597 3669 3671-3672 3709 3720 3722 3737 
3797 3817 3885 3985 3997-3999 4006 401 1 4022-403 1 4040 
4095 4201 4246 4253 4277 4297 4355 4405 4410 4434 4676 
4689 4767 4775 4831-4834 4837 4885 4962 4983 4998 5017- 
5019 5021-5031 5035-5037 5040 5095 5143 5164 5167 5169 
5233-5236 5380 5505 55 17-5519 5523 5536 5546-5548 5691 
5695 5724 5750 5766 5788 5801 5895 6027 6048 6082 6095 
6111 61196132 6135 61476223 6301-6303 6305-6307 6374 
641 1 6459 6555 6563 6591-6595 6601 6606-6607 6609-661 1 
6613-6614 6624 671 1 6767 6799 6805 6909 6946 7018 7052 
7067 7118 7133 7166 7168-7169 7386 7464 7469 7473 7571- 
7573 7697-7698 7716 7814 7827-7828 7905 7934 7977 8071- 
8073 8084 8098-8104 8235 8320 8331 8351 8354-8356 8364- 
8365 8370 8372 8376 8452 8520 8572 8667-8668 8750-8752 
8813 8830 8863 8883-8885 8950 8958 8963 9040-9041 9251 
9257 9267 9293-9298 93 1 3 9424 9454 9456-9457 9466 95 10 
9514-9515 9533 9542 9556 9576-9577 9597 9626 9646 9722 
9758 9787 9903-9904 9988 10027 10138-10139 10155-10160 
10284 10296 10319 10346 10496 10536-10542 10557 10670- 
10672 10680 10682 10778 10791 10897 10971 11028-11030 
11052 11100-1110911122 11132 11192 11392 11403-11404 
11447 11513 11568 11595 11606 11626 11638 11818 11927 
1 1931-11933 12028 12030-12031 12033-12034 12036-12039 
12041 12047 12049 12146 12155-12157 12160 12327 12344 
12363 12383-12385 12402 12405-12408 12424 12439 12521- 
12522 12570 12590 12601 12611 12615-1261612626 12639 
12684-12685 12688 12696-1269712707 12725-12727 12729 
12748-12749 12754 12763-12764 12830 12904 12913 12916- 
12917 12923 12929 12959 12977 12990 12996 13000 13047 
13051 13054 13062 13073 13082-13084 13117 13141 13187- 
13188 13196 13218-13219 13249 13257 13277 13284-13285 
13351 13389 13394-13395 13456 13458 13479-13481 13488 
13494-13495 13499 13503 13512 13516 13530 13535 13575 
13607-13608 13613 13616 13641 13649 13659 13679 13705- 
13706 13708 13713 13793 13798-13801 13803 13861 13869 
13872 13875 13883-13884 13888 13891-13893 13896 


infant brain 


Columbia 
University 


IB2003 


6 46 746 1914 1947 1959 1994 21 1 1 2220 2257 2516 2962 
3166 3226 3272 3435 3572 3885 4277 4377 4410 4833 4837 
5029-5030 5040 5147 5259 585 1 6147 6423 6595 6598 661 1 
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Name 


SEQIDNOS: 








7145 7169 7716 8262 8354-8355 8364 8452 8863 9264 9457 
9515 9556 9577 9626 9646 9715 9781 9928 9970 10027 10329 
10376 10557 10791 11083 11289 11406 11459 11513 11663 
11933 12039 12405 12546 12570 12601 12616 12685 12754 

12913 12916 12954 12977 12990 13000 13056 13062 13195 
13257 13392 13456 13458 13480-13481 13488 13497 13499 
13506-13507 13511-13512 13514 13516 13549 13616-13617 
13649 13793 13861 13884 13888 13891 13896 


infant brain 


Columbia 
University 


IBM002 


1564-1565 19762218 2420 3166 4391 5788 6147 6302 10329 
1 1108 11513 12039 12729 12830 13062 13257 13512 13617 
13803 13859 13861 


infant brain 


Columbia 
University 


IBSOOl 


927 931 1564 2915 3166 3737 3885 5029 5044 50955143 5701 
5894-5895 6135 6307 7018 8452 8963 9424 9556 9626 10296 
10542 11083 11100 11108 11167 12383 12611 12785 13257 
13280 13479 13495 13506 13617 13679 13713 13896 


lung, 
fibroblast 


Strategene 


LFBOOl 


67 235 552 641 746 1346209021862343 27702812 2871 
3203 3537 3548 3634 3812 3839 4073 4253 4434 4654 4783 
4786 4821 4940 4957 4963 4985 5163 5172 5492 5523 5788 
591 1 5938 6027 6048 6057 6223 6272 6322 6599 6873 6886 
6997 7204 7759 7815 7933 7977 8226 8235 8262 8354 8452 
8648 8742 8791 8830 8863 889S 8944 8948 8951 9000 9076 
9097 9128 9290 9304 9414 9455 9466 9472 9476 9503 9544 
9563 9671 10000 10050 10342 10376 10434 10447 10679 
10760 10857 11055 11094 11289 11402 11607 11818 11837 
12000 12006 12178 12363 12405 12462 12582 12617 12636 
12670 12754 12913 12916-12917 12925 12940 13126 13377 
13394 13446 13456 13494 13497 13499 13502 13575 13613 
13670 13679 13691 13795 13797 13867-13868 13882 


lung tumor 


Invitrogen 


LGT002 


51 69 88 142 158 211 444 598 608 722-723 733 795-797799 
1004 1023 1135-1145 1308 1338 1346 1493 1526-1533 1535 
1566-1569 1626 1654 1816 1841 1848 1860 1976 1983 2090 
2 150 2161 2173 2223 2257 2266 2296-2297 2342 2402 2449 
2483 2555 25692576 2591 261 1 2623 2724 2809 2868 2880 
2926 3088 3166 3272 3347 3571 3658-3666 3673 3722 3737 
3797 3839 3885 4000-4002 4006 401 1 4025 4032-40344060 
4133 4201 4246 4253 4277 4282 4340-4341 4384 4388 4391 
4399 4402 4405 4434 4543 4622 4671 4686 4767 4783 4791 
4833-4834 4836 4885 4903 4940 4957 4987 4998 5001 5017- 
5018 5030 5037 5052 5108 5163 5210 5227-5232 5422 5451 
5520-5528 5581 5627 5637 5674 5693 5713 5724 5748 5766 
5772 5788 5894-5895 5902 5907 5917 5938 5990 6005 6020 
6094 6129 6147 6171 6224-6225 6239 6241 6253 6322 6336 
6404-6410 6461 6508 6595-6598 6624 6765 6793 6798 6851 
6870 6873 6994 7003 7008 7045-7047 7053-7055 7067 7085 
7109 71 16 7139 7141 7158 7169 7187 7468 7517 7524 7579 
7608 7676 7687 7716 7761 7825-7826 8074-8078 8105-8106 . 
8139 8235 8298 8323 8335 8345 8354 8359 8363 8365 8370 
8372 8375 8456 8563 8633 8648 8678 8741 8792 8798 8805 
8831 8863 8870 8989 9002 9010 9012 9032-9037 9039 9051 
9076 9080 9092 9128 9257 9269-9271 9273-9274 9276 9299- 
9301 9399 9414 9424 9457 9466 9472 9480 9484-9485 9517 
9533 9536 9539 9544 9567 9612 9626 9665 9677 9689 9700 
9703 9758 9773 9784 9788 9838-9841 9870 9928-9929 9942 
9970 9983-9986 10140-10143 10149 10161 10175 10275 10296 
10319 10330 10346 10349 10386 10496 10508 10561 10591- 
10593 10673-10674 10742 10774 10778 10782 10788 10802 
10836 10971 11031-11033 11064 11108 11132 11182-11191 
11393-11395 11406 11459 11462 11560 11565 11588 11596 
11604 11606-11607 11615 11618 11629 11821 11823 11845 
1 1869 1 1932 1 1934 1 1961 1 1978-1 1980 12000 12006 12017 
12022 12033 12078 12080 12127 12150-1215412178 12236 
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Tissue 
Origin 


RNA 


Library 


SEQ ID NOS: 








12327 12363 12371 12379 12386-12393 12409 12424 12521 
12570 12590 12616-12618 12630 12637 12674 12691 12693 
12698-12699 12716 12749 12754 12760 12762 12830 12832- 
12833 12842 12904 12907 12913 12917 12923 12993 13020 
13031-13032 13035 13060 13062 13064 13072 13083 13090 
13136 13150 13193 13197 13202 13234 13277 13279 13281- 
13282 13295-13296 13381-13383 13385-13386 13394 13400 
13456 13477 13479-13480 13484 13488-13489 13492 13494 
13496-13497 13499 13503 13506-13507 13511 13554 13575 
13597 13616 13630 13646 13649 13659-13660 13670 13701- 
13704 13713 13743 13782 13788-13789 13791-13792 13802- 
13803 13864 13867-13868 13872 13882-13884 13888 13892 


lymphocytes 


ATCC 


LPCOOl 


142 316 1034 1977 2161 2342 2367 2483 2513 2529 2932 2962 
3272 371 1 3722 3839 4006 4277 4282 4405 4434 4836 4970 
5037 5239 5451 5706 5796 5924 5941 6048 6147 6301 6765 
6793 6805 6815 7133 7171 7275 7320 7687 7815 8033 8093 
8095 8105 8110 8246 8262 8345 8365 8368 8370 8535 8664 
8674 8722 8834 8858 8898 8919 8963 9012 9257 9456-9457 
9484 9539 9544 9560 9758 9928 9939 10175 10201 10284 
10333 10496 10520 10529 10679 10744 10766 11032 11108 
11508 11513 11519 11853 11862 12033 12160 12363 12387 
12402 12424 12438 12520 12560 12570 12663 12830 12913 
12923 12968 13021 13066 13072 13085 13140 13147-13148 
13394 13479 13488 13495 13499 13507 13575 13867 13869 
13888 


leukocyte 


GIBCO 


LUCOOl 


21 49 5 1 67 83 88 94 142 21 1 316 326 3 40 3 68 3 71 403 438 
443-455 464 485 518-523 525-530 532-535 591-592 616 724- 
726 744 800 841 889 943 1004 1346 1556 1570-1572 1826 
1914 1916 1927 1959 1976-19772095 2103-21072124-2127 
2134 2138 2145 2161 2163 2175 2215 2223 2237 2266 2343 
2363 2412 2421 2449 2483 2529 2555 2569 2575 2597 2617- 
2618 2695-2696 2827 2876-28772899 2903 2961 2965-2975 
3002 3035-30383040-3046 3109 3132 3166 3199 3203 3207 
3246-3251 3265 3272 3286-32873312 3344 3356 3361 3378 
3400 3434 3451 3468 3479 3484 3494 3503 3552 3590 3610 
3627 3634 3704 3709 3720 3722 3725 3757 3797 3804 3810 
3839 3885 3985 3996 4006 4009-401 1 4019 4022-4023 4025 
4035-4037 4058 4060 4194 4253 4269 4277 4297 4341 4362 
4388 4390-4391 4396 4399-4400 4402 4405 4410 4412 443 1 
4434 4534 4543 4594 4615 4630 4641 4651-4663 4670 4688- 
4689 4694 4699-4706 4736 4755 4758-4759 4767 4775 4783 
4791 4798 4828 4835-4840 4845 4865 4930 4940 4947-4948 
4957 4962 4964 4972 4976 4985 499 1 4998 500 1 5003 50 1 7 
5022 5030 5037 5040 5075-5076 5108 51 18 5143 5163 5171- 
5172 5313 5481 5503 5505 5521 5523 5526 5535 5549 5552 
5656 5691 5724 5726 5740 5750 5766 5772 5788 5794 5796 
5801 5865 5874 5894-5895 5906 5908 5923-5924 5928 5938 
5989 6027 6042 6057 6063-6064 6082 6094-6103 6125 6130- 
6137 6142 6147 6166-6167 6171 6181 6214 6226 6239 6253 
6301 6307 6371 6374 6418 6512 6662 6716 6730 6788 6792 
6799 6815 6836 6860 6873 6876-6877 6886-6887 6945 6975 
6979 7018 7037 7056 7058 7067 7116 7118 7137 7155 7158 
7171 7275 7315 7384-7385 7387-7389 7391-7393 7395-7398 
7452-7463 7468 7494 7608 7671 7676 7687 7714 7733 7792 
7815 7845 7864 7870 7905 7930 8093 8107 8110 8120 8139- 
8140 8224 8226 8262 8276 8320 8363-8365 8368 8375 8387 
8432 8452 8456 8520 8535-8536 8539 8562-8563 8577 8579 
8597 8599-8601 8603-8605 8608 8640-8646 8648 8664 8669 
8674 8677 8691 8702 8722 8753 8755 8798 8815-8816 8830 
8838-8839 8858 8863 8870 8876 8898 8943-8944 8948 8951 
9001 9010 9012 9061 9076 9092 9097 9128 9257 9304 9454- 
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9456 9466 9472-9475 9484 9497 9511 9515 9517 9529 9536 
9542 9556 9567 9612 9626 9702-9703 9723 9731-9738 9745 
9752 9758 9762-9770 9780-9781 9789 9791 9808 9832 9842 
9927-9928 9939 9942 9970 10000 10175 10275 10319 10329- 
10331 10333 10345-10346 10412 10428 10430 10433 10435- 
10436 10453-10454 10456 10465 10470 10475 10488 10496 
10513 10566 10645 10679 10683 10691 10774 10776 10778 
10788 10791 10795 10802 10891 10928 10930 10935-10941 
10948 10961-10962 10964-10967110341105511108 11132 
11136 11145 11239 11289 11405-11406 11459 11483 11519 
11588 11604 11606 11618 11620 11693 11695 11805 11809- 
11816 11818 11830 11833 11848 11850-11856 11868 11927 
11932 11934-11935 11937-11939 11954 11988 12000 12006 
12009 12028 12033 12037 12039 12047 12053 12092 12097 
12118 12127 12143 12159-12160 12178 12186 12327 12336 
12363 12375 12387 12402 12404-12405 12410 12424 12438 
12442 12490 12511 12521-12522 12556 12560 12564 12570 
12590 12599 12601 12605 12615-12617 12636 12643 12653 
12660 12670 12672-12680 12693 12700-12701 12706-12707 
12715 12735 12754 12823 12895 12904 12910 12913 12916^ 
12918 12923 12925 12935 12945 12947 12955-12956 12972 
12977 12993 13002 13020-13021 13051 13062 13066 13072- 
13073 13082 13085 13117 13126 13130-13131 13135 13138- 
13140 13147-13151 13161 13169 13189 13254 13295 13391 
13394 13425 13436 13442 13456 13477 13479 13488-13490 
13494-13496 13498-13503 13505-13507 13512 13530 13546 
13550-13551 13554 13557-13564 13575-13581 13590 13592- 
13593 13609 13613-13616 13627 13630 13645-13646 13660 
13679 13713 13782 13803 13864 13866-13869 13872 13883 
13885 13888 13893 


leukocyte 


Clontech 


LUC003 


536 539 541-543 593 728 1552 1927 197430893252-3253 
3434 3548 3709 371 1 3722 3797 401 1 4019 4040 4060 4194 
4201 4277 4282 4622 4707 4791 4841-4842 4949 4998 5001 
5030 5345 5895 6052 6138 6227 6419 6595 6754 6765 6767 
6788 6911-6912 7049 7139 7171 7464 7575-7577 8093 8110 
81 16 8365 8370 8375 8592 8648 8830 8863 8944 9466 9544 
9612 9758 9771 9773 9790 9929 10326 10346 10679 10779 
10961 11035-1103611132 11401 11513 11853 1194012160 
12256 12405 12670 12674 12681 12840 12904 12906 13020 
13051 13162 13280 13400 13488 13511 13516 13554 13670 
13679 13860 13869 13883 13889 


melanoma 
from cell line 
ATCC#CRL 
1424 


Clontech 


MEL004 


83 142 360447 841 1061 1346 1516 1573 1724 1959 1963 
2159 2175 2343 2367 2513 2652 28122876 2961 3001 3272 
3345 3604 3627 3722 3754 3839-3840 3985 3988 4040 4194 
4246 4341 4515 4568 4641 4667 48624940 5075 5132 5163 
5481 5492 5523 5740 5788 6008 61126119 6147 6189 6195 
6272 6287 6418 6423 6508 6563 6726 6860 6870 7134 7166 
7169 7275 7294 7579 7677 7748 783 1 7852 7980 81 10 81 16 
8226 8235 8262 8320 8345 8363 8370 8456 8563-8564 8633 
8838 9001-9002 9128 9296 9304 93 13 9484 9542 9544 9646 
9703 9758 9780-9781 9808 9999 10007 10027 10296 10346 
10470 11145 11239 11289 11401 1 1406 1 1508 11596 11606 
11620 11823 11899 11950 12019 12107 12160 12292 12329 
12363 12405 12436 12523 12599 12658 12670 12749 12754 
12774 12842 12930 13020 13296 13394 13458 13489 13498 
13501 13507 13551 13554 13575 13613 1361613649 13660 
13743 13775 13868 


mammary 
gland 


Invitrogen 


MMGOOl 


51 67 142 449 594 598 616 708 729-733 10041060 1194-1196 
1310-1315 1329 1536-1541 1826 1848 1858 1914 1947 1974 
1976-1978 2174-2175 2283 2299 23 10 2336 2343 2403-2406 
2449 2483 2555 2563 2576 2597 261 1 2614 2617 2812 2827 
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2926 3 191 3207 3490 3503 3673 3714-3715 3722 3737 3777 
3796 3807-38193885 4003-40044060 4330 43464355 4391 
4399 4434 4543 4671 4767 4791 4836-4837 4843 4940 4964 
4975 4985 4998 5001 5017 5019 5022 5052 5108 5155 5163 
5168-5169 5269 5345-5347 5466 5526-5527 5529 5581 5615 
5724 5772 5778 5788 5794 5894-5895 5902 5907 591 1 5917 
5923-5924 6048 6057 6064 6094 6109 6125 6147 6156-6157 
6215 6228 6321 6430 6458 6496-6499 6545 6579 6598-6599 
6607 671 1 6765 6780 6793 6797 6799 6805 6880 693 1 6979 
6987 6996 7012-7013 7048 7054 7057 7104 7141 7158 7169 
7517 7578-7581 7583 7872 7937-7940 8110 8139 8156 8224 
8262 8292 8298 8336 8359 8363 8365 8372 8452 8619 8628 
8646 8648 8722 8758-8759 8782 8813 8815 8863 8882 8939 
8963 8983 9012 9097 9111 9139-9141 9143 9157 9164 9264 
93 13 9327 9347 9466 9484 951 1 9517 9544 9553 9563 9577 
9646 9700 9703 9734 9746 9780 9791 9844-9846 9927-9928 
9942 10006-10007 10051-10055 10144 10146-10147 10162 
10285 10330 10346 10439 10490-10491 10496 10542 10557 
10600 10628 10645 10691 10729 10772 10796 10983 11038 
1 1 108 1 1 132 1 1216-11217 1 1277-1 1279 1 1396-1 1397 1 1406 
11459 11513 11596 11606-11607 11626 11628 11823 11830 
1 1833 11902 11941-11943 11979 12000 12009 12037 12047 
12078 12093 12146 12160 12181 12258-12263 12281 12327 
12371 12383 12387 12395 12424 12426 12438 12442 12484 
12511 12564 12570 12590 12594 12601 12605 12609 12615- 
12616 12618 12621 12653 12670 12691 12699 12702-12703 
12749 12771 12791-12794 12834-12835 12885 12898 12905 
12913 12916-1291712920 12923 12955 12976-12977 13002 
13020 13035 13047 13062 13064 13066 13072-13073 13084 
13090 13092 13127 13135 13169 13190 13193 13195 13249 
13254 13296 13320-13322 13392 13456 13477 13479-13480 
13488-13489 13494 13497-13499 13505-1350713512 13514 
13516 13546 13551 13554-13555 13590 13611-13612 13631 
13649 13670 13713 13743-13745 13838 13864 13866-13867 
13869 13872 13875 13882-13885 13888 13891 


induced 
neuron cells 


Strategene 


NTDOOl 


88 1493 1552 15612034 2090 2510 2522 257025762623 3226 
3272 3839 4006 4058 4282 4384 4694 4783 4888 5103-5104 
5172 5523 5541-5542 5684 5766 5788 5794 5938 6082 6195 
6418 6449 6607 6960 6972 7058 7494 7815 7937 8370 8929- 
8930 8996 9000 9003 9128 9157 9289-9291 9472 9928 10275 
10550 10729 10760 11607 11736 11818 1203912075 12094 
12256 12363 12381 12423 12522 12707 12735 12863 12906 
12958-12959 12963 13020 13052 13054 13196 13202 13310 
13368 13378 13438 13479 13486 13506-13507 13546 13616 
13654 13659 13691 13795 13891 


retinoid acid 
induced 
neuronal cells 


Strategene 


NTROOl 


1552 383938854282 4434 5030 9308 9466 1110811132 
12034 12039 12405 12424 12570 12905 13047 


neuronal cells 


Strategene 


NTUOOl 


88 708 1552 1561 1947 2223 2490 2614 3272 3610 3625 3627 
4032 4201 4253 4297 4337 4434 4543 4622 4641 5104 5163 
5523 5541 5684 5788 6094 6147 6307 6601 6862 7755 8648 
8740 8996 9097 9128 9289 9308 9466 951 1 9544 10333 10779 
10963 11032 11108 11132 12034 12039 12075 12405 12424 
12570 12590 12706 12749 12830 12905 12917 12963 13035 
13073 13280 13490 13507 13613 13616 


pituitary 
gland 


Clontech 


PIT004 


1 148 5674 6068 6371 7759 8093 10430 12605 12636 12658 
12916 13021 13082 13875 


placenta 


Clontech 


PLA003 


3885 4025 5030 5536 5711 5923 8349 8367 9921 12033 12076 
12403 12511 13655 13885 


prostate 


Clontech 


PRTOOl 


67 83 142225 235 379486 572 616 1286 1316-1329 1538 
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Library 


SEQ ID NOS: 








2023 2139 2147 2161 2266 2290 2337-2344 2346 2367 2422 
24442511 2631 2809 3001-3002 3084 3419 3563 3625 3627 
3797 3810 3821-38273829-3835 4019 4022 4173 4282 4332 
4434 4667 4755 4791 4862 4865 4998 500 1 5066 5171 5349 
5352-5356 5358-5361 5363-5364 5481 5581 5656 5674 5723 
5740 5902 6027 6047 6125 6321 6371 6374 6427 6458 6463 
6500-6507 6509 6563 6598 6643 6793 6803 6871 7014-7015 
7058 7104 7275 7320 7608 7635 7733 7842 7852 7864 7941- 
7943 7946-7951 7953-7954 8077 8087 8093 8110 8224 8226 
8452 8487 8520 8558 8635 8677 8863 8950 8963 8983 8997 
9012 9145-9148 9150-9154 9269 9302 93 17 9466 9503 9528 
9646 9703 9780 10000 10027 10056-10060 10491 10629 10691 
10777 10891 11145 11200 11239 11281 11283-11295 11344 
11406 11761 11837 11862 12006 12166 12264-12278 12305 
12363 12368 12411-12412 12438 12467 12685 12691 12729 
12734 12795-12798 12800 12830 12863 12904 13010-13011 
13104 13146 13295 13323-13326 13377 13394 13456 13473 
13477 13489 13516 13533 13550 13611 13678-13679 13746 
13866-13867 13884 13889 


rectum 


Invitrogen 


RECOOl 


6 67 142 683 731 997 1178 1909 19592005 2023 2596 2611 
2614 2809 2926 3314 3333 3455 3722 3780 3870 4173 4355 
4837 4949 4987 5526 5580 5615 5674 5691 5724 5788 5895 
5909 5924 6057 61 12 6195 6765 6805 7815 7833 8095 8664 
8863 9517 9539 9544 9884 9927 10400 10666 1 1401 1 1 513 
11606 11985 12160 12327 12428 12693 12848 12910 12977 
13051 13064 13072 13420 13494 13497 13507 13512 13515- 
13516 13866 13869 13884 13888 


salivary gland 


Clontech 


SALOOl 


67 731 800 997 1054 1914 2267 2395 2529 3 136 3595 3627 
401 1 4192 4246 4330 4434 4641 4957 4987 5040 5052 5163 
5451 5481 5706 5723 5788 5895 6219 6621 6801 6900 6975 
7045 7733 8] 10 8372 8535 8563 8635 8830 8951 9000 9010 
9051 9313 9472 9475 9671 9724 9758 9927 10027 1 1 145 
11695 11725 12017 12284 12363 12424 12427 12570 12609 
12670 12674 12693 12977 13035 13307 13554 13617 13867 
13872 13889 13891 


salivary gland 


Clontech 


SALs03 


1516 1724 1858 5030 6186 13657 13864 


skin fibroblast 


ATCC 


SFBOOl 


2251-2252 5788 6068 12511 


skin fibroblast 


ATCC 


SFB002 


6068 8951 12511 


skin fibroblast 


ATCC 


SFB003 


4025 5895 7741 


small 
intestine 


Clontech 


SINOOl 


142 319 627 654 1034 1063 1197-1198 1330-1338 1340-1359 
1575 1646 1774 1814 19782161 2347-23542409 2876 3046 
3419 3460 3605 3716-3718 3737 3797 3837-3839 3841-3843 
3845-3857 3885 3986 40604201 43014351 4385 4568 4689 
4694 5076 5163 5270-5273 5304 5326 5365 5367-5372 5374 
5503 5550 5701 5772 6064 6094 6171 6288 6427 6430-6432 
6438 6510-6522 6598 6615 6793 6815 6997-6998 7016-7018 
7054 7058 7072 7309 7450 7604 7769 781 1 7873-7876 7955 
7957 7959-7962 7964 8120 8298 8350 8452 8830 8863 8950- 
8951 8966 9010 9073-9075 9119 9126 9128 9155-9166 9303 
9544 9560 9780 9884 9928-9929 10008-10010 10061-10068 
10097 10262 10330 10351 10601 10630-1063410760 10983 
1 1061 11219 1 1296-1 1308 11310-1 1313 1 1513 1 1620 11693 
12182-12183 12280-12287 12327 12363 12488 12707 12799- 
12801 12922 12991 13012-13014 13035 13051 13064 13297 
13307 13328-13332 13335 13382 13499 13506 13554 13560 
13575 13631 13695 13714 13747-13749 13751 13882 13884 


skeletal 
muscle 


Clontech 


SKMOOl 


1104 1346 2363 2367 2495 2555 2876 2880 3555 3634 3722 
4011 402241944201 4253 4277 4282 4434 4641 4940 4972 
4998 5343 5481 5523 5801 6005 6336 6873 7408 7995 81 10 
8120 8235 8262 8292 8345 8372 8576 8740 8830 8936 8951 
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9303 9689 10616 10679 11132 11145 11825 11869 11927 
12387 12442 12467 12570 12663 12735 12916 13280 13503 

13872 13885 


skeletal 
muscle 


Clontech 


SKM002 


8535 


skeletal 
muscle 


Clontech 


SKMsOS 


6336 6962 8936 


skeletal 
muscle 


Clontech 


SKMs04 


770 1724 3797 4277 6336 12405 13658 


spinal cord 


Clontech 


SPCOOl 


83 142 390 415 598 668 708 731 1184 1199-1207 1360-1375 
1377-1396 1516 1574-1576 1595-1596 1849 192720702129 
2161 231 1-23 14 2345 2355-2368 2423-2424 2430 2484 2529 
2569 2576 2876 3215 3249 3272 3283 3532 3584 3627 3634 
3711 3719-3722 3737 3839 3860-3884 401 1 4025 4038-4039 
4043 4055-4056 4173 4246 4282 4354 4375 4391 4434 4681 
4767 4781 4808 4964 4985 4998 5037 5163 5233 5274-5277 
5375-5392 5394-5402 5523 5569-5570 5581 5615 5723 5788 
5835 5902 5928 5936 6047 6078 6082 62 1 1 6288 6374 6433- 
6435 6512 6523-6531 6534 6595 6616 6625-6626 6788 6894 
6979 6999 7018-7026 7126 7166 7359 7473 7642 7653 7807 
7814 7877-7879 7965-7968 7970 7972-7980 8105 8108-81 10 
8139 8246 8298 8345 8363 8368 8482 8603 8646 8884 8898 
8981 9010 9012 9076-9078 9098 9167-9179 9184-9189 9264 
9302 9304 9319-9320 9455 9466 9520 9530 9544 9556 9567 
9781 9895 9901 9928 9942 9947 9969 9999 10007 10069- 
10077 10079-10085 10177 10296 10326 10346 10376 10422 
10566 10602 10635-10638 10679 10685-10686 10729 10776 
11132 11220 11246 11314-11323 11325-1133011417-11418 
11459 11513 11818 12000 12011 12017 12033 12039 12160 
12184-12185 12288-12292 12295-1'^'^99 1^301-1^305 l'^363 
12375 12383 12387 12402 12413 12442 12468 12527 12605 
12617 12636 12657-12658 12739-12740 12754 12772 12802- 
12809 12830 12835 12841-12842 12905 12923 12940 12976 
13003 13015 13017-13021 13051-13052 13117 13126 13136 
13260 13277 13283 13295 13336-13343 13367 13442 13456 
13473 13477 13481 13495 13497 13499-1350013507 13516 
13659 13670 13713 13715-13716 13748 13752-13759 13803 
13869 13872 13884-13885 13888 13893 13896 


adult spleen 


aontech 


SPLcOl 


800 1927 4032 4834 6064 6135 6195 6446 6788 6873 7166 
7455 8966 9929 10744 12402 12564 12590 12691 12904 12933 
13082 13500 13506-13507 13516 13575 13864 13869 13883 
13889 


stomach 


Clontech 


STOOOl 


21 83 142 1004 1208-1215 1217-1219 1397 1399-1405 1671 
23 15-23 16 2345 2369-2373 2375 2575-2576 2809 2846 2984 
3 136 3166 3537 3610 3698 3723-3725 3839 3885-3897 4057- 
4059 4173 4277 4410 4480 4667 4791 4808 4940 4987 5262 
5278-5281 5283-5284 5403-5405 5407-5424 5481 56565674 
5796 5904 6418 6436-6440 6535-6540 6563 6627-6629 6765 
6940 7000-7001 7027-7030 7064 7135 7509 7604 7880-7885 
7981-7990 8087 8110 8120 8143 8226 8452 8535 9010 9079- 
9081 9191 9193-9196 9304-9306 9313 9317 9321 9715 10007 
10011-10013 10086-10093 10178-10179 10603-10605 10640- 
10642 11069 11167 11221-11222 11331-1133711339-11343 
11419 11513 11818 12186-12190 12307-1231412327 12363 
12425-12427 12438 12617 12773-12774 12810-12811 12834 
13082 13103 13298-13299 13344-13349 13592 13630 13670 
13717 13760-1376413782 13888 


thalamus 


Clontech 


THA002 


579 598 616 1065 1148 1220-1221 1223-1226 1407-1432 1597 
2266 2317-23192340 2342 2376-23782380 2431 2444 2555 
3093 3230 3286 3537 3722 3726-3732 3737 3898-3902 3904- 
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Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








3918 3920 3922 4060-4062 4173 4201 4282 4360 4391 5270 
5285-5288 5425-5438 5440-5449 5701 5902 5938 6137 6437 
6458 6461 6541-6544 6546-6550 6630 6755 7031 7095 711$ 
7166 7484 7579 7815 7849 7905 7977 7991-8000 8002-8005 
8126-8128 8134 8363 8558 8664 8786 8890 8930 8963 9082- 
9085 9128 9197-9199 9201-9216 9251 9264 9308 9503 95 15 
9556 9646 9703 9928 10014-10017 10027 10094-10100 10102 
10267 10496 10606-10608 10643-10650 10696 10891 11145 
11223-11225 11344-11355 11406-11407 11420 11513 11604 
11695 11791 11950 12022 12107 12191-1219912315-12331 
12363 12375 12405 12428-12430 12688 12706 12735 12748 
12754 12812-12813 12815 12835 12914 12959 13020 13023- 
13025 13060 13073 13300 13351-13358 13402-13403 13489 
13496 13506 13512 13719-13720 13765-13768 13770 13872 
13883 


thymus 


Clontech 


THMOOl 


51 142 150 332 346 360 438 546 731 760 895 1004 1104 1227- 
1234 1264 1391 1516 1577-1585 1812 1860 187721292174 
2215 2263 2321-2322 2408 2414 2425 2449 2490 2555 2569 
2575 261 1 2847 2880 3435 3530 3722 3727 3733 3735-3742 
3839 3870 4006 4041 4043-40464060 4109 4375 4396 4399 
4434 4667 4671 4759 4791 4885 4976 4987 5108 5289-5290 
5466 5481 5526 5553-5554 5796 5956 5979 6020 6186 6253 
6336 6371 641 1 6438 6441-6447 6617-6621 6765 6788 6797 
6870 6886 6908 6972 6994 7059-7060 7126 7141 7166 7168 
73107383 7450 7494 7632 7716 7779 7887 7889-7891 8087 
811 1-81 14 81 17-8120 8139 8226 8343 8368 8370 8452 8456 
8633 8830 8898 8921 8963 9010 9076 9086-9087 9128 9134 
9264 9304 9307-93 13 9401 9466 9497 9544 9563 9582 9612 
9626 9646 9758 9781 9927 9937 9969 10018 10065 10163- 
10166 10175 10284 10329 10414 10557 10609-10610 10617 
10679 10687-10691 10742 10760 10772 10776 10778 10881 
10891 1 1 187 1 1 194 1 1226-11229 1 1274 11406 1 1408-1 1410 
11412 11459 11513 11676 11695 11830 11865 11942 12000 
12006 12016 12022 12092 12160 12186 12200-12204 12327 
12363 12414-12417 12427 12462 12470 12490 12564 12693 
12706 12735 12740 12835 12840 12898 12905 12910 12913- 
12914 12916 12977 13020 13036 13051 13062 13111 13126 
13141 13295 13301-13305 13326 13351 13391 13396-13397 
13456 13484 13498 13505 13507 13512 13516 13546 13713 
13803 13805 13808 13866 13869 13885 13888 


thymus 


Clontech 


TElMc02 


16 27 51 67 142 390 598 1233 1493 1508 1586-1588 1598-1603 
1724 1841 1918 1927 1959 1976 1979 1989 2057 2107 2161 
2223 2290 2407 2426-2428 2432-2433 2444 2487 2569 2617 
3537 3664 3711 3720 3722 3737 3780 3817 4021 4025 4047- 
4049 4060 4063-4068 4246 4277 4377 4405 4688 4694 4759 
4791 4837 4949 4957 4992 4998 5037 5052 5108 5505 5553 
5555-5564 5571-5572 5683 5766 5772 5788 5796 5894 591 1 
6048 6186 6225 6288 6306 6333 6621 6728 6730 6765 6767 
6780 6788 6815 6867 6873 6979 7049 7061-7062 7166 7169 
7676 7687 7809 8084 8121 8129-8130 8179 8262 8354 8363 
8365 8375 8482 8597 8654 8740 8786 8791 8963 9076 9157 
9257 9264 9314 9323-9324 9401 9454 9466 9519 9529 9536 
9637 9700 9703 9927-9929 9940 10004 10007 10070 10167 
10169-10172 10180 10182 10267 10284 10326 10331 10383 
10439 10452 10542 10605 10691 10693-1069410697 10744 
10775 10778 10795 11046 11098 11274 11413-11415 11462 
11519 11576 11592 11606 11618 11621 11627-11628 11693 
11807 11814 12022 12034 12044 12080-12081 12086 12160 
12236 12256 12327 12335 12363 12368 12387 12418-12419 
12424 12433 12556 12560 12564 12570 12577 12594 12599 
12612 12663 12735 12754 12836-12839 12844 12905 12913 
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12955 12977 13020 13035 13037 13051 13062 13072 13087 
13193 13280 13295 13398 13404 13456 13488 13499-13500 
13503 13507 13516 13533 13535 13546 13611 13613 13630 
13646 13649 13659 13670 13678-13679 13713 13769 13775 
13803 13806-13808 13810-13811 13869 13872 13882-13883 
13885 13888-13889 


thyroid gland 


Clontech 


THROOl 


49 115 142 360 641 698 800 1004 1134 1193 1233 1235 1237- 
1241 1329 1433-1471 1473-1476 1478-1479 1481 1589-1591 
1593-1594 1604 1606-1612 1614-1615 1639 1671 1906 1927 
1959 1976 19942090 2099 2134 2150 2161 21742186 2215 
2343 2367 2381-2387 2429 2434-2437 2449 2490 2510 2529 
2555 2569 2662 2827 2901 2961 2997 3001 3109 3265 3286 
3425 3490 3503 3560 3643 3698 3722 3737 3743-3747 3797 
3839 3885 3923-3941 3943-3954 3956-3961 4009 4050-4053 
4060 4069-4082 4109 4173 4194 4253 4277 4282 4348 4354- 
4355 4376 4391 4396 4412 4434 4641 4689 4694 4755 4759 
4783 4791 4834 4957 4972 4985 4987 5018 5037 5052 5075- 
5076 51085147 5163 5171 5270 5292-52945317 5380 5450- 
5461 5464-5481 5503 5523 5565-5568 5573-5574 5693 5711 
5724 5757 5772 5788 5894-5895 5917 5923 5938 5959 5966 
6005 6013 6027 6047-6048 6061 6064 6125 6135 6139 6189 
6215 6240 6336 6371 6374 6448-6449 6458 6508 6538 6551- 
6554 6556-6561 6563-6572 6595 6598 6607 6622-6623 6631- 
6636 6793 6803 6815 6873 6899 6955 6958 7032-7038 7040 
7043 7063 7066-7067 7079 71 16 7155 7233 7275 7455 7669 
7743 7792 7839 7845 7857 7892-7893 7910 7930 7989 8006- 
8008 8010-8033 8053 8067 8087 8092-8093 8110 8116 8120 
8122-8124 8131-8143 8196 8226 8235 8262 8345 8365 8368 
8370 8372 8375 8452 8520 8535 8543 8597 8646 87-U 8760 
8816 8839 8863 8898 8930 8948 8951 8963 8966 9010 9055 
9088-9091 9142 9157 9217-9220 9222-9227 9229-9242 9244 . 
9257 9264 9304 9315-9316 9325-9330 9368 9401 9455 9466 
9484 9525 9542 9544 9563 9646 9695 9703 9780-9781 9855 
9928 9942 9947 10000 10019 10027 10103-101 17 101 19-10120 
10173-10175 10183-10185 10346 10376 10440 10470 10496 
10611 10651-10657 10666 10679 10695 10698-10704 10760 
10772 10778 10782 10788 10891 11033 11051-11052 11061 
11063 1111211231 1125611356-1135711359-1136011362- 
11367 11369-11372 11376 11385 11406 11416 11421-11425 
11462 11513 11588 11605-11607 11620 11833 11869 11932 
11980 11985 12006 12017 12033 12143 12160 12193 12195 
12205 12207-12208 12292 12332-12336 12338-12345 12347 
12349-12356 12358-12360 12363 12368 12405 12420 12422 
12424 12428 12434-12439 12465 12523 12601 12605 12609- 
12611 12617 12636 12645 12670 12691 12707 12740 12749 
12754 12768 12775 12777 12817-12824 12830 12840 12845- 
12848 12863 12911 12940 12955 12977 13020 13026-13028 
13038 13047 13051 13062 13072-13073 13090 13104 13117 
13126 13195 13202 13254 13295-13296 13360-13371 13389 
13391 13399 13412 13479 13488-13489 13492 13494 13498 
13500 13503-13504 13506-13507 13511-13512 13516 13533 
13551 13554 13590 13613 13627 13644 13646 13656 13659- 
13660 13670 13713 13721 13743 13771-13774 13782 13803 
13809 13812-13814 13860 13866-13867 13869 13872 13882 
13884-13885 13888 13891 13893 


trachea 


Clontech 


TRCOOl 


83 438 483 858 1006 1034 1242-1245 1359 1483-1494 1616- 
1619 1621 167122662324 2388-23912409 2438-24392444 
2575 3001 3136 3272 3425 3460 3535 3548 3748-3756 3810 
3962-3967 4083-4086 4530 4755 4758 4949 4957 4987 5075 
5213 5295-5299 5481-5482 5484 5486-5488 5490-5491 5535 
5576-5580 5656 5941 6091 6450-6453 6574-6578 6637 6797 
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6955 7041-7042 7049 7068 7204 7347 7733 7864 7894 8034- 
8043 8045 8120 8139 8144 8563 8635 8646 9012 9092-9099 
9245-9251 9317 9331 9333 9484 9503 9517 9734 10020-10022 
10027 10097 10121-10123 10125-10127 10284 10344 10478 
10612 10658-10659 10705 10812 10948 11145 11233-11239 
11373-11376 11428-11429 11942 12022 12209-1221 1 12292 
12361-12363 12365-12366 12438 12440-12441 12611 12617 
12734-12735 12825-12826 12900 12904-12905 12916 12923 
12965 12972 12976 12989 13029-13030 13306-13308 13372- 
13373 13492 13670 13713 13722 13775-13781 13803 13884 


uterus 


Clontech 


UTROOl 


1036 1134 1246-1250 1324 1493 1495-1511 1622 1671 1928 
2145 2266 2310 2326 2343 2367 2392-2397 2555 3281 3479 
3720 3757-3760 3762 3969-39864088-4090 4192 4201 4246 
4277 4390-4391 4434 4515 4998 5002 5184 5300-5301 5481 
5492-5504 5656 5695 5712 5794 5936 61 16 6475 6579-6587 
6595 6870 7002 7043-7044 73 1 1 7484 7895-7899 8046-8055 
8057-8059 8061 8110 8146-8147 8152 8226 8262 8622 8677 
9008 9092 9137 9252-9257 9334 9370 9414 9466 9539 9703 
10023-10024 10128-10134 10346 10595 10660-10668 10706- 
10708 11145 11239 11289 11377 11379-11385 11695 11830 
12367-12369 12371-12374 12442-12443 12570 12670 12693 
12827-12829 12914 13002 13047 13073 13083-13084 13131 
13260 13277 13309-13310 13374-13376 13394 13489 13512 
13713 13782 13784-13787 13866 13869 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO; 








Waterman 
"flT^ 


Iilciitit 

y 


— 




5 

M3 6501 


-. 

Homo sapiens 


— — 

alpha-2-niacroglobuiin 





-fg 




AFl 18090 


Homo sapiens 


PRO2044 






— 

A 


XO 1 683 


Homo sapiens 


alpha 1 -antitrypsin 


"Ud 


— 


4 


- 


Homo sapiens 


reverse transcriptase 


— 

79 


— 






Homo sapiens 


unknown protein 




40 




AF015539 


Mytilus edulis 


precollagen P 


}^^. . .. 


33 




X03325 


Homo sapiens 


apolipoprotein B fragment 


540 


83 




AB019280 


Mus musculus 


sprouty-4 


91 






D88152 


Homo sapiens 


acetyl-coenzyme A transporter 


625 


87 


10 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


58 








4409. 








AL049569 


Homo sapiens 


dJ37C10.5 (KIAA0445) 


182 


82 


12 


AJ242540 


Volvox carteri f. 


hydroxyproline-ricb glycoprotein DZ- 


85 


37 






nagariensis 


HRGP 






13 


L27428 


Homo sapiens 


reverse transcriptase 


135 


61 


14 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


266 


72 








similar to pogo element 






15 


U93569 


Homo sapiens 


putative pi 50 


135 


37 


16 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


67 








6913. 






17 


X53581 


Rattus 


0RF4 


124 


34 






norvegicus 








18 


AF183961 


Homo sapiens 


carbon catabolite repression 4 protein 


431 


75 








homolog 






19 


AJ002190 


Homo sapiens 


dihydroxyacetone phosphate 


551 


88 








acyltransferase 






20 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


127 


45 


21 


AK001269 


Homo sapiens 


unnamed protein product 


1643 


99 


22 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 


275 


58 








similar to pogo element 






23 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


75 








7874. _ 






24 


AF156550 


Mus musculus 


putative E1-E2 ATPase 


168 


58 


25 


AFI 19856 


Homo sapiens 


PR01851 


585 


83 


26 


U49974 


Homo sapiens 


mariner transposase 


187 


46 


27 


G00901 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


86 


30 








4982. 






28 


AF295773 


Homo sapiens 


ral guanine nucleotide dissociation 


126 


74 








stimulator 






29 


AFl 13685 


Homo sapiens 


PRO0974 


92 


73 


30 


U83j03 


Homo sapiens 


line- 1 reverse transcriptase 


102 


50 


31 


Y91577 


Homo sapiens 


Human secreted protein sequence 


279 


75 








encoded by gene 2 SEQ ID NO:250. 








AF003535 


Homo sapiens 


0KF2-like protein 




.7' 


. -^^ ,. 


M15386 


Homo sapiens 


gamma-globin 


j70 


84 


"^5 


M19419 


Mus musculus 


proline-rich salivary protein 


3ll 






AF211943 


Homo sapiens 






~ 






WWOX 






36 


X13885 


Nicotiana 
tabacum 


extensin (AA 1-620) 


103 


35 


37 


U93563 


Homo sapiens 


putative pl50 


127 


58 


38 


U93564 


Homo sapiens 


putative pi 50 


103 


77 


39 


AF069732 


Homo sapiens 


ADA2-like protein 


524 


88 


40 


X61046 


Hydra sp. 


mini-collagen 


101 


34 


41 


AK000322 


Homo sapiens 


unnamed protein product 


566 


80 


42 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


57 



130 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 
NO: 


Accession No, 


Species 


Description 


Watcrman 


Hentlt 








7727. 




~ 


43 


AF071081 


Mycobacteriuin 
tuberculosis 


proline-rich mucin homolog 


104 


41 


44 


AF092135 


Homo sapiens 


PTD014 


228 


41 


45 


Y73353 


Homo sapiens 


HTRM clone 1870914 protein 

sequence. 


295 


56 


46 


AF 118082 


Homo sapiens 


PRO1902 


119 


44 


47 


X78926 


Homo sapiens 


zinc finger protein 


442 


52 


48 


X54326 


Homo sapiens 


glutaminyl-tKNA synthetase 


542 


81 


49 


D50645 


Homo sapiens 


SDF2 


321 


95 


50 


M92439 


Homo sapiens 


leucine-rich protein 


344 


80 


51 


U28963 


Homo sapiens 


Gps2 


593 


82 


52 


U41806 


Homo sapiens 


p60 


660 


81 


53 


API 8 1490 


Homo sapiens 


preiiylcysteine lyase 


461 


78 


54 


U93570 


Homo sapiens 


putative pi 50 


147 


36 


55 


W73499 






529 


76 


56 


AFl 19851 


Homo sapiens 


PRO 1722 


126 


57 


57 


G03203 


Homo sapiens 


7284, ^ ' Q 


115 


61 


58 


AL021939 


Homo sapiens 


dJ352A20.2 (aldehyde dehydrogenase 
family protein) 


422 


90 


59 


L24158 


Homo sapiens 


integrin alpha 9 protein 


117 


71 


60 


Y32157 


Homo sapiens 


Human SH3D1A protein. 


530 


91 


61 


X61296 


norvegicus 


open reading frame 2 


117 


31 


62 


AK002064 


Homo sapiens 


unnamed protein product 


330 




63 


AB012223 


Canis familiaris 


0RF2 


80 


56 


64 










37 


65 


U 15647 


"MiiTmusculus 


fe"versc'traifscriptas e 


-|29 


55 


66 


G0341 1 


llomo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


126 




67 


Y48359 


Homo sapiens 


Human prostate cancer-associated 
protein 56. 


590 


99 


68 


W74879 


Homo sapiens 


Human secreted protein encoded by 
gene 151 clone HTLEF62. 


368 


98 


69 


AFl 04921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


604 


93 


70 


'AFl 75265 


Homo sapiens 


vacuolar sorting protein 35 


632 


88 


71 


U93571 


Homo sapiens 


p40 


106 


33 


72 


X15324 


Homo sapiens 


angiotensinogen 


330 


84 


73 


Z98204 


Hordeum 
vulgare 


extensin 


111 


38 


74 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


61 


75 


AFl 18092 


Homo sapiens 


PRO2061 


453 


79 


76 


M63175 


Homo sapiens 


autocrine motility factor receptor 


190 


85 


77 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


153 


38 


78 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


58 


79 


X78926 


Homo sapiens 


zinc finger protein 


199 


37 


80 


M77381 


Homo sapiens 




98 


54 


81 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


129 


44 


82 


U93569 


Homo sapiens 


putative pl50 


94 


38 


83 


AP000616 


Oryza sativa 


similar to RING-H2 fmger protein 
RHAla(AF078683) 


132 


100 


84 


AF255446 


Crypfliecodinium 
cohnii 


Dip 1 -associated protein C 


129 


34 


85 


R59837 


Homo sapiens 


Sequence of human microtubule- 


82 


48 



131 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


rdentit 




















associated protein tau 




— 


86 


Y02749 


Homo sapiens 


Human seCTeted protein encoded by 


157 


76 








gene 100 clone HNFIU96. 






87 


AFl 16712 


Homo sapiens 


PR02738 


91 


58 


88 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


44 








6566. 






89 


Y82742 


Homo sapiens 


DNA replication and repair associated 


315 


79 








protein (DRASP). 






90 


Ml 6961 


Homo sapiens 


alpha-2-HS-glycoprotein 


138 


74 


91 


U49973 


Homo sapiens 


ORF 1; MER37; putative transposase 


250 


56 














92 


AF220656 


Homo sapiens 


apoptosis-associated nuclear protein 


62 


69 








PHLDAl 






93 


U65928 


Homo sapiens 


Jun activation domain binding protein 


188 


75 


94 


U93568 


Homo sapiens 


putative pl50 


102 


48 


95 


S80119 


Rattus sp. 


reverse transcriptase homolog 


130 


53 


96 


U93563 




putative pi 50 


242 


50 


97 


Y86472 


Homo sapiens 




105 


46 








fragment, SEQ ID NO:387. 






98 


G03714 


Homo sapiens 


77^™ ^^'^^^'^ protein, SEQ ID NO. 


132 


~62 














99 


^^j^Q^^ 


E uuscaballus 


preproalbumin 


289 


"65 




AFl 18090 


Homosa iens 


PRO2044 


"ill 


~90 


101 




Homo sapiens 

omo sapiens 


ORFl; MER37; putative transposase 













similar to pogo element 






102 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


53 








7214. 






103 


Y86573 


Homo sapiens 


Human gene 9 1 -encoded protein 


225 


82 








fragment, SEQ ID NO:490. 






104 


AF003535 


Homo sapiens 


0RF2-like protein 


114 


47 


105 


AF130079 


Homo sapiens 


PR02852 


133 


56 


106 


AF 130089 


Homo sapiens 


PRO2550 


107 


71 


107 


M63473 


Homo sapiens 


alpIia-5 type IV collagen 


131 


45 


108 


AFl 16661 


Homo sapiens 


PRO 143 8 


112 


54 


109 


X92485 


vivax 


pval 


101 


41 


110 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


80 


111 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 


199 


69 








similar to pogo element 






112 


AF194537 


Homo sapiens 


NAG 13 


104 


44 


113 


L27428 


Homo sapiens 


reverse transcriptase 


160 


34 


114 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


137 


56 


115 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


191 


67 








similar to pogo element 






116 


AF130052 


Homo sapiens 


PRO0956 


163 


47 


117 


L27428 


Homo sapiens 


reverse transcriptase 


117 


36 


118 


U93569 


Ploino sapiens 


putative pi 50 


104 


66 


119 


W48351 


Homo sapiens 














BCRB2. 






120 


G00357 


Homo sapiens 




81 


~57 








4438. 






121 


G03438 


Homo sapiens 


Human secreted proteins SEQ ID NO; 


78 


51 








7519. 






122 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 










6613. 






123 


X61296 


Rattus 


open reading frame 2 


94 


36 






norvegicus 








124 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


60 








7882. 







132 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Idcntit 


125 


AFl 19900 


Homo sapiens 


PR02822 


168 


68 


126 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


209 


-|| 


127 


■L27428 


Homo sapiens 




102 




128 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


99 


63 


129 






Human secreted protein, SEQ ID NO: 
7795. 


113 




130 


G03133 




7214. ^ 






131 


U93572 










132 


U37263 


Homo sapiens 


KRAB zinc finger protein; IVlethod: 
author 


155 


57 




G 00403 




Human secreted protein SBQ ID NO* 

4484. 








B08918 


Homo sapiens 


encoded by gene 28 SEQ ID NO;75. 


58 


61 


135 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


66 


136 


GOBI 72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


38 


137 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


230 


55 


138 


AK000496 


Homo sapiens 


un^'ame^ pr^ein product 


TzT 


~46 


139 


X53581 


Rattus 


0RF4 


136 


38 


140 


U49973 


Homo sapiens 


OREl; MER37; putative transposase 


158 


48 


141 


U49973 


Homo sapiens 


ORFl i MER37; putative tran.sposase 
similar to pogo element 


l62 


"65 


142 


AF090930 


Homo sa"ien"^ 


PRO0478 







143 

"144^ 


G03790 


omo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 




~ 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 

fragment, SEQ ID NO:387. 


98 


65 


145 


AJ238588 


Sciurus vulgaris 


cytochrome c oxidase subimit III 


417 


72 


146 


Q03801 




Human secreted protein, SEQ ID NO: 
7882. 


139 




147 


Y36156 










148 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


121 


70 


149 


Y76184 


omo sapiens 


^rae 61 ^^"^^^'^ protein encoded by 






150 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


95 


57 


151 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


126 


66 


152 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


105 




153 


AFl 19900 


Homo sapiens 


PR02822 


116 


62 


154 


G03789 




Human secreted protein, SEQ ID NO: 
7870. 






155 


AB009993 


Mus musculus 


collagen al(V) 


105 


36 


156 


U49973 


Homo sapiens 


ORFl: 1VIER37; putative transposase 
similar to pogo element 


155 


69 


157 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


348 


71 


158 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


54 


159 


AF247705 


Oryctolagus 
cuniculus 


alpha 1 type X collagen 


102 


42 



133 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ m 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Id en tit 


160 


R95913 


Homo sapiens 


Neural thread protein. 


99 


56 


161 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


52 


162 


X7I442 


Rattus 


ORF 1; putative 


96 


47 


163 


U93570 


Homo sapiens 


putative pi 50 


118 


38 


164 


U23515 


Caenorhabditis 
e egans 


weak similarity to adenylyl cyclase- 

chabaudUdamimajo^ 

antigen protein (PIR:A32555). Final 

strand'^^'^'^''^ ^^"^ predicted on other 


93 


37 


165 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


66 


~J66 


AF 130079 


Homo sapiens 


PR02852 


1^3 


"90 


167 


L27428 


Homo sapiens 


reverse transcriptase 


200 


53 


168 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


92 




169 


R95913 


llomo sapiens 


T^feuraTBireadproteE" 


lT6 


"54 


170 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


324 


85 


TTi 


U83303 


Homo sa iens ' 




TTl 


^0 — 


172 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


TTT 


"69 


173 


AF130089 


"Homosa iens 


PRO2550 


126 


59 


174 


S80119 


"Rattus s 









175 


G0245 1 


T4^nw"sapk5Hs 


Human sTcrrteTpTotcix^^ ID NO" — 
6532. 




~ 




^UT5647 


Mus musculus 


reverse transcriptase 


104 


46 


177 


M24732 


Homo sapiens 


lamin-lilce protein 


112 


42 


178 


AF041330 








38 


179 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


58 


180 


AFI94537 




NAG 13 


92 








Homo sapiens 







~53 


182 


U93574 


"Homo sapiens 


"putative pi 50 




"46 


183 


Y14166 


Gallus gallus 


attachment region binding protein 


91 


40 




G03807 




Human secreted protein SEQ ID NO' 
7888. 










Homo sapiens 


ORPl; MER37; putative transposase 


18j 


73 


186 


U93572 


Homo sapiens 


pi^tive pl50° 


139 


64 


187 


M22332 


Homo sapiens 


unknown protein 


79 


41 


188 


Y87202 




IDNO:241. 








U70935 


maniculatus 








190 


S80119 




NAGn'^^^'^"^^^^^ homolog 


— ^ 




191 


AFl 94537 


Homo sa iens 






~l5 






Homo sapiens 


Human secreted protein fragment 

encoded from gene 28. 


"Tl2 


""64 


193 


X92485 


vivax°'*''™ 


pval 


96 


40 


194 


R13556 


Homo sapiens 


Protein encoded downstream of Wic M 
oncoprotein. 


112 


45 


195 


L27428 


Homo sapiens 


reverse transcriptase 


141 


37 


196 


U93570 


Homo sapiens 


putative pl50 


201 


41 


197 


X92485 


Plasmodium 
vivax 


pval 


120 


48 


198 


AF130089 


Homo sapiens 


PRO2550 


137 


60 



134 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 







Species 


Descnption 






NO: 








Waterman 


Identit 










Score 


y 


199 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subimit) 


377 


84 


200 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-N1P 


119 


52 


201 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


151 


68 








fragment, SEQ ID NO:490. 






202 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 


247 


78 








similar to pogo element 






203 


W48351 


Homo sapiens 


Human breast cancer related protein 


126 


56 








BCRB2. 






204 


Y91577 


Homo sapiens 


Human secreted protein sequence 


362 


80 








encoded by gene 2 SEQ ID NO250. 






205 


R59842 


Homo sapiens 


ApoE4Ll protease. 


100 


82 


206 


AF161356 


Homo sapiens 


HSPC093 


78 


62 


207 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


48 








4581. 






208 


U83280 


Leishmania 


39 kDa antigen 


121 


53 






donovani 








209 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


80 








7795. 






210 


U49973 


Homo sapiens 


ORFl; 1M1ER37; putative transposase 


146 


72 








similar to pogo element 






211 


M69197 


Homo sapiens 


haptoglobin-related protein 


344 


"92 ' 


212 


AF0346I1 


Homo sapiens 


intrinsic factor-B12 receptor precursor; 


123 


37 








cubilin 






213 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


129 


65 








SEQ ID NO: 163. 






214 


V00662 


Homo sapiens 


cytochrome oxidase 1 


485 


87 


215 


G02514 


Homo sapiens 


Human .secreted proteijj, SEQ ID NO: 


224 


80 








6595. 






216 


U35312 


Mus musculus 


nuclear receptor co-repressor 


115 


47 


217 


L26953 


Homo sapiens 


chromosomal protein 


143 


77 


218 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


224 


70 


219 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


75 








7534. 






220 


AB018114 


Arabidopsis 


RING finger protein-like 


111 


38 






tiialiaua 








221 


D38112 


Homo sapiens 


ATPasesubunit6 


475 


84 


222 


V00662 


Homo sapiens 


cytochrome B 


466 


77 


223 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


175 


85 








SEQIDNO:163. 






224 


AF010144 . 


Homo sapiens 


neuronal thread protein AD7C-NTP 


125 


38 


225 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


86 


65 








encoded by gene 121. 






226 


X77816 


Rattus 


PR-Vbetal 


130 


54 






norvegicus 








227 


U09500 


Homo sapiens 


cytochrome b 


274 


62 


228 


.'U7081104 


Mus musculus 


ORF2 


111 


36 






domesticus 








229 


AF090942 


Homo sapiens 


PRO0657 


88 


57 


230 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


76 


57 








8172. 






231 


G03472 


Homo sapieas 


Human secreted piotein, SEQ ID NO: 


101 


66 








7553. 






232 


B12310 


Homo sapiens 


Human secreted protein encoded by 


116 


54 








gene 10 clone HDPGP94. 






233 


AF0I0400 


Homo sapiens 


transaldolase-related protein 


253 


77 


234 


AF197913 


Helicoverpa 


basic DNA-binding protein BDBP 


137 


50 






annigera nuclear 












polyhedrosis 












virus 









135 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 




Description 


Smitb- 

WafermHn 

Score 


% 
y 


235 


AF090931 


Homo sapiens 


PRO0483 


123 


75 


236 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


185 


86 


237 


MI 9503 


Homo sapiens 


ORFl; putative 


99 


40 


238 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4449. 


93 


58 


239 


AF014883 


Homo sapiens 


NADH dehydrogenase subunit 2 


305 


65 


240 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


60 


44 


241 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


79 


50 


242 


U15647 


Mus musculus 


reverse transcriptase 


117 


47 


243 


AF010144 


Homo sapiens 


neuronal tliread protein AD7c-NrP 


104 


56 


244 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 


94 


45 


245 


AK023542 


Homo sapiens 


unnamed protein product 


82 


38 


246 


X55702 


I>rosophila 
melanogaster 


polycomb protein 


84 


31 


247 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7870. 


129 


65 


248 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


154 


70 


249 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1051. 


142 


63 


250 


Y17832 


Human 
eadogenous 
retrovirus T< 


env protein 


297 


71 


251 


U93568 


Homo sapiens 


p40 


103 


46 


252 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


201 


47 


253 


AF090895 


Homo sapiens 


PROG 117 


139 


60 


254 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


185 


72 


255 


L27428 


Homo sapiens 


reverse transcriptase 


156 


40 


256 


U49973 


Homo sapiens 


ORP 1 ; MER37; putative transposase 
similar to pogo element 


157 


59 


257 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


118 


70 


258 


AF194537 


Homo sapiens 


NAG 13 


141 


38 


259 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


115 


71 


260 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4718. 


113 


53 


261 


M22332 


Homo sapiens 


unknown protein 


78 


45 


262 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


70 


78 


263 


X61296 


Rattus 
norvegicus 


open reading frame 2 


108 


36 


264 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


178 


45 


265 


G03303 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7384. 


81 


63 


266 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


116 


84 


267 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7867. 


118 


52 


268 


AFl 19855 


Homo sapiens 


PROI847 


74 


70 


269 


AF 109907 


Homo sapiens 


SI64 


85 


61 


270 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


137 


63 


271 


X92485 


Plasmodium 
vivax 


pval 


107 


72 


272 


AF194537 


Homo sapiens 


NAGI3 


167 


51 i 



136 



wo 01/64835 PCT/llSOl/04927 





Accession No. 


Species 


Descnption 






NO: 








Waterman 


Wentit 

y 


273 


U93564 


Homo sapiens 


p40 


104 


40 


274 


L27428 


Homo sapiens 


reverse transcriptase 


142 


56 


275 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


60 








4435. 






276 


X61296 


Rattus 


open reading frame 2 


96 


48 






norvegicus 








277 


AF090931 


Homo sapiens 


PRO0483 


140 


65 


278 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


231 


66 








similar to pogo element 






279 


AFI30089 


Homo sapiens 


PRO2550 


164 


60 


280 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


98 


58 








protein 






281 


L22548 


Homo sapiens 


collagen type XVIII alpha 1 


92 


38 


282 


AP000616 


Or>'za sativa 


similar to RING-H2 finger protem 


130 


68 








RHAla(AF078683) 






283 


AFn6715 


Homo sapiens 


PR02829 


160 


75 


284 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


53 


285 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


53 








7871. 






286 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


53 








BCRB2. 






287 


U49973 


Homo sapiens 


ORFl ; MBR37; putative transposase 


147 


66 








similar to pogo element 






288 


U93572 


Homo sapiens 


putative pi 50 


125 


32 


289 


AL050399 


Arabidopsis 


putative proline-rich protein 


142 


44 






thaliana 








290 


X92485 


Plasmodium 


pval 


147 


43 






vivax 








291 


AB047600 


Macaca 


hypothetical protein 


172 


66 






fascicularis 








292 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


55 








7870. 






293 


D38115 


Pongo pygmaeus 


NADH dehydrogenase subunit 5 


342 


71 


294 


AF090942 


Homo sapiens 


PRO0657 


99 


66 


295 


M61185 


Bos taurus 


glutamic acid-rich protein 


114 


44 


296 


M13100 


Rattus 


unknown protein 


107 


43 






norvegicus 








297 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


114 


50 


298 


X92485 


Plasmodium 


pval 


93 


78 






vivax 








299 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


75 








7253. 






300 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


42 


301 


L24321 


Homo sapiens 


transformation-related protein 


117 


60 


302 


X83413 


Human 


U88 


219 


49 






herpesvinis 6 








303 


U93567 


Homo sapiens 


putative pl50 


130 


48 


304 


B08918 


Homo sapiens 


Human secreted protein sequence 


72 


61 








encoded by gene 28 SEQ ID ^0:75. 






305 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


153 


68 






nagariensis 


HRGP 






306 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


532 


79 


307 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


53 








7795. 






308 


L27428 


Homo sapiens 


reverse transcriptase 


151 


72 


309 


M69297 


Homo sapiens 


ORF3 


145 


43 


310 


X92485 


Plasmodium 


pval 


81 


60 






vivax 








311 


L27428 


Homo sapiens 


reverse transcriptase 


103 


41 



137 



w o 01/64835 



PCT/llSOl/04927 





Accession No. 


Species 


Description 






NO: 








Waterman 


Uentit 












y 


312 


AF130079 


Homo sapiens 


PR02852 


135 


49 


313 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


58 








7867. 






314 


AF090928 


Homo sapiens 


PRO0470 


88 


48 


315 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


64 








4433. 






316 


U93568 


Homo sapiens 


putative pi 50 


148 


46 


317 


AFl 19855 


Homo sapiens 


PRO] 847 


108 


84 


318 


P60839 


Homo sapiens 


Sequence of human serum albumin 


175 


50 








(HSA)onplasmidpXL53. 






319 


W46424 


Homo sapiens 


Hirnian macrophage stimulating protein 


257 


69 








(MSP). 






320 


AL049547 


Homo sapiens 


dJ34F7.2 (CREB-RP (013)) 


247 


64 


321 


G02532 


Homo sapiens 


Human secareted protein. SEQ ID NO'. 


110 


66 








6613. 






322 


AF016099 


Mus musculus 


endonuclease/'reverse transcriptase 


102 


48 


323 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


324 


U49973 


Homo sapiens 


ORFl ; IVIER37; putative transposase 


126 


44 








similar to pogo element 






325 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


59 








4435. 






326 


R59842 


Homo sapiens 


ApoE4Ll protease. 


95 


60 


327 


W48351 


Homo sapiens 


Human breast cancer related protein 


84 


61 








BCRB2. 






328 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


66 








4435. 






329 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


65 








4435. 






330 


AFl 19855 


Homo sapiens 


PR01847 


121 


80 


331 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


364 


71 


332 


AK000496 


Homo sapiens 


unnamed protein product 


145 


41 


333 


D00570 


Mus musculus 


open reading frame (196 AA) 


153 


53 


334 


AF119855 


Homo sapiens 


PRO 1847 


116 


74 


335 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


50 








7871. 






336 


R13556 


Homo sapiens 


Protein encoded downstteam of hhc_M 


129 


56 








oncoprotein. 






337 


G02994 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


107 


59 








7075. 






338 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


115 


72 








clone HCACJ81. 






339 


AL359782 


Trypanosoma 


possible (hhv-6) ul 1 02, variant a dna. 


117 


50 






brucei 


complete virion genome. 






340 


AK022217 


Homo sapiens 


unnamed protein product 


127 


70 


341 


U43360 


Peromyscus 


reverse transcriptase 


115 


75 






maniculatus 








342 


AF11R086 


Homo sapiens 


PR01992 


141 


73 


343 


X92485 


Plasmodium 
vivax 


pval 


96 


59 


344 


AFl 06677 


Drosophila 


dissatisfaction 


90 


48 






melanogaster 








345 


U 12693 


Homo sapiens 


cytochrome oxidase subunit II 


239 


91 


346 


L27428 


Homo sapiens 


reverse transcriptase 


95 


56 


347 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


69 








7795. 






348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


51 








6613. 






349 


X51616 


Volvox carteri 


SULFATED SURFACE 


110 


41 








GLYCOPROTEIN 185 







138 



w o 01/64835 



PCT/USOl/04927 







Species 


Description 






NO: 








Waterman 


Wentit 

y 


350 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


55 








7870. 






351 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


183 


60 








similar to pogo element 






352 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


151 


5i 






major 








353 


R95913 


Homo sapiens 


Neural thread protein. 


95 


56 


354 


AF061340 


Artibeus 


cytochrome c oxidase subunit in 


346 


70 






jamaicensis 








355 


AF090895 


Homo sapiens 


PRO0117 


126 


60 


356 


AFO 16099 


Mus jnusculus 


endonucJease/reverse transcriptase 




48 


357 


AFl 18086 


Homo sapiens 


PRO 1992 


159 


73 


358 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


85 


89 






encoded by gene 17. 






359 


AF010144 


Homo sapiens 


neuronal tliread protein AD7c-N'I P 


98 


50 


360 


AF010144 


Homo sapiens 


neuronal rhread protein AD7c-NTP 


110 


57 


361 


M13100 


Rattus 
norvegicus 


unknown protein 


122 


34 


362 


Y36203 


Homo sapiens 


Human secreted protein #75. 


108 


63 


363 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


75 








7133. 






364 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


212 


65 








7878. 






365 


AF130051 


Homo sapiens 


PRO0898 


136 


71 


366 


AF068294 


Homo sapiens 


HDCMB45P 


188 


65 


367 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


225 


70 


368 


S80U9 


Rattus sp. 


reverse transcriptase homolog 


188 


45 


369 


U70935 


Peromyscus 


reverse transcriptase 


75 


48 






maniculatus 








370 


AFl 18082 


Homo sapiens 


PRO 1902 


98 


79 


371 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


106 


40 


372 


AFO 14903 


Pan troglodytes 


NADH dehydrogenase subunit 2 


169 


41 


fn 


AFO 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


93 


48 


374 


W48351 


Homo sapiens 


Human breast cancer related protein 


94 


66 








BCRB2. 






375 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


80 








7188, 






376 


U93568 


Homo sapiens 


putative pi 50 


140 


56 


377 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


50 








7075. 






378 


AF090942 


Homo sapiens 


PRO0657 


154 


66 


379 


U93568 


Homo sapiens 


putative pi 50 


149 


36 


380 


U93570 


Homo sapiens 


p40 


184 


57 


381 


L27428 


Homo sapiens 


reverse transcriptase 


128 


60 


382 


AFl 94537 


Homo sapiens 


NAG13 


114 


35 


383 


AFl 167 12 


Homo sapiens 


PR02738 


109 


56 


384 


D38112 


Homo sapiens 


cytochrome c ojcidase subunit 3 


405 


81 


385 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


116 


63 








RHAla (AF078683) 






386 


M22334 


Homo sapiens 


unlcnovm protein 


124 


39 


387 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


52 








6953. 






388 


D38112 


Homo sapiens 


NADH dehydrogenase subtmit 5 


327 


94 


389 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


70 








7645. 






390 


AF010144 


Homo sapiens 


neuronal teead protein AD7c-NTP 


129 


62 


391 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


259 


66 








similar to pogo element 







139 



w o 01/64835 



PCT/llSOl/04927 





Accession No. 


Species 


Description 






NO: 








Weram 


Idetitit 










Score ' " 


y 


392 


G00577 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 


137 


63 








4658. 






393 


U43360 


Peromyscus 


reverse transcriptase 


129 


54 






maniculatus 








394 


AK023582 


Homo sapiens 


unnamed protein product 


148 


46 


395 


M22332 


Homo sapiens 


unknown protein 


128 


41 


396 


AF 118086 


Homo sapiens 


PRO 1992 


160 


71 


397 


002363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


52 








6444. 






398 


038112 


Homo sapiens 


cytochrome c oxidase subunit 3 


199 


66 


399 


U49973 


Homo sapiens 


ORFl; iyiER37; putative transposase 


234 


78 








similar to pogo element 






400 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 


107 


40 






brucei 


protein rhala. 






401 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoform 


116 


61 


402 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


145 


37 


403 


AFl 18082 


Homo sapiens 


PRO1902 


97 


55 


404 


AF202635 


Homo sapiens 


PP1200 


126 


55 


405 


V00662 


Homo sapiens 


cytochrome oxidase I 


352 


68 


406 


AF229067 


Homo sapiens 


PADI-H protein 


129 


71 


407 


AL390114 


Leishmania 


extremely cysteineAfaline rich protein 


197 








major 








408 


L26251 


Trj'panosoma 


CR5 


95 


46 






brucei 








409 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


310 


62 








similar to pogo element 






410 


X92485 


Plasmodium 
vivax 


pval 


96 


68 


411 


M64793 


Rattus 


salivary proline-rich protein 


128 


40 






norvegicus 








412 


Y19192 


Talpa europaea 


cytochrome oxidase suiiunit I 


431 


83 


413 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


299 


86 


414 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


220 


81 








SBQIDNO:163. 






415 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


296 


89 


416 


X58438 


Mus musculus 


proline rich protein 


112 


50 


417 


U70932 


Peromyscus 


reverse transcriptase 


89 


51 






leucopus 








418 


V00662 


Homo sapiens 


cytochrome oxidase III 


200 


84 


419 


AF017789 


Homo sapiens 


putative transcription fector CAI50 


120 


41 


420 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


183 


69 


421 


AL359782 


Trypanosoma 


possible (hhv-6) u] 102, variant a dna, 


166 


44 






brucei 


complete virion genome. 






422 


AFl 30051 


Homo sapiens 


PRO0898 


158 


59 


423 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


86 


52 


424 


D38U6 


Pan paniscus 


cytochrome c oxidase subunit 3 


342 


82 


425 


U93570 


Homo sapiens 


putative p]50 


133 


41 


426 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


165 


67 








SEQ ID NO: 163. 






427 


D13951 


Nicotiana 


extensin precursor 


140 


42 






tabacum 








428 


L27428 


Homo sapiens 


reverse transcriptase 


104 


34 


429 


R95913 


Homo sapiens 


Neural thread protein. 


118 


49 


430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


206 


97 








SEQ ID NO: 163. 






431 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


55 


432 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


98 


54 


433 


.T05042 


Oiyctolagus 


alpha- 1 (Vni) collagen precursor 


91 


48 



140 



w o 01/64835 
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Accession No. 


Species 


Description 


Smith- 














Identit 










Score 








cuniculus 








434 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


56 








7348. 






435 


U93572 


Homo sapiens 


putative pl50 


118 


38 


436 


U93569 


Homo sapiens 


putative pl50 


100 


30 


437 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


81 








7437. 






438 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


71 








6613. 






439 


U52077 


Homo sapiens 


manner transposase 


187 


52 


440 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


45 








7214. 






441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


71 


442 


AE003727 


Drosophila 


CG16718 gene product 


301 


48 






melanogaster 








443 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


221 


74 








clone HPMBQ32. 






444 


U35730 


Mus musculus 


jerky 


159 


26 


445 


X5358I 


Rattus 


0RF3 


192 


46 






norvegicus 








446 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


52 








4497. 






447 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 


156 


38 








similar to pogo element 






448 


AF194537 


Homo sapiens 


NAG 13 


315 


70 


449 


W48351 


Homo sapiens 


Human breast cancer related protein 


93 


66 








BCRB2. 






450 


X92099 


Brugia pahangi 


collagen 


126 


44 


451 


AF090930 


Homo sapiens 


PRO0478 


88 


60 


452 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


93 


40 








SEQ ID NO: 163. 






453 


AF081114 


Mus musculus 


0RF2 


108 


32 






domesticus 








454 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


65 








7871. 






455 


S801I9 


Rattus sp. 


reverse transcriptase homolog 


197 


54 


456 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


68 








6616. 






457 


R95913 


Homo sapiens 


Neural thread protein. 


114 


48 


458 


U09116 


Homo sapiens 


ORFl, encodes a 40 kDa product 


160 


39 


459 


X92485 


Plasmodium 
vivax 


pval 


99 


52 


460 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


52 








8172. 






461 


S80119 


Rattus sp. 


reverse transcriptase homolog 


138 


48 


462 


Y91577 


Homo sapiens 


Human secreted protein sequence 


211 


67 








encoded by gene 2 SEQ ID NO:250. 






463 


X97707 


Pongo pygmaeus 


stopcodon created by 


229 


76 






abelii 


posttranscriptional polyadenylation 






464 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


67 


465 


L27428 


Homo sapiens 


reverse transcriptase 


154 


40 


466 


AK000496 


Homo sapiens 


unnamed protein product 


140 


69 


467 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


61 








4718. 






468 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


69 








4670. 






469 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


286 


79 


470 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


448 


86 


471 


M10546 


Homo sapiens 


cytochrome oxidase I 


296 


79 



141 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 


Identit 

y 




Y86248 


Homo sapiens 


SEQ ID NO: 163. 


102 


48 


473 


AL080253 


Arabidopsis 

thaliana 




Tos 


"42 




X99452 


esculentum 


extensin-like protem Dif54 


108 


25 


475 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


68 


33 


476 


AB012223 


Canis lamillaris 


0RF2 


78 


"66 


477 


AF 130089 


Homo sapiens 


PRO2550 


113 


~Ti 


478 


G 03 652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7733. 


390 


~97 


479 


AF210651 


Homo sapiens 


NAG 18 


146 


80 


480 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


103 


40 


481 


AF194537 


Homo sapiens 


NAG 13 


118 




482 


R96800 










483 


L27428 


Homo sapiens 


reverse transcriptase 


184 


47 


-^1^ 






nag'b 








- ^p)9453y 


"^omo sapiens 







"55 


486 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO* 1 63 


162 


82 


487 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 1 63 


204 


86 


488 


¥86248 


Homo sapiens 


Human secreted protein HCHPF68, 


140 


53 


489 


U93574 


Homo sapiens 


putative pi 50 


86 


54 






Homo sapiens 


Human breast cancer related protein 
BCRB2. 






491 


AJ27 1 872 


Nicotians 


extensin 


220 


47 


~492 


U11288 


tirosophila 

melanogaster 


diaphanous protein 





~ 


493 


G04091 




8172. 






494 






7795. 








AFl 19900 




PR02822 






496 


AB026542 


Homo sapiens 


WASP-family protein 


96 


38 




D86853 


rose'us'^^"*^^ 


extensin 


104 


^4 


498 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 
binding protein KIO (NID:g8148) 


109 


47 


499 


G00492 


Homo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


"109 


~fi7 


500 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


112 


62 


501 


AFl 19901 


Homo sapiens 


PR0283 1 


"Tie 


"82 


502 


AF238235 


Entamoeba 


diaphanous protein 


120 


41 


503 


M22332 


Homo sapiens 


unknown protein 


123 


49 




AFl 19851 




PRO 1722 






505 


X61296 


Rattus 

norvegicus 


open reading frame 2 


107 


~ 


506 


AFl 18082 


Homo sapiens 


PRO 1902 


145 


52 


507 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


68 


508 


X55685 


Lycopersicon 
esculentum 


extensin (class 1) 


175 


39 


509 


X92485 


Plasmodium 


pval 1 117 


52 



142 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 






Smith- 
Waterman 
_Score 


% 

Identit 

J! 














^To" — 


AF090942 


Honwsa iens 


PRO0657 


"95 




511 


U93569 


Homo sapiens 


putative pi 50 


120 


54 














513 


AJ242540 


Volvox carteri f. 
naganensis 


hydroxyproline-rich glycoprotein DZ- 

^^^^ : 


196 


63 


^Ti 


L27428 




reverse transcriptase 


~n9 





"sTs 


^^^ggg 


Homo sapiens 




"Wi 




516 


U93574 


Homo sapiens 


putative pi 50 


178 


35 




G02753 




Human secreted protein, SEQ ID NO' 
6834. 






518 


AF05j5jS 


Al\inella 


fibrillar collagen chEin FAp 1 alpha 


1 12 


36 


519 


X52235 


Homo sapiens 


ORFII 


148 


35 




AF130051 


Homo sapiens 


PRO0898 


98 




"551 


L02106 


Drosophila 
melanogaster 


ribonucleoprotein 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 









U93570 


Homo sapiens 


putative p 1 50 


159 






Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

A Tri xrri- 1 

ctr,\^ UJ riKj. loj. 


210 


100 


"525 


■1^2^4.2^ 


Homo sapiens 




~T28 




526 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


96 


65 


527 


X53581 


Rattus 

norvegicus 


0RF4 


130 


42 


"528 


U9357b 


Homo sapiens 


: . 

putative p 150 






529 


AF130089 


Homo sapiens 


PRO2550 


132 


43 




AK024455 


Homo sapiens 


FLJ00047 protein 






531 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


61 


532 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


210 


44 


533 


AF010I44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


109 


42 


5j4 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


62 


~535 


M64793 


"Rattus — ' 

norvegicus 


—r— r =^ = 

salivary proline-ricn protein 






536 


R1j556 


Homo sapiens 


Protem encoded downstream of li]ic_M 


122 


50 


^37 


(300637 


Homo sapiens 


Humans^eted protein SEQ ID NO- — 

4718. 


118 


^ - 


538 












539 


AF010I44 


Homo sapiens 


neuronal tiiread protein AD7c-NTP 


116 


39 




G04091 




8172. 








SSOl 19 










542 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


543 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


125 


91 


544 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


"146 


— 


545 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


74 


45 


546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
cloneHCACJSI. 


94 


75 


547 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


510 


86 


548 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


146 


68 



143 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



NO: 








Waterman 
Score 


y 








RHAla(AF0786833 






549 


Y08061 


Homo sapiens 


Htjman c-myb protein fiagment 


128 


82 


550 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


153 


77 


551 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7077. 


121 


56 


552 


X92485 


Plasmodium 
vivax 


pval 


103 


50 


553 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32, 


129 


46 


554 


L27428 


Homo sapiens 


reverse transcriptase 


149 


44 


555 


AF194537 


Homo sapiens 


NAG13 


157 


45 


556 


Y13247 


Homo sapiens 


FB19 protein 


106 


42 


557 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


100 


54 


558 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


176 


68 


559 


S80119 


Rattus sp. 


reverse transcriptase liomolog 


113 


43 


560 


AY008270 


Homo sapiens 


cholesteryl ester transfer protein 


107 


95 


561 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


140 


63 


562 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


347 


68 


563 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NID:g8148) 


93 


40 


564 


D38114 


Gorilla gorilla 


cytochrome c oxidase subunit 3 (COIII) 


329 


74 


565 


Y36156 


Homo sapiens 


Human secreted protein #28. 


153 


56 


566 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


131 


48 


567 


D38I12 


Homo sapiens 


cytochrome c oxidase subimit 3 


406 


94 


568 


AF130079 


Homo sapiens 


PR02852 


101 


55 


569 


AF081114 


Mus musculus 
domesticus 


0RF2 


123 


40 


570 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


65 


45 


571 


D86853 


Catharanthus 


extensin 


168 


39 


572 


AF104415 


Mus musculus 


gene trap locus- 13 


179 


66 


573 


AF130089 


Homo sapiens 


PRO2550 


114 


56 


574 


X67863 


Mus musculus 


T2 


115 


42 


575 


S80119 


Rattus sp. 


reverse transcriptase homolog 


101 


28 


576 


S80119 


Rattus sp. 


reverse transcriptase homolog 


150 


57 


577 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo elemeait 


142 


74 


578 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


106 


57 


579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


46 


580 


L24521 


Homo sapiens 


transformation-related protein 


110 


38 


581 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


537 


84 


582 


AF090895 


Homo sapiens 


PRO0117 


127 


80 


583 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7284. 


145 


70 


584 


X55681 


Lycopersicon 
esculentum 


extensin (class I) 


112 


38 


585 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


473 


60 


586 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


144 


68 


587 


U47855 


Araneus 
diadematus 


fibroin-3 


124 


30 


588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


150 


75 


589 


U93567 


Homo sapiens 


putative pl50 


225 


47 



144 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEO ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 


590 


X71602 


Mcotiana 
tabacum 


extensin 


147 


33 


591 


X57527 


Homo sapiens 


alpha UVIIl) collagen 


103 


42 


592 


G03112 


Homo sapiens 


Human secreted proteLn, SEQ ID NO; 
7193. 


75 


47 


593 


R28916 


Homo sapiens 


Type III procollagen (prior art). 


116 


48 


594 


R95913 


Homo sapiens 


Neural thread protein. 


116 


37 


595 


U11880 


Petromyzon 
marinus 


cytochrome oxidase subunit I 


127 


52 


596 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


84 


62 


597 


L27428 


Homo sapiens 


reverse transcriptase 


158 


40 


598 


M55251 


Canis familxaris 


glycoprotein 80 


559 


86 


599 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


600 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


143 


33 


601 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


154 


54 


602 


X73481 


Drosophila hydei 


mstlOIC2) 


107 


42 


603 


MS 1321 


fascicularis 




114 


39 


604 


X05561 


Homo sapiens 


917) (2953 is 2nd base in codon) 




42 


605 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


82 


62 


606 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7284. 


83 


64 


607 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


Tot 


44 


608 


L27428 


Homo sapiens 


reverse ti'anscriptase 


147 


43 


609 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


113 


61 


610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


151 


82 


611 


U93568 


Homo sapiens 


putative pl50 


144 


32 


612 


AB022223 


Arabidopsis 
thaliana 


extensin protein-like 


186 


58 


613 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


108 


49 


614 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


133 


36 


615 


X92485 


Plasmodium 




120 


42 


616 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


175 


89 


617 


U83280 


Leishmania 
donovani 


39 kDa antigen 


111 


51 


618 


AL 160493 


Leislimania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


137 


67 


619 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


53 


620 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


92 


45 


621 


AF130089 


Homo sapiens 


PRO2550 


123 


34 


622 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


133 


59 


623 


AF010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


127 


45 


624 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


205 


66 



145 



w o 01/64835 



PCT/llSOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


625 


X07882 


Homo sapiens 


Po protein 


119 


39 


626 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


70 


100 


627 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


-jj 


628 


G00637 


Homo sapiens 


4718. 




54 


629 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


115 


46 


630 


X63368 


Homo sapiens 


HSJlb 


151 


52 


631 


AF 130089 


Homo sapiens 


PRO2550 


155 




632 


X92485 


vi\^°'^'""^ 


pval 


102 


-gj 


633 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


102 


39 


634 


U49973 




ORF 1 J MER37; putative transposase 
similar to pogo element 






635 


X92485 




pval 


95 


73 


636 


S80119 


Rattus sp. 


reverse transcriptase homolog 


114 


58 


637 












638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


130 


76 


639 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


83 


36 


640 


AF041330 


Bode saltans 


NADH dehydrogenase subunit 5 


96 


34 






no^egicus 








642 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


35 


643 


'U23172 


Caenorhabditis — 
elegans 


sknflar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 

(PIR.JH0162) 


"lis 


~35 


644 


^FOSTl 11 


Mus musculus 

domesticus 


ORF2 


^68 


~33 


645 


AK027208 


Homo sapiens 


unnamed protein product 


90 


51 


646 


AFC 16099 


Mus musculus 


endonuc lease/reverse transcriptase 


101 


59 


647 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


76 


648 


AF273441 




NADH dehydrc^enase subunit 3 


121 


4^ — 


649 


L27428 


Homo sapiens 


reverse transcriptBse 






650 


AF119851 


Homo sapiens 


PRO 1722 


176 


53 


651 


Y91452 


Homo sapiens 


encoded by gene 2 SEQ ID NO:125. 






652 


AF130089 


Homo sapiens 


PRO2550 


130 


~36 


653 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


126 


41 


654 


AK0003B5 


Homo sapiens 


unnamed protein product 


195 


63 


655 


AB041881 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


158 


100 


656 


X61047 


Hydra sp. 








657 


M22332 


Homo sapiens 


unknown protein 


ICQ 


50 


658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


62 


659 


AFl 94537 


Homo sapiens 


NAG 13 


95 


48 


660 


Y67470 


Homo sapiens 


Np70 protein carboxy tenninal region. 


120 


50 


661 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


86 


32 


662 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
8144. 


169 


53 


663 


W48351 


Homo sapiens 


Humjui breast cancer related protein I 120 


72 



146 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










-^2211 


y 


-— 







BCRB2 






-H— 1 




-— j 

Mus mnscu us 


reverse tianscriptase 


TZs 


-— 

50 


_^r£ 


5 — £ 


Homo sapiens 


Neural thread protein- 





-12. 


666 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO ; 


134 


80 














667 


67 


Homo sapiens 


Human secreted protem, SEQ ID NOi 


117 


44 

















668 


ABO 18705 


; 

Mus musculus 


ORF2 


115 


32 


669 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


280 




670 


X5j581 


Rattus 


0RF4 


71 


39 






norvegicus 








. . 
671 


U49973 


Homo sapiens 


- 

ORFl; MER37; putative transposase 


201 


66 


— — 







similar to pogo element 






672 




: 

Homo sapiens 


Neural tiiread protein. 










Homo sapiens 


ORFl; MER37; putative transposase 


164 


46 








similar to pogo element 






-^^1 


Ar 1 18082 


Homo sapiens 


PRO1902 


— 




6/5 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




43 








JZl^ — T — — 






-^^^ 


UUUj /U 


Mus musculus 


open readmg frame (251 AA) 


4^1 — 





— — 


AF194537 


Homo sapiens 


^"^^^^ _ 




-2 


678 


M13 100 


Rattus 


unknown protem 


146 




— — 




noivegicus 








— 




— ^2—1 


Mus muscu us 


; 

reverse transcriptase 


-123 


— 

-4- — 


680 


R95913 


Homo sapiens 


Neural thread protein. 


145 


55 


681 


R59842 


Homo sapiens 


ApoE4Ll protease. 


^97 ,., 


63 


682 


AF130089 


Homo sapiens 


PRO2550 


94 




683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


69 


684 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 


93 


50 








KSRP 






685 


U93569 


Homo sapiens 


putative pl50 


133 


58 


686 


G03095 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


64 


.. 






_ 






687 


Yj6243 


Homo sapiens 


Human secreted protein encoded by 


„.. 
69 


73 

















Ar 1 10/ iz 


_- 

Homo sapiens 







.- 
56 


689 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


53 














-— 


. 


— : 

Homo sapiens 


— — ^ TFa 

putative plSO 


—p^ 


— 

49 




TT40Q 


Homo sapiens 


ORFl; MER37; putative transposase 




70 














"fiOT 


AF090S95 


Homo sapiens 


PROO 117 


— 

~t4 — 





"695 


AF130089 


omo sapiens 


PRO2550 






— — 


S801 19 


Kattussp. 


reverse transcriptase homo log 


17n 




— 




-— 

-~ 




Mus musculus 


reverse transcriptase 


4^2 







G03789 


Homo sapiens 


?8-7n^" secreted protein, SEQ ID NO. 


















— 




— -. 

Homo sapiens 


' — n: — A = — A T-vT i.TTn 

neuronal thread protem AD7c-NTP 


— 

"Tr^ 


— 

-Zr 


-~ 




Homo sapiens 


neuronal thread protein AD7c-NTP 


182 


— 1 


-J— 




Homo sapiens 


cytochrome c oxidase subunit 1 







loo — 


G00403 






-~ 










4484. 






701 


AF003535 


Homo sapiens 


0RF2-like protein 


125 


49 


702 


AF025467 


Caenorliabditis 


contains similarity to drosophila DNA- 


89 


41 






elegans 


binding protein KIO (NID:g8148) 






703 


L27428 


Homo sapiens 


reverse transcriptase 


255 


50 


704 


AF130089 


Homo sapiens 


PRO2550 


87 


55 


705 


G03052 


Homo sapiais 


Human secreted protein, SEQ ID NO: 


113 


52 



147 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 
-5£5I! 


Identit 

y 








^7T33 






"TOfi 


Y79140 


— : 

omo sapiens 


-77— — V. 11 

Human haemopoietic stem cell 


— — 

211 


88 









regulatory protein SCM3. 






"707 







us muscu us 


reverse transcriptase 


— 


...'^'^ 


"iM 




omo sapiens 


Human, secreted protein encoded by 




66 








gene 44 clone HTDAD22. 






^709 


G03807 


~R ^ 

omo sapiens 


Human secreted protein, SEQ ID NO; 


-— 

07 




50 








riR^Fi '■ — w 






1T6 


U49973 


'■ 

omo sapiens 




-— 


— 

64 








similar to po^o e^emenT 






711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


65 














712 


^ggggg 


Homo sapiens 


pubrtive pi 50 


To8~ 





713 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


37 














"714 


M2473'7 


H — o~a"^ 

omo sapiens 


r— pr —■ 

— ~i — — h — ="1 


"07 


— 

-£r 


^15 


D381 12 


omo sapiens 


cytociromecoxi asesu uni 


— — 

306 


79 


"716 


U97674 


esocricetus 
fl^''^ ^ : 


cytochrome c oxidase chain I 


462 


85 


"tTt 


AF004715 


omo sapiens 


I J u i 

jerlcy gene product homolog 




100 


42 


^tTr 


X92485 


asmo lum 


pval 


84 


48 


"719 


AF130089 


Homo sa iens 


PRO2550 


"132 


~ 


720 


AL451017 


Neurospora 


related to U 1 SMALL NUCLEAR 


125 


43 


— — 




^^^^^ -. 


lvil5UlNU\^JLii\-'rK\JlXillN \j 






-— 




Homo sapiens 


FLJ00047 protein 


108 


68 




G03 133 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


146 


46 








- 






Tn 




1 

Human 


U88 


269 


41 






iierpesvirus 6 








"774 

— — 


X92485 




Plasmodium 

V^^^ ,. 




pval 




117 


43 


725 




Plasmodium 




pval 


97 


41 






w^ax ^ 








— 





Homo sapiens 


,.- --- 

Human secreted protein, SEQ ID NO: 


102 


73 














"777 


OUUO:? / 


"ft ■■ '■ — 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


5 

118 


43 








'^1}^' i 






^77S 

— — 


j^22j^6 


— 1 — 

e a vu garis 


cnitinase 1 


.. 








W48351 


Homo sapiens 


Human breast cancer related protein 


107 


49 









BCR!B2 






^7T0 




omo sapiens 


Human secreted protein, SEQ ID NO. 


— 


-^^^ 

59 








-— '- : 5 






'Tvi 


W4y /I / 


Homo sapiens 


Protem polymer adhesive substrate 




148 


29 








PPASl-C 






^732 


W50193 




omo sapiens 


Amino ^idswiuence of salivary 


95 


45 








S—. — 






"tT^ 





— : 

Ostertagia 


cuticular collagen 




104 


37 






circumcincta 








^734 

— — 


AF130089 


omo sapiens 


PRO2550 








G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


100 








6836. 






736 


Y91577 


Homo sapiens 


Human secreted protein sequence 


494 


86 








encoded by gene 2 SEQ ID NO:250. 






737 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


188 


69 








7874. 






738 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


593 


91 


739 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


102 


62 








homolog 







148 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


% 

Identit 


740 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


111 


54 


741 


A23786 


Beta vulgaris 


chitinase 1 


106 


~36 


742 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


72 


743 


U49973 


Homo sapiens 


OKF 1 ; MER37 : putative transposase 
similar to pogo element 


184 


61 


744 


AE003629 


melanogaster 


CG 17108 gene product 






745 


U93563 


Homo sapiens 


putative pi 50 


145 




746 


G01931 


Homo sapiens 


Human secreted proteia, SEQ ID NO: 
6012. 


68 


66 


747 


AF2 17973 


Homo sapiens 


unknown 


113 


79 


748 


AC006161 


thaliana 


putative CENP-B/ARS-binding 
protein-Uke protein 








G02996 




7077. 






750 


AP000616 


Oryza sativa 


RHAla (AF078683) 


115 


54 


751 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


99 


72 


752 


G03133 


Homosa iens 

m pi ns 


Human secreted protein, SEQ ID NO: 
7214. 


~85 


"48 


753 


X92485 


Plasmodium 


pval 


89 


73 


754 


U93563 


Homo sapiens 


putative pl50 


T86 


-gg 


755 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


50 


756 


AF 194537 


Homo sapiens 


NAG 13 


~l3S 


To" — ~ 


757 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


79 




002994 


Homo sapiens 


7075. 


119 


61 


759 


AF130079 


Homo sapiens 


PR02852 


138 


40 


760 


X92485 


Plasmodium 


pval 






761 


U93050 


Mus musculus 


poly(A) binding protein 11 


95 


^5 


762 


V01555 


Human 
herpesvirus 4 


BYRFl, encodes EBNA-2 (Dambaugh 
et al, 1984; Dillner et al, 1984) 


80 


n — 


763 


V00662 


Homo sapiens 


ATPase 6 


337 


83 


764 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


117 


41 


765 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


423 


81 


766 


R95913 


Homo sapiens 


Neural thread protein. 


114 


66 


767 


V00662 


Homo sapiens 


cytochrome oxidase 1 


223 


83 


768 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


268 


83 


769 


V00662 


Homo sapiens 


cytochrome oxidase 1 


357 


81 


770 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


296 


71 


771 


AF026211 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


95 


39 


772 


AK024455 




FLJ00047 protein 


108 


13 


773 


X92485 


Plasmo^faim"^ 

vivax 


pval 




~39 


774 


AF130051 


Homo sapiens 


PRO0898 


123 


38 


775 


ABO 12223 


Canis familiaris 


0RF2 


174 


51 


776 


AB028664 


Paralichthys 
olivaceus 


cytochrome oxidase subunit-3 


268 


57 


777 


V00662 


Homo sapiens 


cytochrome oxidase I 


436 


85 


778 


X52235 


Homo sapiens 


ORFII 


125 


47 


779 


U93569 


Homo sapiens 


putative pi 50 


235 


49 



149 



w o 01/64835 



PCT/USOl/04927 



SEQID 
INO: 


Accession No. 


Species 


Description 


Smitli- 
Watcrman 


Identit 

y 


780 


U49973 


Homo sapiens 


ORFl; MER37: putative transposase 


262 


76 


781 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 


239 


70 


782 


AF090942 


Homo sapiens 


PRO0657 


146 


69 




G00454 


Homo sapiens 


Human secreted protein SEQ ID NO* 


85 


66 


784 


U49973 


Homo sapiens 


ORFl* MER37' putative transposase 
similar to pogo element 


^09 


~62 








ORFl' MER37' putative transposase 
similar to pogo element 






786 


U49973 




ORFl; MER37; putative transposase 
similar to pogo element 








V00662 




URF 4 (NADH dehydrogenase subunit) 


Til 




"^788 


AF 130089 


Homo sapiens 


PRO2550 




~P 


789 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5~H P — SFO rn isin- — 


131 


57 


^790 


G00354 


15 Hi 

omo sapiens 


^mnansecr e proem, Q 


"Tnn 


— j 


^791 


G02987 


Homo a i ns 

omo sapiens 


Human secreted protein SEQ ID NO* 

7068. 


-rrj2 


"ro 


792 


Y36203 


Homo sapiens 


Human secreted protein #75. 


104 


77 


793 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


117 


58 


794 


AF090930 


Homo sapiens 


PRO0478 


130 


68 


795 


AK024455 




FLJ00047 protein 






796 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


141 


70 




AF130089 




PRO2550 






^798 


W88627 


Homo sapiens 


clone HPMBQ32. 


~T53 


-gg 




AF130051 




PRO0898 






800 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


179 


70 


80J 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 


75 


51 


"802 


AF041330 


Bodo saltans 


NAIMtlehj^ogenase subunit 5 


~l03 


— 


803 


D38484 


Hylobates 


Cytochrome C oxidase subunit 1 


263 


70 


804 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


41 


805 


X924S5 


Plasmodium 


pval 


103 


38 


806 


AF194537 


Homo sapiens 


NAG13 


285 


51 


807 


AF121360 


mdano^'^er 


DNZDEDHC/NEWl zinc finger protein 


179 


47 


808 


X92485 


Plasmodium 

vivax 


"pvai 


~13l 


~46 


809 




tabacum 










Y01400 


omo sapiens 


clone HNHF029. 


174 


73 


811 


U93565 


Homo sapiens 


putative pi 50 


118 


38 


812 


AF 118082 


Homo sapiens 


PRO1902 


88 


54 


813 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


42 


814 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


114 


50 


815 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


113 


43 


816 


K02576 


Homo sapiens 


saUvaiy proline-rich protein 1 


148 


40 



150 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 

— 


Description 


Smith- 
Waterman 




% 

Identit 

y 


"817 


AF2 17449 


Schistosoma 
mekongi 


u ^ 

NAIDH dehydrogenase subunit 6 








AB012223 


Can is familiaris 








819 


X71602' 


Nicotiana 
tabacum 


extensin 


162 


45 


820 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

: 


154 


36 


"III 


^^g^^^ 


— : 

Homo sapiens 


Neural thread protein. 


T53 




822 


G00357 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


125 


61 




U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


364 


57 


"824 


U43360 


_ 

Peromyscus 
maniculatus 


reverse transcriptase 








AF194537 










826 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


437 


72 


-|— 


ArUj 1 foJ. 


Homo sapiens 


diaphanous 1 


108 


38 




AF194537 


Homo sapiens 


NAGI3 


92 


45 


829 




Homo sapiens 


cytochrome c oxidase subunit 3 






830 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


110 


46 


831 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


108 


31 


832 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


117 


34 


833 


U93564 


Homo sapiens 


putative pi 50 


84 


40 






Homo sapiens 


putative pi 50 


262 


54 




U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 






836 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 








AFl 94537 


Homo sapiens 








839 


U52077 


Homo sapiens 


mariner transposase 


344 


67 




AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NID:g8148) 




46 


841 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 


130 


92 


842 


M6479I 


Rattus 


salivary proline-rich protein 


114 


36 


843 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


196 


43 


844 


X92485 


Plasmodium 


pval 


102 


73 


~i45 


"xeioJs 


Hydra sp 


— — — n 

mini-coilagen 






846 


W48351 


Homo sapiens 


Human breast cancer related protein 


92 


48 


847 


W48351 


Homo sapiens 


Human breast cancer related protein 


112 


54 


848 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


68 


849 


AF194537 


Homo sapiens 


NAG13 


141 


46 


850 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


45 


851 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


58 


852 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


220 


60 



151 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


853 


X92485 


Plasmodium 


pval 


127 


45 


854 


AF 1343 05 


Homo sapiens 


"Scar3 


99 


"38 


855 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


343 


68 












56 


857 


AF130089 


Homo sapiens 


PRO2550 


112 


40 




AK024372 


Homo sapiens 










~859 


G03790 


omo sapiens 


HumM^ecretecTp^rm^^ ID NO" — 

7871. 




76 


860 


D38112 


Homo sapiens 


c).1:ochrome c oxidase subunit 3 


279 


80 




GOB 133 




7214. 








U49973 




similar to pogo element 






AF2 1 065 1 










864 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


109 


51 






Lycopersicon. 
esculentum 


extensin (class I) 








G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 










Plasmodium 

vivax 


pval 








G00397 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4478. 






869 


X92485 


Plasmodium 
w^ax __ — 


pval 


105 


38 


■"sTn 


G00397 


Homo sapiens 


— T : 

Human secreted protein, SEQ ID NO." 
'W^ Td P dTb 


90 


56 


T/i 


Y2isi\ 


Horn sa'en 

omo sapiens 


geneNo^s'^^ proeinenco e y 


___ 





872 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


36 


873 


AF 13 0089 


Homo sapiens 


PRO2550 


160 


82 




AFl 18082 




PRO 1 902 






875 


U93564 


Homo sapiens 


putative pi 50 


180 


44 




Ml 0546 




cytochrome oxidase I 






877 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


127 


52 


878 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


j5 


~879 


^^^^gg 


Drosophila 

melanogaster 


dLphanous protein 


~^ 


~46 








protein rha la. 






881 


B08942 


Homo sapiens 


encoded by gene 18 SEQ ID NO* 99 


95 


40 


882 


AF130089 


Homo sapiens 


PRO2550 


137 


~ 


883 


AF090942 


Homo sapiens 


PRO0657 


142 


73 




G03787 


Homo sapiens 


7^868^ ^s'^^*^^ protein, SEQ ID NO. 


97 


60 


885 


X61296 


Rattus 


open reading frame 2 


106 


43 


886 


Y 19767 


Homo^sapiens 


SEQ ID NO 485 from W09922243. 


Ti 


~57 


887 


X 14963 


Homo sapiens 


collagen-like protein (447 AA) 


130 


51 


888 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


117 


55 


889 


L25616 


Homo sapiens 


CGI protein 


150 


62 


890 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


171 


66 


891 


Y86472 


Homo sapiens 


Hirnian gene 52-encoded protein 


107 


40 



152 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
_Scoi;e 


% 

Identit 

y 








fragment SEQ ID NO'387 






~SQ9 


X523 18 


"r — t 

OS aurus 


histone H2A Z (AA 1-127) 





— ^ 





V00662 


omo sapiens 


URF 1 (NADH dehydrogenase subunit) 


~Tm 


46 


~«Qd 




omo sapiens 


cytoc rome c oxi ase su uni 




-— ^ 




"80S 


X92485 




pva 




— 

47 


896 


L76159 


Homo sapiens 


FRGl gene product 


100 


35 





D50926 


Homo sapiens 


The KIAA01.)6 gene product is novel. — 








X040 1 1 






"TTZ 


~ 


^899 


M90656 


Homo sapiens 


gamma'glutamylcy'steine synthetase 


"Toi 


■~ 


900 


R95913 


Homo sapiens 


Neural thread protein. 


95 


75 




L27428 


Homo sapiens 


reverse transcriptase 


81 


47 







Mus niusculus 


Pro-Pol- dUTPase polyprotein 






903 
— — 


G00412 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
^f^i , 


105 


76 


90 


AF010144 


— : 

Homo sapiens 


neuronal thread protein AD7c-NTP 


126 


37 




AF130089 


Homo sapiens 


PRO2550 


88 


82 


06 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 










Homo sapiens 


reverse transcriptase 






908 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


Ti 


909 


U93570 


Homo sapiens 


putative pi 50 


130 


41 




AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 






911 


L27428 


Homo sapiens 


reverse transcriptase 


168 


46 






Plasmodium 


pval 


91 


60 


913 

— — 


Q03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
- - .. 


103 


64 


914 


— — — 

G03789 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


64 


915 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NOrlOSl, 


121 


51 


916 


G0250I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


77 


917 


Ml 2099 


Mus musculus 


proline-rich protein 


129 


44 




M i 5 53 0 


Homo sapiens 


B-cell growth fector 






919 


AP' 13 0079 


Homo sapiens 


PR02852 


158 


88 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 










norvegicus 








922 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


110 


50 


923 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


99 


36 


924 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


80 


47 


925 


AC006127 


Homo sapiens 


BRG-l-HUMAN ; nuclear protein 
GRBl; homeotic gene regulator; SNF2- 
BETA; MITOTIC GROWTH AND 
TRANSCRIPTION ACTIVATOR; 
POSSIBLE GLOBAL 
TRANSCRIPTION ACTIVATOR 
SNF2L4 


442 


92 


926 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


126 


39 


927 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


321 


76 



153 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


% 


928 


AC006161 


Arabidopsis 


putative CENP-B/ARS-binding 


97 


35 


929 


X02873 


Daucus carota 


put. precursor 


112 


38 


930 


U93563 










931 


ABO 12223 


Canis familiaris 


ORF2 


202 


50 


932 


AF053538 


Alvinella 


fibrillar coUe^en chain FApl alpha 


114 


37 


933 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


57 


934 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


283 


54 


935 


U41017 


Caenorhabditis 


repetitive region; weakly similar to E. 
^tein (PIR:A43417) 


107 


33 




U47855 


diadematus 








937 


G03133 


Homo sapiens 


Human secreted protein, SBQ ID NOr 


95 


43 


"0^8 


W48351 


omo sapiens 


Human breast cancer related protein 


— — 


— 


939 


M13101 


Rattus 

Homo^sa^Iens 


unknown protein 


121 


40 


940 


G00689 


mo apiens 


Human secreted protein, SEQ ID NO: 
4770. 


TTt 


"46 


941 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


62 














943 


U93564 


Homo sapiens 


putative pi 50 


279 


40 


944 


Y64S90 


Homo sapiens 


ID NO: 1051. 


110 


48 


945 


L27428 


Homo sapiens 


reverse transcriptase 


238 


66 




AF194537 










947 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


55 


"948 


AC006161 


■ : 

tliaUana''^'^ 


TP rFNP R/ARS h' H' 

protein^like protein ' ^ '"^ 


Tn 




949 


AF 194537 


Homo sapiens 


NAG13 


106 


66 








gene 27 clone HSAWA27. 






951 


G04067 




Humaii secreted protein, SEQ ID NO: 

8148. 












SEQ ID NO 485 from W09922243. 


"tIs 


"^5 


953 


U44838 


Glycinrm^^ 








954 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


131 


39 


955 


G03801 


Homosa iens 

omo sapiens 


Human secreted protein SEQ ID NO" 
7882. 


T28 


~53 


956 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


103 


74 


957 


AF130089 


Homo sapiens 


PRO2550 


120 


~yj — 


958 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


71 


959 


AF090942 


Homo sapiens 


PRO0657 


83 


63 


960 


L27428 


Homo sapiens 


reverse transcriptase 


121 


30 


961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


157 


68 


962 


AF090942 


Homo sapiens 


PRO0657 


138 


61 


963 


U83280 


Leishmania 
donovani 


39 kDa antigen 


101 


53 


964 


U93566 


Homo sapiens 


p40 


102 


31 



154 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 


Accession No. 


Species 


Descriptinn 


Smith- 




NO: 








Waterman 


Identit 


965 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


^3 








7867. 






966 


G00689 


Homo sapiens 

m 


Human secreted proteiiis SBQ ID NO: 


136 


~40 — ~ 








4770. 






967 


AF090930 


Homo sapiens 


PRO0478 


158 


~80 


968 


AB012223 


Canis familiaris 


ORF2 


94 


16 


969 


AF010144 


Ratos ^^^'^"^ 




127 


~54 


970 


X53581 




ORF4 


















971 


G03453 


Homo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


To9 


~56 








7534. 






"972 


QQ2U4 


omo sapiens 


Human secreted protein, SEQ ID NO. 


^R7 


'~A7 














~973 





^^^^ 

omo sapiens 


"a — tur h N tvl Iv ' — r 

transferase "Tte' 


~Q5 


~8n 








~H ^^^^ pro g'M- — SEQ ID NO- — 






"974 


G03095 


omo sapiens 


^uman secrete proem, Q O. 


-— 


— 














975 


U93574 


Homo sa iens 


putative pl50 


T40 


~43 


976 


W48351 


~HoHo sa'^iens 

om sapiens 




~l9 


~65 








BCRB2. ^ 






~977 


Y08062 


~H ~ 

omo sapiens 




Uf, 


~55 








derived from DNA35638. 






"978 


QQg^^^ 


„ ^ — a~^e 

omo sapiens 




"148 


-7- 








7492. 






979 


AF130114 


Homo sapiens 


PR02459 


121 


61 


980 


K03202 


Homo sapiens 

__ E 


salivary proline-rich protein precursor — 






4ll 


AF 1 1 6909 


Homo sa^Ien"^ 




"TTs 


~ 






omo sapiens 


Human'" creted r tein SEO ID NO — 





-~ 








7795 P ' V 






983 


Yl^^j^ 


Plasmodium 


lutamate cysteine li ase 


To6 


~66 






fal ' 








984 


AF229067 


Hom^o^"iens 


PADI-H protein 


T52 


~66 


985 


AFl 19900 


Homo sapiens 


PR02822 


~I42 


"dR 


986 




Homo sapiens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


TT9 


~ 








7795. 






~987 


j^^gggg 


Homo sa iens 

omo sapiens 




Tofi 


'M) 








oncoprotein. 






988 


G03II9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


"5I 








7200. 






990 


U35730 


Mus musculus 




~l33 


~29 


991 


AFl 13685 


Homo sapiens 


PRO0974 


136 


~63 


992 










^77 


993 


Z97211 


Schizosaccharom 


kinesin-like protein 


197 


^47 






yces pombe 








994 


AF010144 




neuronal thread protein AD7c-NTP 


"124 


-go 


995 


U93563 


Homo salens 


utative 150 ~~ 


157 


"30 — 


996 


AB015802 


Acetobacter 


sknnar tocellulose complementing 




~54 






xylinus 


protein of A. xylinum ATCC23869 






997 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO' 


"144 


""60 








6444. 






998 


j)28\l6 


Pan paniscus 




"352 


"75 


999 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


524 


78 








I (NDF-1). 






1000 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


377 


70 


1001 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


50 








8153. 






1002 


AL390114 


Leislimania 


extremely cysteine/valine rich protein 


249 


61 






major 








1003 


M14702 


IVIurine leukemia 


pol polyprotein 


206 


48 



155 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


species 


Description 


Smith- 
Waterman 


Identit 

y 














1004 


Z21507 


Homo sapiens 


fluman elongation factor- 1 -delta 


511 


85 




L27428 











^^06 




Homo ^a^iens 


NADH dddydropenase subunit 5 


-~ 




^007 


AF090895 


Homo sapiens 


PRO0117 


~T^2 


~6^ 


1008 




G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

— — 


140 
-— 


65 




U44838 


— : 

Plasmodhlm 






-:- 


"ToTo — 


AF251290 


falciparum 


ghitamic acid ricli protein 


lu 


~ 








reverse transcriptase 






1012 


AF1300S9 


Homo sapiens 


PRO2550 


114 


77 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
8173. 






1014 


AF090895 


Homo sapiens 


PROOl 17 


97 


65 




AF130089 


Homo sapiens 


PRO2550 






1016 


AF079367 


Mesocricetus 
auratus 


cytochrome c oxidase subunit III 


276 


52 





- 

AF055985 


Onciiocerca 
volvulus 


pyrrolidone-nch antigen 










Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 








.AF090944 


Homo sapiens 


PRO0663 






1020 


Y21166 


Homo sapiens 


Human bcl2 proto-oncogene mutant 
protein fragment 14. 


84 

^40"' — 


36 




AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 




1022 


AL049608 


Arabidopsis 
tlialiana 


extensin-like protein 


105 


34 


1023 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene ICQ clone HNFIU96. 


152 


52 


IFm — 




— : 

Homo sapiens 


Human secreted protein, SEQ ID NOr 

7795. 




.... 




M12140 


Homo sapiens 


envelope protein 






1026 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


200 


58 


1027 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


266 


67 


1028 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


97 


31 


1029 


G02950 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7031. 


102 


56 






Homo sapiens 


Human secreted protein, SEQ ID NO; 

6563. 






1031 

— — 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

J : 


74 


56 






— , 

Homo sapiens 


Protem sequence and annotation 

at present via e-mail from 
LABEIT@EMBL-Heidelberg DE 


389 


100 


'"1033 


(302654 


omo sapiens 


ELman secreted protein, SEQ ID NO. 


"gfi 





"T034 


AF010144 


Homosa iens 


neuronal tiiread protein AD7c-NTP 


"^Me 


~60 


1035 


AF194537 


Homo sapiens 


NAG13 


-~ 


~52 


1036 


L27428 


Homo sapiens 


reverse transcriptase 


166 


51 


1037 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


107 


44 


1038 


AFl 16638 


Homo sapiens 


PRO 1546 


56 


61 


1039 


U93570 


Homo sapiens 


putative pi 50 


138 


40 


1040 


AF130089 


Homo sapiens 


PRO2550 


150 


91 


1041 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


68 



156 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ ID 
NO: 




Accession No. 




Species 


Description 


Smith- 
-^f^ 


% 




^ 


omo sapiens 


-— . ^ 

— uman secrete protem . 






-TqT^ 








-— — 




"704? 


Ij^ggj^ 


Homo sapiens 


ORfT MES37- putative transposase 

similar to pogo element 


"208 


4t — 














1046 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


64 


1047 


U93563 


Homo sapiens 


putative pi 50 


127 


30 


l^^q 


AF1301 14 


Homo sapiens 


PR02459 





^ 





U12919 


us muscu us 


adenylyl cyclase type VII 


—^5 


-5? 




AC008054 


eishmania 


L8453.1 


129 


30 


1051 


X99467 


Medicago 
truncatula 


ENOD20 


110 


38 


1052 
— — 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




150 


76 




AFl 16712 


Homo sapiens 


PR02738 


-— 




... 


To5^ — 


jVI96256 


Homo sapiens 


rapamycin binding protein 






1055 


U15647 


Mus musculus 


reverse transcriptase 


86 


37 




AL024498 


Homo sapiens 


dJ4]7M14.2 (novel serine/threonine- 
protein kinase (ortholog of mouse and 
rat MAK (male germ cell-associated 
kinase)) 








AF090942 


Homo sapiens 


PRO0657 






1058 


AFO 10144 


Homo sapiens 


neuronal ttiread protein AD7c-NTP 


156 


68 


1059 


AF081 1 14 


Mus musculus 
domesticus 


ORF2 

~. 


134 


47 


T060 — 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 








U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


126 




1062 


G03790 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

, 


84 


55 


TofiT 


Ijgg^^^ 


— j 

Mus musculus 


reverse transcriptase 


— 


-|| 


T064 — 








"Tfi 




1065 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


130 


66 


T066 — 


X92485 


Plasmodium 


pva 


Tl9 


~ 




U93567 










1068 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


540 


84 


T^TS — 


U93570 


Chdbolobur 












tuberculatus 


cytochrome^c oxidase subunit III 


"355 


"71 




U93567 










1072 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


66 


1073 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO* 163 


136 


59 


1074 


U93572 


Homo sapiens 


putative pi 50 


140 


53 


1075 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


43 


1076 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


105 


37 


1077 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


94 


66 


1078 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


81 


57 


1079 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


35 



157 



w o 01/64835 



PCT/llSOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 
_Score 


Identit 










"8172 








AF] 62149 


— 

ycop asm a 


Hzi r r '■ 

variable surface lipoprotein 


—— 


41 






ovis ^ 








"Tosl — 


G03172 


omo sapiens 


-fj -z^ • sFo in NO- — 


•— 


75 








7253^^"^^ P^*****"' ^ 






~T082 


U43360 





— — n^i 

reverse anscnp se 


-jj- 


-— 






m^aniculatus 








T083 


^^2564 


-jj ^ 

Horn" ^a^ien^ 


~p40 


— 


■— 

"^T 


T084 




AF229067 


omo sapiens 


PADI-H protein 


-— 






U49973 


omo sapiens 


ORFl, MERj7, putative traiisposase 


— — 


-— 

60 








simi ar to pogo e ement 






"lose — 


^ 


n ' 

Horn" sa^'ien^ 




Tfis 


— 

"^T 


~I087 


"Y16ViZ 


omo sapiens 




— 





T088 




"aFiWst — 


omo sapiens 




-— 

.." 






B08976 


Homo sapiens 


~ : 

Human secreted protein sequence 




"~ 




- — 




encoded by gene 28 SEQID NO; 133. 






^1 


AF 194537 


^ 

Homo sapiens 


^^"^^^ 


155 


40 


1091 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


35 










^ 








AF010144 


— -, 

Homo sapiens 


neuronal thread protein AD7c-NTP 


195 


40 


1093 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 


91 


46 










similar to pogo element 








- ■-■ ^ 




ORF 4 


106 


62 





— — — — 


norvegicus 










AL 1 60493 


Leishmania 


— TT 7 : 1 

probable (hhv-6) ul 102, variant a 


129 


51 






^^•^'•"^ : 


DNA, complete virion genome 






1096 




Homo sapiens 


Human gene 52-encoded protein 


72 . 


33 


— — 






fragment, SEQ ID NO:388. 






~m^5 


U40342 


— j 

Mus musculus 


ninein 


__ 

r 


„ . 


'-2?? 1 


M24732 


Homo sapiens 


lamin-like protein 


92 


37 


1099 


X92485 


Plasmodium 


pval 


111 


67 






vivax 








1100 




Homo sapiens 


Human secreted protein encoded by 


156 


86 








gene No. 107. 






01 


AJrUUUOlO 


— : 

Or}'za sativa 


similar to RING-H2 finger protein 


116 


90 














117)9 


G00437 


— ■■ 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


. ,.. 
71 









f^^^ 3 : — — 








G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




97 


~ 

j6 









— — _ 








^^^^^^ 


— : 

Homo sapiens 


putative pl50 




168 


56 


TTo3 — 
— 


U93570 


Homo sapiens 


putative pl50 


'T^e 




— 


L27428 


omo sapiens 


reverse transcriptase 





— 




X53581 


a s ^ 


ORF4 




~ 

4j 







norvegicus 








Hos — 




omo sapiens 


Human secreted protein sequence 


— — 


— 

11 








encoded by gene 2 SEQ ID NO*250 






ll09 — 


(302490 


Ti ale 

omo sapiens 


Human secreted protein, SEQ ID NO. 


-— 


— 














Hlo — 


^g^^g^ 


~- : 


putative pi 50 


-— 

156 


— 

38 


TTTi — 


AF118086 


Horn" ^^^1^°^ 


PROI992 






1112 




Homo sapiens 




~71 


"is — 








gene 22 clone HMSJW18. 






1113 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


124 


62 


1114 


L27428 


Homo sapiens 


reverse transcriptase 


200 


40 


1115 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


50 








7077. 






1116 


L27428 


Homo sapiens 


reverse transcriptase 


122 


70 


1117 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


65 



158 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 
y 














TTTs — 


W48351 


Homo sapiens 


Himan breast cancer related protein 

BCRB2. 


Tie 


— 




D38484 


syndactylus 


(COXI) 






"TT^T — 


AF010144 


Homo sapiens 


n euronal thread protein AD7c-KTP — 


n4i — 







G03786 


omo sapiens 


Human secreted protein, StiQ ID NO: 




58 


1122 


AFC 13990 


Homo sapiens 


ubiquitin C-terminal hydrolase 


163 


50 








Human secreted protein, SBQ ID NO: 
7867. 






1124 


G 02490 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO. 


154 


65 


Tvzs — ' 


AF010144 


Homo a ien 


neuronal thread protein AD7cl4TP 


Tn6 





1126 


AF130089 


Homo sapiens 


PRO2550 


~76 


'12 




G02832 


Homo sapiens 


6913. 


~ 


~59 






Homo sapiens 


encodeTby^gei^l4 ^^^''^^^^^ protein 


80 


59 


1129 


AFl 19855 


Homo sapiens 


PRO 1847 


T46 


"to — 


1130 


AF 19453 7 


Homo sapiens 


NAG13 


182 


66 


— 


L27428 


Homo sapiens 


reverse transcriptase ^ 





-P 






Homo sapiens 


putative pi 50 






1133 


G00407 




Human secreted protein, SEQ ID NO; 

4488. 


"BT 


68 


TT^^ — 


AJ004810 


Zea mays 


cytochrome P450 monooxy^eiiase 


T^R 






AFC 10144 








~ 


TT36 — 


GO 1502 


Ilomo sapiens 


HumaTsecret^ed'pro^ SEq'iD NO- — 

5583. 


~ 





"Till — 


AB018705 















nQiTi™s'apiel"s 


luoteni serine tliieonine kinase 




~- 


1139 


W48351 


Homo sapiens 


Human breast cancer related protein 


92 


60 


1140 


AF194537 


Homo sapiens 


NAG 13 


115 


33 














1142 


D86853 


Catharanthus 
roseus 


extensin 


142 


37 


TT43 — 

— 


D00570 


us muscu us 


-p TT- 

open rea ing e (, — ) — 




-—^ 


— 




G03203 


omo sapiens 


^uman secrete protein, SfiQ I NO: 


62 


~ 

38 


~TT45 — 


270684 


Caenorhabditis — 
elegans 


„ 


~T05 





1146 


G03133 


Homo sapiens 


Human secreted protein SEQ ID NO- — 
7214. 


~T9 


"54 


1147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


151 


62 


1148 












1149 


AF129756 


Homo sapiens 


BAT2 


177 


52 


1150 


AF010144 










1151 


L27428 


Homo sapiens 


reverse transcriptase 


77 


31 




U34044 










1153 


AK024455 


Homo sapiens 


FLJ00047 protein 


"TF 


~ 


1154 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


197 


67 


1155 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


120 


64 


1156 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


98 


34 


1157 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


54 



159 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 












y 


1\5S 


AF 194537 


H ale~s 

omo sapiens 


V 


■—^ 





Q 


U49973 


omo sapiens 


5 — &an 

. . ' 'P"*^ sposase 




69 









suni ar o pogo e emen 








^g'<^j2 


— : 

omo sapiens 




— 

-y^j 




-^^ 


TTei — 


G03203 


omo sapiens 


H ~Td f SEOIDNO- — 




68 


— 







7284™^^'''^^^ 










— -. 

Homo sapiens 


-TT — — : — 

Human secreted protein, SfiQ JD NO: 


-JTyi 


_ 

57 














~TT63 — 

— 


AFl 16712 


Homo a lens 

omo sapiens 


PR02738 


T?q 







V00662 




cytochrome oxidase I — 


— — 


— 




T165 


G00416 


H°mQ ^r'ens 

omo sapiens 




"TTr 











4497™ ^^'^'^ ^ proem, Q 

' — — 






g 




-j— = 

__usmu^u_iis 


IS one H e 


r 


— ^ 


1T67 




ic an 


ex ensin 










H mo T lens 








TT68 — 



AF130051 


omo sapiens 


PRO0898 


ITt 


— r 




G03714 


omo sapiens 


Human secreted protein, S£Q lO NO: 


— — 

10 


— — 

48 









: 






Ttto — 
— — — 




_ ; 

omo sapiens 


reverse scnptase — ^ ^ 


— — 





1171 




G00497 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


— 

108 


— 

62 


— — 







. 








AF0j0277 


; 

Tragelaphus 


cytochrome oxidase subunit III 


266 


54 


— 


— — — 


S£ekii 




— 




— 


" 


ycine max 


__ : 

nydroxyproiine-nch glycoprotein 


.11- 


— 

_38 


1174 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 


168 


45 


— . 







similar to pogo element 






1175 


U43627 


— : 

Arabidopsis 


extensin 


111 


42 






thaliana 








TT76 — 





Arabidopsis 


: 

extensin 


— 

98 


— 

29 






„ '^"^ — '■ 








■ — 
— — — 


1 , 




omo ^^P'^"^ 


— ST Tso 

pu ivep 


— 




22 


78 


01 


Caenorhabditis 


collagen 


108 


39 






^^^^^ — . 








Tr79 — 




omo sapiens 


Human secreted protein SEQ ID NO" — 


~rjn 


"68 








6613. 






"fiso — 


AFO 16099 


Ha i 

usmuscu us 


endonuclease/reversetranscriptas^ 


-ppj 




nTai 




AF0I0144 


omo sapiens 


neuron rea pro eui J^' 


"r^I ' 






AFO 101 44 


^omo sapiens — _ 




~ 1 




TT83 — 


Y86248 


omo sapiens 


Human secreted proteto HCHPF68 


~T?3 


"is 


— 






SEQ ID NO: 163. 








Pgg^j^ 


omo sapiens 


^'t^^e'^^Ab^ d"^^ ^"^"^'y 


— — 


"rS 


TT85 — 


U87607 




pu a ive 1 mg p e 


To6 


"51 














1186 


yg^g^g 


HoiT^srien 

omo sapiens 


Human gene 91 encoded protein 


Hi 


"75 








fragment, SEQ ED NO:490. 






TTsT — 


j-^gg-^^^ 


Homosa iens 

Homo sa^ien^ 


lijie*'f"e'°^ e temf^^ ^^'^'""^ ^ 


~?^4 


19 


"TT88 — 


U83303 


omo sapiens 


'W~ m+H \^ SFOTDNO- — 





"35 


1189 


G03789 


omo sapiens 


787n^ ^ ^™ 


Tso 


-To 














TT90 — 


G02490 


omo sapiens 


-jj '■ J : 

Human secreted protein, SEQ ID NO: 


-— 

146 


— 

68 














im — 


AFl 18086 


Homo sapiens 


PRO 1992 


~146 


"^1 


1192 


W12842 


Homo sapiens 


Truncated pro-alphal (III) chain. 


106 


35 


1193 


AF000298 


Caenorhabditis 


weak similarity to coUagens; glycine- 


67 


34 






elegans 


and proline-rich 






1194 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


74 








7637. 






1195 


¥30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 


132 


12 








human tumor suppressor gene 2^P-1. 







160 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 


1196 




Homo sapiens 


Human secreted protein, SEQ ID NOt 


75 


48 


1197 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


254 


73 


1198 


U93570 


Homo sapiens 


p40 


103 


30 






Leishmania 


extremely cysteineyvaline rich protein 






1200 


AK024455 


Homo sapiens 


FLJ00047 protein 


115 


56 


1201 


AF090942 


Homo sapiens 


PRO0657 


88 


64 


1202 


GO 0442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


70 


1203 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. _ — 


113 


46 


. - 


GO 19 84 


^ 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


41 


1205 


U35730 


Mus musculus 


jerky 


107 


27 


1206 


U 15647 


Mus musculus 


reverse transcriptase 




45 






Mus musculus 


reverse transcriptase 






1208 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


58 


1209 


AF010144 


Homo- sapiens 


neuronal thread protein AD7c-NTP 


91 


54 




AFl 19900 


Homo sapiens 


PR02822 




81 


1211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


68 


1212 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


45 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 




61 






Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


102 


61 


1215 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


101 


46 


1216 


AJ242540 , 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 


270 


58 


1217 


Y86472 


Homo sapiens 


Human gene 5 2-encoded protein 
fragment, SEQ ID NO:387. 


105 


46 


1218 


AC002483 


Homo sapiens 


putative product from mKNA sequence 
CG003 from BRCA2 region; match to 
U50534(NID:gl685103) 


378 


97 




AF090895 


Homo sapiens 


PRODI 17 


130 


58 


1220 


AFl 13685 


Homo sapiens 


PRO0974 


117 


50 


1221 


X61295 


Rattus 


LI retroposon, a portion of its ORF2 


126 


50 


1222 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 ■ 


70 


T223 


W48351 


omo sapiens 


Hvunan breast cancer related protein 

BCRB2 


^99 


— 





^^2574 


— ■ 

omo sapiens 







-— 


1325 


AF130051 




PRO0898 


-pr^ 


— 


T226 


U93563 


Horn" sa^rr 




~M — 


~47 


I227 


U49973 


Homo sapiens 


ORf 1^MER37' putative transposase 

similar to pogo element 


^20 — 


— 














1229 


W21733 


Homo sapiens 


NIP- 1 encoded by clone 59. 


138 


63 


1230 


U 15647 


Mus musculus 


reverse transcriptase 


105 


42 


1231 


U93563 


Homo sapiens 


putative pi 50 


299 


54 


1232 


R95913 


Homo sapiens 


Neural thread protein. 


138 


51 


1233 


AF130079 


Homo sapiens 


PR02852 


203 


70 


1234 


X53581 


Rattus 
norvegicus 


ORF3 


106 


60 



161 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 


— — 











y 





AFl 18086 


_ : 

Homo sapiens 




PRO 1992 




81 


1236 


X92485 




Plasmodium 
vjvax _ 


pval 


125 



71 


1_237 


AF010144 


Homo sapiens 


: — — ; 

neuronal thread protein AD7c-NTP 




75 






Homo sapiens 


putative pi 50 






1239 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


69 


56 








5330. 






1240 


AF130089 


Homo sapiens 


PRO2550 


136 


41 


1241 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


53 








4478. 






1242 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


75 








7795, 






124j 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
683j, 




87 


45 


1244 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


58 








7868. 






1245 


u9j570 


Homo sapiens 


putative pi 50 


161 


50 


1246 


Z70684 


Caenorhabditis 


F28D1.8 


121 


45 






elegans 








1247 


AF257305 


Homo sapiens 


ASH I 


576 


89 


1248 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


98 


68 








7344. 






1249 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


107 


43 






elegans 


binding protein KIO (NID:g8148) 






1250 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 


130 


91 








RHAla (AF078683) 













Homo sapiens 


FAP protein 




61 






G0409 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


5j 








. 










AF068294 


- 

Homo sapiens 


HDCMB45P 


-51 


6j 




AF090895 


Homo sapiens 


PRO0117 


111 


60 




G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


201 


75 


1256 


AFO 1 0 1 44 


Homo sapiens 


neiu-onal thread protein AD7c-NTP 


116 


51 


1257 


AB033032 


Homo sapiens 


KIAA1206 protein 


115 


80 


1258 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


122 


60 








4446. 






1259 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


137 


68 








7882. 






1260 


AB032906 


Hylobates 


dopamine receptor D4 


96 


35 






pileatus 








1261 


AF022985 


Caenorhabditis 


Similar to collagen; coded for by C. 


106 


38 






elegans 


elegans cDNA yk55f3.3; coded for by 












C. elegans cDNA yk66d5.3; coded for 












by C. elegans cDNA yk71e4.3; coded 












for by C. elegans cDNA yk55f3.5; 












coded for by C. elegans cDNA 












yk66d5.5; coded for by C. elegans 














cDNAyk71e4.5 






-^25± 


rf^^f; 


; 

Homo sapiens 


p40 182 


— 

J2. — 




L2009o 


IVIanduca sexta 


rihosomal protein s7 






T264 


AFl 1990] 




PR02831 


103 


"71 — 


1265 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 


219 


97 








in cosmid C27F2 (U40419) 






1266 


D90279 


Homo sapiens 


collagen alplia 1(V) chain precursor 


120 


42 


1267 


L27428 


Homo sapiens 


reverse transcriptase 




41 


1268 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


68 








4493. 






1269 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


34 



162 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 




AF130089 




PRO2550 






1271 


U12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


121 


45 




AF1653 10 


Homo sapiens 


ATP cassette binding transporter 1 




— 






Homo sapiens 


Neural thread protein- 


■TTS 




1274 


X92485 


vivax 






~ 




G03 1 72 




Human secreted protein, SEQ ID NO: 

7253. 










Homo sapiens 


Human secreted protein, SEQ ID NOr 
7645. 









^^^^^^ 


Mus musculus 




_10£ 










Homo sapiens 


putative p] 50 


98 




Homo sapiens 


Human secreted proteinj SEQ ID NO: 
4488. 






1280 


G00j57 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


167 


71 


1281 


AJ271871 


Nicotiana 
sylvestris 


putative extensfn 


105 


36 


1282 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


119 


32 


1283 


go'ih"^"*^ — 


Homo sapiens 


neuronal thread protein AD7c~NTP 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 






1?86 




Nicotiana alata 


^ J : FT^ 




36 




GO'^87'^ 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

6953. 






1287 

1-?^ 

— — 


L149973 


Plomo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element — 


289 


61 


S801 19 




Kattus sp. 


reverse transcriptase homoJog 











U49973 


Homo sapiens 


ORFl J lV[ER37j putative transposase 


203 


76 

33 - 


1290 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 








7253. ^ ' ^ 








AF130089 




PRO2550 


"iIt 




T293 


AF003535 


Homo sapiens 


ORF2-like protein 




"11 


1294 


Y19610 


Homo sapiens 


SEQ ID NO 328 firom W09922243. 


100 


42 






Homo sapiens 


reverse transcriptase 






1296 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

— 


140 


71 


~1297 — 


^^^^^ 


— 

Oncorhynchus 
raylciss 


complement component C3 


. 






AC0043 8 1 










1299 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO . 


180 


64 


Hoo — 


Y 14479 


-Tj -■ 

omo sapiens 


Fragnent of human secreted protein 

encoded by gene 14. 


"135 


-— 

66 


~1301 


Y^g^jgi 


Mus musculus 


reverse transcriptase 


106 


"1^ 


1302 


D13951 


Nicotiana 


extensin precursor 


134 


42 


1303 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


73 


1304 


Yi'y]i2 


Mus musculus 


Pro-PoI-dUTPase polyprotein 


93 


~38 


1305 


Y13620 


Homo sapiens 


BCL9 . 102 


39 


1306 


U93567 


Homo sapiens 


putative pi 50 245 


63 


1307 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 \ 130 
clone HNHF029. 


70 


1308 


AL355774 


Streptomyces 
coelicolor A3(2) 


putative integral membrane protein 


136 


40 


1309 


W54966 


Homo sapiens 


Synthetic human type III collagen 


124 


41 



163 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 








SYN-C3. 








M20670 


Plasmodium 
vivax 


circumsporozoite protein 








AF151366 


Arabidopsis 
thaliana 


arginine/'serine-rich protein 






lol2 


G03099 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

T}^^' -, : ji^T^^fK^TTS 


76 


43 




G03258 


—J ; 

Homo sapiens 


Human secreted protein, SEQ ID NO; 
7339. 


1 9 






W88627 


Homo sapiens 


clone HPMBQ32. 












Human secreted protein, S£Q ID NOr 

6566, 








R959 1 3 


Homo sapiens 


Neural thread protein. 


98 


58 




AFl 13685 


Homo sapiens 


PRO0974 




47 






Homo sapiens 


Human secreted protein encoded by 
gene 51 done HUKEX85. 








G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4493. 








AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 
HRGP 






1j21 


^CAAAA 


Homo sapiens 


putative pi 50 


124 


.^^ 




AF090931 


Homo sapiens 


PRO0483 




85 




G04091 


Homo sapiens 


Human secreted protein, SEQ JUD NO* 


85 


44 


1324 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


131 


43 


~F25 


^^^^^^ 


Homo sapiens 


— , 

Human breast cancer related protein 
BCRB2. 










Homo sapiens 








"1357 — 






revCTse^ranscriptase 


111 


4' 


1328 


U93568 


Homo sapiens 


putative pi 50 


112 


30 






Homo sapiens 


Human secreted protein, SBQ ID NOi 

7870. 










Homo sapiens 


reverse transcriptase 






1331 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


88 


1332 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


55 






Nicotiana 
tabacum 


extensin 










Homo sapiens 


Human secreted protein, SBQ ID NO: 

7214. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 






1338 


U43360 


Peromyscus 


reverse transcriptase 


114 


60 


1339 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


55 


1340 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


123 


31 


1341 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


118 


70 


1342 


X71629 


Mus musculus 


msgl 


106 


57 


1343 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


105 


79 



164 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 




G03435 


Homo sapiens 


Human secreted protem, SEQ ID NO. 




85 


1345 


AF161356 


Homo sapiens 


HSPC093 


88 


88 




JO 1435 


norvegicus 


ATPase 


:>48 


66 


1347 


U93563 


Homo sapiens 


putative pl50 


142 


36 






omo sapiens 


Human secretBd protein, SEQ ID NO. 


87 


66 


"T349 


AF090942 


Homo sapiens 


PRO0657 


152 


"54 


1350 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


63 


T35T — 


^^2572 


Homosa iens 


— FT ffn 


1l3 


~84 


1152 


X92485 


Plasmodium 

vivax 


pva^l '^^ 


13^ 


"to — 


1353 


X61047 










1354 


AFl 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


128 


46 




G03133 




7214. ^ ' ^ 








AFl 94537 










1357 


AFl 3 0079 


Homo sapiens 


PR02852 


139 


73 






norvegicus 










Y 8 6248 


Homo sapiens 


SEQ ID NO: 163. 






1360 


U49973 


Homo sapiens 


ORFl ; MEEG7; putative transposase 


328 


69 


1361 


W80406 


Homo sapiens 


A.'secTeted^proteti^oded by clone 

dii40 3. 


126 


~65 








6532. ^ ' 










norvegicus 




















1365 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


63 






norvegicus 


sequence 








Y02749 


Homo sapiens 


gene 100 clone HNFIU96. 






1368 


G04091 


Homo sapiens 


Human secreted protem, SEQ ID NO. 


80 


46 


1369 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


26 


1370 


X92485 


Plasmodium 

vivax 


"plir 


IO6 


16 


1371 


U90946 


^iscoiWeum'™ 


myosin heavy chain kinase B 






1372 


L27428 


Homo sapiens 


reverse transcriptase 


98 


61 


1373 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 






1374 


U49974 


Homosa iens 




I37 


~57 


1375 


AC004891 


Homo sa iens 

omo sapiens 


contTata'^Hke^ sMar to U87224 

(PID:gl 857710) 


~234 


~58 


1376 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


132 


60 


1377 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


317 


51 


1378 


AF118082 


Homo sapiens 


PRO1902 


102 


42 


1379 


AF115549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


186 


45 


1380 


U93567 


Homo sapiens 


putative pi 50 


116 


38 


1381 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


218 


47 



165 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Watcrman 


Identit 








similar to o o element 




— 


1382 


B08918 


Homo sapiens 


Human secrrted protein sequence 

encoded by gene 28 SEQ ID NO:75. 


87 


"66 




AF090895 




PRODI 17 


68 




1384 


U93570 


Homo sapiens 


putative pi 50 


178 


39 








Human secreted protein, SEQ ID NOi 

7214. 








AF 13 0089 




PRO2550 







HI? — 


^^■^296 


H^mo ^a^lens 

Rattus ^^^'^"^ 


reverse fransm^ase^ 




~ 


1388 




a s 


open re ing e 


123 


"di 


1389 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


67 


1390 


Y026TI 


Homo sapiens 


Hurnan secreted protein encoded by 

gene 22 clone HMSJW18. 


147 


37 














1392 


R95913 


Homo sapiens 


Neural thread protein. 


104 


35 


— 










1^ 




W80406 


Homo sapiens 


AlecretedTprotefe^Micoded by clone 

dh40 3. 


~wf 






AF2 16972 










1396 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
ORFrMER37-^'^'t't"^ 


160 


58 


T397 — 





Tl a^' 

omo sapiens 


sim'lar elein 'T ^P*^^^^^ 


112 


-— 


T398 


AFO 1 0 1 44 


Homo sapiens 


neuronaTthre^ prot^ AD7c NTP 


"97 





1399 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


125 


54 


1400 


X92485 


Plasmodium 


pval 


124 


39 


1401 


U93563 


Homo sapiens 


putative pi 50 


131 


36 





^^^^^^ 


Mus musculus 







-^1 








~T En — T te~l 






T404 


QQJJl^ 


H^m" ^a'^lens 

omo sapiens 


Hu ^ em' ted"" te"^" SEC ID NO' — 
^ uman sec e pro in, y i . 


"rZ 


— 


1405 


^QQgg^ 


Homo sapiens 


Human secreted proteiti, SEQ ID NO: 
4435. 


~T\ 


~ 


1406 


Ml 8094 


Phaseolus 
vulgaris 




138 






AFI34304 








"^5 


1408 


Y08061 


Homo sapiens 


Human c myb protein fragment 


"TIT 




1409 


U93574 


Homo sapiens 


putative pi 50 


179 


43 














1411 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


46 


1412 — 


QQ24i\ 


Homosa iens 

omo apien 


Human secreted protein, SEQ ID NO: 

7492. 


"T49 


"56 


1413 


AFO 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


48 


1414 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




82 


1415 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


92 


52 


1416 


AF119855 


Homo sapiens 


PR01847 


82 


70 


1417 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


34 


1418 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


99 


39 


1419 


AF130079 


Homo sapiens 


PR02852 


114 


69 


1420 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


52 



166 



SEQID 


Accession No. 


Species 


Description 


Smitli- 


% 


NO: 








Waterman 


IdenKt 












y 


1421 


R13556 


Homo sapiens 




TIT^ 


44 








oncoprotein. 






1422 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


77 








7867." 






1423 


R95913 


Homo sapiens 


Neural thread protein. 


128 


80 


1424 


L26953 


Homo sapiens 


chromosomal protein 


104 


34 


1425 


U83280 


Leishmania 


39 kDa antigen 


105 


51 






donovani 






1426 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


98 


75 


1427 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


149 


40 


1428 


AF090895 


Homo sapiens 


PRODI 17 


111 


75 


1429 


AF119855 


Homo sapiens 


PRO 1847 


88 


56 


1430 


AF229067 




PADI-H protein 







1431 


D38112 


Homo sapiens 






^79 




U4997J 


H mo a ien 

omo sapiens 


UKr 1 ; Mc.K37j putative transposase 


-—— 


— 














T433 


AF0r0144 


Homosa iens 


neuronaUhr^pri^efa AD7c NTP 


"149 


~V7 


1434 


AF161356 


Homo sapiens 


HSPC093 


Tso 


~46 




U93570 


Homo sa iens 




TT6 


"Zr 


1436 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


Tti 


~7i 














T437 


U83280 


Leisiimania 


39 kDa antigen 


T06 


~80 






donovani 








1438 


G03793 


omo sapiens 






73 








7874. 






1439 


U 15647 


Mus muscuius 


reverse transcriptase 


233 


44 




L27428 


Homo sapiens 






4^ 


"T441 


Y01400 


omo sapiens 


SecretecTprotefa^encod by gene 1 8 


"TTs 










clone HNHF029. 






T442 


G03807 


-— : 

omo sapiens 


Human secreted protein, SEQ ID NO: 


- - — 


63 


"1443 


M2'7332 


Homosa iens 

Ratos ^^^'^"^ 


"untoT FT'' 

"roline-richsaXai — rotem 


"TS3 


-— 


1444 


M11901 




p 7p em 


~T02 


"40 














T445 


Y02999 


Hoino^apiens 


Fra ment of human ecreted P 


loi 


"66 








H H h I?! 






T446 


gq^g^g 


Homosa iens 

1 pie 


Human secreteT r te n SBQ ID NO' — 


Its 


~14. 








6596 P ' ' V 






1447 


G02480 


Homo sapiens 

0 0 pien 


Human secreted protein, SEQ ID NO' 


^0 


"66 








6561. 






T448 


X76208 


"Do hT 


protem 33-specific exons 


~m 


"Zx 






melanoeastei 








T449 


Y86248 


Homosa iens 

omo sapiens 


Human secreted protein HCHPF68 


Ttr 


^70 








SEQ ID NO: 163. 






1450 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


"45 








8172. 






1451 


S80119 


Rattus sp. 


reverse tisnscriptase homolog 


115 


54 


1452 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


70 


63 








7795. 






1453 


U42471 


Mus muscuius 




To5 


~40 








homolog 






1454 


AK024455 


Homo sapiens 


FLJ00047 protein 


109 


53 


1455 


AC007258 


Arabidopsis 


Hypothetical protein 


105 


37 






tlialiana 








1456 


AF194537 


Homo sapiens 


NAG 13 


208 


52 


1457 


U63542 


Homo sapiens 


FA? protein 


111 


84 


1458 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


111 


65 








gene 22 clone HMSJWI8. 






1459 


AF090931 


Homo sapiens 


PRO0483 


84 


43 



167 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% " 


NO. 










Identit 














1460 


R95913 


Homo sapiens 


Neural thread protein. 


106 


"69 


1461 


R95913 


Homo sapiens 


Neural thread protein. 


109 


40 


1462 


U93564 


Homo sapiens 


putative pl50 


237 


42 


1463 


AB029309 


Homo sapiens 


Npw3 8-binding protein NpwBP 


97 


37 


1464 


U44838 


Glycine max 










AL050341 


Homo sapiens 


dJ3 9^022 i (rearranged L myc fusion 


Ui 


^5 








sequence (ZN-15 related zinc finger 


















1465 


L27428 


Homo sapiens 


reverse transcriptase 


94 


"34 


1467 


AP010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 




"58 






Homo sapiens 


Ilunian secreted protein (clone fk317- 


~M~ 


~4S 


. . ,q 




H salen 

omo sapiens 


lin -^Td 1"^ — SEO TD NO- — 


Tyi 











^mnanse^ree proem, 






T470 


G03790 


Homosa iens 

omo sapiens 


Human se"reted protein SEQ ID NO- — 


l47 


— 








7871. 






~14T\ 


API 09907 


Homo sapiens 







Ts " 

li 


1472 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


li2 










clone HPMBQ32. 






1473 


Y86248 


Homosa iens 




132 


"on 








SEQ ID NO: 163. 






T474 


QQ2\^2 


Homosa iens 

omo sapiens 


HLiman secreted protein, SEQ ID NO. 


— — 


— 














1475 


U49973 


Homosa iens 

omo sapiens 


ORFl j MER37^ putative transposase 




^8 






similar to pogo element 






1476 


U49974 


Homo sapiens 


mariner transposasc^ 


~50l 


~59 


1477 


U49974 






l06 


"fin 


1478 


W48351 


Homo sapiens 


Hunmn^toeast'cancer related protein 


TTt 


^79 








BCRB2 






1479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NOr 


l46 


~82 














1480 


U93567 


Homo sapiens 


pumtiveptSQ 


"^02 


■4? 


1481 


K02576 


Homo sapiens 


salivary proHne-rich protein 1 


loi 


"46 




U87607 




putative RNA binding protein 1 


To?) 


"tt 






norvegicus 








l48^^ 


G03790 


Homo sapiens 


Human secreted protein SEQ ID NO- — 


1?4 


"ts 








7871. 






1484 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


62 








4409. 






1485 


U15647 


Mus musculus 


reverse tran.scnptase 


115 


73 


1486 


AF194537 


Tv^ —^'-^ 


NAG 13 


III 




1487 


Ml 1902 








"55 


1488 


G0313J. 


omo sapiens 


Humem segeted^proteto ^SE ID NO- — 


~m7 


64 








7214. 






1489 


AF009668 


— wl 

^lerosis 


polyprotem 


— — r 


— 






associated 








1490 


AK023542 


Homo sapiens 


unnamed protein product 


1T4 




1491 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


50 








7867. 






1492 


U49973 


Homo sapiens 


ORFl; MEI137; putative Iransposase 


184 


47 








similar to pogo element 






1493 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


222 


53 


1494 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


50 


1495 


AF 109907 


Homo sapiens 


S164 


259 


45 


1496 


R95913 


Homo sapiens 


Neural thread protein. 


110 


51 


1497 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


299 


80 








similar to pogo element 







168 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


1498 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


65 


1499 


L26953 


Homo sapiens 


chromosomal protein 


104 


67 




AF090895 




PR nn 1 1 T 
JrtvVJU 11/ 






1501 


U93572 


Homo sapiens 


p40 


115 


42 


1502 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


13j 


50 


1503 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


56 


1504 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

=—e i J 


119 


69 


T505 


W00838 


H 

omosap,e„s 


Tumour necrosis factor-related gene 
product. 






YJ!^ — 


AF109907 


Homo sapiens 


^T^l = 


184 


43 




U49973 


Homo sapiens 


ORF 1; MER375 putative transposBse 
similar to pogo element 


288 


67 


1508 


R13536 


Homo sapiens 


Protein encoded downstream of hhc M 


131 


69 


T3o9 — 


AK000241 


Homo sapiens 


unnameiTprotein product 


"167 


— 


1510 


G02532 


Homo sapiens ^ 


Human secreted protein, SEQ ID NO: 


153 


60 


nil — 


S62928 


Homosa iens 


PRBIM protein precursor ' 


~T57 


"39 


1512 


ABO 12223 


Canis familiaris 


ORF2 


116 


40 




G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 






1514 


AF220264 


Homo sapiens 


MOST-l 


108 








norvegicus 










V00662 


Homo sapiens 


cytoclirome oxidase III 






1517 
... 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


263 


70 




U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 








M24732 


Homo sapiens 


lamin-llke protein 






1520 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
-S—- — ii 1 — J : 


131 


51 




W48351 


— : 

Homo sapiens 


Human breast cancer related protein 
BCRB2 


81 


68 


~T522 — 


AF194537 


„ ^ . 




"TTS 


— 


T523 


X92485 


I'lasmo^dkim"^ 

vivax" '™ 


^^^^^ 


~85 


"ii — 


1524 


G0409 1 


Homo sapiens 


8172. ^ ^ 


125 


50 


1525 


AB041228 


Homo sapiens 


G protein-coupled receptor TGR-1 


220 


100 




AF010144 










1527 


U52077 


Homo sapiens 


mariner transposase 


237 


56 




L27428 










1529 

— — 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


355 


78 




j^-^2QiQ 


— f 

Mus inus cuius 


IFN-response element binding factor 2 


— 

90 


37 




uUUJ j4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


64 


1532 


U11288 


Drosophila 
melanogaster 


diaphanous protein 


138 


38 


1533 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


429 


63 


1534 


L27428 


Homo sapiens 


reverse transcriptase 


249 


55 


1535 


U93570 


Homo sapiens 


putative p 150 


114 


31 


1536 


AF130089 


Homo sapiens 


PRO2550 


111 


47 



169 
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SEQID 
NO: 






Description 


Smith- 
Waterman 
Score 


y 






Lycopersicon 
esculentum 


extensin (class II) 




28 






Mus musculus 


reverse transcriptase 






1539 


AK024455 


Homo sapiens 


FLJ00G47 protein 


139 


80 




G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 




48 


1541 


AC024788 


Gaenorhabditis 


Hypothetical protein Y46E12A.d 


80 


46 


1542 




W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


132 


73 


1543 
— — 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


90 


32 




G02971 


— -. 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


— 

71 


— 

63 


1545 


R95913 


Homo sapiens 


Neural thread protein. 


lis 


55 






Pan paniscus 


cytochrome c oxidase subunit 1 






1547 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


370 


71 


1548 


W48351 


Homo sapiens 


Human breast cancer related protein 


119 


65 


1549 


AJ004810 


Zea mays 


cytochrome P450 monooxygenase 


140 


70 




AFl 136S5 


Homo sapiens 


PRO0974 


115 


47 


1551 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


172 


75 


1552 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


1553 


W40113 


Homo sapiens 


Human alplia-2(IV) collagen protein. 


117 


59 




G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 










Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 






1556 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


502 


89 




AF 1 30089 


Homo sapiens 


PRO2550 






1558 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


109 


43 


1559 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


183 


52 


1560 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


180 


52 


156 




AF202893 


Mus musculus 








1562 
-— 


M63421 




Drosophila 
melanogaster 


csp32 


104 


39 


63 




Homo sapiens 


— : — — 

Human secreted protein, SEQ ID NO: 

l-Hl', — r-^ 




129 




69 


T564 


U49973 


— : 

Homo sapiens 


ORF I ; MER37j putative transposase 
similar to pogo element 








AJ132106 


Bos taurus 


SCO-spondin 






1566 


AL390114 


Leishmania 
^^-'^'^ ^ 


extremely cysteine/valine rich protein 




119 


66 


1367 


AF161356 


Homo sapiens 


HSPC09.? 


1M 


— 


1568 


AF119851 


Homo sapiens 


PRO 1722 


94 


72 




^•^^'^•^^^ 


Homo sapiens 


reverse transcriptase 







— 




Lycopersicon 
esculentum 


extensui-like protein DiflO 







1571 


YD 1400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


126 


59 


1572 


X7348I 


Drosophila hydei 


mstl01(2) 


105 


41 


1573 


AF025467 


Gaenorhabditis 
elegans 


contains similaritj' to drosophila DNA- 
binding protein KIO (NID;g8148) 


133 


50 


1574 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


51 



170 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
_Score 


Identit 

y 








~8l44 






1575 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


323 


83 




AF062008 


Caenorhabditis 
elegans 


unknown 








X92485 


Plasmodium 


pval 










Homo sapiens 








1579 


AF090944 


Homo sapiens 


PRO0663 


132 


59 




AL 137798 


Homo sapiens 


dJ1182A14.5.1 (novel gene (isofonn 

1)) 




53 






Plasmodium 


pval 








G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 






1583 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


135 


54 


1584 


G00637 




Homo sapiens 

„.,. .... 


Human secreted protein, SEQ ID NO: 

4718. 


119 


52 


1 8 




Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 










Homo sapiens 


Human secreted protein, SEQ ID NO*. 
7075. 






l^«S 


D28482 


Homo sapiens 


SCR2 


j90 


8j 




AFOl 0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


122 


59 




AFOlO 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


50 


1590 


Ar 1 i tsU /o 


Homo sapiens 


PR01848 






1591 


G00985 


Homo sapiens 

... 


Human secreted protein, SEQ ID NO: 


56 


78 




G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 






— 


y^^^^^ 


Homo sapiens 


i = 


'"^^ - 








Mus musculus 


Pro-Pol- dUTPase polyprotein , 






1595 


L27428 


Homo sapiens 


reverse transcriptase 


138 


45 


1596 


X97707 


Pongo pygmaeus 
_^elii _ 


stopcodon created by 
posttranscriptional polyadenylation 


139 


61 


"1597 


X98710 


Homo sapiens 


COLl Al and PDGFB fusion transcript 






1598 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6561. 


97 


58 




AF2 1 065 1 


Homo sapiens 








1600 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


48 


1601 


J03770 


Mus musculus 


homeobox protein 


99 


35 


1602 


AFl 19901 


Homo sapiens 


PR02831 


119 


56 


1603 


AL031673 


Homo sapiens 


dJ694B14.1 (PUTATIVE novel KRAB 
box protein with 1 8 C2H2 type Zinc 


233 


44 


1604 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

11^^' J : 


130 


66 


-— - — 





— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 




89 


— 

44 


T606 


D88461 


ttu 


N-WASP 


"rv5 


-— 


T607 — 


AF090942 


Homo sapiens 


PRO0657 


"loT 


— 


1608 


U35730 


Mus musculus 


jerky 


105 


34 


1609 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO 0siID:g8148) 


108 


45 


1610 


B06334 


Homo sapiens 


Human subtilisin-kexin isoenzyme 1. 


474 


84 


1611 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


101 


90 


1612 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


55 


1613 


D86853 


Catharanthus 


extensm 


123 


39 



171 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identif 













y 






loseus 








1614 


AF119851 


Homo sapiens 


PRO 1722 


135 


— 


1615 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


142 


"57 








clone HCACJ81. 






1616 


AFl 94537 


Homo sapiens 


NAG 13 


154 


"63 


1617 


AFl 19851 


Homo sapiens 


PRO 1722 ^ 


91 


~69 






Homo saptens 




92 


"si — 








BCRB2. 






"1619 


AK024455 


Homo sapiens 


FLJ00047 protein 


—rpj 


— 


1620 


AF217973 


Homo sapiens 




TT^ 


"67 




AF000298 


Caenorhabditis 

aenor a i i 


weak s" "1 tv t U — ' 


-— — 


-— 








"and r™'^'^^'^" agens, g ycine- 






1622 




^Q2516 


Homo sa iens 


"saliv — "ror"'e-r'ch teT 


~To8 


-J- 




G04067 


li 

omo sapiens 




-— 

135 


-— 

44 








8 148^ i> V WO. 






T624 


^j^g 


Homo a iens 

omo sapiens 


ORFl" MER37* pu+ative transposase 




— 








ORFTMER37-'^'u"ative tran a 






1625 


l]/^gC)-J2 


Homo sapiens 




~rs6 


— 








similar to o^o element ^^P*^^^^® 






T&56 — 


~RT3556 


omo sapiens 


p'rot'hi eiicodeci^dow^n"tre i f hh M 
1 0 L,in enco e owns ream o c_ 


TT3 


— ^ 

















'X92485 


Plasmodium 


pva 


— 


45 






vivax ° 








1628 


AF090895 


Homo sapiens 


PRO0117 


"156 


~61 





AFl 16661 


Rattu"^^^^^"^ 


PRO 143 8 


"R7 


-— 




M13100 


a s _ 


unknown protein 


-ylg 


~ 






~H5^iFTens^ 








1631 


G03203 




"Human secreted r t i — SEO ID NO- 


T47 


-Tjr 








7284 P ' V 






1632 


AFl 19851 


Homo sapiens 


PRO 1722 


"Tot 


"lo — 


1633 


M64792 




salivary proline-rich protein 


~To9 


"46 






norveaicus 








1634 


L27428 


HomosapLns 


reverse transcriptase 


~l09 


"38 




AFl 18082 


Homo a iens 


PRO 1 902 


-RTi 


— 

"ts 


1636 


R95913 


Homo sapiens 


Human^^ecr t ^d^^^'te' — SEO ID NO- — 


HI 






G02490 


Hom sa iens 

omo sapiens 




-— 


— 








6571 ' ^ X 






1638 


U93570 


Homosa iens 


putative pi 50 


"T21 


"^4 


1639 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


"127 


"69 














1640 


U49973 


Homosa iens 


ORFl; MER37; putative transposase 


~Tn 


"68 














1641 


AF194537 


Homo sapiens 


NAG 13 


"T40 


""66 


1642 


G02493 


Homo sapiens 


Human secreted protein, SBQ ID NO: 


"Tol 


"lo — 








6574. 






1643 


M64793 




salivary proline-rich protein 


117 


33 






norvegicus 








1644 


AFO 10144 




neuronal thread protein AD7c-NTP 


"T23 


~50 


1645 


G03787 


Homo sapiens 


Human secreted protein, SEQ LD NO: 


"96 


"to 














1646 


L27428 


Homo sapiens 


rwerse transcriptase 


~86 


~84 


1647 


X92485 


Plasmodium 


^ — ans ptase 


137 


40 


1648 


U 15647 


Mus musculus 


reverse transcriptase 


93 


68 


1649 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


1650 


AF116712 


Homo sapiens 


PR02738 


107 


57 


1651 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


72 , 








6720. 






1652 


X05472 


Rattus 


ORF3 


84 


51 



172 



wo 01/64835 PCT/USOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Idendt 

y 






norvegicus 








1653 


G00588 


Homo sapiens 


Human secreted protein» SEQ ID NO: 

4669. 


122 


Ti 


1654 


U93566 


Homo sapiens 


p40 


117 




1655 


AF2 17536 


Homo sapiens 


truncated mevalonate kinase 


141 


70 


1656 


AF090895 




PRODI 17 






1657 


X92485 


Plasmodium 


pval 


114 


45 


1658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-TsrTP 


192 


61 


1659 


Y86473 


Homo sapiens 


fragment, SEQ ID NO:388. 


73 


30 


1660 


Y91577 


Homo sapiens 


Human secreted protein sequence 


331 


74 


1661 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7519. 


"l27 


"69 


1662 


LJ49973 


Homo sapiens 


simflar to po^o 'e^e^metiT ''^^"^^"^^^^ 


j12 


67 


1663 


S80119 


Rattus sp. 


reverse transcriptase liomolog 


99 


59 














TS^^ 


M76729 




Hum^an'tocast'cancer^r^^^ 


~ 






W4835 1 


Homo sip'icnl 


BCRB2. 




~ 




AF169388 




alpha 4 collagen IV 






1668 


W48351 


Homo sapiens 


Human breast cancer related protein 


84 


66 


"1669 


U49973 


Homosa lens 

omo sapiens 


ORFl; MER37" putative transposase 


~Jjk 


"8?i 


1670 


M29622 


Mus musculus 


open reading frame 2 


74 


46 


1671 


W90838 


omo sapiens 


#6. 


98 


100 


1672 


G00328 


Homo sapiens 


Human sscreted protein, SEQ ID NO; 
4409. 





"58 


1673 


AF09093 1 


Homo sapiens 


PRO0483 


72 




1674 


AF051782 


Rattus ^^^'^"^ 


diaphanous 1 


116 


"49 


1675 


U57361 


norvegicus 


collagen XII alpha 1 


108 


~48 


1676 


AF 182844 


Homo sapiens 


VPS28 protein 


395 


95 


1677 


L27428 


Homo sapiens 


reverse transcriptase 


189 


47 


1678 


G02532 


Homo sapiens 


Human sscreted protein, SEQ ID NO: 
6613. 


142 


53 


1679 


U93565 


Homo sapiens 


putative pi 50 214 


40 


1680 


AK002129 


Homo sapiens 


unnamed protein product 128 


57 


1681 


X03 145 


Homo sapiens 


pot. ORF V 93 


48 


1682 


X63005 


iVlus musculus 


proline-rich protein ^ 108 




1683 


AFl 18082 




PRO1902 117 




1684 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_lVI 157 
oncoprotein. 


~M 


1685 


R95913 




Neural thread protein. 


92 





1686 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6566. 


111 




1687 


X61296 


norvegicus 


open reading frame 2 


104 




1688 


ABO 12223 


Canis familiaris 


ORF2 


98 


39 


1689 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


278 


72 


1690 


U52077 


Homo sapiens 


mariner transposase 


175 


56 


1691 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


46 



173 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


1692 


AF061128 


Plasmodium 


merozoite surface protein 1 


85 


44 


1693 


X77722 


Hotmfsapiens 


interferon alpiia/beta receptor 


~89 


~60 


1694 


M13100 


Rattus 


unknown protein 


94 


40 


1695 


AF202635 


Homo sapiens 


PP1200 


114 


60 


^11^ 


AKOO 11 16 


Homo sapiens 












G00403 


omo sapiens 


Human secreted protein SEQ ID NO* 






1698 


AFl 18078 


Homo sapiens 


PRO 1848 


93 


42 






vivax 


pval 


147 


49 


1700 


M63819 


Plasmodium 


malaria antigen 


101 


64 


Ttoi — 


AF090930 


HonwsapLns 


PRO0478 


146 


— 


1702 


AB009993 


Mus niusculus 


collagen al(V) 


94 


40 




AL3901 14 


Leishmania 


extremely cysteine/valine rich protein 


169 


66 


T704 


AFl 30089 


Homosa iens 

Human'^^'^"^ 


PRO2550 


— 


— 

^1 


^705 




herpesvirus 6 


U88 


Til 




1706 


S60088 


Homo sapiens 


putative adhesion molecule=ADMLX 


151 


86 


1707 


GO 1246 


Homo sapiens 


secreted protein, SEQ ID NO. 


176 


52 


1708 


AFl 30079 


Homo sapiens 


PR02852 


174 


41 




Y28682 










1710 


Ml 4423 


Mus musculus 


pro-alpha- 1 type I collagen 


112 


34 














1712 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


41 


Trn — 


, 


~R '■ 


plakophilin 




60 


T7T4 — 


QQ^^^^ 


Homo sa*^iens 

omo sapiens 


7534. 


i" 




1715 


U49973 


Homo sapiens 


ORF 1 j MER37; putative transposase 


171 


48 


1716 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


63 


1717 


S80119 


Rattus sp. 


reverse transcriptase homolog 


140 


57 


TTil — 


L27428 













G03790 


Homo sapiens 


Human secre^d protein SEQ ID NO- — 
7871. 









AF130089 




PRO2550 






1721 


G03652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


49 


1722 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


111 


72 


1723 


AFO 16099 


Mus musculus 


endonuclease/i-everse transcriptase 


132 


60 


1724 


X92485 


Plasmodium 
vivax 


pval 


96 


62 




U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


— — 


-— 


1726 


M81321 


Macaca } proline-rich protein 
fascicularis I 


"T32 


~45 


1727 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6588. 


119 


69 


1728 


U93564 


Homo sapiens 


p40 


129 


58 


1729 


U93574 


Homo sapiens 


putative pi 50 


113 


76 


1730 


AF130089 


Homo sapiens 


PRO2550 


136 


61 


1731 


L056G8 


Cercopithecine 


glycoprotein gl 


100 


46 



174 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 






herpesvirus 2 




















1733 


U93574 


Homo sapiens 


putative pl50 


187 


43 




G03714 


Homo sapiens 


Human secreted protein, SEQ ID NOi 


155 


64 


1735 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. — ^ _ 


102 


64 


T736 


Y02999 


Homo sapiens 


Fragment 01 human secreted protein 
encoded by gene 121. 


TT8 


~ 


17j7 


G00490 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO. 


110 


58 


T738 

T739 


AF090942 


Homosa iens 


PRO0657 


T63 


^5 




15rosophik ° 

melanogaster 




~149 


~46 




L27428 










1741 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


76 


54 


1742 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


165 


59 


— ^ 


AC003682 




^tensto"^ 




'% 






Volvoxcarteri 




'TT 




1745 


Q00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


62 


1746 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


89 


41 


1747 


— rF 

G0j787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
23^^ 


115 


. 

42 


T748 


■ ^jggggj 


: 

Homo sapiens 


putative pi 50 


~125 


— 


1749 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


45 


1750 


y02999 


Homo sapiens 


Fragment of human secreted protein 


139 


53 


1751 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


87 


80 


— 


L27428 


Homo sapiens 


reverse transcriptase 











Homo sapiens 


putative pl50 







1754 


U22376 


Homo sapiens 


alternatively spliced product using exon 


134 


50 


1755 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


175 


76 


1756 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


113 


60 


1757 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


158 


44 


1758 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ ID NO:496. 


109 


88 


1759 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


73 


T766 — 


yggg^g 


Homo sapiens 


Hmnan gene 92 encoded protein 

fragment, SEQ IDNO:496. 


m 


"99 


1761 


Y86579 


Homo sapiens 


fragment, SEQ ID NO:496. 


no 


81 


1762 


U08020 


Mus musculus 


collagen pro-alpha-1 type I chain 


105 


34 


176o 


G0j790 


Homo sapiens 


7871. 


90 


45 


1764 


U93569 


Homo sapiens 


putative pl50 


148 


36 


1765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


45 


1766 


X02873 


Daucus carota 


put. precursor 


112 


47 


1767 


X92485 


Plasmodium 
vivax 


pval 


100 


45 


1768 


R95913 


Homo sapiens 


Neural thread protein. 


96 


65 



175 



wo 01/64835 PCT/llSOl/04927 



SEQID 


Accession No. 


Species 


Bescription 


Smith- 




NO: 








Waterman 


Identit 










_Scm;e 


y 


T769 


G00637 


Homosa iens 


Human secreted protein, SEQ ID NO: 


















1770 


X97675 


Homo sapiens 


plakophiiin 2b 


"TTs 


^70 


1771 


G00637 


Homo sapiens 

omo sapien 




Tm 


^8 








4718. 






"T772 


QQ2522 


Homosa iens 

omo sapiens 


Human secreted protein, SBQ ID NOr 


-jyj 


— 








6613. 






1773 


AF130089 


Homo sapiens 


PRO2550 


T58 


"So 


1774 


U23552 


AiluroDoda 


cytochrome b 


"^06 


^78 






melanoleuca 








1775 


AF010144 


Homo sapiens 


neuronal tliread protein AD7c-NTP 


"98 


33 




AF116715 


Leishmad^"^ 


PR02829 


~r4 


"67 


Tttt — 


AC008054 




L8453.1 


TT4 









major 








1778 


AF037364 


Homo sapiens 


paraneoplastic neuronal antigen MAI 


"397 


n — 




R96800 






~P2 


"60 


Ttso — 


G 024 80 


Homo sapiens 

omo sapiens 


Human secretedpr^td^ 


TSs 


~Ka 








6561, 






"PtsI — 


U49973 


Homo sa iens 

omo sapiens 


OR-F 1 ■ MER3 7 ' putative transposase 


~Rt1 

















T789 


AF 174482 


Homosa iens 

omo sapiens 


- " . 


T33 


~46 


-p=^= 




Homo sapiens 


^neuronal^thr-ad — ot ' AD7 NTP 


— — 


— 




T78i 


AF010144 






-~ 




T785 


AF194537 


llomo sapiens — 


i^GiT — -^-2L-^15? ^ 


"m 


"sl — 


TTRfi 


U21123 


r s p 1 a 


i 

ena polypeptide 


— 

120 


— 

44 






mel^ogaster 








— — 

1787 


Arzuui 5 / 




11 : 

EBNA2-like protein 


108 


30 

















G02872 


Ij = 

Homo sapiens 


— —J —. — 

rluman secreted protein, ocQ ID NO: 


_ 

)1 




56 


















M64792 




salivary proline-ricii protein 


— — 

128 


— 

40 














T790 


^□2485 


H^^^dium 


— ai 

pva 


"98 


-j-j 














T79l 




AF010144 


Homosa iens 


neuronal thread rotein AD7c-NTP 


Ti4 


^1 




(J49973 


~H 

omo sapiens 


OKr 1 J MiiKj?, putative transposase 


— — 

197 


81 








Hum^*^c°Jed^ roteto SEO ID NO- — 






tto — 


G00328 


Homosa iens 

omo sapiens 




"TIs 


— 








4409 P ' V 






1794 


AF104923 


Homo sapiens 


putative transcription factor 


l42 


~59 


1795 


W48351 


Homo sapiens 


Human breast cancer related protein 


133 


60 








BCRB2. 






1796 


AC003113 


Arabidopsis ' 


F2401.6 


"57 


"62 






thaliana 








1797 


M22332 


Homo sapiens 


unknown protein 


118 


29 


1798 


Y02693 


Homo sapiens 


Human secreted protein encoded by 












gene 44 clone HTDAD22. 






1799 


AL390114 


Leistimania 


extremely cysteine/valine rich protein 


154 


"It — 


"1800 




Homo sipiens 


— 40 


— 7— r 


""sfi — 


TWi — 




L>copcrsicon 


cxtensin lil-e rotein Dif54 

cx ensin- KG pro em i 


— — 


"^8 






esculentum 








1802 


L27428 


Homo sapiens 


reverse transcriptase 


102 


34 


1803 


L27428 


Homo sapiens 


reverse transcriptase 


141 


43 


1804 


Ml 8933 


Mus musculus 


alpha- 1 type-Ill collagen precursor 


118 


30 


1805 


X92485 


Plasmodium 


pval 


106 


67 






vivax 








1806 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




65 








7492. 







176 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Wentit 


1807 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


33 


1808 


X97675 


Homo sapiens 


plakophilin 2b 


154 


61 


1809 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

"p^sr 


130 


53 


"TsTo — 


X92485 


Plasmodium 

vivax 




T33 


— 




(503 807 




7888. ' ^ 










endogenous 
retrovirus K 


env protem 


119 


81 


— Ill — 


AF119851 


Rattiis ^^^'^'^^ 


PR01722 










norvegicus 


ORF4 


Til 


"55 


1815 


G0o47j 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


111 


74 


1816 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


153 


68 


1817 


M19155 




-antigen precursor 


164 


50 


1818 


AFl 18082 


Homo sapiens 


PRO1902 


90 


75 


1819 


W40353 


Homo sapiens 


US5702907^ 


110 


52 


1820 


U93563 


Homo sapiens 




~TT4 


"35 


1821 


U41538 


Caenorhabditis 


proline rich 


95 


52 


"1822 


AF010144 


Homo sapiens 


neuronal thread protein AD7c NTP 


T54 


~45 


1823 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


53 


1824 


AF 130089 


Homo sapiens 


PRO2550 


128 


40 


4^5? — 


AF090944 




PRO0663 




4s — 




AC003113 


tliaUana''^'^ 


F2401 18 


10 




1827 


AF194537 


Homo sapiens 


NAG 1 3 


85 


28 


1828 


AF009668 


^lerosfs 

associated 

retrovirus 


polyprotem 


185 


41 


1829 


AFO 16099 


Mus rtiusculus 


endonuclease/reverse transcriptase 


155 


"42 


1830 


X69465 




ryanodine receptor 1 


516 


~86 


1831 


G00454 


Homo sapiens 


liuman secreted protein, SEQ ID NO: 
4535. 




"40 


1832 


U88966 


Homo sapiens 


rapamycin associated protein FRAP2 


434 


89 


1833 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


105 


32 


1834 


AF085809 


Mus musculus 


synapsin lb 


98 


33 


1835 


AK023003 


Homo sapiens 


uimamed protein product 


393 


81 


1836 


Y41740 


Homo sapiens 


Human PRO701 protein sequence. 


429 


78 


^— 


M369I3 


Mus'inusculus 










X63005 




prolke^rich^IMioteto^'^ ^' ^'^''^'''^^ 


^8 





1839 


X83413 


Human 


U88 


149 


45 


1840 


AF134304 


Homo sapiens 


Scar2 


87 


37 


1841 


AC024772 


Caenorhabditis 
elegans 


contains similarity to Mus musculus 
alpha-NAC, muscle-specific form 
(GB:U48363) 


131 


25 


1842 


AB002366 


Homo sapiens 


K1AA0368 


153 


75 


1843 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


47 



177 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


T844 


AL035526 


^^^.j — sis 


exte s' I'k P 

ex ensin- i -e pro em 



















Q2(,i56 


riomo sapiens 


nolNrZu 


-Q- 


— 


T846 


M14228 


Hoino ?^iens 




^98 




1847 


AK022217 


^omo ^^^^^ 


unnamed^protdn^^^ 


"97 


— 1 

— ^ 


g 


AJ250042 


omo sapiens 


Rab5 GDP/GTP exchan<Je factor 


"174 











homologue — _ _ 






— sTo 


W23949 


— : 

Homo sapiens 


Human phosphomositide 30H-kinase 


143 


28 








Hum ^"^"'^ J p — SEO ID NO — 






1850 


G03786 


Homo sa iens 

omo sapiens 




— j 


-— 








^imatisecr e proem, g 






1851 


ABO 17 114 


Homosa iens 


~ad1 


113 


Itm — 


1852 




D00570 


Mus musculus 


open reading frame (251 AA) 


174 


~55~ 




A67510 


Ivf i 

Mus musculus 


\/ti Tc ^/^T Tcr*! tt t tc i^v*Ktr\\jnn tmvt a 
mUo IVlUouULiUo vJCXNUMlC JJINA 


TT^ 


— 

42 








CONTAINING N ALLELE OF FVl 












mari^r trans osase 






1854 




U49974 


Homosa iens 


__ P 


-JTJ 


"82 






omo sapiens 







— j 

-|i 


1856 




D89729 


omo sapiens 


CRMl protein 


~- 

475 


90 




AF090895 


omo sapiens 


PROG 1 17 






1858 


AFO 15926 


omo sapiens 


^u"^'*^ i.^^*'''?^°cn"^ binding 


"TTt 

117 


~ 

73 








pnospnoprotem-50 






l85g — 


^^^^^^ 


"Gil — ni 

a us ga us 


NF-kB p65 siibunit _ 


— — 

^TT 


— 


Imi 




K03204 


omo sapiens 


salivary prohne-ricn protein precursor 


-iii 


~ 


1861 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


195 


.... 
52 








clone HPMBQ32 






1862 


U49973 


Homo sapiens 


ORF I ; MER37; putative fransposase 


331 


69 










similar to pogo element 






1863 


GO 4091 


; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


35 









. 









— — — ^ 


"FT ■■ 

Homo sapiens 


reverse transcriptase 


— 








Homo sapiens 


ORFlj MER37j putative transpos^se 


106 


46 









similar to pogo element 






l86fi 


u 

— 


— — : 

ycobacteriuin 

^'^'^ 3^-^ 


fibronectin attachment protein 


86 


36 




1867 


AF 255446 


Crypthecodin lum 


— r-; : — J : 

Dipl-associated protein C 


134 


45 






H^'' a iens 








1868 





omo sapiens 


H — an reTd P — SEO ID NO — 


-— 


— 

63 








^™ansec e proem, Q i 






1869 


APOOOOOl 


-p 


235aal n h — otheti al ts' 

J aa ong ypo e ica pro ein 


ITTs 









h rile shii 
Rattur " 








1870 


M13100 




unlciiown rotein 

un lownproem 


l2l 


-j= 






norveaicus 








UtI — 


X67863 


Mus musculus 


~T2 


Inl 


— 


1872 


S80119 


Ratti^sD^^ 


reverse transcriptase horn log 


l5l 


"43 


1873 


W73633 


Homo sapiens 


Human secretedirotein clone 


"Tio 


"44 


— 


!^f7oo?i 


Uln — a iens 

omo sapiens 


— — Tj^^g '■ 

myosin- — _ 


— 


— 


1875 


W48351 


Homo sapiens 


Human breast cancer related protein 


— 

92 




54 






. 


rn^Hff tTri ti 






lR7fi 


M21097 


Horn" sa'^'ens 




— 

lln 


— 


1877 


(304091 


omo sapiens 


Human secrrtedVotefa SEQ ID NO — 




~ 








8172. 






1878 


U25281 


Rattus 


SHS domain binding protein 


108 


36 






norvegicus 








1879 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


48 








8172. 






1880 


X73113 


Homo sapiens 


fastMyBP-C 


599 


77 


1881 


AX028128 


Homo sapiens 


unnamed protein product 


162 


43 


1882 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


62 



178 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCTAJSOl/04927 



SEQID 
NO: 




Species 


Description 


Smitli- 
Waterman 


% 

rdentif 

y 








7870. 






jgg^ 


Y00664 


Homo ^^P^^"^ 


T^" 'h^t'cTr^^^/s^khi ^ " ^?ivf 










U86587 


US muscu us 


^ub ^^it^ i'o'dTt"° " '^^ ^ 




77 


1885 


X67337 


Homo sapiens 

omo sapien 


Hum^^ — mkNA cleava e factor 1 68 
koT^bu^ eavage or 


148 




1886 


Y13829 


Homo sapiens 


MBNL protein 


117 


63 








Human secreted protein, SEQ ID NO' 
6012, 


















1889 


L39059 


Homo sapiens 


transcription fector SLl 


92 


34 




M69297 


Homo sapiens 




~5T^ 





T891 






Neijgal thread rotein 







T892 ' 


228201 


Sac'ch^omyces — 
cerevisiae 


rM?P Vl^T OfV?«/ 


— 




189j 


G00427 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO. 


90 


51 


1894 


U49973 


Homo sapiens 


ORFl- MER37- putative transposase 

similar to pogo element 


183 


IT — 








phosphomositide 3-kinase 


"H? — 




"1896 — 


AF093775 


Mu"*" uT'^lu^ 

_ ^ 


a p a-a ^ — m-Mn- — 







"1897 — 


G0ci797 


omo sapiens 


7878^^*'^^ 


— 


— 

56 


"1898 


M21904 


Homo sapiens 


4F2 heavy chain antigen 


-TTgg 


~80 


1899 


AB028997 


Homo sapiens 


KIAA1074 protein 


145 


38 






herpesvirus 4 








"ifs^ — 


ABO 1 1 1 42 


Homo sapiens 


KIAA0570 protein 




95 




AF194537 


"Homo sa^r"^ 


^^^^ 150 







1903 — 










15 


— 


M29399 


Homo sapiens 


eiydiroc^e membrane protein band 4 2 " 


IP 


I0 


1905 


Y27400 


Homo sapiens 


Human P450 reductase fimctional 


294 


67 


1906 


Y28503 


Homo sapiens 


HGFH3 Human Growth Factor 


167 


100 


1907 — 


AF128625 
^B029147^ 


Homo sapiens 


CD'c42'tod^ing protein kinase beta 


"509 


"89 


1908 


Cucumis sativus 


expressed in cucumber hypocotyls 


98 


41 














1910 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 


330 


48 


I9TI 


AF 129075 


It ' 

omo sapiens 


T-COMPLEX PROTEIN 1 THETA 
SUB UNIT (TCP-l-THETA) 


— 

511 


— 

90 


1912 


AFO 11450 


Mus musculus 


type X V collagen 


87 


28 




Y14482 


Homo sapiens 


en^^ded b"^*^™^!? ^^'^'^^^^'^ protein 


124 


75 


I9T4 — 


AF171230 


y.^^ 

unguiculata 


phosphatidif acfd phosphatase beta 


IT2 


— 


1915 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


47 


1§T^ — 


M94131 














Nicotiiffla"^"^ 

tabacum 


extensm i-ozv) 


15o 




1918 


AF 186605 




MLL2 protein 






1919 


M12130 


Mus musculus 


RNA polymerase II 


498 


83 


1920 


AL049794 


Homo sapiens 


dJ777L9.1 (novel protein similar to 
mouse kinesin-like proteins KIFIA and 
KIFIB) 


514 


90 


1921 


D83703 


Homo sapiens 


peroxisome assembly fector-2 


233 


64 


1922 


Y 11922 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 
No: 522. 


162 


77 



179 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 




Identit 


T923 


AC00248 1 


Homo sapieni 


similar to nitrogen permease refflilatoF^ 
similar to P39923 (PID:g730170X 
match to AA233630 (NID'gl 856833) 
and AA399402 (NID:g2053U7) 




"79 


1924 


D82060 


Homo sapiens 


membrane protein with histidine rich 


115 


40 


1925 


U49974 


Homo sapiens 


mariner transposase 


195 


61 




S80119 










"1927 


G00354 


Homo sa" iens 

omo sapiens 


Human secreted protein SEQ ID NO* 


~79 


~ 


1928 




Homosa iens 

omo sapiens 


EXT 1 putative tumour 
exostoses candidate gene 


~430 


~88 




X58063 


Brugia pahangi 


mfcomaridTheaft^°°^°* "'^^^ 


104 


43 


1930 


AF 119855 


Homo sapiens 


PRO 1847 


132 


64 


1931 


AJ005577 


Homo sapiens 


6-phosphofructo-2-kinase 


326 


90 




AF02026I 






"Hi — 




1933 


AJ272204 


HOTno'^ tens™ 


h>po&etical^protehi 




15 


1934 


AL451017 


Neuroq^ora"^ 

crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


1l6 


19 














1936 


U93564 


Homo sapiens 


putative pl50 


217 


54 






herpesvirus 4 










API 9453 7 






1^3 




"1555 — 


AF226044 


Homo sapiens 


HSNFRK 




"15 


1940 


X83413 


Human 

herpesvirus 6 


U88 


303 


50 


1941 


AJ007628 


a s ^ 


ELK channel 1 


-— 


— 


~T942 


AF 104328 


AraMdopTis 

thaliana 


cell wall plasma membrane Tinker 

protein homolog 


"135 


13 


1943 


AF 130089 




PRO2550 










Homo sapiens 

omo sapiens 


Human secreted protein, SEQ ID NO. 


~W2 


~ 


1945 


N419419 


Mus musculus 


nrolinc-rich salivarv nrotcni 


~Tn 


40 


1946 


U57316 




histone acetvltransferase 


132 




1947 


AL16jj02 


Homo sapiens 


human type XVIII collagen 


79 


34 


1948 


AB020746 


Aiabidopsis 
thaliana 


protein kiiiase-like protein 


117 


36 


1949 


AJ011738 


Homo sapiens 


Inilb 


209 


85 


1950 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


155 


48 


1951 


AFl 19569 


Homo sapiens 


patched 2 


166 


89 


1952 


AB002107 


Homo sapiens 


hPer 


118 


39 


1953 


X98834 


Homo sapiens 


zinc finger protein HsaI2 


430 


70 


1954 


Y19641 


Homo sapiens 


SEQ ID NO 359 from W09922243. 


96 


64 


1955 


U93569 


Homo sapiens 


putative pi 50 


157 


44 


1956 


G04091 




8172. 


85 


53 


1957 


AB013729 










1958 


AC005360 


Homo sapiens 


FAA 


338 


68 


1959 


AJ223075 


Homo sapiens 


TRIP protein 


598 


95 


1960 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


147 


62 


1961 


AF076776 


Drosophila 
melanogaster 


helicase DOMINO A 


157 


45 


1962 


AF265555 


Homo sapiens 


ubiquitin-con jugating BIR-domain 
enzyme APOLLON 


587 


77 


1963 


AF229642 


Mus musculus 


DXImx46e protein 


127 


91 



180 



SEQID 
NO: 






Description 


Smitli- 
Score 


Identit 

y 


1964 


U72520 ! Musmusculus 


mena protein 


89 


31 




W40309 j Homo sapiens 


Human ITAK protein. 




32 


1966 


AJ388557 ; Canis familiaris 


zinc finger protein 


826 


56 




Y92515 1 Homo sapiens 


Human OXRE-12, 




53 


1968 


Y17832 


Human 
endogenous 
retrovirus K 


pol protein 


187 


49 


1969 


Y41245 Homo sapiens 


Human Y218 protein. 


220 


78 


1970 


AB052738 | Sus scrofa 


Smadj 


366 


85 


1971 


AB007644 Arabidopsis 
thaliana 


contains similarity to phytocyanin/early 
nodulin-like proteiii'-gene id:K19P 1 7.3 


106 


32 


1972 


G00333 Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


57 


107 


AL357472 [ Homo sapiens 


VPS33B 


112 




974 


AF 1 5 1902 1 Homo sapiens 


CGI-144 protein 






1975 


AL 1 37260 1 Homo sapiens 


iiypothetical protein 


148 


92 


1976 


U89505 Homo sapiens 


HIark 


408 


89 


1977 


U67328 Musmusculus 


NIPI-like protein 


168 


73 


1978 


AK026435 | Homo sapiens 


imnamed protein product 


601 


94 




Y143I8 


Homo sapiens 


peroxisomal ABC-transporter 


507 


96 


1980 


U63630 


Homo sapiens 


MCM4 


570 


90 


1981 


AFl 18090 


Homo sapiens 


PRO2044 


154 


84 




AFO 16370 


Homo sapiens 


U4AJ6 small nuclear ribonucleqprotein 
hPrp3 


422 


63 


1983 


AB0ni54 


Homo sapiens 


KIAA0582 protein 


420 


80 


1984 


ABO 11422 


Homo sapiens 


Trad 


201 


67 




AL390156 


Homo sapiens 


hypothetical protein 


230 


97 


1986 


AJ242540 


Volvox caiteri f. 
nagarlensis 


hydtoxyproline-rich glycoprotein DZ- 
HROP 


163 


44 




Ml 2523 


Homo sapiens 


alloalbumin Venezia 




91 


1988 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7077. 


145 


63 




Y36156 


Homo sapiens 


Human secreted protein #28. 


107 


80 


1990 


AFL61356 


Homo sapiens 


HSPC093 


156 


57 


1991 


G03443 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7524. 


132 


72 


1992 


AF119851 


Homo sapiens 


PRO 1722 


126 


52 




G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


228 


60 




AF010I44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 




1995 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6550. 


118 


58 


1996 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

.- 


137 


57 


1997 


X75068 


Bos taurus 


plasmalemmal ponn 


85 


85 


1998 


Y64869 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1030. 


92 


80 


1999 


Y91577 


Homo sapiens 


Human secreted protein sequence 

encoded by gene 2 SEQ ID NO:250. 


275 


79 




AF 11 8078 


Homo sapiens 


PRO 1848 


111 


57 


"5o5T — 


S80119 


Rattus sp. 


reverse transcriptase homolog 






2002 


ABOllllO 


Homo sapiens 


KIAA0538 protein 


146 


73 


2003 


Ml 5530 


Homo sapiens 


B-cell growth factor 


94 


64 


2004 


AF225918 


Mus musculus 


intestinal cell kinase 


216 


77 


2005 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


85 


2006 


D84391 


Mus musculus 


reverse transcriptase 


135 


38 


2007 


AF090930 


Homo sapiens 


PRO0478 


101 


86 



181 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Wateiraan 

-— ^ 


Hentit 

y 


^008 


G03807 


Homo sapiens 


Human secreted protein SEQ ID NO- — 
7888. 












gene 22 clone HMSJW18. 








AFl 13685 




PRO0974 






2011 


AF 130079 


Homo sapiens 


PR02852 


141 


78 


1 


AF010144 












AF090944 


Homo sapiens 


PRO0663 


"ST 




2014 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived jGrom DNA35638. 


151 


58 


2015 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


223 


75 


2016 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

: 


137 


78 


"2017 


^2Qg^^ 


Homo sapiens 


chromosomal protein 


^20 


"74 


2018 


G03646 

-— — — 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7727. 


68 


57 


^oT9 — 


G 03 646 


— : — 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7727. 


. 








Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 










Homo sapiens 


chromosomal protein 






2022 


W4835] 


Homo sapiens 


Human breast cancer related protein 
BCILB2 


139 


69 


2023 


U93569 


Homo sapiens 


putative pi 50 


187 


89 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 






2025 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


71 


2026 


Y76198 


Homo sapiens 


Human secreted protein encoded by 


77 


66 


2027 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxy pro line-rich glycoprotein DZ- 
HRGP 


372 


73 


2028 


U49973 


Homo sapiens 


ORFl ; 1VCER37; putative transposase 
similar to pogo element 


159 


80 


2029 


AFl 9453 7 


Homo sapiens 


NAG13 


158 


57 






Trypanosoma 
brucei 




1 10 


41 




AF197913 


Helicoverpa 
annigera nuclear 
polyhedrosis 


basic DNA-binding protein BDBP 




55 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 






20j3 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


82 


2034 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


75 


2035 


G03646 


Homo sapiens 


Human secreted prwtein, SEQ ID NO: 


104 


67 


2036 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


71 


2037 


AF 130089 


Homo sapiens 


PRO2550 


146 


55 


2038 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


81 


73 


2039 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


66 


2040 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


159 


68 



182 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 




Species 


Description 


Smith- 




NO: 








Waterman 
S^re 


Identit 


^2043 




Homosa iens 

Human^^^'^"^ 


Neuron associated protein 




~7Q 

-Ip- 


^042 


yQ^^-g 




BYRFl encodes EBNA-2 (Dambaugh 


Tos 


77 






he" esvirus4 


et al, 1984; Dillner et al, 1984) 






"2043 


QQ2S\^ 


Homosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO. 


"125 


— 

65 







. 










AFl 18086 


omo sapiens 


PRO 1992 


-— 


— 




^045 


G02532 


omo sapiens 


Human secreted protein, SEQ ID NO. 


-— 


64 














■^7046 


W50192 


omo sapiens 


Amino acid sequence of salivary 


Tor 


— 









protein CON-1. 








Y08062 


Homosa iens 

omo sapiens 


Hurnan PR0245 protein fragment 




— 








derivea irom DN A35638. 






"7048 


AL3901 14 


eis mama 


DNa'c^ ^ 


155 


64 








-—J ; 










U49973 


Homo sapiens 


ORFl; ]vIER37j putative transposase 


— 

125 




53 









similar to pogo element — ^ 








AT 'ioni 1 A 


Leisnmania 


extremely cysteine/valine rich protein 


— 

112 


— 

56 








"^^■^"'^ -. 












Homo sapiens 


— -3 ; — 

Human secreted protem, SEQ ID NO: 


— 

70 


— r 

6^ 








— 












Human*^ 


I"'"*^" 




"tIx 





^053 


^ ^^^^ 















herpesvirus 6 










G02902 


omo sapiens 


Human secreted protein SEQ ID NO- — 


— 











6983. 






"2055 





n ^ 

omo sapiens 




TnH — 

4 


— 








6613. 






"2056 


^. ,,^4 


^. .. 


coding region (AA 1 - 437) 


-j~ 









disco Weum"" 








^^^057 


1R95913 


Homosa iens 

Homosa'^'iens 


Neural thread protem 


"83 


"77 


"^058 


AFl 18080 




PRO 1880 





"6l 


~2059 




AJ276003 


TTi aiens 

omo sapiens 


GARl protein 


—2 

— i 







G02485 


omo sapiens 


Human secreted protein, SEQ ID NO: 


120 




70 









i — j-T r-' 








X01918 


-fr — j 

Urosophua 


salivary gland glue protein 


Tm 

179 


— 

40 






melanogaster 








"^2062 


U00029 


cerevisiae'^^'^^^ 


Yhr217cp 


ITo 


"dfi 














^063 


AF130051 


Homosa iens 


PRO0898 


~l4^ 


"80 


^064 


AF00606 1 


Homo sapiens 

omo sapiens 




T2T 


"r^ 








hGH-V3 






2Qgg 


(302409 


Homo sa iens 

omo sapiens 




Ttq 











6490. ^ ' Q 






2066 


U82303 


Homo sapiens 




92 


"68 


2067 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


J168 


"79 






Hwno^sTTens 








2068 


L27428 




rev^fe Uanscriptase 


"186 


"38 




AF130051 


Homo sa iens 


PRO0898 


-— 


^ 


2070 


U49973 


Homo ^piens 


ORFl; MER37; putative transposase 


"271 


"^5 








similar to pogo element 








AF130051 




PRO0898 






2072 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-lsTP 


117 


67 


2073 


AF157706 


Human 


B4 


104 


49 






herpesviras 6B 








2074 


AL049608 


Arabidopsis 


extensin-like protein 


120 


41 






thaliana 








2075 


AP002460 


Arabidopsis 
thaliana 


geneid'.Fl D9.26~unknown protein 


150 


27 


2076 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


85 



183 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 

Waterman 

Score 


Idcntit 


2077 


Ml 1897 


Mus musculus 


proline-rich salivary protein 


125 


42 


2078 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


66 


75 


2079 


AF041330 


Bodo saltans 


NADH dehydrogenase subimit 5 


185 


47 


^2081 


AJ006470 


Homo sapiens 


cartilage-associated protein (CASP) 




84 


AK024509 


Homo sapiens 


unnamed protein product 


132 


83 


2082 


Uy3564 


Homo sapiens 


putative pi 50 


137 


67 


2083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


71 


2084 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


81 


2085 


AL16037] 


Leishraania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


114 


76 


2086 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


255 


63 




AFl 94537 


Homo sapiens 


NAG 13 


122 


78 


2088 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


74 


2089 


AFl 3 0079 


Homo sapiens 


PR02852 


149 


77 


2090 


U93574 


Homo sapiens 


p40 


260 


96 


2091 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


99 


76 


2092 


AF194537 


Homo sapiens 


NAG 1 3 


125 


51 


2093 
"2094 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


319 


72 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


159 


56 


2095 


U93572 


Homo sapiens 


putative plSO 


195 


57 


2096 


X83413 


Human 
herpesvirus 6 


U88 


132 


^57 




AFO 10400 


Homo sapiens 


transaldolase-related protein 


463 


89 


2098 


U93563 


Homo sapiens 


putative pi 50 


128 


35 


2099 


U49973 


Homo sapiens 


ORFl ; MER37, putative transposase 
similar to pogo element 


160 


43 


2100 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


133 


43 


2101 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


71 


2102 


K02401 


Homo sapiens 


chorionic somatomammotropin 


628 


90 


2103 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


141 


80 


2104 


U49973 


Homo sapiens 


ORE 1 ; MER37; putative transposase 
similar to pogo element 


309 


63 




AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


192 


37 


2106 


Y59772 


Homo sapiens 


Human normal ovarian tissue derived 
protein 49. 


261 


89 




AF202051 


Homo sapiens 


NM23-H8 


680 


100 


2108 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


127 


54 






Glycine max 


hydroxyproline-rich glycoprotein 


121 


36 


2110 


D26135 


Homo sapiens 


diacylglycerol kinase gamma 


172 


100 


2111 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


159 


56 




X65488 


Homo sapiens 


hnRNP U protein 


117 


70 


2113 


AL359782 


Trypanosoma 
bmcei 


probable similar to ring-h2 finger 
protein ihal a. 


116 


35 


2114 


AF130051 


Homo sapiens 


PRO0898 


93 


62 


2115 


AJ242540 


Yolvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


260 


53 


2116 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


148 


70 



184 



SEQID 
NO: 






Description 


Smith- 


Identit 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 




71 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
6720. 


70 


66 


2119 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


84 


2120 


S79410 


Mus musculus 


nuclear locali2ation signal binding 
protein 


94 


94 


2121 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similai" lo pogo element 


304 


73 


2122 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


83 


32 


2123 


AK025047 i Homo sapiens 


unnamed protein product 


116 


48 


2124 


Q03371 Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


72 


2125 


AF138883 


Bos taurus 


type II collogen cyanogen bromide 
fragment CB 10 


103 


40 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


186 


100 




G03114 Homo sapiens 


Human secreted protein, SEQ ID NO; 
7195. 








AF194537 Homo sapiens 




126 


66 


2129 


ABO 15 802 1 Acetobacter 
i xylinus 


similar to cellulose complementing 
protein of A. xylinum ATCC23869 


109 


80 


2130 


AF1S7147 1 Musmusculus 


drebrin A 


1 10 


38 




W54966 Homo sapiens 


Synthetic human type EI collagen 

SYN-C3. 


148 


42 


2132 


L36341 


Aspergillus 
nidulans 


regulatory protein 


130 


49 


2133 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


58 


2134 


API 16689 


Homo sapiens 


PR02168 


113 


81 




AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


191 


38 


2136 


M81321 


Macaca 
fascicularis 


proline-rioh protein 


125 


43 


2137 


A18812 


Brass ica napus 


extensm 


106 


34 




AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


104 


41 


2139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


191 


92 


2140 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


196 


90 


2141 


Y57285 


Homo sapiens 


Human GPCR protein (HGPRP) 
sequence (clone ID 2214673). 


507 


78 




AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 




47 




AF090944 


Homo sapiens 


PRO0663 




53 


2144 
..... 


Y86573 


Homo sapiens 


Human gene 91-encoded protein 
fragment, SEQ ID NO:490. 


390 


76 


2145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


198 


70 


^T46 — 


R96800 


Homo sapiens 




Ti9 


"82 


2147 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRQP 


194 


45 


2148 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7143. 


91 


60 


2149 


AF090942 


Homo sapiens 


PRO0657 


129 


66 


2150 


AF130089 


Homo sapiens 


PRO2550 


372 


S2 


2151 


AC009991 


Arabidopsis 


unknown protein 


81 


59 



185 



wo 01/64835 PCT/llSOl/04927 



SEQID 


Accession No. 


species 


Description 


Smith- 


% 


NO: 




















Score 


>■ 






thaliana 








2152 


AF090942 


Homo sapiens 


PRO0657 


93 


53 


2153 


R13319 


Homo sapiens 


Partial Human Natural Killer receptor. 


215 


89 


2154 


AC008075 


Arabidopsis 


F24J5.4 


139 


36 






thaliana 








2155 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


148 


50 


2156 


S79410 


Mus musculus 


nuclear locah'zation signal binding 


112 


58 








protein 






2157 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


66 


2158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


85 


80 








4457. 






2159 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


75 








7339. 






2160 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


145 


45 








protein CON-2. 






2161 


AF 119900 


Homo sapiens 


PR02822 


138 


71 


2162 


AJ223953 


Homo sapiens 


hPTTG 


106 


62 


2163 


AK023542 


Homo sapiens 


unnamed protein product 


76 


52 


2164 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


84 








4425. 






2165 


U41038 


Caenorhabditis 


Similar to cadherin-type repeat 


137 


65 






elegans 








2166 


Y91577 


Homo sapiens 


Human secreted protein sequence 


348 


88 








encoded by gene 2 SEQ ID NO;250. 






2167 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


108 


84 






brucei 


complete virion genome. 






2168 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


48 








5327. 






2169 


AP000616 


Ory za sativa 


similar to RING-H2 finger protein 


125 


67 








RHAla (AF078683) 






2170 


AF130089 


Homo sapiens 


PRO2550 


142 


65 


2171 


Y 19609 


Homo sapiens 


SEQ ID NO 327 from W09922243. 


103 


65 


2172 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


39 


2173 


G03114 


Homo sapiens 


Human secreted protein, SHQ ID NO: 


91 


50 








7195. 






2174 


D38435 


Homo sapiens 


homologue of yeast PMSl 


314 


96 


2175 


AFl 19851 


Homo sapiens 


PRO 1722 


230 


69 


2176 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


100 


63 








6065. 






2177 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


137 


44 








RHAla (AF078683) 






2178 


U23172 


Caenorhabditis 


sunilar to myoblast cell surface antigen 


139 


42 






elegans 


(SP:CS24_HUMAN, P23246) and D. 












melanogaster No-on-transient A protein 












(PIR: JHO 1 62) 






2179 


AL049608 


Arabidopsis 
thaliana 


extensin-lilce protein 


286 


57 


2180 


L17318 


Rattus 


proline-rich proteoglycan 


148 


40 






norvegicus 








2181 


AK024455 


Homo sapiens 


FLJ00047 protein 


97 


63 






Homo sapiens 


Neural thread protein. 




69 


2183 


AF266479 


Homo sapiens 


rectachrome 1 


148 


81 


2184 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


70 








4658. 






2185 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


40 








6953. 






2186 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


84 








4409. 







186 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


Wentit 


2187 


AFl 37273 


Gallus gallus 


alpha 1 (V) collagen 


103 


43 


-^11^ 


K03'^05 


Homo sapiens 


salivary proline-ricli protein precursor 


^^•^ 


36 




D90064 


Homo sapiens 


NCA-W272 






2190 


AF130089 


Homo sapiens 


PRO2550 


137 


68 




^^A^l 


Volvox carter! 


extensin 


11 J 


62 




Tj49973 


Homo sapiens 


ORFl; MER37: putative transposase 
similar to pogo element 




59 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
7868- 




59 






Homo sapiens 


mariner transposase 


257 


53 


2195 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


98 


79 


2196 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


180 


77 


2197 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


121 


65 


219 


Y36366 


-— ; 

Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 






2199 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


67 


2200 


AJ007042 


Homo sapiens 


TRX5 protein 


264 


75 


2201 


AF130051 


Homo sapiens 


PRO0898 


71 


61 




AK024455 


Homo sapiens 


FLJ00047 protein 


153 


62 


2203 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


95 


52 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 




82 




AF090895 


Homo sapiens 


PRODI 17 


163 


69 


2206 


G03806 


Homo sapiens 



Homo sapiens 


Human secreted protein, SEQ ID NO: 

7887. 


154 


69 


Neural thread protein. 


103 


86 


2208 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


78 


2209 


W48351 


Homo sapiens 


Human breast cancer related protein 


100 


65 


2210 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


111 


45 


2211 


AF090931 


Homo sapiens 


PRO0483 


63 


90 


2212 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


100 


74 


"22T3 


Y02693 


— ^ 

Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 






2214 
— — 


M7667 1 




Lycopersicon 
esculentum 


extensin (class II) 


137 


35 




j^g^g J 


omo sapiens 


r-j r=— j 3, 2: 

pot. unidentified reading n:ame 


~T^o 





^216 




omo sapiens 








^ttTt — 


AFl 18086 




PRO 1992 




79 


-=2Yg — 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 






2219 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


84 


2220 


AF141347 


Homo sapiens 


alpha-tubulin 


571 


94 




AF071172 


Homo sapiens 




187 


84 


2222 


M17783 


Homo sapiens 


glia-dertved nexin precursor 


529 


83 




AF081484 




alpha-tubulin isoform 1 






2224 


AK026072 


Homo sapiens 


unnamed protein product 


199 


57 


2225 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


161 


61 


2226 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


126 


41 


2227 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


124 


63 



187 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 




Description 


Smith- 
Watcrmai 


Identit 

y 




AF 11 9855 


Homo sapiens 


PRO 1847 


108 


74 


2229 


G03133 ' 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


55 


2230 


AF090944 


Homo sapiens 


PRO0663 


146 


61 


2231 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


152 


74 


2232 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


59 


2233 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


58 


2234 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


60 


2235 


AK024455 


Homo sapiens 


FLJ00047 protein 


133 


71 


2236 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


99 


59 




U87607 


norvegicus 


putative RNA binding protein 1 


111 


38 




G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


63 




G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


75 


65 


2240 


L27428 


Homo sapiens 


reverse transcriptase 


136 


40 


2241 


R96800 


Homo sapiens 


Human liistiocyte-secreted factor HSF. 


122 


82 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


105 


95 


2243 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 


48 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


145 


75 


2245 


U09116 


Homo sapiens 


ORE 1, encodes a 40 kDa product 


346 


87 


2246 


U82303 


Homo sapiens 


unknown 


155 


79 


2247 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6536. 


105 


45 


2248 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


105 


47 


2249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


63 


2250 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


62 




X81206 


Drosophila hydei 


histone H3.3 


101 


71 


2252 


AF155581 


Danio rerio 


proteasome subunit beta 7 


92 


52 


2253 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


172 


60 


2254 


AF084225 


Homo sapiens 


cytochrome P450 2E1 


114 


46 


2255 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


136 


64 


2256 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


147 


90 


2257 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


297 


59 


2258 


U93563 


Homo sapiens 


putative pl50 


218 


66 






Homo sapiens 


early lymphoid activation protein 


82 


55 


2260 


AF194537 


Homo sapiens 


NAG 13 


117 


56 






Peromyscus 
maniculatus 


reverse transcriptase 




82 


2262 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


86 


66 


2263 


AL0219I8 


Homo sapiens 


b34I8. 1 (Kruppel related Zinc Finger 
protein 184) 


372 


61 


2264 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine lich protein 


122 


71 



188 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Identit 

y 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


123 


79 


2266 


Y86248 


Homo sapiens 



Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


204 


95 


^267 — 


^^gQ2g 


Chlamydomonas 
reinhardtii 


amino acid feature: Rod protein 
domain, aa266 .. 468; amino acid 
■feature: globular protein domain, aa 32 
.265 










Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 












start codon not included 






2270 


ABO 14554 


Homo sapiens 


KIAA0654 protein 


141 


63 




G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8148. 




60 


2272 


U4997J 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


118 


54 






Homo sapiens 


Human neurofilaraent-H mutant protein 
fragment 1 1. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

7534. 




66 


2275 


AL J 59782 


Trypanosoma 
Mus^musculus 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


198 


64 


"^^276 


fj^^^^-j 




reverse transcriptase 






2277 


AJ004810 


Zea mays 


cytoclarome P450 monooxygenase 


129 


43 


2278 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


67 


2279 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


68 


2280 


L27428 


Homo sapiens 


reverse transcriptase 


150 


60 


_2281 




Homo sapiens 


salivary proline-rich protein precursor 


142 


40 




U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 






2283 


D00570 


Mus musculus 


open reading frame (251 AA) 


169 


56 






Homo sapiens 


Fragment of human secreted protein 

encoded by gene 90. 








AL359782 


Trypanosoma 
brucei 


possible (lihv-6) ul 102, variant a dna, 
complete virion genome. 






2286 


AFl 19901 


Homo sapiens 


PR0283 1 


116 




2287 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


73 


2288 


AF130089 


Homo sapiens 


PRO2550 


102 


75 


2289 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


70 


2290 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


127 


64 


"^291 


^26gS3 


Homo sapiens 


chromosomal protein 






2292 


AF161356 


Homo sapiens 


HSPC093 


100 


57 






Homo sapiens ■ 


Human sectetsd protein, SEQ ID NO; 

4409. 










Homo sapiens 


pot. ORF I 






2295 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


52 


2296 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


66 


2297 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


123 


73 


2298 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


349 


73 


2299 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


109 


61 



189 



wo 01/64835 PCT/llSOl/04927 



SEQID 




Species 


Sescription 


Smitli- 




NO: 








Waterman 


Identit 










Score 


y 


2300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


81 








5327. 






2301 


AF251290 


Plasmodium 


glutamic acid-rich protein 


112 


40 




...... 


falciparum 








2j02 


AFl 55232 


Pisum sati\aim 


extensin 


89 


36 


2303 


AFl 30089. 


Homo sapiens 


PRO2550 


83 


50 






Mus musculus 


Wiscott-Aldrich Syndrome protein 


84 


51 








homolog 






2305 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


86 


30 


2306 


AP002031 


Arabidopsis 


geneJd:K3D20.3~ 


168 


44 






thaliana 








2307 


AF157321 


Homo sapiens 


30 kDa protein 


309 


64 


2308 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


57 








6613. 






2309 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


198 


42 


2310 


Y27607 


Homo sapiens 


Human secreted protein encoded by 


207 


100 








gene No. 41. 






2311 


AFl 30089 


Homo sapiens 


PRO2550 


106 


66 


2312 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


60 








4518. 






2313 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 


114 


80 








antigenic protein encoded by 












phageclone lambda HC2533. 






2314 


AB032910 


Hyiobates 


dopamine receptor D4 


108 


40 






muelleri 








2315 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


159 


59 








gene No. 107. 






2316 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


58 


2317 


AF238235 


Entamoeba 


diaphanous protein 


103 


51 






histolytica 








2318 


U15647 


Mus musculus 


reverse transcriptase 


101 


37 


2319 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


117 


51 


2320 


¥86248 


Homo sapiens 


Human secreted protein HCHPF68, 


163 


78 








SEQ ID NO: 1 63. 






2321 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


48 








7871. 






2322 


D00570 


Mus musculus 


open reading frame (196 AA) 


122 


39 


2323 


AP000616 


Ovyzd sativa 


similar to RING-H2 finger protein 


122 


75 








RHAla (AF078683) 






2324 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


79 








6720. 






2325 


U44838 


Glycine max 


cxtcnsin 


126 


36 


2326 


Y87103 


Homo sapiens 


Human .secreted protein sequence SEC) 


99 


59 








ID NO; 142. 






2327 


Y86573 


Homo sapiens 


Human gene 9 1 -encoded protein 


408 


79 








fragment, SEQ ID NO:490. 






2328 


AF194537 


Homo sapiens 


NAG 13 


181 


66 


2329 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


405 


75 








similar to pogo element 






2330 


X53581 


Rattus 


ORF4 


116 


45 






norvegicus 








2331 


L26953 


Homo sapiens 


chromosomal protein 


108 


67 


2332 


L27428 


Homo sapiens 


reverse transcriptase 


117 


76 


.2333 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


50 








4718. 






2334 


AF191687 


Homo sapiens 


alanine-glyoxylate aminotransferase 


100 


37 








homolog 






2335 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


84 


68 



190 



w o 01/64835 
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SEQID 


Accession No. 




Description 






NO: 








Watcrmaii 
Score 


Ideiitit 


2336 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


106 


53 






major 








2337 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


168 


65 








SEQID NO: 163. 






2338 


R95913 


Homo sapiens 


Neural thread protein. 


164 


67 


2339 


AF090942 


Homo sapiens 


PRO0657 


122 


81 


2340 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


90 








8159. 






2341 


AF090942 


Homo sapiens 


PRO0657 


124 


60 


2342 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


126 


83 








7124. 






2343 


AB013454 


Rattus 


NaPi-2 beta 


143 


77 






norvegicus 








2344 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


127 


73 








gene 44 clone HTDAD22. 






2345 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


146 


45 






nagariensis 


HRQP 






2346 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


40 


2347 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


55 








7124. 






2348 


Y36203 


Homo sapiens 


Human secreted protein #75. 


138 


78 


2349 


L26953 


Homo sapiens 


chromosomal protein 


131 


57 


2350 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


65 








6613. 






2351 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


144 


58 








terminal, clone XEH.Sc} 






2352 


AF130051 


Homo sapiens 


PRO0898 


155 


70 


2353 


G00357 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
4438. 


134 


76 


2354 


L27428 


Homo sapiens 


reverse transcriptase 


141 


71 


2355 


L26953 


Homo sapiens 


chromosomal protein 


128 


66 


2356 


W48351 


Homo sapiens 


Human breast cancer related protein 


148 


75 








BCRB2. 






2357 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


62 








7867. 






2358 


B12310 


Homo sapiens 


Human secreted protein encoded by 


115 


50 








gene 10 clone HDPGP94. 






2359 


M76546 


Helianthus 
anniius 


hydrox3^roline-rich protein 


103 


43 


2360 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


82 


80 








encoded by gene 17. 






2361 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


95 


72 








isoform 






2362 


M64793 


Rattus 


salivary proKne-rich protein 


117 


41 






norvegicus 








2363 


W03642 


Homo sapiens 


Human caiuiabiiioid GPR N-terminal 


98 


80 








sequence. 






2364 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 


152 


45 






crassa 


RIBONUCLEOPROTEIN C 






2365 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


85 


55 








gene 22 clone HMSJWl 8. 










Homo sapiens 


Human secreted protein, SEQ ID NO; 


101 


77 








7871. 






2367 


AF093097 


Homo sapiens 


putative RNA-binding protein Q99 


248 


97 


2368 


U52077 


Homo sapiens 


mariner transposase 


227 


74 


2369 


X07882 


Homo sapiens 


Po protein 


102 


38 


2370 


U44838 


Glycine max 


extensin 


102 


32 


2371 


ABO 12223 


Canis familiaris 


ORF2 


158 


60 


2372 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


104 


42 



191 



wo 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 




Waterman 
Score 


Identlt 






elegans 


binding protein KIO (NID:g8148) 






2373 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


107 


40 


2374 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


106 


67 


2375 


AFl 18086 


Homo sapiens 


PR01992 


169 


65 


2376 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 


55 


Till 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


59 


2378 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


141 


70 


2379 


R95913 


Homo sapiens 


Neural thread protein. 


120 


50 


2380 


U93572 


Homo sapiens 


putative pi 50 


124 


53 


2381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


195 


66 


2382 


U52077 


Homo sapiens 


mariner transposase 


282 


67 


2383 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


113 


43 


2384 


AJ242540 


Volvox cartei i f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


112 


37 


2385 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


88 


75 


2386 


AF130089 


Homo sapiens 


PRO2550 


160 


62 


2387 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone m^MBQ32. 


166 


51 


2388 


AF130089 


Homo sapiens 


PRO2550 


103 


71 


2389 


AB027890 


Schizosaccliarom 
yces pombe 


Hypothetical protein 


116 


100 


2390 


1/93570 


Homo sapiens 


putative pi 50 


151 


54 


2391 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


110 


71 


2392 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7878. 


109 


67 


2393 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


94 


34 


2394 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


156 


62 


2395 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7133. 


57 


56 


2396 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


107 


43 


2397 


AF2 10651 


Homo sapiens 


NAG 18 


162 


54 


2398 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


116 


34 


2399 


S80864 


Homo sapiens 


cytochrome o-Iike polypeptide 


115 


68 


2400 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4446. 


141 


54 


2401 


M13100 


Rattus 
norvegicus 


unknown protein 


148 


46 


2402 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7874. 


133 


70 






Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250- 


366 


94 


2404 


AFl 13685 


Homo sapiens 


PRO0974 


112 


56 


2405 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


98 


67 


2406 


R41001 


Homo sapiens 


Human myotonic dystrophy gene 
protein. 


207 


68 


2407 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


127 


63 



192 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 




Species 


Description 




% 


NO: 








Waterman 
Score 


Idertit 

y 






major 


DNA, complete virion genome 






2408 


G03101 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


378 


95 








7182. 






2409 


Y91452 


Homo sapiens 


Human secreted protein sequence 


250 


86 








encoded by gene 2 SEQ ID NO;]25. 






2410 


U72520 


Mus musculus 


mena protein 


115 


40 


2411 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


132 


44 








RHAla(AF078683) 






2412 


AF095770 


Homo sapiens 


PTH -responsive osteosarcoma Dl 


109 


85 








protein 






2413 


AFl 19901 


Homo sapiens 


PR0283 1 


113 


66 


2414 


R95913 


Homo sapiens 


Neural thread protein. 


93 


57 


2415 


B01372 


Homo sapiens 


Neuron-associated protein. 


104 


83 


2416 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


50 








6913. 






2417 


J02459 


bacteriophage 


K(tail component; 199) 


720 


92 






lambda 








^418 


104694^" 


Mus musculus 


alpha-1 type IV collagen 


103 


43 


2419 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


61 








4463. 






2420 


W26496 


Homo sapiens 


CD2 associated intracellular protein 


115 


80 








CAIP LS02-21. 






2421 


AF000298 


Caenorhabditis 


weak similarity to collagens; glycine- 


145 


38 






elegans 


and proline-rich 






2422 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


73 








4523. 






2423 


Y86248 


Homo sapiens 


Human secreted protein 1ICHPF68, 


156 


70 








SEQIDNO:163. 






2424 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


61 








6561. 






2425 


Y30822 


Homo sapiens 


Human secreted protein encoded from 


95 


59 








gene 12. 






2426 


U93565 


Homo sapiens 


putative pl50 


147 


75 


2427 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


246 


53 






major 








2428 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


138 


61 


2429 


Y19767 


Homo sapiens 


SEQ ID NO 485 fixjm W09922243. 


112 


75 


2430 


D13892 


Homo sapiens 


carboxyl methyltransferase 


181 


68 


2431 


X97675 


Homo sapiens 


plakophilin 2b 


131 


65 


2432 


G02896 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


108 


70 








6977. 






2433 


X03145 


Homo sapiens 


pot. ORFIII 1 180 


82 


2434 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


86 








6613. 






2435 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


421 


67 








similar to pogo element 






2436 


M64793 


Rattus 


salivary proline-rich protein 


111 


38 






norvegicus 








2437 


AF2 10651 


Homo sapiens 


NAG 18 


128 


72 


2438 


AFO 16099 


IVIus musculus 


endonuclease/reverse transcriptase 


232 


46 


2439 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 


122 


53 








DNA, complete virion genome 






2440 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


80 








4457. 






2441 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


139 


54 








encoded by gene 121. 






2442 


AF090895 


Homo sapiens 


PRO0117 


115 


61 


2443 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


70 








4446. 







193 



SEQID 
NO: 


Accession No. 


species 


Description 


Waterman 


Hentit 


2444 


G00594 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4675. 


71 


80 


2445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


81 


37 


2446 


AF090895 


Homo sapiens 


PROOl 17 


86 


64 


2447 


G00442 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
4523. 


103 


86 


2448 


W48351 


Homo sapiens 


Human b.'east cancer related protein 


94 


61 


2449 


L21936 


Homo sapiens 


succinate dehydrogenase flavoprotein 
subunit 


168 


91 




AF090942 


Homo sapiens 


PRO0657 


134 


71 


2451 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7339. 


126 


87 


2452 


K03179 


Homo sapiens 


pro-alpha- 1 type-I collagen 


120 


44 


2453 


AF 118082 


Homo sapiens 


PRO1902 


143 


46 


2454 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


114 


68 


2455 


AF 130089 


Homo sapiens 


PRO2550 


125 


89 


2456 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4413. 


134 


71 


2457 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


93 


75 


2458 


Y09945 


Rattus 


putative integral membrane transport 
protein 


166 


46 


2459 


AF130052 


Homo sapiens 


PRO0956 


75 


48 


2460 
2461 " 


G00328 

R96800" 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


76 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


123 


69 


2462 


AL390114 


Leishmania 
major 


probable Chhv-6) ul 102, variant a 
DNA, complete virion genome 


157 


48 


2463 


X83413 


Human 
herpesvirus 6 


U88 


236 


50 


2464 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4598. 


181 


53 


2465 


X83413 


Human 
herpesvirus 6 


U88 


218 


50 


2466 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


147 


57 


2467 


X83413 


Human 
herpesvirus 6 


U88 


196 


53 


2468 

2469 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


165 


61 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


101 


46 


2470 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


221 


56 


2471 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4508. 


136 


63 


2472 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


122 


94 




AF161361 


Homo sapiens 


HSPC098 


120 


60 


2474 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


142 


40 


2475 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SKQIDNO:163. 


130 


58 


2476 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6596. 


78 


48 


2477 


AL 160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


86 


46 



194 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 






major 


DNA, complete virion genome 








GO 1478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5559. 


96 


66 


2479 


AL033545 


Arabidopsis 
thaliana 


extensin-like protein 


114 


40 


2480 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


59 


2481 


AL390114 


Leishmania 
major 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


146 


62 




AL 1 32902 


Caenorhabditis 
elegans 


Y71A12B.4 




37 




AF 154502 


Homo sapiens 


quiescent cell proline dipeptidase 


439 


88 




AFO 10326 


Drosophila 
melanogaster 


short fom of CHIP 


56 


42 




W80406 


Homo sapiens 


A secreted protein encoded by clone 
dli40 3. 


1 19 


50 


2486 


W31186 


Homo sapiens 


Human p 160 polypeptide 160.2. 


115 


60 


2487 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


74 


2488 


L26953 


Homo sapiens 


chromosomal protein 


129 


68 


2489 


Y21418 


Homo sapiens 


Human high mobility group protein 
HMGI-C mutant fragment 2. 


96 


46 


"2490 ~ 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


2491 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


158 


76 


2492 


U12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


144 


47 


2493 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


117 


81 






Plasmodium 
vivax 


pval 


116 


64 


2495 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


92 


64 


2497 


AF161483 


Homo sapiens 


HSPC134 


„ 

211 


78 




X05006 


Homo sapiens 


S-protein 


298 


92 


2499 


AB032911 


Hylobates agilis 


dopamine receptor D4 


89 


42 




M98502 


Mus musculus 


pMLZ-4 


325 


90 


2501 


Y94920 


Homo sapiens 


Human secreted protein clone 
pm412_l 2 protein sequence SEQ ID 


569 


85 


2502 


AB049054 


Homo sapiens 


brain link protein-1 


224 


90 




AF064604 


Homo sapiens 


KE03 protein 


224 


37 


2504 


Z69727 


Schizosaccharom 
yces pombe 


putative dna-directed ma polymerase iii 
130 kd polypeptide (ec 2.7.7.6) 


384 


61 


2505 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


147 


43 


2506 


A.TO 10604 


Mus musculus 


L-Sox5 protein 


366 


87 


2507 


X83413 


Human 
herpesvirus 6 


U88 


203 


46 




A.T277425 


Globodera 
pallida 


putative cuticular collagen 


125 


40 




AB017919 


Homo sapiens 


peptidylarginine deiminase type V 






2510 


AF001947 


Homo sapiens 


U4AJ6-associated RNA splicing factor 


583 


97 


2511 


AJ238520 


Homo sapiens 


putative transcription factor-like 
nuclear regulator 


722 


100 


2512 


AL3 90736 


Homo sapiens 


b A209J 1 9. 1 . 1 (G Wl 1 2 protein) 


557 


84 


2513 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


1113 


100 


2514 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


174 


56 



195 



/;/E:A^¥O01 84835 cpd 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 


2515 


U60803 


Homo sapiens 


clathrin heavy chain 2 




92 


2516 


AJ388557 


Canis familiaris 


zinc finger protein 


826 


56 


2517 


AB02725 1 


Homo sapiens 


zinc finger protein (ZFD25) 


631 


85 


2518 


AK023160 


Homo sapiens 


unnamed protein product 


168 


54 


2519 


D87326 


Mus musculus 


GSG2 


575 


73 


2520 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


42 








7452. 






2521 ■ 


X67688 


Homo sapiens 


transketolase 


120 


66 


2522 


AF074086 


Homo sapiens 


protease 


390 


86 


2523 


AF220509 


Homo sapiens 


transcription associated fector 


801 


99 








TAFIBIL 






2524 


AL078463 


Homo sapiens 


dJ365I19.1 {K1AA0456) 


374 


92 


2525 


AF038995 


Mus musculus 


putative RNA helicase RCK 


160 


93 


2526 


M60618 


Homo sapiens 


nuclear autoantigen 


116 


75 


2527 


G03133 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 


101 


57 








7214. 






2528 


X68790 


Homo sapiens 


Bactericidal /Permeability Increasing 


136 


90 








Protein 






2529 


A01592 


Homo sapiens 


haemoglobin A beta chain 


508 


94 


2530 


U89277 


Homo sapiens 


polyhomeotic 1 homolog 


404 


79 


2531 


X90845 


Rattus 


alphall spectrin 


507 


87 






norvegicus 








2532 


AL137081 


Arabidopsis 


phenylalanine-tRNA synthetase-like 


178 


33 






thaliana 


protein 






2533 


Y 18046 


Homo sapiens 


FGFRI oncogene partner (FOP) 


363 


100 


2534 


W54966 


Homo sapiens 


Synthetic human lype III collagen 


113 


42 








SYN-C3. 






2535 


X61047 


Hydra sp. 


mini-collagen 


102 


38 


2536 


AB006330 


Mus musculus 


S0X5 


559 


94 


2537 


AC006283 


Arabidopsis 


En/Spm-like transposon protein 


149 


33 






thaliana 








2538 


Af 196779 


Homo sapiens 


JMII protein 


139 


57 


2539 


AK.000741 


Homo sapiens 


unnamed protein product 


233 


47 


2540 


¥18046 


Homo sapiens 


FGFRI oncogene partner (FOP) 


315 


84 


2541 


U54996 


Homo sapiens 


HZWIO 


371 


81 


2542 


G0263 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


45 








6712. 






2543 


M95610 


Homo sapiens 


alpha-2 IX collagen 


93 


34 


2544 


AF071173 


Mus musculus 


Herc2 


349 


82 


2545 


AB000516 


Homo sapiens 


DSIF pi 60 


600 


93 


2546 


AL355178 


Homo sapiens 


dJ947L8.1.6 (novel CUB and Sushi 


395 


90 








(SCR repeat) domain protein) 






2547 


U93574 


Homo sapiens 


putative pi 50 


235 ~" 


76 


2548 


X8G035 


Oi'yctolagus 


cysteine rich hair keratin associated 


99 


31 






cuniculus 


protein 






2549 


U58088 


Homo sapiens 


Hs-CUL-2 


592 


86 


2550 


L26953 


Homo sapiens 


chromosomal protein 


131 


63 


2551 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


61 








4435. 






2552 


U39412 


Homo sapiens 


alpha SNAP 


159 


54 




AF248651 


Homo sapiens 


RNA-binding protein BRUN0L4 


569 


93 


2554 


AJ006519 


Rattus 


ASIC-beta 


164 


73 






norvegicus 








2555 


U93570 


Homo sapiens 


putative pi 50 


169 


52 


2556 


AFl 04328 


Arabidopsis 


cell wall-plasma membrane linker 


105 


29 






thaliana 


protein homolog 






2557 


AB028975 


Homo sapiens 


KIAA1052 protein 


610 


83 


2558 


AJ245621 


Homo sapiens 


CTL2 protein 


286 


47 



196 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 




Species 


Description 






NO: 








Waterman 
Score 


Identit 

y 


2559 


AF090944 


Homo sapiens 


PRO0663 


174 


78 


2560 


Y17832 


Human 


pol protein 


411 


63 






endogenous 












retrovirus K 










AF109907 


Homo sapiens 


S164 


279 


44 


2562 


^09561 


synthetic 


human serum albumin 


680 


91 






construct 








2563 


U4S359 


Gallus gallus 


kinesin light chain 


575 


81 




R63235 


Homo sapiens 


CNS neural thread protein HB4. 


435 


88 


2565 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


623 


89 


2566 


AF154916 


Chlamydomonas 


variable flagellar number protein 


117 


36 






reinhardtii 








2567 


AF181657 


Drosophila 


BcDNA.LD34475 


261 


42 






melanogaster 








2568 


M12530 


Homo sapiens 


transferrin precursor 


693 


87 


2569 


U96915 


Homo sapiens 


sin3 associated polypeptide pl8 


651 


93 


2570 


R74205 


Homo sapiens 


Human death associated protein DAP- 

2. 


285 


27 


2571 


AL078593 


Homo sapiens 


dJ210Bl.l (KIAA0680) 


107 


37 


2572 


AF293405 


Phaseolus 


seed-miax)pylar-endotlieIium-specific 


88 


39 






coccrneus 


protein 






2573 


G03 1 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


49 








7195. 






2574 


AB024601 


Pseudomonas 


uridylyl transferase 


120 


37 






aeruginosa 








2575 


AB050893 


Anadara nodifera 


cytochrome c oxidase subunit 1 


111 


79 


2576 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


130 


72 








7253. 






^77^ 


M17697 


Homo sapiens 


glutamate dehydrogenase 


369 


61 


2578 


AF045640 


Caenorhabditis 


C11D2.4 gene product 


264 


43 






elegans 








2579 


U71382 


Homo sapiens 


OB binding protein- 1 


273 


86 


2580 


AF116661 


Homo sapiens 


PRO 143 8 


114 


61 


2581 


AK000496 


Homo sapiens 


unnamed protein product 


153 


55 


2582 


A67510 


Mus musculus 


MUS MUSCIIIIJS GHNOMIC DNA 


190 


38 








CONTAINING N ALLELE OF FVl 


















2583 


Y44851 


Homo sapiens 


Human CD39-L66 protein. 


205 


97 


2584 


AB029151 


Homo sapiens 


D29 


253 


73 


2585 


AF039023 


Homo sapiens 


Ran-GTP binding protein; RanBP6 


765 


94 


2586 


AF054180 


Homo sapiens 


hematopoietic cell derived zinc finger 


116 


34 








protein 






2587 


AF283645 


Homo sapiens 


folate transporter/carrier 


580 


82 


2588 


AF090930 


Homo sapiens 


PRO0478 


132 


67 


2589 


AC002339 


Arabidopsis 
thaliana 


putative ABC transporter 


238 


34 


2590 


M34668 


Homo sapiens 


protein tyrosine phosphatase (EC 


627 


86 








3.1.3.48) 






2591 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO. 


141 


63 








6613. 






2592 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


142 


63 














2593 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


146 


54 








oncoprotein. 






2594 


AF047437 


Homo sapiens 


sperm acrosomal protein 


526 


74 


2595 


U28789 


Mus musculus 


PACT 


528 


71 


2596 


AB0023 66 


Homo sapiens 


KIAA0368 


615 


83 


2597 


AL049610 


Homo sapiens 


dJ1055C14.2(KIAA0026 


704 


93 








(transcription factor-like protein 







197 



SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 
Score 


Identit 

y 




- -. 




MRGX)) 






2598 


GO 1279 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


385 


79 









^^^'^ 






oo 

2599 


AF090895 


: 

Homo sapiens 


PRODI 17 


142 


61 


2600 


D29763 


Mus musculus 


seizure-related gene product 6 


119 


76 


— — 






precursor 









U20158 


Homo sapiens 


NAnlf ri iTh 




53 







Homo sapiens 


Tvt A nM su unit 


— = 


— ^ 


— — 




omo sapiens 


— — ehydrogenase subunit2 






— — 


L24804 


Homo sapiens 


p23 


0 

259 


62 


2605 


Y12102 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 


132 


92 








NO. 415. 






— — 

2606 





— : 

Homo sapiens 


SEC14L 


614 


81 


^|5Z 


AF202635 


Homo sapiens 


PP1200 — — ^ 





_47 




AF277374 


Homo sapiens 


enhancer of polycomb 




85 




X03484 


Homo sapiens 


raf protein (aa 1-648) 


600 






U 15637 


Homo sapiens 


CD40 binding protien 


465 


82 


2611 


U40265 


Trypanosoma 


ATPase subunit 6 


98 


31 




. . . 


cruzi 


. 






^11= 


AF090942 


Homo sapiens 


PRO0657 








V00662 


Homo sapiens 


cytochrome oxidase I 


05 




— — 

— 


U46920 


— omo sapiens 


metaxm 






2615 


V 19757 


Homo sapiens 


SEQ ID NO 475 from W09922243. 




— -'^ - 


— 


2616 _ 


AK025047 


Homo sapiens 


unnamed protein product 


173 


58 


2617 


D13866 


Homo sapiens 


alpha-catenin 


569 


96 


2618 


U20536 


Homo sapiens 


cysteine protease Mcli2 isofotrn alpha 


588 


87 


2619 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 




86 


2620 


AF190168 


Homo sapiens 


serum albumin precursor 


522 


78 


2621 


A06977 


Homo sapiens 


albumin 


607 


93 


2622 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 


339 


81 








clone BPA 13. 






2623 


X56698 


Xenopus laevis 


42Sp48 


1 17 


47 


2624 


W48351 


Homo sapiens 


Human breast cancer related protein 


131 


57 








BCRB2. 






2625 


X07881 


Homo sapiens 


proline-rich protein Gl 


1 16 


32 


2626 


W48351 


Homo sapiens 


Human breast cancer related protein 


89 


58 







.. 


BCRB2. 






2627 


A00279 


synthetic 


Human senini albumm 


564 


95 






construct 








— 


AJS-UZiOi J 


Homo sapiens 


-j 

unnamed protein product 




70 


629 


M23613 


Homo sapiens 


nucleophosmin 


486 


82 


2630 


AF243424 


Homo sapiens 


SG2NA beta isofonn 


256 


98 


2631 


G00506 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


60 






. ... 








^11? 


^'^.flll 


Homo sapiens 


albumin 


457 


74 






Homo sapiens 


albumin 






634 


M8 1088 


Rattus 


EF-1 -alpha 


165 


68 


- 




norvegicus 










AL3 39587 


Homo sapiens 


- 

hypothetical protein 


496 


82 






Homo sapiens 


serum albiuniiL 


576 


91 





G02872 




Human secreted protein, SEQ ID NO: 












6953. 






2638 


U93563 


Homo sapiens 


putative pi 50 


186 


38 


2639 


U68729 


Meloidogyne 


cuticle preprocoUagen 


113 


34 






incognita 








2640 


AB033056 


Homo sapiens 


jaAA1230 protein 


269 


94 


2641 


AF116712 


Homo sapiens 


PR02738 


115 


61 


2642 


W48353 


Homo sapiens 


Human breast cancer related protein 


124 


72 



198 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Watcrman 


Idenm 








BCFLT2. 






2643 


G02455 


Homo sapiens 


6536. 


~98 " 


~S4 


2644 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


127 


64 


2645 


AK025116 


Homo sapiens 


unnamed protein product 


104 


64 


2646 


X92485 


vivax 


pval 




47 


2647 


AF130089 


Homo sapiens 


PRO2550 


122 


~51 


2648 


AFl 94537 


Homo sapiens 


NAG 13 


279 


83 








encoded by gene 17. 












7519. 






2651 


AFl 16661 




PRO 143 8 






2652 


U63542 


Homo sapiens 


FA? protein 


128 


79 




AL3 59782 


bmcei 


possible (hliv-6) ul 102 variant a dna 
complete virion genome. 








Y85062 




homologue (ICEL) protein sequence. 












Human secreted protein, SEQ ID NOi 

6613. 








R07057 




(hhcm) gene preoduct. 








G00637 




4718. ^ 






2658 


B25722 


Homo sapiens 


encoded by gene 9 SEQ ID NO"l 1 1 


91 


88 


~2659 


X92485 


Plasmodium 


pval 

B 


141 


60 


"1660 


y^g^gg 


Homosa iens 


tfomm secreted protein encoded by 


"68 


"68 


2661 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12 ^ 


"96 


19 


2662 


AF090931 


Homo sapiens 


PRO0483 


114 


74 


2663 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


129 


57 


2664 


AF090930 


Homo sapiens 


PRO0478 


Tto 


"78 


2665 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


460 


77 


2666 


D38112 











2667 


AF090931 


Homo sapiens 


PRO0483 




-~- 


2668 


AF010144 






in 


~66 ~ 


2669 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


169 


73 


2670 


AF194537 


Homo sapiens 


NAG 13 


170 


~47 


2671 


X86003 


Rattus 


neuron-derived orphan receptor 


104 


50 


2672 


U63542 


Homo sapiens 


FAP protein 


139 


"75 


2673 


AF090930 


Homo sapiens 


PRO0478 




~69 


2674 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


161 


75 


2675 


AK024455 


Homo sapiens 


FLJ00047 protein 


105 


53 


2676 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


142 


67 


2677 


AK000496 


Homo sapiens 


uimamed protein product 


110 


67 


2678 


AFl 16715 


Homo sapiens 


PR02829 


151 


78 


2679 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


72 


2680 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


81 



199 



SEQID 






Description 


Smith- 




NO: 








Waterman 
Score 


Identit 

y 








6613. 






2681 , 


Y76198 


Homo sapiens 


Human secreted protein encoded by 


131 


IS 








gene 75. 






2682 


U76604 


Homo sapiens 


1 80 kDa bullous pemphigoid antigen 


96 


37 








2/type XVII collagen 






2683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


85 


55 


2684 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


80 








4488. 






2685 


AF090895 


Homo sapiens 


PRODI 17 


81 


51 


2686 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


52 








7871. 






2687 


M64793 


Rattus 


salivary proline-rich protein 


122 


41 






norvegicus 








2688 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


145 


71 


2689 


AFl 30089 


Homo sapiens 


PRO2550 


149 


78 


2690 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


102 


47 








clone HPMBQ32. 






2691 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


74 








7124. 






2692 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


60 








7145. 






2693 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


106 


26 


2694 


AE001158 


Borrelia 


conserved hypothetical integral 


81 


37 






burgdorferi 


membrane protein 






2695 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


76 








7867. 






2696 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


114 


100 








RHAla (AF078683) 






2697 


AF194537 


Homo sapiens 


NAG13 


126 


90 


2698 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


69 








7795. 






2699 


U93563 


Homo sapiens 


putative pl50 


138 


37 


2700 


AF090928 


Homo sapiens 


PRO0470 


177 


57 


2701 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


75 










7795. 






2702 


X92485 


Plasmodium 
vivax 


pval 


116 


50 


2703 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


57 


2704 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 2 (ND2) 


137 


81 


2705 


AF090895 


Homo sapiens 


PRO0117 


105 


51 


2706 


AF090944 


Homo sapiens 


PRO0663 


121 


45 


2707 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


57 








7344. 






2708 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


85 








7253. 






2709 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


283 


84 


2710 


AF130079 


Homo sapiens 


PR02852 


125 


48 


2711 


M22334 


Homo sapiens 


unknown protein 


131 


50 


2712 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


58 








4497. 






2713 


S80119 


Rattus sp. 


reverse transcriptase homolog 


230 


57 




AF09093 1 


Homo sapiens 


PRO0483 


98 


85 


2715 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


79 








5327. 






2716 


L00016 


Homo sapiens 


urf5 


295 


93 


2717 


Y02693 


Homo sapiens 


Human secreted protem encoded by 


135 


82 








gene 44 clone HTDAD22. 






2718 


M81321 


Macaca 


proline-rich protein 


117 


38 






fescicularis 









200 
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SEQ ID 
NO: 


Accession No. 




Description 


Score 


% 

Identit 


2719 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


57 


2720 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhal a. 


147 


56 


2721 




Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


132 


67 




Y02749 


_ , 

omo sapiens 


Human secreted protein encoded by 
gene 100 clone HN'F1U96 




129 


56 


^723 


pgg^ 


Homo sapiens 







~ 


2724 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 1 63 


182 


79 


2725 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO*163 


195 


92 


2726 


W97293 


Homo sapiens 


An annexin binding protein (NABP-1). 


193 


40 




U12690 




cytochrome oxidase subunit II 






2728 


V00662 


Homo sapiens 


cytochrome oxidase I 


500 


92 




W03 642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 








AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 






2732 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


51 




AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 




44 


2734 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


288 


81 


2735 


Z74036 


Caenorhabditis 
e^^ans 


contains similarity to Pfam domain; 
PFO 1391 (Collagen triple helix repeat 
(20 copies)), Score=58.9, E- 
value-3.5e-14,N-3 — — 


117 


37 


^736 


G04063 


Homosa lens 

omo sapiens 


Human secreted protem, SEQ ID NO: 
8144. 


- 







AK024455 


Homo sapiens 


FLJ00047 protein 





60 




G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO." 
4658. 








AP002460 


thaliana 


gene_idiFlD9.26~uiikn.own protein 










Pan troglodytes 


cytochrome c oxidase subunit 1 






2741 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


143 


64 


2742 


L27428 


Homo sapiens 


reverse transcriptase 


119 


45 






Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 










Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 






2745 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


51 


2746 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


159 


75 


^747 — 


G00437 


Homo sapiens 


4518. 


162 


— 


2748 


W48351 


^Hoino sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


2749 


AK024455 


Homo sapiens 


FLJ00047 protein 


165 


69 


2750 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


167 


72 


2751 


M22332 


Homo sapiens 


unknown protein 


208 


46 



201 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Idcntit 

y 


^752 


lRl3556 


— : 

xiomo sapiens 


Protem encoded downstream or hhc__M 
oncoprotein. 






2753 


Rla556 


Homo sapiens 


Protein encoded downstream of hhcJVI 

mostV" 


150 


65 


~T754 


AF220264 


Homo sapiens 




T^o 


32 


2755 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


58 





AF13005 1 


Homo sapiens 


PRO0898 


162 


73 






Homo sapiens 


NADH dehydrogenase subunit 3 


259 


86 


2 58 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7795. 








AJ243666 


Homo sapiens 


NICE-5 protein 






2760 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


533 


90 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6563. . 






2762 


U 1 2690 


Homo sapiens 


cytochrome oxidase subunit II 


257 








Homo sapiens 


HTR.VI clone 438283 protein sequence. 






2764 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJVI 
oncoprotein. 


156 


45 


2765 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


58 


2766 


X92485 


Plasmodium 


pval 


149 


53 


2767 


AFl 13685 


Homo sapiens 


PRO0974 


131 


63 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 






2769 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


154 


71 


2770 

... 


X97707 
. .-. - - 


Pongo pygmaeus 
- 


stopcodon created by 
posttranscriptional polyadenylation 


185 


87 


2771 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18 


110 


59 


2772 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


60 


2773 


AF090944 


Homo sapiens 


PRO0663 


118 


62 






Homo sapiens 


chromosomal protein 






2775 


AF090930 


Homo sapiens 


PRO0478 


127 


59 


"277^ — 


AK024455 


Homo sapiens 


FLJ00047 protein 


154 


... 






Homo sapiens 


ApoE4Lx2 protease. 






2778 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. _ 


101 


55 


T779 — 


YQ'y^-j^ 


-n ■■ 

Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJWI8. 






2780 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


56 


^782 


j-^ggj^^ 


— 

HonioTa'^iens^^ — 


NADH d h H h — Ta 


-— 


— 


^783 


Y02749 


omo sapiens 


Human secreted^roteiif encoded by 

gene 100 clone HNFIU96. 


124 


68 


"^715 — 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 







U43360 


Peromyscus 
maniculatus 


reverse transcriptase 






-^^1^ — 


W58700 


Homo sapiens 


Human ST-1 partial sequence. 


"T^ 






R96800 








~w — 


2788 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


76 


2789 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


104 


44 


2790 


AF130089 


Homo sapiens 


PRO2550 


121 


39 


2791 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


63 


2792 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


50 



202 



w o 01/64835 



PCT/llSOl/04927 



SEQID 


Accession No. 






Smith- 




NO: 








Waterman 


Identit 










Score 


y 








7133. 






2793 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


153 


80 


2794 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


461 


84 


2795 


AF090931 


Homo sapiens 


PRO0483 


114 


76 


2796 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


53 








7870. 






2797 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


114 


78 


2798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


71 








7871. 






2799 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 13 


79 








6596. 






2800 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


117 


48 








clone HNHF029. 






2801 


X92485 


Plasmodium 
vivax 


pval 


97 


80 


2802 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


471 


80 


2803 


AF130051 


Homo sapiens 


PRO0898 


118 


78 


2804 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


73 








7437. 






2805 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


143 


69 








gene 100 clone HNFIU96. 






2806 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


62 








7496. 






2807 


K02401 


Homo sapiens 


chorionic somatomammotropin 


543 


88 


2808 


Y18522 


Corvus 


cytochrome oxidase subunit I 


382 


60 






frugilegus 








2809 


G01194 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


555 


93 








5275. 






2810 


BOS 148 


Homo sapiens 


Human neuronal differentiation factor- 


399 


77 








1 (NDF-1). 






2811 


AF007826 


Homo sapiens 


bax epsilon 


94 


39 


2812 


AFC 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


220 


67 


2813 


U12690 


Homo sapiens 


cytochrome oxidase subunit n 


495 


81 


2814 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


48 






7492. 






2815 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


75 








7339. 






2816 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


437 


83 


2817 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


66 








7878. 






2818 


X55733 


Homo sapiens 


initation factor 4B 


458 


80 


2819 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 








7882. 






2820 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


347 


68 


2821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


62 


2822 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


118 


54 








oncoprotein. 






2823 


AF 130079 


Homo sapiens 


PR02852 


143 


54 


2824 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


52 








4409. 






2825 


Y76198 


Homo sapiens 


Human secreted protein encoded by 


161 


67 














2826 


AFl 19851 


Homo sapiens 


PRO 1722 


104 


70 


2827 


D49489 


Homo sapiens 


human P5 


523 


91 


2828 


Ml 0546 


Homo sapiens 


cj-tochrome oxidase I 


206 


81 


2829 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


138 


42 








oncoprotein. 






2830 


AF090931 


Homo sapiens 


PRO0483 


97 


80 


283] 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


65 



203 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Watcrman 
^core 


% 

Idcntit 

y 








7871. 






2832 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


112 


35 


2833 


X92485 


Plasmodium 
vivax 


pval 


103 


43 






Homo sapiens 


Rpl5-TIA-1. 


152 


54 


2835 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


153 


59 


2836 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2, 


148 


59 


2837 


AF090931 


Homo sapiens 


PRO0483 


149 


81 


2838 


AF130089 


Homo sapiens 


PRO2550 


123 


49 


2839 


AF090942 


Homo sapiens 


PRO0657 


135 


54 


2840 


G037S7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


124 


54 


2841 


AF090895 


Homo sapiens 


PRO0117 


146 


55 


2842 


API 32200 


Homo sapiens 


PRO 1751 


98 


74 


2843 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


194 


75 


2844 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


61 


59 


2845 


ALj90114 


Leishmania 
major 


extremely cysteine/valine rich protein 


171 


37 


2846 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


155 


69 


2847 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


58 


2848 


G02532 


Homo sapiens 


Human secreted protein, SKQ ID NO: 
6613. 


108 


65 


2849 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


441 


77 


2850 


D38I12 


Homo sapiens 


cytochrome c oxidase subunit 1 


490 


88 


2851 


K02401 


Homo sapiens 


chorionic somatomammotropin 


488 


90 


2852 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7339. 


110 


49 


2853 


U83280 


Leistimania 
donovani 


39 kDa antigen 


131 


47 


2854 


AF090944 


Homo sapiens 


PRO0663 


173 


72 


2855 


V00662 


Homo sapiens 


ATPase 6 


392 


71 


2856 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


147 


50 


2857 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


113 


76 


2858 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


178 


84 


2859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


192 


92 


2860 


Z81068 


Caenorhabditis 
elegans 


contauis similarity to Pfam domdn: 
PF00102 (Protein-tyrosme 
phosphatase), Score=232.1, E- 
value=2.6e-66, N=l~cDNA EST 
yk299h6.3 comes from this 
gene-cDNA EST yk:420b4.3 comes 
from this gene-cDNA EST yk439g6.3 
comes from this gene-cDNA EST 
yk299h6.5 comes from this 
gene-cDNA EST yk420b4.5 comes 
from this gene-cDNA EST yk439g6.5 
comes from this gene 


104 


36 


2861 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


444 


85 


2862 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


168 


61 


2863 


U93564 


Homo sapiens 


putative pl50 


132 


34 



204 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


Identit 

y 


2864 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dli40 3. 


152 


50 


2865 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


124 


76 


2866 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


145 


46 


2867 


AF09093O 


Homo sapiens 


PRO0478 


161 


61 






Homo sapiens 


Human secreted protein encoded from 

gene 12. 




48 


2869 


U15647 


Mus musculus 


reverse transcriptase 


197 


40 




U09500 


Homo sapiens 


cytochrome b 






2871 


U09500 


Homo sapiens 


cytochrome b 


614 


92 






Homo sapiens 


NADH dehydrogenase subunit 4 


516 


87 


2873 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


458 


84 


2874 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


516 


92 


2875 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


87 


55 


2876 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


624 


96 


2877 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


160 


58 


2878 


Lf32174 


Dictyostelium 
discoideum 


non-receptor tyrosine kinase 


120 


33 


2879 


D38I12 


Homo sapiens 


ATPase subunit 6 


141 


54 


2880 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


84 


2881 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


93 


79 


2882 


V00662 


Hdmo sapiens 


ATPase 6 


130 


87 


2883 


AL390114 


Leishmania 
major 


probable proteophosphoglycan 


117 


50 


2884 


AFn9851 


Homo sapiens 


PR01722 


79 


55 


2885 


X92485 


Plasmodium 
vivax 


pval 


106 


48 


2886 


V00662 


Homo sapiens 


ATPase 6 


127 


87 


2887 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


191 


83 




G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


138 


58 


2889 


X92485 


Plasmodium 


pval 


107 


63 


2890 


Ml 0546 


Homo sapiens 


cytochrome oxidase 1 


163 


89 


2891 


AF090942 


Homo sapiens 


PRO0657 


72 


37 


2892 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


136 


56 


2893 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


139 


66 


2894 


AK024455 


Homo sapiens 


FLJ00047 protein 


139 


57 


2895 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


111 


44 


2896 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


182 


62 


2897 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


73 


2898 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


248 


53 


2899 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


366 


68 


2900 


X07882 


Homo sapiens 


Po protein 


146 


41 


2901 


V00662 


Homo sapiens 


cytochrome B 


517 


91 


2902 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


378 


67 


2903 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


2904 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


99 


54 



205 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


Identit 


-^22^ — 


AF 13 0079 


Homo sapiens 


PR0^85^ 





^ 


" ^ — 


D381 12 


omo sapiens 




~T7^ 


79 


" — 


AF202635 


omo sapiens 


PPl^OO 


"TT^ 


66 




AF041330 


0 


NADH dehydrogenase subunit 5 





41 


— 

— - — 




omo sapiens — 


FAP protein — • rr-r\j\ — 




J2 


^Itt — 


D38114 


Oorula gonJla 


c3/tochronie c oxidase subunit 1 (COI) 










AP002031 


Homo'sa iens 


gene id'K3D20 3-' 


8 


35 


^^2912 


j-^^g^^^ 




cytochrome c oxidase subunit 1 


^35 


80 


2913 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


459 


83 


"^§t7 — 


l^^^ll^ 


Homo sapiens 


cytochrome c oxidase subunit 3 


_480 


_80 








Homo sapiens 


cytochrome c oxidase subunit 1 


4t5 — 










cytochrome c oxidase subunit 3 




80 


"291? 





Homo sapiens 


cytochrome c oxidase subunit 3 


"T^^ 




2918 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


2919 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


458 


79 


2920 


G03797 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

... 


152 


65 




V00662 


Homo sapiens 


LRF 2 (NADH dehydrogenase subunit) 


J 24 


87 


2922 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


530 


84 






Homo sapiens 


30 i<Da protein 






2924 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


71 


2925 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


503 




2926 


U47624 


Xenopus laevis 


alpha(E)-catcnin 


243 


94 


2927 


AF010144 


Homo sapiens 


neuronal tliread protein AD7c-NTP 


135 


47 






Homo sapiens 


cytochrome c oxidase subunit 3 




78 


2929 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


465 


81 


2930 


002994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


151 


58 


2931 


D 16480 


Homo sapiens 


enoyl-CoA hydratase/3-hydroxyacyl- 
CoA dehydrogenase alpha-subunit of 
trifunctional protein 


345 


67 


2932 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


212 


95 




AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla(AF078683) 


123 


69 


2934 


X83427 


Oniithorhynchus 
anatmus 


cytochrome c oxidase subunit 1 


460 


82 


2935 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


527 


83 






Homo sapiens 


NADH dehydrogenase subunit 4 






2937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


183 


77 


2938 
^939 


X15081 
^FT94537~ 


Crithidia 
Homo sapiens 


MURF2 protein (AA 1-348) 


105 


42 


NAG 13 


151 


52 


2940 


AJ27 1872 


Nicotiana 
sylvestris 


extensin 


165 


37 


2941 

- . 


AP000616 


... 


similar to RING-H2 finger protein 
RHAla (AF078683) 


134 


91 


^lii 


j^^gj^^ 


Homo sapiens 


cytochrome c oxidase subunit 3 




69 










509 




2944 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


95 


2945 


J01415 


Homo sapiens 


MTND4 


429 


76 


2946 


ACO 10793 


Arabidopsis 
thaliana 


F20B 17.16 


99, 


41 


2947 


AF116712 


Homo sapiens 


PR02738 


132 


60 


2948 


X89658 


Homo sapiens 


CAP- 18 protein 


136 


49 



206 
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SEQID 


Accession No. 


Species 


DescipHon 


Smith- 
Waterman 
Score 


Identit 


2949 


V00662 


Homo sapiens 


C3^ochrome oxidase 1 


449 


86 






Homo sapiens 


NADH dehydrogenase subunit 4 




82 


2951 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


363 


76 


2952 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla (AF078683) 


129 


92 


- 




. 

Cnthidia 
oncopelti 


NADH dehydrogenase subunit 5 


- - - 








Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


110 


68 


2955 


Y76287 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 20. 


345 


78 






Homo sapiens 


cytochrome c oxidase subunit 3 






2957 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


456 


83 


2958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


76 


2959 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


84 


2960 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


400 


70 


2961 


X69978 


Homo sapiens 


XP-G factor 


539 


88 




AF203687 


Homo sapiens 


prolactin regulatory element-binding 

protein 


453 


77 






Homo sapiens 


cytochrome c oxidase subunit 1 


561 


83 


2964 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


92 


2965 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


119 


63 


2966 


U52077 


Homo sapiens 


mariner transposase 


469 


83 


2967 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


130 


53 


2968 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


103 


42 


2969 


X92485 


Plasmodium 


pval 


144 


42 


2970 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


127 


63 


2971 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


129 


63 






Homo sapiens 


Human secreted protein (clone fk317- 

3). 


91 


56 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






2974 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

- 


130 


68 






1 

Homo sapiens 




167 


46 




V00662 










2977 


AP000616 


Or>'za sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


121 


44 


2978 


Y86248 

. .. 


Homo sapiens 




Human secreted protein HCHPF68, 
SEQ IDNOil63. 


126 


72 






Homo sapiens 


putative plSO 






2980 


M12099 


Mus inusculus 


proline-rich protein 


119 


39 






Mus musculus 


histone HI 






2982 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NIDrgSMS) 


172 


51 


2983 


AC002291 


Arabidopsis 
thaliana 


extensin 


110 


35 


2984 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


295 


92 


2985 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


171 


65 



207 



SEQID 
NO: 




Species 


nescription 


Smith- 
Waterman 
Score 


Identit 

y 


2986_ 


U '°985 


Homo sapiens 


triadin 


416 


83 


2987 


API 55232 


Pisum sativum 


extensin 


159 


41 


2988 


X03 145 


Homo sapiens 


pot. ORF V 


133 


44 






Homo sapiens 


cytochrome oxidase subunit II 




84 


2990 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


53 


2991 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


538 


83 




V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


471 


75 


2993 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


511 


72 


2994 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


301 


79 


2995 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


526 


78 


2996 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


634 


82 


2997 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68,. 
SEQ ID NO: 163. 


158 


76 


2998 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


103 


50 


2999 


V00662 


Homo sapiens 


cytochrome oxidase I 


535 


78 


3000 


L38908 


Nicotians 
tabacum 


extensm 


146 


38 


3001 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


494 


87 


3002 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


66 


3003 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


117 


46 


3004 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


93 


47 


3005 


U43627 


Arabidopsis 
tlialiana 




118 


37 


3006 


AFl 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


150 


44 


3007 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7637. 


114 


73 


3008 


W54966 


Homo sapiens 


Synthetic human type HI collagen 
SYN-C3. 


111 


39 


3009 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


84 


3010 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


212 


77 


3011 


X65718 


Prunus dulcis 


extensin 


129 


42 


3012 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


76 


3013 


Z38128 


Mus musculus 


histone HI 


104 


32 


3014 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


102 


34 


3015 


Y86248 


Homo sapiens 


Human secreted protein 1ICHPF68, 
SEQ ID NO; 163. 


128 


80 


3016 


Y34068 


Homo sapiens 


Histone HI isofonn HI. 5 pANCA- 
reactive fragment (residues 69-226). 


99 


34 


3017 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


142 


32 


3018 


AJ133050 


Panulirus argus 


c>tochrome c oxidase subunit I 


125 


71 


3019 
- 


G03597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7678. 


175 


68 






Homo sapiens 


putative pi 50 


97 




3021 


D38116 


Pan paniscus 


cytochrome c oxidase subunit I 


555 


82 


3022 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


147 


76 


3023 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


632 


82 


3024 


AF155232 


Pisum sativum 


extensin 


181 


42 


3025 


V00662 


Homo sapiens 


cytochrome oxidase I 


567 


85 



208 
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SEQID 
NO: 




Species 


Description 


Smith- 
Score 


Identit 

y 


3026 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


130 


58 


3027 


W00838 

■ 


Homo sapiens 


Tumour necrosis factor-related gene 

product, 


141 


67 


^029 


AF 13 0089 
^F096944 


Homo sapiens 
Homo sapiens 


PRO2550 
PROOTes 






168 


76 


3030 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


122 


92 






Homo sapiens 


NADH dehydrogenase subunit 4 




91 


3032 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


351 


87 


3033 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


99 


75 


3034 


S75895 


Homo sapiens 


NADH dehydiogenase subunit 2, ND2 


1 13 


85 


3035 


U93567 


Homo sapiens 


putative pl50 


173 


41 


3036 


X92485 


Plasmodium 
vivax 


pval 


122 


41 


3037 


AF202635 


Homo sapiens 


PPI200 


128 


67 


3038 
T039 ^ 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


151 


83 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


415 


74 


3040 


D86853 


Catharanthus 
roseus 


extensin 


198 


40 


3041 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


47 


3042 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


377 


95 


3043 


S80119 


Rattus sp. 


reverse transcriptase homolog 


106 


50 


3044 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7284. 


137 


55 


3045 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


58 


3046 


U93574 


Homo sapiens 


putative pi 50 


177 


49 


3047 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypoftetical protein 


98 


37 


3048 


D38112 


Homo sapiens j 


cytochrome c oxidase subunit 3 


568 


87 


3049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


144 


63 


3050 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


80 


3051 


AK027208 


Homo sapiens 


unnamed protein product 


145 


72 


3052 


G 03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


166 


75 


3053 


AFl 18082 


Homo sapiens 


PRO 1902 


84 


50 


3054 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


118 


41 


3055 


X92485 


Plasmodium 
vivax 




132 


52 


3056 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4484. 


117 


74 


3057 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


99 


72 


3058 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


627 


89 


3059 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


98 


57 




G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 






3061 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


80 


3062 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


529 


80 


3063 


AF130051 


Homo sapiens 


PRa0898 


159 


71 


3064 


AF195418 


Mus musculus 


0DZ3 


386 


94 


3065 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


109 


51 
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SEQID 


Accession No. 




Description 


Smith- 




NO: 








Waterman 


Idcntit 








„ . 


Score 


y 








clone HCACJ8 1 . 






3066 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


338 


80 


3067 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


120 


50 


— — 





brucei 


protein rhala. 








647 


Miis miisculiis 


reverse transcriptase 




134 


41 


3069 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


397 


92 


j070 


AF 130089 


Homo sapiens 


PRO2550 


149 


55 


3071 


U93565 


Homo sapiens 


putative pi 50 


143 


37 


3072 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


45 








7214. 






J 073 


AF090930 


Homo sapiens 


PRO0478 


1j>5 


78 


3074 


AF 130089 


Homo sapiens 


PRO2550 


156 


58 


3075 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


267 


65 








fragment, SEQ IDNO:387. 






3076 


U83280 


Leishmania 


39 kDa antigen 


98 


51 






donovani 








3077 


D38112 


Homo sapiens 


cytochrome c oxidase subimit 3 


626 


86 


3078 


AFl 16712 


Homo sapiens 


PR02738 


1 14 


59 


3079 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


112 


46 








homolog 








D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


492 


80 


3081 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


523 


86 


3082 


D38I12 


Homo sapiens 


cytochrome c oxidase subunit 3 


515 


79 


3083 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


150 


41 








homolog 






3084 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


191 


97 








SEQ ID NO: 163. 






3085 


U I 2690 


Homo sapiens 


cytochrome oxidase subunit 0 


537 


78 


3086 


D38112 


Homo sapiens 


ATPase subunit 6 


300 


58 


3087 


X8603 1 


Sus scrofa 


homologue of proline/arginine-rich 


104 


37 








antibacterial protein 






3088 


K02247 


Rattus 


proline-rich protein 


108 


41 






norvegicus 








3089 


W48351 


Homo sapiens 


Human breast cancer related protein 


130 


48 




.. , . 




BCRB2. 






3090 


V00662 


Homo sapiens 


cytochrome oxidase I 


597 


84 


3091 


Ml 8094 


Phaseolus 


hydroxyproline-rich glycoprotein 


221 


43 






vulgaris 








3092 


D38113 


Pan troglodytes 


C3rtochrome c oxidase subunit 1 


392 


59 


309j 


D38116 


Pan patiiscus 


c5^ochrome c oxidase subunit 1 


593 


84 


3094 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


471 


76 






Homo sapiens 


CNS neural thread protein HB4. 




98 


3096 


L26953 


Homo sapiens 


chromosomal protein 


115 


57 


3097 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


57 








4672. 






3098 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


91 








SEQ ID NO: 163. 






3099 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


96 






6836. 






3100 


AC006233 


Arabidopsis 
thaliana 


hypothetical protein 


65 


44 




AF289098 


Cladrastis 












kentukea 








3102 


AFl 55232 


Pisum sativum 


extensin 


142 


43 


3103 


AFl 3 0089 


Homo sapiens 


PRO2550 


122 


81 


3104 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


143 


80 








SEQ ID NO: 163. 






3105 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


104 


59 








sequence. 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


Identit 

y 


3106 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


112 


76 


3107 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


117 


72 


3108 


D38112 


Homo sapiens 


NADH dehydrogenase subunit4 


334 


85 






Arabidopsis 
thaliana 


extensin 




30 




AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 




100 




AF 13 0089 


Homo sapiens 


PRO2550 




37 


3112 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


220 


93 


3113 


A18812 


Brassica napus 


extensin 


128 


32 


3114 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


63 


3115 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


92 


69 


3116 


X92485 


Plasmodium 
vivax 


pval 


148 


45 


3117 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


574 


80 


3118 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 

(PIR:JH0162) 


125 


41 


3119 


U43627 


Arabidopsis 
thaliana 


extensin 


129 


32 


3120 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


171 


82 




YI2950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


90 


59 


3122 


AF130051 


Homo sapiens 


PRO0898 


134 


82 




M8132I 


Macaca 
fascicularis 


proline-rich protein 


158 


48 


3124 


U93563 


Homo sapiens 


putative pi 50 


126 


32 


3125 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 1 (NDl) 


337 . 


92 


3126 


AF130089 


Homo sapiens 


PRO2550 


155 


77 


3127 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


76 


3128 


AF090901 


Homo sapiens 


PROG 195 


109 


53 


3129 


AF003736 


Murine leukemia 
virus 


reverse transcriptase 


164 


40 


3130 


U25281 


Rattiis 
norvegicus 


SH3 domain binding protein 


88 


39 


3131 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


116 


71 


3132 


D86853 


Catharanthus 


extensin 


110 


33 


3133 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


506 


76 


3134 


V00662 


Homo sapiens 


cytochiome oxidase I 


583 


88 


3135 


Y36366 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 3. 


103 


68 




AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 






3137 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


119 


54 


3138 


AFl 18082 


Homo sapiens 


PRO 1902 


118 




3139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


270 


87 


3140 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-li2 finger 
protein rhal a. 


103 


48 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 









Waterman 

_§£SE 


Identit 

y 


3141 


D381 12 


: 

Homo sapiens 


NADH dehydiogenase subunit 5 


353 


74 


3142 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


77 
















Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 












SEQIDNO:163. 






J 144 


AF090930 


Homo sapiens 


PRO0478 


138 


73 


3145 


AFl 55232 


Pisum sativum 


extensin 


no 


34 


3146 


M69008 


Homo sapiens 


alpha-1 type XIII collagen 


107 


37 


3147 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


103 


63 






^^'^^^ . .. . 


protein rhal a. 






3148 


D38112 


Homo sapiens 


cytochrome c oxidase subvinit 1 


505 


84 


3149 


M77194 


Rat leukemia 
viras 


polymerase 


167 


35 




AP002543 


Arabidopsis 


gb| AAD230 1 5 . l~gene_id:F 1 5M7. 1 6~si 










thaliana 


milar to unknown protein 






3151 


AK024455 


Homo sapiens 


FLJ00047 protein 


109 


59 


3152 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


89 


39 


3153 


J04794 


Homo sapiens 


aldehyde reductase (EC 1 . 1 . 1 .2) 


137 


58 


3154 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


70 








7795. 






3155 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


160 


53 


J 156 


X92485 


Plasmodium 
vivax 


pval 


104 


50 


3 157 


AFl 94537 


Homo sapiens 


NAG 13 


207 





3158 


AF063693 


Mus musculus 


type XIII collagen 


104 


37 


3 159 


AP0006I6 


Oryza sativa 


similar to R1NG-H2 finger protein 


130 


91 








RHAla (AF078683) 






3160 


AF020191 


Mus musculus 


proline-rich protein 13 


107 




^' 




AFl 94537 


Homo sapiens 








3162 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


41 








6616. 






3163 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


161 


58 








gene No. 101. 






3164 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


139 


46 


3165 


U93570 


Homo sapiens 


putative pi 50 


151 


68 


J 166 




Homo sapiens 


SEQ ID NO 485 from W09922243. 


144 


71 


3 167 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


147 


66 


3168 


L27428 


Homo sapiens 


reverse transcriptase 


94 


51 




•^.^^^^^ 


Homo sapiens 


plakophilin 2b 


IOj 


76 


J 170 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 


115 


54 








SEQ ID NO; 163. 






"17 


AK 02445 5 


— 

omo sapiens 


FLJ00047 protein _ 





_rl 


-|i2? 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


146 


38 


J 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 4 


467 


81 


3174 


X92485 




Plasmodium 

vivax 


pval 




88 


69 


j175 


L26953 


Homo sapiens 


chromosomal protein 


124 


78 


3176 


AF042 1 69 


Homo sapiens 


putative ATP-dependent mitochondrial 


223 


95 






- 


RNA helicase 








Ar ijuu/y 




PR02852 






3178 


AF2732I7 


Mus musculus 


ceil proliferation related protein CAP 


97 


36 


3179 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


69 








7496. 






3180 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


146 


78 


3181 


AF118082 


Homo sapiens 


PRO 1902 


150 


58 


3182 


U60315 


Molluscum 


MCI07L 


101 


44 






contagiosum 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 













Waterman 
Score 


Mentit 

y 






virus subtype 1 








3183 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


148 


61 








similar to pogo element 










Homo sapiens 


putative pi 50 




58 


3185 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


117 


79 









SEQ ID NO: 163. 






3186 


U87607 


Rattus 


putative RNA binding protein 1 


167 


j9 






norvegicus 








3187 


G0j789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


160 


63 








7870. 






3188 


S80119 


Rattus sp. 


reverse transcriptase homolog 


152 


49 


3189 


X05300 


Rattus 
norvegicus 


ribophorin I 


122 


46 


3190 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


106 


80 


3191 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


83 








7124. 






3192 


G03 1 72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


80 








7253. 






3193 


AF000298 


Caenorhabditis 


wealc similarity to collagens; glycine- 


98 


34 






elegans 


and proUne-rich 






3194 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


87 








7124. 






3195 


AF022985 


Caenorhabditis 


Similar to collagen 


97 


40 






elegans 








3196 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


452 


81 


3197 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 10 


58 








7492. 






3198 


X99452 


Lycopersicon 


extensin-like protein Dif54 


100 


36 






esculentiun 








3199 


D86853 


Catharanthus 
roseus 


extensin 


119 


36 


3200 


AF025424 


Rattus 


RNA polymerase I 127 kDa subunit 


215 


74 






norvegicus 








3201 


U93563 


Homo sapiens 


putative pi 50 


359 


77 


3202 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


112 


37 


3203 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


84 








4581. 






3204 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


39 








4770. 






3205 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


76 








4718. 






3206 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


127 


64 








clone HCACJ81. 






3207 


X97675 


Homo sapiens 


plakophiiin 2b 


142 


51 


3208 


GOO 63 7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


58 








4718. 






3209 


AB026512 


Ecnomiosa sp. 


cytochrome c oxidase subunit 1 


254 


62 






Ecnl ^ 








j210 


V 


Homo sapiens 


- 

Human secreted protem, SEQ ID NO: 


163 


59 


— — 






=- — ^ 








D38 1 12 


— : 

Homo sapiens 


ATPase subunit 6 


"48^ 


^ 


"3212 


V00662 










3213 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


66 








7519. 






3214 


D38112 


Homo sapiens 


ATPase subunit 6 


400 


72 


3215 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


88 


3216 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


472 


82 


3217 


AF090930 


Homo sapiens 


PRO0478 


96 


64 


3218 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


46 
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SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7253. 






3219 


AF006061 


Homo sapiens 


placental growth honnone isoform 


290 


68 


3220 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


63 


3221 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


441 


71 


3222 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


59 


3223 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


497 


81 


3224 


L26953 


Homo sapiens 


chromosomal protein 


95 


59 


3225 


U83280 


Leislimania 
donovani 


39 kDa antigen 


117 


90 


3226 


AF239615 


Manihot 
esculenta 


CRANTZ hydroxyproline-rich 
glycoprotein 


124 


43 


3227 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


442 


74 


3228 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


172 


67 


3229 


X92591 


Mus musculus 


transcription factor 


109 


43 


3230 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


81 


54 


3231 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 1 


476 


72 


3232 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


54 


3233 


U96416 


Demiyus 
distinctus 
timjonesi 


cytochrome b 


95 


40 


3234 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQ ID NO: 163. 


161 


86 


3235 


M76546 


Helianthus 


hydroxyproline-rich protein 


101 


40 


3236 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


149 


75 


3237 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


532 


76 


3238 


V00662 


Homo sapiens 


URf 1 (NADH dehydrogenase subunit) 


576 


89 


3239 


U93567 


Homo sapiens 


p40 


155 


32 


3240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


79 


3241 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


65 




G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


76 


62 


3243 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


69 


3244 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


115 


39 


3245 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


54 


3246 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6292. 


153 


69 


3247 


U58736 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


107 


37 


. ... 


U87607 


norvegicus 


putative RNA binding protein 1 






3249 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3250 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


166 


61 


3251 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


109 


40 


3252 


AF130089 


Homo sapiens 


PRO2550 


151 


73 



214 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


Identit 

y 


3253 


W4835] 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


58 


3254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


162 


80 


3255 


AL359782 


To'panosoma 
brucei 


probable similar to riiig-h2 finger 
protein rha la. 


108 


57 


3256 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


156 


88 


3257 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


206 


93 


3258 


Y36203 


Homo sapiens 


Human secreted protein #75. 


111 


77 


3259 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


98 


57 


3260 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


345 


67 


3261 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


97 


56 


3262 


L38908 


tabacum 


extensm 


155 


40 


3263 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


95 


70 


3264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


198 


87 


3265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


69 


3266 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


158 


59 


3267 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


105 


56 


3268 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


72 


3269 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


200 


95 


3270 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


138 


73 


3271 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


151 


90 


3272 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

SEQ ID NO: 163. 


193 


95 


3273 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


111 


52 


3274 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


87 


3275 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


86 


3276 


S62928 


Homo sapiens 


PRBIM protein precursor 


104 


34 


3277 


G02832 


Homo sapiens 


Human secreted protein. SEQ ID NO; 
6913. 


154 


55 


3278 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


3279 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


212 


91 


3280 
- 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


118 


50 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 






3282 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


3283 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


455 


81 


3284 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


502 




3285 


AFl 18086 


Homo sapiens 


PR01992 


99 


88 


3286 


D38112 


Homo sapiens 


cytochrome c oxidase subunit I 


489 


83 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Score 


Idcntit 

y 


3287 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


485 


88 


3288 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 


3289 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ull02, vaiiant a dna, 
complete virion genome. 


158 


60 


3290 


X92485 


Plasmodium 

vivax 


pval 


99 


58 


3291 


AFl 16712 


Homo sapiens 


PR02738 


102 


51 


3292 


AF119851 


Homo sapiens 


PRO 1722 


99 


86 


3293 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


3294 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


66 


3295 


M64793 


Rattus 
norvegicus 


salivary proline-rioh protein 


107 


41 


3296 


AK024455 


Homo sapiens 


FLJ00047 protein 


97 


53 


3297 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


156 


60 


3298 


AC007654 


Arabidopsis 
thaliana 


T19E23.7 


97 


49 


3299 


Y17221 


Homo sapiens 


Human secreted protein (clone fk317- 
3). 


96 


44 


3300 


AJ243905 


Caenorhabditis 
elegans 


SFl protein 


104 


42 


3301 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


96 


41 


3302 


X89453 


Rattus 
norvegicus 


DRPLA 


83 


65 


3303 


U 10099 


Homo sapiens 


POM-ZP3 


227 


52 


3304 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


553 


89 


3305 


L27428 


Homo sapiens 


reverse transcriptase 


144 


43 


3306 


D38112 


Homo sapiens 


cytoclirome c oxidase subunit 3 


388 


71 


3307 


AF217374 


Acan til aster 
planci 


cytochrome oxidase subunit I 


439 


78 


3308 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


'439 


82 


3309 


D38112 


Homo sapiens 


cytoclirome c oxidase subunit 1 


492 


88 


3310 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4433. 


81 


65 


3311 


X53581 


Rattus 
norvegicus 


0RF4 


140 


48 


3312 


V00662 


Homo sapiens 


cytochrome B 


487 


88 


3313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


451 


80 


3314 


V00662 


Homo sapiens 


cytochrome oxidase I 


524 


88 


3315 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


518 


87 


3316 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


115 


50 


3317 


V00662 


Homo sapiens 


URP 1 (NADH dehydrogenase subunit) 


356 


69 


3318 


U 12693 


Homo sapiens 


cytochrome oxidase subunit H 


477 


78 


3319 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 

N-terminal domain), Score=27.5, E- 
value=0.0001,N=l 


75 


40 


3320 


Q01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


126 


75 


3321 


AF090930 


Homo sapiens 


PRO0478 


166 


65 


3322 


X92485 


Plasmodium 




104 


44 


3323 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


513 


80 


3324 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


483 


79 
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SEQID 
NO: 


Accession No. 


Species 




Waterman 
Score 


Identit 

y 


3325 


AF130089 


Homo sapiens 


PRO2550 


121 


88 


3326 


AC026234 


Unknown 


Contains weak similarity to an 
unknown protein T23E1 8.5 


191 


66 




V00662 


Homo sapiens 


cytochrome B 






3328 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


316 


75 




AF091711 


Homo sapiens 


splice variant AKAP350 


106 


52 


3330 


U93565 


Homo sapiens 


putative pi 50 


125 


45 




AF119851 


Homo sapiens 


PRO 1722 


88 


72 


3332 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


489 


81 


3333 


V00662 


Homo sapiens 


cytochrome oxidase 1 


544 


90 


3334 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


475 


80 


3335 


Y 14482 


Homo sapiens 


Fragment of hiunan secreted protein 

encoded by gene 17. 


168 


66 


3336 


AF146688 


Takifugu 
rubripes 


serine-threonine kinase 9 


101 


43 


3337 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


101 


50 


3338 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


110 


50 


3339 


V00662 


Homo sapiens 


cytochrome oxidase I 


458 


80 


3340 


K02401 


Homo sapiens 


chorionic somatomammotropin 


209 


97 


3341 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


1 12 


52 


3342 


X65165 


Volvox carteri 


extensin 


146 


41 


3343 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


119 


51 




V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


559 


84 


3345 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


508 


89 


3346 


Ml 1 897 


Mus musculus 


proiine-rich salivary protein 


96 


41 


3347 


U92455 ■ 


Mus musculus 


WW domain binding protein 7; WBP7 


113 


38 


3348 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


40 


3349 


K03205 


Homo sapiens 


saUvary proline-rich protein precursor 


] 13 


38 


3350 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


542 


79 


3351 


B03148 


Homo sapiens 


Human neuronal differentiation fector- 
1 (NDF-1). 


664 


88 


3352 


GO 3043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


117 


85 


3353 


A18812 


Brassica napus 


extensiQ 


103 


31 




Ml 0546 


Homo sapiens 


cytochrome oxidase I 


125 


78 


3355 


MS1321 


Macaca 
fascicularis 


proline-rich protein 


107 


44 


3356 


M10546 


Homo sapiens 


cytochrome oxidase I 


284 


90 


3357 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


513 


80 


3358 


D38n2 


Homo sapiens 


Qlochrome c oxidase subunit 1 


541 


79 


3359 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


543 


87 


3360 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


86 


3361 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


179 


57 


3362 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


62 


3363 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


151 


46 


3364 


D38n2 


Homo sapiens 


cytociirome c oxidase subunit 3 


569 


82 


3365 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


102 


40 


3366 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


143 


76 


3367 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID MO: 
6490. 


145 


84 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 





.. 


- 




Score 


y 




_Dj81_1^ 


Homo sapiens 


cytochrome c oxidase subunit 1 


587 


87 


3369 


038112 


Homo sapiens 


cytochrome c oxidase subunit 1 


549 


82 


jj70 


AL035526 


Arabidopsis 


extensin-like protein 


130 


39 






thaliana 










J04543 


Homo sapiens 


synexm 


101 


40 






Didelphis 


cytochrome c oxidase subunit 3 


134 


71 






virginiana 








337j 


U9j568 


Homo sapiens 


putative p 1 50 


176 


59 


3374 


L28748 


Bos taurus 


putative 


146 


67 


3375 


038112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


3376 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


509 


78 


3377 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


51 








6619. 






3378 


U 97 674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


557 


82 


3379 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


120 


35 








SEQ ID NO: 163. 








X92485 


Plasmodium 


pval 


124 


38' 






vivax 








3381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


78 








6619. 






3382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


108 


67 


3383 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


115 


86 








RHAla(AF078683) 






3384 


X98296 


Homo sapiens 


ubiquitin hydrolase 


238 


83 


3385 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


494 


75 


3386 


Y 86472 


Homo sapiens 


Human gene 52-encoded protein 


155 


66 








fragment, SEQ ID NO:387. 






3387 


AL359782 


Trypanosoma 


probable similar to ring-h2 fmger 


116 


65 






brucei 


protein rhala. 






3388 


W73624 


Homo sapiens 


Human secreted protein clone aw92_l . 


327 


62 


3389 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


76 








7888. 






3390 


G00j61 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


73 


61 






_ - 


4442. 






3391 


U35730 


Mus musculus 


jerky 


152 


30 




U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


100 


41 


3393 


U97674 


Mesocricetus 


cytoclirome c oxidase chain I 


483 


77 






auratus 






3394 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


539 


81 


3395 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


608 


87 


3396 


X71602 


Nicotiana 


extensm 


104 


35 





... 


tabacum 








3397 


L26J53 


Homo sapiens 


chromosomal protein 


100 


60 


3398 


AF 197832 


Cyanocitta 
cristata 


cytochrome oxidase I 


488 


76 


3399 


X53581 


Rattus 


0RF4 


140 


42 






norvegicus 








3400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


597 


85 


340^ 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


656 


89 






Homo sapiens 


chromosomal protein 






3403 


AF052298 


Drosophila 


Y box protein 


114 


30 






silvestris 








3404 


U43360 


Peromyscus 


reverse transcriptase 


133 


53 






maniculatus 








3405 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


65 


3406 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


82 


3407 


U01849 


Trj^anosoma 


ORF2 


94 


30 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterni.in 


% 

Identit 

y 






brucei 








3408 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


115 


65 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 




86 


3410 


U93570 


Homo sapiens 


putative pi 50 


233 


52 




G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8172. 


104 


44 


3412 


AF090944 


Homo sapiens 


PRO0663 


97 


56 


3413 


X61048 


Hydra sp. 


mini-collagen 


128 


44 


3414 


AFl 18086 


Homo sapiens 


PRO 1992 


128 


81 


3415 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


61 


3416 


AF116695 


Homo sapiens 


PR02221 


169 


69 


3417 


U31778 


Human 

papillomavirus 
type 20 


putative 


107 


50 


3418 


X92485 


Plasmodium 


pval 


97 


54 


3419 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


195 


,94 


3420 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


144 


69 


3421 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


139 


71 


3422 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


111 


63 


3423 


X15917 


Paramecium 
aurelia 


0RF4 protein (AA 1-156) 


93 


41 


3424 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


99 


69 


3425 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


115 


60 


3426 


y 14486 


Homo sapiens 


cj'tosolic serine 
hydroxymethyltransferase 


196 


61 


3427 


U93569 


Homo sapiens 


putative pl50 


110 


44 


3428 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


67 


3429 


ABO 12223 


Canis familiaris 


0RF2 


169 


44 


3430 


X97675 


Homo sapiens 


plalcophilin 2b 


118 


60 


3431 


AF220264 


Homo sapiens 


MOST-1 


153 


72 


3432 


X64173 


Zea 

diploperemiis 


hydroxyproline-rich glycoprotein 


104 


41 


d433 


AFl 13685 


Homo sapiens 


PRO0974 


104 


72 




AB002306 


Homo sapiens 


KIAA0308 






3435 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


83 




ABO 12223 


Canis familiaris 








3437 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


65 




U52077 


Homo sapiens 


mariner transposase 


381 


64 


3439 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


425 


84 


3440 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


63 




G02624 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


90 


78 


3442 


Y36156 


Homo sapiens 


Human secreted protein ft28. 


126 


71 


3443 


AF191032 


Myxine 


RJNG3 


84 


63 


3444 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


141 


57 


3445 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 




65 
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SEQID 
NO: 


Accession No. 


Species 




Smith- 
Waterman 


% 
y 


3446 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


157 


67 






Homo sapiens 


cytochrome oxidase subunit II 






3448 


AFn2481 


Homo sapiens 


RAD54B protein 


392 


87 




G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


98 


55 


3450 


Y 86248 


Homo sapiens 


Hmnan secreted protein HCHPF68, 
SEQ ID NO: 163. 


133 


90 


3451 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


418 


82 


3452 


AF 13 0079 


Homo sapiens 


PR02852 


110 


80 


3453 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


143 


73 


3454 


AB032254 


Homo sapiens 


bromodamain adjacent to zinc finger 
domain 2A 


447 


85 


3455 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


134 


62 


3456 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


79 


68 


3457 


Dj8112 


Homo sapiens 


cjtochrome c oxidase subunit 3 


545 


84 




G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


134 


83 


3459 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


277 


80 




AK000867 


Homo sapiens 


unnamed protein product 






3461 


Y27854 


Homo sapiens 


Human secreted protein encoded by 

gene No. 101. 


123 


52 


3462 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


89 


30 


3463 


U93568 


Homo sapiens 


putative pi 50 


100 


46 


3464 


AL390114 


LeislmTania 
major 


extremely cysteine/valine rich protein 


156 


43 


3465 


D381I2 


Homo sapiens 


NADH dehydrogenase subunit 3 


340 


88 


3466 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


119 


100 


3467 


BO 1372 


Homo sapiens 


Neuron-asscx;iated protein. 


142 


75 


3468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


103 


40 


3469 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


139 


80 


3470 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl . 


207 


89 




AFl 16661 


Homo sapiens 


PRO 1438 


1 19 


47 


3472 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


129 


59 


3473 


AF130089 


Homo sapiens 


PRO2550 


108 


86 


3474 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


113 


80 


3475 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


590 


79 




AF118086 


Homo sapiens 


PRO 1992 






3477 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


47 


3478 


J01415 


Homo sapiens 


MTND4 


482 


3479 


X03404 


Bos taurus 


alpha subunit (aa 1-394) 


583 


89 






Homo sapiens 


cytochrome c oxidase subunit I 






3481 


G03628 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7709. 


242 


80 


3482 


G00407 


Homo sapiens 


Human secreted protein, SHQ ID NO: 
4488. 


121 


61 


3483 


X83427 


Omithorhynchus 


cytochrome c oxidase subunit I 


508 


78 


3484 


Y07754 


Homo sapiens 


Human secreted protein fragment 


549 


93 
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SEQID 
NO: 




Species 


Description 


Waterman 
Score 


Idcntit 

y 








encoded from gene 1 1 . 






3485 


X97567 


Homo sapiens 


porl 


388 


69 


3486 


X97567 


Homo sapiens 


porl 


608 


84 


3487 


K02401 


Homo sapiens 


chorionic somatomammotropin 


641 


93 


3488 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


135 


71 


3489 


S62941 


Homo sapiens 


Ps 2=basic proline-rich protein(PRBlL 
precursor protein=basic proline-rich 
proteins (Ps, PmF, PmS, and Pe) 
precursor) {C-termuial} 


116 


36 


3490 


S74728 


Homo sapiens 


antiquitin=26g turgor protein homolog 


549 


84 


3491 


L13329 


Homo sapiens 


iduronate-2-sulfatase 


564 


85 


3492 


X79535 


Homo sapiens 


beta tubulin 


620 


88 


3493 


AF08I484 


Homo sapiens 


alpha-tubulin isoform 1 


578 


87 


3494 


U09823 


Oryctolagus 
cuniculus 


elongation fector 1 alpha 


631 


89 


3495 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


616 


90 


3496 


M12140 


Homo sapiens 


envelope protein 


430 


53 


3497 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


123 


70 


3498 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLTl. 


104 


51 


3499 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


134 


54 


3500 


Y36156 


Homo sapiens 


Human secreted protein #28. 


143 


75 


3501 


AFl 13685 


Homo sapiens 


PRO0974 


77 


81 


3502 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4446. 


157 


62 


3503 


AF090944 


Homo sapiens 


PRO0663 


164 


81 


3504 


AF090931 


Homo sapiens 


PRO0483 


103 


70 


3505 


U93570 


Homo sapiens 


p40 


258 


47 


3506 


ABO 16601 


Drosophila 
alpina 


cytochrome c oxidase subunit I 


108 


82 


3507 


ABO 16601 


Drosophila 


cytochrome c oxidase subunit I 


108 


82 


3508 


AFl 94537 


Homo sapiens 


NAG 13 


449 


78 


3509 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


60 


3510 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


102 


37 


3511 


AC008113 


Arabidopsis 
thaliana 


F12A21.10 


90 


48 


3512 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


104 


33 


3513 


AF090944 


Homo sapiens 


PRO0663 


115 


52 . 


3514 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


152 


67 


3515 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


123 


48 


3516 


AF090942 


Homo sapiens 


PRO0657 


128 


47 


3517 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


286 


56 


3518 


AF255661 


Crypthecodiniura 
cohnii 


Dinapl -interacting protein 5; Dip5 


108 


36 


3519 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


209 


93 


3520 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


38 


3521 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


131 


64 


3522 


AL359782 


Trj'panosoma 


possible (hhv-6) ul 102, variant a dna. 


105 


42 



221 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








complete virion genome. 






3523 


R95913 


Homo sapiens 


Neural thread protein. 


118 


72 


3524 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


59 


3525 




Homo sapiens 


reverse transcriptase 


162 


50 


3526 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


118 


58 


3527 


AF090930 


Homo sapiens 


PRO0478 


146 


66 


3528 


G00528 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

4609. 


169 


50 


3529 


AF194537 


Homo sapiens 


NAG 13 


119 


88 


3530 


U93564 


Homo sapiens 


putative pl50 


234 


84 


3531 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7284. 


208 


63 


3532 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


275 


92 


3533 


AF090931 


Homo sapiens 


PRO0483 


126 


66 


3534 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5. 40. 


117 


85 


3535 


G03076 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7157. 


173 


77 


3536 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


117 


45 


3537 


AF209061 


Eubranchipus sp. 


cytochrome c oxidase I 


127 


80 


3538 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


174 


85 


3539 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


118 


45 


3540 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


88 


50 


3541 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


128 


71 


3542 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog llA 


111 


77 


3543 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


69 


3544 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


470 


80 


3545 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 


54 


3546 


AL359782 


Trypanosoma 


probable similar to ring-h2 fmger 
protein rhala. 


106 


42 


3547 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


584 


83 


3548 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


635 


95 


3549 


D38112 


Homo sapiens 


cytochrome o oxidase subunit 3 


661 


91 


3550 


D38112 


Homo sapiens 


cytochrome o oxidase subunit 1 


607 


86 


3551 


AK024455 


Homo sapiens 


FLJ00047 protein 


88 


51 


3552 


W88957 


Homo sapiens 


Polypeptide fragment encoded by gene 
128. 


500 


86 


3553 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


228 


95 


3554 


AF130092 


Homo sapiens 


PRO2620 


165 


93 


3555 


AL121845 


Homo sapiens 


dJ583P15.5.1 (novel protein (isofomi 
D) 


659 


91 


3556 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7143. 


170 


72 


3557 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


599 


88 


3558 


D38112 


Homo sapiens 




578 


84 


3559 


AK000385 


Homo sapiens 


uimam edproteinproduct 138 


47 


3560 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


112 


61 


3561 


X97675 


Homo sapiens 


plakophilin 2b 


123 


65 


3562 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


94 


62 
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NO: 








Waterman 


Identit 










Score 


y 








6595. 






3563 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


198 


90 








SEQIDNO:163. 






3564 


X86029 


Vigna 


extensin-like protein 


141 


36 






unguiculata 








3565 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


60 








7795. 






3566 


AP000616 


Ory'za sativa 


similar to RING-H2 finger protein 


110 


94 








RHAla(AF078683) 






3567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


203 


93 








SEQ ID NO: 163. 






3568 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


146 


72 








SEQIDNO:163. 






3569 


S62928 


Homo sapiens 


PRBIM protein precursor 


102 


44 


3570 


D86853 


Catliaranthus 
roseus 


extensin 


123 


40 


3571 


U93574 


Homo sapiens 


putative pi 50 


279 


56 


3572 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


94 


82 








gene 51 clone HUKEX85. 






3573 


U45964 


Herpesvirus 


LIVtPl 


127 


35 






papio 








3574 


Y08319 


Homo sapiens 


kinesin-2 


254 


55 


3575 


X97675 


Homo sapiens 


plakophilin 2b 


134 


54 


3576 


AF00315] 


Caenorhabditis 


contains similarity to an KNA 


137 


43 






elegans 


recognition motif 






3577 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


66 








4658. 






3578 


U80447 


Caenorhabditis 


coded for by C. elegans cDNA 


117 


50 






elegans 


yk187f6.5; coded for by C. elegans 












cDNA ykl87f6.3; coded for by C. 












elegans cDNA ykI46fl 1.5 






3579 


U23172 


Caenorhabditis 


similar to myoblast cell surfece antigen 


126 


31 






elegans 


(SP:CS24_HUMAN, P23246) and D. 












melanogaster No-on-transient A protein 












(P1R:JH0162) 






3580 


AJ249395 


Globodera 


NADH-ubiquinone oxidoreductase 


85 


33 






pallida 


subunit 4 






3581 


U09] 16 


Homo sapiens 


ORF 1 , encodes a 40 kDa product 


139 


55 


3582 


X97675 


Homo sapiens 


plakophilin 2b 


125 


73 


3583 


L26953 


Homo sapiens 


chromosomal protein 


101 


55 


3584 


S80343 


Homo sapiens 


arginyl-tRNA synthetase, ArgRS 


110 


67 


3585 


X97675 


Homo sapiens 


plakophilin 2b 


116 


84 


3586 


X92485 


Plasmodium 


pval 


124 


50 






vivax 








3587 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


56 








7795. 






3588 


6031 72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


56 






7253. 






3589 


U63542 


Homo sapiens 


FAP protein 


128 


75 


3590 


D38112- 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


88 


3591 


D38112 


Homo sapiens 


NADH dehydrogenase subunii 1 


612 


87 




K02401 


Homo sapiens 


chorionic somatomammotropin 






3593 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


40 








4770. 






3594 


V00662 


Homo sapiens 


cytochrome oxidase I 


459 


76 


3595 


AC008262 


Arabidopsis 


F4N2.10 


136 


31 






tlialiana 








3596 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


52 








7871. 







223 



w o 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitb- 

Waterman 

Score 


% 

Identit 

y 


3597 


AF116645 


Homo sapiens 


PRO1708 


623 


91 


3598 


M2302S 


Human 
herpesvirus 4 


nuclear antigen precursor 


121 


35 


3599 


W88816 


Homo sapiens 


Polypeptide fragment encoded by gene 
58. 


100 


33 






Plasmodium 
vivax 


pva 








G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 




78 




AK000496 


Homo sapiens 


unnamed protein product 






3603 


AF083929 


Mus musculus 


ES18 


98 


40 


3604 




Homo sapiens 


cytochrome c oxidase subunit 1 


537 


83 


3605 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


548 


85 


3606 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


158 


57 


3607 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


429 


75 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
7437. 






3609 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


88 


3610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


224 


80 


3611 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


481 


70 


3612 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


166 


92 


3613 


D38112 


Homo sapiens 


cytoclirome c oxidase subunit 1 


582 


83 


3614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


167 


91 


3615 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


582 


77 


3616" ~ 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


101 


40 


3617 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


591 


85 


3618 


AB037275 


Cynomolgus 
Epstein-Barr 
Virus TsB-B6 


EBNA-1 


119 


42 


3619 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


581 


91 


3620 


Y34068 


Homo sapiens 


Histone HI isoform Hi. 5 pANCA- 
reactive fragment (residues 69-226). 


100 


33 


3621 


AL049608 


Arabidopsis 
tlialiana 


extensin-like protein 


105 


27 


3622 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


127 


69 


3623 


S71569 


Neocallimastix 
patriciarum, 
Peptide, 860 aa 


Xylanase B, XYLB {EC 3.2.1.8} 


108 


43 


3624 


m3172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-fransient A protein 
(PIR:JH0162) 


101 


37 


^62^ 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


390 


83 






Homo sapiens 


cytochrome c oxidase subunit 1 






3627 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


637 


90 


3628 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


169 


39 


3629 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


142 


77 


3630 


U93564 


Homo sapiens 


putative pi 50 


330 


90 


3631 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


42 



224 
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SEQID 




Species 


Description 


Smith- 




NO: 










Identit 










Score 


y 


3632 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


80 








6613. 






3633 


G02360 


Homo sapiens 


Human secreted protein, SEQ EDNO: 


139 


65 








6441. 






3634 


U74071 


Phascolosoma 


cytochrome c oxidase subunit I 


248 


72 






sp. 'California' 








3635 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


121 


87 








4442. 






3636 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


12S 


65 








oncoprotein. 






3637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


124 


63 








gene 22 clone I1MSJW18. 






3638 


Y08062 


Homo sapiens 


Human PR0245 protein (ragmen: 


159 


51 








derived from DNA35638. 






3639 


X65582 


Mus musculus 


alpha-2 collagen 


102 


53 


3640 


AF090942 


Homo sapiens 


PRO0657 


97 


76 


3641 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


146 


69 








clone HPMBQ32. 






3642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


55 








6619. 






3643 


X14576 


Murine leukemia 


gag pis protein 


134 


44 






virus 








3644 


AF130051 


Homo sapiens 


PRO0898 


177 


48 


3646 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


59 








7145. 






3647 


AK000385 


Homo sapiens 


unnamed protein product 


105 


40 


3648 


V00662 


Homo sapiens 


cytochrome oxidase I 


533 


83 


3649 


AF090944 


Homo sapiens 


PRO0663 


129 


73 


3650 


AF090944 


Homo sapiens 


PRO0663 


129 


73 


3651 


W48351 


Homo sapiens 


Human breast cancer related protein 


88 


60 








BCRB2. 






3652 


AF090930 


Homo sapiens 


PRO0478 


92 


57 


3653 


G03172 


Homo sapiens 


Hiunan secreted protein, SEQ ID NO: 


148 


73 








7253. 






3654 


ABO 18440 


Echinococcus 


NADH dehydrogenase subunit 2 


100 


32 






multilocularis 






3655 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


68 








5327. 






3656 


AF161356 


Homo sapiens 


HSPC093 


106 


48 


3657 


AF090930 


Homo sapiens 


PRO0478 


158 


85 


3658 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


82 








7124. 






3659 


U93574 


Homo sapiens 


putative pi 50 


117 


44 


3660 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


149 


51 








gene 22 clone HMSJW18. 






3661 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


67 


70 








gene 22 clone HMSJWl 8. 






3662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


99 


62 








SEQ ID NO; 163. 






3663 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 


154 


45 






crassa 


RIBONUCLEOPROTEIN C 










Homo sapiens 


Human secreted protein encoded by 




77 








gene No. 5. 






3665 


W48351 


Homo sapiens 


Human breast cancer related protein 


128 


59 








BCRB2. 






3666 


U47855 


Araneus 


fibroin-3 


109 


39 






diadematus 








3667 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


604 


88 


3668 


K02401 


Homo sapiens 


chorionic somatomammotropin 


635 


93 
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SEQ ID 
NO: 




Species 


Description 


Smith- 


Identit 

y 


3669 


AF083929 


Mus musculus 


ESI 8 


133 


46 


3670 


J01415 


Homo sapiens 


MTND4 


620 


87 


3671 


AK025047 


Homo sapiens 


umiamed protein product 


160 


60 


3672 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


106 


31 


3673 


ABO 1903 8 


Homo sapiens 


beta- 1,4 mannosyltransferase 


766 


94 


3674 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/foifchead 
homolog 11 A 


113 


69 


3675 


U80447 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
y]cl87f6.5; coded for by C. elegans 
cDN A ykl 87f6.3; coded for by C. 
elegans cDNA ykl46fl 1.5 


113 


39 


3676 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


197 


68 


3677 


U52077 


Homo sapiens 


mariner transposase 


500 


74 


3678 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


174 


97 


3679 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 


86 


3680 


AC003113 


Arabidopsis 
tlialiana 


F2401.6 


102 


83 ' 


3681 


AF005370 


Alcelaphine 
herpesvirus 1 


putative immediate early protein 


153 


42 


3682 


GO 1502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5583. 


111 


62 


3683 


AP002543 


Arabidopsis 
thaliana 


gb|AAD23015. l~geneJd:F15M7.16~si 
milar to unknown protein 


131 


40 


3684 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


146 


64 


3685 


U93572 


Homo sapiens 


putative pi 50 


158 


43 


3686 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


98 


43 


3687 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


106 


54 


3688 


L26953 


Homo sapiens 


chromosomal protein 


128 


76 


3689 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


87 


3690 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


109 


67 


3691 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6 8, 
SEQ ID NO: 163. 




75 


3692 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


165 


85 




M20789 


Homo sapiens 


alpha- 1 type I collagen 




43 


3694 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 1 63 . 


162 


50 


3695 


AF2 10651 


Homo sapiens 


NAG 18 


124 


67 




AF 13 0079 


Homo sapiens 


PR02852 


94 


53 


3697 


Y19192 


Talpa europaea 


cytochrome oxidase subunit I 


618 


85 


3698 


D38il2 


Homo sapiens 


NADH dehydrogenase subunit 2 


414 


86 


3699 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


487 


76 


3700 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


42 


3701 


AF2 17374 


Acanth aster 


cytochrome oxidase subunit I 


563 


85 


3702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NlD:g8148) 


106 


48 


3703 


D38112 


Homo sapiens 


cytochirome c oxidase subunit 1 


522 


79 


3704 


D67066 


Bos tauras 


N-WASP 


219 


42 


3705 


Y3 63 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


51 


3706 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


44 



226 
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SEQID 


Accession No. 




Description 


Smith- 


% 


NO: 








Waterman 












Score 


y 








7879. 






3107 


Z70684 


Caenorhabditis 


F28D1.8 


105 


37 






elegans 








3708 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


125 


74 








7871. 






3709 




D38112 




Homo sapiens 


cytochrome c oxidase subunit 1 


558 


88 


3710 


AF217374 


Acanth aster. 


cytochrome oxidase subunit I 


512 


85 






plane i 








3711 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


186 


80 


3712 


AF197854 


Melithreptus 


cytochrome oxidase I 


189 


83 






lunatus 








3713 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


129 


95 








RHAla(AF078683) 






3714 


AB029027 


Homo sapiens 


KIAAl 104 protein 


141 


81 


3715 


L26251 


Trypanosoma 


CR5 


73 


31 






brucei 








3716 


R55913 


Homo sapiens 


Neural thread protein. 


117 


44 


3717 


G00397 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


95 


61 








4478. 






3718 


Y36112 


Homo sapiens 


Extended human secreted protein 


108 


81 








sequence, SEQ ID NO. 497. 






3719 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 


132 


55 








(hhcm) gene preoduct. 






3720 


M64923 


Bos taurus 


CIO protein 


194 


94 


3721 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


115 


75 








gene 22 clone HMSJW18. 






3722 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


95 








7124. 






3723 


AF302773 


Homo sapiens 


ninein-Lm isoform 


157 


57 


3724 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


59 


3725 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


161 


80 








gene 22 clone HMSJW18. 






3726 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


121 


52 


3727 


AF220264 


Homo sapiens 


MOST-1 


86 


88 


3728 


L27428 


Homo sapiens 


reverse transcriptase 


316 


62 


3729 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


112 


46 


3730 




Homo sapiens 


Human breast cancer related protein 


145 


68 








BCRB2. 






3731 


W48351 


Homo sapiens 


Human breast cancer related protein 


118 


56 








BCKB2. 






3732 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


65 








7124. 






3733 


AF090930 


Homo sapiens 


PRO0478 


148 


71 


3734 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


140 


72 








gene 22 clone HMSJWI8. 






3735 


U93569 


Homo sapiens 


putative pl50 


270 


59 


3736 


U93568 


Homo sapiens 


putative pl50 


151 


38 


3737 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


124 


82 








6574. 






3738 


X92485 


Plasmodium 
vivax 


pval 


101 


44 






Homo sapiens 


Human secreted protein, SEQ ID NO: 












6619. 






3740 


AF090944 


Homo sapiens 


PRO0663 


124 


56 


3741 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


47 








6613. 






3742 


AFl 18086 


Homo sapiens 


PRO 1992 


124 


61 


3743 


AF083929 


Mus musculus 


ES18 


108 


35 


3744 


L34685 


Arabidopsis 


cell wall protein 


120 


34 
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SEQ ID 


Accession No. 




Description 


Smith- 




NO: 








Score 


Identif 

y 






thaliana 








3745 


AF090931 


Homo sapiens 


PRO0483 


117 


54 


3746 


AF090930 


Homo sapiens 


PRO0478 


136 


75 


3747 


G00397 


Homo sapiens 


Hmiian secreted protein, SEQ ID NO: 


154 


59 








4478. 






37'18 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


44 








4449. 






3749 


AP000616 


Oryza sativa 


similar to RIN'G-H2 finger protein 


98 


42 








RHAla(AF078683) 






3750 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


165 


73 








gene 22 clone HMSJWl 8. 






3751 


AP000616 


Oiyza sativa 


similar to RING-H2 finger protein 


113 


87 








RHAla(AF078683) 






3752 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


58 








7868. 






3753 


U93574 


Homo sapiens 


putative pi 50 


157 


60 


3754 


X56015 


Crithidia 


NADH dehydrogenase subunit 5 


103 


30 






oncopelti 








3755 


M80613 


Homo sapiens 


putative 


114 


26 


3756 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


54 








4414. 






3757 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


210 


89 


3758 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


58 








7871. 






3759 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


110 


46 






elegans 


binding protein KIO {NID:g8148) 






3760 


M13100 


Rattus 


unknown protein 


119 


46 






norvegicus 








3761 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


135 


52 






elegans _ 


binding protein KIO (NlD;g8148) 






3762 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


132 


84 








RHAla(AF078683) 






3763 


Ml 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


125 


40 


3764 


U63542 


Homo sapiens 


FAP protein 


131 


72 


3765 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


76 








4493. 






3766 


X92485 


Plasmodium 
vivax 


pval 


119 


54 


3767 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


81 


56 








protein CON-2. 






3768 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


65 








7492. 






3769 


AB005216 


Homo sapiens 


Nek, Ash and phospholipase C gamma- 


487 


90 








binding protein NAP4 






3770 


AF026211 


Caenorhabditis 


Similar to cuticular collagen 


102 


37 






elegans 








3771 


W48351 


Homo sapiens 


Human breast cancer related protein 


91 


55 








BCRB2. 






3772 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


76 








6566. 






3773 


AK024455 


Homo sapiens 


FL J00047 protein 


138 


66 




AF090895 


Homo sapiens 


PRODI 17 


152 


60 


3775 


AF130051 


Homo sapiens 


PRO0898 


122 


78 


3776 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


63 








5327. 






3777 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


187 


75 








7888. 






3778 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


56 








4523. 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


3779 


X55685 


Lycopersicon 
esculentura 


extensin (class I) 


132 


38 


3780 


L27428 


Homo sapiens 


reverse transcriptase 


116 


53 


3781 


Y71066 


Homo sapiens 


Human membrane transport protein, 
MTRP-U. 


188 


86 


3782 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


122 


67 


3783 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


462 


75 


3784 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4414. 


127 


67 






Homo sapiens 


Np70 protein carboxy terminal region. 






3786 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


152 


78 


3787 


S80119 


Rattus sp. 


reverse transcriptase homolog 


143 


40 


3788 


AF090942 


Homo sapiens 


PRO0657 


134 


67 


3789 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


135 


63 


3790 


AF116661 


Homo sapiens 


PR01438 


128 


45 


3791 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


130 


38 


3792 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4484. 


127 


74 


3793 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON- 1. 


101 


48 


3794 


AF000298 


Caenoriiabditis 
elegans 


weak similariiy to coUagens; glycine- 

and proline-rich 


175 


46 


3795 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


117 


40 


3796 

, . 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


185 


88 


3797 


D38112 


Homo sapiens 


cytochrome c oxidase subimit 3 


610 


84 


3798 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


156 


56 


3799 


AL3 59782 


brucei 


probable similar to ring-h2 finger 
protein rha la. 


71 


52 


3800 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8172. 


127 


77 


3801 


AF130079 


Homo sapiens 


PR02852 


164 


54 


3802 


AL3597S2 


Ttypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


118 


45 


3803 


U42580 


Paramecium 
bursaria 

Chlorella virus 1 


Pro-rich, PAPK (20X); similar to 
Arabidopsis anter-specific Pro-rich 
protem, corresponds to Swiss-Prot 
Accession Number P40602 


101 


36 


3804 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


173 


87 


3805 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


126 


72 


3806 


D38112 


Homo sapiens 


NADH dehydrogenase suhunit 4 


225 


85 


3807 


U88587 


Nicotiana alata 


120 kDa style glycoprotein 


118 


38 


3808 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


129 


82 


3809 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


67 


3810 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


213 


97 


3811 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


106 


63 


3812 


AF169974 


Homo sapiens 


serme racemase 


153 


56 


3813 


L27428 


Homo sapiens 


reverse transcriptase 


175 


43 


3814 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


109 


43 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smitli- 

Score 


% 


3815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


78 


3816 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4433. 


112 


65 


3817 


AFH9901 


Homo sapiens 


PR02831 




55 


3818 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


89 


3819 


AL 160371 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


91 


66 


3820 


X92485 


Plasmodium 
vivax 




108 


46 


3821 
3822 


^AK024455 
Y27854 


Homo sapiens 


FLJ00047 protein 


79 


59 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


178 


87 


3823 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


99 


50 


3824 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS 1 1 037. 


86 


94 


3825 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


53 




AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


88 


3827 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


77 


61 


3828 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


139 


51 


3829 


K03208 


Homo sapiens 


salivary proline-rich protein precursor 


161 


41 


3830 


U44838 


Glycine max 


extensin 


108 


47 


3831 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


136 


71 


3832 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


84 


3833 


AL163912 


Arabidopsis 
thaliana 


glycine-rich protein atGRP-7 


117 


37 


3834 


AF130089 


Homo sapiens 


PRO2550 


138 


45 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


92 


76 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


152 


70 


3837 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


98 


37 


3838 


X68249 


Xenopus laevis 


Proline rich protein 


92 


66 






Homo sapiens 


cytochrome oxidase subunit 11 




92 


3840 


R63235 


Homo sapiens 


CNS neural thread protein HR4. 


186 


100 






Homo .apieii. 


Human secreted protein, SEQ ID NO: 
8172. 








Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


116 


62 


3843 


AF164614 


Homo sapiens 


envelope protein 


508 


77 


3844 


AFl 30051 


Homo sapiens 


PRO0898 


130 


73 


3845 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


144 


74 




G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 






3847 


Ml 3 100 


Rattus 
norvegicus 


unknown protein 


129 


46 


3848 


X97675 


Homo sapiens 


plakophilin 2b 


145 


67 


3849 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


141 


65 


3850 


K03206 


Homo sapiens 


salivary proline-rich protein precursor 


114 


42 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 


Identit 

y 


3851 


U93574 


Homo sapiens 


putative pi 50 


165 


82 


3852 


AF09093 1 


Homo sapiens 


PRO0483 


97 


78 


3853 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 

similar to pogo element 


279 


80 


3854 


U93570 


Homo sapiens 


putative pi 50 


135 


38 


3855 


M13100 


Rattus 
norvegicus 


unknown protein 


100 


57 


3856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


166 


63 


3857 


Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 37. 


117 


49 


3858 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


147 


45 


3859 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


150 


71 


3860 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


165 


69 


3861 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


131 


83 


3862 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


171 


81 


3863 


AK000241 


Homo sapiens 


unnamed protein product 


100 


48 


3864 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


121 


80 


3865 


AK024455 


Homo sapiens 


FLJ00047 protein 


124 


69 


3866 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


64 


3867 


AFl 16712 


Homo sapiens 


PR02738 


92 


48 


3868 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


165 


62 


3869 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


128 


62 


3870 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


94 


75 


3871 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


72 


3872 


AJ242540 


Vol vox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


147 


44 


3873 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


113 


51 


3874 


AF090930 


Homo sapiens 


PRO0478 


120 


63 


3875 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


3876 


D88674 


Homo sapiens 


anti2yme inhibitor 


147 


75 


3877 


AF263744 


Homo sapiens 


erbb2-inteiacting protein ERBIN 


212 


37 


3878 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


140 


82 


3879 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4435. 


82 


35 


3880 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


144 


60 


3881 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NtP 


122 


60 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
4658. 


134 


75 


3883 


M8I321 


Macaca 
fascicularis 


proline-rich protein 


101 


41 


3884 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


109 


56 


3885 


AFO 13214 


Bos taurus 


acidic ribosomal phosphoprotein PO 


177 


94 ■ 


3886 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


77 
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SEQ ID 
NO: 


Accession No. 


Species 


»esertp«on 


Smith- 
Waterman 


% 








4484. 




— 




Y59860 


Homo sapiens 


Human nomial uterus tissue derived 
protein 23 . 




72 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 




72 


3889 


M64792 


Rattas . 
norvegicus 


salivary proline-rich protein 


104 


40 


3890 


AF156228 


Drosophila 
melanogaster 


salivary gland secretion protein 


104 


35 




V 11 s 


Homo sapiens 


— T , — -pj- -g 

Secreted protein encoded by gene 18 
Clone rio/\.CJ6 J . 


90 


47 


3892 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


138 


38 


3893 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


146 


65 




G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7871. 










Daucus carota 


put precursor 






3896 


AF090942 


Homo sapiens 


PRO0657 


135 


50 




YS6248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


77 


3898 


X92485 


. Plasmodium 

vivax 


pval 


1 1 1 


62 




G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 








AL 160493 


Leishmania 
major 


probable (hhv-6) u 1 102, variant a 
DNA, complete virion genome 






3901 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


83 


3902 


AF090942 


Homo sapiens 


PRO0657 


134 


65 




AF220264 


Homo sapiens 








3904 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


53 


3905 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


85 


3906 


AK025047 


Homo sapiens 


unnamed protein product 


127 


73 


3907 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7124. 


135 


86 




G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 










norvegicus 


PR-Vbetal 








G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 








AK024455 


Homo sapiens 


ri_ijuuu4/ proteui 






3912 


U93567 


Homo sapiens 


putative pi 50 


233 


63 




W54966 


Homo sapiens 


Synthetic human type III collagen 

SYN-C3. 






j914 


X94976 


Brassica napus 


cell wall-plasma membrane linker 
protem 


104 


36 


3915 


P92219 


Homo sapiens 
(human) 


CRl protein. 


125 


73 


^916 


X92485 


vivax 


~pval 


94 


64 


3917 


Y36495 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 27. 


105 


52 


3918 


AF118086 


Homo sapiens 


PRO 1992 


130 


87 


3919 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7124. 


125 


80 


3920 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 



232 
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SEQID 
NO: 




Species 


Description 


Smith- 


Identit 

7 














3921 


AF090930 


Homo sapiens 


PRO0478 


128 


64 




Q00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


75 


3923 


W48351 


Homo sapiens 


Human breast cancer related protein 


130 


65 


3924 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


134 


71 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


102 


58 






Homo sapiens 


The KIAA0153 gene product is related 
to a putative C.elegans gene encoded in 
cosmid F42A8. 


141 


100 


3927 


U21I23 


Drosophila 
melanogaster 


ena polypeptide 


117 


37 


3928 


X58438 


Mus musculus 


proline rich protein 


136 


36 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


71 


3930 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


73 


3931 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


86 


48 


3932 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


110 


65 


3933 




Plasmodium 
vivax 


pval 


116 


36 


3934 


X97675 


Homo sapiens 


plakophilin 2b 


129 


82 


3935 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7339. 


84 


58 


3937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


107 


56 


3938 


AF090895 


Homo sapiens 


PRO0117 


166 


68 


3939 


K02550 


Oncorhynciius 


70-kilodalton heat shock protein 


104 


38 


3940 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


74 






Homo sapiens 


chromosomal protein 






3942 
3943 


U63542 


Homo sapiens 


FAF protein 


135 


75 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


80 






Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 




67 




AK 02445 5 


Homo sapiens 


F1J00047 protein 






3946 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


92 


52 


3947 


W48927 


Homo sapiens 


Schwannomin-binding proteir C- 
terminal fragment. 


103 


56 


3948 


AF252293 


Homo sapiens 


PAR3 


161 


44 


3949 


Y02749 


Homo sapiens 

... 


Human secreted protein encoded by 
gene 1 00 clone HNF1U96. 


125 


47 


3950 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


128 


59 


3951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


87 


3952 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


152 


75 


3953 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


168 


64 


3954 


R95913 


Homo sapiens 


Neural thread protein. 


110 


54 


3955 


AF130089 


Homo sapiens 


PRO2550 


122 


63 


3956 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


48 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waferniari 


% 

Identit 






Homo sapiens 


Human secreted protein #75. 






3958 


Y86248 


Homo sapiens 


Human secreted" protein HCHPF68, 
SEQ ID NO: 1 63 . 


177 


61 


3959 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


69 


3960 


U93563 


Homo sapiens 


putative pI50 


227 


62 






Catliaranthus 






40 






Cloning vector 
pSportl 


beta-galactosidase alpha peptide 








GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 






3964 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

'^'^2^- 


121 


42 


3965 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

■72^ ^ 7^ -■ 


150 


66 


3966 


\/T1 1 Ortl 

Ml lyui 


^Ittus 

norvegicus 


proline-rich salivary protein 


3ol 


— 


3967 


AF229126 


Homo sapiens 


acetylcholinesterase collagein-like tail 
subunit isofbrm VII 


108 


36 





AJ006770 


_ _ 

Homo^sap^i'ens^ — 




— ^ 




3969 


AF130089 




PRO2550 


~T28 1 


82 


3970 


U93563 


Homo sapiens 


putative pi 50 


99 


48 




AK000496 


Homo sapiens 


unnamed protein product 






3972 


Z70684 




Caenorhabditis 
elegans 


F28D1.8 


108 


42 


3 "" 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 






3974 


AK024455 


Homo sapiens 


FLJ00047 protein 


132 


65 




AK024455 


Homo sapiens 


FL.T00047 protein 


128 


85 


3976 


X01918 


Drosophila 
melanogaster 


salivary gland glue protein 


99 


40 


3977 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


46 


3978 


AF229067 


Homo sapiens 


PADl-H protein 


107 


87 




M64793 


norvegicus 


salivary proline-rich protein 






3980 


S80119 


Rattus sp. 


reverse transcriptase homolog 


134 


49 


3981 


AF090901 


Homo sapiens 


PRO0195 


106 


94 


3982 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NTD:g8I48) 


111 


42 


39^ 





Plasmodium 

Honf SI ions 


pval 


— — 

10- 


^^0 


3984 


I 


omo sapiens 


ir~na Td P — SEO ID NO- — 

67™^" secre e pro em, Q O. 


—r= 


■fil 


3985 


AB012162 


Homo sapiens 


APCL protein 


190 


42 


"lll^ 


AFO 1 6099 


Mus musculus 


endonuclease/reverse transcriptase 





— 




U93570 










3988 


D381 12 


Homo sapiens 


NADH^deliydrogenase subunit 5 


3o2 


1^ 


3989 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


244 


85 




V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


420 


92 




^Igg^jg 


Homo sapiens 


NADH dehydrogenase subunit 4 






3992 — 




tabacum 




"TT5 


"15 


3993 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


64 


3994 


AF090944 


Homo sapiens 


PRO0663 


99 


39 


3995 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7888. 


150 


75 


3996 


D38113 


Pan troglodytes 


cj-tochrome c oxidase subunit 1 


574 


87 


3997 


D38112 


Homo sapiens 


NADH dehydiogenase subunit 4 


488 


76 ■ 
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SEQID 
NO; 


Accession No. 


Species 


Description 


Smitli- 
Score 


% 

Identit 

y 


3998 


U52077 


Homo sapiens 


mariner transposase 


475 


70 


3999_ 


U93570 


Homo sapiens 


p40 


111 


34 






Mas musculus 


reverse transcriptase 


137 


43 


4001 


AFl 16712 


Homo sapiens 


PR02738 


105 


52 


4002 


AFl 13685 


Homo sapiens 


PRO0974 


125 


53 


4003 


W48351 


Homo sapiens 


Human breast cancer related protein 


92 


78 


4004 


AFl 16715 


Homo sapiens 


PR02829 


115 


73 






Homo sapiens 


NADH dehydrogenase subunit 2 




80 


4006 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


614 


86 


4007 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


106 


45 


4008 


U90268 


Homo sapiens 


Kritl 


152 


52 


4009 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


472 


96 


4010 


B03148 


Homo sapiens 


Human neuronal differentiation fector- 
1 (NDF-1). 


304 


100 


4011 


BOS 148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


472 


98 


4012 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


102 


50 


4013 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 


4014 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


118 


47 


4015 


U 12690 


Homo sapiens 


cytochrome oxidase subunit K 


522 


81 


4016 


G02480 
AFO 10144 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 

neuronal thread protein AD7c~NTP 


147 


52 


106 


57 


4018 


AB003476 


Homo sapiens 




549 


91 


4019 


D38112 


Homo sapiens 


NADU dehydrogenase subunit 4 


391 


93 


4020 


tJ44949 


Xenopus laevis 


zona pellucida A glycoprotein homolog 


143 


32 


4021 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


571 


88 


4022 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


554 


87 


4023 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


165 


88 




AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


131 


38 


4025 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


583 


86 


4026 


AF090895 


Homo sapiens 


PRO0117 


104 


49 


4027 


AFl 3 0056 


Homo sapiens 


PRO 1 367 


80 


60 


4028 


X97675 


Homo sapiens 


plakophilin 2b 


157 


83 


4029 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 
kDa subunit 


114 


35 


4030 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


110 


74 


4031 


D38112 


Homo sapiens 


ATPase subunit 6 


395 


80 


4032 


S70718 


Hemicentrotus 
pulcherrimus=se 
a urchins, tests, 
Peptide, 632 aa 


fibrillar collagen alpha 120 and 140 
chains 


104 


33 


4033 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


355 


66 




AC004497 


Homo sapiens 


MX2 


464 


58 


4035 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
44)4. 


151 


61 


4036 


[AF 11 7888 


Homo sapiens 


myosin-IXa 


113 


34 


4038 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID"NO:163. 


100 


48 


4039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


153 


73 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 

Watcrman 

Score 


Identit 

y 


4040 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


338 


86 


4041 


AF116661 


Homo sapiens 


PRO 143 8 


136 


53 


4042 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


120 


84 


4043 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


493 


85 


4044 


M90516 


Homo sapiens 


glutamine:fructose-6-phosphate 
amidotransferase 


261 


72 


4045 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478, 


102 


64 


4046 


AF130089 


Homo sapiens 


PRO2550 


143 


78 


4047 


AF090942 


Homo sapiens 


PRO0657 


119 


41 


4048 


R95913 


Homo sapiens 


Neural thread protein. 


138 


47 


4049 


AF010144 


Homo sapiens 


neuronal tiiread protein AD7c-NTP 


1 11 


59 


4050 


X77962 


Tetrahymena 
thermophila 


fibrillarin 


105 


49 


4051 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4419. 


144 


71 


4052 


X02794 


Friend murine 
leukemia virus 


Pr65 


107 


37 


4053 


Y00994 


Homo sapiens 


Human CSR3 protein sequence. 


109 


38 


4054 


AL121585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5}) 


299 


74 


4055 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


314 


80 


4056 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7519. 


65 


54 


4057 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

SEQ ID NO: 163. 


187 


67 


4058 


AF205385 


Pan troglodytes 


NADH dehiydrogenase subunit 5 


202 


89 


4059 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


100 


4060 


U63542 


Homo sapiens 


FAP protein 


142 


73 


4061 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


158 


50 


4062 


Y95028 


Homo sapiens 


Human clone vp7 1 0RF2, SEQ ID 
NO: 128. 


123 


44 


4063 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


76 


65 


4064 


AF000298 


Caenorhabditis 
elegans 


weak similarity to coUagens; glycine- 
and proline-rich 


165 


46 


4065 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


147 


93 


4066 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


109 


76 


4067 


AB035523 


Gallus gallus 


avenain 


101 


48 


4068 


AC079829 


Arabidopsis 
tiialiana 


Pto kinase interactor, putative 


105 


43 


4069 


AF2 10651 


Homo sapiens 


NAG18 


97 


77 


4070 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


43 


4071 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


158 


65 


4072 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


43 




AF038007 


Homo sapiens 


FICl 


153 


96 


4074 


M33228 


Trypanosoma 


ATPase 6 


103 


37 


4075 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


119 


46 


4076 


M81321 


Macaca 
fascicularis 


proline-rich protein 


104 


36 


4077 


X52235 


Homo sapiens 


ORFll 


192 


39 


4078 


G00328 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


126 


83 
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SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


Identit 

y 








4409. 






4079 


AB026542 


Homo sapiens 


WASP-family protein 


98 


41 


4080 


L 173 18 


Rattus 
norvegicus 


proline-rich proteoglycan 


105 


42 


4081 


AF 130051 


Homo sapiens 


PRO0898 


117 


42 


4082 


AF130089 


Homo sapiens 


PRO2550 


85 


79 


4083 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


117 


48 


4084 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


89 


40 


4085 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


80 


4086 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4484. 


144 


74 


4087 


AF085691 


Homo sapiens 


multidrug resistance-associated protein 

3A 


197 


49 


4088 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6571. 


109 


62 


4089 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


51 


4090 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


103 


65 


4091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4718. 


159 


68 


4092 


X92485 


Plasmodium 
vivax 


pval 


95 


38 


4093 


U93569 


Homo sapiens 


p40 


205 


40 


4094 
4095 


L 16864 


African swine 
fever virus 


cd2 homologue 


98 


45 


X71413 


Homo sapiens 


ELEl 


675 


98 


4096 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


149 


60 


4097 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


70 


4098 


AF090930 


Homo sapiens 


PRO0478 


149 


78 


4099 


AF090942 


Homo sapiens 


PRO0657 


124 


56 


4100 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


139 


62 


4101 


L26953 


Homo sapiens 


chromosomal protein 


104 


54 


4102 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7253. 


128 


70 


4103 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


94 


58 


4104 


U73199 


Mus musculus 


Rho-guanine nucleotide exchange 
factor 


370 


56 


4105 


AF090931 


Homo sapiens 


PRO0483 


104 


90 


4106 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


133 


72 


4107 


AFl 18082 


Homo sapiens 


PRO 1902 


145 


49 


4108 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


116 


50 


4109 


U93564 


Homo sapiens 


p40 


539 


91 


4110 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


144 


54 


4111 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
' 7888. 


138 


65 


4112 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


91 


73 


4113 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


120 


36 


4114 


M17522 


Paracoccus 


cytochrome cl precursor (EC 1.10.2.2) 


101 


41 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


Identit 

y 






denitrificans 








4115 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


59 


4116 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


148 


73 


4117 


G01931 


Hoaio sapiens 


Human secreted protein, SEQ ID NO: 

6012. 


80 


85 


4118 


X04758 


Homo sapiens 


pro- alpha (V)collagen (AA 1099) 


106 


39 


4119 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


122 


47 


4120 


AF130089 


Homo sapiens 


PRO2550 


132 


68 


4121 


AF090942 


Homo sapiens 


PRO0657 


136 


48 


4122 


YD 1400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


115 


66 


4123 


W50193 


Homo sapiens 


Amino acid sequence of salivaiy 
protein CON-2. 


72 


48 


4124 


X92485 


Plasmodium 


pval 


104 


54 


4125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


131 


59 


4126 


AK024455 


Homo sapiens 


FLJ00047 protein 


86 


61 


4127 


AFl 19855 


Homo sapiens 


PRO 1847 


99 


68 


4128 


S80119 


Rattus sp. 


reverse transcriptase homolog 


129 


39 


4129 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
4425. 


156 


62 


4130 


G00416 


Homo sapiens 


Human secreted protein. SHQ ID NO: 
4497. 


135 


72 


4131 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPmla. 


108 


83 


4132 


L16461 


Chlamydomonas 
reinhardtii 


structural wall protein 


87 


37 


4133 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


95 


4134 


Y27854 


Homo sapiens 


Human secreted protein encoded by 

gene No. 101. 


121 


69 


4135 


G01984 


Homo sapiens 


Human secreted protein, ST=Q ID NO: 
6065. 


135 


70 


4136 


K02576 


Homo sapiens i salivary prolinc-rich protein 1 


134 


42 


4137 


Y15173 


Human 

papillomavirus 
type 75 


E4 protein 


101 


38 


4138 


AF130089 


Homo sapiens 


PRO2550 


130 


35 


4139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


111 


38 


4140 


Y36203 


Homo sapiens 


Human secreted protein #75. 


120 


48 


4141 


AF130089 


Homo sapiens 


PRO2550 


139 


36 


4142 


S80119 


Rattus sp. 


reverse transcriptase homolog 


170 


47 


4143 


U54636 


Staphylococcus 
aureus 


protein A 


135 


35 


4144 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


157 


55 




G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


112 


54 


4146 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


117 


37 


4147 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


98 


61 


4148 


AF130051 


Homo sapiens 


PRO0898 


124 


88 


4149 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
_Scorc 


Wentit 

y 








~4754 






"4T5O 


Z29481 


Homo sapiens 


" rp Tj-r- 

j-hyaroxyanthranilic acid dioxygenase 


^44 


"62 


4151 


M81321 


Macaca 


proline-rich protein 


172 


50 


4152 


G00416 , 


Homo sapiens 


Human secreted protein, SEQ TD NO: 


129 


66 


4153 


AFl 19851 


Homo sapiens 


PR01722 


128 


62 




AB037826 


Homo sapiens 


KIAA1405 protein 






4155 


M12099 


Mus rausculus 


proline-rich protein 


124 


37 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






4157 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


124 


41 


4158 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4446. 


134 


83 


4159 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


141 


74 


4160 


M76546 


Helianthus 


hydroxyproline-rich protein 


135 


39 


4161 


U35730 


Mus musculus 


jerky 


116 


31 


4162 


Z81525 


Caenorhabditis 
elegans 


contains similarity to Pfam domain; 
PFO 1391 (Collagen triple helix repeat 
(20 copies)), Score=35.5, E-value=4e- 

07, N=2 


130 


50 


4163 


AFl 13685 


Homo sapiens 


PRO0974 


129 


45 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 






4165 


U4997J 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


137 


70 


4166 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


175 


72 




G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 








AB000505 


Daucus carota 


unnamed protein product 






4169 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


113 


30 


4170 


AFl 19900 


Homo sapiens 


PR02822 


154 


59 


4171 


AE001381 


Plasmodium 


hypothetical protein 


106 


33 


4172 


AE000034 


Mycoplasma 
pneumoniae 


bifimctional threonine dehydrogenase; 
galactosylttansferase 


100 


28 


~4fr — 


^^2jgQ 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


Too 




4174 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


68 


4175 


AK024455 


Homo sapiens 


FLJ00047 prbtem 


116 


60 


4176 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


94 


44 


- 


AK024455 


Homo sapiens 


FLJ00047 protein 






4178 


AY007557 


Mycobacterium 
avium subsp. 
paratubercuJosis 


fibronectin-attachment protein FAP-P 


98 


42 




I U14UU 




Secreted protein encoded by gene 18 
clone HNHF029. 






4180 


M12100 


Mus musculus 


proline-rich protein MP-3 


132 


44 


4181 


AF090930 


Homo sapiens 


PRO0478 


138 


63 


4182 


G03172 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7253. 


152 


75 


4183 


Y86248 


Homo sapiens 


Hiunan secreted protein HCHPF68, 
SEQIDNO:163. 


184 


84 
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SEQID 


Accession No. 




Description 


Smith- 


% 


NO: 








Waterman 
Score 


Identit 

y 


4184 


K03205 


Homo sapiens 


sah'vary proline-rich protein precursor 


150 


43 


4185 


G04067 


Homo sapiens 


Human secreted protein, SEQ IDNO: 


128 




4186 


Ar090944 


Homo sapiens 


PRO0663 


124 


49 


4187 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68. 


93 


62 








SEQ ID NO: 163. 






4188 


D38112 


Homo sapiens 


ATPase subunit 6 


4j9 


78 


4189 


AFl 30051 


Homo sapiens 


PRO0898 


89 


57 


4190 


AF090930 


Homo sapiens 


PRO 04 7 8 


137 


83 


4191 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


100 


35 


4192 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


212 


91 


4193 


Y13247 


Homo sapiens 


FB19 protein 


142 


46 


4194 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


268 


93 


4195 


AFl 24729 


Mus musculus 


acinusS' 


140 


42 


4196 


AJ277425 


Globodera 


putative cuticular collagen 


156 


43 






pallida 








4197 


¥86248 


Homo sapiens 


Human secreted protein HCHPF68, 


161 


77 








SEQ ID NO: 163. 






4198 


Z70684 


Caenorliabditis 


F28D1.8 


97 


45 






elegans 








4199 


M64793 


Rattus 


salivary proline-rich protein 


119 


36 






norvegicus 








4200 


X62379 


Mus musculus 


form in, isoform IV 


115 


48 


4201 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


224 


93 


4202 


Ml 4820 


Trypanosoma 


0RF2 bases 1807-2850; first start 


103 


29 






brucei 


codon at 2032; putative 






4203 


U93572 


Homo sapiens 


p40 


182 


34 


4204 


D82026 


Silene latifolia 


glycine-rich protein 


98 


49 


4205 


AF003151 


Caenorhabditis 


contains similarity to an RNA 


107 


40 






elegans 


recognition motif 






4206 


L07545 


Leishmania 


A 'c' was inserted aflernt 369 (=nt 


101 


35 






tarentolae 


10459 in genomic sequence (M10126)) 












to correct -1 frameshifl probably due to 












gel compression 






4207 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


104 


38 


4208 


AF229126 


Homo sapiens 


acetylcholinesterase collagen-like tail 


96 


75 








subunit isoform VII 






4209 


AF090895 


Homo sapiens 


PRO0117 


142 


68 


4210 


X92485 


Plasmodium 
vivax 


pval 


104 


35 


4211 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


73 








6909. 






4212 


G02828 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


151 


69 








6909. 






. 4213 


AFl 15549 


Homo sapiens 


Wiskott-Aldrich Sjoidrome protein 


122 


43 


4214 


AFO 90944 


Homo sapiens 


PRO0663 


97 


39 


4215 


X12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 


111 


60 








29 to 267) 






4216 


M69065 


human 


ORFl 


89 


38 






herpesvirus 2 








4217 


AF090895 


Homo sapiens 


PRO0117 


133 


78 






Homo sapiens 


Neuron-associated protein. 






"4219 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


71 


47 








SEQIDNO:163. 






4220 


U93567 


Homo sapiens 


p40 


170 


35 


4221 


D00570 


Mus musculus 


open reading frame (25 1 AA) 


108 


28 


4222 


X97675 


Homo sapiens 


plakophilin 2b 


122 


47 


4223 


AF270937 


Plutella 


PxORF73 peptide 


99 


54 






xylostella 









240 
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SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


Identit 






granulovirus 








4224 


AF130089 


Homo sapiens 


PRO2550 


144 


69 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 




82 


4226 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


99 


62 


4227 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


172 


59 


4228 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


161 


74 


4229 


AL049537 


Homo sapiens 


dJ1164I10.1 (brefeldin A-inhibited 
guanine nucleotide-exchange protein 2) 


212 


90 


4230 


AFl 18086 


Homo sapiens 


PRO 1992 


150 


69 




M8 1321 


Macaca 
fascicularis 


proline-rich protein 




41 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 




68 


4233 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


132 


56 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 






4235 


R13556 


Homo sapiens 


Protein encoded downstream of Mic_M 
oncoprotein. 


124 


32 


4236 




Homo sapiens 


Hum^ secreted protein, SEQ ID NO: 


119 


42 


4237 


X93498 


Homo sapiens 


21 -Glutamic Acid-Rich Protein 


368 


56 






Homo sapiens 


Neural tliread protein. 






4239 

4240 
4241 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


63 


X97675 


Homo sapiens 


plakophilin 2b 


108 


78 ■ 


\ j6156 


Homo sapiens 


Human secreted protein #28. 


137 


65 




AK000385 


Homo sapiens 


unnamed protein product 


131 


33 


4243 


AJ252253 


human 

herpesvirus 2 


glycoprotein G-2 


107 


36 


4244 


9447 


Homo sapiens 


Human PR01556 (UNQ7o4) ammo 
acid sequence SEQ ID NO:372. 






4245 


Y05398 


Homo sapiens 


Human TIE ligand NL8 protein 

^tf4^°'^^ 


424 


84 


"4946 


LOGO 16 


15 




— — 


— 


~4247 


AF134579 


Zeamays"^"^ 


M-abino'^alactan protein 







4248 


734465 


Zea mays 


extensin-like protein 


123 


33 




AFO 1 0 1 44 










4250 


,102695 


Plasmodium 


circumsporo/oite protein 


110 


33 


^25l 


AK0''4455 


Homo sapiens 


FLJ00047 protein 


^36 


^65 


4252 


U02570 


Homo sapiens 


CDC42 GTPase-activating protein 


566 


93 








cytochrome c oxidase subunit 1 






4254 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


111 


63 


4255 


M61143 


Bovine 
herpesvirus 1 


latency-related open reading frame 2; 


101 


42 




AFO 101 44 


Homo sapiens 


neiu'onal thread protein AD7c-NTP 






4257 


Y06294 


Homo sapiens 


Human transcription regulator MOP6 
partial sequence. 


410 


79 


4258 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


112 


61 


4259 


D90252 


Human 

papillomavirus 
type 5b 


E4 protein 


110 


32 



241 
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SEQ ID 


Accession No. 


Species 


Description 


Smitli- 

Waterman 

Score 


Identit 

y 




X90569 


Homo sapiens 


elastic titin 


660 


95 




AF0909jO 


Homo sapiens 


PRO0478 






4262 


AF132209 


Homo sapiens 


prepro-major basic protein liomolog 


422 


72 




AF043 1 02 


Pneumocystis 


surface glycoprotein A 






4264 


G03790 



Homo sapiens 
. 


Human secreted protein, SEQ ID NO: 


132 







AF090895 


Homo sapiens 


——Fr 

PRODI 17 


159 


62 






Homo sapiens 


PRO0478 


143 


50 


4267 


GOB 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5 ii i 


119 


34 


"4268 


P70494 


Homosa iens 

omo sapiens 


Sec|uence of human B-cell growui 
factor (BCOF). 


^ 






D23660 


Homo sapiens 


ribosomal protein 


493 


73 




AFI30089 


Homo sapiens 


PT> r\o^<f\ 


119 


39 


"4571 — 












4272 


AB033615 


Mus musculus 


phospholipase C-L2 


485 


79 




AF006082 


Homo sapiens 


iJ 






4274 


Y 12293 


Mus musculus 




104 


33 






Bos taurus 


cGMP-dependent protein kinase 
(isoform I beta) 


561 


81 


4276 


API 19855 


Homo sapiens 


PRO 1847 


155 


71 


4277 


J01415 


Homo sapiens 


MTND4 


372 


63 


4278 


AB021078 


Plasmid Collb- 


ybbA 


101 


30 


4279 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


73 


4280 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4409. 


110 


73 




AFl 19851 


Homo sapiens 


PRO 1722 


110 


60 


4282 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 1 


346 


92 






Homo sapiens 


DIA-12C protein 


109 


52 


4284 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of tlie 
human tumor suppressor gene ZAP- 1 . 


122 


76 


4285 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


121 


42 


4286 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


57 


4287 


Y11525 


Homo sapiens 


CCAAT/enhancer binding protein 


100 


32 


4288 


AF033122 


Homo sapiens 


non-p53 regulated PA26-T1 nuclear 
protem 


128 


81 


4289 


W48351 


Homo sapiens 


Human breast cancer related protein 


111 


39 


4290 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. v-r—. ; 


140 


58 


4291 




-~j — — 

Volvox carteri i. 
nagariensis 


hydroxyprohne-rich glycoprotein DZ- 
HRGP 










Zea mays 


proiin rich protein 






4293 


U09367 


Homo sapiens 


zinc finger protein ZNF136 


461 


60 




AF038960 


Homo sapiens 


SKDl homolog 






4295 


AL390114 


Leislimania 


extremely cysteine/valine rich protem 


128 


54 


4296 


U66561 


Homo sapiens 


Icruppel-related zinc finger protein 


512 


89 


4297 


AF043706 


Caenorhabditis 
elegans 


contains similarity to granulins 


104 


55 


4298 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


4299 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 
protein 


520 


87 



242 
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SEQID 
NO: 




Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


4300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


142 


72 


4301 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


109 


50 


4302 


AL035440 


Arabidopsis 


putative protein 


103 


31 


4303 


L05500 


Homo sapiens 


adenylyl cyclase 


655 


96 




AF1167I2 


Homo sapiens 








4305 


D 13757 


Homo sapiens 


am idophosphoribosy Itransferase 


182 


92 


4306 


L20450 


Mus musculus 


DNA-binding protein 


470 


64 


4307 


AF2 13386 


Mus musculus 


ATP-binding cassette protein 


175 


97 


4308 


X76850 


Mus musculus 


MAP kinase-activated protein kinase 2 


154 


69 






Rattus sp. 


reverse transcriptase homolog 




55 


4310 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 
IcDa subunit 


110 


32 


4311 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


58 


4312 


Z46236 




Iceratinocyte growth factor 


210 


75 


4313 


AB020700 


Homo sapiens 


KIAA0893 protein 


569 


87 


4314 


AF124727 


Homo sapiens 


acinusS 


498 


88 


4315 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


88 


32 


4316 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


122 


53 




API 19900 


Homo sapiens 


PR02822 


135 


58 


4318 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog 1 1 A 


104 


70 


4319 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


51 


4320 


AB026054 


Homo sapiens 


brain finger protein 


412 


90 


4321 


AF090944 


Homo sapiens 


PRO0663 


133 


71 


4322 


AF165513 


Homo sapiens 


vacuolar protein sorting 45 isoform 


712 


97 


4323 


Y99418 


Homo sapiens 


Human PR013I7 (UNQ783) amino 
acid sequence SEQ ID NO:277. 


532 


97 


4324 




m.Jor 




142 


37 


4325 


00^807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


128 


59 


4326 


W40054 


Homo sapiens 


P300/CBP-associated transcriptional 
cofactor P/CAF C-terminus. 


175 


94 


4327 


AF2 17411 


Homo sapiens 


neuroligin 3 isoform HNL3 


549 


84 


4328 


U94836 


Homo sapiens 


ERPROT 213-21 


175 


87 


4329 


Y20763 


Homo sapiens 


Human neuro£ilament-M mutant 
protein fragment 45. 


501 


87 


4330 


AB037745 


Homo sapiens 


KIAA 1324 protein 


1014 


99 


4331 


M91563 


Rattus 
norvegicus 


NMDA receptor subtype 2C 


116 


39 


4332 


G03704 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7785. 


286 


76 


4333 


AC006841 


Arabidopsis 
thaliana 


Mutator-like transposase 


130 


70 


4334 


AP000373 


Arabidopsis 
thaliana 


jasmonate inducible protein; 
myrosinase binding protein-like 


130 


53 


4335 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
concqjtual translation supplied by 


377 


90 


4336 


AJ277426 


Globodera 
pallida 


putative cuticular collagen 


112 


37 



243 
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SEQID 


Accession No. 










NO: 








Waterman 


Identit 










Score 


y 


4337 


U I 6802 


Rattus 


Ca2+-dependeiit activator protein; 


560 


79 






norvegicus 


calcium-dependent actin-binding 












protein 






4338 


Y27907 


Homo sapiens 


Hmnan secreted protein encoded by 


118 


56 








gene No. 119. 






4339 


AB024520 


Homo sapiens 


notch4 


93 




33 


4340 


AF 128406 


Homo sapiens 


nuclear prelamin A recogpition fector 


241 


100 


4341 


W19771 


Homo sapiens 


Beta-1 integrin modulator B171 . 


169 


100 


4342 


AL096828 


Homo sapiens 


dJ963E22.1 (Novel protein similar to 


666 


88 








NY-REN-2 Antigen) 










Homo sapiens 


uKATP-1 




80 


4344 


U84487 


Homo sapiens 


CX3C chemokine precursor 


371 


78 


4345 


U93569 


Homo sapiens 


putative pi 50 


111 


47 


4346 


AL1617S5 


Streptomyces 


putative serine/threonine protein kinase 


102 


30 






coelicolor A3(2) 








4347 


AK026162 


Homo sapiens 


unnamed protein product 


377 


98 


4348 


AF221759 


Homo sapiens 


Maml 


314 


47 


4349 


AFl 65926 


Homo sapiens 


NUP155 


147 


84 


4351 


U50185 


Rattus 
non'egicus 


PP-IM 


144 


52 


4352 


D50455 


Rattus 


phodpholipase C delta4 


196 


70 






norvegicus 








4353 


X54131 


Homo sapiens 


protein-tyrosine phosphatase 


261 


87 


4354 


AF151850 


Homo sapiens 


CGI-92 protein 


255 


92 


4355 


G03996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


95 








8077. 






4356 


U93564 


Homo sapiens 


putative pi 50 


171 


72 


4357 


AP001507 


Bacillus 


unknown conserved protein 


159 


34 






halodurans 






4358 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


60 








7868. 






4359 


AF072697 


Mus musculus 


SHYC 


177 


97 


4360 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


69 








7870. 






4361 


M98776 


Homo sapiens 


keratin 1 


449 


77 


4362 


AK024436 


Homo sapiens 


FLJ00026 protein 


671 


91 


4363 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


155 


49 








clone HPMBQ32. 






4364 


D13896 


Rattus 


cytoplasmic dynein heavy chain 


327 


92 






norvegicus 








4365 


X79389 


Homo sapiens 


glutathione transferase Tl 


164 


96 


4366 


AB006458 


Mus musculus 


alpha-D-mannosidase 


177 


56 


4367 


X01060 


Homo sapiens 


put. transferrin receptor (aa 1-760) 


160 


96 


4368 


W48351 


Homo sapiens 


Human breast cancer related protein 


103 


54 








BCRB2. 






4369 


AL353715 


Homo sapiens 


bK3 1 84A7.3 . 1 (helicase-like protein 


485 


100 








NHL) 






4370 


AF001631 


Oryctolagus 


glucose-regulated protein GRP94 


118 


92 






cuniculus 








4371 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


80 








4497. 








AL121741 


Schizosaccharom 


putative negative regulator of vesicle 


200 


41 






yces pombe 


formation 






4373 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 


279 


70 








subunit 






4374 


U47741 


Homo sapiens 


CREB-binding protein 


123 


96 


4375 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


127 


75 








gene 22 clone HMSJWl 8. 






_4376_j 


Y12781 


Homo sapiens 


transducin (beta) like 1 protein 


1056 


90 



244 
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SEQ ID 




Species 


Description 






NO: 








Waterman 
Score 


Identit 

y 


4377 


W69424 


Homo sapiens 


Human secreted protein bgl40_l . 


183 


97 


4378 


AL096700 


Homo sapiens 


dJ499B10.2 (phosphorylase kinase. 


639 


86 








alpha 2 (liver) (PYK)) 






4379 


AF 1773 90 


Manduca sexta 


antennal specific membrane protein 


378 


51 








AMP 






4380 


X97675 


Homo sapiens 


plakophilin 2b 


156 


75 


4381 


R33713 


Homo sapiens 


PgllOl. 


104 


100 


4382 


ABO 15473 


Arabidopsis 
thaliana 


gene_id:MCM23. l~unknown protein 


113 


61 


4383 


AFl 16715 


Homo sapiens 


PR02829 


133 


50 


4384 


AL357472 


Homo sapiens 


VPS33B 


676 


99 


4385 


AF09093 1 


Homo sapiens 


PRO0483 


155 


58 


4386 


U92645 


Gecarcinus 


alpha- 1 -tubulin 


511 


75 






lateralis 






4387 


Y36095 


Homo sapiens 


Extended human secreted protein 


202 


90 








sequence, SEQ ID NO. 480. 






4388 


M64923 


Bos taurus 


CIO protein 


218 


95 


4389 


X98264 


Homo sapiens 


M-phase phosphoprotein 4 


197 


100 


4390 


AK000264 


Homo sapiens 


unnamed protein product 


239 


73 


4391 


Ml 2937 


Homo sapiens 


ferritin heavy subunit 


662 


84 


4392 


P92219 


Homo sapiens 


CRl protein. 


116 


80 






(human) 








4393 


X16135 


Homo sapiens 


L protein (AA 1-558) 


759 


99 


4394 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


58 








4478. 






4395 


AC005591 


Homo sapiens 


PkB-like 


170 


97 


4396 


AF161426 


Homo sapiens 


HSPC308 


313 


77 


4397 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


146 


61 








gene 22 clone HMSJW18. 






4398 


AFl 04921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


679 


88 


4399 


AF257330 


Homo sapiens 


COBW-like protein 


586 


90 


4400 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


65 








7888. 






4401 


AFl 19851 


Homo sapiens 


PRO 1722 


101 


79 


4402 


M55542 


Homo sapiens 


guanylate binding protein isofonn I 


230 


76 


4403 


Y07752 


Volvox carteri 


pherophorin-S 


459 


88 


4404 


W48351 


Homo sapiens 


Human breast cancer related protein 


113 


44 








BCRB2. 






4405 


U07786 


Sus scrofa 


beta actin 


519 


90 


4406 


A06977 


Homo sapiens 


albimiin 


586 


88 


4407 


L20755 


Cuscuta reflexa 


hybrid proline-rich protein; cytokinin- 


112 


41 








induced;haustona 






4408 


AB002299 


Homo sapiens 


KIAA0301 


612 


98 


4409 


AB001424 


Mus musculus 


KIF17 


104 


47 


4410 


M88108 


Homo sapiens 


p62 


574 


92 


4411 


AL121673 


Homo sapiens 


bA3d5P22.2 (novel protein) 


415 


71 


4412 


AF064553 


Mus musculus 


NSDl protein 


224 


64 


4413 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


107 


36 








similar to pogo element 






4414 


A06977 


Homo sapiens 


albumin 


505 


82 




A06977 


Homo sapiens 


albumin 


596 


90 


4416 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 


123 


50 








IDNO:155. 






4417 


AL359782 


Trypanosoma 


possible (hhv-6) ul 1 02, variant a dna, 


103 


85 






brucei 


complete virion genome. 






4418 


G03053 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


54 








7134. 






4429 


AF049606 


Mus musculus 


transcription factor NF-ATc isofonn b 


114 


90 
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SEQID 


Accession No. 




Description 






NO: 








Waterman 


Identit 










Score 


y 


4420 


Z11190_cdl 


Homo sapiens 


ll-DEC-1998 Interleukin-3 coding 


354 


97 








sequence from b2HFLS20W cDNA 












library. 






442] 


W64469 


Homo sapiens 


Human secreted protein fi'om clone 


203 


100 








CW795_2- 






4422 


lvll2523 


Homo sap-ens 


alloalbuinin Venezia 


350 


94 


4423 


AF 13 0077 


Homo sapiens 


PR02619 


561 


89 


4424 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


161 


64 








gene 44 clone HTDAD22. 






4425 


Y91577 


Homo sapiens 


Human secreted protein sequence 


393 


91 








encoded by gene 2 SEQ ID NO:250. 






4426 


A09561 


synthetic 


human serum albumin 


651 


96 






construct 








4427 


AL133215 


Homo sapiens 


bA108L7.5 (novel protein similar to 


392 


100 








Plasmodium POMl and C. elegans 












F46G11.1 (Tr:Q20485)) 






4428 


W33663 


Homo sapiens 


Human puromycin-sensitive 


172 


96 








aminopeptidase (PSA)-68. 






4429 


AB021654 


Homo sapiens 


DD2/bi]e acid-binding 


184 


81 








protein/AKRlC2/3alpha- 












hydroxysteroid dehydrogenase type 3 






4430 


W63683 


Homo sapiens 


Human secreted protein 3. 


114 


42 


4431 


AY008763 


Homo sapiens 


sentrin/SUMO-specific protease 


447 


96 


4432 


U52965 


Homo sapiens 


ENX-1 


176 


94 


4433 


AF 180470 


Mus musculus 


Kiaa0575 


423 


77 


4434 


XI 7206 


Homo sapiens 


put. LLRep3 protein (AA 1-221) 


581 


99 


4435" 


Y70929 


Homo sapiens 


Human zilla4 splice variant protein. 


621 


100 


4436 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


71 








4497. 






4437 


A06977 


Homo sapiens 


albumin 


610 


97 


4438 


A00279 


synthetic 


Human serum albumin 


621 


94 






construct 








4439 


L29028 


Chlamydomonas 


amino acid feature; N-glycosylation 


105 


36 






eugametos 


sites, aa 41 .. 43, 46 „ 48, 51 ,. 53, 72 ,. 












74, 107 .. 109, 128 .. 130, 132.. 134, 












158 .. 160, 163 .. 165; amino acid 












feature: Rod protein domain, aa 169 .. 












340; amino acid feature: globular 












protein domain, aa 32 ,, 168 






4440 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


65 


4441 


U49973 


Homo sapiens 


ORFl; ]V[ER37; putative transposase 


133 


69 








similar to pogo element 






4442 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


60 








6953. 






4443 


AFl 13685 


Homo sapiens 


PRO0974 


101 


54 


4444 


W03627 


Homo sapiens 


Human follicle stimulating hoimone 


181 


42 








GPR N-terminal sequence. 






4445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


76 








7871. 






4446 


G004 12 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


54 


- 






4493. 








L02867 


Homo sapiens 


paraneoplastic antigen 






4448 


D38435 


Homo sapiens 


homologue of yeast PMSl 


108 


66 


4449 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


65 








4409. 






4450 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


68 


4451 


AF220264 


Homo sapiens 


MOST-1 


101 


40 


4452 


AF130079 


Homo sapiens 


PR02852 


97 


59 


4453 


AFl 16715 


Homo sapiens 


PR02829 


139 


69 
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SEQID 




Species 


Description 






NO: 








Waterman 
Score 


Identit 

y 


4454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


53 








7888. 






4455 


AF130087 


Homo sapiens 


PR024I1 


108 


73 


4456 


AF279891 


Homo sapiens 


dead box protein 15 


108 


58 


4457 


L08258 


Strongylocentrot 


Icinesin light chain isoform 4 


176 


94 






us purpuratus 








4458 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


78 








4493. 






4460 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


74 








7253. 






4461 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


52 








7882. 






4462 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


118 


72 


4463 


AF 1 19900 


Homo sapiens 


PR02822 


144 


80 


4464 


M15530 


Homo sapiens 


B-cell growth factor 


92 


76 


4465 


Y17833 


Human 


env protein 


107 


62 






endogenous 












retrovirus K 








4466 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


78 


4467 


AF090895 


Homo sapiens 


PRO0117 


112 


79 


4468 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


64 








8153. 






4469 


X97675 


Homo sapiens 


plakophilin2b 


119 


78 


4470 


AFII8081 


Homo sapiens 


PRO 1900 


119 


74 


4471 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


60 








7645. 






4472 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


113 








7795. 






4473 


Y86248 


Homo sapiens 


Human seciBted protein HCHPF68, 


205 


73 






SEQ ID NO: 163. 






4474 


AF130114 


Homo sapiens 


PR02459 


121 


67 


4475 


AFI78534 


Homo sapiens 


talin 


213 


67 


4476 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


122 


92 








gene No. 119. 






4477 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


99 


66 








clone HCACJ81. 






4478 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


142 


62 








gene No. 107. 






4479 


G00689 


Homo sapiens 


Human secreted protein, SEQID NO: 


153 


55 








4770. 






4480 


AF2 17374 


Acanth aster 
planci 


cytochrome oxidase subunit I 


130 


100 


4481 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


57 








7795. • 






4482 


Y20983 


Homo sapiens 


Human glial fibrillary acidic protein 


110 


63 








GFAP wild type fragment 9. 






4483 


AF221552 


Or>'za sativa 


proline-rich protein RiP- 15 


L119 


33 


4484 


L25941 


Homo sapiens 


integral nuclear envelope inner 


110 


84 








membrane protein 






4485 


AF130089 


Homo sapiens 


PRO2550 


161 


81 


4486 


U39742 


Caenorhabditis 


coded for by C. elegans cDNA 


98 


72 






elegans 


yk25e5.3; coded for by C. elegans 












cDNA yk25e5.5; similar to repeat 












guanylate kinase domain of D. 












melanogaster ]ethal(l) discs large- 1 












tumor suppressor protein 












(SP:DLGl_DROME, P3 1007) and R. 












norvegicus postsynaptic density protein 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Watcrman 


Identit 








95 (PSD-95) (SP:PSD9_RAT, P31016) 






4487 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


89 


81 


4488 


X97675 


Homo sapiens 


plakophilin 2b 


127 


88 


4489 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


105 


71 


4490 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


64 


4491 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


77 


4492 


U83280 


Leishmania 
donovani 


39 kDa antigen 


107 


76 


4493 


AF023142 


Homo sapiens 


pre-mRNA splicing SR protein rA4 


141 


40 






Homo sapiens 


salivary proline-ricli protein 1 


119 


40 


4495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


45 


4496 


AF130089 


Homo sapiens 


PRO2550 


132 


75 


4497 


AF090930 


Homo sapiens 


PRO0478 


160 


76 


4498 


API 19900 


Homo sapiens 


PR02822 


158 


55 


4499 


AFl 16661 


Homo sapiens 


PR01438 


126 


44 


4500 


AFl 16661 


Homo sapiens 


PROI438 


118 


42 


4501 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


48 


4502 


M81321 


Macaca 
fascicularis 


proline-rich protein 


154 


44 


4503 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


96 


4504 


D44596 


Saccliaromyces 
cerevisiae 


Mdjlp heat sliock protein 


93 


42 


4505 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


76 


4506 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


207 


93 


4507 


U93564 


Homo sapiens 


p40 


520 


86 


4508 


Y02999 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 121. 


135 


55 


4509 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


77 


4510 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718, 


146 


66 


4511 


L27428 


Homo sapiens 


reva^ transcriptase 


143 


87 


4512 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


80 


4513 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


97 


75 


4514 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


60 


4515 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla(AF078683) 


120 


73 


4516 


AF130051 


Homo sapiens 


PRO0898 


141 


86 


4517 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


185 


94 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


130 


100 


4519 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 


4520 


AF210651 


Homo sapiens 


NAG 18 


161 


88 


4521 


Y02693 


Homo sapiens 


Human secreted protem encoded by 
gene 44 clone HTDAD22. 


140 


68 


4522 


AF2 17374 


Acanth aster 
planci 


cytochrome oxidase subunit I 


131 


76 
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SEQ ID 
NO: 


Accession No. 




Description 


Waterman 


y 


4523 ■ 


U72520 


Mus musculus 


mena protein 


102 


42 


4524 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


129 


75 


4525 


AL 160493 


Leishmania 


probable (hhv-6) ulI02, variant a 
DNA, complete virion genome 


108 


84 


4526 


G03043 


Homo sapiens 


Human secreted protein, SEQ TD NO: 

7124. 


139 


76 


4527 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


138 


89 


4528 


AF090931 


Homo sapiens 


PRO0483 


91 


88 


4529 
4530 


Y86248 
002515 


Homo sapiens 
Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


111 


51 


Human secreted protein, SEQ ID NO: 
6596. 


130 


92 


4531 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


123 


64 


4532 


L26953 


Homo sapiens 


cliromosomal protein 


102 


79 


4533 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. 


88 


54 


4534 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7284. 


122 


40 


4535 


AF090944 


Homo sapiens 


PRO0663 


113 


53 


4536 


U 15647 


Mus musculus 


expressed in select embryonal 
carcinoma cells and testicular cells; 
similar to Swiss-Prot Accession 
Number PI 1260 


119 


35 


4537 


L13858 


Homo sapiens 


guanine nucleotide exchange factor 


150 


81 


4538 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


138 


68 


4539 


AF090930 


Homo sapiens 


PRO0478 


104 


53 


4540 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


112 


64 


4541 


K02401 


Homo sapiens 


chorionic somatomammotropin 


572 


96 


4542 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


171 


67 


4543 


J04208 


Homo sapiens 


inosine-5'-monophosphate 
dehydrogenase (EC 1.1.1.205) 


455 


92 


4544 


K02401 


Homo sapiens 


chorionic somatomammotropin 


397 


88 


4545 


Y08062 


Homo sapiens 


Human PR0245 protein firagment 
derived from DNA35638. 


155 


66 


4546 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


161 


68 


4547 


R24056 


Homo sapiens 


hGH variant #8 - 172Arg 174Thr 
176Tyr 178His. 


117 


52 


4548 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


111 


68 


4549 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


55 


4550 


AF119855 


Homo sapiens 


PR01847 


152 


68 


4551 


G03172 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7253. 


153 


70 


4552 


M59217 


Homo sapiens 


alpha- 1 type XIII collagen 


98 


57 






Homo sapiens 


Human cisplatin resistance protein. 


136 


96 


4554 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


72 


4555 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


91 


60 


4556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


70 


4557 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


101 


44 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


Idcntit 








isoforai 






4558 


AF130089 


Homo sapiens 


PRO2550 


150 


85 


4559 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


161 


56 


4560 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


81 


64 


4561 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

6619. 


115 


55 


4562 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


106 


82 


4563 


AF090931 


Homo sapiens 


PRO0483 


145 


73 


4564 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


358 


77 


4565 


AFI30089 


Homo sapiens 


PRO2550 


175 


85 


4566 


Z93244 


Homo sapiens 


bKI 16F5.2 (PUTATIVE RhoGAP 
(CDC42 GTPAse Activating Protein) 
LIKE protein) 


125 


61 


4567 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


142 


56 


4568 


AF130089 


Homo sapiens 


PRO2550 


125 


75 


4569 


AF130089 


Homo sapiens 


PRO2550 


172 


82 


4570 


AF225961 


Rattus 
norvegicus 


RhoGEF glutamate transport modulator 
GTRAP48 


149 


78 


4571 


W4835I 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


143 


64 


4572 


G02640 


Homo sapiens ^ 


Human secreted protein, SEQ ID NO: 
6721. 


157 


70 


4573 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


91 


73 


4574 


X92485 


Plasmodium 


pval 


111 


52 


4575 


U93567 


Homo sapiens 


p40 


151 


71 


4576 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


90 


57 


4577 


AC007508 


Arabidopsis 
thaliana 


F1K23.4 


143 


42 


4578 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


53 


4579 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


175 


73 




G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4419. 






4581 


AFT 3 0079 


Homo sapiens 


PR02852 


117 


70 


4582 


AF090930 


Homo sapiens 


PRO0478 


109 


57 


4583 


K96800 


Homo sapiens 


Human liistiocyte -secreted factor HSF. 


134 


79 


4584 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


149 


62 


4585 


AF 11 8090 


Homo sapiens 


PRO2044 


175 


97 


4586 


AL080243 


Homo sapiens 


E 1 A binding protein pSOO; match: 
proteins: Sw:Q09472 Sw:Q92793 
Sw:P45481 Wp:CE00571 
Wp:CE21I17 Tr:O01368 Wp:CE08856 
Wp:CE00570 Wp:CE08453 Tr:O44076 


144 


93 


4587 


¥85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


140 


62 


4588 


AJ131245 


Homo sapiens 


Sec24B protein 


131 


81 


4589 


W02105 


Homo sapiens 


Human L-asparaginase. 


559 


100 


4590 


L26953 


Homo sapiens 


chromosomal protein 


146 


71 


4591 


Ml 6976 


Glycine max 


N-75 


125 


36 
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SEQID 




Species 




Smith- 




NO: 










Identit 










Score 


y 


4592 


AFl 19851 


Homo sapiens 


PRO 1722 


118 


61 


4593 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


4594 


M20789 


Homo sapiens 


alpha- 1 type I collagen 


103 


41 


4595 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


72 








4462. 






4596 


AF130089 


Homo sapiens 


PRO2550 


124 


71 


4597 


Ci004]6 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


65 








4497. 






4598 


AF 11 8082 


Homo sapiens 


PRO 1902 


122 


64 


4599 


AK023034 


Homo sapiens 


unnamed protein product 


156 


100 


4600 


AFl 18082 


Homo sapiens 


PRO1902 


148 


62 


4601 


AF119855 


Homo sapiens 


PRO 1847 


130 


50 


4602 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


77 








4484. 






4603 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


65 








7075. 






4604 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


135 


54 








gene 100 clone HNFIU96. 






4605 


ALl 60371 


Leishmania 


probable (hhv-6) ul 102, variant a 


123 


75 








DNA, complete virion genome 






4606 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


74 








4658. 






4607 


V00672 


Pan troglodytes 


reading frame protein 4 


194 


72 


4608 


X86791 


Sus scrofa 


beta-globin 


156 


87 


4609 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


130 


88 








clone HPMBQ32. 






4610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


146 


90 






SEQ ID NO: 163. 






4611 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


68 








7870. 






4612 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


120 


38 


4613 


W48351 


Homo sapiens 


Human breast cancer related protein 


148 


53 






BCRB2. 






4614 


AK024372 


Homo sapiens 


unnamed protein product 


124 


63 


4615 


AC005360 


Homo sapiens 


FAA 


175 


91 


4616 


D38113 


Pan troglodytes 


ATPase subunit 6 


204 


69 


4617 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


122 


45 








oncoprotein. 






4618 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


81 








4478. 






4619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


90 








4478. 






4620 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


50 








7124. 






4621 


S79410 


Mus musculus 


nuclear localization signal binding 


107 


45 








protein 






4622 


AF126163 


Homo sapiens 


HHLA3 protein 


124 


75 


4623 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


69 








5327. 






4624 


U38184 


Tr>'panosoma 
cruzi 


ATPase subunit 6 


102 


45 


4625 


U01849 


Tr>'panosoma 


ORFl 


97 


36 






brucei 








4626 


G03058 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


57 








7139. 






4627 


M33228 


Tr>panosoma 


ATPase 6 


126 


37 






brucei 








4628 


AFl 18086 


Homo sapiens 


PRO 1992 


113 


60 


4629 


AF041330- 


Bodo saltans 


NADH dehydrogenase subunit 5 


149 


41 



251 



wo 01/64835 PCT/llSOl/04927 



NO: 








Waterman 


Idcntit 

y 


4630 


X12544 


Homo sapiens 


3 HL A-DR B protein precursor (AA - 
29 to 267) 


108 


48 


4631 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


111 


46 


4632 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:]63. 


124 


60 


4633 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


151 


60 


4634 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


138 


100 


4635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


135 


56 


4636 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


189 


100 


4637 


X67320 


Rattus 
norvegicus 


Hld-Histone 


122 


40 


4638 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


181 


76 


4639 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


96 


73 


4640 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


136 


54 


4641 


AP000616 


Orj'za sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


70 


4642 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


190 


56 


4643 


M29580 


Homo sapiens 


zinc finger protein 7 (ZFP7) 


103 


38 


4645 


U47741 


Homo sapiens 


CREB-binding protein 


207 


94 


4646 


AL035461 


Homo sapiens 


dJ967N2I.5 (novel MCM2/3/5 family 
member) 


213 


80 


4647 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


76 


60 


4648 


AF227948 


Homo sapiens 


HBV pX associated protein-S; XAP-8 


402 


86 


4649 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


143 


86 


4650 


U93574 


Homo sapiens 


putative pi 50 


121 


33 


4651 


A02739 


synthetic 
construct 


chloramphenicol acetyltransfisrase 


622 


99 


4652 


AF 190449 


Mus musculus 


putative transcription factor ALF-4 


274 


83 


4653 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


108 


76 


4654 


X15187 


Homo sapiens 


precursor polypeptide (AA -21 to 782) 


763 


98 


4655 


AP000616 


Oo'za sativa 


similar to RING-H2 finger protein 

RnAla(AF078683) 


128 


91 


4656 


Y0S062 


Homo sapiens 


liuiiian PR0245 protein fragment 
derived from DNA35638. 


169 


77 


4657 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


135 


83 


4658 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


60 


4659 


AC004850 


Homo sapiens 


vacuolar assembly protein VPS41 
homolog (S53) 


187 


92 


4660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


181 


92 




G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4446. 


138 


58 


4662 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


154 


78 


4663 


M69180 


Homo sapiens 


noimiuscle myosin heavy chain-A 


106 


84 


4664 


AF130089 


Homo sapiens 


PRO2550 


117 


65 


4665 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


91 



252 



w o 01/64835 



PCT/llSOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smitii- 




NO: 








Waterman 
Score 


Identit 

y 


4666 


G03714 


Homo sapiens 


- 

Human secreted protein, SEQ ID NO; 


139 


62 









J'^'^^- 






... 

4667 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


111 


78 









RHAla (AF078683) 






4668 


AFl 13695 


Homo sapiens 


PRO 1365 




^1 




AF003540 


Homo sapiens 


Krueppel family 7inc finger protein 


1 1 1 


64 


4670 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


1j4 


49 








RHAla (AF078683) 






4671 


ALl 09658 


Homo sapiens 


dJ776F14.1 (orthologof mouse P47) 


110 


100 


4672 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


62 








.. 






467j 


AP000616 


.. 

Or>'za sativa 


similar to RJNG-H2 finger protein 


1j8 


60 








RHA 1 a (AF078683) 






4674 


X51591 


Homo sapiens 


beta-myosin heavy chain (1151 AA) 


442 


97 


4675 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


148 


76 








gene No. 101. 






4676 


D86971 


Homo sapiens 


no similarities to reported gene 


133 


100 








products 










Homo sapiens 


salivary proline-rich protein 1 


142 


... 


4678 


AF044205 


Gossypiinn 


proline-rich protein precursor 


117 


39 






hirsutum 








4679 


D38116 


Pan paniscus 


NADH dehydrogenase subunit4 


243 


98 


4680 


AFl 13685 


Homo sapiens 


PRO0974 


117 


74 


4681 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


193 


90 








SEQ ID NO; 163. 






4682 


L26163 


Mus musculus 


histone Hie 


111 


37 


4683 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


145 


78 








SEQ IDNO:163. 






4684 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


112 


53 








SEQ ID NO: 1 63. 






4685 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


110 


60 








clone HNHF029. 






4686 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


72 














"4^687 — 


Pggjj^ 


Homosa iens 

omo sapiens 


NADH deh dro en esubun"t6 


l65 


Too — 


— 





Homo sapiens 


— T : — 

Amino acid sequence or 2l neterogenous 


"r^Q 


—— — 

100 








ribonucleotide protein. 






"4680 


Y00281 


"H 

omo sapiens 


aIa'tis^'^ 


— — 

— |- 


— — — 

4?^ — 


"dfion — 


AF2 1 065 1 


omo sapiens 








— — j 


^^^^^^ 


Homo sapiens 


— h i '■ 

c omosoma protem 


— — 


~ 

? 


"2^09 




omo sapiens 


c romosoma pro ein — 






^— 

-— 





_ omo s^ienb 


cytoc rome oxi ase subuni 


—-— 


~- — 

80 


469 


9 TS' 

L2742S 


Homo sapiens 


reverse tonscnptase 


144 


87 




Yi 624,_ 


Homo sapiens 


Hainan secreted protein HCHPFoSj 


165 


74 


— — 






SEQ IDlNO:i63. 








,nn4 

W29474 


Homo sapiens 


Human histone HI isofonn HIS-I. 


9j 




41 


^ y 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-N I'P 


126 


56 


4698 


S79410 


Mus musculus 


nuclear localization signal binding 


138 


62 








protein 






4699 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


54 








: -— — 






~4700 — 


■qq^j-^^ 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


"162 


— 








7795. 






4701 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


112 


88 


4702 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


59 








4414. 






4703 


AF241242 


Mus musculus 


T-cell-specific T-box transcription 


294 


75 








factor T-bet 






4704 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


80 



253 



SEQID 


Accession No. 


Species 


Description 


Smitli- 




NO: 








Watcrman 


Identit 




















SEQ ID NO: 163. 






4705 


AF038963 


Homo sapiens 


KNA helicase 


102 


70 


4706 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


148 


63 








RHAla(AF078683) 






4707 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


70 








6444. 






4708 


AF289099 


Maackia 


ENOD2f 


113 


42 


4709 


AF130087 


Homo sapiens 


PR02411 


124 


57 


4710 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


188 


86 








SEQ ID NO: 163. 






4711 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


86 








7124. 






4712 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


143 


78 








SEQ ID NO: 163. 






4713 


AF273133 


Ochotona 
macrotis 


NADH dehydrogenase subunit 4 


154 


76 


4714 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


48 








7534. 






4715 


Z18361 


Ovis aries 


trichohyalin 


120 


33 


4716 


AF130089 


Homo sapiens 


PRO2550 


152 


75 


4717 


AF119851 


Homo sapiens 


PRO 1722 


143 


60 


4718 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


145 


45 


4719 


K02401 


Homo sapiens 


chorionic somatomammotropin 


619 


96 


4720 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


114 


52 








SEQ ID NO: 163. 






4721 


J03756 


Homo sapiens 


hGlI-V2 


515 


74 


4722 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


194 


93 






SEQ ID NO: 163. 






4723 


G00985 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


128 


56 






5066. 






4724 


D49490 


Homo sapiens 


protein disulfide isomerase-related 


213 


55 








protein (PDIR) 






4725 


G03807 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

7888. 


138 


66 




4726 




K02401 




Homo sapiens 


chorionic somatomammotropin 


657 




98 


4727 


D87459 


Homo sapiens 


Similar to Volbox carteri extensin 


162 


80 








(S22697) 






4728 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


99 


34 






brucei 


complete virion genome. 






4729 


AF090944 


Homo sapiens 


PRO0663 


151 


63 


4730 


M32305 


Human 


E 1/E4 fusion protein 


106 


40 






papillomavirus 












type 47 








4731 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


57 








4414. 






4732 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


76 








7795. 






4733 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


114 


43 






elegans 


binding protein KIO ClMID:g8148) 






4734 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


133 


72 








RHAIa (AF078683) 






4735 


M76546 


Helianthus 


hydroxyproline-rich protein 


113 


43 


4736 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


59 








6913. 






4737 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


47 








7867. 






4738 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


133 


73 



254 



wo 01/64835 PCT/llSOl/04927 



SEQ 10 
NO: 


Accession No. 


Species 


Oescripdcn 


Smith- 
Waterman 
Score 


Identit 

y 








RHAla(AF078683) 






4739 


AF044205 


Gossypium 
hirsututn _j 


proline-rich protein precursor 


105 


42 


4740 


S78854 


Oryctolagus 


alpha-tropomyosin 


159 


87 


4741 


G03789 


Homo sapiens 


HuinairseCT^dproteinrsWroNOr ~ 
7870. 


T51 




4742 


B01372 


Homo sapiens 


Neuron-associated protein. 


159 


85 


4743 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


122 


100 


4744 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


176 


82 


4745 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


78 


4746 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


80 


4747 


AP000616 


Oryza sativa 


similar to RIKG-H2 finger protein 
RHAla (AF078683) 


120 


100 


4748 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rha la. 


117 


53 


4749 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


208 


93 


4750 


AF 155232 


Pisum sativum 


extensin 


172 


40 


4751 


AP000616 


Oryza sativa 


similar to SJNG-H2 finger protein 
lUIAla (AF078683) 


117 


76 


4752 


AF 130079 


Homo sapiens 


PR02852 


104 


70 


4753 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


154 


50 


4754 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


167 


50 


4755 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAIa(AF078683) 


138 


100 


4756 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


163 


66 


4757 


MH901 


Rattus 


protine-ricli salivary protein 


102 


45 


4758 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


242 


94 


4759 


AF046935 


Homo sapiens 


PCFl Ip homolog 


244 


94 


4760 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


165 


41 


4761 


Y30681 


Homo sapiens 


Splice variant ZAP-IB protein of the 
human tumor suppressor gene ZAP-1. 


115 


60 


4762 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


178 


84 


4763 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


134 


84 


4764 


R95913 


Homo sapiens 


Neural thread protein. 


136 


70 


4765 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


139 


69 


4766 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


102 


56 




U63159 


Mus musculus 


transaldolase 


218 


100 


4768 


AF230279 


Caenorliabditis 
elegans 


SW13-lilse protein; PSA-1 


106 


38 


4769 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
78S2. 


116 


56 


4770 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


136 


67 


4771 


X92485 


Plasmodium 


pval 


85 


64 



255 



/;/E:A^¥O01 84835 cpd 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith 
Waterman 


Identit 

y 






vivax 












Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 








AC002398 


Homo sapiens 








4774 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


110 


75 


4775 


AF120151 


Homo sapiens 


cytokine receptor-like molecule 9 


112 


100 


4776 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


74 




G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


75 


4778 


GOB 172 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

725j. 


117 


55 


4779 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 

isofomi 


101 






— 
AB007860 


Homo sapiens 


KIAA0400 






4781 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


8S 


4782 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

■ 


98 


84 






Homo sapiens 


unknown protein 






4784 


Y36203 


Homo sapiens 


Human secreted protein #75. 


350 


76 


4785 


K0240 1 


Homo sapiens 


chorionic somatomamniotropin 


ISS 


97 




G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 








G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 








M38451 


Homo sapiens 


growth hormone 






4789 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


146 


77 


4790 


K02401 


Homo sapiens 


chorionic somatomammotropin 


250 


96 




G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 






4792 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


125 


78 


4793 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


131 


51 


4794 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


57 


4795 


Ml 8095 


Phaseolus 
vulgaris 


hydrox^prorme-rich glycoprotein 


103 


34 


4796 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


131 


69 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 






4798 


AL132841 


Caenorhabditis 


Y15E3A.3 


120 


51 


4799 


AL009146 


Drosophila 
melanogaster 


alternatively spliced form 


79 


94 






Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 






4801 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


121 


100 


4802 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


94 


90 


4803 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


101 


90 



256 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 




Species 


Description 


Smith- 




NO: 








Waterman 
Score 


Identit 

y 


4804 


R95913 


Homo sapiens 


Neural thread protein. 


139 


71 


4805 


X61045 


Hydra sp. 


mini-collagen 


126 


68 


4806 


X12544 


Homo sapiens 


3 HL A-DR B protein precursor (AA - 


114 


56 








29 to 267) 






4807 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


160 


84 








clone HPMBQ32. 






4808 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


277 


94 


4809 


L26953 


Homo sapiens 


chromosomal protein 


90 


60 


4810 


AP000616 


Or>^za sativa 


similar to RING-H2 finger protein 


108 


76 








RHAla(AF078683) 






4811 


L26953 


Homo sapiens 


chromosomal protein 


121 


62 


4812 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


124 


71 








gene No. 1 19. 






4813 


G03787- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


46 








7868. 






4814 


W48351 


Homo sapiens 


Human breast cancer related protein 


100 


78 








BCRB2. 






4815 


AF152961 


Homo sapiens 


chromatin-specific transcription 


152 


96 








elongation factor FACT 140 kDa 












subunit 






4816 


U93569 


Homo sapiens 


putative pi 50 


334 


82 


4817 


AB014575 


Homo sapiens 


KIAA0675 protein 


125 


67 


4818 


G02532 


Homo sapiens 


Human secreted protein, SRQ ID NO: 


114 


90 








6613. 






4819 


G00354 


Homo sapiens 


Human secreted protein, SEQ DD NO: 


129 


45 








4435. 






4820 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


134 


48 






brucei 


protein rhala. 






4821 


W48351 • 


Homo sapiens 


Human breast cancer related protein 


~l29 


"61 








BCRB2. 






4822 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


83 


4823 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


55 








6571. 






4824 


AFl 13685 


Homo sapiens 


PRO0974 


110 


54 


4825 


U63542 


Homo sapiens 


FAP protein 


136 


70 


4826 


X83413 


Human 


U88 


130 


54 






herpesvirus 6 








4827 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


76 








7878. 






4828 


U76609 


Homo sapiens 


ribosomal L5 protein 


134 


89 


4829 


W48351 


Homo sapiens 


Human breast cancer related protein 


110 


57 








BCRB2. 






4830 


AC006161 


Arabidopsis 


putative CENP-B/ARS-binding 


141 


31 






thaliana 


protein-like protein 






4831 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


71 








7637. 






4832 


U94189 


Rattus 


Duo 


158 


79 






norvegicus 








4833 


J04031 


Homo sapiens 


MDMCSF (EC 1.5.1.5; EC 3.5.4.9; EC 


193 


97 








6.3.4.3) 






4834 


Y13397 


Homo sapiens 


Amino acid sequence of protein 


654 


97 








PR0334. 






4835 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


53 








7519. 






4836 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


464 


93 








5327. 






4837 


AF272833 


Homo sapiens 




413 


87 


4838 


AF130089 


Homo sapiens 


PRO2550 


132 


71 


4839 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


54 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 








4414. 








G00437 


Homo sapiens 


Human secreted protein, SEQ ID ISIO: 
4518. 








W77354 


Homosap.ens 


2 truncated protein. 






— 




Homo sapiens 


r JvUUo J / 









AF090944 


Homo sapiens 


PRO0663 




~ 


~5l44 — 




Homo sapiens 


Human secreted protein HCHPF685 
SEQ ID NO: 163. 


~~ 


"IT 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 








Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 










Homo sapiens 


cytochrome oxidase subunit II 






4848 


AB023485 


Mus musculus 


transcription fectorCAlSOb 


118 


42 






Caenorhabditis 
elegans 




98 


42 




AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NID:g8148) 






4851 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


179 


92 


4852 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


145 

123 


82 
T7 


4853 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


4854 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


55 


4855 


U66464 


Homo sapiens 


hematopoietic progenitor kinase 


110 


81 


4856 


ACOl 1708 


Arabidopsis 
thaliana 


putative RING anc finger protein 


121 


43 


4857 


M81321 


Macaca 
fascicularis 


proline-rich protein 


126 


35 


4858 


U42471 


iVIus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


104 


40 


4859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


195 


75 


4860 


X65165 


Volvox carteri 


extensin 


248 


55 






Flomo sapiens 


Human secreted protein, SEQ ID NO: 

7253, 






4862 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


185 


92 




Y86248 


omo sapiens 


SEQ ID NO' 1 63 


— 1 

142 


75 


"4864 


AP000616 


Ory'za sativa 


RHAla(AF078683) 


"T^7 


Too — 




Y86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 








AB023229 




ivIAA1012 protein 






4867 


R96800 


Homo sapiens 


Human histiocytje-secreted factor HSF. 


144 


80 








chromosomal protein 






4869 


U63542 


Homo sapiens 


FAP protein 


134 


40 




JJod<Sj J 


C atharanthus 
roseus 


extensm 






4871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


152 


69 


4872 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


136 


55 


4873 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


156 


84 
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NO: 








Waterman 


Identit 

y 


4874 


Y 17832 


Human 
endogenous 
retrovirus K 


env protein 


219 


67 


4875 


AF217517 


Homo sapiens 


uncharacterized bone marrow protein 
BM041 


81 


38 


4876 


L27428 


Homo sapiens 


reverse transcriptase 


105 


91 


4877 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


146 


46 


4878 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


38 


4879 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


77 


4880 


R58816 


Homo sapiens 


Human c-myc far upstream element 
(FUSE) binding protein (FBP)variant 
from PBL clone 3 1-10. 


87 


39 


4881 


G03258 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

7339. 


150 


67 


4882 


AF 175223 


Drosophila 
melanogaster 


SANT domain protein SMRTER 


84 


89 


4883 


AF 174482 


Homo sapiens 


polycomb 3 


136 


57 


4884 


Y54324 


Homo sapiens 


Amino acid sequence of a human 
gastric cancer antigen protein. 


147 


66 


4885 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


141 


71 


4886 


U55376 


Caenorhabditis 
elegans 


F16H 11:2 gene product 


95 


100 


4887 


AF261918 


Homo sapiens 


disintegrin metalloproteinase with 
thrombospondin repeats 


262 


72 


4888 


J02459 


bacteriophage 
lambda 


K (tail component; 1 99) 


610 


100 


4889 


Y95435 


Homo sapiens 


Human calcium channel SOC- 
2/CRAC-l. 


219 


89 


4890 


B01372 


Homo sapiens 


Neuron-associated protein. 


134 


80 


4891 


X64712 


Gallus gallus 


collagen-alpha-3 type K 


112 


41 


4892 


K02401 


Homo sapiens 


chorionic somatomammotropin 


556 


87 


4893 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


108 


55 


4894 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


4895 


L26953 


Homo sapiens 


chromosomal protein 


93 


81 


4896 


AJ277740 


Homo sapiens 


RPBllblbeta protein 


102 


66 


4897 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


145 


87 


4898 


AL359782 


Tri-panosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


131 


67 


4899 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


153 


64 


4900 


U90304 


Homo sapiens 


iroquois-class homeodomain protein 
IRX-2a 


109 


41 


4901 


D29956 


Homo sapiens 


This gene is similar to tre 
oncogene(X63547). 


120 


73 


4902 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


116 


70 


4903 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


102 


82 


4904 


AF 119900 


Homo sapiens 


PR02822 


157 


57 


4905 


U93565 


Homo sapiens 


putative pi 50 


146 


55 


4906 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


120 


67 


4907 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


160 


76 


4908 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


136 


55 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


Id en tit 

y 








SBQ ID NO: 163. 






4909 


AFl 55232 


Pisum sativum 


extensin 


98 


43 


4910 


U15647 


Mus musculus 


reverse transcriptase 


196 


50 




G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 








AF225419 


Homo sapiens 


HSCARG 






4913 


M76546 


Helianthus 


hydroxyproline-rich protein 


110 


47 


4914 


AL 117382 


Homo sapiens 


dJ881L22.2 (novel protein) 


312 


100 




AB037814 


Homo sapiens 


KIAA1393 protein 




68 


4916 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


169 


89 


4917 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


90 




AP000616 


Oryza sadva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


116 


86 


4919 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


132 


100 


4920 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


183 


65 


4921 


AF090944 


Homo sapiens 


PRO0663 


141 


57 




AF090931 


Homo sapiens 


PRO0483 


159 




4923 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


81 


4924 


U09500 


Homo sapiens 


C3^ochrome b 


276 


94 


4925 


AB002377 


Homo sapiens 


KIAA0379 protein 




98 


4926 


M22334 


Homo sapiens 


unknown protein 


201 


62 


4927 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


65 


4928 


AF130089 


Homo sapiens 


PRO2550 


116 


82 


4929 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


94 


41 


4930 


W75159 


Homo sapiens 


Human secreted protein encoded by 
gene 45 clone HTTDS54. 


294 


82 


4931 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


96 


74 


4932 


G02460 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


57 


4933 


Y0267I 

- 


Homo sapiens 

- 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


142 


70 


4934 


AF130087 


Homo sapiens 


PR0241 1 


137 


80 


4935 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


56 


4936 


,AB032604 


Mus musculus 


MIWI (piwi) 


313 


50 






Homo sapiens 


2'-5' oligoadenylate binding protein 






4938 


AF054658 


Homo sapiens 


kappa 1 immunoglobulin light chain 
variable region 


120 


87 


4939 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


134 


67 






Felis catus 


ribosomai protein SI 7 


118 


100 


4941 


AB028956 


Homo sapiens 


KIAA1033 protein 


527 


88 


4942 


G00429 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4510. 


281 


86 


4943 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


71 


4944 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


108 


95 


4945 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6571. 


170 


76 


4946 


AF090895 


Homo sapiens 


PROG 117 


125 


65 


4947 


D 13866 


Homo sapiens 


alpha-catenin 


575 


100 


4948 


L01664 


Homo sapiens 


lysophospholipase 


166 


96 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


Hcntit 

y 


4949 


AF 11 8085 


Homo sapiens 


PRO 1975 


572 


96 


4950 


AK024455 


Homo sapiens 


FLJ00047 protein 


95 


81 


4951 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


113 


58 


4952 


ABOl 1 142 


Homo sapiens 


BtJAA0570 protein 


120 


88 


4953 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


39 


4954 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


175 


69 


4955 


AK024455 


Homo sapiens 


FLJ00047 protein 


119 


64 


4956 


AF116715 


Homo sapiens 


PR02829 


142 


75 




M3 1 520 


Homo sapiens 


ribosomal protein S24 


158 


100 


4958 


D38113 


Pan troglodytes 


ATPase subunit 8 


98 


64 


4959 


AF130089 


Homo sapiens 


PRO2550 


156 


75 




AP000616 


Oiyza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


136 


95 


4961 


S61950 


Oryctolagus 
cuniculus 


alpha 2(1) procollagen, alpha 2(1) {C- 

terminal} 


159 


93 


4962 


AF070540 


Homo sapiens 


putative nuclear protein 


513 


93 


4963 


S61950 


Orj'ctolagus 
cuniculus 


alpha 2(1) procollagen, alpha 2(1) {C- 

temiinal} 


159 


93 


4964 


AF146191 


Homo sapiens 


FRGl 


343 


92 


4965 


Y 16624 


Homo sapiens 


Human phosphoprotein 32 (pp32) 
protein sequence. 


114 


100 


4966 


X67813 


Canis familiaris 


signal recognition particle,72 kDa 
subunit 


127 


100 


4967 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


127 


63 


4968 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


117 


69 


4969 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


101 


80 


4970 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


173 


59 


4971 


D381 12 


Homo sapiens 


ATPase subunit 6 


1 12 


91 


4972 


Y 8 6248 


Homo sapiens 


Human secreted protein HCHPF68, 

SEQ ID KO : 1 63 . 


209 


97 


4973 


G00328 


Homo sapiens 


Human secreted protein. SBQ ID MO: 


162 


80 


4974 


AL050399 


Arabidopsis 


putative pro line-rich protein 


75 


91 


4975 


LI 1668 


Bos taurus 


cyclophilin-40 


134 


100 


4976 


M55169 


Homo sapiens 


tripeptidyl peptidase II 


385 


90 


4977 


AF090944 


Homo sapiens 


PRO0663 


154 


88 


4978 


G00388 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


102 


62 


4979 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


70 


4980 


G00721 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4802. 


104 


95 


4981 


R54656 


Homo sapiens 


Prostaglandin 12 production stimulating 
protein B. 


101 


100 


"4982 — 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


1 4 




4983 


AF273053 


Homo sapiens 


CTCL tumor antigen se89-l 


281 


96 


4984 


M15530 


Homo sapiens 


B-cell growth fector 


117 


58 


4985 


D13641 


Homo sapiens 


mitochondrial outer membrane protein 
19 


534 


94 


4986 


L11316 


IVtus musculus 


ect2 


528 


92 


4987 


X51755 


Homo sapiens 


open reading frame (458 AA) 


108 


90 


4988 


M61877 


Homo sapiens 


erythroid alpha spectrin 


129 


87 
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Species 








NO: 








Waterman 


Idcntit 










Score 


y 


4989 


X57316 


Saccharomyces 


carboxypeptidase s 


134 


50 






cerevisiae 








4990 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


78 








4419. 






4991 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


173 


61 








SEQ ID NO: 163. 






4992 


M34019 


Bos taums 


beta-adrenergic receptor Icmase 


172 


100 


4993 


L26953 


Homo sapiens 


cluomosomal protein 


135 


86 


4994 


X97675 


Homo sapiens 


plakophilin 2b 


124 


63 


4995 


Y36112 


Homo sapiens 


Extended human secreted protein 


153 


77 








sequence, SEQ ID NO. 497. 






4996 


AF130089 


Homo sapiens 


PRO2550 


175 


48 


4997 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


68 








6582. 






4998 


¥86214 


Homo sapiens 


Nuclear transport protein clone hfb341 


312 


98 








protein sequence. 






4999 


G03681 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


95 








7762. 






5000 


AF152961 


Homo sapiens 


chroraatin-specific transcription 


127 


96 








elongation factor FACT 140 kDa 












subunit 






5001 


U70063 


Homo sapiens 


acid ceramidase 


315 


96 


5002 


AF041373 


Rattus 


clathrin assembly protein short form 


163 


63 






norvegicus 








5003 


AK001841 


Homo sapiens 


umiamed protein product 


468 


83 


5004 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


61 








4414. 






5005 


S79410 


Mus musculus 


nuclear localization signal binding 


116 


47 








protein 






5006 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


55 








7214. 






5007 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


76 








4419. 






5008 


Z15005 


Homo sapiens 


CENP-E 


281 


84 


5009 


AF130089 


Homo sapiens 


PRO2550 


122 


75 


5010 


AL049698 


Homo sapiens 


dJ470B24.1.1 (myeloid/lymphoid or 


77 


33 








mixed-lineage leukemia (tritiiorax 












(Drosophila) homolog); translocated to, 












4 (AF-6) (isoform 1)) 






5011 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


62 








6463. 






5012 


AL023780 


Schizosaccharom 


DNA binding protein 


145 


36 






yces pombe 








5013 


R13556 


Homo sapiens 


Protein encoded downstream oflihc_M 


93 


46 








oncoprolein. 






5014 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


43 








7881. 






5015 


AF010144 


Homo sapiens 


neuronal tluead protein AD7c-NTP 


104 


55 


5016 


U15647 


Mus musculus 


expressed in select embryonal 


119 


35 








carcinoma cells and testicular cells; 












similar to Swiss-Prot Accession 












Number PI 1260 






5017 


B10545 


Homo sapiens 


Human aspartate protease psn-like4 


649 


98 








protein. 






5018 


AF089745 


Homo sapiens 


FK506-bind]ng protein 


656 


98 


5019 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


634 


97 


5020 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


603 


92 


5021 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


623 


94 


5022 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


671 


100 



262 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Score 


% 

Identit 

y 


5023 


AFl 30089 


Homo sapiens 


PRO2550 


109 


71 


5024 


AB013452 


Homo sapiens 


ATPasell 


208 


77 


5025 


R27654 


Homo sapiens 


Human calcium channel 27980/16. 


183 


100 


5026 


Y36156 


Homo sapiens 


Human secreted protein #28. 


114 


64 


5027 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


74 


5028 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


149 


59 


5029 


S74728 


Homo sapiens 


antiquitin=26g turgor protein homolog 


652 


98 


5030 


L41498 


Homo sapiens 


longation factor 1-alpha 1 


592 


87 




AB040672 


Homo sapiens 


UDP-GalNAc: polypeptide N- 
acetylgalactosaminyltransferase 


454 


70 


5032 


AJ22394S 


Homo sapiens 


RNA helicase 


574 


84 


5033 


AF021935 


Rattiis 
norvegicus 


mytonic dystrophy kinase-related 
Cdc42-binding kinase 


670 


97 


5034 


AFl 50755 


Mus musculus 


microtubule-actin crosslinking factor 


526 


80 


5035 


AFl 50755 


Mus musculus 


microtubule-actin crosslinking factor 


602 


89 


5036 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


598 


89 


5037 


U09823 


Oryctolagus 
cuniculus 


elongation factor 1 alpha 


708 


96 


5038 


AE003621 


Drosophila 
melanogaster 


CG13384 gene product 


279 


45 


5039 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


107 


68 


5040 


AF047469 


Homo sapiens 


arsenite translocating ATPase 


650 


94 


5041 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


109 


41 


5042 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAG3) 


288 


100 


5043 


L26953 


Homo sapiens 


chromosomal protein 


124 


67 


5044 


AB005047 


Homo sapiens 


SH3 binding protein 


105 


88 


5045 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


143 


78 


5046 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


164 


83 


5047 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


121 


63 


5048 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


153 


64 


5049 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


120 


59 


5050 


Z25535 


Homo sapiens 


nuclear pore complex protein hnupl53 


115 


88 


5051 


J03176 


Bradyrhizobium 
japonicum 


cytochrome b/cl precursor 


97 


35 


5052 


X85995 


Homo sapiens 


immunoglobulin light chain 


200 


100 


5053 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


164 


76 


5054 


B01372 


Homo sapiens 


Neuron-associated protein. 


109 


80 


5055 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


209 


54 


5056 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


110 


51 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
7871. 


93 




5058 


X92474 


Homo sapiens 


ch-TOG 


147 


100 


5059 


L27428 


Homo sapiens 


reverse transcriptase 


264 


75 


5060 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


119 


44 


5061 


L25616 


Homo sapiens 


CGI protein ' 


119 


77 


5062 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


99 


62 



263 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


Identit 

y 








7871. 






5063 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


164 


59 


5064 


AF118082 


Homo sapiens 


PRO 1902 


175 


53 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


84 


51 




AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


108 


100 




AF 155232 






137 


45 


5068 


X63005 


Mus musculus 


proline-rich protein 


98 


35 


5069 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


307 


56 


5070 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


173 


89 


5071 


G03801 


Homo sapiens 


Human secj eted protein, SEQ ID NO: 
7882. 


153 


58 


5072 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQ ID NO: 163. 


164 


73 


5073 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


109 


73 


5074 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


121 


80 


5075 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


113 


72 


5076 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


151 


88 


5077 


AF090944 


Homo sapiens 


PRO0663 


138 


78 


5078 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWIB. 


150 


74 


5079 


X97675 


Homo sapiens 


plakophilin 2b 


179 


93 


5080 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


123 


76 


5081 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


176 


59 


5082 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


100 


54 


5083 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


99 


59 


5084 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


101 


57 


5085 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


118 


77 


5086 


AB 047600 


Macaca 
fesciciilaris 


hypothetical protein 


93 


69 




G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4433. 


88 


80 


5088 


AF000298 


Caenorhabditis 
elegans 


weak sunilarity to coUagens; glycine- 
and proline-rich 


112 


57 


5089 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


61 


5090 


G04078 
- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


74 


61 


50 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 






5092 


AF065484 


Homo sapiens 


sorting nexin I A 


172 


81 


5093 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


139 


90 


5094 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


78 


5095 


U02313 


Mus musculus 


protein kinase 


147 


84 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


IdenHt 

y 


5096 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


129 


67 


5097 


U64849 


Caenorhabditis 
elegans 


Contains similarity to Pfam domain: 
PF00646 (F-box), Score=28.7, E- 
value=4.3e-05, N=l 


197 


39 


5098 


AF006010 


Homo sapiens 


progestin induced protein 


174 


86 


5099 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7452. 


119 


55 


5100 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


132 


82 


5101 


G 00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


159 


77 


5102 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


99 


70 


5103 


J02459 


bacteriophage 
lambda 


J (taiI:host specificity; 1 132) 


485 


96 


5104 


J02459 


bacteriophage 
lambda 


li (taircoi^onent;853) 


823 


98 


5105 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein Kl 0 (NID:g8 148) 


106 


42 


5106 


AF130089 


Homo sapiens 


PRO2550 


121 


35 


5107 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


135 


81 


5108 


P92219 


Homo sapiens 
(human) 


CRl protein. 


123 


88 


5109 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


83 


78 


5110 


U41751 


Mus musculus 


EI24 


560 


83 


5111 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


150 


69 


5112 


AF130089 


Homo sapiens 


PRO2550 


164 


88 


5113 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


160 


72 


5114 


AF052831 


Tr>'panosoma 


unknown 


107 


70 


5115 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ull02, variant a dna, 
complete virion genome. 


128 


68 


5116 


X55684 


Lycopersicou 
esculentura 


extensin (class I) 


60 


29 


5117 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


124 


64 


5118 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


115 


60 


5119 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


79 


5120 


X65718 


Pninus dulcis 


extensin 


94 


33 


5121 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF07868j) 


132 


100 


5122 


AJ007714 


Homo sapiens 


lysine-ketoglutarate reductase 
/saccharopine dehydrogenase 


154 


96 


5123 


AJ007714 


Homo sapiens 


lysine-kctoglutaratc reductase 
/saccharopine dehydrogena.se 


158 


80 


5124 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


109 


44 


5125 


AF130089 


Homo sapiens 


PRO2550 


112 


46 


5126 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


60 


5127 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


99 


68 
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SEQID 
NO: 








Score 


Identit 

y 


5128 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


143 


57 


5129 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


140 


55 


5130 


J02459 


bacteriophage 
lambda 


H (tail component;853) 


378 


100 


5131 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


123 


78 


5132 


J04088 


Homo sapiens 


DNA topoisomerase II (EC 5.99.1.3) 


163 


96 


5133 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


147 


85 


5134 


D90827 


Escherichia coli 


Serine/Threonine protein phosphatase 
(EC 3.1.3.16). 


329 


46 


5135 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


159 


62 


5136 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


161 


91 


5137 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


136 


96 


5138 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


129 


67 


5139 


G03797 


Homo sapiens 


Hiunan secreted protein, SEQ ID NO: 
7878. 


93 


31 


5140 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


141 


46 


5141 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


166 


67 


5142 


AF003535 


Homo sapiens 


ORf 2-like protein 


168 


50 


5143 


AF259792 


Homo sapiens 


SWIlLbeta protein 


684 


98 


5144 


Y36203 


Homo sapiens 


Human secreted protein #75. 


130 


63 


5145 


AF130089 


Homo sapiens 


PRO2550 


151 


76 


5146 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


52 


5147 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


129 


85 


5148 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 

protein rhal a. 


101 


67 


5149 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


118 


56 


5150 


AF130051 


Homo sapiens 


PRO0898 


80 


72 


5151 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


165 


63 


5152 


X56832 


Homo sapiens 


muscle specific enolase 


573 


90 


5153 


X76930 


Homo sapiens 


hepatocyte nuclear factor 4 


544 


91 


5154 


U12919 


Mus musculus 


adenylyl cyclase type VII 


156 


75 


5155 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


91 


76 


5156 


G 04072 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

8153. 


133 


50 


5157 


Y08387 


Homo sapiens 


mu-adaptin-related protein 2 


252 


86 


5158 


R13556 


Homo sapiens 


Protem aicoded downstream of hhc M 
oncoprotein. 


126 


66 


5159 


AF090942 


Homo sapiens 


PRO0657 


139 


77 


5160 


AF090942 


Homo sapiens 


PRO0657 


120 


70 


5161 


D67066 


Bos taurus 


N-WASP 


135 


42 


5162 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


115 


73 


5163 


AF044496 


Canis familiaris 


acidic ribosomal phosphoprotein PO 


112 


78 


5164 


W48351 


Homo sapiens 


Human breast cancer related protein 


106 


58 
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SEQ ID 
NO: 


Acce.i„„No. 


Species 


Description 


Smith- 
Waterman 
Score 


Idcjitit 

y 








BCRB2. 






5165 


AJ250095 


Bordetella 
bronchiseptica 


pertactin (P.68) 


119 


64 


5156 


U96411 


Mus musculus 


otogelin; MLEMP 


609 


90 


5167 


AFl 16238 


Homo sapiens 


pseudouridine synthase 1 


237 


87 


5168 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4457. 


151 


78 


5169 


AL050342 


Homo sapiens 


dJ655I<:7.1 (novel protein) 


263 


100 


5170 


AK024455 


Homo sapiens 


FLJ00047 protein 


177 


72 


5171 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


94 


100 


5172 


L38908 


tabacum 


extensin 


101 


47 


5173 


AF055904 


Myxococcus 
xanthus 


unknown 


107 


36 


5174 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


143 


59 


5175 


¥00337 


Homo sapiens 


Human secreted protein encoded by 
gene 81. 


123 


86 


5176 


AJ0I0957 


Hippopotamus 
amphibius 


NADHl 


212 


90 


5177 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7519. 


84 


76 


5178 


U83280 


Leishmania 
donovani 


3 9 kDa antigen 


128 


90 


5179 


AF025467 


Caenoriiabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein ICIO (NID:g8148) 


183 


45 


5180 


L27428 
AF025467 


Homo sapiens 
Caenorhabditis 
elegans 


reverse transcriptase 

contains similarity to drosophila DNA- 

binding protein KIO (NID:g8148) 


106 


4S 


5181 


144 


45 


5182 


R83119 


Homo sapiens 


Human cisplatin resistance protein. 


142 


96 


5183 


R95330 


Homo sapiens 


Tumor necrosis factor receptor 1 death 
domain iigand (clone 1 TU). 


475 


71 


5184 


AF090930 


Homo sapiens 


PRO0478 


121 


57 


5185 


AF193613 


Homo sapiens 


cell recognition molecule Caspr2 


338 


50 


5186 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


102 


36 


5187 


AL031118 


Homo sapiens 


dJ 153014.3 (novel C2H2 type Zinc 
Finger protein) 


465 


62 


5188 


AFl 83 144 


Mus musculus 


left-right dynein 


199 


83 


5189 


J02459 


bacteriopliage 
lambda 


J (tailrhost specificity; 1 132) 


748 


95 


5190 


AFl 936 1 3 


Homo sapiens 


cell recognition molecule Caspr2 


229 


50 


5191 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6595. 


135 


65 


5192 


AP000616 


Oryza sativa 


similai to RING-H2 finger protein 
RHAla(AF078683) 


120 


84 


5193 


Y00311 


Homo sapiens 


Human secreted protein encoded by 

gene 54. 


112 


49 


5194 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4414. 


126 


56 


5195 


AFl 04261 


Mus musculus 


Pax transcription activation domain 

interacting protein PTIP 


487 


72 


5196 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


132 


65 


5197 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


80 


5198 


X55777 


Homo sapiens 


52kD protein 


106 


30 


5199 


AF092170 


Homo sapiens 


polycystic kidney disease 2 related 


122 


91 
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Species 


Description 


Smith- 


% 


NO: 








Waterman 












Score 


y 








protein 






5200 


AF092170 


Homo sapiens 


polycystic kidney disease 2 related 


138 


96 








protein 






5201 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


107 


67 








7879. 






5202 


AP000616 


Oiyza sativa 


similar to RING-H2 finger protein 


1 10 


81 








RHA 1 a (AF078683) 






5203 


R95913 


Homo sapiens 


Neural thread protein. 


96 


61 


5204 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


83 


55 


5205 


AF3 19476 


Homo sapiens 


GKAP42 


705 


100 


5206 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


62 








6913. 






5207 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


119 


75 








4446. 






5208 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


50 








7871. 






5209 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


139 


39 








4770. 






5210 


AF090931 


Homo sapiens 


PRO0483 


94 


90 


5211 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


114 


74 








4438. 






5212 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


70 








5327. 






5213 


U53209 


Homo sapiens 


trans former-2 alpha 


124 


95 


52l4 " 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


58 








6566. 






5215 


Y 02671 


Homo sapiens 


Human secreted protein encoded by 


151 


49 








gene 22 clone HMSJWI8. 






5216 


L26953 


Homo sapiens 


chromosomal protein 


95 


52 


5217 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


66 






6613. 






5218 


AF220264 


Homo sapiens 


MOST- 1 


151 


65 


5219 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


53 








7214. 






5220 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 








4718. 






5221 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


66 








7888. 






5222 


API 94537 


Homo sapiens 


NAG 1 3 


120 


60 


5223 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


60 








4457. 






5224 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


144 


58 








encoded by gene 17. , 






5225 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoform 


164 


70 


5226 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


73 








4413. 






5227 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


69 








7519. 






5228 


M34451 


Caenorhabditis 


collagen (rol-6) 


100 


41 






elegans 












Homo sapiens 


precursor polypeptide (AA -29 to 3 15) 






5230 


AL3 59782 


Tr>'panosoma 


possible (hhv-6) ul 102, variant a dna. 


112 


69 






brucei 


complete virion genome. 






5231 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


75 








7879. 






5232 


D86982 


Homo sapiens 


similar to human ankyrin 1 (SO 8275) 


201 


100 


5233 


AFl 16712 


Homo sapiens 


PR02738 


135 


49 


5234 


AL049758 


Homo sapiens 


dJ437M21.I (novel protein) 


184 


70 
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SEQID 
NO: 


Accession No. 


Species 


Descn-pfon 


Smifh- 

Watcrraan 

Score 


% 

Idcntit 


5235 


ABO 14527 


Homo sapiens 


KIAA0627 protein 


164 


100 


5236 


AF194537 


Homo sapiens 


NAG 13 


183 


64 


5237 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(lihcm) gene preoduct. 


170 


43 


5238 


AF135439 


JVIiis musciilus 


formin binding protein 1 1 


230 


100 


5239 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


69 


5240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


168 


63 


5241 


AF116661 


Homo sapiens 


PROMS 8 


152 


58 


5242 
^243 


L27428 
AF 13 00 89 


Homo sapiens 
Homo sapiens 


reverse transcriptase 
PRO2550 


193 
141 


53 
60 


5244 


AF118086 


Homo sapiens 


PRO 1992 


135 


71 


5245 


AJ228863 


Mus musculus 


CLlP-115 


195 


97 


5246 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


148 


81 


5247 


AC007887 


Arabidopsis 
thaliana 


Fl 504.29 


76 


93 


5248 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


75 


5249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


66 


5250 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


70 


5251. 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


80 


5252 

T253 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


112 


54 


AF090944 


Homo sapiens 


PRO0663 


177 


62 


5254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

SEQ ID NO: 163. 


132 


57 


5255 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 


74 


5256 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


157 


91 


5257 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


160 


78 


5258 


U 16296 


Homo sapiens 


TIAMl protein 


133 


100 


5259 


AL 16037 1 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


92 


93 


5260 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


324 


78 


5261 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


50 


5262 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


217 


100 


5263 


AB047600 


Macaca 
fascicular is 


hypothetical protein 


125 


57 


5264 


U83246 


Homo sapiens 


copine I 


199 


86 


5265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


150 


72 


5266 


U91329 


Homo sapiens 


kinesin-like motor protein BCIFIC 


132 


93 


5267 


J00672 


Oryctol^s 


beta-myosin 


109 


100 


5268 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


74 


43 


5269 


AF119851 


Homo sapiens 


PRO 1722 


96 


61 


5270 


AF090930 


Homo sapiens 


PRO0478 


113 


91 


5271 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


123 


62 


5272 


B01372 


Homo sapiens 


Neuron-associated protein. 


122 


60 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 

y 


5273 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


116 


78 


5274 


AF090931 


Homo sapiens 


PRO0483 


144 


67 


5275 


AL 1 60493 


Leishmania 


probable (hhv-6) uI102, variant a 
DNA, complete virion genome 


1 17 


68 


5276 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


111 


58 


5277 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


147 


67 


5278 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA 1 a (AF07 8683) 


120 


67 


5279 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


69 


5280 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


196 


45 


5281 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 15 


73 


5282 


A18812 


Brassica napus 


extensin 


120 


38 






Homo sapiens 


Imogen 38 


291 


75 


52 S4 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


97 


37 


5285 


AF130089 


Homo sapiens 


PRO2550 


181 


73 


5286 


AF130089 


Homo sapiens 


PRO2550 


147 


67 


5287 


AF090942 


Homo sapiens 


PRO0657 


90 


50 


5288 


G03043 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


116 


74 


5289 


G03714 


Human secreted protein, SEQ ID NO: 
7795. 


143 


64 


5290 


D86972 


Homo sapiens 


similar to E.coli hypothetical 29.6 KD 
protein (P 1 : YIGW_ECOLI) 


111 


100 


5291 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


57 


5292 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


131 


65 




. - 
Y08061 


Homo sapiens 


Human c-myb protein fragment 


213 


76 


5294 


AF090895 


Homo sapiens 


PROG 117 


134 


52 


5295 


AF017635 


Homo sapiens 


DCHT 


145 


100 


5296 


G02558 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6639. 


136 


60 


5297 


AP000616 


Or>'za sativa 


similar to RING-H2 finger protein 
RH.A.la (AF078683) 


118 


67 


5298 


AK024455 


Homo sapiens 


FLJ00047 protein 


67 


80 


5299 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


164 


75 


5300 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


129 


55 


5301 


R 14643 


Homo sapiens 


Gamma interferon receptor. 


107 


82 




V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 




90 


5303 


Y08062 


Homo sapiens 


Human PR0245 protein fi-agment 
derived from DNA35638. 


205 


68 


5304 


AF130089 


Homo sapiens 


PRO2550 


100 


77 


5305 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


117 


44 


5306 


X92485 


Plasmodium 


pval 


128 


61 


5307 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


77 


57 


5308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 140 
clone HPMBQ32. 


42 


5309 


Y91577 


Homo sapiens 


Human secreted protein sequence j 489 


78 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


Iilertit 

y 








encoded by **ene 2 SEQ ID NO*250 






5310 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 


— 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


— — 

129 


65 


"3312 


AF119851 


Tl '' 


PRO 1722 


~97 


~65 


^sTs — 


G02987 


Homo sapiens 

omo sapiens 


7068. 


"l60 


~90 




G03043 




7124. 








AF07I3 10 










5316 


AF090942 


Homo sapiens 


PRO0657 


162 


70 


5317 


L83j03 


Homo sapiens 


lineal leveisetianscuptase 


109 


83 




AF090894 


Homo sapiens 








5319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


129 


66 






G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7124. 










Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 






5322 


l^A^iot 


Homo sapiens 


reverse transcriptase 


139 


52 


5323 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 




1324 


AFl 1 8082 


"Homo sapiens 


r KU 1 yuz 


Tot 


— 


5325 


M81321 


Macaca 


proline-ricli protein 


114 


37 


5326 


G02435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


79 


5327 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


72 




AF090930 


Homo sapiens 


PRO0478 






5329 


L27428 


Homo sapiens 


reverse transcriptase 


267 


65 




G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 








AL 160493 


Leislimania 


probable (hhiv-6) ull02, variant a 
DNA, complete virion genome 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 








G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 








G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 






5335 


AP000060 


AeropjTum 


103aa long hypothetical protein 


71 


50 


5336 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


75 


5337 


Z93784 


Homo sapiens 


dJ398C22.1 (novel protein, orthologof 
mouse brain protein E46) 


280 


68 


5338 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


64 


5339 


AF210651 


Homo sapiens 


NAG18 


160 


56 


5340 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


80 


5341 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


151 


75 


5342 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


50 


5343 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


75 


5344 


G03021 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 
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SEQID 
NO: 


Accession No. 




Description 


Waterman 


% 

Idcntit 








7102. 












ID NO: 142. ^ ^ 








AL12189] 




dJl 187M17J2 (KIAA0552 protein) 






5347 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


33 


5348 


AF090944 


Homo sapiens 


PRO0663 


205 


72 






Homo sapiens 


Human secreted protein, SEQ ID NOi 

7492. 








AL359782 


brucei 


protein rhala. 










Homo sapiens 


orphan neurotransmitter transporter 
NTT5 






5352 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


105 


75 


5353 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


167 


69 


5354 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


97 


60 


5355 


G00590 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


80 


5356 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


123 


67 




W88609 


Homo sapiens 


Secreted protein encoded by gene 76 
clone HK1XL73. 






5358 


AC004450 


Arabidopsis 


putative extensin 


110 


37 


5359 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

tCrlAia {ArV laoaj) 


120 


79 


5360 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


199 


74 


5361 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 163 


174 


97 


5362 


ABO 1743 7 


Gailus galius 




98 


44 


5363 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


155 


93 


5364 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 163. 


181 


70 


5365 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

m, — F^Tfr f — ' 


159 


68 








, 

omo sapiens 


derived ''rotetn^#153°'^ agmen 


"04 


-— 


"5367 


X52022 


Homosa iens 


colla'^en'type VI alpha 3 chain 


T50 


-jx 


~5368 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


^20 


^77 




G00397 




Human secreted protein, SEQ ID NO; 

4478. 


















5371 


AF130079 


Homo sapiens 


PR02852 


103 


95 














5373 


AF210651 


Homo sapiens 


NAG18 


137 


67 


5j74 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 


106 


76 


5375 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


109 


70 


5376 


AL109819 


Arabidopsis 
thaliana 


extensin-like protein 


123 


34 


5377 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 


79 


5378 


AF036170 


Dictyostelium 


homeobox-containing protein Wariai 


93 


83 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smitli- 
Score 


Identit 

y 






discoideum 










AF130089 


Homo sapiens 


PRO2550 


134 


75 


5380 


AF161384 


Homo sapiens 


HSPC266 


663 


96 


5381 


AF130051 


Homo sapiens 


PRO0898 


147 


77 


5382 


AF090944 


Homo sapiens 


PRO0663 


1 18 


83 


.383 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214, 


129 


55 


5384 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


110 


68 


5385 
5386 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


165 


53 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


145 


96 


5387 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


108 


51 


5388 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


104 


89 


5389 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


70 


5390 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


103 


72 


5391 


AL359782 


Trypanosoma 


possible (hhv-6) ull02, variant a dna, 
complete virion genome. 


119 


85 


5392 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


120 


65 


5393 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


131 


78 


5394 


G01552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5633. 


159 


90 


5395 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


110 


65 


5396 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


85 


59 


5397 


X86681 


Homo sapiens 


HNP36 protein 


248 


71 


5398 


L27428 


Homo sapiens 


reverse transcriptase 


133 


62 


5399 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


132 


54 


5400 


Al,080196 


Homo sapiens 


hypothetical protein 


164 


96 


5401 


038112 


Homo sapiens 


NADH dehydrogenase subunit 5 


119 


75 


5402 


AF090944 


Homo sapiens 


PRO0663 


99 


64 


5403 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4433. 


93 


58 


5404 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


103 


37 


5405 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6,5.40. 


137 


80 


5406 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


102 


69 


5407 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


102 


69 


5408 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


142 


69 


5409 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7519. 


139 


63 


5410 


Y12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


65 ■ 


47 


5411 


AF181626 


Drosophila 
melanogaster 


BCDNA.GH03694 


171 


53 


5412 


U93565 


Homo sapiens 


putative pl50 


195 


70 


5413 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


170 


64 
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SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idenfit 

y 


5414 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


126 


61 


5415 


X83572 


Homo sapiens 


ARSD 


133 


100 


5416 


D86853 


Catharanthus 


extensin 


113 


41 


5417 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


139 


60 


IS 

5418 


. .- - 


- 

Homo sapiens 


Human secreted protein, SEQ JDD NO: 


.... 

175 


77 


5419 


R32010 


Homo sapiens 


Rpl5^TIA4. 


140 


83 


5420 


D67066 


Bos taurus 


N-WASP 


153 








Homo sapiens 


myosin-DCb 






5422 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


74 


5423 


AF220264 


Homo sapiens 


MOST-1 


94 


78 


5424 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


63 


5425 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


131 


57 


5426 


AF130089 


Homo sapiens 


PRO2550 


120 


58 


5427 


Z70204 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF00628 (PHD-finger), Score=63.1, E- 
value=1.9e-15, N=l~cDNA EST 
EMBL:T00556 comes from this 
gene-cDNA EST yk344f7.5 comes 
from this gene 


131 


55 


5428 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


158 


51 


5429 


AF130089 


Homo sapiens 


PRO2550 


112 


73 


5430 
5431 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


161 


52 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


72 


5432 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


79 


5433 


AF090942 


Homo sapiens 


PRO0657 


109 


67 


5434 


AE003588 


Drosophila 
melanogaster 


CG2839 gene product 


141 


50 




G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4658. 


154 


79 


5436 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7882. 


128 


41 


5437 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


143 


63 


5438 


G03172 




Homo sapiens 


Human secreted protein, SEQ ID NO; 
7253. 


117 


84 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
7874. 








AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAIa(AF078683) 




42 




AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 






5442 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


193 


56 


5443 


X07495 


Homo sapiens 


translated region (AA 1-264) 


108 


41 


5444 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7075. 


135 


62 


5445 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


130 


41 


5446 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


136 


71 
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SEQ ID 


Accession No. 




Description 






NO: 








Waterman 
Score 


Idenfit 

y 


•5447 


YD 1400 


Homo sapiens 


Secreted protein encoded by gene 18 


123 


62 








clone HNHF029. 






5448 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


101 


45 








protein CON-2. 






5449 


U13866 


unidentified 


non-functional lacZ alpha peptide 


106 


57 






cloning vector 








5450 


X01469 


Plasmodium 


histidine-rich protein 


162 


56 






lophurae 








5451 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


88 








6561. 






5452 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


155 


68 








encoded by gene 17. 






5453 


AFl 19900 


Homo sapiens 


PR02822 


162 


63 


5454 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


120 


51 








clone HNHF029. 






5455 


AC007204 


Homo sapiens 


BC273239 1 


1485 


69 


5456 


L27428 


Homo sapiens 


reverse transcriptase 


122 


41 


5457 


Ml 8094 


Phaseolus 


hydroxyproline-rich glycoprotein 


141 


39 






vulgaris 








5458 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


118 


68 








RHAla (AF078683) 






5459 


AF090930 


Homo sapiens 


PRO0478 


144 


64 


5460 


AF130089 


Homo sapiens 


PRO2550 


124 


60 


5461 


Q00377 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


135 


69 








4458. 






5462 


X97675 


Homo sapiens 


plakophilin 2b 


103 


72 


5463 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


69 








7253. 






5464 


AF229067 


Homo sapiens 


PADI-H protein 


146 


65 


5465 


AFl 16715 


Homo sapiens 


PR02829 


83 


71 


5466 


AF090930 


Homo sapiens 


PRO0478 


146 


52 


5467 


U93574 


Homo sapiens 


putative pi 50 


203 


52 


5468 


W48351 


Homo sapiens 


Human breast cancer related protein 


89 


50 








BCRB2. 






5469 


AF191838 


Homo sapiens 


TANK binding Jdnase TBKl 


121 


95 


5470 


AFl 30079 


Homo sapiens 


PR02852 


146 


69 


5471 


AF090944 


Homo sapiens 


PRO0663 


90 


66 


5472 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


106 


63 








7882. 






5473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


127 


60 








SEQrDNO:163. 






5474 


AF064553 


Mus musculus 


NSDl protein 


269 


100 


5475 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


132 


65 








4463. 






5476 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


130 


86 








7253. 






5477 


U11271 


Homo sapiens 


thromboxane A2 receptor 


118 


67 


5478 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


143 


60 








6613. 






5479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


75 








7253. 








R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


119 


74 


5481 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


103 


85 








RHAla (AF078683) 






5482 


AF238235 


Entamoeba 


diaphanous protein 


108 


35 






histolytica 






5483 


AF090930 


Homo sapiens 


PRO0478 


120 


55 


5484 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


111 


84 








SEQ ID NO: 163. 







275 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 


Accession No. 




Description 






NO: 








Waterman 


Identit 










Score 


y 


5485 


M61883 


Sus scrofa 


apomucin 


252 


55 


5486 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


77 








7124. 






5487 


G'03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


46 








7124. 






5488 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


62 








4414. 






5489 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


124 


47 






elegans 


binding protein KIO (NID;g8148) 






5490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


220 


71 








SEQ ID NO: 163. 






5491 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


195 


65 








SEQ ID NO: 163. 






5492 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


116 


38 


5493 


AF118086 


Homo sapiens 


PRO 1992 


135 


82 


5494 


AF090930 


Homo sapiens 


PRO0478 


105 


74 


5495 


Y94451 


Homo sapiens 


Human inflammation associated 


170 


61 








protein #8. 






5496 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


65 








7546. 






5497 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


78 








4409. 






5498 


X56123 


Mus musculus 


talin 


594 


88 


5499 


AL035652 


Homo sapiens 


dJlJ6.1 (topoisomerase (DNA) I) 


105 


86 


5500 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


81 








4409. 






5501 


Y45389 


Homo sapiens 


Human secreted protein fragment 


161 


93 








encoded from gene 31. 






5502 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


60 








4414. 






5503 


X03145 


Homo sapiens 


pot. ORF I 


215 


79 


5504 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 


180 


61 








protein 1 






5505 


AL390212 


Homo sapiens 


ciJ29MI0B.I.2 (protein kinase C 


1282 


93 






binding protein I (isofonn2)) 






5506 


AF 11 8086 


Homo sapiens 


PRO 1992 


146 


61 


5507 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 


116 


74 








terminal, clone XEH.Sc} 






5508 


U42471 


Mus musculus 


Wiscott-Aldricli Syndrome protein 


165 


50 








homolog 






5509 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


211 


95 


5510 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


114 


69 








SEQ ID NO: 1 63 . 






5511 


AF090944 


Homo sapiens 


PRO0663 


125 


58 


5512 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


53 








7492. 






5513 


Y36156 


Homo sapiens 


Human secreted protein w2S. 


110 


68 


5514 


AF118086 


Homo sapiens 


PRO 1992 


128 


76 


5515 


Z22866 


Mus musculus 


skelemin 


218 


95 


5516 


Z22866 


Mus musculus 


skelemin 


218 


95 




G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


71 








4457. 






5518 


AL133380 


Homo sapiens 


dJ862P8.3 (Similar to MAP3K10 


603 


77 








(mitogen-activated protein kinase 












kinase kinase 10)) 






5519 


G03438 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO; 


187 


78 








7519. 






5520 


AF090901 


Homo sapiens 


PRO0195 


115 


60 
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SEQID 
NO: 

.. . 


Accession No. 
.... 


Species 


Description 
. 


Smitli- 

Waterman 

Score 


IdenHt 

y 


5521 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

obi.^ lU INU. lOJ. 


180 


53 




/ 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


J 55 




5523 


W90847 


Homo sapiens 


Human lymphocjrte targeted peptide 


149 


100 


5524 


X97675 


Homo sapiens 


plakophilin 2b 


153 


70 






Mus musculus 


synaptotagminII/IP4BP 






5526 


M32319 


Homo sapiens 


HL A protein allele B5 1 


754 


94 


5527 


AF130051 


Homo sapiens 


PRO0898 


93 


64 


5528 


X92485 


Plasmodium 


pval 


88 


66 


5529 


AJ131890 


Homo sapiens 


DNA polymerase lambda 


205 


100 


5530 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 

isoform 


132 


68 




Y 29 6 


— : 

Homo sapiens 


Fragment of human secreted protein 
encoded by gene 10 1 . 








AF090931 


Homo sapiens 


PRO0483 






5533 


AF090942 


Homo sapiens 


PRO0657 


122 


66 


5534 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


145 


56 


5535 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


569 


91 






Homo sapiens 


chorionic somatomammotropin 


375 


98 


5537 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


136 


68 


5538 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


44 


5539 


S62928 


Homo sapiens 


PRBIM protein precursor 


104 


40 


5540 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


70 


5541 


AE000463 


Escherichia coli 


putative glycosidase 


254 


100 


5542 


J02459 


bacteriophage 
lambda 


H (tail component;853) 


761 


100 


5543 


Y85062 


Homo sapiens 


Interleukin 1 convating enzyme 
homologue (ICEL) protein sequence. 


119 


58 






Homo sapiens 


putative monocarboxylate transporter 


233 


87 


5545 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


187 


65 


5546 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 
in cosmid C27F2 (U40419) 


556 


84 






... ... ... 

Plasmodium 
vivax 


pval 








AB023233 


Homo sapiens 


KIA A 1016 protein 






5549 

■ 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


64 


5550 


AC002044 


Homo sapiens 


Alpha-fetoprotein enliancer binding 
protein (3' partial) 


111 


38 


5551 


AF026802' 


Homo sapiens 


alpha-3 type IX collagen 


117 


47 




G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 




88 




G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 






5554 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243 . 


104 


71 


5555 


AF130089 


Homo sapiens 


PRO2550 


172 


86 


5556 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


72 


5557 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


160 


61 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 


Wentit 

y 


5558 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


131 


68 




G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


112 


67 


5560 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


159 


73 


5561 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


141 


67 


5562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 


71 




G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

6561. 




SO 




AF090942 


Homo sapiens 


PRO0657 






5565 


M64793 


Rattus 
norvegicus 


salivary proline-ricli protein 


169 


43 




AL035526 


Arabidopsis 
thaliana 


extensin-like protein 


103 


40 




AF 130089 


Homo sapiens 


PRO2550 


132 


67 


5568 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


83 


5569 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 
kDa subunit 


153 


42 




AF068266 


Homo sapiens 


EHT protein 






5571 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


42 


5572 


W48351 


Homo sapiens 


Human breast cancer related protein 


112 


68 


5573 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


71 


5574 


L20450 


Mus musculus 


DNA-binding protein 


283 


61 


5575 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


114 


95 


5576 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


140 


71 


5577 


Y20648 


Homo sapiens 


Human neurofilament-L wild type 
protein fragment 38. 


66 


37 


5578 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


112 


41 


5579 


AL359782 


Trypanosoma 


possible (hIiv-6) ull02, variant a dna, 

complete virion genome. 


142 


74 


5580 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

j : 


142 


77 


5581 


AT71 ^^*=;n^ 


: 

Homo sapiens 


angiopoietin-related protein 






5582 


AF090930 


Homo sapiens 


PRO0478 


149 


77 








DNA-directed RNA polymerase 






5584 


M76546 


Heliantlius 


hydroxyproline-rich protein 


101 


51 


5585 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


66 


47 


5586 


R96800 


Homo sapiens 


Human iiistiocyte-secreted fector HSF. 


130 


72 


5587 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


148 


56 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


148 


66 


5589 


AC006014 


Homo sapiens 


similar to KIAA0618 and nucleai- 
envelope protein POM 121; 
alternatively spliced product; similar to 
PID:g3327050andP52591 
(PID:gl709213) 


108 


84 


5590 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


122 


84 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








oncoprotein. 






5591 


AB001684 


Chlorella 
vulgaris 


ORF74 


61 


47 


5592 


AF090930 


Homo sapiens 


PRO0478 


123 


70 




G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 








W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 






5595 


L26953 


Homo sapiens 


chromosomal protein 


116 




5596 


L2695j 


Homo sapiens 


chromosomal protein 


115 


67 


5597 

- 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
661j. 


157 


55 


5598 


13 


Homo sapiens 


Human irritable bowel disease related 
polypeptide IMX39. 


1j1 


44 


5599 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


45 


5600 


AF 130089 


Homo sapiens 


PRO2550 


142 


62 




AYO 14404 


Homo sapiens 


kinesin-like protein RBKrN2 


297 


98 


5602 


AF2 10651 


Homo sapiens 


NAG 18 


11j 


''^ 




AF090931 


Homo sapiens 


PRO04S3 






5604 


X01060 


Homo sapiens 


put transferrin receptor (aa 1-760) 


226 


80 


5605 


LJ49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


455 


75 


5606 


M64793 


Rattus 


salivary proline-rich protein 


119 


42 


5607 


A3 1039 


Nicotiana alata 


PRP3 


92 


36 




AF090944 


Homo sapiens 


PRO0663 


147 


64 


5609 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


46 


5610 


X01065 


Cairina moschata 


histone H5 


88 


40 






Dictyostelium 
discoideum 


calcium binding protein 


104 


40 


5612 


AL 160493 


Leishmania 
major 


probable (hhv-6) ull 02, variant a 
DNA, complete virion genome 


144 


37 


5613 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


127 


65 


61 


G 12 


: 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
5299. 






5615 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

_ . . . .. 


122 


68 


4fiT^ 


AB029309 




Homo Stipiens 


Npw3 S-bmding protein NpwBP 




78 














"5618 


AF090930 


Homo sapiens 


PRO0478 





~ 


5^9" ~ 


D88461 


Rattus rattus 


N-WASP 


107 


40 


5620 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 


104 


j8 


3621 


y^^^^^ 


Homo sapiens 


Human s^reted'protein #28 






5622 


AF090894 


Homo sapiens 


PRO0113 


119 


55 




AFI30089 


Homo sapiens 


PRO2550 


130 


69 


562^ 


AF130089 


Homo sapiens 


PRO2550 






5625 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


49 


5626 


AF130089 


Homo sapiens 


PRO2550 


152 


80 


5627 


AF151837 


Homo sapiens 


CGI-79 protein 


127 


78 


5628 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


122 


84 


5629 


AJ237981 


Vitis vinifera 


putative proline-rich cell wall protein 


93 


50 


5630 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6582. 


110 


52 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 


5631 


AF 13 0089 


Homo sapiens 


PRO2550 


151 


62 





U47661 


— 


proline-nch protein PRP2 precursor 




28 




Y08061 






93 




5634 


AL133215 


Homo sapiens 


bA108L7.6 (semaphorin 4G (sema 
domain, inrununoglobulin domain (Ig), 
transmembrane domain (TM) and short 


323 


100 


~5635 


G03438 


Homo sapiens 


Huim'rsecreted'^otein SEQ ID NO' — 
7519. 


138 


67 




AF090942 




PRO0657 


184 


67 


5637 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


80 


~5638 


G03043 


lio ' 

omo sapiens 


Tl — ' Td 1? — SEO ID NO- — 

^mnansecree pro in, Q 


139 


83 


5639 


AF 11 9900 


Homo sapiens 


PR02822 


143 ! 62 




AL390114 






151 


61 


5641 


Y36243 


Homo sapiens 


Human seaeted protein encoded by 
gene 20, 


118 


72 


5642 


AF090895 


Homo sapiens 


PRO0117 


140 


56 






Homo sapiens 


Human HSGTl protein. 


611 


96 


5644 n 


AF 162692 


Homo sapiens 


putative voltage-gated calcium channel 

ganirria-4 subunit 


172 


73 


5645 


Z98883 


Homo sapiens 


C407A10.1 (GPIl (N- 
acetylglucosaininyl transferase 
component)) _ — 


229 


100 


TfM 


G00403 


omo sapiens 


Humoii secreted protein, SEQ ID NO; 


117 

T59 


80 


"5647 — 


QQ-^-y'j2 


Homo sapiens 


Fhiman secreted protein SEQ ID NO" 

7253. 


78 


5648 






BMO4/ ™^ ^ 


97 


49 


5649 


AF2 10651 






149 


58 


5650 


AK000385 


Homo sapiens 


unnamed protein product 


125 


36 










143 


96 


5652 


AF090944 


Homo sapiens 


PRO0663 


95 


43 




Y86248 




Human secreted protein HCHPF68j 
SEQ ID NO: 163. 


131 


61 




Y86248 




Human secreted protein HCHPF68, 
SEQIDNO:163. 


100 


58 




G02480 




6561. ^ 


157 


70 




AP000516 


Oryza sativa 


RHAla (AF078683) ° ^ 


120 


100 




M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


123 


47 


5658 


W48351 


Homo sapiens 


BCRB2'"^^^* cancer related protein 


159 


46 


5659 


G03714 Homo sapiens 


Human secreted protein. SEQ ID NO: 


163 


60 


5660 


Y86248 Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


105 


46 


5661 


Y27868 1 Homo sapiens 

i 


Human secreted protein encoded by 
gene No. 107. 


125 


71 


5662 


D87744 


Mus musculus 


DRPLA protein 


89 


40 


5663 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


65 


5664 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


87 


65 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score™"" 


% 

Identit 

y 




AF055985 


volvulus 


pyirohdone-nch antigen 


















5667 


AF130089 


Homo sapiens 


PRO2550 


128 


74 


— 


AF090930 


Homo sapiens 


PRO0478 




54 








tlm protein 






5670 


AB001684 


Chlorella 
vulgaris ^ 


ORF49b 


70 


62 


6 1 


GO ' 9 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7879. 






5672 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


78 


5673 


AF090942 


Homo sapiens 


PRO0657 


150 


60 


5674 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


78 


5675 


M22332 


Homo sapiens 


unknown protein 


121 


43 


5676 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


128 


72 


5677 


AL049868 


Homo sapiens 


dJ927M24.2 (KIAA1219) 


403 


95 


5678 


Uj1785 


Human 

papillomavirus 
type 36 


putative 


103 


44 


5679 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


73 


5680 


AF068229 


Homo sapiens 


lysyl hydroxylase 3 


148 


92 




Y00358 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27, 


209 


8i 




AB008515 


Homo sapiens 


RanBPIVI 


142 


100 


568j 


ABO 10692 


Arabidopsis 
thaliana 


contains similarity to endo-beta-N- 
acetylglucosaminidase—gene id:Kl 812 
3.27 


148 


45 






bacteriophage 
lambda 


J (taihhost specificity; 1 132) 








AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 






5686 


AB049054 


Homo sapiens 


brain link protein- 1 


160 


100 


^68^ 


AF229067 


Homo sapiens 


PADI-H protein 


144 


68 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 








APOO 1313 


Arabidopsis 
thaliana 


contains similarity to 
pherophorin~geneJd:T5M7.14 






5690 


M76976 


Drosophila 


dorsal- ventral patterning protein 


167 


■^"^ 


5691 


L24559 


Homo sapiens 


DNA polymerase alpha 


175 


100 




AF071777 


Mus musculus 








5693 


AF061741 


Homo sapiens 


retinal short-chain 
dehydrogenase/reductase retSDRl 


704 


99 


5694 
- 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6613. 


178 


66 




AF092092 


Homo sapiens 


AP-3 adaptor complex mu3 A subunit 






5696 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


86 


5697 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


132 


75 


5698 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


116 


40 


5699 


AF161361 


Homo sapiens 


HSPC098 


128 


56 


5700 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


106 


57 



281 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitb- 
Wateinian 


% 


5701 


AFl 19901 


Homo sapiens 


PR02831 


113 


76 


5702 


^Aonlf 


Homo sapiens 


salivary proline-rich protein 1 


101 


49 




G0j043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 








AF130089 


Homo sapiens 


PRO2550 


122 


75 


5705 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


72 


5706 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
PA^DI H rotein 


113 


75 


~5707 




Homosa iens 








5708 


W40353 


Homo sapiens 


Human unspecified protein from 


129 


65 


5709 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


81 




AK024455 


Homo sapiens 


FLJ00047 protein 






5711 




Y76381 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 75. 


247 


93 




AFl 16715 


Homo sapiens 


PR02829 






5713 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


71 


5714 


G04078 


Hoiiio sapiens 


Human secreted protein, SEQ ID NO; 
8159. 


Ml 


79 




G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


91 




5716 


AF090944 


Homo sapiens 


PRO0663 


136 


56 


5717 


Q003 82 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


153 


65 


,., 

5718 


G03807 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


109 


70 


~57l9 — 




^ 

Homo sapiens 


PRO2550 ! 140 


~86 


5720 

— — 


G03043 




Homo sapiens 


Human secreted protein, SEQ ID NO: 144 
7124. 1 


67 


"^fSi — 


AFl J 00 89 


"Ef 

Homo sapiens 


PRO2550 


146 


-— 

-^1 




API 1 Q/^Q/; 


Homo sapiens 


PRO 1992 


130 




5723 


D21230 


Homo sapiens 


alpha I (XV) collagen chain 


390 


78 





AL049559 


Homo sapiens 


dJ37C10.5 {KIAA0445) 


266 


^ .^ .. 




X04412 


Homo sapiens 


plasma gelsolin 


761 


\zl 


5726 




synthetic 

construct 


serine protease inhibitor 


206 


100 


"572^ 


y^^g^Q 




paraplegin 


218 






AF132021 


"Homo sapiens 


myosin X 


613 


"17 


5729 


ALI63491 


Arabidopsis 


putative protein 


236 


46 


5730 


L19704 


Homo sapiens 


alternative first exon 


561 


86 








peroxisome assembly factor-2 


240 




5732 


AB007936 


Homo sapiens 


KIAA0467 protein 


189 


11 




L38622 


Mus musculus 


mSinSB gene product 


280 




5734 


AF051944 


Gallus gallus 


Xin 


460 


68 


5735 


B07857 


Homo sapiens 


Amino acid sequence of Smadl 
interactor protein clone Sl+12-5. 


172 


67 


5736 


Z14020 


Nicotiana 


Pistil extensin like protein, partial CDS 
only 


105 


44 


5737 


L17308 


Gossypium 
hirsutum 


proline-rich cell wall protein 


145 


34 


5738 


Y76141 


Homo sapiens 


Human secreted protein encoded by 
gene 18. 


109 


72 


5739 


AF178534 


Homo sapiens 


talin 


583 


82 


5740 


D13435 


Homo sapiens 


PIQ-F 


143 


100 


5741 


AC005578 


Homo sapiens 


F20887 1, partial CDS 


584 


86 



282 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 

y 


~5742 


AC005578 


H mosa iens 

Homo sapiens 


F20887 1, partial CDS 







3743 


AJO 10479 




kinesin-like protein — 


T?V7 




^744 


M19419 


MuTmusculus 


proine-nc sa varypr em — _ 


-— = 


— 


~5745 


L29377 




CaenorhabdWs 

ele^ans 


ysme-nc 


— 


44 


> 




omo sapiens 


3l d P — SFO ID NO- — 


— 


-— 

67 








^uman secrete proem, 






"5747 


D84223 


"h ^ 

H°m° ^a^'en^ 


-j 1 tT?NA fhpti 

eucy syntnetase 


— 


— 


T748 ' 




J04990 


omo sapiens 


ca epsm . 


— 5 


inn 

— ^ — 


-^Zzz 




Homo sapiens 


Human secreted protem #75. 


122 


11 


— 


V00488 


Homo sapiens 


alpha globin 


181 


94 


_57m^ 


M[8642 


Homo sapiens 


lactoferrin 


122 


96 


5732 


W15286 


Homo sapiens 


Soluble type I insulin-like growth 


481 


87 








factor receptor. 






5753 


Z98595 


Schizosaccharom 


coronin-like protein 


192 


38 




— — — — — 


yces pombe 


.. 






5754 


AL035683 


Homo sapiens 


aJ10o3B2.1 (Beta-1,4- 


2:>9 


100 










galactosyltransferase) 






5755 


G00j33 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


61 














; 

5756 




G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


61 








— - — — 






; 1 

5757 


i 

D49490 


Homo sapiens 


protein disulflde isomerase-relaled 


669 


96 


— 






protein (PDIR) 








^- 


■■ 

Homo sapiens 


Secreted protein encoded by gene 94 


175 


12 


— " 






clone HPMBQ32 








^^^^..^ — 


-r-j j 

us muscu us 


= — ^ — 


"oT^ 


100 





G03801 


omo sapiens 


7s!8o^ ^'^^ protein, SEQ ID NO: 




51 








u'ui /uu r\ 11 no = 






~57fil 


AL 160493 


.eisimania 


probable (hnv-6) ul 102, variant a 


— — 

130 


_ 

76 








DNA, complete virion genome 






"5762 


AF 130079 


^ 

omo sapiens 


PR02852 


-— 


-— 




— 


AF143946 


omo sapiens 


transcriptional activator SRCAP 


— 




~ 





M80627 


Homo sapiens 


iielix- loop-helix protem 


127 


88 





AF03 8007 


Homo sapiens 






47 


57 


G01881 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


427 


97 








5962. 






57 7 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


172 


75 









7870. 








B12884 


— : 

omo sapiens 


Human alplia-bradeion protein 


465 




70 








sequence. 






5769 


Y66645 


Homo sapiens 


Membrane-bound protein PRO 1310. 


347 


100 


~S^T 


AB033 1 68 


Mus musculus 


nuclear protein ZAP 1 1 5 


39 




X8j575 


ricetu us 


CHOI antigen 


113 


57 






^'^^'•'^ : 








-T— 


AF 135422 




GDP-mannose pyrophospliorylase A 


764 





3773 


L38644 




karyopherin beta 


121 
















3774 


AB020746 




protein kinase- like protein 


107 




33 


3775 


Y 17920 


Drosophila 


CALO protein 


128 








melanogaster 








5776 


AK000309 


Homo sapiens 


unnamed protein product 


988 


98 


5777 


AFO 10404 


Homo sapiens 


ALR 


469 


97 


5778 


AB045131 


Homo sapiens 


NADPH-dependent retinol 


126 


92 








dehydrogenase/reductase 






5779 


AF 197927 


Homo sapiens 


AF5q31 protein 


530 


95 


5780 


X79198 


Homo sapiens 


host cell factor 


119 


74 


5781 


X69838 


Homo sapiens 


G9a 


157 


41 



283 



SEQ ID 


Accession No. 


Species 


Description 


Smitli 


% 


NO: 








Waterman 


JdenHt 















80 

5782 


Y94450 


Homo sapiens 


, 

Human mflammation associated 


227 


44 








protein #7. 






^783 


AF216804 




a us^^ 


nuclear matrix transcription fector 






98 








norvegicus 










K0o4oj 


Homo sapiens 


— : — ^ — -j 

Derived protein oi clone ICA 1 o 


-— 

6j8 


... 

91 







~— ~ 


(ATCC 40553) 








AF000I49 


Mus muscu us 


ATP-binding cassette transporter 




-2^? 


41 


— 


D89974 


omo sapiens 






78 








protekfoPl'so'^"'^'''' 






— 
— 


AF090942 


— omo sapiens 


rKvJUO 3 / 




63 


— 




Homo sapiens 


23 kD highly basic protein 


- 


100 




G03 172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


14 


6d 








7253. 






5790 


2972 11 


Schjzosaccharom 


kinesin-like protein 


346 


60 







yces pombe 








5791 


AF134707 


Homo sapiens 


— ^. -. = 

drsmtegrm and metalloproteinase 


144 


95 








domain 19 






579 




— : 

Homo sapiens 


N-acetyllactosaminide beta-l,6-N- 


547 


76 








acetylglucos&niinyltransferase 






"570^ 




AF090930 


-fj -. 

omo sapiens 




.. 

—^5 


- 




■ 


Y59879 


omo sapiens 


Human nonnal uterus tissue derived 


407 


97 


— — 






protein 42. 








G03714 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO) 


10 

149 


72 

















AF077208 


omo sapiens 




— 

— 1= 


— — 

-IH. — - 


„q 

■ 




omo sapiens 


^^^'^ — A . .p^- 




— 




AFO 1 0 1 44 


omo sapiens 


neuronal t rea protein AD7c-NTP 


__i 


-Z^ — 


~V7QQ 

— — 


M29487 


omo sapiens 


integrin alpha subunit precursor 






5800 


D87438 


Homo sapiens 


Similar to a C.elegans protein in 


354 


85 








cosmid C14H10 








5801 




W54282 


— ; 

Homo sapiens 


Protein sequence of the di-alpha 


195 


97 






. 


haemoglobin gene contained in pSS 1 . — 






— «f\ 




— 

AB03017d 


Homo sapiens 


peptidylarginme deiminase type II 


1 


4t^ — ■ 




D3 8441 


Homo sapiens 


acylamino acid-releasing enzyme 






— sTvi 


Y55939 


Homo sapiens 


Human GEK2 protein. 


157 


96 


5805 


X89626 


Bos taurus 


240K protein of rod photoreceptor cng- 


92 


42 








channel 






06 




Homo sapiens 


similar to Kelch proteins; similar to 


177 


38 










BAA77027 (PID:g4650844) 






5807 


Y 86275 


Homo sapiens 


Human secreted protein HSRGWl 6, 


— — 

436 


_ 

77 









SEQ ID NO' 190 






"5808 




1vr ""Fi 


carboxylesterase precursor 


-— 


-— 

55 






auratas"^^ ^ 








5809 


AF272148 


Homosa iens 


KRAB zinc fin er rotem 


~593 


"68 


5810 


W57260 


Homo sa lens 


Human sana'^rinY ^"^ 


"651 


~Ofi 




AF202893 


Mus musculus 


j^^f2-ih ' 


IqI 


— 


5812 


AC006963 


Homosa iens 

omo sapiens 


similar to Kelch proteins- similar to 


"^19 


-5p 








BAA77027 (PID:g4650844) 






~58T3 


AB017114 


■■ 


"^Mo 


— — 


— 


"5814 — 


AC007954 


Homo sapiens 




"Its 


-j~ 


"5815 


AF124726 


Homo sapiens 


acinusL 


-— — 


-— 


5816. 


AB015330 


Homo sapiens 


HR1HFB2007 


167 


"36 


5817 


AB020684 


Homo sapiens 


KIAA0877 protein 


699 


57 


5818 


Z15025 


Homo sapiens 


Bat2 


172 


49 


5819 


X95761 


Mus musculus 


new:Rhobin 


549 


79 


5820 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


68 








7871. 






5821 


X55126 


Mus musculus 


ZQ)-29 


633 


93 


5822 


Y99366 


Homo sapiens 


Human PR01475 (UNQ746) amino 


452 


98 



284 



w o 01/64835 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








acid sequence SEQ ID NO:88. 






5823 


AB 040801 


Homo sapiens 


SREB3 


611 


95 


5824 


M27878 


Homo sapiens 


DNA binding protein 


462 


64 


5825 


Y13620 


Homo sapiens 


BCL9 


140 


40 


5826 


AF152510 


Homo sapiens 


protocadherin gamma A3 short fonn 


525 


92 








protein 






5827 


M23451 


Achlya 


steroid receptor 


337 


61 






ambisexLialis 








5828 


AJO 11305 


Homo sapiens 


guanine nucleotide exciiange factor, 


132 


100 








delta subunit 






5829 


Y85565 


Homo sapiens 


Human homologue of lINC-53 (Hs- 


710 


96 








lJNC-53/2) sequence. 






58j0 


U03277 


Drosophila 


cloned by ability to arrest the cell cycle 


258 


46 






melanogaster 


when expressed in the fission yeast 












Schizosaccharomyces pom be 








AF 193 342 


Mus musculus 


GCN2alpha 


469 






AF279265 


Homo sapiens 


putative anion transporter 1 


240 


91 


££ 1 




Homo sapiens 


unnamed protein product 


687 


97 


5834 


AK023998 


Homo sapiens 


unnamed protein product 


674 


96 


5835 


AF07I059 


Mus musculus 


zinc finger RNA binding protein 


243 


100 


5836 


AF178534 


Homo sapiens 


talin 


463 


62 


5837 


J04569 


Homo sapiens 


glial fibrillary acidic protein 


102 


55 


5838 


AF068625 


Mus musculus 


DNA cytosine-5 methyltransferase 3A 


130 


89 


5839 


AB037739 


Homo sapiens 


K1AA13I8 protein 


644 


97 


5840 


R86865 


Homo sapiens 


Human protocadherin pc3 . 


650 


96 


5841 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


84 








7124. 






5842 


X51486 




Callus gallus 


cNFl-Al protein (AA 1-522) 


590 


86 


5843 


U65785 


Homo sapiens 


150 kDa oxygen-regulated protein 


200 


95 








own 50 






5844 


U16126 


Homo sapiens 


EAA4 


679 


97 


5845 


AL035071 


Homo sapiens 


dJl 085F1 7. i .3 (ortholog of mouse 


717 


100 








DNMT3B (DNA Cytosine-5 












Methyltransferase 3B3) (isoform 3)) 






5846 


Y13620 


Homo sapiens 


BCL9 


505 


81 


5847 


AC004812 


Homo sapiens 


similar to human HsGCNl U77700 


316 


100 








(PID:g2282576); similar to yeast 












translation activator GCNl 












(PID:g462168) 






5848 


M77246 


Rattus 


beta-chain clathrin associated protein 


611 


89 






norvegicus 


complex AP-2 






5849 


AF007833 


Homo sapiens 


kruppel-related zinc finger protein 


569 


79 








hcKrox 






5850 


AF056617 


Homo sapiens 


BWSCR2 associated zinc-fmger 


175 


32 








protein BAZl 






5851 


D38548 


Homo sapiens 


The ha0936 gene product is novel. 


409 


65 


5852 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


55 








4609. 






5853 


D 87071 


Homo sapiens 


similar to C.elegans protein encoded in 


188 


76 








cosmid T20D3 (Z68220). 






5854 


M64979 


Homo sapiens 


glial factor-1 


557 


83 




G01480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 












5561. 






5856 


X06345 


Xenopus laevis 


laminACAAl-265) 


109 


68 


5857 


AB017499 


Oryctolagus 


Lipoprotein Receptor Related Protein 5 


196 


40 






cuniculus 








5858 


D43633 


Oryzias latipes 


G protein-coupled seven- 


281 


50 








transmembrane receptor 






5859 


AF067I65 


Homo sapiens 


zinc finger protein 3 


669 


87 



285 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 













-1 


'^M) — " 


AL031 118 


, 

Homo sapiens 


dJ153G14 3 (novel C2H2 type Zinc 




j8 








Finger protein) 






"5861 


U13897 


T? alen 

omo sapiens 


^rot '^Iris'ifformT''^'^^ ^^^^ 


T30 











pro em, isuorm 








M74002 


omo sapiens 


arginine-nc nuc ear s'" 


-— r 





-pj-^ 


X67855 


Oryctolagus 


ramcaciumc anne 


Hi 









^"•^"'"^ . 








"5864 


^^g^^^ 


omo sapiens 


T—. r-j— 

pTt*^ anscriptiona ac or 


~dl^ 

-— 


— 

— ^ 


-FoZS 


AF251038 


onto sapiens 


}'^^^° :— 










^^^^ 


Mus jnusculus 


DNA binding protem Rc 


— -— 

409 




867 




Rattus sp. 


Na47Cl(-^-dependent neurotransmitter 


582 


95 








transporter 








-^lll 


X97675 


_ ; 

Homo sapiens 


plakophilin 2b 


5 

1 




93 






Homo sapiens 


protocadherin alpha 13 short fonn 


680 


100 














"sRvn — 




AFl 18082 


omo sapiens 


PRO 1 902 


— — 


— 


5871 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




149 


49 








_ _ _ _ _ 






5872 


. 

A61971 


... 

unidentified 


MCSP 


621 


82 


5873 


Z54206 


Bovine 


UL51 


104 


33 






herpesvirus 1 








5874 


AF069765 


Homo sapiens 


signal recognition particle 72 


674 


97 


5875 


AF 1 79428 


Homo sapiens 


DNA polymerase zeta catalytic subunit 


692 


100 


. . 






variant 1 






5876 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


119 


48 








gene No. 101. 






5877 


y94990 


Homo sapiens 


Human secreted protein vb21_l, SEQ 


446 


98 








ID NO'20 






— ^ 

5878 




L0881 1 




Drosophila 


adherin 


264 


42 






melanogaster 










— — "-— 

D86983 


Homo sapiens 


^ — 

suniiar to D. melanogaster 


429 


100 








peroxidasin(Ul 1 052) 








AFl 52508 


Homo sapiens 


protocadlierin gamma A2 short form 


664 


r ■ 

100 








protein 






5881 


AF152508 


Homo sapiens 


protocadherin gamma A2 short form 


660 


98 








protein 






— — 

— 




— j 

Mus musculus 


K-Cl cotransporter 


"Too 







AB023229 


Homo sapiens 


KI A A 1012 protein 




107 


83 


5884 


U53420 


Rattus 


sodium-calcium exchanger form 3 


595 


98 






norvegicus 








"5885 


AF039019 


Homo sapiens 


—. 3 . 

zmc ger m g pro em 


"Mr 

■ 


— 








? r-. J- 






— — 

5886 




AF2 07661 


— : 

Homo sapiens 


sodium bicarbonate cotransporter-uke 


.. 

701 


100 








protein 






"5887 


AF130051 


"H '■ 

omo sapiens 


PRO0898 


-— 


74 


-^ooo 


AFl 52508 


omo sapiens 


protocadherin gamma A2 short form 


— 


— 

0 








protein 






— 


A ^ 1 ri'7Q 


— : 

Homo sapiens 


heptacellular carcinoma novel gene-3 


— — 

741 


— 

98 









j-jj TTxi 






5890 


R13556 


: 

Homo sapiens 


Protem encoded downstream of hhc_M 




118 











oncoprotem. 






"5891 


AF157326 


Homo sapiens 


TIP 120 protein 


Iti 


"85 


5892 


AF129170 


Homo sapiens 


apolipoprotein E receptor 2 


630 


99 


5893 


X05561 


Homo sapiens 


alpha- 1 chain precursor (AA -27 to 


197 


45 








917) (2953 is 2nd base in codon) 






5894 


AJ238374 


Homo sapiens 


putative protein THl 


641 


96 


5895 


X63526 


Homo sapiens 


homologue to elongation factor 1- 


709 


97 








gamma from A.salina 






5896 


Y11651 


Homo sapiens 


phosphate cyclase 


521 


80 



286 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 

y 


5897 


Y 18448 


Homo sapiens 


Bassoon protein 


681 


100 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 






5899 


AF 1 52473 


Homo sapiens 


protocadiierin alplia CI short form 
Py*^^^'" : — 


733 


100 


"5900 


QQ^^^Q 


Homo sapiens 


Human secreted protein, SbQ ID NO: 
7871. 


T^s 


"66 






Homo sapiens 


Derived protein of clone ICA13 
(ATCC 40553). 








X 12928 


aestivum 


HLviw giuTcnin suDunit d ^aa i-o^o) 










Hydra sp. 








5904 


AF118082 


Homo sapiens 


PRO 1 902 












A00279 


construct 










T14722 cdl 


Homo sapiens 


01 -DEC- 1992 Human CD27 antigen 
cDNA. 










Homo sapiens 


Human secreted protein clone ye7_l 
protein sequence SEQ ID NO: 172. 






5909~ 


^Yi27lT 


Homo sapiens 


vacuolar sorting protein 35 






Mus musculus 


Pro-Pol-dUTPase polyprotein 


125 


50 




G00360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4441. 






5911 


W88762 


Homo sapiens 


Polypeptide fragment encoded by gene 

.- 


152 


50 


5912 


G01984 


- 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


68 


-— 





. 

Homo sapiens 


~Tr~ 3 : r-— 

Human secreted protein, SEQ ID NO: 


209 


71 


~59T4 


AB046048 


"Macaca 

fascicularis 


unnamed Dortein product 


l43 


"49 


"ilri — 


S80644 


Sus scrofa 


A Tlfh-^H^^' pH V- 




-^1 






us muscu us 


" /^noTI'i — 







-Tqyj 


X52622 






-— ^- 


— 


~59T8 


U49973 


Homrsa^iens 


Ol^^ftinction unknown 


TT4 


"64 


5919 


S71403 


Mus musculus 


m33-A isoform 


120 


42 




AF09093 1 




PRO0483 






5921 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7874. 


99 


36 


""SQ79 


AJ006770 


■j^- r—. 


extensin 


96 





"5923 


AJ250193 


MusmuTculur 


muscle protein 637 


730 




5924 


AF229067 


Homo sapiens 


PADI-H protein 


278 


63 


5925 


X92485 


Plasmodium 


pval 


99 


50 


"5926 


Q 


H^m sa ien 

omo sapiens 


Human secreted protein, SEQ ID NO: 84 

7795. J 


"66 


^927 


AF130089 


Homo sa iens 


PRO2550 114 


"64 


5928 


J00126 


Homo sapiens 


MHC factor B {237 


100 




U49973 




ORF 1 ; MER37 ; putative transposase i 25 1 
similar to pogo element 




5930 


AF090895 


Homo sapiens 


PROG 117 


63 


62 




AF041330 




NADH dehydrogenase subunit 5 


147 




5932 


U83857 


Homo sapiens 


Aacll 


209 


80 


5933 


AF068294 


Homo sapiens 


HDCMB45P 


144 


50 


5934 


X92485 


Plasmodium 
vivax 


pval 


135 


69 


5935 


L07545 


Leishmania 
tarentolae 


A 'c' was inserted after nt 369 (=nt 
10459 iii genomic sequence (M10126)) 
to correct -1 frameshift probably due to 


127 


36 



287 
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SEQ ID 

NO: 




Species 


Description 


Waterman 
Score 


y 








gel compression 










Homo sapiens 


Neural thread protein. 






5937 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


130 


65 


— 


Y^^Qj^^ 


Honfio sapiens 


Human secreted protein (clone 
bhl57_7). 






59j9 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


167 


68 


5940 


U93570 


Homo sapiens 


putative pi 50 


103 


41 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 






^qA^ ■- 


AF010144 


Homo sapiens 


neuronal thread protein ADTc-NTP 


110 


^1 


5943 




AL3901 14 


Leishmania 
jnajor ^ 


probable (hliv-6) ul 1 02, variLuit a 
DNA, complete virion genome 


12- 


90 




— 

AF130089 


Homo sapiens 


PRO2550 


., 

101 


48 


945 




Homo sapiens 


Human secreted protein, SEQ ID NO; 

6953. 






5946 


X92485 


Plasmodium 


pval 


104 


70 


5947 


M29622 


Mus musculus 


opep reading ftame 2 


132 


64 




AF130089 


Homo sapiens 


PRO2550 




60 


5949 


AF130089 


Homo sapiens 


PRO2550 






5950 


AF130079 


Homo sapiens 


PR02852 


117 


52 






Homo sapiens 


Human secreted protein, SEQ ID NO; 

7870. 










Homo sapiens 


0RF2: function unknown 






5953 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


55 




M22332 


Homo sapiens 


unknown protein 


169 


41 


5955 




Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


133 


72 


5956 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


100 


36 


5957 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


63 


"5958 — 


AF0I0I44 — 


— : 

Homo sapiens 


~~ — iT — J r~- — A TAT xTTn 

neuronal thread protem AD7C''NTP 


Tl2 




5959 


Z49944 


Canis familiaris 


CDC42 GTP-binding protein 


120 


40 




AFl 1 8082 


Homo sapiens 


PRO 1902 






5961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


65 


5962 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


62 


5963 


AF 11 8082 


Homo sapiens 


PRO 1902 


113 


60 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






5965 


U49973 


Homo sapiens 


ORF 1; MER37; putative transposase 
similar to pogo element 


120 


57 


96 




G00637 


- ,.- _ .. 
Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 








G03224 


Homo sapiens 


Huinan secreted protein, SEQ ID NO: 
7305. 












6012. ^ 






5969 


AFl 19855 


Homo sapiens 


PR01847 


162 


72 


5970 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


146 


46 


5971 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


122 


37 


5972 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


121 


66 
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SEQID 
NO: 


Accession No. 




Description 


Smifh- 
Waterman 


Identit 

y 


5973 


AL 160371 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, contpleto virion genome 


107 


60 


5974 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


68 


5975 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


47 


5976 


U93572 


Homo sapiens 


p40 


112 


33 


5977 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


58 


5978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


70 


5979 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


54 


5980 


M64792 


Rattus 


salivary proline-rich protein 


105 


31 


5981 


X66285 


Mus musculus 


HCIORF 


86 


36 




M64793 


norvegicus 


salivary proline-rich protein 


1 10 




5983 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


131 


60 


5984 


Y36 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 






5985 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


47 


"5986 


Y86248 




Homo sapiens 


Human secreted protein HCHPF68. 
SEQIDNO:163. 


Tso 




^ggg 


AF 130079 




PR02852 











Homo sapiCTS 






~M 


5989 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

oCV^ lU Iriyj. lOo, 


202 


89 


5990 


AF130089 


Homo sapiens 


PRO2550 


238 


74 




AFl 18082 


Homo sapiens 


PRO 1 902 






5992 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

J]^ . 


131 


60 


-jg^ 


^^^^^g 


"tJ = 

Homo sapiens 


reverse transcriptase 


~no 




5994 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


45 


~5995 


Y86248 


— : 

Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 










Homo sapiens 


36/8-8 fiision protein ^^'ith epitope for 
anti-lectin antibody 








Y08061 










5998 


AP002460 


Arabidopsis 


gene_id:FlD9,26~unlaiown protein 


100 


38 


5999 


U93569 


Homo sapiens 


p40 


103 


52 


6000 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWIS. 


100 


63 


"6001 


W48351 


Homosa iens 

omo sapiens 


BCRB2. 


~V2i 


1\ 




G03789 




Human secreted protein, SEQ ID NO: 
7870. 






6003 






7214. ^ 






6004 


AF130089 


Homo sapiens 


PRO2550 


122 


63 


6005 


R95913 


Homo sapiens 


Neural thread protein. 


235 


56 


6006 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


173 


58 


6007 


X92485 


Plasmodium 


pval 


96 


53 
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SEQID 
NO: 


Accession No. 


Species 




Smith- 
Score 


% 

Identit 

y 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


236 


62 


6009 


Y17832 


Human 
endogenous 


env protein 


103 


66 


6010 


AF 119900 


Homo sapiens 


PR02822 


91 


50 


6011 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


155 


55 


6012 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


201 


91 






Homo sapiens 


Human secreted protein #28. 






6014 


AF 130079 


Homo sapiens 


PR02852 


119 


36 


6015 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


63 


6016 


AF 130089 


Homo sapiens 


PRO2550 


HO 


40 


6017 


AF010144 


Homo sapiens 


neuronal 4read protein AD7c-NTP 


142 


57 


6018 


AF 116661 


Homo sapiens 


PRO 143 8 


147 


69 


6019 


AF 130089 


Homo sapiens 


PRO2550 


169 


71 


6020 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7879. 


208 


60 


6021 


AFl 19851 


Homo sapiens 


PRO 1722 


147 


65 


6022 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


139 


47 


6023 


AF130089 


Homo sapiens 


PRO2550 


142 


66 


6024 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


91 


76 


6025 


Ml 8093 


Phaseolus 

\ailgaris 


hydroxyproline-rich glycoprotein 


103 


48 


6026 


D86853 


Catharanthus 


extensin 


97 


"46~ 


6027 


AFl 30089 


Homo sapiens 


PRO2550 


95 


47 


6028 


X92485 


Plasmodium 


pval 


102 


61 


6029 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


149 


53 


6030 


G00643 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4724. 


96 


76 


6031 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8159. 


144 


44 


6032 


AB011483 


Arabidopsis 


contains similarity to root cap 
protein~gene_id:MUF9. 1 6 


117 


37 


6033 


X92485 


Plasmodium 


pval 


85 


40 


6034 


AFl 18082 


Homo sapiens 


PRO 1902 


112 


58 


6035 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4458. 


121 


59 


6036 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


65 


6037 


G00352 


Homo sapiens 


Human secreted protein, SEQ K) NO: 
4433. 


107 


71 


6038 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


65 


6039 


X92485 


Plasmodium 


pval 


84 


78 


6040 


AF130051 


Homo sapiens 


PRO0898 


107 


69 


6041 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18.' 


113 


58 


6042 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7878. 


209 


75 
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SEQID 
NO: 


Accession No. 




Description 


Waterman 
_STOre 


Identit 


"6043 




H sale 

omo sapiens 


"F ent fh — an secreted rot " 

encodeTby geaeZJ ^^'^^ ^™ 




"56 


"6044 


G03789 


Homosa iens 

omo sapiens 


Human secreted protein^ SEQ ID NO" 


"126 




6045 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


65 


6046 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


62 


6047 


AF130079 


Homo sapiens 


PR02852 


205 




6048 


R959I3 


Homo sapiens 


Neural thread protein. 


148 


45 




D381 12 










6050 


D38113 


PMUrogbdytei — 


"^^HXhydroge^ieiJE^^ 





~ 


6051 


Mil 901 


Rattus 


proline-rich salivary protein 


96 


42 


6052 


AF130089 


Homo sapiens 


PRO2550 


348 


73 


g^g^ 


AF119851 




PRO 1 722 


"i^^ 









Homo sapiens 

omosap.ens 


7874. 








AF161356 




HSPC093 






6056 




X92485 


Plasmodium 

: 




155 


55 




G04000 


Homo sapiens 


— : — 

Human secreted protein, SEQ ID NO: 


520 


100 


6058 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


65 


6059 


W036'^6 


Homo sapiens 


Human thyrotropin GPRN-tenninal 


163 


53 


"6060 


G02872 


Homo sapiens 


HumaTsecreted protein SFQIDNO- — 

6953. 


"300 


"68 















"6062 ~ 


AF090895 


Homo sapiens 


PROO 1 1 7 







6063 


X92485 


Plasmodium 


pval 


110 


86 


"6064 — 


AF130089 


Homo sapiens 


PRO2550 


"T66 


"71 


6065 


M22332 


Homo sapiens 


unknown protein 


188 


48 




Y08062 


Homo sapiens 


derived from DNA3 563 8. ^ 






— 


AFO 101 44 








^5 




GO 05'^ 8 


Homo sapiens 


HmnaTsecreted^ro^^^ — 
4609. 


"W^ 






M22332 










6070 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RJIAla (AF078683) 


109 


82 


6071 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


73 


"6072 — 


AF130079 


Homo sapiens 


PR02852 


"143 


"60 


6073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


132 


55 


6074 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
n\ V- — opn in NO- — 


158 


65 


"6075 


G02532 


-rj ; 

omo sapiens 


^umansecree proem, Q 


— 


— 


6076 


AF116712 


Homo sapiens 


PR02738 


102 


41 




W48351 




BCRB2. 






6078 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


109 


60 


6079 


AF118082 


Homo sapiens 


PROI902 


196 


61 


6080 


AFl 18082 


Homo sapiens 


PRO1902 


165 


59 


6081 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


153 


66 
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SEQ m 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 


6082 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


188 


^1 


6083 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


109 


60 


6084 


V00672 


Pan troglodytes 


reading frame protein 4 


185 


85 




AF130051 




PRO0898 






6086 


AF 13 0079 


Homo sapiens 


PR02852 


96 


72 




D381 12 










6088 


AF090895 


Homo sapiens 


PRO0117 


110 


67 






maniculatus 


reverse transcriptase 










Homo sapiens 


Human secreted protein, SEQ ID NOi 
7284. 










Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 










Homo sapiens 


Human secreted protein, SEQ ID NOr 
4447. 






6093 


G03789 


Homo sapiens 


Human secreted protein, SHQ ID NO; 

_2^]^ : •j-^T 


176 


59 








— : 

omo sapiens 


Human secreted protein encoded by 




50 


"6095 


U70935 


Peromyscus 

maniculatus 


reTerse^t^^cr^^e^^^'^ 

reverse 


~2T6 


"44 


6096 


X71442 


Rattus 


ORF 1 , putative 


109 


53 


6097 


API 19855 


Homo sapiens 


PRO 1847 


88 


71 














6099 


AF090895 


Homo sapiens 


PRO0I17 


119 


61 




U52077 










6101 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


73 


6102 


M22332 


Homo sapiens 


unknown protein 


132 


44 




G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 






6104 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


64 


47 


~6l05 


M6479 1 


^^ttus 

norvegicus 


salivary roline rich rotein 

saivarypro e-nc protem 


Tr 


~ 




Y86248 




SEQ ID NO: 163. 










Homo sapiens 


4478. 








AF068294 




HDCMB45P 






"STSP — 


AFl 18086 


H°m° ^a^'^"^ 

omo sapiens 


PRO 1992 


~Wi 


~ 


"6110 


M63819 


asmo um 


malaria antigen 


-J- 


~ 


6111 


G03093 


Homefsapiens 


Human secreted protein, SEQ ID NOt 
7174. 


"T30 


~65 


61 12 


G03790 




Human secreted protein, SEQ ID NO' 
7871. 








AF119851 




PRO 1722 






6114 


AF229067 


Homo sapiens 


PADI-H protein 


134 


48 




AF010144 




neuronal thread protein AD7c-NTP 






6116 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


201 


80 


6117 


AF285567 


Microcebus 
griseorufus 


cytochrome b 


185 


58 


6118 


AL 13 8656 


Arabidopsis 
thaliana 


extensin precursor-like protein 


141 


28 


6119 


X92485 


Plasmodium 


pval 


165 


61 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


6120 


G03714 


Homo sapiens 


7795™ secreted protein, SEQ ID NO. 


164 


84 


6121 


M13100 


Rattus 


unknown protein 


176 


49 


Tl22 


QQg^gg 


Homo°^Peni 


Human secreted protein, SEQ ID NO; 

7214. 


T77 


~ 




G02485 




6566. 






6124 


AF 130089 




PRO2550 


-||? 






G00637 


Homo sapiens 


Human secreted protein, SEQ ID NOc 

4718. 




~ 




AF 144054 










6127 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


70 


6128 


U49973 


Homo sapiens 


ORF I; MER37; putative transposase 


236 


77 


6129 J 


U93564 


Homo sapiens 


putative pi 50 


152 


53 


6130 


Y02999 


Homo sapiens 


enroded'by^gene'or'^'^^'^'^ protein 


129 


57 


6131 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


56 


-qY22 — 


G04072 


Homo sapiens 


ftunan secreted protein SEQ ID NO' 
8153. 


T95 


"67 




W50192 


Homo sapiens 


protein CON-1. 








TAIC000496 


"^smo^dtoi"^ 


unnamed pi otem product 


tII 


— 


irns — 






pva 








-|||^ — 


^36203 














Homo sapiens 


Human secrSd'protein'^S^ ID NO' — 
7867. 


-~ 


"TE 


6138 


Y02693 


Homo sapiens 


gene 44 clone HTDAD22. 


106 


36 


"6T39 — 


AF130051 


Homo a ens 

Homo sapiens 


PRO0898 


— 


~fS 


6140 


G03793 


m sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


"2^2 


^73 


6141 


AF119851 


Homo sapiens 


PRO 1722 


107 


63 


6142 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7867. 




57 


6143 


R95913 


Homo sapiens 


Neural thread protein. 


114 


70 


6144 


AF068294 


Homo sapiens 


HDCMB45P 


123 


56 


6145 


AFl 18082 


Homo sapiens 


PRO 1902 


113 


32 


6146 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


42 


6147 


Y30731 


Homo sapiens 


Amino acid sequence of a human 


938 


100 


6148 


U93574 


Homo sapiens 


putative pi 50 


99 


47 


6149 


AF090894 




PROOl 13 






6150 


AF130079 


Homo sapiens 


PR02852 


125 


49 








7214, 










Homo sapiens 


7305. ^ 




60 


6153 


Y01398 


Homo sapiens 


Secreted protein encoded by gene 16 
clone HNGFR75. 


90 


82 


6154 


X92485 


Plasmodium 




110 


42 


6155 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


118 


48 


6156 


GO 1495 


Homo sapiens 


Human secreted protein, SEQ ED NO: ! 170 


89 
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SEQID 
NO: 


Accession NO. 


Species 


Description 


Smith- 
Score 


Wentit 














6157 


X92485 


Plasmodium 


pval 


157 


52 


6158 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


108 


58 






Cotumix 
japonica 


Cytochrome oxidase subunit 1 








AFl 1 8082 


Homo sapiens 


PRO 1902 






6161 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


154 


64 


6162 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


107 


37 


6163 


M22332 


Homo sapiens 


unknown protein 


125 




6164 


W48351 


Homo sapiens 


Human breast cancer related protein 






6165 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


235 


64 


6166 


G006S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


36 


6167 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


59 


6168 


D38il2 


Homo sapiens 


cytochrome c oxidase subunit 3 


562 


85 






Pan troglodytes 


NADH dehydrogenase subunit 4 




85 


6170 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
661j. 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7867. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 






6173 _ 


K03206 


Homo sapiens 


salivary proline-rich protein precursor 




36 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 








U935_69 _ 


Homo sapiens 


p40 


192 


40 






Nicotiana 
sylvestris 


extensui 








Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 






6178 


AFl 19855 


Homo sapiens 


PRO 1847 


114 


43 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3, 


98 








Homo sapiens 


pot. unidentified reading frame 


112 




6181 


1J49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


204 


76 


6182 


AL3 59782 ■ 


Trypanosoma 


possible (hlw-6)ull02, variant a dna, 110 
complete virion genome. 


73 


""eiss — 


Y)2SU2 


Homo sapiens 


NADH dehydrogenase subunit 3 240 


~88 


6184 


Y27868 


Homo sapiens 


Human secreted protein encoded by ■ 1 02 
gene No. 107. 


47 


6185 


X86031 


Sus scrofa 


homologueof proline/arginine-rich 99 
antibacterial protein ! 


49 


6186 


Y36156 


Homo sapiens 


Human secreted protein *28. [ 196 


75 


"lill — 


AF169387 


Mus musculus 


alphas collagen IV 


111 


38 




AF194537 




NAG13 


138 




6189 


X91836 


Vigna 
unguiculata 


extensin class 1 protein 


167 


33 


6190 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. ■ 


175 


63 


6191 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


123 


46 


6192 


X92485 


Plasmodium 


pval 


115 


54 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 

_5££r? 


% 

Identit 








"vivax 








6193 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


345 


65 


6194 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


82 


54 


6195 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


214 


56 


6196 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


87 


79 


6197 


Y08062 


Homo sapiens 


derived from DNA35638. 


118 


41 


"6198 


-^^269] 


Homo sapiens 


S24 


~215 


T3 


6199 


G02872 


Homo sapiens 


Huniian secreted protein, SEQ ID NO* 
6953. 


100 


46 


6200 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NOi 
7075. 


~H3 


^73 














6202 


U93572 


Homo sapiens 


putative pi 50 


99 


63 


6203 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


109 


45 


"6204 


AFO 10144 


lio len 


— — . • AD7 NTP 


163 


39 


"6205 




Homo sapiens 


B^^UgrowtlTfactor^'" ~ 






6206 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


50 


6207 


W88627 


Homo sapiens 


Secreted protein eucoded by gene 94 
clone HPMBQ32. 


181 


72 


6208 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


75 




AF010144 






r6'9 


"81 


6210 


Y36203 


Homo sapiens 


Human secreted protein #75. 


6211 


G0j790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




52 


6212 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


48 


~62l3 


D86854 


Catharanthus 

roseus 


extensin 


^20 


— 








6835. ^ ' ^ 








W33698 










6216 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


62 


6217 


U40342 


Mus musculus 


ninein 


77 


72 




G00637 




4718. ^ 








U49973 




ORFl ; MER37^ putative transposase 
similar to pogo element 






6220 


X92485 


viVaK^'^'"'" 


pval 


107 


46 


6221 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


40 


6222 


X86029 


Vigna 


extensin-Iike protein 


140 


35 


6223 


G03790 


Homosap^ns 


Human secreted protein, SEQ ID NO: 
7871. 


216 


"59 


6224 


AF068294 


Homo sapiens 


HDCMB45P 


164 


60 


6225 


Y50797 


Homo sapiens 


Human calmodulin/NEF protein 
binding inhibitor 2. 


135 


50 


6226 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 


139 


41 


6227 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


58 
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SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


Identit 

y 














6228 


AF130089 


Homo sapiens 


PR0255Q 


126 


43 




Y86248 




SEQIDNO:163^ 








AP000616 




RHA]a(AF078683) 






6231 


Y86248 


Homo sapiens 


SEQ ID NO' 163 


174 


79 


6232 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO ■ 1 63 


168 


78 


6233 


U49973 




Homo sapiens 


ORFl; MEBG7; putative transposase 


164 


67 


"6234 




— -. 

omo sapiens 


IfamMi^secre^ed^prote^ sequence 

encoded by gene 2 SEQ ID NO'250 


-—3 


— 


"6235 


AF113685 


"Homo sapiens 


PRO0974 


Ti4 


~46 


6236 


C03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


40 


"6237 


AL096844 


Streptomyces 

coelicolor A3(2) 


"putative integral membrane protein 


"88 


— 





AFl 30079 






^65 






AF1300S9 


'Homo sapiens — 


lPRO2550^ 




~ 


6240 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


212 


69 


6241 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
1^64^° 


102 


54 


-~Z242 


AFl 09907 


^ 

omo sapiens 




~r- — 


"Ofi 


"6243 


Y02671 


omo sapiens 


Human secreted protein encoded by 

gene 22 clone HMSJWI 8. 


1r5 


-J- 


6244 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


52 




G03790 




7871. 






6246 


AF130089 




PRO2550 






6247 


AF0iai44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


40 


"§§35 — 




Homo sapiens 










AFl 16661 




PRO 143 8 


Tn 




6250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


68 


~6251 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 







6252 


Y64890 


Homo sapiens 


ID NO: 1 05 1 


106 


42 


6253 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


247 


68 




Y86248 


omo sapiens 


SEQ ID NO: 163. 


117 


75 


5255 


V00662 


Homo sapiens 


cytochrome oxidase I 


355 


78 


6256 


Y02999 




encoded by gene 121 . 


131 






D381 12 










6258 


AF061340 


Artibeus 


cytochrome c oxidase subunit III 


308 


63 


~6259 


AFO 16099 


Musmusculus 


endonuclease/reverse transcriptase 


~TT4 


"4^ 


6260 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


179 


78 


6261 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


192 


75 


6262 


G03714 


Homo sapiens 


Human secreted protein, SBQ ID NO: 

7795. 


146 


71 


6263 


U09116 


Homo sapiens 


ORF 1 , encodes a 40 kDa product 


114 


37 


6264 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7344. 


88 


60 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


Identit 

y 





U49974 


Homo sapiens 











Y0S062 


omo sapiens 


Hu^n^]^)245 pxrtein fiagment 

derived from DNA35638. 




72 


6267 


AF097473 


Mus musculus 


^^^^^■^ 


"86 


"40 




X70343 


Nicotiana 

sylvestris 




"95 


"33 


6269 


W48351 


Homo sapiens 


BCRB2. 


94 


84 


6270 


L27428 


Homo sapiens 


reverse transcriptase 


132 


48 


6271 


AF 11 8082 


Homo sapiens 


PRO 1902 


"M^ — 


53 


6272 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 






6273 


X03145 




pot. ORF V 





"35 


6274 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


90 


71 


6275 


U93572 


Homo sapiens 


p40 


83 


43 




AF058700 










6277 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


96 


65 


"55^5 — 


AF 13 0079 


Bl'^mo sa'^ien^ 


PR02852 









AF161356 


Rattuis ^^^^'^"^ 


HSPC093 


~TM 




"ffiso — 




norvegicus 


PR-Vbetal 


"To9 


-^y 


6281 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


IjO 


58 


"6282 — 


J 


Tl — 0 "e 

omo sapiens 


TT — ' Td te' — SEO ID NO- — 

74^^^^"*^ 


-— 


— 


6283 


AF068294 


Homo sapiens 


HDCMB45P 


98 


37 




AFl 16713 


Pla^od'^m"^ 


PR02829 







— 


X92485 


vivax 




'W' 




6286 


U93570 


Homo sa'^iens 




^^55 





g^gg 


Y36203 




Human secreted r te" #75 










Homo sapiens 


se"eiioprote^nP '■ 


~220 


"65 


6289 


X92485 


Plasmodium 


pval 


125 


70 


"6290 — 


-^g^gg^ 


Mu^'^ usculu 

RatLT"^^" 


^ ^ 


"TTi 


— 


6291 


Ml 1901 


norvegicus 


rolfne-Tch al" P 

pr me-ric s ivaryproem 


^94 




6292 


AFl 16661 


Homo sapiens 


PRO 143 8 


144 


46 




L27428 











6294 


G00397 


Homo sapiens 


Human ^creted^ITOtei^ SEQ ID NO- — 

4478. 




~ 


6295 


AF130089 




PRO2550 





"ll 


6296 


L27428 


Homo sapiens 












Homo sa iens 


Neuron associated^protein 


T34 


~m 


6298 


G02872 


Plomo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


110 


Is — 


6299 


AF130089 




PRO2550 


110 




6300 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


63 


6301 


AF220264 


Homo sapiens 


MOST-1 


172 


90 


6302 


AF130079 




PR02852 


311 




6303 


AF068294 


Homo sapiens 


HDCMB45P 136 


53 


6304 


G03789 


Homo sapiens 


Human seca-eted protein, SEQ ID NO: 110 
7870. j 


62 


6305 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO; ! 1 13 
6953. 1 


45 


6306 


W48351 


Homo sapiens 


Human breast cancer related protein | 166 
BCRB2. ! 


58 
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SEQ ID 


Accession No. 


Species 


Description 


Smitli- 




NO: 








Waternian 


Identit 













y 


~6^^07 


gggggg 


■■ 

Homo s^Pj^l^^ 


WW? " 






~6l08 


G03786 


omo sapiens 


~Td ■ ggQ rp >TQ. 


Ts5 " 


~ 








78fi7 ° ^ 






"6309 


QQ2JU 


".jj : 

omo sapiens 


~H ' Fd • SEO ID NO- — 


— — 


— 








7795^°^^'^'^** 






~63T0 


AFl 19851 


-r- : 

Homo sa'^iens 


PRO 1 722 


-— 


— 


6311 


L27428 


omo ^^P|^||^ 




"q1 


~ 




AF130089 


omo sapiens 


PRO2550 


-rJT 


"Ts 


"6313 




AFl 30089 


omo sapiens 


PRO2550 


— -j 


"fiR 


6314 


X15917 


Paramecium 


ORF4 protein (AA 1-156) 


— = 

97 


~ 

38 







^""^^^^^ — . 












Homo sapiens 


— ^— 

Human PR0245 protein fragment 


— 

94 




~— 







derived from DNAjSGjS,^ 










-n -■ 

Homo sapiens 


Human secreted protein #75. 


124 


65 


T^T^ — 




Homo sapiens 


reverse transcriptase 


117 


56 


—i— — 


IVtl 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


105 


41 


6319 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


69 








n 






~Wvx) 


U92673 


— 

Eurythenes sp. 


cytochrome oxidase I 


59 


^'^ 














6321 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


283 


72 








gene 44 clone HTDAD22. 






"frw 




AK000496 


: 

— O'tio sapiens 


unnamed protein product 


— — 

206 




10 




AB047934 


Macaca 


hypothetical protein 


173 


68 


— — 




fascicularis 











fwvi — r 

AB02j485 


iVfus musculus 


transcription factor CA150b 




43 





AFO'^026'^ 


Bos taunis 


general protein transport factor pl6 






6 J 26 
-_ 


L 223 76 


Homo sapiens 


alternatively spliced product using exon 
'■'■'^ 


98 


57 


6327 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


115 


60 


— — — 







encoded by gene 90. 






-^£±1 — 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


100 


57 


6329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


67 










'232^ . 






~rtT3n — 




Peromyscus 


reverse transcriptase 




82 


— 






maniculatus 








"6^^31 — 


G03789 


omo sapiens 


-pj T : 


— 

138 


— 

57 








J • -. 






"6332 — 


K01664 


FT 

rosop la 


- 1 e protem 




12 


— 

37 






Homo°s?^ie^ns 








6333 


M69297 


omo sapiens 


^ 





— ^ 






H"m"'r'^ie"^ 


reverse transcriptase 


-— 

"Tnl 




~S3I5 — 


AF090894 


omo sapiens 


PROG 113 




"5^ 


y 


AF130089 


omo sapiens 


PR.(y2550 


"=r?o 


— 


-2^j= 


AJ271872 




extensui 


lo4 


— 

32 






sylvestt-^s 








6338 


AJ132694 


Homo sapiens 


small GTPase raclb 


99 


100 


-^11? — 


AF119851 


Homo sapiens 


PRO 1722 


123 


60 


6340 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


215 


56 








7870. 






"6341 — 


U83774 


"5 -■ '■ 

Pemis apivorus 


cytochrome oxidase I 






"6342 — 


(302538 






—~ 











6619. 






6343 


X924S5 


Plasmodium 


pval 


104 


81 


6344 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


64 








4335. 






6345 


U93572 


Homo sapiens 


p40 


79 


48 


6346 


X92485 


Plasmodium 


pval 


156 


55 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Watoman 
^'^"'^^ 


Identit 

y 














6347 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


85 


6348 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


158 


77 


6349 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


64 


6350 


AF130089 


Homo sapiens 


PRO2550 


187 


69 






norvegicus 












leucopus 








6353 


X55687 


escu^entum°" 


extensm (class 11) 


89 


44 


6354 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


57 


6355 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


76 ■ 


"6356 


Y86248 


Homo sapiens 


Hifnan secreted protein HCHPF68 

SEQ ID NO: 163. 


"98 


"61 


6357 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 




-|i 




Y02693 


omo sapiens 


°ene 44 clone HTDAD22. ^ 


"214 




6359 


G00427 


Homo sapiens 


4508. 


~75 


"88 




X07882 


l^°™ophfeoi-r 


cytochrome c oxidase subunit 3 






6361 


Z11592 


parasitica 


c> ociromecoxi ase su uni j 


7T 





6362"" " 


AF09lj944 " ~ 


Homo sapiens 


PRO0663 


130 


46 


6363 


AF010144 










6364 


AF229067 


Homo sapiens 


PADI-H protein 


155 


75 








gene No. 101. 








AK024501 




FLJOOI 10 protein 







6367 


AFl 18082 


Homo sapiens 


PRO 1902 




'W' — 


6368 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


53 


6369 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


96 


6370 


G03043 


Homo sapiens 


Himan secreted protein SEQ ID NO- — 

7124. 


"124 


^l3 


6371 


AF 13 0079 




PR02852 






6372 


X92485 


Plasmodium 


pval 


129 


42 


6373 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


63 


6374 


AF119851 


Homo sapiens 


PRO 1722 


251 


65 








7871. ^ 








X70343 


sylves^s 


extensm 


104 


35 


6377 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7305. 


70 


52 


6378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


69 


6379 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


109 


51 


6380 


Z38128 


Mus musculus 


histoneHl 


113 


36 


6381 


U93573 


Homo sapiens 


p40 


21'2 


43 
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SEQID 
NO: 


Accession No. 


Specie, 


Description 


Smitli- 
Waterman 


Idcntit 


6382 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


■ 109 


40 




AL096744 










6384 


AF220264 


Homo sapiens 


MOST-1 


119 


56 




X92485 


vivax 












norvegicus 








6387 


G02639 


omo sapiens 


6720™ ^^'^'^^'^'^ protein, SEQ ID NO. 


116 


52 


6388 


M13100 


Rattus 


unknown protein 


119 


48 


~6389 


AB047948 


Macacr^^ 

fascj^i^aris 


-r . . . —. 

ypo e ca pro em 


"80 




6390 


X53581 


Rattus 


ORF4 


139 ' 


55 


6391 


AK025947 


Homo sapiens 


unnamed protein product 


166 


52 


6392 


G03786 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO. 




75 


~6393 


j^^^gg^ 


Homosa iens 


unknown protein 


T66 


"39 


6394 


AF229067 


Homo sapiens 


PADI-H protein 


128 


59 


6395 


G02532 


Homo sapiens 


miraan secreted protein, SEQ ID NO. 


80 


70 


6396 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


44 


6397 


M26361 


Mus rausculus 


LINE/Ig H-chain fusion protein 


97 


61 




AF n 8082 




PRO 1902 






6399 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


116 


89 


6400 


U93570 


Homo sapiens 


putative pi 50 


88 


43 








Hiinian secreted protein fragment 
encoded from gene 28. 








G04072 




8153. ^ 








G 02 872 




6953. ^ ' ^ 






6404 


G0326J 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


155 


56 


6405 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO' 1051 


88 


39 


6406 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


68 


6407 


G00560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


244 


90 


"6408 


L27428 


Homosa iens 


reverse transcriptase 


~108 


~42 


~6409 


(302532 


Homo sapiens 

omo sapiens 


Human secreted protein, SEQ ID NO' 

6613. 





— 


6410 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


106 


44 


6411 


U70935 


Peromyscus 

maniculatus 




154 


"40 


6412 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


6413 


AL031186 


Homo sapiens 


bK984Gl.l (supported by FGENES) 


249 


78 


6414 


AF1300S9 


Homo sapiens 


PRO2550 


100 


55 


6415 


AB 047963 


Macaca 
fascicularis 


hypoflietical protein 


165 


57 


6416 


M22332 


Homo sapiens 


unknown protein 


111 


42 


6417 
6418 


G02832 
AFO 16099 


Homo sapiens 
Mus musculus 


Human secreted protein, SEQ ED NO: 
6913. 

endonuclease/reverse transcriptase 


140 
147 


50 
60 



300 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


Heiitit 


"6419 


pgg^j^ 


"Homo sapiens 


NADH dehydrogenase subunit 1 ' 




~M 


6420 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


144 


81 


6421 


U49973 


Homo sapiens 


0RP2: function unknown 


91 


62 








6835. 








AF072508 










6424 


AK000385 


Homo sapiens 


unnamed protein product 


161 


54 


6425 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


148 


54 


~6426 


X92485 


Plasmodium 

vivax 


pval 


To3 


~ 




Y 8624 8 




SEQ ID NO: 163. 












6913. ^ 








AFO 16099 










6430 


Y36203 


Homo sapiens 


Human secreted protein #75. 


139 


58 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7344. 










Plasmodium 


pval 










Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 233. 








G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 








G02872 


Homo sapiens 


HuiTian secreted protein, SEQ ID NO; 
6953. 










Homo sapiens 


similar to salivary proline-rich protein 
P-B 






6437 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85 


lb 


58 


6438 


X99467 


Medicago 
Homo^a^iens 


ENOD20 


110 


37 


"6439 


yggg^^ 


omo sapiens 


Human ^ene 91 encoded protein 

fragment, SEQ lDNO:491. 


~T?9 


58 


6440 


X92485 


Plasmodium 
wwc ^ 


pval 


114 


46 


"6441 


AK024455 




FLJ00047 protein 


T73 


~V7 


"6442 


G03790 


Homo sapiens 


7S71. 


— 


~5l 




Y36203 










6444 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


138 


50 


6445 


M24732 


Homo sapiens 


larain-like protein 


71 


45 




AFl 19851 




PRO 1722 






6447 


AF126163 


Homo sapiens 


HHLA3 protein 


98 


53 


6448 


G00577 


Homo sapiens 


Human secreted protein, SBQ ID NO: 


151 


58 


"6449 


AF200 1 87 


cercopithicine 

herpesvirus 15 


EBNA2-like protein 


"125 


— 


6450 


X86031 




antibacterial protein 






6451 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


114 


54 


6452 


AF090942 


Homo sapiens 


PRO0657 


129 


65 


6453 


Ml 9503 


Homo sapiens 


OEIFI; putative 


86 


39 


6455 


X92485 


Plasmodium 




99 


40 



301 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 

SCOTC 


Ideiitit 


"6456 


X86030 


^atS — " 


— ^~Tk P ~ 

e ensin- e pro em 











un^uLuJata 








"6457 


Y08061 


Homosa iens 

Ondiocerca"^ 


Human c m b rotein fra°- ent 

— uman c my pr em — ^men^ 


Ti3 


"ri 


"6458 


AF055985 




pyrro i one-nc an igen 


"90 


— 






vol ulu 








"6459 


AFl 19851 


^^"^ . 

Rattu" ^^^'^"^ 


PRO 1722 


—75 


~ 


"6460 


J05497 




snRNP-associated polypeptide N 


~T7vi 


— 







norvegicus 











AF130089 


omo sapiens 


PRO2550 




-L- 


— 




W50192 


omo sapiens 


Ammo acid sequence of salivary 


93 


~ 










P*^*^ ^ : „_ ^ 






6463 


003789 


— : 

Homo sapiens 


Human secreted protem, SEQ ID NO: 


— — 

122 




72 








J . 











^ 


— ; 

Homo sapiens 


Human secreted protein #28. 


— 




~6465 


AL160493 


Leishmania 


probable (hhv-6) ull02, variant a 










major 


DNA, complete virion genome 






6466 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


36 






.... 


7214. 






6467 


G03y93 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


58 


1 






... ... _ . . 






6468 




Homo sapiens 


Human secreted protem, SEQ ID NO: 


121 


67 


















X92485 


— —. 

Plasmodiuin 
OTvax ^ 


pval 


— 

89 


— 

72 


"6470 


: 


omo sapiens 


T5 Th — P — SFO m NO — 


-— 


— 














"6471 


AF130089 


omo sapiens 


PRO2550 


-— 


— 


■"M79 




omo sapiens 




Tvi 





"6471 




AF130079 


omo sapiens 


PR02852 


~T7i 


~ ■ 




G03133 


omo sapiens 


Human secreted protein, SEQ ID NO. 












^ jl" 






"647^ 




AF220264 


. 

omo sapiens — 




— 

— i 




— 




G04067 


omo sapiens 


— 3 : 

Human secreted protein, SEQ ID NO: 




45 








-^^^ — i — 1 1 n 






"6477 


AFO 16099 


"ivf i 


en onuc eas reverse anscrip ase 


"ff)fi 


— — 





X92485 


P asmo lum 


pva 


— 

2 0 



















6480 




"rf 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


— 

112 


— 

61 









— '• i^;; 








U49973 


; : 

omo sapiens 


• "JPct'O" " — 


— 


— 

"^T 


"6482 






reverse anscrip ase 


"TTi 




"6483 


X92485 


Pl^na'dT" 

abino mm 


pva 


Tri 


"dO 

















L27428 


1? 


; 

reverse anscnptese _ 


-— 


— 

-^^ 


"6485 


Y73966 


Homo sapiens 

omo sapiens 


uman pros a e or ragmen 


"T3i 










erive P^*^^ ■ — — 






"6486 


G02538 


— : 

omo sapiens 




-— 

















"6487 


^ggg^^ 


— : 

omo sapiens 







— 


— ^ 


"mm — 


Y64890 


Homo sapiens 


Human 5 EST related polypeptide SEQ 


72 


42 








ID NO"1051 






"6489 — 


j^^^^gg 


Homo sapiens 




^"93 


"33 


6490 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


57 








8153. 






6491 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


70 








7795. 






6492 


U49973 


Homo sapiens 


ORFl; 1WER37; putative transposase 


251 


75 








similar to pogo element 






6493 


U15647 


Mus musculus 


reverse transcriptase 


100 


40 



302 
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SEQTD 


Accession No. 


Species 


Description 


Smitii- 


% 


NO: 










Waterman 


Identit 




AK027208 


: 

omo sapiens 


— d ' d — t 

unname protem pro uc ■ - — 


"TM^ 







~aZq^ 


G03714 


omo sapiens 




-— 










7795^ ^^^'^ ^™ 






"6496 


AF130089 




Homo sapiens 


PRO2550 


— r 


— 




"6497 


U49973 


omo sapiens 


ORF 1 ; MER37i putative transposase 


















6498 


AF090894 


Homo sapiens 


PRODI 13 


140 


68 


6499 


Y03000 


Homo sapiens 


Fragment of human secreted protein 


138 


84 








y_^5£ — ■ — — 






"6500 


G00357 


"h ^ 

omo sapiens 




— — 


— 








^^^ansecre pro in, Q O. 






6501 


M69297 


Homosa iens 


"ORFl 


"87 


"58 




1VI22332 


Homo sa lens 

_ omo sapiens 


— if P ' 

un own pro em 


"81 


"zz — 




X92485 


w^ax ^ 


pva 


—j 


— 


~6504 


AF090895 


omo sapiens 


PRO01I7 


— 


— 




6505 


Y7j9j2 


Homo sapiens 


Human prostate tumor EST fragment 


— 

97 




69 








derived protein ^^119. 






. 

6506 





__ : 

Homo sapiens 


putative pi 50 




J22 


- ..... 

-^^ 


^07 


X06O29 


Vigna 


extensin-like protein 


177 


32 


— 


— — — 


unguicuiata 








6508 


G0j790 


Homo sapiens 


_ : 

Human secreted protein, SEQ ID NO: 


282 


60 
















W4835 1 


~s = 

Homo sapiens 


Human breast cancer related protem 


- 

142 


61 









BCRB2. — 








G03 133 


omo sapiens 


Human secreted protein, SEQ ID NOi 












211$ 






~65ri — 
— 




" 


; 

omo sapiens 


mariner transposase — _ 


JIz 


-if 


6512 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


50 














"6513 


AF090928 


"h '■ 

omo sapiens 


PRO0470 


97 


— 

_|l 


~65T4 


AF134827 


ono ep IS 


small nuclear ribonucleoprotein B 


~1M 










domestica 












Homo sapiens 


— : — — 

Human secreted protein, SEQ ID NO: 


.. 

82 


44 









f^^^'v: — j-p 5-r-j 








AF068754 


"h '■ 

omo sapiens 




— — 

118 


— 

69 








HSBPl"*^ ^'^'"^ " ing proem 









AF130089 


— ; 

omo sapiens 


PRO2550 


— — 

T^a 





"65T8 




omo sapiens 


unlaiown protein — — — 






g 


G02493 


omo sapiens 




— 


— 








^^uman secre e pro ein, Q V. 






"6520 


G03119 


Ho osa iens 

omo sapiens 


'Hum reted rotein SEC IDNQ- — 


~65 


Tt 








72"oo''" ^^"^^ ^ ^™ 






6521 


AK023563 


Homosa iens 


unnamed r te 'n r duct 


"^268 


^8 




G00357 


Tiom ie 

omo sapiens 


^ et d t ' — SEO ID NO' — 


— — 











^man seer e pro em, Q 








AF2 10651 


Homosa iens 

omo sapiens 


j^^g'^g 


-— 


-— 





AF130079 


omo sapiens 


PR02852 




127 


— 

: 





G03786 


omo sapiens 


Human secreted protein, SEQ ID NO: 




51 








'1T~^ H 






"6526 


Y45382 


Homo -a len 

omo sapiens 


enTOded^from^<»ene28™ 


—J 


"62 














6527 


U42471 


Mus mus cuius 


Wiscott-Aldricli Syndrome protein 


115 


48 








homolog 






6528 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


86 


60 








6913. 






6529 


R59842 


Homo sapiens 


ApoE4Ll protease. 


92 


58 


6530 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


77 


6531 


W48351 


Homo sapiens 


Human breast cancer related protein 


88 


60 ■ 



303 
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SEQID 
NO: 


Accession No. 


species 


Description 


Smitii- 

&wn 


% 

1. 








BCRB2. 






6532 


AFl 19851 


Homo sapiens 


PRO 1722 


Tn 


~6[ 


6533 


AF063866 


Melanoplus 
entomopoxvirus 


ORF MSV233 hypothetical protein 


85 


35 




G02469 




6550. 








R95913 






4p 




~6536 


AF130079 


H°mo ^a^iens 


PR02852 




~ 


"6537 


AB047600 


'w — 

fas^cicularis 


hypothetical protein 


T55 


^60 


6538 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


279 


68 


6539 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


112 


55 


6540 


AF130089 


Homo sapiens 


PRO2550 


124 


56 





U93574 


omo sapiens 




nrl 


-|| 




AFl 18082 




PRO 1902 






~6543 




Homo sapiens 


encoded by gene 17. 


Ti 


— 


6544 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


44 


~6545 


G03786 


Homo sapiens 


Human secreted protein SEQ ID NO' — 

7867. 


^38 


"62 





AFl 16661 




PR01438 









552928 


Homo sapiens 




ITS 




6548 


AB030192 


Mus musculus 


homologue of S70447 FlFo-ATP 
domain f subunit 


174 


63 




AK0251 16 










6550 


X92485 


Plasmodium 


pval 


105 


56 


"6551 


AF130079 


H^rn^ a ien 


PR02852 


"T72 





"6552 




Pe^omyscus"^ 




~84 


"42 








protein 








M76546 


annuus 










AF130089 




PRO2550 






6556 


AFl 3 0079 


Homo sapiens 


PR02852 


165 


48 




R07057 




(hhcm) gene preoduct 








(303307 




Human secreted protein, SBQ ID NO: 

7888. 








X92485 


vivax 










G02639 




6720. 








AF217374 


planci 














Human secreted protein SEQ ID NO' 
7795. 






6563 


Y95028 


Homo sapiens 


Human clone vp7 1 0RF2, SEQ ID 
NO:128. 


134 


84 


6564 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


66 


47 


6565 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


180 


49 



304 
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SEQ ro 

NO: 


Accession No. 




Description 


Smitli- 
Waterman 


Identit 

y 


6566 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


96 


57 


"6567 


X92485 


Plasmodium 

vivax" 


pvai*"° 


~96 


"58 








gene 44 clone HTDAD22. 






gg^Q 


AFl 19851 




PRO 1 722 









AF2 89098 


Cladrasds'^"^ 

kentukea 


EN0D2 


TTo 




-^^^i 


AFl 18086 


Homo sapiens 


PRO 1 992 










G02538 


omo sapiens 


Human secreted protein, SEQ ID NO. 






~6573 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

8144. 


~89 


1i — 





M22332 


Homo sapiens 










Y86248 


omo sapiens 


HumarLcretecfprotein HCHPF68 

SEQ ID NO: 163. 


"51 





6576 


AK021848 


Homo sapiens 




l3l 


^41 


6577 


AF161356 


Homo sapiens 


HSPC093 


155 


57 




AFl 16909 








76 


6579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


53 


6580 


AF130089 


Homo sapiens 


I'RO2550 


276 


74 




K0^576 


Homo sapiens 


salivary proline-rich protein 1 


119 


3J 


6582 


U49973 


Homo sapiens 


ORF2: function unknown 










Homo sapiens 


Human secreted protein, SEQ ID NO; 

7492. 






6584 


AJ249395 


Globodera 


NADH-ubicjuinone oxidoreductBse 


99 


4] 


6585 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


75 


62 


6586 


U49973 


Homo sapiens 


ORF2: fiinction unknown 


134 


58 




Y08061 










6588 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


217 


53 


6589 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


147 


41 


6590 


U43627 


Arabidopsis 


extensm 


103 


31 


6591 


X62677 


Oryctolagus 


retrovirus related reverse transcriptase 


71 


34 


6592 


¥91 577 


Homo sapiens 


Human secreted protein sequence 
encoded by ^ene 2 SEQ ID NO'250 


177 


66 


6593 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 



100 


72 


"6594 


M69297 


Homo sapiens 




~T37 


~43 


6595 


X68101 


Rattus 




372 


89 


6596 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


62 


~65Q7 


Y027 85 


— : 

omo sapiens 


^nan secreted protein encoded by 

gene 51 clone HUKEX85 


— 


-— 

43 


6598 


299714 


Homo sapiens 


bKl 048E9.5 (novel protein) 


1642 


98 


6599 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


162 


59 


6600 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


132 


65 


6601 


AK000385 


Homo sapiens 


unnamed protein product 


187 


74 


6602 


AF130051 


Homo sapiens 


PRO0898 


137 


54 


6603 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 


88 


48 



305 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Wafcrman 
^core 


HcBht 









terminal, clone XliH.Sc} 








■^^^^^^ 


— — 

Hydra sp. 


mini-collagsn 


— — 

-~ 


— 

4t — 


"6655 ' 


G03807 


Homo sapiens 


7888™ secreted protein, SEQ ID NO. 


















"66()fi 


G02872 


"h ' 

omo sapiens 


iid P — SEO TD NO- — 

^^mnansecre proem, g 


^j— 


— 














6607 


AF232673 


Homo sapiens 


B17 long form 


Mil 


99 





AF 11 8082 


Homo sapiens 


PRO ! 902 


T^T 


"Iz — 




R1j556 


omo sapiens 


Protein encoded downstream of hhc__M 





















Y08062 


omo sapiens 


^Human PR024S protein fragment 


-TfT. 


— 








derived from DNA35638. 






"6611 




yg^^^^ 


"H "e 

omo sapiens 


Huinan secreted protein #75. 


T40 


"62 

— 




X92485 


vivax° ™ 


pva 


~iv7 

15 


72 


"66T3 


^g2077 


Homosa iens 


mariner transposase ' 


^60 


~67 


"6614 




AF161392 


~R 

omo sapiens 


HSPC274 


-pr^ 


~45 


6615 


G0o714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




93 


~ 

65 

















AFl 19851 


—J : 

omo sapiens 


PRO 1722 





— 





G03789 


omo sapiens 


Human secreted protein, SEQ ID NO: 


101 


~ 

63 














„ — 


■ 


T~^T '■ 


— i ■ — 7 — 1^ = 

extremely cysteine/valine rich protein 


— — 

117 


— 







: 










Y64890 


omo sapiens 


"Human 5' EST related polypeptide SEQ 


— — 


— 














"6620 




"Mu — inn — 




~T06 ~ 


^^8 


~662l 


AFl 3 0089 


Homo's'a'^iens 


PRO2550 


"TTl 


_|- 


6622 


X92485 


Plasmodium 


pval 


139 


44 






wvax ^ 








-^ZfS 


AF090895 


omo sapiens 


PROO 1 1 7 


~T6i ' 


— 


"6674 


M63730 


omo sapiens 


BPAG2 _ — — 


TriT 


— r 




6625 


QQ2258 


Homo sapiens 


Human secreted protein, SEQ ID NO.* 


TF" 


— 

51 


















ArU /ooOtJ 


Homo sapiens 


PTnnm 


"To 


— 

~P 




AL080253 


tlialiana 


put^vesnRNP rotein 

proem 


~T5I 




-^^^ 


AK000385 


Homo sapiens 


unnamed protein product — 


^^TS 







W60652 


omo sapiens 


'c- (a ema ive anscnpt). 






"fifi^n 


G028o2 


omo sapiens 


uman secre e pro em, Q 


-— — 


"fifi 

















AB046048 


— 

acaca 


' — H '■ H 

unname porteinpro uct 


— — 


— 

65 


• 




ascicuaris 








"6632 


X92485 


vivax ° 




pva 


"TT9 


~88 


"6633 


^Qgj^g 


Homo a ien 

omo sapiens 


potORFV 


98 


-Tq 








omo sapiens 


salivary proline-rich protein 1 


165 


-— 




g^^g 


rosop 1 a 


dynamin 


111 


"39 














"6636 


G03789 


Homo°T^' 

omo sapiens 


Pluman secreted protein, SEQ ID NO: 


162 


~50 








7870. 






6637 


AF130089 


Homo sapiens 


PRO2550 


141 


75 


6638 


AFl 18082 


Homo sapiens 


PRO1902 j 108 


63 


6639 


W48351 


Homo sapiens 


Human breast cancer related protein 


144 


60 








BCRB2. 






6640 


Y 197 67 


Homo sapiens 


SEQ ID NO 485 fi-om W09922243. 


104 


47 


6641 


AFl 18082 


Homo sapiens 


PRO1902 


121 


53 


6642 


AFl 18082 


Homo sapiens 


PRO 1902 


195 


68 


5643 


R95913 


Homo sapiens 


Neural thread protein. 


212 


65 



306 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Score"""" 


% 

Identit 


6644 


AF090928 


Homo sapiens 


PRO0470 


81 


~Z5 


6645 


AF 13 0079 


Homo sapiens 


PR02852 


151 


69 


6646 


AF003535 


Homo sapiens 


ORF2-like protein 


140 


35 




G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 




61 


6648 


AF010144 


Homo sapiens 


neuronal thread protein AD7(;-NTP 


100 


~42 


6649 


Ml 5 530 


Homo sapiens 


B-cell growth fector 


121 


76 


6650 


AF090942 




PRO0657 


1 19 





6651 


AF118082 


Homo sapiens 


PRO 1902 


130 




6652 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


143 


87 


6653 


AF130089 


Homo sapiens 


PRO2550 


143 


63 




AF119851 




PRO 1722 






~6655 


GO 1246 


Homo sapiens 


5327. ^ 


To3 


-~ 




API 19855 




PRO 1847 






6657 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


36 


6658 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


61 


"6659 


AFl 19851 


"Horn — aTe 

omo sapiens 


PRO 1722 


Ttt 


"sl 


~6660 


X92485 


vivax''* 


pval 


~lM 


-p^ 


6661 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


57 


6662 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


68 


"6663 


^^22029 


Glycine max 


hydroxyproline" rich glycoprotein 


TTs 


"35 


6664 


D00570 


Mus musculus 


open reading frame (25 1 AA) 


126 


35 




AF130051 




PRO0898 






6666 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


42 


6667 


G02851 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


71 


41 


6668 


AFl 3 0079 


Homo sapiens 


PR02852 


110 


59 




G03800 


Homo sapiens 


7881. 








G00527 




Human secreted proteiiij SEQ ID NO: 
4608. 








G03790 




7871. 










cuniculus 


retrovirus related reverse transcriptase 






6673 


V01201 


vinisr^ ^^^^^^ 


coding sequence of pol 


143 


47 


6674 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


78 


6675 


G00648 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


56 


6676 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


48 


6677 


AF130089 


Homo sapiens 


PRO2550 


Too 


~l5 


6678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


94 


68 


6679 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


105 


71 


6680 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7893. 


218 


53 


668] 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


115 


53 
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SEQID 
NO: 




Species 


Description 


Smith- 
Waterman 


Identit 

y 














"6682 


AF161356 


H m sa tens 


HSPC093 





— 


~6683 


G02538 


H°mo sapiens 

omo sapiens 


Hrnnan secreted protein, SEQ ID NO. 


^tF 


~ 

72 


6684 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


42 


6685 


S79410 


Mus musculus 


nuclear localization signal binding 


98 


43 


6686 


U93564 


Homo sapiens 


p40 


96 


41 




AFl 19855 




PRO 1 847 






6688 


U52077 


Homo sapiens 


mariner transposase 


268 


81 






vivax 














Human breast cancer related protein 
BCRB2. 








M63274 


falciparum 


malaria antigen 






6692 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NOi 


147 


80 


6693 


Mil 901 


Rattus 


proliiie-rich salivaiy protein 


115 


40 


6694 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


131 


50 




G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO* 
7878. 








AFC 68294 


Homo sapiens 


HDCMB45P 






6697 


X55685 


Lycopersicon 
esculentum 


extensin (class 1) 


112 


41 


"6698 


M83293 


immunodeficienc 


vpx protein 


— 


— 


"6699 


G01246 


Homo^apiens 


Human secreted protein SEQ ID NO- — 
5327. 


"T69 










6536. 






-^^^1 — 


AFI 18080 


^a'^o^d^m"^ 


PRO 1 880 





3I 






vivax 










AF161356 




HSPC093 






6704 


X92485 


Plasmodium 


pval 


164 


54 


"6705 


QQ^ggg 


Homo sapiens 


Human secreted protein SEQ ID NO" — 
6619. 


172 


l9 




AK025326 










"6707 




Homo sapiens 


HiunMf secreted^^crteh^ ID NO- — 

7284. 


TI5 





6708 


G03786 


Homo sapiens 


Human secreted protem, SEQ ID NO; 


119 


71 


6709 


AF176531 


Mus musculus 


F-box protein FBX 16 


324 


60 




X92485 


vivax 








6711 


AF13D089 




PRO2550 






6712 


GO 3411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


150 


59 


6713 


AF090944 


Homo sapiens 


PRO0663 


114 


43 


6714 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


110 


47 


6715 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


131 


60 
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SEQJD 
NO: 






Description 


Waterman 


Identit 

y 


6716 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


65 


6717 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


46 


"6718 


QQ^222 


Homosa iens 

omo sapiens 


fcman ecreted rotein SEC ID NO- — 

5303 ^'^ 


14 


Ton — 


6719 


y^^203 


Homosa iens 


Human secreted rotein #75 


"l29 


Ts — 




QQ-^-j^g 


Homo sapiens 


Human secreted protein SEQ ID NO' — 
7870. 


Tot 


"64 




Y85062 




homologue (ICEL) protein sequence. 










Homo sapiens 


Human secret protein 75 . 


~TT^ 




~6723 


AFO 1 0 1 44 


Homo sapiens 






46 


"6723 


R96800 




Himum^istiocyte'^secr^^ factor HSF — 


T33 




6725 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


88 


76 


6726 


Q00352 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


218 


61 





y 


'k ^ 

omo sapiens 


Human secreted protein sequence 

encoded by gene 2 SEQ ID NO'250 


~— 


— 


"6728 


U01877 


Homo sapiens 


p300 protein 


1I9^ 


~27 


6729 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


51 




W88627 




clone HPMBQ32. 












HCl ORF 






6732 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


114 


38 




AF057352 




hepatocellular carcinoma autoantigen 






6734 


AF090895 


Homo sapiens 


PRO0117 


152 


57 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






6736 


X92485 ' 


Plasmodium 


pval 


92 


89 


6737 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


112 


49 


6738 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


278 


72 


6739 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


191 


47 


6740 


AF118082 


Homo sapiens 


PRO 1902 


83 


49 




G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 








Y36708 




encoded by gene 62. 






6743 


¥02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


177 


70 


6744 


Y91475 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 25 SEQ ID NO:148. 


159 


50 


6745 


X92485 


Plasmodium 




112 


39 


6746 


Z82268 


Caenorhabditis 
elegans 


predicted using Genefinder-contains 
similarity to Pfam domam: PF0I391 
(Collagen triple helix repeat (20 
copies)), Score=73.8, E-value?=1.2e-l 8, 
N=2; PF01484 (Nematode cuticle 
collagen N-termina] domain), 
Score=37.5, E-value=le-07, 
N=l~cDNA EST yk92fl.5 comes from 
this gene-cDNA EST yk55f6.5 comes 
from this gene-cDNA EST yk65b7.3 
comes from this gene-cDNA EST 


108 


36 
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SEQID 


Accession No. 


Species 


Description 


Smitli- 




NO: 








Waterman 


Idcntit 










Score 


y 








yk;58f3-5 comes from fliis gene~cDNA 












EST yk72h5.5 comes from Ihis 












gene~cDNA EST yk55c4.5 conies from 












this gen&-cDNA EST yk58hl.5 conies 












from this genC'^DT^A ESTyk65al2 5 












'^'k94f4^°co *es fronTthis^en^^'^ 














"h — °T ■ ' QFo^in NO- — 










— : 

omo sapiens 


^umansecree proem, Q : 


— — 


— — 














"6748 


g^^ggjg 


Homosa iens 


Neurai thread rotein 


~Jo& 


36 


^6749 


^-^^^^ 


MuTmif c'ulT 

u s us 


latelet a tivatin° fact r 


^09 


"60 








LetylVckol^risoform lb beta subunit 






6750 


QQg^^^ 


Homosa iens 

omo sapiens 


Human^secr^d'^rotein SEO ID No"^ 


T59 


"65 








7795 P ' V 






"6751 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO* 


"96 


^8 








4718. 






6752 


AF090895 




PRO0117 




-^^ 





AF217973 


n'^mo sapiens 

omo sapiens 




T?vs 







AFl 19851 


omo sapiens 


PRO 1 722 


-—2 


—2 






omo sapiens 


PRO 1847 


— — 





~6756 


G03789 


omo sapiens 


7870^ protein, SEQ ID NO. 


"163 
















"6757 




G00354 




"h '' 

omo sapiens 


—= — : — i : — ^ — 

umanseciee protein, Q 


— 5 


— 






"h ' 

omo sapiens 


~Tl~ ^fd ^ — SFO lb NO- — 

^mnanseciee protein,. iQ 


~T5f 


"7?^ 














"6759 


AF130051 


Homo sa iens 

omo sapiens 


PRO0898 


"98 


"68 




"6^61 


M36914 


eamays 


cell wall protein (put.), putative — 


— 




~- 




G02538 


omo sapiens 


uman secre d protein, Q ID O. 




~ 














"6762 





. 

omo sapiens 


- ■- • —s SFO ID NO- 


"88 


"65 














"6763 


Ml 9756 


omo sapiens 


MHC HLA B 47 FT 

-hi fhh ^^ i^i']^^""^ ^ J 


~s8ri 


"o? 

— = 


6764 


AL359782 






— -z 










complete virion enom'e^^^ ^ ^ 









AF010144 


"it ' 

Homo sapiens 


neuronal thread protein AL'7c-N I Jr 


— — 

"T^H 


— 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO. 





















G0379O 


— ^ 

Homo sapiens 


— — : — j — — — 

^ uman secre ed pro ein, S Q ID NO: 


-— 


— 

64 














"6768 


AF061340 


~A~7h 

r 1 eus 


7— V rq r- ^ 

cy oc ome c oxi ase su uni 





— 














6769 


T71265 cdi 


Homosapiens 


I9-JAN-1996 Neuronal apoptosis 


T88 


"59 








inhibitor protein (NAIF) cDNA. 






6770 


BO 1390 


Ifomosa iens 


Neuron-associated protein. 


"92 


"55 




AF017777 


"n 

rosop la 


a CO a 


~TTZ 


— 














6772 


^g^ggg 


Su^s scrofa 


rotein hos liafase 2A 65 kDa 


"33l 


"93 








re ulato iibun L al ha isoform 






6773 


X61296 


"^ttas 


r en readin " fiame 2^ ^ ""^ 

openrea mg ame 


101 


"It 






norvegicus 








6774 


ALl 17345 


Homo sapiens 


dJ416F21.2 (novel protein) 


639 


91 


6775 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


110 


61 








7867. 






6776 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAGS) 


238 


76 


6777 


AB029065 


Mus musculus 


Ser/Thr kinase KKIAMRE-beta 


165 


66 


6778 


M37400 


Homo sapiens 


aspartate aminotransferase 


208 


68 


6779 


AF130089 


Homo sapiens 


PRO2550 


127 


52 


6780 


AF130079 


Homo sapiens 


PR02852 


332 


81 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 

— — 








Waterman 
^""^ 


Idcntit 

J . . 


6781 




AL049748 


; 

Homo sapiens 


dJ41P2.2 (RNA binding motif protein 9 


264 


60 








*^'^cr*"d d k' 4 






"6782 


j^^^QQ^ 





eye in- epen en mase 


^39 


— 














"6783 


AF3 19938 


Themus ^ 


— — etencefact rComEA 

c mp e c 


TTs 








h"l 








"6784 


AF010144 


HomoTa \ens 


neuronal thread rotein AD7c NTP 


"TT9 


"44 


6785 


LI 6864 


Afticanswine 


cd2^homologue ^'^ ~ 


152 


"34 







fever vinis 










ygg^Q^ 


Homo sapiens 


"o r^j . ■ 

Human secreted protein #75. 


~oo 


~Ad 





AF130087 


Homo sapiens 


PR0241 1 


'Tn 


~ 




AF208846 


Homo sapiens 


BM-004 


— 

664 




100 


6789 


G02485 


Homo sapiens 


Human secreted proteinj SEQ ID NO; 


90 


8j 








^^^^ . : 









AP000059 




Aeropyrum 


120aa long hypothetical protein 


— 

81 









pemix 












Homo sapiens 




Human secreted protein^ SEQ ID NO; 


146 




— ' 






-^^11: : J 






6/92 


-— — — 

Y 12961 


, 

Homo sapiens 


Amino acid sequence of a human 


175 


65 








secreted peptide. 






6793 


Y73442 


Homo sapiens 


Human secreted protein clone ya66_l 


565 


93 


— — 






protein sequence SEQ ID "MO; 106. 








Woooz / 


— : 

Homo sapiens 


Secreted protein encoded by gene 94 


136 


50 




— — 




clone HPJVIBQ32. 






6 9 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 






6796 


AF009668 


multiple 


polyprotein 


194 


60 






sclerosis 








— — 




associated 

re ovirus_ 


— — — — 








R59842 


Homo sapiens 


ApoE4LI protease. 




7, - ■ 




67QS 


U88167 


Caenorhabditis 


contains similarity to C2 domains 


171 








^^^^"^ — ^ 








"6799 


AF293024 


omo sapiens 


— : r—3 

s eroi recep or activator iso orm 


-— ; 


— 


"fisnn — ' 


AFOl 0144 


omo sapiens 


Tfh — H • AD? NTP 

ea proem — c- 


^ 


■fi^ 


— 
— 




Homo sapiens 


Human GTPase associated protein-28. 


— 






AL359782 


Trypanosoma 


possible (hhv-6) ull02, variant a dna, 




47 








complete virion genome. 






"fisni — 


AF078036 


Homo sapiens 


RelA associated inhibitory NF-kappaB 


— — 

1275 


95 








subunit p65 binding protein 






"6804 


G03203 


— : 

omo sapiens 


Human secreted protein, SEQ ID NO. 


— 
















-7— 





It ^ 

omo sapiens 


Human transmembrane protein 


— — 


—J- 

100 








HTMPN-70. 






"6806 


g 


^ 3a i n 

omo sapiens 


Human secreted protein SEQ ID NO" 


— 


"si 








— r-fY. YTd P 






"6807 


Y14482 


li Tn 

omo sapiens 


ragmeiiL 0 uman secre e pro ein 


Tto 









— 


enco e y gene — . 






"fiSOR 


AB046048 




unname po em pro uct 


~rr 


— 






fas^dcukris 








~6809 


K02401 


Homo sapiens 


— r — 1 T- — ; 

cnonomc somatomammotropm 


"dZfi 





6810 


AF130089 




PRO2550 


"150 


"56 


6811 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


123 


65 








gene 22 clone HMSJW18. 






6812 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


38 








7376. 






6813 


AF090930 


Homo sapiens 


PROD 478 


103 


65 


6814 


G0253S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


54 








6619. 







311 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 

~" 


Identit 

y 


"6815 


"B'H42^ 


Homo sapiens 


Human PR01244 protein sequence 

SEQ ID NO: 193. 






6816 


Ml 1901 


norveoicus 


proline-rich salivary protein 


105 


4j 


6817 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


96 


56 


6818 


"Vj0822 


Homo sapiens 


Humm secreted protein encoded from 


102 


48 


6819 


^g^gQg 


Homo sapiens 


unllnown 


T65 


^7 


6820 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


45 


6821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


39 




G01246 




5327. 






"^5^2 — 




Homo sapiens 


manner transposase 


411 







AC004237 




H^an secreted protein SEQ ID NO- — 







"6^25 


G03797 


Homo sapiens 


7878. 


HI 










7376, 






6827 


001246 




Hiniian secteteci protein, SEQ ID NO. 


108 


68 


6828 


U49974 


Homo sapiens 


mariner transposase 


103 


86 




Ar 1 lyoj 1 


Homo sapiens 








6830 


W48351 


Homo sapiens 


Human breast cancer related protein 


82 


62 


6831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


73 


6832 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


131 


53 




G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

5066. 








W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 










Plasmodium 
vivax 


pval 






6837 


AB046O48 


Macaca 


unnamed portein product 


147 


42 


6838 


AB047600 


Macaca 


hypothetical protein 


134 


52 


~6839 




Homo"sapfens 


Human secreted protein SEQ ID NO- — 

7795. 


14\ 


"60 




G00376 




4457. 








AF130O89 




PRO2550 






6842 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


56 


~6M^ 


AF090894 


Homo sapiens 


PRO0113 


148 


"62 


6844 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


51 


~6845 





H m sa ien 

omo sapiens 


Human secreted protein SEQ ID NO- — 

6619. 


— 


— 


6846 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


65 


44 


6847 


AF130089 


Homo sapiens 


PRO2550 


160 


76 


6848 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


132 


68 


6849 


AFl 16715 


Homo sapiens 


PR02829 


123 


67 


6850 


Y01437 


Homo sapiens 


Secreted protein encoded by gene 55 


142 


96 



312 
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SEQID 
NO: 


Accession No. 


Species 




Waterman 


Identit 








clone HLMBP 1 8. 






"6851 


G03798 


Homo sapiens 


7879. 


TTo 


— 




AF068294 




HDCMB45P 






6853 


G02532 


Homo sapiens 


Human seqreted protein, SEQ ID NO: 


111 


57 


6854 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


72 


~6S^5 


AF130051 


"h 


PROG 898 


— — 


— 


"6856 


G03798 


Homo sapiens 


7879. 


Tsl 


~ 






esculentum 










AL359782 


brucei 


complete virion genome. 









AF0I0144 


Homo sapiens 












API 19851 


omo sapiens 


PRO 1 722 







"SsSi 


G00418 


omo sapiens 


Human secreted protein, SEQ ID NO. 


-— r 


67 


6862 


G03789 


Homo sapiens 


Human secreted protein, SRQ ID NO: 


114 


53 


6863 


X924S3 


Plasmodium 


pval 


105 


63 


"6864 


AF068''94 


Honio sapiens 


HDCMB45P 


"?06 


"61 


6863 


AF068294 


Homo sapiens 


HDCMB45P 


12S 


46 







Homo sapiens 











X9'^485 


asmo lum 
wvax ^ 


pva 




89 


"6868 


Y86248 


omo sapiens 


Human secreted protein HCHPF68 

SEQ IDls'0:163. 





— 


6869 


AF130089 


Homo sapiens 


PRO2350 


162 


67 








encoded by gene 17. 












'^vto^ to'^ ^h^^^ 








'"6872 — 


M28016 


"TTm — alen 








"6873 


M24069 


Homo sapiens 


bindfaig protein A 


~1662 


"79 


6874 


AF130051 


Homo sapiens 


PRO0898 


163 


76 




AF006061 




liGH-V3 






6876 


G00985 


Homo sapiens 


5066^" secreted protein, SEQ ID NO. 


94 


80 


6877 


AF130079 


Homo sapiens 


PR02832 


hro 


76 




S80905 










6879 


X55683 


Lycopersicon 
RattiJs"*"'" 


extensin (class I) 


70 


35 


"6880 


^■^^259 


norvegicus 


nitric oxide synttiase 


106 


— 


-^11^ 


AK025047 


^a^odiiim"^ 


unnamed protein product 








X92485 


vivax 


pva 


TTU 







W4835 1 


omo sapiens 


BCRK^*^^^* cancer related protein 


185 


46 


6884 


AK024455 


Homo sapiens 


FLJ00047 prwtein 


81 


64 


6885 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


113 


47 


6886 


AFO 14008 




myocardial vascular inhibition factor 


158 


96 


6887 


AF265575 


Homo sapiens 


. ubiquitous TPR-niotif protein Y 
isofonn 


183 


83 


6888 


Y45382 


Homo sapiens 


Human secreted protein fragment 


199 


49 



313 



w o 01/64835 
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SEQ ID 
NO: 


Accession No. 






Smith- 
Waterman 
Score 


% 

Identit 

y 














gggg 


(302827 


Homo sapiens 


Humans^eL^^prokm SEQ ID NO- — 
6908. 


"140 


~67 




AF 130051 




PRO0898 






6891 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 


147 


65 


6892 


X72963 


Nicotiana 


pAP8 product 


80 


44 


6893 


AF073957 


Homo salens 


CXCchemokineBRAK 


399 


82 


6894 


GO380O 


Homo sapiens 


aiman secreted protein, SEQ ID NO. 


160 


80 


~6895 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 

SEQrDNO;163. 


Tso 


"^70 


6896 


Y86248 


Homo sapiens 


SEQ IDNO:163. 


178 


85 


6897 


AP000616 


Orj'za sativa 


RHAla (AF078683) 


l42 


"62 


6898 


AF 11 8082 


Homo sapiens 


PRO1902 


144 


53 




AF 130051 




PRO0898 


l^T 




ggQQ 


G02872 


Homo sapiens 


6953. 




~ 





AF130089 




PRO2550 










HcpaUds^C^vLis 




'W^ 





6903 


AF107406 


Homo sapiens 


GW128 


71 


63 


6904 


W036''7 


Homo sapiens 


Human follicle stimulating hormone 


173 


64 


6905 


AF130089 


Homo sapiens 


PRO2550 


88 


65 


6906 


G03789 


omo sapiens 


■^870*" protein, SEQ ID NO. 


78 


84 


6907 


U93569 


Homo sapiens 


■p40 


145 


48 




Y14482 




encoded by gene 17. 








AF130089 




PRO2550 






6910 


Ml 8093 


Phaseolus 


liydroxyproline-rich glycoprotein 


119 


38 


~69ll 


AF130089 


Hom^'sapiens 


PRQ2550 


139 


"47 


6912 


G00328 


Homo sapiens 


Tiuraan secreted protein, SEQ ID NO; 
4409. 


146 


60 


~69T3 — 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


131 


~70 




Y86248 




Human secreted protein HCHPF68, 
SEQ IDNO:I63. 


106 




6915 


AF161361 




HSPC098 79 




6916 


Y02693 


Homo sapiens 


Human secreted protein encoded by 207 
gene 44 clone HTDAD22. 


60 


~69T7 — 


AF130051 


Homo a iens 


PRO0898 258 


"61 


"6918 — 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7881. 


108 


~65 


6919 


G03714 




Human secreted protein, SEQ ID NO; 
7795. 


131 






G026j5 


Homo sapient, 


I luman secreted protein, SEQ ID NO: 
6716. 


168 


80 


6921 


AF116661 


Homo sapiens 


PR01438 


71 


48 


6922 


AF090940 


Homo sapiens 


PRO0644 


72 


85 


6923 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


186 


70 


6924 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


55 


6925 


AF229067 


Homo sapiens 


PADI-Hpiwtein 


122 


72 



314 
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SEQID 
NO: 








Smith- 
Watei-nian 


Identit 

y 


6926 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein ^^^-1. — ■ — 


105 


37 


"fiOTT 


G03800 


— -. 

omo sapiens 


^umansecre proem, Q ID 


i 


-— 


"6928 


X56603 


Mus musculus 


mouse 57-KD Calcium-binding protein 
(MCaBP) 


1^3 


~52 




AFl 19851 




PRO 1722 






6930 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


61 


693] 


AF077208 


Homo sapiens 


HSPC022 . 


155 


72 








SEQ ID NO 402 from W09922243 






6933 


M22332 


Homo sapiens 


unknown protein 


154 


77 






Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 






6935 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NOt 

_ : TT->-KT^ 


168 


63 


— — 

■ 


G00376 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
j-j : — Tr>>jn- — 


— 


— 

50 




G02532 


— : 

omo sapiens 


Human secre protein, EQIDNO. 


— 


— 

77 


6938 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


68 


6939 


AFl 18082 


Homo sapiens ■ 


PRO1902 


120 


54 




AF090942 




PRO0657 






6941 


M22332 


Homo sapiens 


unknown protein 


96 


51 


6942 


AF1300S7 


Homo sapiens 


PR02411 


132 


58 




Ml 5530 


Homo sapiens 


B-cell growth factor 






6944 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


52 


6945 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7871. 


73 


66 


6946 


Z10561_cdl 


Homo sapiens 


12-MAR-1998 DNA encoding G 
protein-coupled receptor protein 




96 


6947 


W53869 


Homo sapiens 


Gravin polypeptide fragment, residues 


170 


89 


6948 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


164 


86 


6949 


AFl 18086 


Homo sapiens 


PRO 1992 


117 


86 




AF090928 




PRO0470 






6951 


AF130089 


Homo sapiens 


PRO2550 


124 


69 




Y383 97 


Homo sapiens 


Human secreted protein encoded by 
gene No. 12. 








G02532 


Homo sapiens 


6613. 






6954 


X92485 


vivax*"^'"™ 


pval 


108 


64 


6955 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


117 


85 


6956 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


53 


6957 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6983. 


l06 


"50 


6958 


AF130089 


Homo sapiens 


PRO2550 


127 


67 


6959 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 cbne HNFIU96. 


119 


49 


6960 


J02459 


bacteriophage 
lambda 


U (tail component; 131) 


492 


92 


6961 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


68 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 

y 








7214. 











H*^™" ™g 




"Hi 




^ 


AF130089 


omo sapiens 


PRO2550 




~ 


— — 


Ap/erse — 


omo sapiens 




— 

--| 


— 


^ 




Homo sapiens 






~ 

_Z§ 




AB 047934 


Macaca 


hypothetical protem 


'TIS 


58 


— — 




lasciculans 











X92485 

1 


Plasmodium 

vivax 


pval 


— — 

116 


55 




AFl 1 8082 


Homo sapiens 


PRO 1902 


85 


65 


6969 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


63 


. . 


.. 


. 


6908. 






-^^Z^ - 


AFl 16712 


Homo sapiens 


PR0273 8 


101 


.^^ 


6971 


X96667 


Homo sapiens 


beta-galactoside alpha-2,3- 


172 


97 








sialyltransferase 






- 

6972 


. . , ,. 


- . 

Homo sapiens 


Human gene 91 -encoded protein 


150 


82 








fragmentj SEQ ID ]\0:491. 






— — 





— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 




176 


58 








6619. 






6974 


T> 

AFl 62 149 


Mycoplasma 


variable surface lipoprotein 


143 


40 






, 










6975 


W17971 


Homo sapi&ns 


=— J=i = — 

RAG protein kinase C-tenninal binding 




81 














"6976 




AFl 3 0089 


TTm 1 

omo sapiens 


PRO2550 


"^MS 


"78 




AF130089 


omo sapiens 


PRO2550 


~To? 

-i^- 


— 


TOTR 


G0j714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


~ 

75 














"6979 




M60510 


-__ 


beta^D galactosidase fusion protein 


"06 


"47 


^ 


AFl 18086 


n '■ 

omo sapiens 


PRO 1992 


—— 

-iii 


"SI 





Ar 1 juusy 


Homo sapiens 


PRO2550 


"T^i 


4? — 






Homo sapiens 


rwerse transcriptase 


-H* 


-fr 





1193566 


Homo sapiens 




"tIr 


^ 


TqrZ 


Y08062 


Homo sapiens 


^mian PR0245 protein fragment 




78 








Qeriveu irom uinaj jojo. 






"69^- 




-jj2j42S 


"h ' 

omo sapiens 




-— 


-— 

—1 




AF090895 


omo sapiens 


PRO0117 


~r^o 






698 


UUU JZo 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


212 


— ^ 

60 









-4t-~- 7~3 ■ 











— : 

Homo sapiens 


Human secreted protein #75. 


— — 




-Z£ 





AFl 16712 


Homo sapiens 


PR0273 8 


— '■ 


-Z± 


6990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


63 















— — j 

"^5^5 — 


AF090930 


— : 

Homo sapiens 


rRO0478 


Att 









Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


76 




.. ,. 


... 


^^^7- ...... 








002872 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


151 


48 


-— - — 

















Homo sapiens 


"r-~ J = — 

Human secreted protein, SEQ ID NO: 


-— 

113 


47 















995 




. -. 
Acanth amoeba 


myosin IB heavy diam 


120 


4j 






casteilanii 








"6996 


^^^^^^ 




Human secreted protein SEQ ID NO- — 


~]5l 


~69 








4433. 






6997 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


153 


60 








7867. 






6998 


Y12961 


Homo sapiens 


Amino acid sequence of a human 


114 


66 








secreted peptide. 






6999 


AFl 13685 


Homo sapiens 


PRO0974 


88 


58 


7000 


AF130089 


Homo sapiens 


PRO2550 


146 


58 



316 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 










Waterman 
Srore 


Identit 

y 






^ 




Homo sapiens 


: r-jT^ 

putative pl50 





46 







Homo sapiens 


Human secreted ptoteiB encoded by 


146 


57 








gene 100 clone HNFIU96. 






~TC)m — 


„ 


-. 

Homo sapiens 


enco ™d^byftaie'2*S^ I^NO-2^50 


. 


92 


— 














AF130089 


— ^ 


PRO2550 


■ 


— 


^7005 

— 


AL137472 


Homo ^a^iens 

omo sapiens 


hypothetical protein 


T79 





7006 


Y91577 


Homo sapiens 


Human secreted protein secfuence 


— 


— 

98 








encoded by gene 2 SEQ ID NO''^50 * 






^7007 


G03790 


omo sapiens 


Human secreted protein, SEQ ID NO. 


-— 


— 









— ' 








AFT 9851 


: 




— 


— 


"7009 


AF2 1 8028 


Homo"^]!!!^ 


unknowr 


To4 


"55 


^7010 




Homo sapiens 


PRO^SS"' 


129 


~60 


7011 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


110 


54 










ID NO' 1051 






^nTo 




Ti -■ 

Homo sapiens 


pot. ORF VI 


— — 

—4- 







^ 

AF090944 


Homo sapiens 


PRO0663 




— 

75 


70 


y02671 


Homo sapiens 


Human secreted protein encoded by 


- 

122 


65 








gene 22 clone HMSJWl 8. 






7015 


M26460 


Homo sapiens 


retinoblastoma 1 


113 


37 


7016 


X92485 


Plasmodium 
vivax 


pval 


112 


56 


7017 


X92485 


Plasmodium 


pval 


144 


57 






vivax 








7018 


AF 13 0079 


Homo sapiens 


PR02852 


224 


74 


7019 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


129 


43 








gene 5 1 clone HUKEX8S. 






7020 


003714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 





58 








7795. 






7021 


Y02886 


Homo sapiens 


Fragment of human secreted protem 


... 

106 


69 








encoded by gene 90. 






"7022 


AF090901 


; 

omo sapiens 


PRO0195 


7(r 


^ 





AFl 16661 






T9n — 


-~ 

Jl 


— — 




Araoidopsis 


gene__id;FlD9.26^uiiknown protein 




60 







— , 










— ^— 


Homo sapiens 


_ 

Human prostate tumor EST xragment 


— ~7 

126 


63 








pp^rv^ooo*^^^^'" 






— — 


AFl 19900 


— : 

Homo sapiens 


PR02822 







_70 





AF130051 


Homo sapiens 


PRO0898 


133 


\^ 


707S 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


179 


68 








derived protein #153. 






"t^sC — 


Y19684 


■ ■ ■ 
Homo sapiens 


SEQ ID NO 402 ixom W09922243. 








G02532 


Homo sapiens 


Human secreted piotem, SEQ ID KG. 


~Vo 


~ 








— — — 














— 

Mus muscu us 


ferritin heavy cham 


— — 




— 


^7032 — 




omo sapiens 


Human secieted protein, SEQ ID NO. 




~ 














'Tcm — 


P70494 


Homo sapiens 


— ' jrr 5 Tj r 

Sequence othuman B-cell growth 


— 

0 


— 

65 








factor (BCGF). 






— 





Homo sapiens 


Human secreted protein ^^28. 







-J 


^7035 


^.gQ^gj 






Ti6 




7036 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


130 


52 








clone HPMBQ32. 






7037 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


55 








6953. 






7038 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


68 








6619. 






7039 


AF130089 


Homo sapiens 


PRO2550 


121 


65 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 




NO: 






Waterman 


Identit 











~ 


J! 


"^SZT — 




AFl 19855 


— : 

Homo sapiens 


— 

PRO 1847 


ttI 


65 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO.* 




6-. 








-It^ 3 ■■ — cp^ TT^ T-rr^ 






-— — 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


— 

7 


54 








-It^ H P — SFO TD NO- 






— 


G03789 


Homo sa iens 

omo sapiens 


Human secreted protem, SbQ ID NO. 


— r 

123 


— 

80 














— — — 


G03714 


Homo sapiens 


—J — ^ , 

Human secreted protein, SEQ ID NO; 


4 


~ 

63 















"T^5 — 


G0j786 


Homo sapiens 


Humim secreted protein, SEQ ID NO: 












7867. 








G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 












7881. 






7047 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


100 


57 






. .. 










Homo sapiens 


Human secreted protein, SEQ ID NO; 


.. 

152 


- . 






-— : 








— 

7049 

.... , 





Homo sapiens 


-TT-— '■ J = — - ^_ 

Human secreted protein, SEQ ID NO: 
6908. 


94 


-— 

72 


7050 


AF176323 


Macropus 


small nuclear ribonucleoprotein B' 

1 : =— ; 


103 


41 




AF255446 


Crypthecodinium 


Dipl-associated protem C 


127 








cohnii 








1 

7052 


M64792 


Rattus 


salivary proline-nch protein 


149 


42 






norvegicus 








7053 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


147 


69 








V^^- .......... ... .... ,. 






7054 




G03790 


... 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 




54 


-| 




, 1 








V 


Homo sapiens 


Human secreted protein sequence 


185 


86 








encoded by gene 2 SEQ ID NO:250. 








U60269 


— : 

Homo sapiens 


putative envelope protein; orf similar to 


125 


65 








wd to dass n HERV^^ ^ retroviruses 


















^7057 ' 


AF068294 


omo sapiens 


HDCMB45P 





— 5 


■"TfTR 


Y86248 


omo sapiens 




"Ton 


"0? 








SEQ ID "MO' 163 






7059 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


140 


62 








gene 22 clone HMSJW18 






— 


AF090894 


— : 

Homo sapiens 


rlvVJUl 1-? 


-— 

— 5r 


To 


— — 


AF107406 


Homo sapiens 


GW128 




-|5 


"7062 


W50193 


omo sapiens 




"86 










protein CON-2. 






"7063 


AF119851 


Homosa iens 


PRO 1 722 


Tit 


~74 


"7064 


X92483 


Plasmodi^'m"^ 




T^s 


""66 


7065 


(302532 


Homo sapiens 


Human secreted protein, SEQ ID NO' 


79 


59 














^7066 — 


^^gg^g 


li '' 


pu^ive 


"m 


— 


"7067 — 


AF119851 


H°iiio ^a^'en^ 

omo sapiens 


PRO 1722 


^01 


""69 


„- 


AF130079 




PR02852 


— — 


— j 


T069 


AF119851 


Homo sapiens 


PRO 1 722 


"l50 


""64 


7070 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


138 


48 


7071 


M11759 


Lycopersicon 


cell wall hydroxyproline-rich 


94 


39 






esculentum 


glycoprotein 






7072 


U49973 


Homo sapiens 


ORFl; 1VIER37; putative transposase 


145 


62 








similar to pogo element 






7073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


44 



318 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 








4478. 




— 


7074 


AF130089 


Homo sapiens 


PRO2550 


146 


60 


7075 


U49974 






198 


83 


7076 


AF236085 


Homo sapiens 


CYP4F11 


427 


79 




AF1300S9 




PRO2550 


159 




7078 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


206 


85 


7079 


D32202 


Homo sapiens 


alpha IC adrenergic receptor isoform 2 


152 


71 








4718. ^ ' ^ 








AB046048 


fascicularis 














7795. 










norvegicus 










G02832 


Homosap.ens 


6913. 






7085 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


131 


64 


"7086 


G03295 


Homo sapiens 


Himan secreted protein SEQ ID NO- — 

7376. 


^1 


"66 








6613. 












7214. ^ 






^595 — 


Lr0802 1 




nicotinamide N-methyltiansferase 










Homo sapiens 




-~ 


'W~ — 


7091 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


56 


7092 


X92485 


Plasmodium 


pval 


125 


60 


7093 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


62 





AFl 18082 


n ' 


PRO 1902 


"T91 





"7095 




Plasmod'^m"^ 

vivax 




^29 


"67 


7096 


Y08062 


omo sapiens 


derived from DNA35638. ^ 


80 


66 


7097 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


162 


67 


7098 


AFl 16715 


Homo sapiens 


PR02829 


123 


63 














7100 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


48 


7101 


AF010144 


Homo sapiens 


neuronal ttiread protein AD7c-NTP 


l04 


"65 


7102 


D85030 


Schizosaccharom 


ribosomal protein S33 homolog 


139 


85 


7103 


G03021 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


177 


77 


7104 


U72543 


Sus scrofa 


ubiquitin-like/S30 ribosomal fusion 


129 


100 


7105 


AC007787 


AA 187-502 


NFI-X3=transcription factor 


190 


84 


7106 


G03793 


Homo sapiens 


^Human secreted protein, SEQ ID NO: 
7874. 


110 


38 


7107 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


157 


56 


7108 


Z69381 


Saccharomyces 
cerevisiae 


Nil 14 


450 


91 



319 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 

-|^^^ 


% 

Wentit 

y 


-jlQg 


V49572 cdl 


Homo sapiens 


13-NOV-1996 Human stomacli cancer 
clone HP10122cDNA#l. 






7110 


W50192 


Homo sapiens 


protein CON-1. 


99 


45 


7111 


AF068294 


Homo sapiens 


HDCMB45P 


132 


55 




AB033020 


Homo sapiens 


KI AA 1194 protein 







TTT5 — 


ABO 19692 




protein kinase PKNbeta 




-~ 


~7TT4 — 


M12140 


l^omosapiens 







^79 


7115 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


88. 


62 


"7TT6 — 


AF124491 


Homo sapiens 


ARJF GTPase-activating protein G1T2 


^215 


Too — 


7117 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


153 


54 


7118 


AF130089 


Homo sapiens 


PRO2550 


184 


64 




AF130079 




PR02852 






7120 


AL3 59782 


Trypanosoma 


possible (}ihv-6) ull02, variant a dna, 


112 


51 


7121 


AF032384 


Xenopus laevis 


metalloprotease-disintegrin 


304 


76 


7122 


KO 1 664 


Drosophila 


Hkni-like protein 


99 


71 


7123 


AJ238706 


Drosophila 
melanogaster 


monocarboxylate transporter 1 

homologue 


161 


38 


7124 


X92485 


Plasmodium 
^'^^ -. 




127 


54 


~7v5 — 


QQ2-j{4 


Homo sapiens 


: T^^XTr^ 

Human secreted protein, SEQ ID NO; 
7795. 


~80 


"65 




AFl 1671'^ 


Homo sapiens 


rtsXJZ/ JO 


118 


P. 


71 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


64 


1128 


G 024 85 


Ho ' 

orao sapiens 


Hiunan secreted protein SEQ ID NO- " 
6566. 








7129 


AFl 18090 


Homo sapiens 


PRO2044 


280 


80 




AFl 290 85 




protein 








AF054840 




tetraspan TM4SF; Tspan-3 






7132 


L27428 


Homo sapiens 


reverse transcriptase 


156 


51 




AKOO 1 647 










7134 


AF130051 


Homo sapiens 


PRO0898 


142 


84 


7135 


X53581 


norvegicus 


ORF4 


213 


46 


7136 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


86 


TTST — 


G02872 


Homo sapiens 


Human secreted protein SEQ ID NO- — 

6953. 


194 


"68 






torridus 










X60592 




CDw40 






TliS — 




Homo sapiens 


7795. 


"T?! 


~S 




M76979 










7142 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


54 


7143 


Y27677 


Homo sapiens 


Human secreted protein encoded by 

gene No. 111. 


323 


90 


7144 


AFl 16712 


Homo sapiens 


PR02738 


132 


60 


7145 


AB024057 


Homo sapiens 


vascular Rab-GAP/TBC-containing 
protein 


121 


45 


7146 


GOB 172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


144 


65 



320 



/;/E:A^¥O01 84835 cpd 
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SEQID 


Accession No. 


Species 


Description 


Smitli 




NO: 









Waterniiin 


Identit 

y 


AH 

7147 


ALj 59782 


Trypanosoma 


-, — ?s ■■ 3 

possible i,hhv-6) ul 102, variant a dna, 


141 


76 






brucei 


complete virion genome. 






— u 

7148 


?r jtt: 

R06400 


Homo sapiens 


Lambda gtl0ch2 encoded C-terminal 


jj7 




85 









alpha amfdating enzyme. 






7149 


S87759 


Homo sapiens 


protein phosphatase 2C alpha, 


427 


78 








PP2CaIpha 






— — 

— — 


]Vfl2140 


— : 

omo sapiens 


envelope protein _ 


— 

~17^ 


— 

_64 


7151 


APO00616 


Oryza sativa 


similar to R1NG-H2 finger protein 


106 


85 








RHAla (AF078683) 






7152 




Lycopersicon 


extensin (class I) 


70 


j8 






esculentum 








"rm — ' 


U49973 


omo sapiens 


• -1 ' *^ ' ^ transposase 


-— 


61 








similar to pogo elernent 






~lT54 

— — 


AF130089 


omo sapiens 


PRO2550 


~M 


~M) 


4t^^ — 


AF 100742 


Homo sapiens 


M-phase phosphoprotein homolog 


-— 

~TI^ 


— 




G025j8 


Homo sapiens 


Fluman secreted protein, SEQ ID NO: 




~ 

78 








6619. 






7157 


AL3901 14 


Leishmania 


probable (iihv-6) ul 102, variant a 


174 


79 








DNA, complete virion genome 











Homo sapiens 


dJ73M23.3 (KJAA0319) 


^^■^ 






AF04 1330 


Bodo saltans 


NADH dehydrogenase subunit 5 


155 


^7.., .... 


7160 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


129 


46 








encoded by gene 17. 






7161 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


150 


53 








7874. 








7162 


- - ... 

G00354 




Flomo sapiens 


Human secreted protein, SEQ ID NO: 


107 


55 








4435. 






716j 


Ml 3 043 


Mus musculus 


alpha- 1 (IV) colleen 








7164 


ABO 11004 


Homo sapiens 


UDP-N-acetylglucosamine 


257 


90 








pyrophosphorylase 








7165 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


- 

111 


47 








gene 22 clone HMSJW18. 






7166 


AFl 19851 


Homo sapiens 


PRO 1722 


279 


61 


7167 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


51 


— 






^}^' ; : -7 ; 






7168 




W73j27 


: 

Homo sapiens 


Bipolar affective disorder associated 


287 


6j 






geuG fsh 16 product. 






75 




— : 

Homo sapiens 


Extended human secreted protein 


... .. . . 

1231 


99 








sequence, SEQ ID NO. 195. 






— — — 

17 




GOO J 65 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


T^T 


— 

76 














"7171 









omo sapiens 


T~i ^ 

ine- reverse anscrip 


-— 


— 






Homo sapiens 


Human secreted protein, SEQ ID NO: 




141 


~ 

64 








■ ■ — 









AF129507 


— : 

omo sapiens 


transcription actor C ^ 


— — 

245 


— 

55 


'TPm 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


56 


— — 






JronV-»rv-7 






4t^^ — 


1 

AF161356 


— : 

Homo sapiens 


HSPC093 


— 

III 


-— 

55 






Homo sapiens 


unknown 


116 




— 

7177 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


52 














— 


G02872 


Homo sapiens 


Human secreted protein SEQ ID NO — 


"94 


""38 








6953. 






7179 


AF130089 


Homo sapiens 


PRO2550 


124 


47 


7180 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


109 


66 








RHAla (AF078683) 






7181 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


65 








7795. 






7182 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


62 



321 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 











-y^p 


y 




G02607 


TJ ^ 

Homo sapiens 


~Lf r-j r"= cert Tr^ 

Human secreted protein, ofcC^ ID NOi 




75 








-^^55: 






Ura. — 


X92485 


-— J. 

asmo lum 


pva 


--— 

9 


-_ 

60 


TTss — 


AF130089 


liOTiosaleris 

Homo sapiens 


PRO2550 


TT6 ~ 





"7186 


AF130089 


omo sapiens 


PRO2550 


^37 


"64 


o 




omo sapiens 


zinc finger protein (583 AA) 


— 

"tIt 


— 




188 


X92485 




pval 




50 






vivax ° 








"tTrq — 


G02538 


omo sapiens 


— Tj r"= — — 

rluman secreted ptotein, SiiQ ID NO: 


-—— 


— 

57 


— 
















Homo sapiens 




——5 

4?^ — 


-— 





AFl 19855 


Homo sapiens 


PRO 1 847 




~ 




G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


■ 

57 














— To 


— — j ;— 

M155jO 


— , 

Homo sapiens 


— 7 

B-cell growth factor 


106 




7194 


G0j411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


41 








7492. 






7195 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


60 








. ._ .. .. 






7196 


— - 

G03801 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


94 


68 








_Z552: 








7197 


A < 

U60315 




Molluscum 


MC054L 


116 


36 






contagiosum 












virus subtype 1 








— — — 
7198 




Homo sapiens 


— : . . — 

Human secreted protem, SEQ ID NO: 


— 

1 16 




67 









-p^Ti rrs: 






■^2^^ 


A/11 ^^"3^ 
Ml JJjU 


— : 

Homo sapiens 


B-cell growth fector 


■ 


. 

-|? 


— — 


AF090894 


Homo sapiens 


PROOl 13 


— |i 


_zr 


7201 


G02832 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


160 


57 








^^^^ Fh S = 






"79n9 — 





-— 

omo sapiens 


ragmentof uman secreted protem 




129 


_ 

53 








-|j "° '— — 






— — 

7 Oj 





— : 

Homo sapiens 


Human secreted protem, SEQ ID NO: 


— 

92 




65 














^7204 


AF126163 


Homo a ie 


HHLA3 protein 


"TT 


Tx 


^7205 


X92485 




pval 


Tie 


~42 














^7206 


AF090931 


le 

omo sapiens 


PRO0483 





"R3 




"720^ 


y^^^g^ 


Homo sapiens 


Human secreted protein #28. 


— — 

TT^ 


— 

4? — 






Homo sapiens 


'^"ted*^''* ti^**^"*^^ human 


















7209 


Y36203 


Homosa iens 


Human secreted otein#75 


T24 


"56 — 


4in — 


M69008 


le 

omo sapiens 


. — 1 XIII 11 ' 


"ofi 


~— — 




G0j714 


omo sapiens 


P a- type ""te^^^oTjQ m vtq. — 


Til 


-— — 








^ uman secre e pro in, -i^ 












-r- : 

omo sapiens 


Tf — ' S P — — 


T^^^ 


-— — 








^ uman secrete proem, bQ 






7213 


AF130089 


Homosa iens 


PRO2550 


"163 


""86 





AF130089 


—J —. 

Homo sapiens 


PRO2550 


— -j 


— 




AFl 1 8082 




PRO 1902 


Tso 


~§3 


7216 


Y53663 


Homo sapiens 


A human B-cell growth factor related 


83 


66 








protein (BGFRP). 






7217 


G03714 


Homo sapiens 


niiraan secreted protein, SEQ ID NO: 


141 


68 








7795. 






7218 


AF220264 


Homo sapiens 


MOST-1 


123 


66 


7219 


AF229067 


Homo sapiens 


PADI-H protein 


121 


52 


7220 


AFiisose 


Homo sapiens 


PR01992 


130 


61 


7221 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


179 


100 



322 
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SEQID 


Accession No. 


Species 


Description 


Smith- - 


% 


NO: 








Waterman 


Identit 










Score 


y 









SEQ ID NO' 163 








Y 8624 8 


— : 

Homo sapiens 


HCHPF68, 


-— 

154 




78 








SEQ ID NO: 1 63 . 








Y86248 


Homo sapiens 


?^^^xi^^!^'i^'^'*^^"' HCHPF68, 


00 


95 


— — — 






SEQ ID NO:lo3. 






^§25 


^2^^2i 


Homo sapiens 


reverse transcnptase 


"tIt 


- 

55 




AB008227 


Adiantum 


Extensin 




43 







capi us-veneris — 











j-^^g^^^ 


Homo sapiens 


— T — r r-j r — ^- 

cytochronie c oxidase subunit 1 


— — 

113 


— 

78 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


153 


96 








SEQ ID NO: 1 63. 








Y86248 


-~ : 

Homo sapiens 


Human secreted protein HCHPF68j 


112 


74 









SEQ ID NO* 163 









^^^^^^ 


omo sapiens 


B-cell growth factor 


— 

-iiZ 


— 

-Z^ 




AF004341 


omo sapiens 


cytochrome c oxidase subunit III 




87 


^Tori 


G02872 


omo sapiens 


Human secreted protein, SEQ LD NO: 




46 








dKi/- > 








AP000616 


— -. ' 

ryza sativa 


similar to R1NG-H2 linger protein 


— — 

123 


— 

90 








RHAla (AF078683) 






"7233 


^^g^gg 


Mus musculus 




T05 


— 


7234 


M15530 


Homo sapiens 


B-cell growth fector 


113 


61 





AF 194537 


Homo sapiens 


NAG 13 


249 


65 


-^^1^ 





Homo sapiens 


Gly(5) beta-globin mutant. 


296 


80 






Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


50 








-3 . 






"7238 


yjg^^g 


omo sapiens 


>J ATlPI H^rV ^ " t' ■ 


— 




— 

"It 


"irw 

■ 


D381 12 


omo sapiens 


— e y rogenase subunit 3 




_2i 





G03800 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

,~^^-}' i 




122 


70 


7241 


A POOOo 1 6 


; 

Oryza sativa 


similar to RING-H2 finger protein 


. 

123 




90 








KrlAia \J\r\} loOaj) 






7242 


AF090942 


Homo sapiens 


PRO0657 


118 


47 


7243 


AFl 19851 


Homo sapiens 


PRO 1722 


112 


62 


7244 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


100 


53 


7-45 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


144 


66 








encoded by gene 17. 






7246 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


78 


— — 


- 




-^222: 






7248 




— 

Plasmodium 
vivax 


pval 


108 


58 


7249 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


170 


74 









: 











-J- : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 




153, 


70 


— — 






r^^" 






7 





— 

Trypanosoma 




93 


39 






bmcei ^ 








^7252 


Y02886 


omo sapiens 


Fra°-ment of human secreted rotein 


TTs 


— 

54 








encoded by gene 90. 






^79?^^ 


- ^ 


omo sapiens 


Human secreted protein, SEQ ID NO: 


110 


83 









7554. 









Ar010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


167 




66 









Human secreted protein, SEQ ID NO: 


179 










6613. 






7256 


AF130079 


Homo sapiens 


PR02852 


147 


70 


7257 


AFl 18082 


Homo sapiens 


PRO 1902 1 80 


45 


7258 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


55 








6613. 






7259 


W48351 


Homo sapiens 


Human breast cancer related protein 


90 


65 








BCRB2. 







323 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Wateiraan 


Identit 





X92485 


Plasmodium 


"pval 

pva 





"Is — 




G03798 




7879. 








AF130089 




PRO2550 






7263 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone FfMSJWlS 


114 


60 


^7264 


AFl 16661 


Homo sapiens 


PRO 143 8 


~r9 


~67 


7265 


AF 104021 


Bovine viral 
diarrhea virus-2 — 


polyprotein 


161 


40 






Leishmania 


inn— Pin: — ^ ■ 

probable (hhv-6) ul 102, variant a. 

DNA, complete virion genome 


116 


4j 


7267 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


112 


62 


7268 


M15530 


Homo sapiens 


B-cell growth factor 


154 


60 


7269 


G03j56 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


54 


7270 


AFl 18082 


Homo sapiens 


PRO1902 


139 


62 






Mus sp. 


orfl 5' of EpoR 






7272 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


50 


7273 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


61 


7274 


AF130089 


Homo sapiens 


PRO2550 


124 


43 


7275 


X92485 


Plasmodium 


pval 


147 


44 


7276 


Q00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

.. .. 


81 


59 


7277 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


67 


7278 




G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


67 




AFl 30087 


"H '■ 


PR024 1 1 


^ 




~728U 


Y40039 


Homo sapiens 

omo sapiens 


human secreted protein. 


"TO 


69 














7282 


AFl 16661 


Homo sapiens 


PR01438 


117 


48 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 








A 171 1 

Ar 1 10 / ID 


Homo sapiens 


PR02829 






7286 


AFl 30079 


Homo sapiens 


PR02852 


122 


52 






Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 










Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 






7290 


W48351 


Homo sapiens 


Human breast cancer related protein 


146 


59 


7291 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


58 


^7292 


G02532 


Homo sapiens 


Human secreted protein SBQIDNO — 

6613. 


167 


"61 


7293 


X92485 


Plasmodium 
vivax 


pval 


122 


71 


7294 


AF130089 


Homo sapiens 


PRO2550 


87 


62 


7295 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


140 


50 


7296 


X92485 


Plasmodium 


pval 


152 


75 



324 
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SEQ ID 

NO: 




Species 


Description 


Smith- 
Waterman 
Score 


Identit 














7297 


AF090894 


Homo sapiens 


PRODI 13 


172 


65 




G00637 


Homo sapiens 


4718. 








AL3 59782 


brucei 


possible (hhv-6} ul 1 02, variant a dna, 
complete virion genome. 






7j00 


G 02827 


Homo sapiens 


^908^ protein, SEQ ID NO: 


228 


80 


7301 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


73 


7302 


U43360 


Peromyscus 


reverse transcriptase 


119 


46 


7303 


G 02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


76 


7304 


U58658 


Homo sapiens 


unknown 


102 


48 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 






7306 
— — — 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


63 




G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


149 


58 


"7308 


G00637 


Homo sapTelis' — 


~Buman secreted protein SEQ ID NO" — 
4718. 


"Tp 


-j^ 




G03263 




7344. 








M15530 










7311 


AFl 18082 


Homo sapiens 


PRO1902 


168 


46 


~^JI5 — 


^^g^^'j^ 


Homo sapiens 


CoxII/D-loop DNA fiision protein 












sapiens 




_3 


2^ — 


^7314 — 


W88627 


omo sapiens 


s'^reted protefa^enrod^b^ 94 

clone I-IPMDQ32. 




59 


7315 


AF289099 


Maackia 


ENOD2f 


107 


32 


7316 


U 17009 


Phytophthora 


SecY-independent transporter protein 


135 


41 


7317 


AF220264 


Homo sapiens 


MOST-I 


133 


72 


7318 


U49973 


Homo sapiens 


ORF2: function unlaiown 


111 


53 


-^1^^ — 


V 00 662 


Macaca^^'^"^ 


URF 2 (NADH dehydrogenase subunit) 


79* 


^6 




AB047600 


fascicularis 


nypofnetical proteui 






7321 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

SEQ ID NO 163 


205 


95 


7322 


AC005175 


Homo sapiens 


TA2R_HUMAN, BETA ISOFORM; 
TXA2-R' PROSTANOID TP 
RECEPTOR 


127 


57 




R48296 




Human PGF-2/NT-3. 






7324 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI 8 


157 


60 


7325 


AFl 19855 


Homo sapiens 


PRO 1847 


172 


66 




W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 








G02532 




6613. 






7328 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


98 


56 


7329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7870. 


161 


64 


7330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


111 


72 



325 
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SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Hentit 

y 


^7331 


AF126163 


mo sa iens 

omo ^^P|™^ 


HHLA3 protein 


■ 




^7532 


G03258 


omo sapjens 


7339. 


T22 " 


~ 


7333 


AF130079 


Homo sapiens 


PR02852 


136 


54 


~f§^ 


AK023563 


H°mo sapiens 










G02532 


omo sapiens 


Human secreted protein SEQ ID NO* 

6613. 


TT5 


— 


7336 


Ml 5530 


Homo sapiens 


B-cell growth factor 


136 


~75 


7337 


D38112 


Homo sapiens 




168 


"81 


7338 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


339 


80 




Y86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 












fielivclrocrenase subunit 3 






7341 


L00016 


Homo sapiens 


uif4 


295 


72 








N ADH dehydrogenase subunit 3 






7343 


M36647 


Homo sapiens 


mitochondrial lilnge protein precursor 


286 


82 


7344 


X15917 


Paramecium 

^'^^ : 


WD2 protein (AA 1-193) (unusual start 
codon) 


1^2 


34 


: 


■ 


omo sapiens 


SEQ ID NO:163!' 





-— 


"7346 


Y 1/^,^2,2 


omo sapiens 


e^^ed by^gen™l7 protein 


— 


— 


7347 


Y86248 


Homo sa iens 

omo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


T33 


~6Q 


7348 


D381 12 


Homo sapiens 




-_ 


Ti 


7349 


D38112 


Homo sapiens 


cytocbomTc Kdda^ 


"347 


~83 


7350 


V00662 


Homo sapiens 


cytochrome oxidase 1 


325 


85 


7351 


D38112 










7352 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


224 


79 


7353 


D38112 




NADH dehydrogenase subunit 1 


270 


86 


7354 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


167 


90 


7355 


D38113 


Pan troglodytes 




120 




7356 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


84 








NADH dehy drosenase subunit 3 


4i5 — 




7358 


D38112 


Homo sapiens 


cvtoctoome^b'^°^^"~^~^~~ 




~ 


7359 


U09500 






T75 


~80 


7360 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


"209 


^73 


7361 


AFl 18078 




PRO 1848 


~95 


^8 


7362 


AB017116 


' iHomosapiens — " 


AD 1 


148 


^9 


7363 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


146 


42 


7364 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


203 


^82 


7365 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


206 


84 


7366 












7367 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


"IJB 


~ 


7368 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


481 ^ 


85 




X86031 




antibacterial protein 






^571 — 


AB017116 




A ■ xir^xmjico 





-|1 




Y86248 


Ho'^o ^^^-^"^ 

omo sapiens 


Human secreted protein HCHPFoS, 
SEQ ID NO: 163. 






7372 


AF118082 


Homo sapiens 


PRO 1902 


122 


54 


7373 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4573. 


163 


62 


7374 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


53 


7375 


AF090894 


Homo sapiens 


PRO0113 


128 


64 


7376 


AK000241 


Homo sapiens 


unnamed protein product 


131 


67 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smifh- 

"flf^ 


Identit 


^7377 


AFl 19855 


Homo sapiens 


PRO 1847 




"62 


7379 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


114 


37 






norvegicus 




















7382 


Y02671 


Homo sapiens 


HumMf^creted protein encoded by ' 

gene 22 clone HMSJW18. 


-~ 


— 


-^111 


AB040938 




KIAA1505 protein 








G03790 


Homo sapiens 


7871. ^ 


"155 


-~ 




Y86248 


Homo sapiens 


SEQID NO: les!* 






7386 


U63295 
£003 499 


BFos^hUr'*^ 

melanogaster 









CG 12706 gene product 









X03342 


Homo sapiens 








7389 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


58 


7390 




Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO* 163 


154 


100 


AKO'^6800 


— : 

Homo sapiens 


unnamed protein product 


— — 

-iiZ 


-— 

52 


^7392 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


17 


73 


7393 


G04072 


omo sapiens 


^umansecr e proem, Q 


— 


— 


"7394 


Y86248 


„ — i 

omo sapiens 


Pluman secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


80 


7395 


Ml 5530 


Honio sapiens 




"98 


~57 


7396 


L 10908 


Mus musculus 


Gcapl gene product 


105 


51 


7"398~ 


AFl 19851 
1(99256 




PR01722 






Hylobates lar 


cytochrome oxidase subunit III 


319 


73 














7400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


399 
170 


80 


7401 


L78671 


Homo sapiens 


CoxIl/D-loop DNA fusion protein 






Homo sapiens 


NADH dehydrogenase subunit 3 








"7555 — 


Y86248 


Homo sapiens 


SEQ ID NO: 163. 






7404 




Homo sapiens 


Human secreted protein, SEQ ID NOr 


115 


40 


"7405 


D83006 


Saccharomyces — 


gene required for phosphoylation of 

oligosaccharides/ has high homology 
with YIR061W 


Tti 


— 


"7406 


AF144636 


Callus alius 




los 


"44 


"^7407 


G02532 


Hom"o salens 


Human secreted protein SEQ ID NO" 
6613. 


"l66 


~io — 














7409 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


140 


87 


7410 


AP000616 


Oryza sativa 


RHAla (AF078683) 


111 


59 


7411 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


138 


82 










"ll^ 


"li 


7413 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 






7414 


AB036396 


nigromaculata 


cytochrome b 


109 


87 


7415 


AF090930 


Homo sapiens 


PRO0478 


121 


76 


7416 


B03148 


Homo sapiens 


Human neuronal differentiation fector- 
1 (NDF-1). 


171 


96 


7417 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


446 


91 
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SEQID 
NO: 


Accession No. 




Description 


Smitti- 
Watcrman 


Identit 

y 


"74T8 


AB018440 


Echinococcus 

multiioculads 


"NADHddi dro enase subunit 

e y ogenasesu uni _ 







7419 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


123 


66 


7420 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


50 


7421 


AF 155232 


Pisum sativum 




128 


"37 


7422 


AF161356 


Homo sapiens 


HSPC093 


89 


48 






Por^o pygmaeus 




~tIt 





"^7424 


Y86248 


omo sapiens 


flumansecreted^l^^in^^^ 

SEQ ID NO; 163. 






7425 


AF275953 


synthetic 
construct 


Cox8ND6gfp fosion protein 


214 


93 


7426 


D38112 


Homo sapiens 


cvtodfirome c oxidase subunit 1 


T56 


~73 


7427 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO" — 
4718. 


Uo 


~59 


7428 


AK024455 


Homo sapiens 


FLJ00047 protein 






7429 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


212 


82 




S79H0 Mus musculus 


protein 








D38112 


Homo sapiens 








7432 


D381I2 


Homo sapiens 


cytochrome c oxidase subunit 1 


217 


84 




U87145 


Toxoplasma 








7434 


Y86248 


Homo sapiens 


Human secr6t6d protein HCHPF68, 
SEQ ID NO' 163 


213 


72 


7435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107 


135 


57 


7436 


G03807 


Homo sapiens 


7888. 


HI 


— 




Y86248 


Homo sapiens 


SEQ ID NO: 163. 


190 


88 


7438 


D38I12 


Homo sapiens 


NADH dehydrogenase subunit 1 


141 


78 


7439 


AF250284 


Amsacta moorei 
entomopoxvirus 


AMV012 


88 


31 


7440 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


149 


85 




U43360 


Peromyscus 
manicuJatus 








7442 


AF275807 


Homo sapiens 


PNAS-110 






7443 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


126 


56 


7444 


D38112 


Homo sapiens 




~475 


"^5 


7445 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO' 

6953. 


146 


"ri 


7446 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7867. 


130 


60 


7447 


X92485 


Plasmodium 
vivax 


pval 


153 


45 


7448 


AFl 19851 


Homo sapiens 


PR01722 


96 


55 




AF130079 


Homo sapiens 


PRQ2852 


153 




"7550 — 


AF119851 


Homo sapiens 


PR01722 


238 


"55 


7451 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR j 113 
RIBONUCLEOPROTEIN C j 


18 


7452 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


147 


57 


7453 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


136 


63 


7454 


R14163 


Homo sapiens 


Cellular DNA-binding protein RS 1 . 


270 


42 


7455 


X63526 


Homo sapiens 


homologue to elongation factor 1- 


387 


80 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 










gamma from A.salina 






7456 


G03I72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


140 


80 


7457 


AFl 19851 


Homo sapiens 


PRO 1722 


102 • 


55 


7458 


G02756 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6S37. 


83 




7459 


AFl 19851 


Homo sapiens 


PRO 1722 


132 


64 


7460 


L10908 


Mus musculus 


Gcapl gene product 


106 


64 


7461 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. ' 1 


124 




7462 


X77816 


norvegicus 


PR-Vbetal 


142 




7463 


Y34068 




Histone HI isofonn HI. 5 pANCA- 
reactive fragment (residues 69-226). 






7464 


AB046048 


fascicularis 


unnamed portein product 


302 




7465 


X92485 


vivax 


. 

P a 


TTs 


~47 


7466 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

6619. 


116 


65 


7467 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


91 


35 


7468 


AF161361 


Homo sapiens 


HSPC098 


92 


57 


7469 


AF216389 


Homo sapiens 


semaphorin Rs 


288 


88 


7470 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


153 


73 




AF1300S9 




PRO2550 






7472 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


159 


' 58 


7473 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


115 




7474 


X61296 


Rattus 
norvegicus 


open reading frame 2 




49 


7475 


U22376 


Homo sapiens 


13A 


165 




7476 


AF119851 


Homo sapiens 


PR01722 


133 


46 


7477 


AL359782 


Trypanosoma 


possible (lihv-6) u 1 1 02, variant a dna, 
complete virion genome. 


127 


69 


7478 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 






7479 


S79410 


IVlus musculus 


nuclear localization signal binding 

protein 


122 


59 


7480 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


168 


84 


7481 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 




80 


7482 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6292. 


122 


56 


7483 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7874. 


132 


37 


7484 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


7485 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


72 


7486 


M29294 


Rattus 
norvegicus 


small nuclear ribonucleoparticle- 
associated protein 


132 


42 


7487 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHiVrSJW18. 


140 


50 


7488 


AF090901 


Homo sapiens 


PRO0195 


101 


62 


7489 


X97249 


Homo sapiens 


leucine-rich primary response protein 


304 


93 


7490 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


447 


84 


7491 


AK024455 


Homo sapiens 


ELJ00047 protein 


122 


63 


7492 


U87269 


Homo sapiens 


pI20E4F transcription factor 244 


66 


7493 


A3 1036 


Niootiana alata 


PRP2 1 87 


31 



329 
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SEQID 
NO: 




Species 


Description 


Smith- 
Waterman 


Ideiitit 


7494 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


206 


97 


7495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


166 


91 


7496 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


66 


7497 


AK024455 


Homo sapiens 


FLJ00047 protein 


67 


61 




Y86248 




SEQ ID NO:163. 












antibacterial peptides 






7500 


Y86248 


Homo sapiens 


SEQ ID NO: 163. 


166 


89 


"tJoI — 


Y27868 


Horn — T^en 

omo sapiens 


gene No 107. 


Ht 


"61 


7502 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


213 


60 


7503 


AB042770 


Pteropus 


cytochrome c oxidase subunit 3 


169 


89 


7504 


AF126163 


Homo^apiens 


HHLA3 protein 


TTo 


"62 


7505 


D38112 


Homo sapiens 


c)^ochrome c oxidase subunit 3 


299 


90 














7507 


W48351 


Homo sapiens 


Human breast cancer related protein 


128 


72 


"7508 


Y 8624 8 


Homo sapiens 


Hum^secreted protein HCHPF68 

SEQ ID NO:163. 


162 


"86 














7510 


L3S90S 


Nicotiana 


extensin 


114 


36 


"7512 





Homosapiens 


ORFl- MER37- putative transposase 

similar to pogo element 


"273 


"73 


-^lll 




Homo sapiens 


Human neuroMl dOTereirtfa^^ factor — 


4It — 






303148 


omo sapiens 


1 (NDF-I). 




~ 


"7515 


Y86248 


Homosa iens 

omo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


TTo 


~46 


7516 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


170 


84 


7517 


G03263 




Human secreted protein, SHQ ID NO: 
7344. 






7518 












7519 


D38114 


Gorilto^gorilla 


NADH dehydrogenase subunit 3 (ND3) 


141 


84 


7520 


U12690 


Homo sapiens 








7521 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


112 


71 




AL359782 




protein Aa™^'^*° ™g-h2 finger 


96 


46 


7523 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


142 


57 


7524 


AF220264 


Homo sapiens 


MOST-I 


86 


62 




AF130089 




PRO2550 






7526 


G003S1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


79 


7527 


U49973 


Homo sapiens 


ORF2- function unknown 


105 


68 


7528 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


122 


62 


7529 


X05831 


Rattus 
norvegicus 


fibronectin precursor 


100 


64 


7530 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


175 


60 


7531 


X92485 


Plasmodium 




120 


59 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 






vivax 




















7533 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


321 


67 


7534 


AL359782 




TOmptete^irion^^mme^^^' ^ 


126 


72 


7535 


AFl 19851 


Homo sapiens 


PR01722 


110 


67 


"TSp — 


^^^^j^ 






"T?^ 


^5 






Homo sapiens 


HumM secrete*d iKX>te^ 1d NO- — 
7795. 








X92485 


vivax 








"^^ZS — 


AFl 19900 


Homo sapiens 


PRO^ 822 





-t^ 




G02532 


omo sapiens 


Huinaii secreied protein SEQ ID NO' 






7541 


M15530 


Homo sapiens 


B-cell growth factor 


136 


53 


7542 


X53581 


Rattus 


ORF4 


174 


46 


7543 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


138 


74 


"7544 


Yy2^^l2 


Horoo sapiens 




TT6 


~86 


7545 


M15530 


Homo sapiens 


B-cell growth factor 


96 


47 


7546 


X92485 


Plasmodium 


pval 


149 


58 


~7547 


AF113685 


'w ~ 


PRO0974 


— r 


-— 

"^ 


^7548 — ~ 


U49973 


Homo sapiens 


similar io pogo element 


"226 












-1^ 1 


-~| 


"7555 — 


S79410 


"MiTmu^ulus 


"^I^I^locaHz^^ signal binding 

protein 








AF130089 




PRO2550 






7552 


R95913 


Homo sapiens 


Neural thread protein. 


99 


56 














7554 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


75 


7555 


L78671 


Homo sapiens 


CoxII/D-loop DNA llision protein 


207 


56 


7556 




G03714 


Homo sapiens 


Human secreted protein, SEQ ID NOi 

Z7^^' —3 




59 




G02832 


"h '' 

orao sapiens 


6913™^^'^'* 


—— 


— 


7558 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


95 


71 


7559 


X83413 


Human 


U88 


108 


32 


7560 


A18812 


Brassica napus 


extensin 


125 


38 


7561 


D381I2 


Homo sapiens 


ATPase subunit 6 


495 


87 


"75^5 — 


AF130089 


Homo sapiens 


PRCSSO 









AF265575 


omo sapiens 


ubiquitous TPR-motif protein Y 




~ 


^7564 


G00361 


Homo a iens 

omo sapiens 


H*^ °™ - t d te — SEO ID NO" — 


128 


— 


^7565 


^gQjgj 


Caeno h bdif 

ele^ans ^ ' 


C ele°ans collagen dpy 10 gene (Levy 
A.D., Yang, J. and Kramer, J.M. Mol. 
Biol Cell 4^803-17, 1993)' 


"ToT 


"tt 


7566 


V00672 


Pan troglodytes 


reading frame protein 4 


294 


73 


7567 


A18812 


Brassica napus 


extensin 


145 


48 


7568 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


241 


69 


7569 


LOGO 16 


Homo sapiens 


urf4 


284 


71 


7570 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


115 


50 


7571 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


65 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


■ ■ 








-— — : J : 


— — 









— : 

Homo sapiens 


4433^° secrete protem, fcQ ID NO. 




— 

65 


7573 


X92485 


Plasmodium 


pval 


157 


49 


7574 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 
348 


54 


7575 


AFl 30089 


Homo sapiens 


PRO2550 


70 














1511 


L26953 


Homo sapiens 


chromosomal protein 


148 


38 






Homo sapiens 


Human secreted protein, SEQ ID NOr 
6913. 








AB047948 


Macaca 
fascicularis 


hypoflietical protein 








G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7882. 








003263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 










Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 






7583 


AC004079 


Homo sapiens 


liuman HOXA3 ; 95% similarity to 
e307530 (PID.*gl888441) 


127 


100 


"7584 


Y86248 


Homosa iens 

omo sapiens 


SEQIDNO;163^ 


Tfi 


Ti 




Y86248 


Homo sapiens 


SEQIDNO:163.^ 








Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 








D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 






7588 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


115 


80 




Y 8 6248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 






7590 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


112 


60 






Homo sapiens 


Human secreted protein, SEQ ID NO; 

7795. 








A49807 


Neisseria 
meningitidis 


IGA-LPHA3 










Caenorliabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N-terminal domain}, Score=27.5, E- 
value=0.0001,N=l 










Arabidopsis 
thaliana 


extenstn 












encoded by gene 3. 








W4835 1 




BCRB2. 








Y86248 




SEQIDNO:163!' 






7598 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


148 


96 


7599 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


95 


7600 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


192 


85 


7601 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


91 


7602 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


333 


71 



332 



SEQID Accession No. 
NO: ! 


Species 


Description 


Waterman 
Score 


y 


7603 ; D38112 










7604 I U09500 


Homo sapiens 


cytochrome b 


279 


93 


7605 


214016 


tabacum 


pistil extensin like protein, partial CDS 


84 


40 


7606 


G0380] 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7882. 


To8 


"ri 


7607 


G03636 




7717. 






7608 


G03714 




Human secreted protein, SEQ ID NO: 
7795. 






7609 1 Y19767 




SEQ ID NO 485 from W09922243. 






7610 1 D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


434 


87 


7611 


Y21418 




HMGI-C mutant fr^ment 2.^ 






7613 


X92485 


Plasmodium 


pval 


106 


58 


7614 


AC003O58 


Arabidopsis 

^ '^'^^ ■ 


unknown protein 


191 


88 


7615 


S74221 


omo sapiens 


:rr- — j r"= 

IK. iactoi=cytokine down-regulating 

HLA class II 


-— 


— 

65 


7616 


U28971 


elegms ^ ' 


RD protein^udK^ma 
protein) 


-— 





7617 


G04078 




8159. ^ 






7618 


L35013 










7619 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


441 


85 


7620 


AF116638 




PRO 1546 






7621 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


146 


46 


7622 


G03714 




7795. ^ EQIDNO. 






7623 


G02538 




6619. 






7624 


U41031 


eleganf^'''*'*'^ 


proline-rich 


92 


59 


7625 


G03453 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


130 


55 


7626 


L 19098 


Clioristoneura 


cytochrome oxidase I 


102 


66 


7627 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


99 


76 


7628 


AF126163 


Homo sapiens 


HHLA3 protein 


117 


67 


7629 


G00361 


Homo sapiens 


4442. 


69 


53 


7630 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO" 

7870. 


"83 


-j^ 


7631 


G00377 


Homo sapiens 


4458. ^ 


87 


64 


7632 


Y27907 


Homo sapiens 


gene No. 1 19. 


"120 


~48 


7633 


R95913 


Homo sapiens 


Neural thread protein. 


126 


61 


7634 


V00662 


Homo sapiens 


cytochrome oxidase I 


396 


85 


7635 


¥86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


205 


84 


7636 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


464 


85 


7637 
7638 


D38112 
AFl 18086 


Homo sapiens 


NADH dehydrogenase subunit 2 


165 


94 


Homo sapiens 


PRO 1992 


120 


68 


7639 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


176 


84 



333 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 


% 

Identit 

y 


7640 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 

clone HPMBQ32. 


95 


:y9 


7641 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


143 


68 


7642 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


192 


91 


7643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


416 


80 


7644 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 






7645 


U93570 


Homo sapiens 


putative pi 50 


206 


42 


7646 


D38112 










7647 1 AF090942 


Homo sapiens 


PRO0657 


128 


71 


7648 


G02493 




6574. P in, EQ 






7649 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


115 


52 


7650 


Y36366 




Fragment of human secreted protein 
encoded by gene 3. 






7651 


X71442 


norvegicus 


ORF 1; putative 


105 


52 


7652 


L07545 


tarentolae 


A 'c* was inserted after nt 369 

1 0459 in genomic sequence (M10126)) 

to correct -1 firameshift probably due to 

Hu'marsecreted protein SEQ ID NO- — 






7653 


G02532 


Homo sapiens 


6613, 


~89 




~53 


7654 


G02473 




6554. 


7655 


G02538 




Human secretec; protein, SEQ ID NO: 
6619, 






7656 ' D38112 










7657 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


367 


80 


7658 


Y86248 




SEQIDN0:163''^ 






7659 


AL359782 




protein^rha^'^^ ^ ring-h2 finger 


109 


50 


7660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


112 


77 


7661 


AP002460 


■ , . , ^ 

thaliana^^'^ 


gene_id.FlD9.26~unknown protein 


Tn 


~40 


7662 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


137 


96 


7663 


AF2 17973 


Homo sapiens 


unknown 


l24 


"31 


7664 


X92485 


Plasmodium 




TTs 


"59 


7665 


AFl 18082 


Homo sapiens 


PRO 1902 


123 




7666 


AB014563 


Homo sapiens 


KIAA0663 protein 


267 


90 


7667 


G00637 




Human secreted protein, SEQ ID NO' 
4718, 






7668 


AF119855 




PRO 1 847 







7669 


AK025n6 


Homo sapiens 




~w 




7670 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


55 


7671 


R14847 


Homo sapiens 


Protein associated with biochemical 

pathway involving cAMP. 


164 


89 


7672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795, 


93 


68 


7673 


U37426 


Homo sapiens 


kinesin-like spindle protein HKSP 


247 


81 


7674 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


75 


7675 


S67633 


Homo sapiens 


class I major histocompatibility antigen 


103 


59 
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SEQID 


Accession No. 


Species 


Description 


Smitli- 




NO: 








Watcrman 


Identit 


7676 


AF126163 


Homo sapiens 


HHLA3 protein 


146 


~2 




AC005192 


Ho 0 sa iens 

omo sapiens 




143 


~ 








protein PC4; 96% identical to P19182 












(PID;gl35861) 






~7678 


X07882 


Homosa iens 


Po protein 


T26 


39 


7679 


Y 14482 


Homo sapiens 




148 


~ 








encoded by gene 17. 






7680 


AJ242540 


Volvox carteri F — 




~V71 








uagariensis 


HRGP 






~768l 


AF090930 




PRO0478 


~80 


~5S 


7682 


X55654 


Homo sapiens 

MacTcr^'^"^ 




TTs 


"65 


7683 


AB046048 




unnaBeTportek'p^^ 

P 


145 


"67 






fascicularis 






7684 


AFl 30079 


Homo sapiens 


PR02852 


1 17 


60 


7685 


M13101 




unknown protein 










norveoicus 








7686 


Y01158 


Homo°sa iens 

mo sapiens 


Secreted protein encoded by °ene ! 8 


134 


14 








clone HCACJ81. 






7687 


AF126163 


Homo sapiens 


HHLA3 protein 


141 


^84 







Homo sa ic 

omo sapiun 


CC AAT box^'btidtatf ^ 


jtH 


"61 




M37197 






"07^ 


- ~ - 
"H 


^7690 


AL390114 


Q'^" ^^P'^s 

eis mama 


probable (hhv-6) ul 102 variant a 










l?^-'"'^ : 


P^^' virion genome 









AF220264 


omo sapiens 


— wrr~\ — p ' 


TTt 


-=y 

4t — 


^7^692 


AF063866 


san ^utd es 


.J ypo eica proem 


'W' 




















t 

_^ — opo — 








"7693 


G00357 


omo sapiens 


"h tf^ — V' — tpn rn Njn- — 


""ox 


-— — 








^umansecre proem, Q N . 






"7694 


AFl 18082 


^ 

H°m°^a'^ien^ 


PRO 1902 





— 


"7695 




omo sapiens 


Hmwan secreted protein, SEQ ID NO. 


-~ 


~ — 




















Ivf i 

us muscu us 


— r? — r '■ — t — 


-TT=; 


-— — — 








regfon'c^^ ''^^ ^^"^ ^'"""^ 






"7697 


Y64890 


-Yj = ' 

omo sapiens 


Human 5' EST related polypeptide SEQ 


"r^2 


— 








IDNO:1051. 






7698 


yQj^gg 


Homosa iens 




"155 


"77 — 








clone HCACJ81. 






1699 


Ml 5530 


Homosa iens 


Humaif seCTrte^'^rotein SEQ ID NO- — 


106 


"79 — 




G03790 


Homo sa iens 

omo sapiens 




"To8 


10 — 








7871 P ' V 






7701 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


62 














7702 


AF090931 


Homo sapiens 


PRO0483 


125 


68 


7703 


G03432 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


79 








7513. 






7704 


G02532 


Homosa iens 


Human secreted protein, SEQ ID NO: 


T75 


"63 














^705 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


14\ 


~35 


T706 


G02532 


Homosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO' 


"149 


"11 








6613. 






7707 


X92485 


Plasmodium 




126 


42 


7708 


AK023563 


Homo sapiens 


unnamed protein product 


217 


72 


7709 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


124 


58 








RHAla(AF078683) 






7710 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


124 


56 








gene No. 107. 






7711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


62 



335 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 
Jcore 


Identit 

y 


— — 









— — — 




'^rrn 


^^gj^^ 


Homo sapiens 


' , ■ . r r-; 

NADH dehydrogenase subunit 1 


"T^Z 




J72 


_Z 1 


D381 12 


Homo sapiens 


cytochronie c oxidase subunit 1 




-ZZ. 


7714 


X92485 


Plasmodium 


pval 


123 


68 








_wmx ^ 












Homo sapiens 


— : 

Human secreted proteinj SEQ ID NO: 


1j7 




48 













-— 




77 


AFl 19851 


— : 

Homo sapiens 


PRO 1722 




— 

60 


7717 


U9j570 


Homo sapiens 


putative pi 50 


133 


35 


7718 


AF116712 


Homo sapiens 


PR02738 


94 


68 


7719 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


107 


55 








terminal, clone XEH.Sc} 






7720 


U49973 


Homo sapiens 


ORF2: function unknown 


200 


85 


7721 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 


234 


73 








similar to pogo element 






7722 


G00397 


Homo sapiens 


Human secretsd protein, SEQ ID NO; 


155 


76 








4478. 






7723 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


65 








6953. 






7724 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


70 


73 








gene 22 clone HMSJW18. 






7725 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


55 








7870. 






7726 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


130 


75 








terminal, clone XEH.Sc) 






7727 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


94 


78 








encoded by gene 90. 






7728 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


55 








7881. 






7729 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


1 1 1 


63 






gene 22 clone HMSJW18. 






7730 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


119 


69 








dh40_3. 






7731 


... 
G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


84 








7075. 






7732 


Y08769 


Rattus 


microvascular endothelial 


94 


38 




_ 


norvegicus 


differentiation gene 2 






7733 


U09500 


Homo sapiens 


cytochrome b 


463 


84 






Homo sapiens 


cytochronie c oxidase subunit 3 






7735 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


399 


92 


77j)6 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


128 


55 






secreted peptide. 






7737 


AC003058 


.. . . 
Arabidopsis 


unknown protein 


167 


66 






thaliana 


■ 






7738 


W48353 


Homo sapiens 


Human breast cancer related proteui 


86 


70 








BCFLT2. 






7739 


D86854 


Catharanthus 
roseas 


extensin 


154 


40 


7740 


G 03243 


Homo sapiens 


. .. 

Human secreted protein, SEQ ID NO; 


122 


57 








7324. 






7741 


X860j1 


Sus scrofa 


homologue of proline/arginine-rich 


99 


49 














7742 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


35 








6834. 






7743 


V00662 


Homo sapiens 


cytochrome oxidase I 


568 


89 


7744 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


145 


93 


7745 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


154 


100 


7746 


AP000616 


Oryza sativa 


sunilar to RING-H2 finger protein 


133 


61 








RHAla(AF078683) 







336 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Ideiitk 

y 








7888. 


















7749 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


71 


T750 


AFl 30079 


Homo sapiens 


PR02852 


147 


~ 


7751 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


104 


49 


7752 


U43627 


Arabidopsis 


extensin 


116 


j7 


7753 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


71 


7754 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


551 


83 






Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 






7756 


AF119851 


Homo sapiens 


PRO 1722 


97 


56 






Caenorhabditis 
elegans 


similar to RE repeat region of 
PIR;S27806 D. melanogaster homeotic 
protein BarH2 protein (dual bar 
protem) 






7758 


AC002045 


Homo sapiens 


Unknown protein product CIT987SK- 
A-Jovri i_i splice lorm z 


97 


94 


7759 


Y02775 


Homo sapiens 


Human secreted protein encoded by 
gene iz cione rir icu ly. 


288 


98 


"7760 


^^^^^^ 


Homo sapiens 


Human secreted pioteiiiy SEQ ID NO*. 

7795. 


— 


— 




Y86248 


Homo sapiens 


SEQIDNO:163. 








AF09093 1 


Homo sapiens 








7763 


X92485 


Plasmodium 


pval 


114 


62 


7764 


AF09093I 


Homo sapiens 


PRO0483 


107 


86 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6833. 








Y86248 


Homo sapiens 


Hiiman secreted protein HCHPF68, 
SEQ ID NO: 163. 






7767 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene '-'2 clone HMSJWl 8 


148 


72 





Yg5948 


— -. 

Homo sapiens 


SEQ ID NO: 163. 






7769 


Y40039 


Homo sapiens 


Peptide sequence derived from a 


68 


72 


7770 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


62 


— 


U93569 


Homo sapiens 


p^^^' 


188 


"43 


7772 


AL049934 


Homo sapiens 


hypothetical protein 


113 


47 




AF09093 1 




PRO0483 


"tIt 





^7774 


AP000616 


Ory^a sativT^ 


RHA]a(AF078683) ^ ^ 






7775 


G00328 


Homo sapiens 


4409. 


14j 


65 


7776 


AF126163 


Homo sapiens 


HHLA3 protein 


141 


71 


7777 


G00423 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4504. 


93 


47 


7778 


AF090942 


Homo sapiens 


PRO0657 


148 


74 


7779 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


194 


83 


7780 


U 14966 


Homo sapiens 


ribosomal protein L5 


315 


81 
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SEQID 
NO: 


Accession No. 


specie. 


Description 


Smith- 
Waterman 


Lntit 


7781 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


126 


47 


— 


AFl 19855 




PRO 1847 


-^j^— 






G00344 


Homo sapiens 


4425. 




53 














7785 


G00331 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


42 


^7786 — 


Y86248 


~H aT^eiT 

omo sapiens 


ftman secreted protein HCHPF68 

SEQIDNO:I63. 


— — 


— 


7787 


AFl 18082 


Homo sapiens 


PRO1902 


131 


48 






Homo sapiens 


7795. ^ , SEQ ID 








J03807 


us purpuratus 


bistone Hl"delt& 


















779] 


V00662 


Homo sapiens 


cytochrome oxidase I 


522 


80 


7792 


Yj6j66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


153 


72 


7793 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

-^T~~ — Ti ; J : 


84 


53 


^794 


W483 5 1 


Homo sapiens 


Human breast CEincer relsited protein 
BCRB2. 












SEQ ID NO: 163. 






— 






ATPase subunit 8 








Y86248 


H^mo sa'^iens 

omo sapiens 


Human secreted protein HCHPF685 
SEQIDNO:163. 


"135 


~?B 




AF090931 




PRO0483 






7799 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


57 






omo sapiens 


Similar to human 22kDa, SM22 mRNA 
(HIJM22SM). 


99 




84 


7801 


X64707 


Homo sapiens 


BBCi 


232 


77 






Homo sapiens 


7879. 






7803 


Y86248 


Homo sapiens 


SEQ ID NO' 163. 


112 


50 


"7804 


y^g^^^ 


Homosa iens 




"T42 


"56 


7805 


AFI1985I 


Homo sapiens 


PRO 1722 


88 


46 


7806 


G0j790 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


149 


57 


7807 


X68600 


Hordeum 


pZE40 


90 


37 


7808 


AK025047 


Honmsapiens 


umiamed protein product 


"138 


14 


7809 


AF130089 


Homo sapiens 


PRO2550 


Its 


Ti 


7810 


GO 1249 




Human secreted protein, SEQ ID NO: 

5330. 


T24 


~65 


7811 


R59842 










7812 


AFO 16446 


Caenorhabditis 


similar to C. elegans cuticulin precursor 
CUT-2 (SP:P34682) 


95 


42 


7813 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


154 


65 


7814 


AK025047 






165 




7815 


W34499 


Homo sapiens 


Obesity receptor C protein. 


309 


89 


7816 


AJ222673 


Hepatitis C virus 
type 2c 


core protein 


95 


42 


7817 


AB011483 


Arabidopsis 
thaliana 


contains similarity to root cap 
protein~gene_id:MUF9. 1 6 


104 


61 


7818 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


154 


74 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smltli- 
Waterman 


% 

Idenht 


7819 


X92485 


Plasmodium 
vivax 


~pval 


1 12 


~ 


7820 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 






7821 


G03789 




Human secreted protein, SEQ ID NO! 
7870. 






7822 


AF130089 


Homo sapiens 


PRO2550 


107 


67 


7823 


AF 118086 


Homo sapiens 


PRO 1992 


143 


73 




G02493 




6574. 






7825 


AL390114 


major 


probable (hhv-6) ul 102^ variant a 




59 


7826 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


42 


7827 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


65 


7828 


AL359782 


ToTJanosoma 


possible (hhv-6) ul 102, variant a dna, 


104 


58 


7829 


AF 132961 


Homo sapiens 


CGI-27 protein 


154 


49 






sylvestris 










ALl 60371 


major 


DNA, complete virion genome 








G02480 




6561. 








W88627 


Homo saprens 


clone HPMBQ32. 


99 


46 


7834 


G02469 


Homo sapiens 


Human secreted pVotein, SEQ ID NO; 

6550. 


74 


65 


78j5 


G025j8 


Homo sapiens 


Human secreted protem, SEQ ID NO. 


163 


55 


7836 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7563. 


72 


52 


7837 


AF090942 


Homo sapiens 


PRO0657 


126 


59 






respiratory 
syncytial virus 








7839 


D63480 


Homo sapiens 


The KIAAO 1 46 gene product is novel. 


158 


89 




W48352 


Homo sapiens 


BCFLTl. 


88 


6j 


7841 


G03789 


Homo sapiens 


7870. 


T56 


~60 


7842 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


145 




7843 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


127 


66 


7844 


M28016 


Homo sapiens 


cytochrome b 


214 


78 


7845 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


283 


82 


7846 


G03714 




Human secreted protein, SEQ ID NO: 
7795. 






7847 












7848 


L27428 


Homo sapiens 


reverse transcriptase 


210 


67 


7849 


G 02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


156 


96 


7850 


X92485 


Plasmodium 
vivax 


pval 


121 


54 


7851 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


59 


7852 


D38112 - 


Homo sapiens 


NADH dehydrogenase subunit 2 


191 


93 



339 
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SEQID 




Species 


Description 


Smith- 
Score 


Identit 

y 


7853 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


72 


69 


7854 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


142 


76 


7855 U43627 


Arabidopsis 
thaliana 


extensin 


124 


35 


7856 AP000616 

I 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


95 


7857 1 AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 


95 


57 


7858 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


157 


91 


7859 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


108 


45 


7860 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


159 


54 


7861 G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


63 


7862 1 D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


265 


77 


7863 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


145 


56 


7864 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


222 


100 


7865 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


409 


87 


7866 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
EUfAla(AF078683) 


112 


50 


7867 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQ IDNO:163. 


169 


74 


7868 


AL031004 


Arabidopsis 
thaliana 


RSZp22 splicing factor 


127 


35 


7869 


U83771 


Accipiter 
superciliosus 


cytochrome oxidase I 


168 


73 


7870 


U83771 


Accipiter 
superciliosus 


cytochrome oxidase I 


186 


81 


7871 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


203 


56 


7872 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


130 


71 


7873 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


44 


7874 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


138 


69 


7875 


AF119855 


Homo sapiens 


PRO 1847 


174 


70 


7876 


AF2 10651 


Homo sapiens 


NAG 18 


164 


76 


7877 


Z93244 


Homo sapiens 


bKl 16F5J2 (PUTATIVE RhoGAP 
(CDC42 GTPAse Activating Protein) 
LIKE protein) 


111 


45 


7878 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


227 


79 


7879 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


68 


7880 


Y36156 


Homo sapiens 


Human secreted protein #28. 


85 


68 


7881 


AF116661 


Homo sapiens 


PROI438 


138 


61 


7882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


55 


7883 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


111 


63 


7884 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


106 


64 


7885 


AK022821 


Homo sapiens 


unnamed protein product 


175 


72 


7886 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


117 


59 



340 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Mentit 






Homo sapiens 


reverse transcriptase 






7888 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

. .. 


139 


61 


89 


Ml 5530 


Homo sapiens 


B-cell growth lactor 






7890 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 


81 


64 


7891 


AB046048 


Macaca 
fasciculans 


unnamed portein product 


150 


61 


7892 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


73 


7893 


AF130089 


Homo sapiens 


PRO2550 


224 


61 




G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 






7896 

- 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




149 


50 




Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 






7898 


Y73966 

- ... - 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein # 1 53. 


144 


65 


7899 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


83 


"7900 


AF026689 


Homo sa iens 


'^prostate specific trans-^lutaminase 


"104 


"57 


7901 


X03717 


Homo sapiens 


pot. unidentified reading frame 


105 


46 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 








X6 I 046 


Hydra sp. 


mini-collagen 






7904 


AF 124726 


Homo sapiens 


acinusL 


111 


44 


7905 


U77049 


Ovis aries 


bactinecin 1 1 


116 


38 


7906 


AFl 1 8086 


Homo sapiens 


PR01992 


130 


73 


7907 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


48 


7908 


AL359782 


Trypanosoma 


possible (hhv-6) u 1 102, variant a dna, 
complete virion genome. 


124 


83 


7909 


AC004450 


Arabidopsis 


putative extensin 


167 


34 


7910 


AFl 3 0079 


Homo sapiens 


PR02852 


110 


42 




AF090942 


Homo sapiens 


PRO0657 






7912 


AF09093I 


Homo sapiens 


PRO0483 


96 


80 


7913 


G0j203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


46 


7914 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


68 


7915 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


75 


7916 


X92485 


Plasmodium 


pval 


102 


67 


7917 


AF090942 


Homo sapiens 


PRO0657 


126 


76 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






7920 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


138 


61 


7921 


L26251 


Trypanosoma 


CR5 


95 


37 


7922 


G03714 


Homo sapiens 


Human secretedproteihTSEQ ID NO: ~ 
7795. 


176 


T3 



341 
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SEQID 
NO: 


Accession No. 






Waterman 


Ideiitit 

y 




G02532 




6613. 


















7925 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


67 


7926 


Yj>6495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


1 19 




7927 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 


7928 


Y02693 . 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22 


138 


60 


"7929 


G02832 


Homo sapiens 


6913. 


166 


^6l 




AF004341 










7931 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

^' : 


176 


53 


^7932 


Ygi^j-j 


Homosa iens 

omo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


"179 


— 




V0066'^ 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 






7934 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


132 


63 


~TT5 


1 

W4o351 


: 

Homo sapiens 


Human breast cancer related protein 

BCRB2. 








AFO 1 0 1 44 




neuronal thread protein AD7c-NTP 






7937 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 

Clone rLrlYliJ'^j^. 


320 


80 


"7938 


G00403 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


T59 






AF 144 054 




apoptosis related protein APR-4 






7940 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO'250 


314 


73 


7941 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


121 


57 


7942 


AB040936 


Homo sapiens 


KIAA1503 protein 


149 


71 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7437. 






7944 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
exteifsln ^^"^ ^ 


74 


60 


"7945 


Yi^l^l 


Bromheadia 

finlaysoniana 




"68 


"35 


7946 


Y08062 


omo sapiens 


derived fix)m DNA35638. 


126 


50 


7947 


AP000735 


Arabidopsis 


extensin protein-like 


214 


40 


7948 


AF161356 


Homo"sapiens 


HSPC093 


l34 


"75 


7949 


AF090942 


Homo sapiens 


PRO0657 


110 


48 




AJ249395 


pallida 








"tIs — 


AF116661 




PRO 143 8 


"Til 






Y86248 


Homo sapiens 

omo sapiens 


SEQIDNO:163. 









M15530 




B-cell growth factor 






7954 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


137 


~ 


7955 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


130 


62 


7956 


AF 13 0089 


Homo sapiens 


PRO2550 


153 


62 


7957 


AL160371 


Leishmania 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


99 


48 


7958 


AF161356 


Homo sapiens 


HSPC093 


154 


56 



342 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


% " 

Identit 

y 


^7959 


"G02832 


"Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. 









X92485 


vivax 


pval 


107 


5j 


7961 


AF2 10651 


Homo sapiens 


NAG 18 


113 


77 


"tIs^ — 


AF130051 




PRO0898 








R59842 


Homo sapiens 




"Tie 


-— 


79"64~ " 


"AF09093r 


Homo sapiens 


PRO0483 


129 


56 


7965 


Y64890 


Homo sapiens 


TO NO' 1051 


151 




^7966 


j^gg^^ 


Homo sapiens 




Tio 


"40 


7967 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


180 


50 


7968 


AF090894 


Homo sapiens 


PRO0113 


104 


58 


7969 


Y2 1 040 


Homo sapiens 


GFAP mutant iragment 49 


285 


100 


"^7970 


AK001452 


Homo sapiens 




^l46 


~S6 


7971 


V00672 


Pan troglodytes 


reading frame protein 4 


134 


90 




Y86248 




SEQ ID NO: 163. 








W48351 


Homo sapiens 


BCRB2. ^ 








AFI 16661 


Homo sapiens 


rKvj 14 Jo 






^75 


AFl 19882 


Homo sapiens 


PR02492 


112 


48 




G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 










Homo sapiens 


X-Iinlced retinopathy protein {C- 
terminal, clone XEH.Sc} 








\/( \ 1 am 
iVil 1 y U 1 


norvegicus 


proline-rich salivary protein 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 






7980 


U4997j 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element — 


289 


84 


^70X1 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




59 


"7982 


^^^g^^ 


"RatEs 

norvegicus 


PR-Vbetal 


To9 


"69 


7983 


Y86248 


omo sapiens 


SEQ ID NO: 163. 


149 


73 


7984 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


72 


7985 


R59842 


Homo sapiens 


ApoE4Ll protease. 


130 


82 


-^11^ 


AF090895 




PRODI 17 


~tIi 






X07881 


Homo sapiens 






~ 


7988 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


53 


"^79 89 


AF090894 


Homosa iens 


PROOl 13 


~T04 


"61 


7990 


Z93244 


Homo sapiens 


bKl 16F5.2 (PUTATIVE RhoGAP 
LIKE protein) 


100 


61 


7991 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 


56 


7992 


W48351 




BCRB2. ^ 






7993 


AF2 10651 


Homo sapiens 


NAG18 


133 


61 


7994 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


92 


42 


7995 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


171 


82 


7996 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


54 



343 
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SEQID j Accession No. 


Species 


Description 


Smitli- 
Waterman 


% 

Identit 


7997 1 AF090928 


Homo sapiens 


PRO0470 


86 


76 


7998 


AJ251579 


toalkna''^^^ 


cef protein 


113 




7999 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871, 


119 


60 


8000 


R95913 


Homo sapiens 


Neural thread protein. 


105 


70 


8001 


AF229067 




PADI~H protein 






8002 


W88609 


Homo sapiens 


Secreted protein encoded by gene 76 
clone HKIXL73. 


83 


65 


8003 


R96800 


Homo sapiens 




"85 


"61 


8004 


S79410 


Mus musculus 


nuclear localization signal binding 


123 


43 


8005 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


93 


68 


8006 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


64 


8007 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


147 


38 


8008 


G03714 




Human secreted protein, SEQ ID NO: 
7795. 






8009 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


123 


71 


8010 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


54 


8011 1 G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


59 


8012 1 G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


66 


8013 ' AF161356 


I lomo sapiens 


HSPC093 


185 


64 


8014 G00357 


Homosap.ens 


4438. 






8015 


G00413 


Homo sapiens 


Humon secreted protein, SEQ ID NO: 
4494. 






8016 


Y02999 




Fragment of human secreted protein 
encoded by gene 121. 






8017 


Y36366 


Homo sapiens 


encoded by gene 3. 






8018 


W4S351 




BCRB2. 






8019 


G02532 




6613. ^ 






8020 


G03639 




7720. ^ 






8021 
8022 ^ 


G03714 




Huinan secreted protein, SEQ ID NO; 

7795. 








6908. 






8023 


G02827 


Homo sapiens 


6908^" protem, SEQ ID NO. 


195 


68 


8024 


G03786 


Homo sapiens 


Human secreted protein, SEQ.IDNO: 


158 


53 


8025 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


133 


58 


8026 


X92485 


Plasmodium 


pval 


128 


56 


8027 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


135 


61 


8028 


Y02749 


Homo sapiens 


Hmnan secreted protein encoded by 
gene 100 clone HNFIU96. 


105 


66 


8029 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


126 


59 



344 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Jidentit 

y 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 






8032 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


41 


8033 


Y40039 


Homo sapiens 


Peptide sequence derived from a 


113 


69 


8034 


YI4482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


102 


66 




Y86248 


Hotho sapiens 


Human secreted protein nCHPFoS, 
SEQ ID NO: 163. 


221 




8036 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 






8037 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


55 


8038 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ8 1 . 


108 


64 


8039 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


l^" . 




8040 1 G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


86 


8041 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 

derived protein ^153. 


170 


56 


8042 


W48351 


Homo sapiens 


Human breast cancer related protein 


95 


58 


8043 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 


8044 


AF220264 


Homo sapiens 


... 


83 


68 


8045 


AK024455 


Homo sapiens 


FLJ00047 protein 






8046 


Y73966 


Homo sapiens 


Human prostate tumor EST fr^ment 
derived protein #153. 


181 


78 


8047 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


147 


... 
61 


8048 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


102 


43 


8049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


59 


8050 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 






8051 1 D00570 


Mus musculus 


open reading frame (196 AA) 






8052 


AF040964 


Homo sapiens 


unicnown protein ITl 


143 


50 


8053 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

onl.^ LU JNU. 103. 


168 


8j 


8054 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

H '■ — ocn fn Nn- — 


108 


70 


8055 


G02538 


— : 

Homo sapiens 


lunan secrete protein, S Q NO. 


— 


— 


8056 


Y86248 


~H '■ 

omo sapiens 


Human secreted protein HCHPF68 

SEQ ID NO: 163. 


— — 


~^ 


8057 


AF090895 


Homosa lens 


PROOl 17 


TTo 


~77 


8058 


AP000616 


O^'za sativa"^ 

ryza sa iva 


similar to RING-H2 finger protein 
RHAla (AF078683) 


124 


"90 


8059 


AF118082 


Homo sapiens 


PRO 1902 


Tn 


"50 


8060 


AK022217 


Homo sapiens 




l45 


"53 


8061 


AF 13 0089 


Homo sapiens 


PRO2550 


160 


76 


8062 


AF119851 


Homo sapiens 


PRO 1722 


177 


54 


8063 1 D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


73 


8064 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


426 


85 


8065 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


344 


90 


8066 


AF068294 


Homo sapiens 


HDCMB45P 


146 


60 


8067 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


461 


87 


8068 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


504 


90 



345 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 




Accession No. 


— 


Description 


Smith- 
Waterman 




Identit 

y 






X8603 1 


Sus scrota 


— J p — — :-— 

homoiogue or proline/arginine-nch 




40 


8070 


R59842 


Homo sapiens 


ApoE4Ll protease. 


147 


47 




AFl 1985 1 




PRO 17*^2 






8072 


X92485 


Plasmodium 


pval 


183 


45 


8073 


AFl 16712 


Homo sapiens 


PR02738 


95 


52 




AF213465 


Homo sapiens 


dual oxidase 






8075 


AF090895 


Homo sapiens 


PRO0n7 


153 


57 






Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 








T85403_cdl 


Homo sapiens 


19-APR-1993 Human cadherin-1 1 
coding sequence. 


284 


98 




Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO:1051. 








Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 








G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


Tso 


52 




G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


37 


8082 


X05285 


Drosophila 
melanogaster 


fibrillarin 


100 


34 


8083 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


142 


65 


8084 


G02216 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




380 


82 


8085 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


64 


8086 


X92485 


Plasmodium 




138 


64 


8087 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 


117 


82 


8088 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


80 ' 


.. 


8089 


U27143 


Homo sapiens 


protein l^inase C inhibitor-I 


119 


41 


8090 


AFl 19851 


Homo sapiens 


PRO 1 722 


85 


47 


8091 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


112 


73 


"8092 


S79410 


Mus musculus 


nuclear local iization signal binding 
protein 


133 


"64 


8093 


AFl 19854 


Homo sapiens 


PR01843 


522 


100 






Plasmodium 


pval 










Homo sapiens 


NADH dehydrogenase subunit 1 






8096 


AF097473 


Mus musculus 


ORFl 


109 


46 




AF097473 


Mus musculus 








8098 


Y17833 


Human 
endogenous 
retrovirus K 


pol protein 


281 


73 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 








G04072 




Human secreted protein, SEQ ID NO; 
8153. 






8101 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


8102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


8103 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


163 


71 



346 
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SEQID 


Accession No. 


Species 


Description 


Smitli- 




NO: 








Waterman 


Identit 












y 


"8104 


AF 13 0089 


-p -. 

omo sapiens 


PRO2550 


"FT — — 


'% 


g 


G03411 


omo sapiens 


Human secreted protein, SEQ ID NO. 


TT6 
















8106 


Y73966 


Homosa iens 

omo sapiens 


Human restate tumor EST Ir "5 — nt 


T^2 











deriv^ protehf #1 53*"^ agraen 






8107 


AF090942 


Homosa iens 


PRO0657 


~123 


~\ 


8108 


AF010144 


—■ 

omo sapiens 


neuronal ttiread protein AD7c-NTP 


TO 


— 


8109 


AL160371 


eis ania 


rvwA^ ^ \''t ■ ■ '^^'^''^ 


~ 


~ 






m^^or ^ 


, comp e e ymon genome 






8110 


D38112 


omo sapiens 


cytoc n-ome c 0X1 asesu luii 


— 


— 

-IZ. 


8111 [ AF130089 


Homo sapiens 


PRO2550 









8112 


AP000616 


Oryza sativa 


similar to R1NG^H2 finger protein 


122 


~ 

90 








RHAla (AF078683) 






"8lT3~ 


~G02872 


llomosnen 

omo sapiens 


^'^'^^^^'^ protein, SEQ ID NO. 


l82 


^7 














8114 


G00354 


— : 

Homo sapiens 


— ^ : T^VT/^ 

Human secreted protein, SEQ ID NO: 


— rn 

101 


— 

77 








_^f£^^ '. ; — 






8115 


G00452 


; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


50 








^Att j u j u. ■• 1 ■ 






8116 


D38112 


_ : 

Homo sapiens 


NADH dehydrogenase subunit 3 


... .. 


88 


8117 


L17318 


Rattus _ 


proline-rich proteoglycan 


110 


45 






norvegicus 








8118 


S79410 


Mus musculus 


i i — r- — : = ■ . 

nuclear localization signal binding 




125 


62 














8119 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


78 








-Tj^^' ; 






8120 


D38112 




Homo sapiens 


NADH dehydrogenase subunit 1 


. 






8121 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


45 








.^^IL. , 






8122 


D86854 


u 

Catharanthus 


extensin 


— _ 

142 


j5 






"^^^"^ 








8123 


X924S5 


Plasmodium 
vivax ^ 


pval 


144 


5j> 


8124 


G03789 


Homo sapiens 


— : 

Human secreted protein, SEQ ID NO: 


138 


59 








7870L ^ 


1 




8125 


G02532 


: 

Homo sapiens 


Human secreted protein, SEQ ID NO; 


163 


67 














8126 


Y02671 


^ 

omo sapiens 


aiman secreted protein encoded by 


-— 


— 








gene 22 clone HMSJWI8. 






8127 


X07881 


-rj -. 

omo sapiens 


proline-rich protein 01 


To^ 


— — 




S128 


X70343 




ex ensin 


















8129 


W48351 


^ ■ 

omo sapiens 


r j: r-j—j = 





— 








BCRK '^^^ cancer re a e proem 






8130 


AF090895 


-— ^ 

omo sapiens 


PROOl 17 


-— 

T^^ 


— 


8131 


D00570 


us muscu us 


open reading frame (251 AA) 


— ^ 


~ 


8132 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


106 


— 

61 








F^a'' e f h cr ted r te" 






8133 


Y 14482 


Homo sa iens 

omo sapiens 







— 








np^i^iQf o ^ ' 






8134 


AF130079 


— : 


PR02852 


— — 


1 — 


8135 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


~T09 


~38 








7795. 






8136 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


64 








7437. 






8137 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


112 


43 








protein CON-1. 






8138 


AF119900 


Homo sapiens 


PR02822 


113 


57 


8139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


427 


86 



347 
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SEQ ID 
NO: 


Accession No. 


Species 




Waterman 


Identit 

y 




AF004341 










8141 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


58 


8142 


D70836 


Homo sapiens 


Zinc-finger protein 


156 


47 




AC00229 1 


ttialiana 








8144 


X55683 


esculentum"" 


extensin (class I) 


85 


j4 


8145 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


76 


8146 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


63 


8147 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


79 


8148 


M22332 


Homo sapiens 


unknown protein 


156 


40 


8149 


U23455 


Caenorhabditis 


protein BarH2 (PIR A41726) 


92 


76 


"8150 — 


G004I6 


Homosa iens 

omo sapiens 


4497. 


"Fn 


"66 




U49973 


Homo sapiens 


ORPl; MER37", putative ti'ansposase 
similar to pogo element 






8152 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


155 


67 


"8153 ' 


AF130089 


Homo sapiens ' 


PRO2550 


129 


"42 


8154 


AF130089 


Homo sapiens 


PRO2550 


173 


50 






Homo sapiens 


Human secreted pFOtein, SEQ ID NO; 
7795. 








G02532 




Human secreted protein, SEQ ID NO: 
6613. 


















8158 


U11271 


Homo sapiens 


thromboxane A2 receptor 


138 


73 


8159 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


5j 


8160 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5 --: TT^VT/-> 


139 


58 


"8161 — 


G02538 


Homo sapiens 


Human sscreted protein, SEQ ID NO: 
6619. 


"155 


"59 








7795. ^ . EQID 0 








AF161361 




HSPC098 






S164 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


58 


8165 


AF130089 


Homo sapiens 


PRO2550 


121 


63 


8166 


Y02671 


Homo sapiens 


gene 22 clone HMSJW18 


148 


85 


8167 


X92485 


Plasmodium 




97 


56 


"8168 


X92485 


Plasmodium 

vivax 


"pval 


"TTs 


— 








7795. 








AF010144 




neuronal thread protein AD7c-NTP 




60 


8171 


X92485 


Plasmodium 


pval 


105 


44 


8172 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


75 


64 


8173 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


106 


65 


8174 


X86031 


Sua scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


108 


44 



348 



SEQID 


Accession No. 


Species 


Description 


Smittj- 


% " ■ 


NO: 








Waterman 


Identit 












y 


"8175 


ygg^Qg 


Ho 0 sa iens 

h""^" sapiens 


Human secreted protein #75 


■ 





"8176 





omo sapiens 


Human secreted protein, SEQ ID NO" 


TI4 ' 










6613. 






~8TT7 


(302538 


Ti — 0 sapiens 

om pi 




T^^5 


~6x 








6619. 






"sTts — 


AF090928 


~H ^ 


PRO0470 


"TT9 


~T) 


"8T79 — ' 




Homo sapiens 




141 


~15 








gene 22 clone HMSJW 18. 








AF220264 




MOST- 1 






8181 


AFl 19851 


Homo sapiens 


PRO 1722 


143 


73 


8182 


W48351 


Homo sapiens 




1j4 


64 








BCRB2. ^ 






~8T83 




Homo sa iens 

omo sapiens 




"143 


~SQ 








7492. 






. 


X92485 


—. 

vivax° 


pval 


——= 


— 


"8185 


X92485 


Plasmodimn 

vivax° '™ 




pva 


T30 





"8186 


AF119851 — 


omo sapiens 


PRO 17'^'? 


~ 


— 


"sTsT — ' 


AF090942 


omo sapiens 


PRO0657 


--^ 


^ 


— 
— 


AF090931 


Homo sapiens 


PRO0483 


154 


66 




Y0269j 


Homo sapiens 




148 


43 








gene 44 clone HTDAD22 






"8190 


U49973 


Homo sapiens 




368 


15 








similar to pogo element 






8191 


G02639 


Homo sapiens 


^720^ ^^"^^^'^ protein, SEQ ID NO. 


110 


43 














"8192 


"Y6"267"i 


TTln — a ien 

omo sapiens 


Human secreted protein encoded by 


"129 


Ix 








gene 22 clone HMSJW18. 






"8193 


AFl 1 3685 


"h '' 

H°m° sa^r°^ 




~X6 


-— 


8194 




^ 


Human secreted protein '^J^-^ — 


~r2 


"61 


8195 


GO'' 827 


omo sapiens 


Mn^^" ^^^"^^ * 

_^ — 


"^FF^ 


— — — 
11 


— 


S801 19 


"R~th 


homdo "^"ue ot" mHne/ar"lniife rich 


-— 





"8197 


X8603 1 


Sus scrofa 

us scro a 




124 


15 








ant™acteriai° i^teii"^ aiginme-ric 






"8198 


^Q^g^^ 


~Hm — a"ien 

omo sapiens 


IHu — n ec ted r t — SEO ID NO- — 


Tr\ 


-— 








5953^" secre e pro em, y 






"8199 — ^ 


G00637 


Homosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


"91 


"68 














8200 


Y36203 


Homosa iens 


Btonan secreted rotein#75 


"Tsl 


"56 






„ ^ — aieni 


Human secreted protein #75 


~T56 


l4 


8202 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


"166 


l8 


8203 


G03790 


Homo sa iens 

omos piens 


Human secreted protein, SEQ ID NO' 





lo 








7871. 






"8204 — 


Y36203 


Homosa iens 




Ttt 


Is 


"8205 




Homo sapiens 

omo sapiens 


Human sea-eted protein 'sEQ ID NO- — 


— r^- 


-— 








7324. 






8206 


Y02671 


Homo sapiens 




"190 


I2 








aene 22 clone HRISJWIS. 






— 


A FO 9 09 31 


Homo STpiens 


PRO0483 


160 


It — 


8208 


AF126163 


Homo sapiens 


HHLA3 protein 


107 


67 


8209 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


41 








8144. 






8210 


Y36156 


Homo sapiens 


Human secreted protein #28. 


100 


62 


8211 


AF161356 


Homo sapiens 


HSPC093 


73 


60 


8212 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


34 








7871. 






8213 


AB046048 


Macaca 


unnamed portein product 


146 


61 



349 
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SEQD} 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Intit 

y 






fascicularis 










AF130079 




PR02852 






8215 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


109 


50 


8216 


AF090930 


Homo sapiens 


PRO0478 


153 


79 


8217 
^218 ^ 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
3?^^' ■ p— j p 


105 


60 


W48351 


omo sapiens 


BC]^2 cancer re a proem 


-— g 


-— 


8219 


Ml 1901 


Rattus 

noivegicus ^ 


proline-rich salivary protein 


102 


42 


"8220 — 


G00689 


omo sapiens 


~H ted n — SFD TD NO- 

4770™ ^^''^^ ^™ 


~P5 


— 


8221 


XS2235 


Homo sapiens 


ORFII 


174 


39 




AF130089 


Homo sapiens 


PRO2550 







~8223 — 


AF068294 




HDCMB45P 


T§T 




~8225 




Oryctolagus"^ 

cimiculus 




~219 


~90 




AFl 13685 




PRO0974 






8226 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


430 


85 


8227 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


1 15 


58 


8228 


Y36156 


Homo sapiens 


Human secreted protein #28. 


117 


50 




AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 






8230 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


50 


"823l — 


AP000616 


Oryza s'ativa 

ryzasativa 


^■j^^^^p-^^^^^j^^j^-jg-g^^ protein 

RHAla(AF078683) 


"105 


~46 


"l^l^ — 


S62928 


Homo sapiens 


PRB 1 M protein precursor 








AB047600 


fascicularis 




142 




82j4 


G02872 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO. 


104 


48 


8235 


U12695 


Pan paniscus 


cytochrome oxidase subunit 11 


259 


84 


82j6 


KOI 664 


mdano'^as^ter 


Bkm-like protein 


116 


61 


8237 


AF11666] 


Homo sapiens 


PR01438 


135 


56 




P70494 




factor (BCGF), 








yQ^g^^ 








41 — 


"1535 — 




Homo sapiens 


Human secreted protein encoded by ' 

gene 22 clone HMSJWl 8. 


"TWO 






270292 




chemokine CC-1 






8242 


AL160493 


Leishmania 


probable (hhv-6) ul 102, variant a 


119 


82 


8243 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ JD NO: 163. 


97 


83 


8244 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


78 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 








Y20717 


Homo sapiens 


Human neurofilament-M wild type 


74 


60 


8247 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


115 


58 


8248 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


58 


8249 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


84 


8250 ; AF118082 


Homo sapiens 


PRO1902 


136 


50 



350 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 




Identit 










"TF — = 

Homo sapiens 


— J -. — 

^wnan secrete protein, Q 




59 














"8252 


GO 1828 


Homosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO; 


To6 


"si — 














~8253 


AF130089 


Homosa iens 


PRO2550 


^143 


33 


8254 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 







AK022821 


Homo sa iens 

omo sapiens 


minamed piotein product 




"68 





G03714 


Homo sapiens 


Human secreted protein, bbQ ID NO; 


Td3 

143 


— 

53 








fe^hi heav chain 






g 


M24509 


Mu nu cul 

— ^ — ~ 


eriiin eavy c am — ^ — — 


^47 


"69 


— 


G02872 


omo sapiens 


^uman secrete protein, Q ; 


— — 


— 

57 

















U93570 


■ 

omo sapiens _ 


- - 


"254 




— = 

— 




AF2 180^8 


omo sapiens 


^" ' — : — 








8261 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 


— 1 

11 


— 

68 








protein Jragmcnt 59. _ 








8262 


— 


, 

Homo sapiens 


cytochrome c o.x.idase aubunit 3 


— 

604 


— 


8263 


G028j2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1j2 


60 




— — — -— — — 




J- = 








AL139298 


Streptornyces 


putative membrane protein 




34 






coelicolor A3 (2) 








8265 


— 1 


Homo sapiens 


Human secreted protein, SEQ ID NO; 




70 
















8266 


— — — ^ 

G03789 


-— ; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 




100 


64 








^f7^". — _ _ . 






— — 

8267 


AF2o55/5 


— : 

Homo sapiens 


ubiquitous TPR-motif protem Y 


-— 

113 


— 

38 








isoform 






8268 


X92485 


Plasmodium 


pval 


102 


63 







VIV^C 










X61048 


Hydra sp. 


— _ — _ 

mini-coilagen 


110 


46 


8270 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


82 
















8271 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


. 

121 


75 














— 1 


W4835 1 


— ^ 

omo sapiens 


^Inan breast cancer related protein 


"TTvT 


— 








BCRB2 






-— — 


AF090944 


— ^ 

omo sapiens 


PRO0663 


199 


~i9 





G00407 


omo sapiens 


Human secreted protein, SEQ ID NO. 


— 


— 

















G03790 


— ; 

Homo sapiens 


— '■ T : — — 

Human secreted protem, SEQ N . 


— — 


— 














"8276 


AF130089 


omo sapiens 


PRO2550 


"97^ 


""S6 


— 


L27428 


omo sapiens 


reverse transcriptase — 


"Tn? 


— 


— — — 
8278 


S79410 


Mus musculus 


nuclear localization signal binding 


—— 

112 




54 








^^^"^ rt = — <;T?n m Mn- — 






— 





— 

omo sapiens 


uman secreted protein, SEQ NO. 


9 




73 














— 


G04072 


omo sapiens 


"Tf — '~ S~rt — P — <!T?n m >jr> — 

uman seer e proem, Q 


TT^Q 


"71 
















— -|i 


At 1 v4yz 1 


— ; : 

omo aapiens 


■' ,1 P A — — h Th h — ' — 


Trr 


— 




Y 60Z40 


omo sap.ens 


''lluimn sec°eted p'^ote^nlli'II^^~^ 












SEQ ID NO: 163. 






8283 


X55683 


Lycopersicon 


extensin (class I) 


96 


36 






esculentum 








8284 


AF225910 


Mus musculns 


DAZ-associated protein 1 


117 


33 


8285 


AF130089 


Homo sapiens 


PRO2550 


347 


75 


8286 


AB046100 


Macaca 


unnamed protein product 


140 


45 






fascicularis 








8287 


X55684 


Lycopersicon 


extensin (class I) 


67 


45 



351 
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SEQID 
NO: 






Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 














"8288 


AF029670 


Homosa iens 


RadSlC truncated r te' 


^86 


~ 


"8289 — 


Y02886 


Homo sapiens 


Fragment of human sea'gted protein 

encoded by gene 90. 


^49 


~50 


8290 


G03411 




Human secreted protein, SEQ ID NOi 
7492. 








G00357 


Homo sapiens 


4438. 


152 


50 


8292 


D89999 


Gallus gallus 


muscle derived protein 


1134 


66 


"P§l 














W61170 


Homo sapiens 


"^^"olypelrtides^uence from 

CP140 partial cDNA. 


^il 


~ 




S62077 




autoantigen 






— 


AFl 19882 


Homo ^^P|^||^ 


PR02492 





4i — 




G02532 


omo sapiens 


Human secreted protein, SEQ ID NO. 






8298 


AK025116 


Homo sapiens 


unnamed protein product 


259 


82 


8300 


X92485 




pval 


162 


47 


8301 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


76 


^302 


AF092176 


Homo sa iens 


MHC class II antigen 


"280 


"84 


~8303 


AFl 18082 


Homo sa iens 


PRO 1902 


"174 


"52 


'P04~' " 


G02493 


TiomoTall^ns 

omo sapiens 


6574. 


~To^ 


1o — 


8305 


AK023563 


Homo sapiens 


unnamed protein product 


244 


60 




G03714 


omo sapiens 


7795. 


1^6 


66 


8307 


U 15647 


Mus musculus 


reverse transcriptase 


107 


Is — 


8308 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


110 


40 




AB005540 




PCTAIRE2 






8310 


M37679 


Mus musculus 


Ig heavy chain precursor 


95 


"P 


8311 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


137 


82 


~83T2 


W29643 


Homo sapiens 




~yi2 


^00 — 


8313 


AFl 13685 


Homo sapiens 


PRO0974 


116 


39 




W80406 




dh40 3. 


















8316 


AE003536 


DrosopMr"^ 

melanogaster 


CG6451 gene product 


"Wf 


"IT 




M19419 











"8318 


G03786 


Homo sapiens 


EtonaTseCTeted^^tein ^ ID NO" — 
7867. 




"15 


8319 


Y29862 


Homo sapiens 


Human secreted protein clone 

du515 21. 


399 


8] 


8320 


Z34278 


Homo sapiens 


mucin 


103 


23 


8321 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


91 







Homo sapiens 


SEQ ID NO 402 from W09922243. 


"1^2 






Z46791 


Caenorhabditis 
elegans 


PF01391 (Collagen triple helix repeat 
(20 copies)), Score-67.1, E- 
value=l ,2e-I6, N=3; PF01484 
(Nematode cuticle collagen N-tenninal 
domain), Score=82.0, E-value=3.9e-21, 
N=l 




-~ 


8324 


AF021232 1 Homo sapiens 


TRAIL-R4-A 


113 


100 



352 
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SEQ ID 
NO: 




Species 


Description 


Smitli- 

Waterman 

Score 


% 

Identit 


8j25 


AB046085 


fesctoilaris 


unnamed protein product 


292 


75 


8326 


AF113944 


Muntiacus 
muntjak 


prion proten 

pn p 1 


"96 — 


"39 


8327 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7878. 


125 


~63 


8328 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6292. 


123 


62 


8329 


AJ005897 


Homo sapiens 


JM5 


275 


73 


8330 


AFl 19900 


Homo sapiens 


PR02822 


138 


76 


8331 


AF163573 


Homo sapiens 


CARKL 


359 


84 


8332 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


103 


37 


8333 


AF026246 


Homo sapiens 


HERV-E integrase 


354 


78 














8335 


AB001517 


Homo sapiens 


KNP-I alpha protein 


126 


76 




X97490 


Mus musculus 


PNG protein 


469 


98 


8 J 37 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


159 


74 


8338 


V18354__cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 

ti'ansmembrane (7TM) receptor 
genomic DNA. 


379 


92 


8j39 


VI 83 54 cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 
genomkDNA^ {7TM) receptor 


402 


93 






Homo sapiens 


intestine-specific annexin 


385 


89 




G04072 




8153. ^ 








AC003113 


thaliana 














C protein (AA 1-159) 




— 


~§5?3 — 


D86549 


Homo sapiens 




"575 




8345 


Y73384 


Homo sapiens 


HTRM clone 2284580 protein 


1513 


100 


8346 


AP001745 


Homo sapiens 

m sapi ns 


human cDNA DKFZp586F0422, 
Accession No. AL050173 


ITs 


"40 


8347 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


121 


49 


~8348 — 


AF091090 


Homo sapiens 




T25 


"44 


8349 


AFl 06542 


Homo sapiens 


pregnancy-specific beta- 1 glycoprotein 
1 


145 


100 


8350 


AF090942 


Homo sapiens 


PRO0657 


252 


65 


8351 


L23116 


Homo sapiens 


galctocerebrosidase 


185 


85 


8352 


AJ243803 


coelicolor A3(2) 








8353 












8354 


U09477 


Homo sapiens 


p53-binding protein 


388 


100 


'^Mi — 






multifimrtional^cdc^ 









AF071569 


Homo sapiens 


dependent protein kinase II delta2 




"155 — 


8357 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
heemoglobiti gene contained in pSSI. 


321 


81 


8358 


Y12916 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


759 


100 


8359 


AF038961 


Homo sapiens 


SL15 protein 


166 


100 


8360 


W75093 


Homo sapiens 


Human secreted protein encoded by 
gene 37 clone HFVGS85. 


328 


90 


8361 


R55326 


Homo sapiens 


Tumor necrosis factor receptor 1 death 
domain ligand (clone 2DD)- 


386 


97 



353 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


8362 


AF026954 


iBoslaurui 


pyruvate dehydrogenase phosphatase 
regulatory subunit precursor; PDPr 


269 


88 


8363 


X07855 


Homo sapiens 


G-Protein 


289 


98 


8364 


ALII 0247 


Homo sapiens 


hypothetical protein 


265 


97 


8365 


AB046774 


Homo sapiens 


KIAA1554 protein 


3619 


99 


8366 


M10050 


Homo sapiens 


fatty acid binding protein 


97 


100 


8367 


M20882 


Homo sapiens 


pregnancy-specific beta-glycoprotein e 


1097 


99 


8368 


AB051901 


Homo sapiens 


VDUPl 


662 


98 


8369 


U72678 


Mus musculus 


EF-9 


176 


65 


8370 


ALII 7237 










8371 


M10617 


Homo sapiens 


L-FAB? 


104 


"84 


8372 j AJ2499S0 






^^^^ 


97 


8373 


AF003136 


Caenorhabdi'ds 
elegans 


contains weak similarity to an AMP- 
binding motif 






8374 


X91638 










8375 


W75093 


Homofaple'ns 


HumanT^reted protein encoded by 

gene 37 clone HFVGS85. 





"TSo — 


8376 


Z36714 











8377 


X86401 


Homo sapiens 


L^arginine- glycine amidinotransferase 


"555 




8378 


M12413 


Homo sapiens 


apolipoprotein BlOO 


1278 


99 


8379 


G00328 




Human secreted protein SEQ ID NO* 
4409. 






8380 


AF151860 




CGI-102 protein 






8381 


Y91577 


Homo sapiens 


Human secreted protein sequence 


255 


92 


8382 


AFO 17807 


Homo sapiens 


Aip2/3l:omptex fekD^ 


TTs 


— 


8383 


U28727 


Homo sapiens 


pregnancy-associated plasma protein-A 


196 


97 


8384 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 


8385 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


203 


69 


8386 


AF090931 


Homo sapiens 


PRO0483 


145 


77 


8387 


S63758 


Mussp. 


activator 






8388 


W89030 


Homo sapiens 


165^ ^ ^ 


174 


66 


8389 1 G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO:, 


93 


70 


8390 j G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO' — 
6613. 


TsT 


"46 


8391 1 V29245 cdl 

1 




1 5-NOV-1996 Nucleotide sequence of 
the human selenium-binding protein. 






8392 


D38112 










8393 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 


99 


77 


8394 


AK023563 


Homo sapiens 


uimamed protein product 


285 


75 


8395 


G01984 




6065. 






8396 


G00407 




4488. ^ 






8397 


P70494 




factor (BCGF). 






8398 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


57 


8399 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


63 


8400 


W88607 


Homo sapiens 


Secreted protein encoded by gene 74 
clone HHSEG23. 


146 


55 



354 
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SEQ ID 


Accession No. 


Species 


Description 


Smitli- 


% 


NO: 








VVatcrman 


Ideiitit 












y 


"8401 


(300407 


^ 

omo sapiens 


H c-et d rot in SEO ID NO- — 


g"""" 


69 









4488^^^'^^^ ^^^^^^ 








AFl 19851 


~H ^ 

Homo sapiens 


PRO 1 72'^ 


-J^ 


"7^ 


"8403 




G00407 


omo sapiens 


Hiunan secreted protein, SEQ ID NO. 


" 






TnZJTio 


— : 

Homo sapiens 


—77- — '■ — ; r-'s^ 

nbosomal protein S20 


— — 


— 

-^^ 


"§405 


G03714 


Homo sapiens 


Hkunan secreted protein, SEQ ID NO. 


-~ 
















~8406 




AF1I8086 


"h '■ 

omo sapiens 


PRO 1992 


— — 


— 


8407 


GO380O 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


-j— 5 

153 


81 














"8408 




AF090919 


~H ^ 

omo sapiens 


PRO0327 


"96 


— 




G0253S 


omo sapiens 


Human secreted protein, SEQ ID NO: 


127 


~ 

62 














"8410 


Y86248 


"h ' 

omo sapiens 


Human secreted protein HCHPF68 


—— 


-— 

00 








SBQ ID NO' 163 






"84Tl 


QQ^g^^ 


"Horn ien 

omo sapiens 


ffiunan secreted protein, SEQ ID NO. 


-— ; 


~ 














~mT5 — 


AFl 16715 


— -. 

omo sapiens 


PR02829 


— 


— 


"8413 


AF229067 


omo sapiens 


PADI-H protein 


— — 







G0379j 


omo sapiens 


Human secreted protein, SEQ ID NO. 


— — 


— 








• 






"sITl 






— = : 

Homo sapiens 


HDCivtB4:)r 


-pr^ 




~ 

54 





W34499 


Homo sapiens 


Obesity receptor C protein. 




J2 — 


8417 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




272 


57 








■ — jr— p : — Yxr-A- 






"84T8 


S79410 


"m i 

us musculus 


nuclear localization signal binding 


"m 


--— 

54 








protein 


-— 




1419 




AK025047 


: 

omo sapiens 


unnamed protehi product — 


-1^ 


— 

"4? 


8420 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1:>0 


46 









^^^7; — 








AL132841 


Caenorhabditis 




— 

1j5 


— 

56 


— — 




elegans — ^ 










— 
AF130051 


Homo sapiens 




PRO0898 


— 

164 


72 


8 


G00092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


96 


-— - — 






—^-^ p„,„ f 








AP000616 


— -J 


similar to i\iNG-H2 ringer protein 


— 

108 


— 

100 








ivrl/via (^ArU/oD03^ 






— 1 




TJ ^ 

Homo sapiens 


Human secreted protein, SEQ ID NO: 




75 


— — — 














— — — 

G03798 


— ^ 

Homo sapiens 


Human secreted protein, SEQ ID NO: 




91 




48 








-^^^ — 1 u 1- 1 — uA r ht h • 






"8427 


AF194641 


— o ^ 

omo sapiens 







— 










"ari"w°re°'"™ ig c am 










— : 

omo sapiens 


— t A p m xj(-^. 


-— 


— 














"8429 — 


M64322 


—J —. 

omo sapiens 


' ' h h 


— — 


-— 





AP000616 


ryza sa iva 


similar to RR^G ^2 finger^protein 





— 








RHAIa (AF078683) 






"sZri 


AF09093 1 


— : 

omo sapiens 




— — 




— 


"8432 


J04495 




alpha-globin 




~Tr\ 


8433 


YI44S2 


Homo'sa^iens ^ 




T50 


— 








encoded by gene 17. 






8434 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


137 


48 








gene 22 clone HMSJWl 8. 






8435 


AJ272324 


Homo sapiens 


adaptor molecule- 1 


333 


100 


8436 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


48 








6834. 






8437 


Ml 8085 


Homo sapiens 


glycoprotein Hb 


154 


100 



355 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 




X92485 


Plasmodium 
vivax 


pval 






8439 


G0j136 


Homo sapiens 


Hiiman secreted protein, SEQ ID NO; 


126 


56 


~8440 ' 


(303062 


Homo sapiens 


miman secreted protein SEQ ID NO- — 

7143. 


T05 






AL359782 


Trypanoson^a 


protein rhala. 












6953. 












Human secreted protein #75. 






8444 


AB034633 


Mus musculus 


deubiquitinating enzyme 


129 


100 




U01849 


Trypanosoma 












norvegicus 


snRNP-associated polypeptide N 










Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 




100 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO.i63. 






8451 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


162 


91 


8452 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 . 


804 


97 


8453 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO.'163. 


109 


100 


8454 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


88 


8455 


AF099505 


Homo sapiens 


colon carcinoma related protein 


100 


63 






Homo sapiens 


uncharacterized bone marrow protein 
BM034 






8457 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


155 


70 







LI 




Didelphis 


protamine 1 


— 

101 




80 


"8459 


YQ261\ 


Homo^s'ap?ens 


Human secreted protein encoded by 

gene 22 clone HMSJW18. 


T45 


"62 


8460 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


121 


75 














8462 


AF016371 


Homo sapiens 


U-snRNP-associated cyclophilin 


108 


70 






Homo sapiens 


SEQ ID NO 485 from W09922243. 






8464 


AFl 30089 


Homo sapiens 


PRO2550 


190 


65 


8465 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 


basic DNA-binding protein BDBP 


159 


55 


~8466 — 


G02827 


Homo sapiens 


Human secreted protein SEQ ID NO* 
6908. 


183 


~1Q 


8467 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7867. 


97 


60 


8468 


AFl 18082 


Homo sapiens 


PRO1902 


72 


42 


8469 


AF217449 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


107 


40 


8470 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


75 



356 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 










_Score 


y 








^339 






"8471 


AF090931 


H mo sapiens 


PRO0483 


"Tfil 

"T?^ 


-— 


"8472 


Y 14482 


orao sapiens 


^nmdedh'^^eaeTl ^^^^^ P"*tein 




~ 














"8473 




— : 

omo sapiens 


umansecr pro ein, EQ I 


"r>7 


-— 

50 

















AF0227S9 


~H ' 


ubiquitin hydrolyzing enzyme I 


—— 


— ~ 




— — 


rn^7^4 


TTin^^aTeiF 

omo sapiens ^ 


pot. ORF VI 


^M9 






8476 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


— 

65 









I"^^ P4T i 









R59842 


-■ 

omo sapiens 


_jo pro ease. — _ 


"Til 


~M 


8478 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


— 

112 


— 

56 








^^^Tk — 1"' 






"8479 


KOI 664 


15 hi 

rosop la 


- Ike pro em 


140 


—J 

44 







melanogaster 








~S480 




Homo sapiens 


— —3 . — 


— 


To 








6536^° ^^'^^^ protein, bEQ NO. 






"848i 

— — 


AF090931 


"H ^ 

omo sapiens 




— 


— 




Y27854 


Homo sapiens 


Human secreted protein encoded by 




09 








gene No. 101. 






— ^ — 


M69297 


— J : 1 

Homo sapiens 


5~H SFO ID NO- 


— — 

4zi — 


— 

-^i 


"8484 


G02532 


Homo sapiens 


uman secrete protein, bbKi MO. 












^ - • — = • gpo in NO- — 






"8485 


G03793 


"H ' '~ 

omo sapiens 























AF032906 


omo sapiens 


th - 7 

xfr^/f^^"' • / i 




^ 


8487 


X15917 


Paramecium 


ND2 protem (AA 1-193) (unusual stait 


94 


35 






— _ 


TTTi 









TJ93574 




putative pi 50 


— — 


""^5 


~8489 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NOt 


~T35 


72 








7888. 






8490 


X92485 


Plasmodium 


pval 


120 


85 














~8491 


Y02693 


Homosa iens 

omo sapiens 


Human secreted protein encoded by 


Too 


T? 








gene 44 clone HTDAJD22. 






"8492 


M15530 


Homosa iens 




T^e 


"64 


8493 


AF090895 


Homo sa iens 


PRO0117 


~Wi 


"60 


8494 


G02403 


"75 

orao sapiens 


6484. " ' Q 


Tfifi 


-— — 


"8495 


AFl 16661 


Homosa iens 

Homo sapiens 


PRO 143 8 


~i76 





"8496 




AF130089 


omo sapiens 


PRO2550 


"^98 


"To 




AF090895 




PROOl 17 


Tr^ 


"t^ 


"8498 


AF090930 


Homo sa^l^ns 

omo sapiens 


PRO0478 


"T3^ 


-~ 





AF126163 


omo sapiens 


HHLA3 protein 


~r9 

— ^- 


~79 





G02360 


omo sapiens 


Human secreted protein, SEQ ID NO: 


109 


— 

75 








^ J^-p : ,. ^. 






"8501 


S79410 


Mus musculus 


g 1 g 


"OS 


Tr 














"8502 — 


AL359782 


ryp OS ma 


^"^"^bl (hh 6') 110'> ' • t d 

possi e , y- ; u ^, varian a a, 


~T^7 


— 








complete virion genome. 






"8503 — 


AFl 19851 


Homosa iens 


PRO 1722 


"T7V7 


— 


8504 




Homo sapiens 


Human secreted protein, SEQ ID NO* 


"T58 


"6^ 








7795. 






8505 


AF090931 


Homo sapiens 


PRO0483 


128 


50 


8506 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


126 


56 








6619. 






8507 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


82 








7881. 






8508 


X92485 


Plasmodium 


pval 


83 


70 



357 
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SEQID 
NO: 


Accession No- 


Species 


nescription 


Smith- 
Waterman 

_core 


Identit 








>ivax 








8509 


AL390n4 


Leishmania 


extremely cysteine/valine rich protein 


144 


50 


8510 


AL 160493 


Leishmania 


probable {hhv-6) ul 102, variant a 


124 


86 


8511 


Y27868 


Homo sapiens 


Human secret prote^ by 

gene No. 107. 


"T41 


-jj 


8512 


AF229067 




P ADI-H protein 






8513 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


170 


68 


8514 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


66 


8515 1 AF090930 


Homo sapiens 


PRO0478 


88 


80 


8516 G02832 


omo sapiens 


ftunan secreted protein, SEQ ID NO. 


134 


59 


8517 1 K01664 


Droso hila 

mdanofa^ter 


Bkm'like r te' 

- 1 e pro ein 


~T(V7 





8518 1 M31145 


Homo sapiens 


insulin-like growth factor binding 


460 


96 


8519 


M18193 


Homo sapiens 


totei^'^pha-^l^^in inhibitor heavy 

chain old gene name 'ITT 


118 


~4l 


8520 


AFl 16689 


Homo sapiens 


PR02168 


4i5 — 




8521 j AF130051 




PRO0898 




"13 


8522 i L27428 


Homo sapiens 


reverse transcriptase 


201 


72 


8523 S58722 

! 




terminal, clone XEH.8c} 






8524 


AF161356 




HSPC093 


.-l^Z 1 




8525 


Y02926 


Homo sapiens 


encoded by gene 101. 







8526 


AFl 18086 




PRO 1992 






8527 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


108 


50 


8528 


AB046048 


Macaca 


unnamed portein product 


154 


62 


8529 


AF130089 


Homo"sapiens 


PRO2550 


"150 


"70 


8530 


Y03000 


Homo sapiens 


Fragment of human secreted protein 


111 


55 


8531 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


108 


65 


8532 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


172 


59 


8533 


D38112 


Homosa iens 


n^kar locaHzattonTi^nal^bin^ 


~395 


"91 


8534 


S79410 


Mus musculus 


protein 


Too 


"45 


8535 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






8536 


G03800 




7881. 






8537 


Z28407 










8538 


AF130089 


Homo sapiens 


PRO2550 


139 


71 


8539 


Q55625_cdl 


Homo sapiens 


22-JUN-1992 Human beta globin 5'- 
UTR-CDS-3'-UTR. 


274 


76 


8540 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


154 


61 


8541 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


69 


8542 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


149 


64 


8543 


G 02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


240 


61 



358 



wo 01/64835 



PCT/llSOl/04927 



SJEQID 






Description 


Smith- 


% 


NO: 








Waterman 
Score 


Identit 

y 








6953. 






8544 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 17 


85 




- 




6d71, 






8545 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


] 10 


95 









7124. 






8546 


AFl 19851 


Homo sapiens 


PRO 1722 


146 


80 


8547 


U49973 


Homo sapiens 


ORF2: ftmction unknown 


105 


86 


8548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


71 








6613. 






8549 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


53 








6619. 






8550 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


40 








6913. 






8551 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


74 








6566. 






8552 


V00672 


Pan troglodytes 


reading frame protein 4 


142 


92 


8553 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


63 








7795. 






8554 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


71 








6619. 






8555 


AB046100 


Macaca 


unnamed protein product 


144 


58 






fascicularis 








8556 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


322 


73 








7870. 






8557 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


62 








6913. 






8558 


U30888 


Homo sapiens 


tRNA-Guanine Transglycosylase 


276 


86 


8559 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


71 








7881, 






8560 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


72 








7437. 






8561 


G03136 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


57 








7217. 






8562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


83 








4409. 






8563 


W34499 


Homo sapiens 


Obesity receptor C protein. 


188 


92 


8564 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


124 


56 








SEQIDNO:163. 






8565 


L07946 


Volvox carteri 


histone Hl-I 


108 


38 


8566 


Y36j66 


Homo sapiens 


Fragment of human secreted protein 


132 


77 




- 





encoded by gene 3. 






8567 




Human 


nuclear antigen 1 


100 


43 








herpesvirus 4 












Homo sapiens 


— — — 

Human secreted protein #28. 


136 


67 




8569 




X70343 


Nicotiana 


extensin 


92 


34 


— — 





sylvestris 










D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


381 


87 




V00672 


Pan troglodytes 


reading frame protein 4 


146 


93 





D38116 


Pan paniscus 


ATPase subunit 6 


116 


96 




W50193 


Homo sapiens 


Amino acid sequence of salivary 


132 


52 








protein CON-2. 






"8574 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


"217 


loo — 








SEQ ID NO:163. 






8575 


M58009 


Pan troglodytes 


cytochiome c oxidase subunit II 


271 


84 


8576 


D38112 


Homo sapiens 


ATPase subunit 6 


117 


96 


8577 


D38112 


Homo sapiens 


ATPase subunit 6 


180 


93 


8578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


179 


91 


8579 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 




88 








SEQIDNO:163. 
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SEQID 
NO: 


Accession N„. 


Species 


Descriptiiin 


Smith- 
Waterman 
Score 


Identit 

y 


8580 


M15530 


Homo sapiens 


B-cell growth factor 


109 


71 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


77 


8582 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


109 


47 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 




71 






Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 




97 






Homo sapiens 




Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


185 


80 


8586 


M28016 


Homo sapiens 


cytochrome b 


129 


92 


8587 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit 11 


153 


96 


8588 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 






8589 


Y08769 


Rattus ^ 
norvegicus 


microvascular endotiiielial 
differentiation gene 2 


109 


51 


8590 1 L00016 


Homo sapiens 


urf4 


176 


94 


8591 


AF150105 


Homo sapiens 


small zmc finger-lilce protein 






8592 


U15306 


Homo sapiens 


NFXl 


123 


95 


8594 


R44519 


Homo sapiens 


Adult human alpha-globin V107S. 




78 


8595 1 Y03000 

i 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


128 


70 


8596 1 S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


151 


90 


8597 1 X67688 


Homo sapiens 


transketolase 


123 


84 


8598 1 L00016 


Homo sapiens 


urf4 


130 


85 


8599 1 Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


109 


54 


8600 Y86248 


Homo sapiens 


Human secreted protein HCHPF58, 
SEQIDNO:163. 


159 


89 


8601 1 G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


54 


8602 


memo 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


115 


43 


8603 


AP000616 


Oryza sativa 


similar to R1NG-I12 finger protein 

RHA1 a (AF078683) 




95 


8604 


S79980 


Bos taurus 


nbosomai protein L37 


109 


84 


8605 


G03807 


Homo sapiens 


Human secreted protein, SBQ It) NO: 
^ 


12/ 


48 


8606 


D38112 


_ , 

Homo sapiens 


ATPase subunit 6 







8607 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


150 


42 


8608 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


58 


8609 


AF229067 


Homo sapiens 


PADI-H protein 


112 


78 


8610 


AF130089 


Homo sapiens 


PRO2550 




86 


8611 


J0307] 


Homo sapiens 


chorionic somatomammotropin CS-5 


710 


97 


8612 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


104 


41 


8613 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 




84 


8614 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 

Krirt. 1 a \J\r U / 5o5J^ 


116 


70 


8615 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


91 


8616 


M280I6 


Homo sapiens 


cytochrome b 


148 


93 


8617 


AF118082 


Homo sapiens 


PRO1902 


94 


66 


8618 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


182 


82 


8619 


X02585 


Xenopus laevis 


unidentified open reading irame 1 (166 


170 


72 



360 
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SEQ ID 


Accession No. 










NO: 








Waterman 


Idcntit 










Score 


y 








aa) 






8620 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 


125 


57 






brucei 


protein rhala. 






8621 


AC004450 


Arabidopsis 


putative extensin 


140 


48 






tlialiana 








8622 


Y86248. 


Homo sapiens 


Human secreted protein HCHPF58, 


195 


79 




., , 




SEQ ID NO: 163. 






8623 


AF130089 


Homo sapiens 


PRO2550 


150 


78 


8624 


L00016 


Homo sapiens 


urf4 


280 


92 


8625 


AFl 30051 


Homo sapiens 


PRO0898 


161 


76 


8626 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


123 


49 








7563, 






8627 


L00016 


Homo sapiens 


urf4 


201 


95 


8628 


D38112 


Homo sapiens 


NADH dehydrogenase subunit4L 


169 


82 


8629 


AF119851 


Homo sapiens 


PRO 1722 


171 


65 


8630 


X55654 


Homo sapiens 


cytochrome C oxidase 11 subunit 


J 20 


92 


8631 


AP000616 


Oiyza sativa 


similar to RING-H2 finger protein 


108 


56 








RHAla(AF078683) 






8632 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


8633 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


86 








7124. 






8634 


AJ224997 


Rattus 


huntlngtin 


73 


59 






non'egicus 








8635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


155 


80 








SEQ ID NO: 163. 






8636 


M19061 


Ateles geoffroyi 


delta-globin 


100 


40 


8637 


G03800 


Homo sapiens 


Human secreted protdn, SEQ ID NO: 


102 


65 








7881. 






8638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


50 








7124. 






8639 


AFl 16715 


Homo sapiens 


PR02829 


158 


76 


8640 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


62 








7344. 






8641 


Y09010 


Mus musculus 


Ser/Thr kinase 


193 


95 


8642 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


227 


79 








6908. 






8643 


Y028S6 


Homo sapiens 


Fragment of human secreted protein 


116 


95 








encoded by gene 90. 






8644 


AP000616 ■ 


Oiyza sativa 


similar to RING-H2 finger protein 


134 


88 








RHAla(AF078683) 






8645 


V46311_cdl 


Homo sapiens 


12-jrAN-1998 Human secreted protein 


197 


100 








cDNA from clone D157. 






8646 


Y17379 


Homo sapiens 


NADH:ubiquinone oxidoreductase 5 1- 


240 


97 








kD subunit 






8647 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


142 


68 








encoded by gene 3. 






8648 


AJ3 88520 


Canis familiaris 


Ribosomal protein 


134 


100 


8649 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


56 








7124. 






8650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


80 








7795. 










Mus musculus 


nuclear localization signal binding 


99 


36 








protein 






8652 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


64 








4573. 






8653 


AF229067 


Homo sapiens 


PADI-H protein 


134 


69 


8654 


AF090942 


Homo sapiens 


PRO0657 


103 


42 


8655 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 


184 


84 








SEQIDNO:163. 







361 



SEQID 
NO: 


Accessfon No. 




Description 


Smlth- 

Watcrraan 

Score 


Mentit 

y 


8656 


U37359 


Homo sapiens 


MREl 1 homologue hMrell 


238 


97 


8657 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


50 


8658 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


83 


8659 


AF161356 


Homo sapiens 


HSPC093 


149 


64 


8660 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613, 


121 


44 


8661 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


107 


37 


8662 


U19098 


Lj'copersicon 
chilense 


proline-rich protein 


72 


48 


8663 


AFl 18082 


Homo sapiens 


PRO1902 


95 


75 


8664 


AFl 30089 


Homo sapiens 


PRO2550 

extremely cysteine/valine rich protein 


307 


79 


8665 


AL390114 


Leishmania 
major 


115 


40 


8666 


X55683 


Lj'copersicon 
esculentum 


extensin (class I) 


102 


32 


8667 


AK026841 


Homo sapiens 


unnamed protein product 


276 


100 


8668 


AL 109963 


Homo sapiens 


dJ1188J21.1 (FSH primary response 
(LRPRl, rat) homolog 1) 


597 


100 


8669 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


123 


58 


8670 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLTL 


103 


65 


8671 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


121 


61 


8672 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


82 


8673 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


211 


93 


8674 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


126 


61 


8675 


G00354 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 
4435. 


114 


66 


8676 


X55691 


Lycopersicon 
esculentum 


glycine-rich protein 


85 


38 


8677 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

SEQ ID NO: 163. 


188 


80 


8678 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


183 


75 


8679 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


137 


71 


8680 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


82 


8681 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


136 


88 


8682 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


137 


93 


8683 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


107 


45 


8684 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


90 


8685 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7344. 


127 


55 


8686 


W64486 


Homo sapiens 


Human DR3 protein. 


139 


100 


8687 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


134 


33 


8688 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


119 


35 


8689 


U36448 


Homo sapiens 


Ca2+-dependent activator protein for 


107 


91 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Mentit 

y 








secretion 






8690 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490, 


80 


43 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


147 


74 


8692 


AP000616 


Oryza sativa 


similar to R1NG-H2 fmger protein 
RHAla(AF078683) 


126 


95 


8693 


AF130089 


Homo sapiens 


PRO2550 


126 


35 


8694 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 




97 


8695 


Ml 1902 


Mus muscLilus 


proline-rich salivary protein 


98 


35 


8696 


W34499 


Homo sapiens 


Obesity receptor C protein. 


263 


83 


8697 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


181 


73 


8698 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


48 


8699 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


88 


8700 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


111 


51 


8701 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


165 


73 


8702 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


133 


62 


8703 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


92 


8704 


D38112 


Homo sapiens 


ATPase subunit 6 


117 


96 


8705 
8706 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


97 


58 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


115 


79 


8707 


AF1I8086 


Homo sapiens 


PR01992 


167 


81 


8708 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


114 


51 


8709 


LI 0908 


Mus musculus 


Gcapl gene product 


179 


38 


8710 



8711 


S79410 


Mus musculus 


nuclear localization signal binding 
proteni 


135 


69 


AF 130089 


Homo sapiens 


PRO2550 


399 


78 


8712 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


8713 


API 18082 


Homo sapiens 


PRO 1902 


123 


58 


8714 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


173 


81 


8715 


V35554_cdl 


Homo sapiens 


21-NOV-1996 Human tumour antigen 
protein (clone 3) encoding cDNA. 


258 


83 


8716 


AF130089 


Homo sapiens 


PRO2550 


143 


51 


8717 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 fmger 
protein rhal a. 


92 


52 


8718 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


147 


71 


8719 


AFl 53208 


Homo sapiens 


GC-rich sequence DNA-binding fector 
candidate 


115 


70 


8720 


AF090931 


Homo sapiens 


PRO0483 


145 


87 


8721 


X92485 


Plasmodium 




125 


79 


8722 


U72543 


Sus scrofa 


ubiqiiitin-like/S30 ribosomal fiision 
protein 


201 


97 


8723 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


223 


85 


8724 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


121 


60 



363 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Wentit 

y 


8725 


AF130051 


Homo sapiens 


PRO0898 


102 


72 




G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


185 


78 


8727 


R24750 


Homo sapiens 


hGH variant #38 - 174S 176Y lOF 14S 
18F 21L 167S 171D 175T 179T. 


1 16 


82 


8728 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


72 


8729 


X53375 


Helianthus 


anther-specific protein SF18 


114 


39 


8730 


S74221 


Homo sapiens 


IK factoi=cyto]dne down-regulating 
HLA class II 


159 


42 


8731 


AC003058 


Arabidopsis 
llialiana 


unknown protein 


166 


89 


8732 


AB027508 


Homo sapiens 


h VPS 11 


194 


50 


8733 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 

protein) 


144 


93 


8734 


AF266479 


Homo sapiens 


rectachrome 1 


116 


74 


8735 


ALl 32841 


Caenorhabditis 


Y15E3A.3 


125 


60 


8736 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


152 


65 


8737 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


79 


56 


8738 


U93565 


Homo sapiens 


putative pi 50 


190 


52 


8739 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


117 


57 


8740 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


109 


70 


8741 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


165 


61 


8742 


M58009 


Pan troglodytes 


cytoclirome c oxidase subunit II 


213 


97 


8743 


M17887 


Homo sapiens 


acidic ribosomal phosphoprotein (P2) 


106 
116 


74 


8744 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


49 


8745 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


68 


8746 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


135 


71 


8747 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


94 


40 


8748 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


173 


76 


8749 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


166 


67 




Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


272 


57 


8751 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 




71 


8752 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


123 


37 


8753 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


55 


8754 


Y27868 


Homo sapiens 


Human secreted protein encoded by 

gene No. 107. 


105 


44 


8755 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3, 


103 


78 


8756 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


284 


87 


8757 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


190 


89 
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SEQID 
NO: 






Description 


Smith- 
Waterman 
Score 


Identit 

y 




G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 




40 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
7878. 




61 




AF 132984 


Homo sapiens 


nuclear pore complex interacting 

protein NPIP 


196 


94 




AFl 30079 


Homo sapiens 


PR02852 




65 


8762 


AF 155232 


Pisum sativum 


extensin 


177 


42 


8763 


AP000616 


Oryza sativa 


similar to RrMG-H2 finger protein 
RHAla(AF078683) 


128 


95 






Sus scrofa 


homoiogue of proline/arginine rich 
antibacterial peptides 


96 


38 


8765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO."163. 


157 


96 






Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSAV18. 


137 


75 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 




95 


8768 


AL161543 


Arabidopsis 
tlialiana 


extensin like protein 


100 


38 






Homo sapiens 


salivary proline-rich protein precursor 


110 


47 


8770 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


178 


78 


8771 


D83407 


Homo sapiens 


a thyroid hormone responsive gene in 
human skin fibroblasts 


133 


100 


8772 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 
subunit) 


172 


73 


8773 


W03642 


Homo sapiens 


Human caiinabinoid GPR N-terminal 
sequence. 


107 


66 


8774 


AL390il4 


Leishmania 


extremely cysteine/valine rich protein 


117 


100 


8775 


AF09093 1 


Homo sapiens 


PRO0483 


175 


88 


8776 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


125 


58 






Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


126 


73 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


154 


70 


8779 


AL359782 


Trypanosoma 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


149 


50 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


170 


91 




G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


63 




G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


184 


70 


8783 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


169 


97 


8784 


U55376 


Cacnorliabditis 
elegans 


F16H1 12 gene product 


141 


90 


8785 


U55376 


Caenorhabditis 
elegans 


F 1 6H1 12 gene product 


150 


100 


~8786 — 


AL132841 


Caenorhabditis 
elegans 


■ ■" ■ 3 






8787 


AF266479 


Homo sapiens 


rectachrome 1 


148 


61 


8788 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


106 


95 


8789 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


162 


96 


8790 


AC006693 


Caenorhabditis 


Hypotiietical protein W02H5.e 


180 


94 



365 
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SEQID 


Accession No. 


Species 


Dcscriptioa 


Smith- 




NO: 








Waterman 
Score 


Identit 

y 






elegans 








8791 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


159 


80 






elegans 


RD protein (nuclear ma-binding 












protein) 






8792 




AF150105 


Homo sapiens 


small zinc finger-like protein 


505 


98 


879j 


AL390935 


Leishmania 


probable proO 195 


85 


31 














8794 


Y19767 J 


Homo sapiens 


SEQ ID NO 485 ftom W09922243. 


115 


55 


8795 


AFl 18082 


Homo sapiens 


PRO 1 902 


123 


75 


8796 


D38112 


Homo sapiens 


ATPase subunit 6 


438 


82 


8797 


AFl 18086 


Homo sapiens 


PRO 1992 


156 


81 


8798 


Y53037 


Homo sapiens 


Human secreted protein clone 


196 


100 








dx219_13 protein sequence SEQ ID 


















8799 


X92485 


Plasmodium 
vivax 


pval 


132 


63 


8800 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


65 








6613. 






8801 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


58 








4478. 






8802 


AFl 16909 


Homo sapiens 


unknown 


116 


63 


8803 


X92485 


Plasmodiiun 
vivax 


pval 


127 


82 


8804 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


140 


67 








7519. 






8805 


Z95114 


Homo sapiens 


bK212A2.2 (apolipoprotein L, 2) 


294 


100 


8806 


AF143369 


Mus musculus 


msgl-related protein 2 


348 


83 


8807 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


76 








4581. 






8808 


AP000616 


Oryza sativa 


similar to RIIslG-H2 finger protein 


129 


91 








RHAla (AF078683) 






8809 


AF230279 


Caenorhabditis 


SWI3-Iike protein; PSA-1 


126 


37 






elegans 








8810 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


168 


82 


8811 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


122 


54 








SEQ ID NO: 163. 






8812 


D38112 


Homo sapiens 


ATPase subunit 6 


207 


100 


8813 


W99376 


Homo sapiens 


Human fibroblast growth factor 2 24 


93 


41 








kD isoform N-terminus. 






8814 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


57 








'^.^2^- .... ..... 






8815 


X54978 


Bos taurus 


17,000 dalton myosin light chain 


133 


96 




M58459 


Homo sapiens 


ribosomal protein S4Y isoform 


260 


100 


8817 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


256 


90 


8818 


U23947 


Mycoplasma 


VsaDl 


132 


31 






pulmonis 










8819 


G03172 


Homo sapiens 


.... ... ... . 

Human secreted protein, SEQ ID NO: 


140 


69 








725 J. 






8820 


L00016 


Homo sapiens 


urf4 


205 


90 


8821 


X55683 


Lycopersicon 


extensin (class I) 


111 


43 






esculentum 








"8822 


- — 
AL 160493 


Leishmania 


r- r— TT q : 

probable (hhv-6) ul 102, variant a 


~94 


— 






major 


DNA, complete virion genome 






8823 


X66285 


Mus musculus 


HCIORF 


94 


53 


8824 


AF061340 


Artibeus 


Fl ATPase subunit 6 


97 


90 






jamaicensis 








8825 


M18093 


Phaseolus 


hydroxyproline-rich glycoprotein 


123 


36 






vulgaris 








8826 


AFl 19855 


Homo sapiens 


PRO 1847 


113 


52 
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SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 


8827 


M28016 


Homo sapiens 


cytociironie b 


140 


93 


8828 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


142 


56 








SEQ ID NO; 163. 






8829 


AK024455 


Homo sapiens 


FLJ00047 protein 


145 


59 


8830 


038112 


Homo sapiens 


ATPase subunit 6 


194 


95 


8831 ; D38112 


Homo sapiens 


ATPase subunit 6 


184 


90 


8832 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


134 


60 








RHAla (AF078683) 






8833 


AFl 30079 


Homo sapiens 


PR02852 


115 


88 


8834 


AB003784 


Drosophila 


Histone H3 


135 


100 






siniulans 








8835 


Y91577 


Homo sapiens 


Human secreted protein sequence 


173 


82 








encoded by gene 2 SEQ ID NO:250. 






8836 


X70343 


Nicotiana 


extensin 


112 


37 






sylvestris 








8837 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


142 


38 


8838 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


148 


74 








encoded by gene 3. 






8839 


W34625 


Homo sapiens 


Human C3 protein mutant FT-3. 


277 


100 


8840 


AFl 16638 


Homo sapiens 


PRO 1546 


78 


41 


8841 


W48351 


Homo sapiens 


Human breast cancer related protein 


138 


61 








BCRB2. 






8842 1 AF130087 


Homo sapiens 


PR02411 


131 


67 


8843 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


72 








5327. 






8S44 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


60 








4433. 






8845 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


66 








7795. 






8846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


68 








7795. 






8847 


Y0267i 


Homo sapiens 


Human secreted protein encoded by 


146 


70 








gene 22 clone HMSJW18. 






8848 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


131 


68 








SEQ ID NO: 163. 






8849 


Y91577 


Homo sapiens 


Human secreted protein sequence 


273 


79 








encoded by gene 2 SEQ ID NO:250. 






8850 


K02120 


Bovine leukemia 


gag 


201 


54 






virus 








8851 


AF229067 


Homo sapiens 


PADI-H protein 


162 


57 


8852 


AF090931 


Homo sapiens 


PRO0483 


125 


82 


8853 


RMS 84 


Homo sapiens 


I'Gf beta 1 binding protein encoded by 


189 


100 








clone RPA 13. 






8854 


1J5TW 




Mus musculus 


poly(A) polymerase III 


183 


85 


8855 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


193 


72 








6908. 






8856 


G00549 


Homo sapiens 


Human secareted protein, SEQ ID NO: 


104 


66 








4630. 






8857 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


93 


64 








encoded by gene 17. 






8858 


Q55625_cdl 


Homo sapiens 


22-JUN-1992 Human beta globin 5'- 












UTR-CDS-3'-UTR. 






8859 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


152 


67 








gene22cloneHMSJW18. 






8860 


G02828 


Homo sapiens 


Human secreted protein, SBQ ID NO: 
6909. 


112 


72 


8861 


AF130079 


Homo sapiens 


PR02852 


160 


^64 


8862 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


61 



367 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

> 

















6 " 


. . ... 
Homo sapiens 








8864 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


127 


65 


8865 


G00092 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


210 


100 


8866 


AL137351 


Homo sapiens 


hypothetical protein 


327 


100 


8867 


AF090942 


Homo sapiens 


PRO0657 


105 


38 




AK023050 


Homo sapiens 


unnamed protein product 






8869 • 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


101 


65 


8870 


AF216751 


Homo sapiens 


CDA14 


282 


96 


8871 


AL133262 


Homo sapiens 


dJ319D22.1 (CDC5-like protein) 


158 


90 


8872 


V32779_cdl 


Homo sapiens 


1 1 -MAR-1 997 Novel human FO ATP 
synthase sulmnit encoding DNA 

sequence. 


267 


81 


8873 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


118 


58 


8874 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


89 


43 


8875 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


63 


8876 


AK001873 


Homo sapiens 


unnamed protein product 


373 


100 




X52138 


Homo sapiens 


L7a protein 


259 


82 


8878 


X52138 


Homo sapiens 


L7a protein 


273 


87 


8879 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


57 


8880 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


128 


57 


8881 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


71 


8882 


003263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


67 


8883 


AF130089 


Homo sapiens 


PRO2550 


134 


79 


8884 


U36898 


Rattus 
norvegicus 


pheromone receptor VN6 


143 


53 


8885 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


143 


64 


8886 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


138 


92 


8887 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


72 


8888 


R48296 


Homo sapiens 


Human PGF-2/NT-3. 


118 


66 




G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 






8890 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


120 


50 


8891 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


77 


8892 


X53375 

— 


Helianthus 
annuus 


anther-specific protein SFI8 


109 


64 


8893 


AP000616 


Oryza sativa 


—. ; 

Similar to RING-H2 nnger protein 
RHAla(AF078683) 






8894 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


136 


68 


8895 


AF090894 


Homo sapiens 


PRO01I3 


114 


60 


8896 


AF161356 


Homo sapiens 


HSPC093 


148 


51 


8897 


AFl 16638 


Homo sapiens 


PRO 1 546 


91 


45 


8898 


U22231 


Felis catus 


ribosomal protein S3a 


201 


100 



368 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 


Access>onNo. 


Species 


Description 


Smith- 
Waterman 


Idcntit 

y 


8899 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


75 


8900 


AF090894 


Homo sapiens 


PRO0113 


114 


61 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 








AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 








R90288 


Homo sapiens 


Modified pigment epithelium-derived 
factor (rPEDF). 






8904 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO" 1 63 


155 


86 


~8905 


AP000616 


Orvza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


Tl8 


"66 






Nicotiana 
sylvestris 


extensin 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

6467. 








AF130089 


Homo sapiens 


PRO2550 






8909 


AF220264 


Homo sapiens 


MOS'J--l 


113 


47 






Homo sapiens 


Human secreted protein encoded from 
gene 59. 






8911 


Y I 4482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


77 




8912 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO;250. 


162 


83 


8913 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


77 


8914 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


64 


8915 


AF090944 


Homo sapiens 


PRO0663 


73 


58 


8916 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


76 


8917 
- ,.. 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


84 


47 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 








GO'^8'^7 


Homo sapiens 


Human secreted protein, SEQ ID NOr 

6908. 








AJ245416 


Homo sapiens 


G7b protein 






8921 


D90228 


Homo sapiens 


mitochondrial acetoacetyl-CoA thiolase 
precursor 


190 


100 


8922 


AF229067 


Homo sapiens 


PADI-H protein 


157 


65 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 










Homo sapiens 


Hum^ secreted protein encoded by 

gene 75. 






8925 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 


116 


63 


8926 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


65 


8927 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 163 


184 


59 


8928 


AF287892 


Homo sapiens 


sialic acid binding immunoglobulin- 

like lectin 8 long splice variant 


994 


76 


8929 


AF030131 


Mus musculus 


Plenty of SH3s; POSH 


181 


77 


8930 


J02459 


bacteriophage 
lambda 


E (capsid component;34I) 


814 


98 


8931 


U13866 


unidentified 
cloning vector 


non-ftmctional lacZ alpha peptide 


112 


59 



369 
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SEQID 
NO: 


Accession No. 


Species 




Smitli- 
Waferman 


Identit 

y 


8932 


U67540 


Methanococcus 
jannaschii 


conserved hypothetical protein 


98 


56 


8933 


Z74472 


Caenorhabditis 
elegans 


predicted using Genefinder-contains 
similarity to Pfam domain: PF01391 
(Collagen triple helix repeat (20 
copies)), Score=84.3, E-value=8. le-22, 
N=2; PF0I484 (Nematode cuticle 
collagen N-terminal domain), 
Score=34.2, E-value=9.9e-07, 
• N=l~cDNA EST yklOOdlO J comes 
from this gene; cDNA EST 
EMBL:D65979 comes from this 
gene-cDNAEST ykl00d4.3 comes 
from this gene; cDNA EST ykl23g7.3 
comes from this gene-cDNA EST 
yk58e6.3 comes from this gene; cDNA 
EST yk67a5.3 comes from this 
gene-cDNA EST EMBL:T01590 
comes from this gene; cDNA EST 
yk60cll.3 comes from this 
gene-cDNA EST yk89cl0.3 comes 
from this gene; cDNA EST yk85g9.3 
comes from this gene~cDNA EST 
yk83e2.5 comes from this gene; cDNA 
EST yk75f2.3 comes from this 
gene-cDNA EST CEESK90F comes 
from this gene; cDNA EST yk79f5.5 
comes from this gene-cDNA EST 
yk64bl2,3 comes from this gene; 
cDNA EST yk64h9.3 comes from this 
gene-cDNA EST yk74dl.3 comes 
from this gene; cDNA EST yk75e4.3 
comes from this gene-cDNA EST 
yfc83e2.3 comes from this gene; cDNA 
EST yk91cl2.3 comes from this 
gene-cDNA EST yk98f2.3 comes from 
this gene; cDNA EST yk60h4.5 comes 
from this gene-cDNA EST yk60cl 1.5 
comes from this gene; cDNA EST 
yk64h9.5 comes from this gene-cDNA 
EST yk64a9.5 comes from this gene; 
cDNA EST yk64bl2.5 comes from this 
gene-cDNA EST yk67a5,5 comes from 
this gene; cDNA ESTyk74dl.5 comes 
from this gene-cDNA EST yk75e4.5 
comes from this gene; cDNA EST 
yk75f2.5 comes from this gene~cDNA 
EST yk79e6.5 comes from this gene; 
cDNA EST yk80d2.5 comes from this 
gene-cDNA EST yk85g9.5 comes 
from this gene; cDNA EST yk89cl0.5 
comes from this gene-cDNA EST 
yk89c4.5 comes from this gene; cDNA 
EST yk90el 1.5 comes from this 
gene-cDNA EST yk91cl2.5 comes 
from this gene; cDNA EST yk96e2.5 
comes from this gene-cDNA EST 
yk98c2.5 comes from this gene; cDNA 
EST yk98f2.5 comes from this 
gene-cDNA EST yk98hl 1.5 comes 


101 


38 



370 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Menfit 

y 








from this gene; cDNA EST ykl07e7.5 
comes from this gene-cDNA EST 
yk313g9.3 comes from this gene; 
cDNA EST yk3 13g9.5 comes from this 
gene-cDNA EST yk3 12gl.3 comes 
from this gene; cDNA EST yk312gl.5 
comes from this gene-cDNA EST 
yk293c4.3 comes from this gene; 
cDNA EST yk293c4.5 comes from this 
gene 






8934 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


63 


1 




Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 






8936 


X03741 


Homo sapiens 


myosin heavy chaiii (63 5 AA) 


1343 


99 




M29295 


norvegicus 


small nuclear ribonucleoparticle- 
associated protein 






8938 


D00097 


Homo sapiens 


serum amyloid P component 


188 


100 


8939 


W34499 


Homo sapiens 


Obesity receptor C protein. 


255 


83 




L00016 


Homo sapiens 




293 




8941 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


94 


^ 1 


AB007148 


Homo sapiens 


ribosomal nrotein S3a 






8943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


213 


95 




M10119 


Homo sapiens 


ferritin light subunit 






8945 


M74718 


Homo sapiens 


SEF2-1 A protein 


118 


84 


8946 


AL355178 


Homo sapiens 


dJ947L8.1.6 (novel CUB and Sushi 
(SCR repeat) domain protein) 


366 


51 


8947 


Y56033 


Homo sapiens 


Polyproline-rich domain fix)m dynamin. 


70 


51 


8948 


X06547 


Homo sapiens 


glutathione S-transferase (GST-Pi) (AA 


172 


97 


8949 


AF119851 


Homo sapiens 


PR01722 


154 


78 




G 03 043 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7124. 


123 


88 


8951 


X54802 


Homo sapiens 


cytochrome-c oxidase subunit IV 


116 


95 




AJOl 1580 


bacteriophage 
PS34 


gp23 






8953 


Y59441 


Homo sapiens 


Human deltaS fragment #5. 


271 


96 






bacteriophage 
lambda 


B (capsid component;533) 


807 


99 




AL132841 


Caenorhabditis 
elegans 








8956 


W34499 


Homo sapiens 


Obesity receptor C protein. 


166 


89 






Plasmodium 
vivax 


pval 






8958 


ABO 10340 


Mus musculus 


mszfSl 


122 


60 




AL359782 


Trypanosoma 


probable similar to ring-h2 finger 

protein rhala. 










Cricetulus sp. 


ORE (AA 1-286) (1 is 2nd base in 
codon) 








AF09093 1 




PRO0483 






8962 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


130 


72 


8963 


AF 170724 


Homo sapiens 


cell cycle checkpoint protein CHFR 


1364 


99 


8964 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


43 


8965 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


250 


74 



371 
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SEQID 
NO: 


Accession No. 






Waterman 
Score 


% 

Identit 

y 


8966 


Y87233 


Homo sapiens 


Human signal peptide containing 
protein HSPP- 1 0 SEQ ID NO: 1 0. 


445 


100 


8967 


AF127085 




Mus musculus 


semapliorin cytoplasmic domain- 
associated protein 3B 


112 


80 


8968 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 1 63 


109 


46 


8969 


AF090894 


Homo sapiens 


PRO0113 


153 


61 






Homo sapiens 


SEQ ID NO 448 from W09922243. 




49 


8971 


U38544 


Mus musculus 


alpha-1 type 1 collagen 


102 


52 


8972 


X92485 


Plasmodium 

'^'"^F^ ... . 


pval 

.. 


113 


50 


1 Q 


AF241228 


Xenopus laevis 


Friend of GATA 


143 


100 




AF090930 


Homo sapiens 


PRO0478 


79 


41 




AF238374 


Homo sapiens 


mutant fibroblast growth factor 
receptor 3; FGFR3 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






8977 


AB001838 


Homo sapiens 


recovenn 


179 


80 


8978 


AF 11 8082 


Homo sapiens 


PRO 19 02 


73 


70 




AFl 16715 


Homo sapiens 


PR02829 


102 


76 


8980 


AF090942 


Homo sapiens 


PRO0657 




61 


8981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


158 


65 


8982 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


108 


84 


8983 


A00127 


Homo sapiens 


melanoma associated antigen P97 


366 


94 


8984 


G00407 


Homo sapiens 


Human secreted protein, SCQ ID NO: 
4488. 


158 


88 


8985 


Y59778 


Homo sapiens 


Human normal oyarian tissue derived 
protein 55. 


71 


61 


8986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


128 


42 


8987 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


50 


8988 


W50193 


Homo sapiens 


Amino acid sequence of salivaiy 

protein CON-2. 


80 


51 






Homo sapiens 


Amino acid sequence of the human 

MMSCI protein. 


160 


100 


8990 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


130 


67 







Homo sapiens 


NADH dehydrogenase subunit 4L 






8992 


AL359782 


Trypanosoma probable similar to ring-h2 fmgcr 
brucei \ protein rlial a. 


111 


62 


8993 


M28016 


Homo sapiens | cylochrome b 


100 






Homo sapiens Human secreted protein HCHPF68, 
L SEQ lDl\iO:163. 










Homo sapiens NADH deiiydrogenase subunit 4L 






8996 


Y16430 


Mus musculus ; ribosomal protein L35a 


101 


65 


8997 


Y45318 


Homo sapiens 


Human secreted protem fragment 
encoded fsom gene 18. 


96 


76 




AL049659 


Arabidopsis 
tlialiana 


putative protein 


126 


42 


8999 


AE003500 


Drosophila 
meianogaster 


CG12379 gene product 


285 


67 


9000 


R37991 


Homo sapiens 


Sequence of a new cytokine which 
inhibites induction by gammainterferon 
of expression of Class II 
histocompatibility antigens. 


124 


48 



372 



wo 01/64835 PCT/llSOl/04927 



SEQID 1 Accession No. 
NO: 


Species 


Description 


Smith- 


% 
y 


9001 ' AF090930 


Homosa iens 


PRO0478 







9002 1 AP000381 


Arabidopsis 


gb|AAF57656.1~gene_id;K17E12.13~s 


193 


29 


9003 ' M17885 


Homo sapiens 


aci'dic rlbosomgrphosph^^^ (PO) — 


""^64 




9004 1 D63424 


Homo sapiens 


glycogen synthase kinase 3alpha 


126 


81 


9005 


AP000616 


Oiyzasahva 


similar to RING-H2 finger protein 
RHAla(AF078683) 






9006 


ALl 17637 


Homo sapiens 


hypothetical protein 


257 


92 


9007 


AE001381 


Plasmodium 


hypothetical protein 


141 


26 


9008 


W34499 


Homo sapiens 


Obesity receptor C protein. 


152 


86 


9009 


GQ2827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


199 


80 


9010 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

1^^' o, 


202 


94 


9011 


AF090931 


— ; 

Homo sapiens 


PRO0483 


... 


68 


9012 


M11717 


Homo sapiens 


heat shock protein 






9013 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


75 


9014 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


85 


9015 


AL160371 


Leishmania 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 






9016 


AF229067 


Homo sapiens 


PADI-H protein 


119 


54 


9017 


AF090944 


Homo sapiens 


PRO0663 


138 


54 


9018 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


103 


50 


9019 


X924S5 


Plasmodium 

j}y^^ .., 


pval 


JOl 


73 


9020 


G02532 


Homo sapiens 


— ; 

Human secreted protein, SEQ ID NO: 

6613. 






9021 


AF073519 


Homo sapiens 


small EDRK.-rich factor 1, long isofoim 


■ 


... 


9022 


AF229067 


Homo sapiens 


PADI-H protein 






9023 


M33112 


Homo sapiens 


amyloid-beta ja-oteln 


105 


67 


9024 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 






9025 


AF161361 


Homo sapiens 


HSPC098 


130 


55 


9026 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 






9027 


AF130051 


Homo sapiens 


PRO0898 






9028 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


49 


9029 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


58 


9030 


AC002310 


Homo sapiens 


Unknown gene product 


155 


85 


9031 


X92485 


Plasmodium 


pval 




42 


9032 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 






9033 


G00487 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4568. 






9034 


GO0333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


61 


9035 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


111 


66 


9036 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


173 


59 


9037 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


100 


54 


9038 


AF220264 


Homo sapiens 


MOST-1 


96 


80 



373 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 











"71^^^ 


y 





AF 130051 


omo sapiens 


PRO0898 




68 




AFl 16715 


Homo sapiens 


PR02829 


— — 

"T§^ 




— — — 


L76200 


Homo — p_en 


guanylate kinase _ — — 




_83 




G03415 


omo sapiens 




— — 


55 








7496™ ^ 






—— — 


AFl 52513 


omo sapiens 


^— jr — : Tc-T. 5 

protocadnerin gamma Ao snort form 




100 


— 















AFl 30079 


omo sapiens 


PR02852 


— — 




— 


9045 




Homo sapiens 


Human breast cancer related protein 


128 


64 









BCRB2. 









AF090942 


—J : 

omo sapiens 


PRO0657 


— — 


— 

_&5 





AF2 84223 


Homo sapiens 


terra-like protein 


"t~q 

--'^^ 


100 


904 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


128 


68 










RHAla (AF078683) 










H m sa iens 

omo sapiens 


NADH dehydrogenase subunit 2, ND2 — 


-p— 


— 




9050 





Homo sapiens 


Human secreted protein, SEQ iD NO: 


-— 

123 


~ 

75 










— . _ 









AL034344 


, 

Homo sapiens 


dJH8B18.1 (forkheadbox CI) 


323 


100 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


106 


62 








SEQ ID NO: 163. 






"^^^I 


AF225910 


-— 

Mus musculus 


DAZ-associated protein 1 


— 


P ... 


9054 


AF130087 


Homo sapiens 


PR024 1 1 


147 


70 


9055 


W80400 


Homo sapiens 


A secreted protein encoded by clone 


858 


97 








dd71 2. 






9056 




Nicotiana 


extensm 


1 13 


33 






sylvestris 








9057 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


128 


57 






. 


KrlAia \J\T\i lb\>i>i) 






9058 


R95913 


Homo sapiens 


Neural thread protein. 


142 


38 


9059 


AP000616 


Oryza sativa 


similar to RrNG-H2 finger protein 


131 


56 








RHAla (AF078683) 






9060 


1 1 

002538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


50 


.. .. 






. 






9061 


X70343 


.. , . 
Nicotiana 


extensin 


103 


42 


... .. 


. . -. . . 


sylvestris ■ 








9062 


AJ271872 


Nicotiana 


extensin 


129 


33 






sylvestris 








"906^^ 


Y86248 


Homo sapiens 


Human secreted rotein HCHPF68 


151 


73 


— — 




— — 


"5 en rr^ mo- i fxi 








AF266164 




densin- 180 variant D 




144 


■- 
96 







norve^c^icus 








9 


Y8624S 


-p ■■ 

Homo sapiens 


Human secreted protein HCHr r68, 


— 


-— 

68 









SEQ IDNO:163. 








M28016 


— -. 




— — — 




— 

"5? 


"9067 

— 


M28016 


Homo sa^'i^s 

omo sapiens 


cvt'^hr°me b 


— 1 






Y86248 


omo sapiens 


Human "ecreted protein HCHPF68 




— 









SEQ ID NO* 163 






— 

9069 


D38112 


Ti ■■ 1 

Homo sapiens 


NADH dehydrogenase subunit 4L 




Ill 




100 


9070 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


72 




. . - 


..- .. .. 


^^27. 






71 


Yl 2 


Homo sapiens 


Fragment of human secreted protein 


147 


64 














9072 


AL390114 


Leishmania 


extremely cysteine^valine rich protein 


109 


69 














9073 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


171 


85 








8159. 






9074 


Y02671 


Homo sapiens 


Human secreted protein encoded by 




88 








gene 22 clone HMSJW18. 






9075 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


137 


62 



374 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 














9076 


L38941 


Homo sapiens 


ribosomal protein L34 


139 


88 


9077 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


164 


86 


9078 


X55684 


Lycopersicon 


extensin (class I) 


72 


35 


~9079 


Y86248 


Homo'sapiens 


Human secreted rotein HCHPF68 

SEQ ID NO: 163. 


T76 


— 


9080 






6953. 








AFl 30089 




PRO2550 






9082 


X92485 


Plasmodium 


pval 


125 


71 


9083 


AF2 16650 


Homo sapiens 


MTAP 


154 


73 




Y86248 


Homo sapiens 


Human secFeted protein HCHPF683 
SEQlDNO:163. 










lambda'^'^^^^ 


V. (capsid component*34I) 


191 


73 


9086 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

-ir-^ 3 : onr\ TT^ xt^ 


138 


61 


9087 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


TIo 




9088 


G03793 


Homo sapiens 


7874^" ^^'^'^^^^'^ protein, SEQ ID NO. 


144 


61 


9089 1 AK024455 


Homo sapiens 


FLJ00047 protein 


145 


82 


9090 


AF265575 




isoform 






9091 


AF090895 




PRO0117 






9092 


Y86248 


Homo sapiens 


Ifianan secreted protein HCHPF68, 
SEQ ID NO* 163 


211 


86 


9093 


AFl 3 0079 


Homo sapiens 




TST 


^ . 


9094 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 163 


180 


82 


9095 


Y08769 


Rattus 


microvascular endothelial 


97 


42 


9096 


X65551 


Hom?sapiens 


mtigeTortli "monod^ antibody I<i~ 
67 






9097 


W34499 










9098 


Y08769 


Rattus 


microvascular endothelial 


127 


63 


9099 


Y02999 


HonufsapLns 


Fra°menro'f hum^ secreted protein 

encoded by gene 121. 


T29 


"78 


9100 


AF229067 




PADI-H protein 






9101 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


157 


65 


9102 


Y60010 


Homo sapiens 


Human endometrium tumour EST 


69 


63 


9103 


R95913 


Homo sapiens 


Neural thread protein. 


124 


72 


9104 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.Sc} 






9105 


G03800 


Homo sapiens 


Human sscreted protein, SEQ ID NO: 


156 


88 


9106 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


170 


57 


9107 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


163 


89 


9108 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6S34. 


123 


41 


9109 


L41944 


Homo sapiens 


interferon receptor 


162 


66 



375 
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SEQID 1 Accession No. 
NO: 


Specks 


Description 


Smith- 
Waterman 
Score 


% 

Identit 


9110 


AK000496 


— : 

Homo sapiens 


unnanied protein product 






9111 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
-Ir^ A = — — 


81 


62 


9112 


G02538 


: 

Homo sapiens 


Human secrete protein, EQIDNO: 


— — 


— ■ 

52 


9113 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


105 


50 


9114 


AF073519 


Homo sapiens 




t4h — 




9115 


AF119900 




PR02822 




~ 


9116 


G00397 


Homo sapiens 

omo sapiens 


Human secreted protein, SHQ ID NO: 

4478. 


58 




9117 


L00016 


Homo sapiens 








9118 


AC006276 


Homo sapiens 


R28379 3 


138 


100 


9119 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 






9120 


AFl 19855 


Homo sapiens 


PRO 1847 






9121 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


84 


9122 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


115 


78 


9123 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


53 


9124 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


40 


9125 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


96 


63 


9126 


AF130089 


Homo sapiens 


PRO2550 




78 


9127 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


98 


56 


9128 

"9T29 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 




, 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


49 


9130 


AK024455 


Homo sapiens 


FLJ00047 protein 


130 


78 


9131 j W88435 


Homo sapiens 


Disease associated protein kinase 


j36 


100 


9132 


U63332 


Homo sapiens 


super cysteine rich protein; SCRF 


144 


59 


9133 


Y94922 


Homo sapiens 


Human secreted protein clone pv6_l 
protein sequence SEQ ID NO:50. 


154 


90 


9134 


AF3 17425 


■n : 

Homo sapiens 








9135 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


54 


9136 


AF116715 


Homo sapiens 


PR02829 


116 


71 


9137 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7888. 






9138 


AFl 30089 


Homo sapiens 


PRO2550 






9139 


AF161221 


Homo sapiens 


kallikrein-like protein 6 


204 


90 


9140 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 






9141 


U27486 


Pseudorabies 
virus 








9142 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


150 


96 


9143 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


"220 


~ 


9144 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


163 


84 


9145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


180 


77 


9146 


U52077 


Homo sapiens 


mariner transposase 


365 


78 


9147 [ W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


143 


67 



376 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 


Accession No. 






Smith- 


% 










Waterman 


Identit 










Score 


y 








clone HPMBQ32. 






9148 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


100 


81 








RHAla(AF078683) 






9149 


YD 8062 


Homo sapiens 


Human PR0245 protein fragment 


118 


66 








derived from DNA35638. 






9150 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


105 


54 








SEQ ID NO: 163. 






9151 


AL359782 


Trypanosoma 


possible (hhv-6) u 1 102, variant a dna. 


101 


70 






brucei 


complete virion genome. 






9152 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


66 








6613. 






9153 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


160 


72 








SEQIDNO:163. 






9154 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


106 


63 








gene22cloneHMSJW18. 






9155 


Y02926 


Homo sapiens 


Fragment of human secreted protein 


95 


80 








encoded by gene 101. 






9156 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 








7870. 






9157 


G03924 


Homo sapiens 


Human secreted protein, SEQ ijb NO: 


265 ^ 


80 








8005. 






9158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


72 








4457. 






9159 


AF130089 


Homo sapiens 


PRG2550 


180 


61 


9160 


AF116715 


Homo sapiens 


PR02829 


184 


82 


9161 


KOI 664 


Drosophila 
melanogaster 


Bkm-lilte protein 


200 


69 


9162 


D88146 


Homo sapiens 


UDP-galactose transporter 2 


101 


100 


9163 


AFl 07406 


Homo sapiens 


GW128 


155 


60 


9164 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


51 








6983. 






9165 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


60 








7795. 






9166 


AF2 10651 


Homo sapiens 


NAG 18 


83 


65 


9167 


'V00672 


Pan troglodytes 


reading frame protein 4 


119 


95 


9168 


Y01155 


Homo sapiens 


Secreted protein encoded by gene 4 


225 


74 








clone HIBCW32. 






9169 


AF161356 


Homo sapiens 


HSPC093 


100 


75 


9170 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


66 








7795. 






9171 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


115 


86 


.... 






RHAla (AF078683) 






9172 


AFl 19855 


Homo sapiens 


PRO 1 847 


178 


59 


9173 


AF130087 


Homo sapiens 


PR02411 


129 


48 


9174 


AF130079 


Homo sapiens 


PR02852 


149 


59 


9175 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


90 








7214. 






9176 


AF090894 


Homo sapiens 


PRO0113 


118 


65 


9177 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


105 


46 






bmcei 


protein rhal a. 






9178 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


52 














9179 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


70 








7795. 






9180 


AF116715 


Homo sapiens 


PR02829 


127 


64 


9181 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


71 








5327. 






9182 


AF229067 


Homo sapiens 


PADI-H protein 


164 


64 


9183 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


80 



377 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


% 

Idcntit 

y 














9184 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


106 


64 


9185 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


132 


57 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 




65 




AF090931 


Homo sapiens 


PRO0483 


1 17 


68 


9188 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


67 


9189 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


47 


9190 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


269 


85 




Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


150 


40 


9192 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


153 


73 


9193 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


137 


62 


9194 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


31 


9195 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


98 


47 


9196 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7437. 


107 


48 


9197 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


111 



164 


68 



83 


9198 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


9199 


J02459 


bactcriopiiagc 
lambda 


E (capsid componcnt;341) 


235 


85 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 




65 






Homo sapiens 


HS24/P52 




75 


9202 


X67640 


Homo sapiens 


HS24yP52 


231 


81 


9203 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7879. 


128 


66 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


107 


74 


9206 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


186 


66 


9207 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


136 


56 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 








AFl 16715 


Homo sapiens 


PR02829 






9211 


G03714 


TSomo sapiens 


Human secreted protein, SEQ ID NO: 


135 


70 


9212 


AF118086 


Homo sapiens 


PR01992 


84 


57 




G 03 043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 






9214 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


247 


92 


9215 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


74 


76 


9216 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


134 


74 



378 
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SEQID 
NO: 


Accession No. 


Species 


Descriptiun 


Smith- 
Waterman 


Identit 

y 


~92T7 


G02832 


"Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 












clone HCACJ81. 








AP000616 




RHAla(AF078683) 






9220 


L41944 


Homo sapiens 


interferon receptor 


141 


50 




AF130079 




PR02852 





70 


"5551 — 




Homo sapiois 


4462. 






9223 


G02872 


Homo sapiens 


6953. 


142 


56 


9224 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 

protein) 


104 


63 


q^^^ 


U80761 


Homo sapiens 


CTG26 alternate open reeling frame — 


"T^T 


-^1 




G03790 


omo sapiens 


^ mnan secre pro ein, Q 






9227 


AK027208 


Homo sapiens 


unnamed protein product 


138 


68 








encoded by gene 9 SEQ ID NO: 111. 






9229 


(300381 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


127 


88 


9230 


AK025047 


Homo sapiens 


unnamed protein product 


179 


64 




AF090894 




PROD 1 1 3 






9232 


AB046048 


Macaca 


unnamed portein product 


176 


54 


9233 


AB008227 


Adiantum 


Extensin 


93 


34 


9234 


AF 13 0089 


Homo sapiens 


PRO2550 


98 


73 




W88627 


Homo sapiens 


clone HPMBQ32. 








G00397 




4478. ^ 












7795. 








G03790 




7871. 








AE003499 


melanogaster 


CGI 2706 gene product 








G00361 




4442. 






9241 . 


X92485 




pval 


112 


61 


9242 


AF055985 


Onchocerca 


pyrrolidone-rich antigen 


69 


42 


9243 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


140 


80 


9244 


Y86248 


Homo sapiens 


SEQ IDNO:163. 


187 


92 


9245 


M22332 


Homo sapiens 


unknown protein 


117 


40 


9246 






encoded by gene 27. 






9247 


AL136173 


Homo sapiens 


dJ914B9.1 (novel protein (HSPC162) 
similar to Rattus norvegicus 
bithoraxo id-like protein) 


305 


76 


9248 


AF116661 


Homo sapiens 


PR01438 


141 


53 


9249 


Y86248 


Homo salens 


Human secreted protein HCHPF68, 
SEQIDNO;163. 


144 


76 



379 
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SEQIB 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


Identit 

y 


9250 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


245 


89 


9251 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


245 


92 


9252 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


128 


63 


9253 I G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


124 


72 


9254 . AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


110 


90 


9255 ( Y27868 

i 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


132 


65 


9256 1 X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


62 


9257 1 AF217374 

1 


Acanthaster 
planci 


cytochrome oxidase subunit I 


259 


98 


9258 1 W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


64 


9259 1 G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


107 


54 


9260 AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


129 


66 


9261 1 M28016 


Homo sapiens 


cytochrome b 


148 


93 


9262 1 Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 




68 


9263 1 R55749 


Homo sapiens 


Extracellular domain of human high 
affinity IFN-gamma receptor. 


1 16 


83 


9264 j K02403 


Homo sapiens 


complement component C4A 


9025 


99 


9265 X92485 


Plasmodium 


pval 


161 


55 


9266 1 AK024455 


Homo sapiens 


FU00047 protein 


98 


83 


9267 1 G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


51 


9268 


AF130O89 


Homo sapiens 


PRO2550 


124 


77 


9269 


AL359782 


Trypanosoma 


probable similar to ring-li2 finger 
protein rhala. 


94 


54 


9270 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


96 


81 


9271 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 




53 


9272 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






9273 


AF229067 


Homo sapiens 


PADI-H protein 






9274 


Y36203 


Homo sapiens 


Human secreted protein #75. 


154 


77 


9275 


X92485 


Plasmodium 


pval 


157 


72 


9276 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


132 


60 


9277 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


156 




9278 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163, 


138 


84 


9279 


Y20772 


Homo sapiens 


Human neurofilament-M mutant 
protein fragment 54. 


112 


70 


9280 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


146 


59 


9281 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


103 


63 


9282 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


89 


60 


9283 


X92485 


Plasmodium 




124 


79 



380 
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SEQID 
NO: 


Accession No. 


Species 


Oescription 


Smitli- 
Waterman 


Uentit 






vivax 






~ 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 




75 




AFl 30087 


Homo sapiens 


PR02411 


138 


78 


9286 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


140 


54 


9287 


AF004340 


Homo sapiens 


ATPase 6/8 


98 


100 


9288 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


68 


9289 


J02459 


bacteriophage 
lambda 


] (tail component;223) 


457 


97 


9290 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


177 


84 


9291 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


107 


87 


9292 


D38112 


Homo sapiens 


NADH dehydrogenase subunit4L 


311 


91 


9293 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


139 


61 


9294 


R289I6 


Homo sapiens 


Type III procollagen (prior art). 


117 


33 


9295 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


126 


77 


9296 


AFl 30089 


Homo sapiens 


PRO2550 


137 


52 


9297 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


93 


79 


9298 


U22376 


Homo sapiens 


alternatively spliced product using exon 
13A 


239 


77 


9299 


AL359782 


Trypanosoma 
brucei 


possible (hhv-e) ull02, variant a dna, 
complete virion genome. 


134 


69 


9300 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


105 


24 


9301 


D90282 


Homo sapiens 


carbamyl phosphate synthetase I (EC 


276 


96 


9302 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


231 


93 


9303 


M13077 


Photinus py talis 


Luciferase 


557 


100 


9304 


L00016 


Homo sapiens 


urf4 


303 


93 


9305 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


69 


9306 


L46721 


Homo sapiens 


mucin 


204 


80 


9307 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


110 


84 


9308 


J02459 


bacteriophage 


E (capsid component;341) 


218 


93 


9309 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


78 


66 


9310 


AF130075 


Homo sapiens 


FR02532 


92 


69 




G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


97 


51 


9312 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


201 


79 


9313 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


95 


9314 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


68 


9315 


D3S112 


Homo sapiens 


ATPase subunit 6 


223 


81 


9316 


AL 1 60493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


85 


9317 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


117 


88 


9318 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


257 


72 



381 
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SEQID 
NO: 


Accession No. 




Description 


Waterman 
Score 


y 


9319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


61 


9320 


AK001116 


Homo sapiens 


unnamed protein product 


96 


40 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6616. 


174 


60 


9322 


AF037081 


Pan troglodytes 


ribonuclease k6 precursor 


109 


100 


9323 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


149 


63 


9324 


AF153056 


Fasciola hepatica 


tegumental antigen 


60 


44 




G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


200 


69 


9326 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


80 


57 


9327 


AF118086 


Homo sapiens 


PR01992 


139 


60 


9328 


G02360 


Homo sapiens 


Human sea-eted protein, SEQ ID NO: 
6441. 


110 


73 


9329 


API 07406 


Homo sapiens 


GW128 


142 


56 


9330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


75 


9331 


X92485 


Plasmodium 


pval 


121 


56 


9332 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


179 


52 


9333 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7795. 


113 


67 


9334 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


108 


46 


9335 


M22332 


Homo sapiens 


tmknown protein 


158 


60 


9336 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4409. 


95 


46 


9337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


160 


76 


9338 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


150 


68 


9339 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6619. 


169 


59 


9340 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


154 


64 


9341 


AK024455 


Homo sapiens 


FLJ00047 protein 


117 


79 


9342 


G02532 


Homo sapiens Human secreted protein, SEQ ID NO: 
6613. 


191 


76 


9343 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


133 


82 


9344 


AF032457 


Homo sapiens 


BimEL 


123 


100 


9345 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


117 


46 


9346 


G00357 


Homo sapiens Human secreted protein, SEQ ID NO; 
4438. 


153 


75 


9347 


X53020 


Plasmodium 
falciparum 


erythrocyte membrane-associated 
antigen 


68 


75 


9348 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


120 


70 


9349 


AF130089 


Homo sapiens 


PRO2550 


116 


69 


9350 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


165 


72 


9351 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


94 


82 


9352 


AL080242 


Homo sapiens 


bA554C12.1 (RBXl orROCl (ring- 
box or ring finger protein 1)) 


184 


60 



382 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Hentit 

y 


9353 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


125 


64 


9354 


X71442 


Rattus 
norvegicus 


ORF 1; putative 


96 


40 


9355 


G03793 


Homo sapiens 


Human secreted protein, SEQ TD NO: 
7874. 


98 


46 


9356 


AF090931 


Homo sapiens 


PRO0483 


148 


76 


9357 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


144 


56 


9358 


AC005175 


Homo sapiens 


TA2R HUMAN, BETA ISOFORM; 
TXA2-R; PROSTANOID TP 
RECEPTOR 


153 


63 


9359 


AFl 16715 


Homo sapiens 


PR02829 


125 


66 


9360 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

gene 22 clone HMSJW18. 


143 


72 


9361 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


178 


75 


9362 


Y 14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


90 


57 


9363 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7437. 


86 


76 


9364 


AF229067 


Homo sapiens 


PADI-H protein 


116 


51 


9365 


AF010144 


Homo sapiens 


neuronal thread protein AD7o-NTP 


248 


64 


9366 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


104 


76 


9367 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


76 


9368 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


105 


74 


9369 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPmla. 


108 


83 


9370 


AFl 81977 


Hepatitis GB 
virus C 


E2 protein 


95 


38 




AF090931 


Homo sapiens 


PRO0483 






9372 


AF090894 


Homo sapiens 


PRO0113 


133 


57 


9373 


AK002154 


Homo sapiens 


unnamed protein product 


226 


55 


9374 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMS3W18. 


140 


71 


9375 


U77494 


Homo sapiens 


RANBP8 


166 


100 


9376 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478- 


155 


52 


9377 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7870. 


126 


63 


9378 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


151 


78 


9379 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


68 


9380 


AL 160493 


Leislunaiiia 
major 


probable (lihv-6) ull02, variant a 
DNA, complete virion genome 


117 


62 


9381 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


104 


47 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






9384 


AB047936 


Macaca 
fascicularis 


hypothetical protein 


95 


41 


9385 


G02994 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7075. 


57 


64 


9386 


U04810 


Homo sapiens 




151 


100 


9387 


U79260 


Homo sapiens 


unknown 


97 


79 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 


"t&s — 


U 183 39 


Variola virus 


"D4L 


-y^T^ 




9389 


G03S00 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


75 


9390 


AL390935 


Leishmania 


possible dopap 


99 


88 


~T9\ 


AB015633 


Homo sapiens 


type II membrane protein 


TM 


Too — 


9392 


AF113685 


Homo sapiens 


PRO0974 


98 


62 


9393 


AF064819 


Homo sapiens 


serine protease DESCl 


227 


69 


9394 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


219 


68 




GO 1400 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5481. 








AK024455 


Homo sapiens 


FLJ00047 protein 






9397 


AF033260 


porcine 

endogenous type 
C retrovirus 


reverse transcriptase 


155 


53 


T^98 1 


y^^2sn 




— . 

C protein (AA 1-159) 


T47 


— 


9399 


AB029948 


Homo sapiens 


mitochondrial seryl-tRNA synthetase 


1203 


100 






Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 114. 






9401 


G02455 

— -— — — 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


58 




M155jO 


- 

Homo sapiens 


B-cell growth factor 






9403 


X56932 


Homo sapiens 


23 kD highly basic protein 


114 


85 




AC018748 


Arabidopsis 
tlialiana 


Contains similarity to PI 1 protein from 
Drosophila melanogaster gb|X59691 . 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 




88 




AF220264 


Homo sapiens 








9407 


AF119851 


Homo sapiens 


PRO 1722 


157 


65 


9408 


X92485 


Plasmodium 
vivax 


pval 


163 


78 


9409 


X92485 


Plasmodium 
vivax 


pval 


165 


77 


410 


G00352 


Homo sapiens 


- . 

Human secreted protein, SEQ ID NO: 

4433. 










Homo sapiens 


Human secreted protein, SEQ ID NOr 

7795. 








AFl 30089 




PRO2550 






9413 


AF130089 


Homo sapiens 


PRO2550 


175 


65 


_9414 1 


L14848 


Homo sapiens 


MHC class 1-related protein 


225 


97 






Homo sapiens 


Neural thread protein. 


107 


74 


6" 


- - 

AB001684 


Chlorella 
vulgaris 


ORF49b 










Homo sapiens 


Human secreted protein, SEQ ID NO; 
6613. 






9418 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


107 


73 


"9419 — 


AL050285 


Homosa iens 


hvpothetical protein 


194 


Too — 


9420 


AF090944 


Homo sapiens 


PRO0663 


132 


59 


9421 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


61 


9422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


140 


70 


9423 


M20030 


Homo sapiens 


small proline rich protein 


62 


41 


9424 


AF116718 


Homo sapiens 


PRO2900 


396 


100 


9425 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


102 


35 


9426 


X61047 


Hydra sp. 


mini-collagen 


92 


45 



384 
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PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 




Smith- 
Waterman 


y 


9427 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


71 


9428 


L 10908 


Mus musculus 


Gcapl gene product 


107 


44 






Homo sapiens 


SEQ ID NO 485 from VV09922243. 




87 


9430 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


61 


9431 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 

clone HCACJ8 1 . 


111 


80 


9432 


X12580 


Medicago sativa 


put. nodulin (soybean N-75 homolog.) 


87 


33 




AL160371 


Leishmania 
major 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


106 


48 


9434 


AC003028 


Arabidopsis 
thaliana 


3 OS ribosomal protein 831 


79 


42 


9435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


106 


75 


9436 


G02827 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


207 


75 


9437 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


163 


85 




G03786 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

7867. 








AF118086 


Homo sapiens 


PRO 1992 






9440 


AF229067 


Homo sapiens 


PADI-H protein 


140 


64 




M36913 


Zea mays 


cell wall protein (put.); putative 


107 


38 


9442 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6536. 


161 


59 




AF229067 


Homo sapiens 


PADl-H protein 


147 


85 


9444 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


171 


79 


9445 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


155 


57 


9446 


X92485 


Plasmodium 


pval 


97 


55 


9447 


AFl 19851 


Homo sapiens 


PRO 1722 


123 


55 


9448 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla (AF078683) 


130 


88 


9449 


AL391688 


Homo sapiens 


bA524D16A.2.1 (novel protein similar 
to mouse granuphilin-a) 


258 


97 


9450 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


130 


91 


9451 


X92485 


Plasmodium 


pval 


123 


88 


9452 


X83000 


Myocastor 
coypus 


lipase related protein 2 


168 


43 


~9453 


M35603 


Mus musculus 


-fj 5-i ^ 

Hox-3-1 protein 


"447 




9454 


M35520 


Canis familiaris 


OTP-binding protein (rabS) 


271 


98 






Homo sapiens 


NADH dehydrogenase subimit 4L 






9456 


A13595 


construct 


immunosuppresive protein PP15 


111 


100 


9457 


X67788 


Rattus 
norvegicus 


ezrin, p81 


185 


97 


94 


U28068 


Mus musculus 




neurogenic dififerentiation factor 






9459 


AF155140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


556 


88 


9460 


AF155140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


379 


94 


9461 


AFl 69689 


Homo sapiens 


protocadiierin alpha 1 0 alternate 
isoform 


208 


95 


9462 


U73193 


Homo sapiens 


inward rectifier potassium channel 


236 


96 
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SEQID 


Accession No. 




Description 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 








Kirl.2 






9463 


U83278 


Bos taurus 


neural specific protein CRMP-2 


157 


61 


9464 


W48352 


Homo sapiens 


Human breast cancer related protein 


74 


57 








BCFLTl. 






9465 


AL03 1 673 


Homo sapiens 


dJ694B14.3 (PUTATIVE novel 


444 


100 








iialoacid deha!ogenase-lil<e hydrolase 












family protein similar to (archaea) 






■ 






bacterial proteins) 






-|^|^ 


"X6 4 


Homo sapiens 


ribosomal protein S8 


519 


90 


9 


S794 ] 0 


Mus musculus 


nuclear localization signal binding 


115 











■■ — s 











. 

Tlomo sapiens 


CTG26 alternate open reading frame 


95 


84 




G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


71 




- 




4488. 






9470 


AF130056 


Homo sapiens 


PR01367 


98 


76 




X71j54 


Homo sapiens 


vesicular monoamine transporter 


102 


95 


OA ^ 




Bos taurus 


ribosomal protein S29 


120 


95 






Homo sapiens 


nuclear cap binding protein 


583 


96 


9474 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 


469 


91 




- , 




haemoglobin gene contained in pSSl . 






9475 


V49572_cdl 


Homo sapiens 


13 -NOV- 1996 Human stomach cancer 


562 


100 








clone HP10122 cDNA #1. 






9476 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


247 


88 


9477 


AB043820 


Homo sapiens 


platelet glycoprotein VI-2 


95 


47 


9478 


AB020236 


Homo sapiens 


ribosomal protein L27A 


437 


86 


9479 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


52 








7795. 






?^L^^ 


A nAf/^^n 

Al'20563i 


Homo sapiens 


ninjurin2 


364 


100 


9481 


D14886 


Homo sapiens 


TFnA-37 


196 


100 


9482 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


76 








7519. 






9483 


AF 178842 


Homo sapiens 


Rhesus blood group-associated 


128 


75 








glycoprotein 






9484 


AF134895 


Homo sapiens 


glyoxylate reductase 


838 


95 


9485 


AC004832 


Homo sapiens 


similar to 45 kDa secrefoiy protein ; 


562 


98 








similar to CAA10644. 1 












(PID:g4164418) 






9486 


Z35761 


Homo sapiens 


TEL/ABL 


230 


71 


9487 


.Y91577 


Homo sapiens 


Human secreted protein sequence 


149 


72 








encoded by gene 2 SEQ ID NO:250. 






9488 


AL03263 1 


Caenorhabditis 


predicted using Geneflnder 


159 


31 






elegans 








9489 


A J27 1 079 


Oenothera elata 


Ycf2 protein 


113 


36 






subsp. hiookeri 








9490 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


40 








4435. 






9491 


Y53005 


Homo sapiens 


Human secreted protein clone pm749_8 


124 




61 








protein sequence SEQ ID NO: 16. 






9492 


Y21091 


Homo sapiens 


Human p53 cellular tumour antigen 


85 


41 








mutant protein fragment 28. 






9493 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


76 














9494 


AF119851 


Homo sapiens 


PR01722 


94 


54 


9493 


R04932 


Homo sapiens 


Interferon-gamma receptor segment 


142 


100 








from clone 39 responslblefor binding 












the target 






9496 


U40952 


Caenorhabditis 


C03B1 . 1 0 gene product 


103 


75 






elegans 






9497 


M63838 


Homo sapiens 


interferon-gamma induced protein 


128 


92 



386 
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SEQID 
NO: 


Accession No. 


Species 


DescHptio., 


Smith- 
Score 


Identit 

y 


9498 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


147 


44 


9499 


AF108831 


Homo sapiens 


K:C1 cotransporter 3 


244 


77 


9500 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 






9501 


G0341 1 


Homo sapiens 



Human secreted protein, SEQ ID NO: 
7492. 


139 


63 


9502 


AE003499 


Drosophila 
melanogaster — 


CGI 2706 gene product 


180 


36 


9503 j X02344 


Homo sapiens 


beta-tubulin 


"tI? 




9504 D78014 




dihydropyrimidinase related protein-3 




~ 


9505 1 U59446 


Brassica napus 


myrosinase-binding protein related 
protein 


116 


40 


9506 


ABO 1043 7 


Rattus rattus 


cadherin-8 






9507 


AF229067 


Homo sapiens 


PADI-H protein 


178 


74 


9508 


AJ224819 


Homo sapiens 


tumor suppressor 


291 


85 


9509 


AFO 17275 


Mus musculus 


groM'th factor independence- IB 






9510 


AF177203 


Homo sapiens 


cerebral cell adhesion molecule 


388 


52 


9511 


AY007160 


Homo sapiens 


similar to Homo sapiens CGI-57 
protein niRNA with GenBank 
Accession Number AF151815.1 


1533 


99 


9512 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


104 


66 


9513 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 




59 


9514 


X80035 


Oryctolagus 
cuniculus 


cysteine rich hair keratin associated 
protein 




40 


9515 1 D86491 


Xenopus laevis 


Nfrl 


1777 


76 


9516 


Y70761 


Homo sapiens 


Human bcta-amyloid peptide (BAP) 
binding protein, BBP3- 


168 


80 


9517 


AK024500 


Homo sapiens 


FLJ00109 protein 


1714 


100 


9518 


AF008220 


Bacillus subtilis 








9519 


AF099977 


Mus musculus 


schlafen4 


163 


35 


9520 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


162 


72 


9521 


AL035068 


Homo sapiens 


dJ116G19.1 (novel protein) 


102 


76 


9522 


B24527 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 5 SEQ ID NO: 1 53 . 






9523 


AF129756 


Homo Sapiens 


MSH5 


182 


100 


9524 


AJ238706 


Drosophila 
melanogaster 


monocarboxylate transporter 1 

homologue 


126 


32 


9525 


¥76 198 


Homo sapiens 


Human secreted protein encoded by 


250 


97 


9526 


D86604 


TH 1 

Mus museums 




200 


,. 

92 


9527 


AB032918 


Hylobates 
moloch 


— ; 

dopamine receptor D4 






9528 


AB007830 










9529 


AC004472 


Homo sapiens 


PI. 11659 4 


449 


98 


9530 


AF273047 


Homo sapiens 


CTCL tumor antigen se20-7 


104 


29 


9531 


AF130079 


Homo sapiens 


PR02852 


153 


67 


9532 


X07881 


Homo sapiens 


pro line-rich protein Gl 








9533 


AK001845 


Homo sapiens 


unnamed protein product 






9534 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


94 


41 


9535 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


185 


86 


9536 


X99583 


Homo sapiens 


CHLl protein 


635 


98 


9537 


AL022170 


Homo sapiens 


dJ501N12.1 


111 


37 


9538 


X66179 


Xenopus laevis 


p70 S6 kmase 


120 


75 



387 
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SEQID 1 Accession No. 
NO: ( 


Species 


Description 


Smith- " 
Waterman 
Score 


Identit 


9539 


U68380 


Gallus gallus 






98 


9540 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


90 


9541 


AB013918 


Homo sapiens 






78 


9542 1 U29380 


Caenorhabditis 
elegans 


similar to adenylate cyclase 


501 


42 


9543 i AF020312 


Mils musculus 


proline-rich protein 9-1 


95 


51 


9544 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


142 


96 


9545 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7795. 


142 


66 


9546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 

clone HCACJ81. 


110 


47 


9547 


A3044546 


Homo sapiens 


mitogen-activated protein kinase kinase 

kinase 


407 


60 


9548 j U52111 


Homo sapiens 


Ca2+/Calmodulm-dependent protein 
kinase 1 






9549 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 






9550 
9551 


Y87852 
G02455 


Homo sapiens 
Homo sapiens 


Human FGF-5 protein fragment. 


146 


100 


Human secreted protein, SEQ ID NO: 

6536. 


109 


50 


9552 


D45131 


Homo sapiens 


basigin 


311 


66 


9553 


A14656 


synthetic 
construct 


protein antigen 




89 


9554 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 






9555 


X73458 


Homo sapiens 


protein kinase 


568 


85 


9556 


AJ131890 


Homo sapiens 


DNA polymerase lambda 






9557 


S56555 


Homo sapiens 


paraoxonase/arylesterase 


224 


85 


9558 


M22332 


Homo sapiens 


unknown protein 


90 


38 


9559 


B24598 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 47 SEQ ID NO:224. 


66 


44 


9560 


G 03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


95 


51 


9561 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


77 


76 


9562 


X55687 


Lycopersicon 
esculentum 


extensin (class II) 


69 


60 


9563 


AC002333 


Arabidopsis 
tiialiana 


putative SFI6 protein {Helianthus 
annuus} 


111 


31 


9564 


AFl 18082 


Homo sapiens 


PRO1902 


106 


61 


9565 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7874. 






9566 


U10323 


Homo sapiens 


NF45 protem 






9567 


AFl 00757 


Homo sapiens 


COP9 complex subunit 4 


686 


90 


9568 


AL035608 


Homo sapiens 


dJ479J7.2 (transmembrane 4 
superfamily member 6) 


293 


90 


9569 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


63 


9570 


U47856 


Araneus 
diadematus 


fibroin-4 


98 


44 


9571 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


54 


9572 


AJ006239 


Homo sapiens 


diliydropteridine reductase 


461 


84 


9573 


G00821 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4902. 


248 


75 


9574 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


90 


36 


9575 


V00488 


Homo sapiens 


alpha globin 


508 


84 


9576 


AFl 16719 


Homo sapiens 


PR02987 


483 


91 



388 
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SEQ tD 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Waterman 


Identit 

y 


9577 

— — 


L77967 


Ovis aries 


small proline-rich protein witii paired 
^TAT^Ti J 1. J . 


62 


38 


— 


j^2^^^2 


— ■■ 

Homo sapiens 


N ADH dehydrogenase subumt 4 





^2 


9 


P90387 


omo sapiens 


N-teniiinal of human serum albumin 
sto^r^oliuman albumin Swiss Prot — 


274 


67 


~9580 — 


U22961 


Homo sapiens 


Accession Number P02768; Method: 
author'^^' translation supplied by 


271 


~ 


~95fl 


Ul^j-jg 


Homo sapiens 


~p70 


"Tol 


Tf 


9582 


M19419 


Mus musculus 


proline-rich salivary protein 


102 


36 














9584 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


105 

141 


32 


9585 




Homo sapiens 


enco™^by^ffeinr 17 protein 


61 


9586 


U93569 


Homo sapiens 


putative pi 50 


232 


51 


g^gg 


AF 1440 54 












G03790 


Homo sapiens 


Himan secreted^i^m^^ SEQ ID NO- — 

7871. 








9589 


00337] 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


90 


61 


9590 


L27428 


Homo sapiens 


reverse transcriptase 


120 


46 


9591 


G03714 


Homo sapiens 


Human secreted protein, SEQ TD NO; 

'^1^^' . 


85 


62 


„ 


AJ133489 


— — — ; 




— — 





"9593 


G03790 


Heme sapiens*^ — 


Human"secreted protein SEQ ID NO- 
7871. 


^94 


68 




Ml 5386 












~9595 




Homo sapiens 


Human'secreted protein SEQ ID NO- — 

6619. 


309 ~ - 






AFl 16719 




PR02987 




9597 


AFl 16719 


Homo sapiens 


PR02987 


485 


94 




AF068294 




HDCMB45P 






9599 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


70 


"9600 


AFl 18082 


Homosa iens 


PRO 1 902 


ITi 


— 


9601 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


186 


68 


"9602 — 


G03790 


Homo sapiens 


Human secreted protein SEQ ID NO- — 
7871.. 


T76 


— 




Y86305 




Human secreted protein HLYES38 
SEQIDNO.220. 








AFl 94537 










9605 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


67 


"9606 


j^Q^g^g 


~U salens 


salivary proline rich protein 1 


— 


— 


"9607 — 


U49973 


Homo sapiens 


ORf 1 ; MER37- putative transposase 
similar to pogo eleiiieiU 


"T34 


— 


9608 


Y02749 


Homo sapiens 


sene 100 clone HNFIU96. 


129 


67 


9609 


G04063 


Homosa iens 

omo sapiens 


8144. 


loi 


12 


9610 


AFl 18080 


Homo sapiens 


PRO 1880 


237 


100 


9611 


AFl 19851 


Homo sapiens 


PRO 1722 


138 


81 


9612 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


178 


55 


9613 


AF090930 


Homo sapiens 


PRO0478 


88 


71 


9614 


AF068294 


Homo sapiens 


HDCMB45P 


121 


49 


9615 


GO 1657 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


118 


79 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 














9616 


AF090931 


Homo sapiens 


PRO0483 


143 


63 


9617 


Y8647'? 


Homo sapiens 


Human gene 52-encoded protein 
fragment SEQlDNO"387 


92 


44 


9618 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


56 


~96T9 


AB04606 1 


^^^^^^ 

fascicularis 


— J = TH 

unnamed protein product 





~44 






Homo sapiens 


Human secreted protein sec[uence SEQ 

IDNO:155. 








AFl 13685 




PRO0974 






9622 


AF097178 


Equus caballus 


glyceraldehyde-3-phosphate 
deliydrogen&se 


119 


75 


9623 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


134 


62 


9624 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 12 


53 


9625 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


77 


9626 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


164 


73 


9627 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO;490. 


236 


71 


9628 


AFl 3 0079 


Homo sapiens 


PR02852 


138 


50 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 






9630 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


167 


61 


9631 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


87 


50 


9632 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


128 


41 


9633 


G02501 


- 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


1:.4 


81 


9634 


AFl 18082 


Homo sapiens 


PRO 1902 


114 


57 




G03793 


omo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 






9636 


L07543 


Leishmania 


MURF4 


85 


42 


^637 1 


U49973 


Homo'sapiens 


ORF2 function unknown 


"163 


"44 


9638 


AF090895 


Homo sapiens 


PRODI 17 


77 


41 




Y27868 




Human secreted protein encoded by 
gene No. 107. 








Y14480 


Homo sapiens 


encoded by gene 14. 








AFl 18078 




PRO 1 848 






9642 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 


79 




W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 










Homo sapiens 


Human secreted protein. SEQ ID NO; 
7871. 






9645 


G02501 


Homo sapiens 


Human secreted protein, SfcQ ID NO: 


69 


62 


9646 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


69 


35 


9647 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


478 


90 


9648 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


155 


50 


9649 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


60 
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SEQ ID 


Accession No. 




Description 


Smith- 




NO: 








Waternian 
Score 


Identit 

y 








6619. 






9650 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


38 








7214. 






9651 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


118 


36 








clone HPMBQ32. 






9652 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


64 














9653 


D00570 


Mus musculus 


open reading frame (196 AA) 


182 


51 


9654 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


54 








7870. 






9655 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


105 


74 








gene 44 clone HTDAD22. 






9656 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


82 


70 


9657 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
1 18. 


79 


39 


9658 


M24732 


Homo sapiens 


lamin-like protein 


76 


31 


9659 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


73 








6561. 






9660 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 




149 




48 








(hhcm) gene preoduct. 






9661 


\V88627 


Homo sapiens 


Secreted protein encoded by gene 94 


135 


62 








clone HPTvIBQ32. 






9662 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


126 


71 








terminal, clone XEH.Sc} 






9663 


U49973 


Homo sapiens 


ORFl; ivIER37; putative transposase 


141 


63 








similar to pogo element 






9664 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


153 


65 








similar to pogo element 






9665 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


93 


9666 


AFl 13685 


Homo sapiens 


PRO0974 


112 


73 


9667 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


62 








6571. 






9668 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


47 








7214. 






9669 


B12310 


Homo sapiens 


Human secreted protein encoded by 


80 


41 








gene 10 clone HDPGP94. 






9670 


W48351 


Homo sapiens 


Human breast cancer related protein 


106 


58 








BCRB2. _ 






9671 


AF090942 


Homo sapiens 


PRO0657 


118 


66 


9672 


AFI61356 


Homo sapiens 


HSPC093 


94 


63 




AF090930 


Homo sapiens 


PRO0478 


167 


64 


9674 


GO:. 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


50 


. 






...... . . . 






9675 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


70 






■ 


^"^21- ... ... 






9676 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


60 








. 






9 77 




Kattxis rattus 


flk protein 


614 


69 


9678 


AJ006770 


Cicer arietinum 


extensin 


163 


48 


9679 


Y19767 


Homo sapiens 


SEQ ID NO 485 fiom W09922243. 


120 


67 




G00494 


Homo sapiens 


Human secreted protein, SEQ ID NO: 












4575. 






9681 


U63542 


Homo sapiens 


FAP protein 


120 


68 


9682 


M15530 


Homo sapiens 


B-cell growth factor 


146 


76 


9683 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


62 








4433. 






^84 ~ 


AF321379 


Callicebus 


gamma2-globin 


lOl 


86 






moloch 
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SEQID 
NO: 


Accession No. 


species 


Description 


Waterman 
^'^T'^ 


Identit 


. 




. .. ... 

Homo sapiens 


reverse transcriptase 






9686 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


77 


51 


9687 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


104 


58 


9688 


G 023 63 


Homo sapiens 


Human secreted protein, SEQ ID NOr 




. 

58 


9689 


X817I3 


Homo sapiens 


smallest subunit of TFIIA 


210 


91 


9690 


AFI9:.3:>0 


Human 
respiratory 
syncytial virus 


G protein 


100 


25 





API ^ni fin 


Homo sapiens 


small zinc finger-like protein 


— 









AF068294 
AF189307 


omosap^ns 

Homo sapiens 






~ 

54 


unknown 







9694 


G0j13o 


Homo sapiens 


Human secreted protein, SEQ ID NOr 


165 


50 


-— — 




AF130089 


-J- ^ 

omo sapiens 


PRO2550 


--— 




-~ 




Y87064 


omo sapiens 


ID NO' 103 




"SS 





AK024455 


"H '■ 




= 


63 


~9698 


M24732 


Homo sapiens 


ImifafMcepTotefa 


~92 




9699 


AF116715 


Homo sapiens 


PR02829 


119 


75 




AF130089 


Homo sapiens 


PRO2550 






9701 


V00662 


Homo sapiens 


ATPase 6 


255 


11 




LI 0908 


Mus musculus 


Gcapl gene product 






9703 


AF130089 


Homo sapiens 


PRO2550 


208 




9704 


AFl 94537 


Homo sapiens 


NAG13 


89 


51 
77 


9705 


U79260 


Homo sapiens 


unknown 


98 


9706 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


7S 


43 


9707 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

"^r— 3 : 


70 


92 


JTj 

9708 


— 

G00637 


. 

Homo sapiens 


Human secreted protem, SEQ ID NO: 
^^^•^^ 


— z 

136 




60 


'9709 




Homo sapiens 




— 


~48 


9710 


AF130089 


Homo sapiens 


PRO2550 


87 


58 


"ItM — 




Homo sapiens 


pot. ORF III 


1j>5 


41 




G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

8153. 











Homo sapiens 


PR0222 1 


— 1 






AF130051 


Homo sapiens 


PRO0898 




"51 


9715 


AF130089 


Homo sapiens 


PRO2550 


143 


50 


9716 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

, 


124 


49 








: 

Homo sapiens 


reverse transcriptase 


— — 


— 


"pyr? — 


G03133 


Homo sapiens 


7214. ^ ' ^ 


'W 


"35 


^^55 — 


|^3Jf^^ — 




reverse transcriptase 










M74009 


Tlrero ^thecus 

gdadT' 


cytochrome c oxidase subunit II 








Ml 0546 










9722 


AF130051 


Homo sapiens 


PRO0898 


96 


57 


9723 


U93564 


Homo sapiens 


putative pi 50 


144 


49 


9724 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


373 


76 


9725 


AK000496 


Homo sapiens 


unnamed protein product 


155 


55 


9726 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


94 


62 
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SEQ ID 
NO: 






Description 


Waterman 


% 

Identit 


9727 


AF130089 


Homo sapiens 


PRO2550 


154 


40 




AFl 30089 




PRO2550 






9729 


M19419 


Mus musculus 


proline-rich salivary protein 


98 


39 




D90053 










9731 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


50 


"9739 


AF090942 


Homo sapiens 


PRO0657 


~72 


Ts 


9733 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


44 


9734 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163 


159 


94 


97.5 


G03797 


Homosa lens 

omo sapiens 


7878. 


"Tso 


— 





L27428 








33 




G00985 


Homo sapiens 

omosap,ens 


HimanseCTeted protein SEQ ID NO- — 

5066. 









AF007064 


cerevisiae 




















9740 


G03140 


Homo sapiens 


Hiinian secreted protein, SEQ ID NO: 


109 


50 


9741 


AF194537 


Homo sapiens 


NAG13 


123 


54 






Homo sapiens 


A mature human. N-acetylglycosaminyl 
transferase protein. 










Plasmodium 


pval 








G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7881. 








G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6566. 








W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 








Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 










Homo sapiens 


7253. ^ 










Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 






9750 


U49973 


Homo sapiens 


ORFl ; MER37j putative ttBnsposase 
similar to pogo element — 


156 


71 


— — j 


G03925 


omo sapiens 


8006^ * 


— 


— 


"9752 


QQgjgg 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


97 







X92485 


vivax 


pval 


91 






Y07786 




Rfc-like protein 


92 




9755 


AF068294 


Homo sapiens 


HDCMB45P 


125 


63 








reverse transcriptase 


91 




9757 


AK024455 


Homo sapiens 


FU00047 Drotein 1 143 


59 


9758 


S79410 


iVIus musculus 


nuclear localization signal binding | 103 
protein 


41 


9759 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


96 


67 


9760 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


179 


58 


9761 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCAaSl. 


154 


65 


9762 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


78 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








7888. 






9763 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


119 


60 


9764 


¥27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


111 


85 


9765 


R13556 


Homo salens 


Protein encoded downstream of lihc M 
oncoprotein. 


116 


36 


9766 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


165 


74 


9767 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


162 


74 


9768 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


1 17 


52 


9769 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4518. 


92 


50 


9770 


AF130089 


Homo sapiens 


PRO2550 


104 


31 


9771 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


62 


9ni 


U52077 


Homo sapiens 


mariner iransposase 


194 


56 


9773 


Y87297 


Homo sapiens 


Human signal peptide containing 
protein HSPP-74 SEQ ID NO:74. 


345 


100 


9774 


S58722 


Homo sapiens 


X-linked retinopatliy protein {C- 
terminal, clone XEH.Sc} 


73 


36 


9775 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


96 


32 


9776 


GOB 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


124 


53 


9in 


L27428 


Homo sapiens 


reverse transcriptase 


251 


53 


9778 


K02576 


Homo sapiens 


salivary prolinc-rich protein 1 


141 


40 


9779 


GOB 801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


111 


58 


9780 


AJ224997 


Rattus 
norvegicus 


hunting^in 


64 


64 


9781 


AF283769 


Homo sapiens 


similar to QenBank Accession Number 
AC021163 


H9 


53 


9782 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


81 


50 


9783 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


68 


9784 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

8148. 


85 


65 


9785 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


83 


53 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


75 


41 


9787 


M64792 


norvegicus 


salivary proline-ricli protein 






9788 


D90053 


Sus scrofa 


destrin 




43 


9789 


A18812 


Brassica napus 


extensin 


104 


34 


9790 


AF1300S9 


Homo sapiens 


PRO2550 


102 


66 


9791 


AF 130089 


Homo sapiens 


PRO2550 


326 


70 


9792 1 D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


495 


79 


9793 


M10546 


Homo sapiens 


cytochrome oxidase I 


343 


75 


9794 


Y85062 


Homo sapiens 


Interleuldn 1 converting enzyme 
homologue 0CEL) protein sequence. 


106 


58 


9795 


X92485 


Plasmodium 
vivax 


pval 


108 


48 


9796 


M15530 


Homo sapiens 


B-cel! growth fector 


119 


70 


9797 


GOB 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 


148 


65 


9798 


X61046 


Hydra sp. 


mini-cqllagen 


101 


55 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Hentit 

y 






Homo sapiens 






36 


9800 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


52 


9801 


X92485 


Plasmodium 


pval 


113 


45 


9802 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


63 


44 


9803 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


583 


89 


9804 


GO 1736 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


47 


9805 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHFQ29. 


132 


71 


9806 


X92485 


Plasmodium 




106 


53 


9807 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ IDNO:490. 


163 


67 


9808 


AFl 19851 


Homo sapiens 


PRO 1722 


341 


68 


9809 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7563. 


77 


57 


9810 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protem 
RHAla(AF078683) 


133 


72 


9811 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


65 


9812 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


1 18 


37 


9813 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


121 


72 


9814 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


72 


92 


9815 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


71 


71 


9816 


AFl 18082 


Homo sapiens 


PRO 1902 


93 


45 


9817 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


207 


59 






Homo sapiens 


Human secreted protem, SEQ ID NO: 
6953. 








Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 








LI 0908 


Mus musculus 


Gcap 1 gene product 






9821 


A00469 


Homo sapiens 


growth hormone 


254 


81 


9822 


U79260 


Homo sapiens 


unknown 


85 


39 




R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 






9824 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


103 


74 


9825 . 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


51 


9826 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


148 


44 


9827 


Y24789 


Homo sapiens 


Human secreted protein bG77_l. 


265 


100 


9828 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


72 


60 




G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 






9830 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


104 


51 


9831 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4518, 


118 


74 


9832 


AF151866 


Homo sapiens 


CGI-108 protein 


115 


31 


9833 


U93563 


Homo sapiens 


putative pi 50 


127 


61 



395 



SEQID 
NO: 


Accession No. 


Species 

. 


Description 


Smith- 

Watermaii 

Score 


Hentit 

y 


9834 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 






9835 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


138 


65 


9836 


B01372 


Homo sapiens 


Neuron-associated protein. 




88 


9837 


G03115 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


57 


9838 ! AF068294 


Homo sapiens 


HDCiVIB45P 


90 


65 


9839 


S77772 


Homo sapiens 


aspaitylglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
^••^•^••^^^ ... . 


71 


66 


9840 


AF090942 


Homo sapiens 


PRO0o57 







9841 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
. ... 


75 


54 


9842 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


67 


9843 


AF 130089 


Homo sapiens 


PRO2550 


216 


75 


9844 


M15530 


Homo sapiens 


B-cell growth factor 






9845 


U93567 


Homo sapiens 


p40 


248 


75 


9846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


151 


55 


9847 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


64 


9848 


AFl 18082 


Homo sapiens 


PRO1902 


58 


81 


9849 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


129 


65 


9850 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


329 


77 


9851 


AF090944 


Homo sapiens 


PRO0663 


122 


80 


9852 


X71442 


Rattus 


ORF 1; putative 


109 


48 


9853 


AFl 30089 


Homo sapiens 


PRO2550 


126 


60 


9854 


G04063 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
8144. 


98 


62 


9855 


U93563 


Homo sapiens 


putative pi 50 


136 


48 


9856 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


276 


74 


9857 


AF017777 


Drosophila 
melanogaster 


la Costa 






9858 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

"^^J^' — : 


99 


37 


9859 


K02576 


Homo sapiens 


salivary proluie-ricfi protein 1 




37 


9860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 






9861 


AFl 16909 


Homo sapiens 


unknovm _ -t-tt s 


.. 




9862 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 






9863 


Y21811 


Homo sapiens 


CPF polypeptide 36PRO. 






9864 


U93567 


Homo sapiens 


p40 


252 


70 


9865 


AF090895 


Homo sapiens 


PRODI 17 


164 


60 


9866 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


47 


9867 


AF090930 


Homo sapiens 


PRO0478 


l06 


— 


9868 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


167 


84 


9869 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


83 


60 


9870 


AFl 19900 


Homo sapiens 


PR02822 


134 


48 


9871 


AFl 16715 


Homo sapiens 


PR02829 


111 


81 


9872 


U52077 


Homo sapiens 


mariner transposase 


332 


76 



396 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Idenfit 

y 


"9873 


Y^g^QQ 


Homo sapiens 


A mature human N acetylglycosaminyl 
transferase protein. 


^Mre 






U93570 










9875 


L27428 


Homo sapiens 


reverse transcriptase 


127 


54 






Rattus sp. 


reverse transcriptase homolog 




43 


9877 


G00403 


Homo sapiens 


Human secreted protein^ SEQ ID NO: 

4484. 


113 


64 


9878 


AFl 87823 


Zea mays 


transposase DOPA 


132 


38 




W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


101 






G03415 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7496. 


133 


59 




U49973 ! Homo sapiens 

i 


ORFl ; MER37; piilative transposase ; 101 
similar to pogo element 




G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


148 




9883 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5330. 


57 


65 




X92485 


Plasmodium 
vivax 


pval 


129 


43 


9885 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


98 


37 


9886 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


89 


59 


9887 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


155 


54 


9888 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
8144. 


121 


40 


9889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


346 


68 


9890 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


133 


38 


9891 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


143 


66 




X53581 


Rattus 
norvegicus 


0RF7 


81 




9893 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7348. 


109 


53 


"9894 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 


99 




9895 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


157 






L26953 


Homo sapiens 


chromosomal protein 


107 




9897 


AL390114 


Leislixnania 


probable (hliv-6) ul 102, variant a 
DNA, complete virion genome 


95 


51 


^^9898 


U93569 


Homo sapiens 


putative plSO 


106 


~ 


9899 


U 15647 


Mus musculus 


reverse transcriptase 


118 


47 




U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


118 




9901 


U93563 


Homo sapiens 


putative pi 50 


116 


39 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


153 







X03145 


Homo sapiens 


potORFV 


148 






G03787 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

7868. 


84 




9905 


AF066073 


Dictyostclium 
discoideum 


SP85; PsB 


113 


52 


9906 


AF090895 


Homo sapiens 


PROOin 


125 


58 


9907 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


102 


57 


9908 


M24732 


Homo sapiens 


lamin-like protein 


95 


35 



397 



/;/E:A^¥O01 84835 cpd 



WO 01/64835 PCT/USOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


% 

Identit 

y 






Homo sapiens 


Human secreted protein encoded from 
gene 12. 








AFl 1 8080 


Homo sapiens 


PRO 1 880 






9911 


AF116719 


Homo sapiens 


PR02987 


216 


72 






Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

7784. 






9914 


Y 14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


72 


58 


9915 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


71 


99 6 


S79410 


M I 


~— — i — r — -■ ■■ , , ■ 

nuclear localization signal bindmg 

protein 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 






9918 


AFl 167 12 


Homo sapiens 


PR027j8 


106 


69 


9919 


G025j2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


77 


9920 


AF068294 


Homo sapiens 


HDCMB45P 


296 


59 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7077. 








Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 




69 






Homo sapiens 


HPiVISRS 






9924 


AB002317 


Homo sapiens 


KIAA0319 


294 


75 


9925 

._. 


U49973 
.- ... - 


Homo sapiens 
_ „_ 


ORF 1 ; 1VIER37; putative transposase 
similar to pogo element 


169 


42 




U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


153 


59 






G00214 


— -. 

Homo sapiens 


Human secreted protein, SEQ ID NO; 
4295. 






9928 


Y 14483 


Homo sapiens 


Fragment of human secreted protein 


863 


76 


9929 


AF130051 


Homo sapiens 


PRO0898 


128 


60 




AF130089 


Homo sapiens 


PRO2550 






9931 


AF064597 


Homo sapiens 


LINE-1 like protein 


101 


48 


AF027144 


Homo sapiens 


zinc finger protein 




42 


9933 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


44 


9934 


U93569 


Homo sapiens 


putative pl50 


105 


68 


9935 


G0313j 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


133 




9936 


Y73344 


Homo sapiens 


HTRM clone 0258181 protein 


145 


46 


9937 


G02872 


Homo sapiens 


Human secreted protein, SEQ ED NO; 
. . 


123 


77 


9938 




Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 






9939 


D63643 


Homo sapiens 


clathrin coat assembly protein-like 


232 


69 


9940 


Y 14480 


Homo sapiens 


Fragment of human secreted protein 


75 


65 


9941 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


176 


71 


9942 


X04588 


Homo sapiens 


cytoskeletal tropomyosin (AA 1-248) 


379 


81 


9943 


G03683 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7764. 


58 


52 


9944 


AF130114 


Homo sapiens 


PR02459 


117 


67 


9945 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


297 


64 



398 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 

. . 


Smith- 
Waterman 
.Score 


Identit 

y 








similar to pogo element 






9946 


D00570 


Mus musculus 


open reading frame (25 1 AA) 


214 


63 


9947 


X03717 


Homo sapiens 


pot. unidentified reading frame 


76 


35 




G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4523, 










Homo sapiens 


high mobility group protein 14 






9950 


AF090931 


Homo sapiens 


PRO0483 


114 


75 


9951 


AF164612 


Homo sapiens 


Gag protein 


132 


63 


9952 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


112 


65 


9953 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


118 


59 


9954 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ8I. 


138 


74 


9955 


G00974 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5055. 


94 


38 


9956 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


124 


61 


9957 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
fector(BCGF). 


177 


51 


9958 


X92485 


Plasmodium 


pva] 


137 


44 


9959 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


142 


53 


9960 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


174 


68 


9961 

~9962 ~ 


S79410 


IVIus musculus 


nuclear localization signal binding 
protein 


123 


38 


G 03 7 87 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


56 


9963 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


80 


57 


9964 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


136 


82 


9965 


Y27908 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


92 


53 


9966 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


76 


40 


9967 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


38 


9968 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


101 


67 


9969 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


90 


83 


9970 


Y36156 


Homo sapiens 


Human secreted protein #28. 


170 


68 


9971 


U79260 


Homo sapiens 


unknown 


142 


67 


9972 


L27428 


Homo sapiens 


reverse transcriptase 


108 


63 


9973 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


60 


9974 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


78 


85 


9975 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 


79 


39 


9976 


AF090944 


Homo sapiens 


PRO0663 


109 


73 


9977 


AF116715 


Homo sapiens 


PR02829 


120 


56 


9978 


U93563 


Homo sapiens 


putative pl50 


132 


57 


9979 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


97 


80 


9980 


U42580 


Paramecium 
bursaria 


A658R 


70 


40 



399 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 




Description 


Waterman 


Ideiitit 

y 






Chlorella virus 1 








9981 


G03371 


Homo.sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


87 


61 


9982 


G02363 


Homo sapiens 


Human secreted'protein, SEQ ID NO: 
6444. 


135 


70 


9983 


X61047 


Hydra sp. 


mini-collagen 


90 


36 


9984 


G03564 


Homo sapiens 


Human seca-eted protein, SEQ ID NO: 
7645. 


98 


40 


9985 


Y41552 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 80. 


90 


42 


9986 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


132 


52 


9987 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


115 


41 


9988 


AF042832 


Homo sapiens 


forldiead-relatedtBnscription fector 
FREAC-9 


89 


36 


9989 


X03717 


Homo sapiens 


pot. unidentified reading fiame 


105 


67 


9990 


AL390935 


Leishmania 
major 


possible dopap 


118 


74 


9991 


AF201949 


Homo sapiens 


60S ribosomal protein L30 isolog 


109 


58 


9992 


AF130114 


Homo sapiens 


PR02459 


115 


68 


9993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0:163. 


148 


83 


9994 


AF068294 


Homo sapiens 


HDCMB45P 


196 


43 


9995 


Y95028 


Homo sapiens 


Human clone vp7 1 0RF2, SEQ ID 
NO: 128. 


117 


54 


9996 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


74 


53 


9997 


Y 14480 


Homo sapiens 


Fragment of human secreted js-otein 
encoded by gene 14. 


132 


55 


9998 


AF 102771 


Homo sapiens 


cardiac-specific ras association 
(RalGDS/AF-6) domain family I 
protein isoform ID 


148 


100 


9999 


V00662 


Homo sapiens 


URF 2 (NADH deliydrogenase subunit) 


213 


88 


10000 1 
lOOOf^ 


D38112 
G02619 


Homo sapiens 


ATPase subunit 6 


279 


69 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6720. 


97 


39 


10002 


AK000496 


Homo sapiens 


unnamed protein product 


154 


74 


10003 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 
tenninal, clone XEH.Sc} 


125 


74 


10004, 


B07702 


Homo sapiens 


Protein encoded by the endogenetic 
fragment of HERV-W. 


143 


56 


10005 


Y3 63 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


97 


65 


10006 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


72 


10007 


AF021231 


Mus musculus 


acetylcholinesterase-associated 
collagen 


109 


41 


10008 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6444. 


69 


78 


10009 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


170 


78 


10010 


AF130051 


Homo sapiens 


PRO0898 


134 


67 


10011 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


100 


44 


10012 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6613. 


95 


78 


10013 


AF073519 


Homo sapiens 


small EDRK-rich factor I, long isoform 


90 


52 


10014 


AF130052 


Homo sapiens 


PRO0956 


104 


50 



400 



w o 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 


10015 


X92485 


Plasmodium 
vivax 


pyal 




46 


10016 


AFl 30089 


Homo sapiens 


PRO2550 




56 


10017 j AF130089 


Homo sapiens 


PRO2550 


172 


58 


10018 G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


38 


10019 1 G00354 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


54 


10020 1 G02584 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


43 


10021 I G00588 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


70 


51 


10022 : AF090942 


Homo sapiens 


PRO0657 


113 


67 


10023 1 Ml 1759 

i 


Ly copers icon 
escuientiim 


cell wall hydroxyproline-rich 
glycoprotein 


77 


33 


10024 j X55681 


Lycopersicon 
esculentum 


extetisin (class I) 


95 


35 


10025 ; G03133 
10026" yW88627 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 


93 


61 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


42 


10027 


Y73377 


Homo sapiens 


HTRM clone 1645941 protein 

sequence. 


1180 


84 


10028 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


108 


59 


10029 1 G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


1)2 


65 


10030 ! Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


163 


82 


10031 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


131 


66 


10032 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


74 


48 


10033 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


85 


10034 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


192 


60 


10035 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


55 


10036 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


128 


76 


10037 


AFl 18086 


Homo sapiens 


PRO 1992 


148 


68 


10038 


S58722 


Homo sapiens 


X-linked retinopatliy protein {C- 
terminal, clone XEH.Sc} 


125 


47 


10039 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


65 


10040 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


45 


10041 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


75 


10042 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


100 


33 


10043 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






10044 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


135 


71 


10045 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


148 


46 


10046 


X92485 


Plasmodium 




98 


58 



401 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


% 
y 


10047 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


102 


87 


10048 • 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


63 


84 


10049 


U09202 


Homo sapiens 


ornithine decarboxylase antizyme 


202 


67 






Plasmodium 


pval 


77 


34 


10051 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


40 


10052 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


57 


10053 


AC005396 


Arabidopsis 
thaliana 


putative proline-rich protein 


100 


33 


10054 


G03600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


94 


10055 


X73434 


Ovis aries 


KAP5.4 keratin protein 


76 


45 


10056 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


82 


62 


10057 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


66 


10058 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


55 


10059 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


130 


73 


10060 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


42 


! 10061 


L27428 


Homo sapiens 


reverse transcriptase 


107 


43 




G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


92 


47 






Homo sapiens 


Human secreted protein, SEQ ED NO: 

7193. 


93 


46 




AF220264 


Homo sapiens 




135 


48 


10065 


AF130114 


Homo sapiens 


PR02459 


87 


68 


10066 


D86853 


Catharanthus 


extensin 


115 


35 


10067 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


71 


10068 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


102 


54 






norvegicus 


nitric oxide synthase 




77 






Mus famulus 


reverse transcriptase 






10071 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


94 


67 


10072 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


51 


10073 


AK024455 


Homo sapiens 


FLJ00047 protein 


146 


66 






Plasmodium 
vivax 


pval 


121 


36 


10075 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
. . . 


130 


64 


10076 


AF055985 


0 h 
volvulus 


pyrrolidone-rich antigen 






10077 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


165 


75 


10078 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


106 


72 


10079 


AF130050 


Homo sapiens 


PRO0872 


102 


63 


10080 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


44 
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SEQID 1 Accession No. 
NO: ' 

I 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 




.. 


8 1 So. 






10081 : Y87156 

■ 1 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO; 195. 


285 


78 


10082 1 AF068294 


— : 

Homo sapiens 


HDCMB45P 






10083 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


98 


48 


10084 


X92485 


Plasmodium 
vivax 




pva 


. 




10085 


AF 118082 


Homo sapiens 


PRO 1902 






10086 


AF090942 


Homo sapiens 


PRO0657 


88 


53 


10087 


AF 118086 


Homo sapiens 


PRO 1992 




71 


10088 


Y01400 


Homo sapiens ' 


Secreted protein encoded by gene 18 
clone HNiEIF029. 


119 


68 


10089 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


1 17 


34 


10090 


W48351 


Homo sapiens 


Human breast cancer related protein 


154 


77 


10091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4718. 


138 


56 


10092 


G03I19 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7200. 


90 


56 


10093 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


134 


66 


10094 


Ml 5530 


Homo sapiens 


B-cell growth factor 






10095 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


65 


10096 


M15530 


Homo sapiens 


B-cell growth factor 


87 


68 


10097 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


84 


79 


10098 


G00369 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4450. 


150 


66 


10099 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


157 


70 


10100 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


103 


60 


10101 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


114 


70 


10102 


AL359782 


Trypanosoma 
bnicei 


possible (hhv-6) ullOZ, variant a dna, 
complete virion genome. 


130 


53 


10103 


G03786 ' 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 






10104 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


60 


10105 


L27428 


Homo sapiens 


reverse transcriptase 


220 


38 


10106 


W90S34 


Homo sapiens 


Human lympliocyte targeted peptide 
#2. 






10107 


AF161356 


Homo sapiens 


HSPC093 


227 


61 


10108 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


64 


10109 


AF130089 


Homo sapiens 


PRO2550 


126 


77 


10110 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


118 


60 


10111 


M63274 


Plasmodium 
falciparum 


malaria antigen 






10112 


AF090944 


Homo sapiens 


PRO0663 


133 


70 


10113 


L27428 


Homo sapiens 


reverse transcriptase 


91 


32 


10114 


Y17221 


Homo sapiens 


Human secreted protein (clone ik3 17- 

3). 


112 


64 


10115 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


85 


44 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 


10116 


L27428 


Homo sapiens 


reverse transcriptase 


112 


44 


10117 






6582. 








G04063 




8144, 








AF194537 


Mm'musct^s 









ToT20 — 


D00570 




qpen reading fiame (196 AA) 




~ 


10121 

10122^ 


X77816 


Rattus 


PR-Vbetal 


122 


47 


W48351 


Homo sapiens 


Human breast cancer related protein 
H^^secreted protein SEQ ID NO — 


148 


55 


ToT23 — 


(303793 


Homo sapiens 


7874. 


— 


~66 




AF090942 


Homo sapiens 


PRO0657 


"Til 


— ~ 


I0I25 — 


G00442 


omo sapiens 


Human secreted protein, SEQ ID NO. 






10126 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

■ p^t^a^jvep]50 


113 


49 


"10127 — 


^jggggg 


Homo sapiens 




"94 


— 


10128 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


146 


79 


TOI29 — 


AF130089 


"llomo sapiens 


PRO2550 


"^08 


75 


10130 


M15530 


Homo sapiens 


B-cell growth factor 


102 


73 






esculentum 


extensin (class I) 


















10133 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029 


99 


73 


10134 


API 94537 


Homo sapiens 


NAG 13 


220 


48 




G02987 


omo sapiens 


7068. 










Homo sapiens 


Human secreted protein, SEQ ID NOi 
7492. 






10137 


Y86573 


Homo sapiens 


fragment, SEQ ID NO*490 


223 


59 


10138 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


178 


75 




G03172 


Homo sapiens 


Hilnan secreted protein, SEQ ID NO — 
7253. 


~Tjo 




10141^ 


Y14480 
AF072164 





encoded by gene 14. 












10142 


G0221 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


74 


10143 


M15530 


Homo sapiens 


B-cell growth fector 


91 


40 








7888. 












6613. 








G00379 




Human secreted protein, SEQ ED NO: 
4460. 






10147 


Y87064 




ID NO: 103. ^ ^ 






10148 


X71975 


Drosophila 
melanogaster 


put. homologue to S.cerevisiae GARl 
gene 


118 


39 


I0I49 


S58722 


Homo sapiens 


X-Jinked retinopathy protein {C- 
teiminal, clone XEH.8c} 


123 


49 


10150 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


163 


81 
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SEQID 
NO: 


Accession No. 


Spedes 


DescripUon 


Smith- 
Waterman 
Score 


% 

Mcntit 


10151 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


106 


83 


10152 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


52 


10153 


AFl 13685 


Homo sapiens 


PRO0974 


133 


62 


10154 


U41038 


Caenorhabditis 


Similar to cadherin-type repeat 


129 


67 


10155 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


89 


50 


10156 


K02576 


Homo sapiens 


saUvary proline-rich protein 1 


131 


41 


10157 


AB011099. 


Homo sapiens 


KIAA0527 protein 


94 


35 


1015S 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


78 


72 


10159 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


139 


71 


10160 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


68 


10161 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


8S 


10162 


MI 5530 


Homo sapiens 


B-cell growth factor 


137 


63 


10163 


AF090895 


Homo sapiens 


PRODI 17 


93 


67 


10164 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


109 


60 


10165 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4478. 


126 


52 


10166 


AFl 18082 


Homo sapiens 


PRO1902 


111 


55 


10167 


AF090895 


Homo sapiens 


PRO0117 


112 


75 


10168 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


135 


69 


10169 


AF090942 


Homo sapiens 


PRO0657 


149 


67 


10170 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


145 


68 


10171 


AL160371 


Leishmania 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


85 


51 


10172 


S58722 


Homo sapiens 


X-linkcd retinopathy protein (C- 
terminal, clone XEH.Sc} 


136 


62 


10173 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


158 


63 


10174 


AF090942 


Homo sapiens 


PRO0657 


75 


44 


10175 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


97 


47 


10176 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


108 


75 


10177 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


168 


80 


10178 


M22332 


Homo sapiens 


unknown protein 


131 


62 


10179 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
sunilar to pogo element 


160 


60 


10180 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


65 


68 


10181 


AFl 19900 


Homo sapiens 


PR02822 


132 


72 


10182 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


114 


56 


10183 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


124 


59 


10184 


AF210651 


Homo sapiens 


NAG 18 


150 


65 


10185 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


109 


46 


10186 


X71442 


Rattus 


ORF 1; putative 


91 


47 
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wo 01/64835 
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Description 






NO: 








Waterman 


Identit 










Score 


y 






norvegicus 








10187 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


54 


10188 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


82 


72 








6571. 






10189 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


61 








4718. 






10190 


AF220264 


Homo sapiens 


MOST-1 


165 


81 


10191 


Y916I7 


Homo sapiens 


Human secreted protein sequence 


54 


44 








encoded by gene 19 SEQ ID NO:290. 






10192 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


80 








6613. 






10193 


X92485 


Plasmodium 
vivax 


pval 


128 


57 


10194 


X61296 


RattLis 


open reading ftame 2 


79 


72 






norvegicus 








10195 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


116 


62 








terminal, clone XEH.Sc} 






10196 


AF130089 


Homo sapiens 


PRO2550 


144 


62 


10197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


56 






7214. 






10198 


AF090942 


Homo sapiens 


PRO0657 


133 


76 


10199 


S79304 


Rattus sp. 


cytochrome oxidase subunit I; COX I 


280 


66 


10200 


S61070 


Homo sapiens 


reverse transcriptase homolog=pol 


188 


49 








{retroviral element} 






10201 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


51 








7871. 






10202 


AF068294 


Homo sapiens 


HDCMB45P 


174 


65 


10203 


YI4480 


Homo sapiens 


Fragment of human secreted protein 


174 


To ~ 








encoded by gene 14. 






10204 


U93572 


Homo sapiens 


putative pi 50 


77 


40 


10205 


L27428 


Homo sapiens 


jx-verse^transcnp1:as^ 


133 


50 


10206 


J02621 


Homo sapiens 


high mobihty group protein 14 


94 


57 


10207 


AF 13 0079 


Homo sapiens 


PR02852 


144 


73 


10208 


R95913 


Homo sapiens 


Neural thread protein. 


134 


51 


10209 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


154 


76 








similar to pogo dement 






10210 


AB046048 


Macaca 


unnamed portein product 


162 


60 






fasclcularis 








10211 


AF09093 1 


Homo sapiens 


PRO0483 


128 


79 


10212 


AF130051 


Homo sapiens 


PRO0898 


144 


76 


10213 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


54 






7879. 






10214 


AF130089 


Homo sapiens 


PRO2550 


158 


70 


10215 


G03564 • 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


96 


55 








7645. 






10216 


Ml 5894 


Homo sapiens 


chorionic somatomammotropin 


116 


63 








precursor 






10217 


AK024372 


Homo sapiens 


unnamed protein product 


85 


56 


10218 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


118 


69 


10219 


M37679 


Mus musculus 


Ig heavy chain precursor 


64 


83 


10220 


G03806 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


91 


69 








7887. 






10221 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


47 








7075. 






10222 


Y36505 


Homo sapiens 


Fragment of human secreted protein 


102 


48 








encoded by gene 29. 






10223 


AF090940 


Homo sapiens 


PRO0644 


105 


76 


10224 


U49974 


Homo sapiens 


mariner transposase 


155 


76 


10225 


K02576 


Homo sapiens 


salivary proline-ricJi protein 1 


154 


42 
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SEQID 
NO: 






Description 


Waterman 
Score 


Identit 

y 


10226 


AF293639 


Homo sapiens 


dioxin receptor repressor 


819 


98 


10227 


L27428 


Homo sapiens 


reverse transcriptase 


137 


44 


10228 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


106 


70 


10229 


U93566 


Homo sapiens 


p40 


206 


SO 


10230 


Y08061 


Homo sapiens 


Human c-myb protein fiagment. 


114 


66 


10231 


AF 11 8082 


Homo sapiens 


PRO 1902 


76 


52 


10232 


AFl 18086 


Homo sapiens 


PRO 1992 


99 


75 


10233 


X53375 


Helianthus 
annuus 


antlier -specific protein SFl 8 


95 


64 


10234 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


156 


72 


10235 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


100 


35 


10236 


L27428 


Homo sapiens 


reverse transcriptase 


244 


50 


10237 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


93 


70 


10238 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


142 


72 


10239 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


149 


80 


10240 


S52010 


Mus sp. 


1st Met is at position 21 


98 


42 


10241 


AL110147 


Homo sapiens 


hypothetical protein 


125 


35 


10242 


X98485 


Plasmodium 


putative 


85 


51 


10243 


S58722 
Y87064 


Homo sapiens 


X-linked retinopathy protein {C- 
teiminal, clone XEH.Sc} 


70 


65 


10244 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


125 


49 


10245 


U49973 


Homo sapiens 


0RF2: function unknown 


150 


81 

56 


10246 


K02576 




salivary proliiie-rich protein 1 


113 


10247 


AF130089 


Homo sapiens 


PRO2550 


139 


73 


10248 


G03790 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 

787 1 . 


160 


59 


10249 


U93565 


putative pi 50 


109 


43 


10250 


AFl 30089 


Homo sapiens 


PRO2550 


104 


54 


10251 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8144. 


78 


36 


10252 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8144. 


110 


58 


10253 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


123 


54 


10254 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


95 


57 


10255 


X03145 


Homo sapiens 


pot. ORPV 


76 


63 


10256 


Y01405 


Homo sapiens 


Secreted protein encoded by gene 23 
clone HDPBA48. 


116 


65 


10257 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


113 


68 


10258 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


138 


68 


10259 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo dement 


135 


60 


10260 


G02528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6609. 


95 


59 


10261 


M86246 


Homo sapiens 


EHS-2 


96 


62 


10262 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6953. 


180 


66 


10263 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


62 
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SEQID 
NO: 


Accession No. 


species 


Description 


Srnitli- 

Waterman 

Score 


Identit 

J! 








7871. 






10264 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


313 


84 


10265 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


144 


56 


10266 


Y 14480 


Homo sapiens 


Fragment of iiuman secreted protein 
encoded by gene 14. 


150 


61 


10267 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:125. 


408 


89 


10268 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


74 


26 


10269 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


133 


52 


10270 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


61 


10271 


X6I048 


Hydra sp. 


mini-collagen 


121 


40 


10272 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


150 


66 


10273 


L24521 


Homo sapiens 


transformation-related protein 


134 


51 


10274 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


96 


47 


10275 


U59446 


Brassicanapus 


myrosinase-binding protein related 
protein 


115 


39 


10276 


G01480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5561. 


482 


98 


10277 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7881. 


143 


81 


10278 


U93564 


Homo sapiens 


putative p] 50 


130 


46 


10279 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


123 


44 


10280 


AF090944 


Homo sapiens 


PRO0663 


156 


73 


10281 


AF 13 0079 


Homo sapiens 


PR02852 


146 


75 


10282 


X89401 


Homo sapiens 


ribosomal protein L21 


108 


48 


10283 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7645. 


65 


66 


10284 


Y25821 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 4 1 . 


TeoT 


Ts 


10285 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-N'fP 


308 


43 


10286 


AF090931 


Homo sapiens 


PRO0483 


131 


74 


10287 


Y48480 


Homo sapiens 


Human breast tumour-associated 
protein 25. 


203 


90 


10288 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


116 


44 


10289 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7888. 


160 


64 


10290 


X67863 


Mus musculus 


T2 


147 


37 


10291 


AF229067 


Homo sapiens 


PADI-H protein 


123 


43 


10292 


AF078844 


Homo sapiens 


hqp0376 protein 


125 


59 


10293 


G04072 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
8153. 


123 


37 


10294 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


64 


10295 


G03807 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7888. 


162 


70 


10296 


V00662 


Homo sapiens 


ATPase6 


268 


88 


10297 


AB007861 


Homo sapiens 


KIAA0401 


424 


92 


10298 


M24732 


Homo sapiens 


lamin-lilce protein 


183 


40 


10299 


D90053 


Sus scrofa 


destrin 


123 


48 


10300 


X61047 


Hydra sp. 


mini-collagen 


86 


43 
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SEQID 
NO: 


Accession No. 






Smith- 
Waterman 
Score 


Identit 

y 


10301 


U49973 


Homo sapiens 


ORPl ; MER37; putative transposase 
similar to pogo element 


180 


50 


10302 


AFl 18078 


Homo sapiens 


PRO 1848 


157 


63 


10303 


X55681 


Lycopersicon 
esculenlum 


extensuiCclassI) 


87 


36 


10304 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


105 


62 


10305 


X70944 


Homo sapiens 


PTB-associated splicing factor 


84 


32 


10306 


AF068294 


Homo sapiens 


HDCMB45P 


220 


42 


10307 


S79410 


Mus musculus 


nuclear localization signal binding 

protein 


96 


41 


10308 


AJ005559 


Mus musculus 


SPR2A protein 


64 


42 


10309 


AL353995 


Arabidopsis 
thaliana 


AtA0P4 


98 


38 


10310 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


213 


92 


10311 


Ml 53 86 


Homo sapiens 


gamma-globin 


614 


88 


10312 


V00488 


Homo sapiens 


alpha globin 


574 


90 


10313 


AL049730 


Arabidopsis 
thaliana 


pEARLI 1-like protein 


117 


38 


10314 


AL3 56299 


Homo sapiens 


bK3216D2.1.1 (S- 
adenosylhoraocysteine hydrolase 
(SAHH), isoform 1) 


256 


70 


10315 


AF194537 


Homo sapiens 


NAG13 


113 


63 


10316 


S58722 


Homo sapipns 


X- linked retinopathy protein {C- 
terminal, clone XEH.Bc} 


99 


62 


10317 


AF090931 


Homo sapiens 


PRO0483 


86 


68 


10318 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


102 


77 


10319 


L29219 


Homo sapiens 


clkl; putative j 341 


100 


10320 


V00488 


Homo sapiens 


alpha globin 314 


89 


10321 


AJ223953 


Homo sapiens 


hPTTG 


354 


89 


10322 


X79389 


Homo sapiens 


glutathione transferase Tl 


78 


75 


10323 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


101 


66 




AFl 16715 


Homo sapiens 


PR02829 


128 


73 


10325 


AFl 16619 


Homo sapiens 


PRO 1051 


125 


77 


10326 


Y14318 


Homo sapiens 


peroxisomal ABC-transporter 


249 


100 


10327 


U73844 


Homo sapiens 


ESE-la 


185 


90 


10328 


AB051901 


Homo sapiens 


VDUPl 


132 


83 


10329 


AL163300 


Homo sapiens 


human ubiquitin conjugating enzyme 
02 EC 6.3,2.19. 


720 


100 


10330 


AF117237 


Homo sapiens 


prefoldinsubunit2 


729 


100 


10331 


D42054 


Homo sapiens 


KIAA0092 gene product is distantly 
related to smooth muscle myosin. 


352 


84 




AF038965 


Homo sapiens 


26S proteasome ATPase subunit 


431 


100 


10333 


Y94960 


Homo sapiens 
Brassica napus 


Human secreted protein clone mil 17_1 
protein sequence SEQ ID NO:I26. 


399 


96 


10334 


X60376 


proline-rich protein 


114 


31 


10335 


M34427 


Homo sapiens 


T-plastin 


343 


91 


10336 


V00488 


Homo sapiens 


alpha globin 


445 


88 


10337 


M15386 


Homo sapiens 


gamma-giobin 


402 


85 


10338 


X61123 


Homo sapiens 


BTGl 


367 


90 


10339 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


278 


87 


10340 


A03992 


synthetic 
construct 


growth hormone 


145 


61 


10341 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 
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SEQID 

NO; 


Accession No. 




Description 


Smitli- 
Waterman 


Identit 

y 








451g. 






10342 


U02680 


Homo sapiens 


protein tyrosine kinase 


140 


92 


10343 


AF 180681 


Homo sapiens 


guanine nucleotide exchange factor 


1425 


88 


10344 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


123 


69 


10345 


V00488 


Homo sapiens 


alpha globin 


318 


93 


10346 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (PO) 


329 


98 


10347 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


100 


66 






Homo sapiens 


pigment epilhelium-derived factor 


359 


74 


10349 


G01790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5871. 


85 


66 


10350 


V00488 


Homo sapiens 


alpha globin 


361 


90 


10351 


U93567 


Homo sapiens 


p40 


203 


84 


10352 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


253 


85 


10353 


AFl 94537 


Homo sapiens 


NAG13 


196 


41 


10354 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


297 


81 


10355 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


144 


65 


10356 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


94 


54 


10357 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


153 


68 


10358 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


136 


53 


10359 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


95 


77 


10360 


X61296 


Rattus 
norvegicus 


open reading frame 2 


87 


40 


10361 


L24521 


Homo sapiens 


transformation-related protein 


103 


32 


10362 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


101 


55 


10363 


Y95028 


Homo sapiens 


Human clone vp7 1 0RF2, SEQ ID 
NO:128. 


89 


80 


10364 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5066. 


101 


58 


10365 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


322 


85 


10366 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
ferminal, clone XEH.8c} 


101 


59 


10367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


53 


10368 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


120 


59 


10369 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10370 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


169 


81 






Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


80 


75 


10372 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10373 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


77 


10375 


AF130089 


Homo sapiens 


PRO2550 


139 


78 


10376 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


62 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 




Identit 

j: 








7874. 






10377 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


96 


51 


10378 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


58 


10379 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


117 


71 


10380 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


54 


10381 


U93569 


Homo sapiens 


putative pi 50 


160 


60 


10382 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


140 


77 


10383 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6561. . 


83 


71 


10384 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protein 


137 


63 


10385 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4413. 


67 


50 


10386 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 
S24 


117 


58 


10387 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


45 


10388 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


153 


70 


10389 


U93570 


Homo sapiens 


putative pi 50 


112 


58 


10390 


AF090942 


Homo sapiens 


PRO0657 


126 


68 


10391 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


123 


64 


10392 


AF 130089 


Homo sapiens 


PRO2550 


144 


71 


10393 


U40739 


Homo sapiens 


cyclin C 


464 


86 


10394 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


75 


10395 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


137 


55 


10396 


W88947 


Homo sapiens 


Polypeptide fi-agment encoded by gene 
118. 


165 


66 


10397 


AF090930 


Homo sapiens 


PRO0478 


133 


80 


10398 


AK024455 


Homo sapiens 


FLJ00047 protein 


153 


62 


10399 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


69 


10400 


Y 144 82 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


73 


51 


10401 


AB030033 


Dictyostelium 
discoideum 


AmiB 


82 


33 


10402 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


113 


63 


10403 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


148 


77 


10404 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


71 


10405 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


128 


59 


10406 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


76 


10407 


AF090942 


Homo sapiens 


PRO0657 


105 


64 


10408 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEII.Sc} 


93 


54 


10409 


J02621 


Homo sapiens 


high mobility group protein 14 


140 


64 


10410 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


46 


104U 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


56 
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SEQID 


Accession No. 




Description 






NO: 








Waterman 


Identit 










Score 


y 








7879. 






10412 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 


79 


66 








teimfna], clone XEH.Sc} 






10413 


AF 13 0079 


Homo sapiens 


PR02852 


184 


90 


10414 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


237 


77 








SEQ ID NO:220. 






10415 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 








gene 44 clone HTDAD22, 






10416 


AFC 16099 


Mus musculus 


endonuclease/reverse transcriptase 


89 


78 


10417 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


184 


52 


10418 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 


131 


75 








human tumor suppressor gene ZAP-1. 






10419 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


90 








7068. 






10420 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


59 








6588. 






10421 


AF090942 


Homo sapiens 


PRO0657 


154 


57 


10423 


L78669 


Homo sapiens 


CoxII/D-loop DNA fiision protein 


106 


90 


10424 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


105 


75 








protein 






10425 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


130 


64 






brucei 


protein rha la. 






10426 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 








gene 44 clone HTDAD22. 






10427 


AF068294 


Homo sapiens 


HDCMB45P 


256 


56 


10428 


AF194537 


Homo sapiens 


NAG 13 


182 


54 


10429 


AF09093I 


Homo sapiens 


PRO0483 


108 


79 


10430 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


68 








7795. 






10431 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma DI 


127 


87 








protein 






^432"" 


X67920 


Homo sapiens 


tryptophanyl-tRNA synthetase 


138 


84 


10433 


Y 86305 


Homo sapiens 


Human secreted protein HLYES38, 


294 


78 








SEQ ID NO;220. 






10434 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 








protein 






10435 


G 02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


75 








6571. 






10436 


S79410 


Mus musculus 


nuclear localization signal binding 


117 


72 








protein 






10437 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


67 








4658. 






10438 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


140 


69 








RHAla(AF078683) 






10439 


AF130089 


Homo sapiens 


PRO2550 


106 


80 


10440 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


331 


83 








SEQIDNO:220. 






10441 


W94294 


Homo sapiens 


Human phosphatidylinositol transfer 


143 


96 








protein ganmia. 






10442 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


121 


87 








RHAIa(AF078683) 






10443 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 








protein 






10444 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


67 








7519. 






10445 


U93567 


Homo sapiens 


p40 


262 


69 


10446 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


112 


92 








protein 






10447 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


120 


83 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
_Score 


Mentit 

y 








protein 






10448 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


104 


68 


10449 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protein 


115 


80 


10450 


AF095770 I Homo sapiens 


PTH-responsive osteosarcoma Dl 


117 


80 


10451 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


80 


10452 


AF130089 


Homo sapiens 


PRO2550 


315 


68 


10453 


G02530 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6611. 


110 


71 


10454 


G00689 


Homo sapiens ' 


Human secreted protein, SEQ ID NO: 
4770. 


86 


34 
T5 


10455 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


113 


10456 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7637. 


180 


83 


10457 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


119 


96 


10458 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


133 


58 


10459 


Y41710 


Homo sapiens 


Human PR0618 protein sequence. 


894 


71 


10460 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


84 


71 


10461 


K02401 


Homo sapiens 


chorionic somatomammotropin 


471 


88 


10462 


G02n3 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6194. 


241 


73 


10463 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492, 


93 


68 


10464 


U 15647 


Mus musculus 


reverse transcriptase 


110 


47 


10465 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


57 


10466 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6835. 


145 


93 


10467 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


137 


57 


10468 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6571. 


62 


55 


10469 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


93 


86 


10470 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


119 


63 


10471 


AF068294 


Homo sapiens 


HDCMB45P 


117 


43 


10472 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 

fragment, SEQ ID NO:490. 


335 


72 


10473 


G03790 


Homo sapiens 


Hiiinan secreted protein, SEQ ID NO: 
7871. 


121 


62 


10474 


X79238 


Homo sapiens 


ribosomal protein L30 


126 


58 


10475 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10476 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


106 


61 


10477 


AC006693 


Caenorhabditis 
elegans 


Hypothetical protein W02H5.e 


119 


41 


10478 
T0479~ 


G02485 
^Fi30079 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6566. 


87 


66 


Homo sapiens 


PR02852 


97 


54 


10480 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


69 
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SEQID 1 Accession No. 
NO: i 


species 


Description 


Smith- 
Waterman 


Identit 

y 


10481 1 K0240I 


Homo sapiens 


chorionic somatomammotropin 


463 


80 


10482 j G00398 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4479. 


69 


83 


10483 1 G03438 


Homo sapiens 


Htmian secreted protein, SEQ ID NO: 
7519. 


132 


61 


10484 1 R13556 

1 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


119 


80 


10485 AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protein 


90 


54 


10486 1 U79260 


Homo sapiens 


unknown 


105 


90 


10487 j Y87156 

i 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


243 


53 


10488 j G03807 


Homo sapiens 


Htmian secreted protein, SEQ ID NO: 
7888. 


90 


78 


10489 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


64 


10490 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


123 


60 


10491 ! Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 37. 


471 


84 


10492 1 G00577 


Homo sapiens 


Himian secreted protein, SEQ ID NO: 
4658. 


176 


77 


10493 S79410 

! 


Mus musculus 


nuclear localization signal binding 
protein 


135 


64 


10494 1 G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


123 


44 


10495 1 G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


86 


46 


10496 1 Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


145 


40 


10497 j AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


111 


50 


10498 ! S58722 

1 


Homo sapiens 


X-linked retinopatliy protein {C- 
terrainal, clone XEH.Sc} 


96 


73 


10499 ( AL 1 3284 1 


Caenorliabditis 
elegans 


Y15E3A.3 


142 


96 


10500 i Y45382 

1 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


120 


49 


10501 : AF220264 


Homo sapiens 


MOST-1 


93 


83 


10502 ! G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


116 


57 


10503 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


100 


70 


10504 


AF090931 


Homo sapiens 


PRO0483 


88 


80 


10505 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


89 


72 


10506 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


121 


81 


10507 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4473. 


188 


82 


10508 


AF130050 


Homo sapiens 


PRO0872 


101 


74 


10509 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


68 


10510 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protem 


120 


96 


10511 


AK026249 


Homo sapiens 


unnamed protein product 


97 


68 


10512 


X95276 


Plasmodium 
falciparam 


ORF91 


75 


29 


10513 1 X17093 


Homo sapiens 


leukocyte antigen F 


397 


91 


10514 [ W48351 


Homo sapiens 


Human breast cancer related protein 


115 


66 
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SEQID 
NO: 


Accession No, 


Species 


Description 


Waterman 


Wentit 

y 








BCRB2. 






10515 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


68 


10516 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


86 


47 


10517 


AF090944 


Homo sapiens 


PRO0663 


118 


48 


10518 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
temiinal, clone XEH.Sc} 


131 


77 


10519 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


230 


58 


10520 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10521 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


130 


40 


10522 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10523 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4413. 


139 


69 


10524 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


125 


100 


10525 


J04495 


Macaca mulatta 


alpha-globin 


144 


90 


10526 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6616. 


105 


61 


10527 


AF130089 


Homo sapiens 


PRO2550 


132 


67 


10528 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
1DNO:1051. 


90 


53 


10529 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


112 


75 


10530 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


157 


79 


10531 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


59 


10532 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4409. 


121 


63 


10533 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


43 


10534 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


131 


65 


10535 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4433. 


95 


60 


10536 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


157 


79 


10537 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4473. 


199 


90 


10538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


113 


75 


10539 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4518. 


120 


65 


10540 


AF130089 


Homo sapiens 


PRO2550 


195 


67 


10541 


Y60522 


Homo sapiens 


Human normal bladdertissue EST 
encoded protein 194. 


73 


72 


10542 


G01175 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
5256. 


83 


100 


10543 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


81 


54 


10544 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


147 


75 


10545 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


109 


66 
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SEOID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Wentit 

y 


10546 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


123 


46 


10547 


AF130089 


Homo sapiens 


PRO2550 


133 


63 


10548 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ IDNO;220. 


246 


80 


105'i9 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


122 


51 


10550 


J02459 


bacteriopliage 
lambda 


J (taii:host specificity, 1132) 


772 


97 


10551 


G00328 


Homo sapiens 


Human secreted protein, SKQ ID NO: 

4409. 


158 


80 


10552 


G00328 


Homo sapiens ■ 


Human secreted protein, SEQ ID NO: 
4409. 


123 


50 


10553 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


184 


71 


10554 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


107 


60 


10555 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


140 


69 


10556 


U49973 


Homo sapiens 


ORF2: function unknown 


9] 


70 


10557 


AF 15 1850 


Homo sapiens 


CGI-92 protein 


459 


63 


10558 


X66285 


IMus musculus 


HCl ORF 


92 


30 


10559 


S58722 


Homo sapiens 


X-iinked retinopathy protein {C- 
terminal, clone XEH.Sc} 


127 


67 


i 0560 
l056r 


AF130050 

"¥91577 


Homo sapiens 


PRO0872 


100 


71 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


335 


80 


10562 


W4835 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


126 


58 


10563 


AF010144 


Homo sapiens 


neuronal tliread protein AD7c-NTP 


127 


76 


10564 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4413. 


158 


62 


10565 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


77 


10566 


V00488 


Homo sapiens 


alpha globin 


464 


89 


10567 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


162 


81 


10568 


S58722 


Homo sapiens , 


X-linJted retinopathy protein {C- 

tei-minal, clone XEH.Sc} 


113 


72 


10569 


G03040 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7121. 


127 


47 


10570 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


125 


65 


10571 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


79 


88 


10572 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


158 


80 


10573 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


138 


56 


10574 


AF 119900 


Homo sapiens 


PR02822 


154 


55 


10575 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


87 


72 


10576 


G034I5 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


131 


66 


10577 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


80 


47 


10578 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


121 


53 


10579 


Y91577 


Homo sapiens 


Human secreted protein sequence 


335 


83 



416 
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SEQID 


AccessionNo. 


species 


Description 


Smith- 

Wateiman 

Score 


Identit 

y 








encoded by gene 2 SEQ ID NO:250. 






10580 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


104 


47 


10581 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


324 


68 


10582 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670, 


129 


66 


10583 


AF090895 


Homo sapiens 


PRO0117 


137 


63 


10584 


L27428 


Homo sapiens 


reverse transcriptase 


98 


50 


10585 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4499. 


101 


68 


10586 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


108 


82 


10587 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


166 


76 


10588 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protein 


100 


70 


10589 


X03145 


Homo sapiens 


pot. ORF I 


120 


43 


10590 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


139 


70 


10591 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


92 


50 


10592 


AK024455 


Homo sapiens 


FLJ00047 protein 


111 


75 


10593 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


143 


58 


10594 


G03636 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7717. 


116 


54 


10595 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


196 


57 


10596 


AF130089 


Homo sapiens 


PRO2550 


140 


66 


10597 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


115 


52 


10598 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


79 


71 


10599 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


79 

"67 


10600 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


10601 


AF090895 


Homo sapiens 


PRO0117 


106 


68 


10602 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
termmal, dmie XEH.Sc} 


83 


72 


10603 


AF090930 


Homo sapiens 


PRO0478 


119 


70 


10604 


AF095770 


Homo sapiens 


PrH-responsive osteosarcoma D I 
protein 


120 


83 


10605 


AF132949 


Homo sapiens 


CGI- 15 protein 


114 


96 


10606 


Y95028 


Homo sapiens 


Human clone vp7_l 0RF2, SEQ ID 
NO: 128. 


176 


77 


10607 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


155 


74 


10608 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


127 


63 


10609 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


60 


10610 


AF130089 


Homo sapiens 


PRO2550 


86 


90 


10611 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


149 


72 


10612 


L27428 


Homo sapiens 


reverse transcriptase 


311 


52 


10613 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


83 


84 


10614 


D00570 


ISdus musculus 


open reading Same (196 AA) 


116 


69 
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SEQ ID 


Accession No. 


Species 


Description 


Smitli- 
Score 


Idcntit 

y 


10615 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ IDNO:490. 


290 


62 


10616 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


125 


51 


10617 


AFl 16695 


Homo sapiens 


PR02221 


131 


56 


1061 S 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


iOl 


48 


10619 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


104 


65 


10620 


G00437 


Homo sapiens 


Human secreted protein, SEQ ED NO; 
4518. 


112 


58 


10621 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


336 


85 


10622 


AFl 18082 


Homo sapiens 


PRO 1902 


80 


73 


10623 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


136 


50 


10624 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


109 


67 


10625 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


92 


75 


10626 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


121 


62 


10627 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


104 


52 


10628 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


137 


66 


10629 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


129 


67 


10630 


U93569 


Homo sapiens 


putative pi 50 


104 


52 




AF090895 


Homo sapiens 


PRO0117 


136 


70 


10632 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


89 


57 


10633 


AFl 13685 


Homo sapiens 


FROG 974 


130 


64 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


63 


10635 


M24732 


Homo sapiens 


lamin-like protein 


80 


65 


10636 


U49974 


Homo sapiens 


mariner transposase 


116 


72 


10637 


AK024455 


Homo sapiens 


FLJ00047 protein 


158 


67 


10638 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


121 


64 


10639 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


57 


10640 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


72 


10641 


L27428 


Homo sapiens 


reverse transcriptase 


150 


54 


10642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


68 


81 


10643 


U93565 


Homo sapiens 


putative pl 50 


282 


57 


10644 


AF090942 


Homo sapiens 


PRO0657 


153 


74 


10645 


Y48333 


Homo sapiens 


Human prostate cancer-associated 
protein 30. 


196 


90 


10646 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


242 


73 


10647 


U79260 


Homo sapiens 


unlmown 


95 


77 


10648 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


149 


64 


10649 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


158 


87 


10650 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


339 


89 
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SEQID 
NO: 


Accession No. 


species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


10651 


U15183 


Mycobacterium 


proline-rich antigen 


102 


37 


10652 


AF2 17536 


Homo sapiens 


truncated mevalonate kinase 


138 


70 


10653 


AF 119855 


Homo sapiens 


PRO 1847 


74 


100 


10654 ! Ml 5530 


Homo sapiens 


B-cell growth factor 


94 


52 


10655 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


144 


54 


10656 


M15317 


Plasmodium 
lophurae 


histidine-rich protein 


124 


51 


10657 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613, 


143 


66 


10658 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


114 


77 


10659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


172 


55 


10660 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 


78 


10661 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


160 


81 


10662 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


151 


71 


10663 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409, 


160 


81 


10664 


AK000496 


Homo sapiens 


unnamed protein product 


140 


77 


10665 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


147 


75 


10666 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409, 


160 


81 


10667 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409, 


149 


78 


10668 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


303 


78 


10669 


D00526 


Ratlus 
norvegicus 


L-gulono-gamma-lactone oxidase 


108 


73 


10670 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


143 


65 


10671 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


409 


84 


10672 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4770. 


110 


65 


10673 


G00423 


Homo sapiens 


Human secreted protein, SEQ ED NO; 

4504. 


123 


43 


10674 


G03787 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7868, 


85 


36 


10675 


U93565 


Homo sapiens 


putative pl50 


115 


45 


10676 


AF109907 


Homo sapiens 


SI 64 


205 


65 


10677 


G02620 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6701. 


118 


73 


10678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


106 


62 


10679 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protein 


127 


87 


10680 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:195. 


248 


54 


10681 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88 


52 


10682 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


126 


77 


10683 


S58722 


Homo sapiens 


X-linked retinopalliy protein {C- 


123 


72 



419 
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SEQID 
NO: 


Accession No. 




Description 


Waterman 
Score 


Wcntit 

y 






teiminal, clone XEH.Sc} 






10684 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Scj 


123 


72 


10685 


W54966 


Homo sapiens 


Syntlietic human type HI collagen 
SYN-C3. 


99 


40 


10686 1 AF068294 


Homo sapiens 


HDCMB45P 


182 


54 


10687 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


104 


53 


10688 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


277 


76 


10689 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4314. 


118 


65 


10690 G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


143 


71 


10691 G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4425. 


97 


81 


10692 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


160 


68 


10693 


AF 130089 


Homo sapiens 


PRO2550 


133 


56 


10694 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


71 


10695 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


123 


71 


10696 
10697 ' 


G03133 
~G00648 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 

Human secreted protein, SEQ ID NO: 
4729. 


161 


45 


88 


56 


10698 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


75 


71 


10699 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


92 


38 


10700 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


132 


55 


10701 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4718. 


118 


54 


10702 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


66 


73 


10703 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


124 


44 


10704 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


113 


78 


10705 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


119 


50 


10706 


AK025116 


Homo sapiens 


unnamed protein product 


160 


84 


10707 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10708 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10709 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


230 


76 


10710 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


96 


45 


10711 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


174 


70 


10712 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


50 


10713 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


148 


45 


10714 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


151 


80 
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NO: 








Waterman 
Score 


Identit 

y 


10715 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


157 


67 


10716 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


109 


68 


10717 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


151 


52 


10718 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4718. 


84 


80 


10719 


M19973 


Rattus 
norvegicus 


cytochrome PB24 


104 


45 


10720 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


103 


72 


10721 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


161 


52 


10722 


L27428 


Homo sapiens 


reverse transcriptase 


159 


42 


10723 


S58722 


Homo sapiens 


X-iinlied retinopathy protein {C- 
terminal, clone XEH.8c} 


127 


75 


10724 


Y00311 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


84 


50 


10725 


AF090895 


Homo sapiens 


PRO0117 


170 


71 


10726 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


105 


42 


10727 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


123 


38 


10728 


1/93563 


Homo sapiens 


putative pi 50 


162 


64 


10729^ 


X14576 


Murine leukemia 


gag fusion protein 


172 


41 


10730 


AF090895 


Homo sapiens 


PRO0117 


151 


60 


10731 


AFl 19851 


Homo sapiens 


PR01722 


127 


54 


10732 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


152 


61 


10733 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4523. 


156 


76 


10734 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


73 


10735 


G03789 


Homo sapiens 


Human secreted protein, SRQ ID NO: 
7870. 


151 


65 


10736 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


141 


75 


10737 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


74 


10738 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


124 


69 


10739 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


91 


60 


10740 


AFl 30051 


Homo sapiens 


PRO0898 


90 


63 


10741 


AF090895 


Homo sapiens 


PRODI 17 


129 


55 


10742 


AFl 19900 


Homo sapiens 


PR02822 


164 


85 


10743 


AJ 005560 


Mus musculus 


SPR2B protein 


88 


35 


10744 


Z79996 


Homo sapiens 


CB33F2.1 (PUTATIVE novel protein 
similar tO-C-terminal parts of APOL 
(apolipoprotein L) and TNF-inducible 
protein CO 12-1) 


185 


100 


10745 


G03140 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7221. 


94 


88 ■ 


10746 


X82208 


Homo sapiens 


beta-centractin 


134 


96 


10747 


D49744 


Mus musculus 


famesyltransferase alpha subunit 


228 


60 


10748 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


127 


50 


10749 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


62 



421 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 








Smith- 
Waterman 


Identit 

y 








4523. 






10750 


AF130089 


Homo sapiens 


PR0255Q 


147 


62 


10751 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4484. 


108 


71 


10752 


D 17652 


Homo sapiens 


HBpl5A.22 


416 


72 


10753 


AB002389 


Homo sapiens 


KIAA0391 


486 


86 


10754 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


238 


73 


10755 


S79410 


Mus musculus 


nuclear localization signal binding 


103 


57 


10756 


AF130089 


Homo sapiens 


PRO2550 


104 


72 


10757 


X75421 


Hypocrea 
jecorina 


actin 


197 


66 


10758 


AF298594 


Nicotiana alata 


arabinogalactan protein 


107 


30 


10759 


U94832 


Homo sapiens 


KSRP 


95 


40 


10760 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


179 


82 


10761 


Y86573 


Homo sapiens 


Human gene 9 1 -encoded protein 
fragment, SEQ ID NO:490. 


271 


80 


10762 


Z81528 


Caenorhabditis 
elegans 


predicted using Genefmder-contains 
similarity to Pfam domain: PF00939 
(Sodium;sulfate symporter 
transmembrane region), Score=2.8, E- 
value=1.2, N=l ; PF01757 (Domain of 
unknown function), Score=556.0, E- 
value=8.1e-164,N=l 


103 


27 


10763 


V00662 


Homo sapiens 


ATPase 6 


593 


83 


10764 


A06977 


Homo sapiens 


albumin 


610 


81 


10765 


X67863 


Mus musculus 


T2 


109 


46 


10766 


AF137030 


Homo sapiens 


transmembrane protein 2 


620 


96 


10767 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


101 


66 


10768 


X15324 


Homo sapiens 


angiotensinogen 


529 


89 


10769 


M64983 


Homo sapiens 


fibrinogen beta cliain 


604 


87 


10770 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


381 


83 


10771 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 

caldesmon 


133 
^46 


73 


10772 


M64110 


Homo sapiens 


10773 


AB014566 


Homo sapiens 


KIAA0666 protein 


145 


78 


10774 


AL138810 


Homo sapiens 


dJ179L10.2 (Similar to CGI-29 

protein) 


216 


90 


10775 


AJ223953 


Homo sapiens 


liPTTG 


353 


82 


10776 


U92288 


Human 
herpesvirus 6 


HNl 


113 


39 


10777 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


143 


78 


10778 


X12796 


Bos taurus 


HMGl protein (AA 1-215) 


435 


96 


10779 


U86782 


Homo sapiens 


26S proteasome-associated padl 

homolog 


477 


89 


10780 


M60047 


Homo sapiens 


heparin binding protein 


514 


81 


10781 


yi4480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


99 


48 


10782 


AF130079 


Homo sapiens 


PR02852 


175 


48 


10783 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


115 


64 


10784 


Mil 147 


Homo sapiens 


ferritin light chain 


439 


83 


10785 


AF090931 


Homo sapiens 


PRO0483 


145 


55 


10786 


AF072441 


Homo sapiens 


calcineurin binding protein cabin 1 


498 


90 


10787 


AFl 16715 


Homo sapiens 


PR02829 


127 


88 



422 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 
NO: 


Accession No. 


species 


Descripdon 


Sniith- 

Waterraan 

Score 


Idcntit 

y 


10788 


AF201949 


Homo sapiens 


60S ribosomal protein L30 isolog 


631 


91 


10789 


AC005253 


Homo sapiens 


R26445 1 


403 


84 


10790 


M15386 


Homo sapiens 


gamma-globin 


617 


82 


10791 


Y95005 


Homo sapiens 


Human secreted protein vc57_l, SEQ 
ED NO:50. 


716 


95 


10792 


G01285 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5366. 


165 


52 


10793 


X67863 


Mus musculus 


T2 


128 


50 


10794 


Y82326 


Homo sapiens 


Human arginase I SEQ ID NO:17. 


507 


80 


10795 


Y00755 


Homo sapiens 


extracellular matrix protein BM-40 
(AA 1 - 303) 


559 


87 


10796 


AFl 19851 


Homo sapiens 


PRO 1722 


117 


71 


10797 


AF090930 


Homo sapiens 


PRO0478 


146 


63 


10798 


V00488 


Homo sapiens 


alpha globin 


509 


84 


10799 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


131 


42 


10800 


X83618 


Homo sapiens 


hydroxymethylglutaryl-CoA synthase 


422 


78 


10801 


AF132952 


Homo sapiens 


CGI- 18 protein 


341 


88 


10802 


M17375 


Gallus gallus 


type XTl collagen 


88 


42 


10803 


K02401 


Homo sapiens 


chorionic somatomammotropin 


436 


85 


10804 


K02401 


Homo sapiens 


chorionic somatomammotropin 


410 


86 


1 0805 


U6) 232 




cofactor B 


359 


77 


10806 


D86438 


Homo sapiens 


Ibal (ionized calcium binding adapter 
molecule 1) 


406 


77 


10807 


AFl 18082 


Homo sapiens 


PRO 1902 


105 


38 


10808 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


148 


60 


10809 


AF090942 


Homo sapiens 


PRO0657 


163 


52 


10810 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


116 


86 


10811 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


104 


45 


10812 


X61296 


Rattus 
norvegicus 


open reading frame 2 


156 


41 


10813 


U82303 


Homo sapiens 


unknown 


103 


64 


10814 


G01478 


Homo sapiens 


Human secreted protein, SBQ ID NO: 
5559. 


75 


77 


10815 


U52077 


Homo sapiens 


mariner transposase 


131 


85 


10816 


D38112 


Homo sapiens 


ATPase subunit 6 


251 


74 


10817 


G00397 


Homo sapiens 


Human secreted protein, SBQ ID NO: 
4478. 


103 


80 


10818 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. 


102 


52 


10819 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4442. 


94 


66 


10820 


Y21811 


Homo sapiens 


CPF polypeptide 36PRO. 


200 


62 


10821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


161 


60 


10822 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


70 


10823 


U22961 


Homo sapiens 


similai' to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


337 


63 


10824 


G03800 




Human secreted protein, SEQ ID NO: 
7881. 


122 


61 


10825 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


122 


68 


10826 


J03634 


Homo sapiens 


erythroid differentiation protein 
precursor 


222 


68 



423 



w o 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


DescripHon 


Smith- 
Waterman 
Score 


Wen tit 

y 






Homo sapiens 


NADH dehydrogenase subunit 5 






10828 


V00488 


Homo sapiens 


alpha globin 


132 


86 


10829 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


131 


70 


10830 


AF090942 


Homo sapiens 


PRO0657 


122 


43 




AF09093 1 


Homo sapiens 


PRO0483 






10832 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


138 


63 


10833 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7887. 


88 


57 


10834 


AF118082 


Homo sapiens 


PRO 1902 


109 


62 


10835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


44 


10836 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


86 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 










Homo sapiens 


mariner transposase 






10839 


M10546 


Homo sapiens 


cytochrome oxidase I 


341 


83 


10840 


Y44361 


Homo sapiens 


Human cell cycle regulation protein-2. 


144 


60 


10841 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


56 


10842 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


565 


80 


10843 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


118 


100 


10844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


81 


10845 


AF014889 


Homo sapiens 


NADH dehydrogenase subunit 2 


416 


85 


10846 


AF194537 


Homo sapiens 


NAG 13 


177 


70 


10847 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


40 


10848 


Ml 5530 


Homo sapiens 


B-cell growth factor 


92 


"67~" 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 




52 


10850 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla (AF078683) 


121 


76 


10851 


AF116712 


Homo sapiens 


PR027j8 


84 


57 




V00662 


Homo sapiens 








10853 


U01849 


Ti-ypanosoma 


ORFl 


93 


37 


10854 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


93 


61 


- 


AF130089 


Homo sapiens 


PRO2550 






10856 


AF130079 


Homo sapiens 


PR02852 


154 


75 




AK025 116 


Homo sapiens 


unnamed protein product 






10858 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


71 


10859 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7784. 


97 


56 


10860 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


207 


78 


10861 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


83 


50 


10862 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6588. 


126 


46 


10863 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


148 


63 


10864 


B01372 


Homo sapiens 


Neuron-associated protein. 


106 


38 


10865 


U38964 


Homo sapiens 


hPMSR2 


152 


68 


10866 


AF119855 


Homo sapiens 


PR01847 


142 


67 



424 



w o 01/64835 



PCT/llSOl/04927 



SEOID 1 Accession No. 
NO: ' 

\ 


Species 


Description 


Waterman 
Score 


Idcntit 


10867 j GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


99 


75 


10868 


U79260 


Homo sapiens 


unlcnown 


94 


47 


10869 


R07057 


Homo sapiens 


Smaller liepatocellular oncoprotein 
(hhcm) gene preoduct. 


15] 


52 


10870 


AJO 10025 


Homo sapiens 


unr-interacting protein 


100 


64 


10871 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 


110 


44 


10872 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


136 


71 


10873 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


126 


61 


10874 


AF 130089 


Homo sapiens 


PRO2550 


115 


88 


10875 ,Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


114 


59 


10876 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


83 


48 


10877 


AF 118086 




PRO 1992 


145 


82 


10878 


AFl 16712 


Homo sapiens 


PR02738 


110 


55 


10879 


G03172 


Homo sapiens 


Human seca-eted protein, SEQ ID NO: 

7253. 


123 


77 


10880 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


111 


61 


10881 1 Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


106 


51 


10882 ( G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


107 


67 


10883 J G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


101 


65 


10884 i U79260 


Homo sapiens 


unknown 


111 


56 


10885 ! W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


60 


10886 j G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


107 


57 


10887 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


148 


80 


10888 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4493. 


142 


58 


10889 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


156 


67 


10890 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


105 


52 


10891 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4433. 


187 


65 


10892 


A67510 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 

CONTAINING N ALLELE OF FVl 

GENE. 


120 


37 


10893 


AJ223410 


Homo sapiens 


EBIl-ligand chemokine 




77 


10894 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


475 


83 


10895 


V00662 


Homo sapiens 


ATPase 6 


316 


95 


10896 


V00662 


Homo sapiens 


ATPase 6 


427 


82 


10897 


U09500 


Homo sapiens 


cytochrome b 


403 


92 


10898 


U93564 


Homo sapiens 


p40 


215 


91 


10899 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


69 


77 


10900 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6619. 


159 


60 


10901 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


93 


43 



425 



wo 01/64835 PCT/llSOl/04927 



SEQ ID 
NO: 






Description 


Smith- 
Waterman 
Score 


Identit 

y 


10902 


M10546 


Homo sapiens 


cytochrome oxidase I 


302 


66 




G00447 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4528, 




58 




AK024455 


Homo sapiens 


FLJ00047 protein 




74 


10905 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


64 


10906 


AF090942 


Homo sapiens 


PRO0657 


149 


74 


10907 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


167 


91 


10908 


D38112 


Homo sapiens 


ATPase subunit 6 


355 


68 


10909 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


145 


72 


10910 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


1 14 


72 


10911 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


144 


53 


10912 


V00662 


Homo sapiens 


ATPase 6 


289 


69 


10913 


X62677 


Oiyctolagus 
cuniculus 


retrovirus related reverse transcriptase 


93 


51 


10914 


V00662 


Homo sapiens 


cytochrome oxidase III 


509 


86 


10915 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


459 


80 


10916 


Z28029 


Saccharomyces 
cerevisiae 


ORF YKL030W 


96 


32 


10917 


GOB 102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


94 


56 


10918 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


114 


36 


10919 


¥86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


88 


10920 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


39 


10921 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


99 


95 


10922 1 G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


75 


63 


10923 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


125 


85 


10924 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


140 


73 


10925 


G037S7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


77 


38 


10926 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


111 


58 


10927 


Y144S2 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


139 


75 


10928 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


78 


10929 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


77 


57 


10930 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


93 


53 


10931 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


83 


54 


10932 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


262 


84 


10933 


G037S6 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


113 


72 


10934 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


120 


62 


10935 


AP00p616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


69 



426 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 

y 


■ — 







— : 

Homo sapiens 


pla e e glycopro em — precursor 


~TW 

"t~^ ~ 





10937 


AB028893 


Homo sapiens 


nbosomal protein SI 1 


279 


53 


10939 


¥27868 


Homo sapiens 


Human secreted protein encoded by 


127 


55 




— 











10940 


Aril 9900 


— : 

Homo sapiens 


PR02822 


1 


5^ 


10941 


G03556 


Homo sapiens 


Human secreted proteinj SEQ ID NO; 


1-7 


62 






-^^T^ r — ^ ="n 






1094/ 


■ 

^ — % 


— ^ 

omo sapiens 


s ivary pro ine-ncn protem 


— — 





10943 


G02532 


Homo sapiens 


Human secreted protemj SEQ ID NO: 


164 


61 








nll^oQOn 






10944 


■ 

AF116715 


— : 

Homo sapiens 


PR02829 


119 


69 


10945 


M10546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


10946 


L27428 


Homo sapiens 


reverse transcriptase 


112 


51 


10947 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


118 


90 




.. ._ 




P14 A 1 u f h.XlCn^f^Q'X\ 
KrlAia \^ArU/oooJ^ 






10948 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


121 


83 








protein 






... 
10949 


AFl 1 671 5 


Homo sapiens 


PR02829 


1j4 


67 


10950 


S58722 


Homo sapiens 


X-liniied retinopathy protein {C- 


139 


83 








terminal, clone XEH.8c} 






10952 


AF130079 


Homo sapiens 


PR02852 


119 


79 


10953 


Z38128 


Mus musculus 


histoneHl 


126 


35 


10954 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


1j5 


5ji 








4718. 






10955 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


126 


37 


10956 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


69 








4581. 






10957 


AFl 18082 


Homo sapiens 


PRO 1902 


90 


5.1 


10958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


53 






6566. 






10959 


W48351 


Homo sapiens 


Human breast cancer related protein 


82 


51 








BCRB2. 






10960 


U79260 


Homo sapiens 


unknown 


97 


52 


10961 


U49973 


Homo sapiens 


ORF 1 ; MER3 7; putative transposase 




66 








similar to pogo element 






10962 


W48351 


Homo sapiens 


Human breast cancer related protein 


124 


48 






BCRB2. 






10963 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


73 















10964 


AP0006 1 6 


Oryza sativa 


similar to RING-H2 finger protein 


129 


87 








KrlAia \nx\) loOoi) 






10965 


Z28029 


Saccharomyces 
cerevisiae ^ 


ORF YKL030W 


99 


70 


— 


V00672 


Pan troglodytes 


T- — E ? 

readmg frame protein 5 




_107 


70 


10967 


\J\]0 I OO 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


138 


38 








^u^-^' ; - - 










— ; 

Homo sapiens 


chorionic somatomamniotropm 


478 


77 


10969 


- 

D38112 


Homo sapiens 


cytoclirome c oxidase subunit 3 


642 


86 


10970 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


57 
















10971 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


56 














10972 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


51 








4409. 






10973 


K02401 


Homo sapiens 


chorionic somatomammotropin 


491 


82 


10974 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


146 


93 








SEQIDNO:163. 






10975 


D38H2 


Homo sapiens 


NADH dehydrogenase subunit 4 


236 


85 


10976 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


108 


50 



427 
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SEQID 
NO: 




Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 








gene ji cione JriUJsJoAcSD. 






10977 


J03071 


Homo sq)iens 


chorionic somatomammotropin CS-2 


435 


73 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 




74 


10979 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

3 : 


137 


73 




G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 








Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 






10982 


X92485 


Plasmodium 


pval 


119 


47 


109S3 


Y13913 1 Homo sapiens 


COI.lAl and PDGFB fusion transcript 


64 


47 




D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 






10985 


U49973 


Homo sapiens 


OKI- 1 ; MER37; putative transposase 
similar to pogo element 


309 


65 


10986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


145 


87 


10987 


AF130089 


Homo sapiens 


PRO2550 


88 


31 




Y86248 


Homo sapiens 


Htiman secreted protein HCHPF68, 
SEQIDNO:163. 


127 






G00637 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4718. 




75 


10990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 




76 




G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 




72 


10992 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 , 


69 


10993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


120 


92 


10994 


AJ277092 


Oryctolagus 
cuniculus 


translationally controlled tumor protein 

3 


172 


49 


10995 


AF052831 


Tiypanosoma 


unknown 

._ ... . 


92 


37 




Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 






10997 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 








W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 








S52010 


Mus sp. 


orfl 5* of EpoR 






11000 


AF130079 


Homo sapiens 


PR02852 


131 


81 


11001 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


111 


75 




Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 







11003 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


151 


60 


——2 — 


G02532 


Homo sapiens 1 Human secreted protein, SEQ ID NO: 


-— 

9 


— 

58 


11005 


K02401 


Homo sapiens | cliorionic somatomammotropin 


"379 


~89 


11006 


K02401 


Homo sapiens chorionic somatomammotropin 


464 


87 


11007 


M15894 


Homo sapiens ' chorionic somatomammotropin 
precursor 


370 


74 


11008 


K02401 


Homo sapiens 


chorionic somatomammotropin 


467 


88 


11009 


K02401 


Homo sapiens 


chorionic somatomammotropin 


537 


91 


11010 


K02401 


Homo sapiens 


chorionic somatomammotropin 


475 


87 


11011 


K02401 


Homo sapiens 


chorionic somatomammotropin 


460 





428 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smitli- 


% 


NO: 








Waterman 


Identit 










Score 


y 


11012 


G03639 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 


101 


61 








V~^' ■■ . ■ 






11013 


AF266166 




Gillichthys 


ribosomal protein SI 3 

. _ . . .. 


2_3 


54 






mirabilis 






11014 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


61 















11015 


AF116715 


Homo sapiens 


PR02829 


121 


69 


11016 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


92 


75 








terminal, clone XEH.Sc} 






11017 


R96800 


. .. 
Homo sapiens 


Human histiocyte-secreted factor HSF. 


113 


78 


11018 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


40 








7871. 






11019 


S58722 


Homo sapiens 


X-linlced retinopatliy protein {C- 


101 


48 








terminal, clone XEH.Sc} 






11020 


G0313j 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 










7214. 






11021 


X14963 


Homo sapiens 


collagen-like protein (447 AA) 


107 


47 


11022 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


72 








7795. 






11023 


AFl 16715 


Homo sapiens 


PR02829 


140 


- 

75 


1 1024 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


125 


70 








SEQ ID NO: 163. 






11025 


APOOOOOl 


Pyrococcus 


58aa long hypothetical protein 


99 


52 






horikoshii 








„. . . 

...^^^■^^ 


D38I12 _ 


Homo sapiens 


- 

cytochrome c oxidase subunit 3 




601 


. .. , 


1 1027 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


434 


74 


11028 


Z97333 


Homo sapiens 


RHCE protein 


149 


51 


11029 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


48 








4l~ J = op^TTtXI/^^ 






1030 




Homo sapiens 


Human secreted protem, SEQ ID NO: 




77 






— 






.. . 


, ., ,,,, 
/ 


Homo sapiens 


cofactor C 








11032 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


116 


88 


1 1033 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


584 


9j 








1 (NDF-1). 






11034 


L10910 


- 

Homo sapiens 


splicing factor 


108 


58 


11035 


M26361 


Mus musculiis 


L»INE/Ig H-chain fusion protein 


140 


48 


11036 


S58722 


Homo sapiens 


X-linked retinopafty protein {C- 


101 


79 








termmal, clone XEH.Sc} 






,. . 


AK024455 


Homo sapiens 


rL/Juuu4/ protem 


.. . 


48 


11038 


AFl 26 163 


Homo sapiens 


HHLA3 protein 


— 

. 


68 


11039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


144 


79 








SEQ ID NO* 1 63 






— 


AK024455 


Homo sapiens 


FLJ00047 protein 




4^1 


62 


11041 




Homo sapiens 


rii^osojtnal protein S20 







11042 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


12j 


84 








RHAla (AF078683) 








11043 




Y01400 


: 

Homo sapiens 


Secreted protein encoded by gene 18 


125 


68 








clone HNHF029 






— — — 

"T^ — 


V00662 


omo sapiens 


URF 2 (NADH dehydrogenase subunit) 


— — 


— 


1045 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


62 














11046 


D38112 


Homo sapiens 


ATPase subunit 6 


297 


90 


11047 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


416 


73 


11048 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


73 


80 








4572. 






11049 


AF090944 


Homo sapiens 


PRO0663 


96 


70 


11050 


AF130089 


Homo sapiens 


PRO2550 


152 


71 


11051 


Y02671 


Homo sapiens 


Human secreted protein emcoded by 


108 


81 



429 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


specie. 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








gene 22 clone HMSJW18. 






11052 


W89031 


Homo sapiens 


Polypeptide fragment encoded by gene 
167. 


254 


95 


11053 


AF 109907 


Homo sapiens 


S164 


136 


45 


11054 


AF2 17536 


Homo sapiens 


truncated mevalonate kinase 


117 


58 


11055 


X72004 


Halichoenis 
gi7pus 


ATP synthase sutranit 6 


160 


78 


11056 


V00662 


Homo sapiens 


ATPase 6 


300 


71 


11057 


AF228021 


Bos taurus 


cyclophilin I 


209 


73 


11058 


L27428 


Homo sapiens 


reverse transcriptase 


303 


65 


11059 1 U09116 


Homo sapiens 


ORFl, encodes a 40 kDa product 


464 


80 


11060 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


68 


11061 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


301 


85 


11062 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


119 


64 


11063 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


128 


76 


11064 


U50188 


Cavia porcellus 


CGRP-receptor component protein 


152 


88 


11065 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7188. 


104 


44 


11066 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478^ 


112 

Tl3 


64 


11067 


AP000616 


Oryza sativa 


similar lo R1NG-H2 finger protein 
RHAla(AF0:'8683) 


71 


11068 


AE003499 


Drosophila 
melanogaster 


CG12706 gens product 


145 


43 


11069 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


160 


65 


11070 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


416 


67 


11071 ! D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


220 


86 


11072 G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


144 


63 


11073 j G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7492, 


115 


60 


11074 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


117 


65 


11075 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


589 


87 


11076 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


588 


83 


11077 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


106 


86 


11078 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


90 


50 


11079 


AF 132200 


Homo sapiens 


PRO 1751 


97 


89 


11080 


AFl 16661 


Homo sapiens 


PRO 143 8 


109 


51 


11081 


AK024455 


Homo sapiens 


FLJ00047 protein 


93 


48 


11082 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7867. 


84 


73 


11083 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


108 


80 


11084 


AF068294 


Homo sapiens 


HDCMB45P 


229 


59 


11085 


Ml 0546 


Homo sapiens 


cytochrome oxidase 1 


137 


73 


11086 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


49 


11087 


L26953 


Homo sapiens 


chromosomal protein 


122 


75 


11088 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


103 


46 


11089 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
termmal, clone XEH.8c} 


112 


56 


11090 


AF130089 


Homo sapiens 


PRO2550 


172 


60 


11091 


K02576 


Homo sapiens 


salivary proline-rich protein I 


107 


35 



430 
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SEQ ID 
NO: 


Accession No. 




Descriptton 


Smith- 
Waterman 


y 


11092 


AF090930 


Homo sapiens 


PRO0478 ■ 


146 


75 


11093 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


163 


82 


11094 


V00662 


Homo sapiens 


ATPase 6 


130 


87 


11095 


B01372 


Homo sapiens 


Neuron-associated protein. 


122 


82 


11096 


D38]12 


Homo sapiens 


ATPase subunit 6 


152 


62 


11097 


U18339 


Variola virus 


D4L 


96 


60 


11098 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


60 


11099 


AF130089 


Homo sapiens 


PRO2550 


128 


75 


11100 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


218 


62 


11101 


U49973 


Homo sapiens 


ORP 1 ; MER37; .putative transposase 
similar to pogo element 


232 


79 


1 1 102 


AF090930 


Homo sapiens 


PRO0478 


154 


73 


11103 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


133 


65 


11104 


K03036 


Mus musculus 


alpha- 1 type 1 procollagen 


105 


43 


11105 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


142 


59 


11106 


L13129 ^ 


Mus musculus 


synexin 


119 


35 


11107 


AF280111 


Homo sapiens 


cytochrome P450 subfamily IIIA 
polypeptide 43 


122 


52 


11108 


JOGS 14 


Homo sapiens 


beta-tubulin 


606 


94 


11109 


X05196 


Homo sapiens 


aldolase C 


412 


89 


11110 


D00570 


Mus musculus 


open reading frame (196 AA) 


127 


85 


11111 


D50532 


Homo sapiens 


macrophage lectin 2 


165 


65 


11112 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


74 


11113 


M15530 


Homo sapiens 


B-cell growth factor 


163 


82 


11114 


V00662 


Homo sapiens 


ATPase 6 


217 


91 


11115 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


165 


100 


11116 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


119 


36 


11117 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


132 


59 


11118 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497, 


103 


67 


11119 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


94 


42 


11120 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


75 


11121 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


104 


64 


11122 


G0341] 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


90 


59 


11123 


AF161356 


Homo sapiens 


HSPC093 


119 


64 


11124 
11125 


J02963 
G00637 


Homo sapiens 


platelet glycoprotein lib precursor 


102 


70 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


60 


11126 


AF068294 


Homo sapiens 


HDCMB45P 


272 


59 


11127 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


120 


79 


11128 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


101 


78 


11129 


AF130089 


Homo sapiens 


PRO2550 


124 


80 


11130 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhal a. 


109 


51 


11131 


U40265 


Trypanosoma 
cruzi 


ATPase subunit 6 


102 


31 



431 
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SEQ ID 


Accession No. 


Species 


Description 


Smitli- 




NO: 

.. 


- 




- . 


Waterman 
Score 


Identit 

y 


111:>2 


X01677 


Homo sapiens 


glycera!dehyde-3-phospnate 


555 


88 








dehydrogena^ — ^ 








K0_576 


Homo sapiens 


salivary proune-ncn protein 1 


108 


37 


11134 


M81757 


Homo sapiens 


S 1 9 ribosomal protein 


232 


76 


11135 


U93565 


Homo sapiens 


putative pi 50 


17j 




11136 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


369 


100 


11137 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


61 








4478. 






11138 


AB030255 


Homo sapiens 


serine/threonine phosphatase 1 gamma 


404 


83 


11139 


AL049610 


Homo sapiens 


dJ1055C14.2 (KIAA0026 


248 


77 








(transcription factor-like protein 










■ 


MRGX)) 






11140 


Y36421 


Homo sapiens 


Fragment of human secreted protein 


81 


53 




' 




encoded by gene 8. 






— 
11141 


1 


Homo sapiens 


alpha- 1 acid glycoprotein precursor _ 


— 

12 


41 


11142 


M16961 


Homo sapiens 


aIpha-2-HS-gIycoprotein 


543 


89 


11143 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


605 


88 


11144 


AF090944 


Homo sapiens 


PRO0663 


137 


67 


11145 


X92485 


Plasmodium 


pval 


102 


57 






vivax 


. 






11146 


AF130089 


Homo sapiens 


PRO2550 


113 


58 


11 147 


U63542 


Homo sapiens 


FAP protein 


154 


57 


11148 


W03988 


Homo sapiens 


SH2 domain from human SH-PTP2 


162 


73 









(amino acids 1-106). 






11149 


AF116695 


Homo sapiens 


PR0222 1 


173 


44 


11150 


M15530 


Homo sapiens 


B-cell growth factor 


105 


45 


11151 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


64 






8153. 






11152 


M15530 


Homo sapiens 


B-cell growth factor 


124 


75 


11153 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


67 








7437. 






11154 


AF118082 


Homo sapiens 


PRO 1902 


79 


52 


11155 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


72 








4414. 






11156 


Y27893 


Homo sapiens 


Human secreted protein encoded by 


266 


96 








gene No. 116. 






11157 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


148 


84 








gene No. 107. 






11158 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


154 


72 








SEQ ID NO: 1 63 . 






1 1 159 


G003 65 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


58 








4446. 






11160 


X60376 


Brassica napus 


prolme-nch protein 


111 


39 


11161 


S79410 


Mus musculus 


nuclear localization signal binding 


111 


37 








protein 






11162 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


53 








661 J. 






11163 


AF090930 


Homo sapiens 


PRO0478 


^46 


73 


11164 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


60 








7879. 






11165 


AF242772 


Homo sapiens 


mesenchjmal stem cell protein 


141 


51 








DSCD28 






11166 J 


X82629 


Homo sapiens 


Mox-2 


307 


83 


11167 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


168 


91 


11168 


X55684 


Lycopersicon 


extensin (class I) 


71 


66 






esculentum 








11169 


AC079041 


Arabidopsis 


hypothetical protein 


98 


45 






thaliana 








11170 


AF090930 


Homo sapiens 


PRO0478 


156 


50 



432 



w o 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


JDescription 


Watcrmfln 


% 

Identit 

y 


11171 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


99 


70 


11172 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96 


124 


54 


11173 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


~587 


— 


11174 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

. 


127 


66 


11175 Y02671 


: 

Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMS JW 1 8 


157 


68 


11176 1 AF090931 


Homosa iens 


PRO0483 


T46 


~S7 


11177 j M15530 J 


Homo sapiens 


B-cell growth factor 


153 


75 


11178 


AF090930 


Homo sapiens 


PRO0478 






11179 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
^^^•^ ^ 


98 


77 


11180 


AF 194537 


, 

Homo sapiens 




T65 


... 


11181 


AF068294 


Homo sapiens 


HDCMB45P 


146 


53 


11182 


AF090931 


Homo sapiens 


PRO0483 






11183 


W48351 


Homo sapiens 


Human breast cancer related protein 


141 


65 


11184 


Y13396 


Homo sapiens 


Amino acid sequence of protein 
PR0332. 


452 


92 


11185 


AF090930 


Homo sapiens 


PRO0478 






11186 


W48351 


Homo sapiens 


Human breast cancer related protein 


178 


79 


11187 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


237 


70 


11188 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


83 


11189 


AL080206 


Homo sapiens 


hypothetical protein 


103 


71 


11190 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


77 


11191 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


1 12 


65 


11192 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


54 


11193 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


150 


65 


11194 


M92046 


Trypanosoma 
cruzi 


sur&ce antigen 






11195 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 






11196 


G03224 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


98 


75 


11197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
1^^^' — r — 7^—7 i — ? 


117 


50 




11198 


D88548 


"H ^ 

omo sapiens 


24-KDa subunit of complex I 





11199 


AF090852 


Antilocapra 
americana 


prion protem 


nT5 




11200 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 






11201 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


139 




11202 


U93563 




putative pi 50 






11203 


U93570 


Homo sapiens 


putative pi 50 


173 


45 


11204 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


166 


68 


11205 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 
87 


68 


11206 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


56 



433 



wo 01/64835 PCT/USOl/04927 



SEQ ID 
NO: 




Species 


Description 


Smith- 
Waterman 


Identit 

y 


11207 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


123 


89 






Homo sapiens 


cytochrome c oxidase subunit 3 






11209 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


125 


89 






Homo sapiens 


cjrtochrome c oxidase subunit 3 






11211 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


90 


76 


11212 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


547 


81 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 




74 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 






11215 


AF162149 




Mycoplasma 


variable sur&ce lipoprotein 

_ 


118 


34 




AF130089 


Homo sapiens 


PRO2550 






11217 


AF090931 


Homo sapiens 


PRO0483 


112 


77 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 








G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 








AF130051 


Homo sapiens 


PRO0898 


122 


63 


11221 


AF130079 


Homo sapiens 


PR02852 


171 


64 


11222 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


146 


55 


11223 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


130 


71 


11224 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


69 


11225 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


85 


11226 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


90 


11227 


M15530 


Homo sapiens 


B-cell growtii factor 


122 


55 


11228 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


121 


67 


11229 


M15317 


Plasmodium 
lophurae 


histidine-rich protein 


1 16 


43 


11230 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


131 


78 


11231 


W19932 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS55. 


74 


76 


11232 


Y86248 

- 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


134 


67 




AK023045 


.. , _ 
Homo sapiens 


unnamed protein product 






11234 


AP000616 


Oryza sativa 


similar to RING- H2 finger protein 
RHAIa (AF07S683) 


127 


82 


11235 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


125 


65 


11236 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


109 


53 


11237 


AF029403 


Homo sapiens 


oxysterol 7alpha-hydroxylase 


144 


72 


11238 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


44 


11239 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


96 


11240 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 


43 


11241 


L36529 


Homo sapiens 


protein p84 


343 


90 


11242 


M15530 


Homo sapiens 


B-cell growth factor 


122 


55 


11243 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


130 


65 



434 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Acc^sionNo. 


Species 


Description 


Smith- 


% 








clone HCACJ81. 




— 








libosomal protein S2 






11245 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


116 


41 


11246 


W75859 


Homo sapiens 


Human secretory protein of clone 


123 ■ 


37 


11247 

... 


G03800 
. .. . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
J^^^- ... ..... 


132 


73 






Homo sapiens 


Human secreted protem, SEQ ID NO: 
7870. 








B03628 


Homo sapiens 


Human phospliolipase 2 HPPL2. 






11250 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


72 


11251 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


115 


46 


11252 


AK021455 


Homo sapiens 


unnamed protein product 


146 


81 


11253 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88. 


56 




S80916 


Homo sapiens 


parotid "o" protein, Po=saIivary 
proline-rich protein {exon 3} 




43 




G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4535. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

6463. 








AFl 94537 


Homo sapiens 








11258 


M15530 


Homo sapiens 


B-cell growth factor 


130 


70 




G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437, 


131 


72 


11260 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


59 


11261 


G02922 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

- . .. . 


316 


92 


11262 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


81 


11263 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


71 


11264 


AFl 30089 


Homo sapiens 


PRO2550 


111 


66 




G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 






11266 


'^^^^•^^ — 


Homo sapiens 


B-cell growth &ctor 


158 


76 




AFl 30079 


Homo sapiens 


PR02852 


i_124 


68 




Y14480 




encoded by gene 14. 










omo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 






11270 


G0j787 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


63 


11271 — 


AF 11 8082 


Homo sapiens 


PRO 1 902 


~ 




11272 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


305 


74 


11273 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alptia peptide 


108 


53 


11274 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7874. 


235 


81 


11275 


U93570 


Homo sapiens 


putative pi 50 


182 


54 


11276 


X03145 


Homo sapiens 


pot ORF V 


144 


60 


11277 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


103 


57 


11278 


AL035681 


Homo sapiens 


dJ756G23.1 (novel Leucine Rich 


269 


100 



435 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 








Protein) 







11279 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


61 


11280 


G03356 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 
74j7. 


115 


70 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 




71 






Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


81 




AFl 16715 


Homo sapiens 


PR02829 






11284 


AL3597S2 


Trypanosoma 


probable similar to ring-h2 finger 
protein rha la. 


111 


76 


11285 


U52077 


Homo sapiens 


mariner transposase 


198 


78 


11286 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


120 


62 


11287 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


103 


45 


11288 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


63 


60 




AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


123 


72 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


120 


70 




G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 






11292 


AFl 16715 


Homo sapiens 


PR02829 


130 


63 


11293 


AF090930 


Homo sapiens 


PRO0478 


134 


60 


11294 


AP0006I6 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla (AF078683) 


119 


65 


11295 


Y86248 


Homo sapiens 


Human .secreted protein HCHPF68, 
SEQ ID NO: 163. 


102 


50 


11296 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 



141 


49 


. ... 
11297 


. .. . 
G00397 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


154 


72 


11298 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


126 


54 


11299 


AK025270 


Homo sapiens 


unnamed protein product 


150 


54 




ATTl 1 

At 1 iSU/o 


Homo sapiens 


PRO 1848 






11301 


AF090942 


Homo sapiens 


PRO0657 


81 


43 




AFl 16715 


Homo sapiens 


PR02829 






11303 


AF090931 


Homo sapiens 


PRO0483 


121 


71 




W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 




82 




G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. 


122 


59 


11306 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


89 


11307 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


65 


11308 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


51 


11309 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


67 


55 


11310 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


105 


71 


11311 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


104 


73 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


88 


73 



436 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: 


Accession No. 






Smith- 
Waterman 


% 

Identit 

y 






Trypanosoma 


ATPase subunit 6 




31 






Homo sapiens 


Human secreted protein encoded by 
gene No, 107. 




53 


11j15 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhal a. 


113 


54 


11316 
. . .. 


G02493 

. . . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


90 


11317 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


74 


11318 
— — — — 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


144 


55 




Ar lol JjO 


— : 

Homo SEpiens 




— — 

12 


- 

56 




S58722 


Homo sapiens 


X-linlced retinopathy protein \C~ 


102 


53 


11321 


AFll 666 1 


Homo sapiens 


PR01438 


131 


74 


11322 


AK000496 


Homo sapiens 


uiuiEnied protein product 


137 


59 


11323 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


57 


11324 


U63542 


Homo sapiens 


FAP protein 


151 


79 




M15530 


Homo sapiens 


B-cell growth factor 






11326 


AF116715 


Homo sapiens 


PR02829 


164 


78 




G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 








AF090895 


Homo sapiens 


PROOl 17 






11329 


AF090942 


Homo sapiens 


PRO0657 


132 


62 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


190 


72 






Cloning vector 
pSportl 


beta-galactosidase alpha peptide 






11332 


G00689 


Homo sapiens 


Hiunan secreted protein, SEQ ID NO: 


119 


38 


11333 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63 . 


131 


52 


13 


1 

G03807 


= 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


.-- - - 








Plasmodium 


pval 






^^^^^ — 


^^^^^^ 


Hortio sapiens 


B-cell growth factor 






11337 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAIa (AF078683) 


126 


100 


11338 


AF 11 8086 


Homo sapiens 


PRO 1992 


123 


80 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






11340 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 




TI341 — 


AF130079 


Homo sapiens 


PR0285'^ 


T45 


"80 


11342 


M19419 


Mus musculus 


proline-rich salivary protein 


104 


44 






Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 






llj.44 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


120 


91 


11345 


U94832 


Homo sapiens 


KSRP 


122 


51 


11346 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


56 


11347 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


188 


87 


11348 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


173 


59 


11349 


W48351 


Homo sapiens 


Human breast cancer related protein 


111 


68 



437 



w o 01/64835 



PCT/llSOl/04927 



SEQIO 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Wen tit 

y 








BCRB2. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 










Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 








AF161356 


Homo sapiens 


HSPC093 


134 




1^^^^ 




Homo sapiens 


neuronal thread protein AD7c-NTP 


... 


47 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 






11356 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


12.1 


59 


11357 


AF130089 


Homo sapiens 


PRO2550 


114 


62 




AK000496 


Homo sapiens 


unnamed protein product 






11359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


81 


11360 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


64 




AF130056 


Homo sapiens 


PRO 1367 






11362 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7496. 


116 


63 






Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 




91 




G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 










Homo sapiens 


chloride channel (putative) 






11366 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


68 


11367 


AF06S294 


Homo sapiens 


HDCiVIB45P 


281 


57 


11368 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


71 


11369 

._ ,.. 


G03043 
- 


Homo sapiens 
... . 


Human secreted protein, SEQ ID NO; 


136 


89 




G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 










Variola virus 








11373 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81 


100 


52 


11374 


Ml 5530 


Homo sapiens 


B-cell growth factor 


135 


61 






Plasmodium 


pval 










Homo sapiens 


Human secreted protein, SEQ ID NO; 

7795. 








W4835 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 






11378 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


53 


11379 


G02567 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


83 


54 


11380 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


85 


11381 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


156 


64 


11382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


121 


79 


11383 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


104 


41 



438 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 




G00354 


Homo sa iens 
omo sapiens 


Human secreted protein SBQ ID NO" — 
4435. 









AP000616 


Oryza sativa 


similai* to RING-H2 finger protein 
RHAla(AF078683) 








AF068294 


Homo sapiens 


HDCMB4 5P 






11387 


B08976 


Homo sapiens 


Human secreted protein sequence 

encoded by gene 28 SEQ ID NO!l33. 


136 


46 


11388 


AF090931 


Homo sapiens 


PRO0483 


123 


74 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


156 


75 






Homo sapiens 


Human breast cancer related protein 
BCRB2. 






11j91 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

. 


117 


54 




W54966 


Homo sapiens 


Synthetic human type HI collagen 

SYN-C3. 






11393 


S79410 


Mus iiiLiscuius 


nuclear localization signal binding 
protein 


111 


46 


11394 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


49 


11395 


AF090930 


Homo sapiens 


PRO0478 


148 


70 




AF2I0651 


Homo sapiens 


NAG 18 


100 


62 






Homo sapiens 


Human secreted protein, SBQ ID NO; 
7879. 








AF063243 


Bos taurus 


ribosomal protein L30 






11399 


AF162149 


Mycoplasma 


variable surface lipoprotein 


104 


37 


11400 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


175 


71 


11401 


D38112 


Homo sapiens 


NADH dehydrogenase subunit4 


341 


92 


11402 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


137 


90 


11403 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


115 


70 


11404 


M10126 


Leistimania 
tarentolae 


NH2 terminus uncertain 


101 


41 






Homo sapiens 


— : — — — — 

Human secreted protein, SEQ ID NO: 
6566. 










Homo sapiens 


RNA polymerase 11 subunit 






11407 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


68 


1408 


Y14482 


H 

omo sapiens 


Fraginent or huiTian secreted protein 
encoded by gene 17. 


.- - 




. 


J0296j 


Homo sapiens 


platelet glycoprotein lib precursor 


118 


77 


11410 




Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


169 


77 


11411 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


169 


82 


11412 


AK025116 


Homo sapiens 


unnamed protein product 


133 


44 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 








AK0244'^5 


Homo sapiens 


FLJ00047 protein 






11415 


AK024455 


Homo sapiens | FLJ00047 protein 


87 


63 


11416 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
temiinal, clone XEH.Sc} 


94 


57 


11417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


123 


58 


11418 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


69 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitii- 
Wateiman 


Identit 

y 


11419 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 




53 


11420 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 




68 


11421 


002363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


51 


11422 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


70 


11423 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


62 


11424 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


136 


72 


11425 


AK002154 


Homo sapiens 


unnamed protein product 


292 


75 


11426 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


98 


75 


11427 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


143 


64 


11428 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


131 


81 


11429 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


130 


81 


11430 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


172 


61 


1 1431 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


112 


74 


11432 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


52 


11433 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


128 


58 


11434 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


63 


11435 


Y02671 




Homo sapiens 
— 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


139 


83 


TT436 — 


X63220 


Bos taurus 


NADH dehydrogenase 


.- 




11437 


AF090895 


Homo sapiens 


PROOl 17 


122 


59 


11438 


J04655 


,Ascaris suum 


collagen 


90 


46 


11439 


AF090942 


Homo sapiens 


PRO0657 


166 


52 


11440 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


55 




GOO 63 7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 








Ml jj jU 


Homo sapiens 


B-cell growth factor 






11443 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


64 


11444 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


115 


67 




AF090895 


Homo sapiens 


PROOl 17 






11446 


AF 118086 


Homo sapiens 


PRO 19 92 


109 


63 






Homo sapiens 


Secreted protein encoded by gene 18 
clone HC AC J81. 


160 


80 


11448 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


97 


75 


11450 


Y02671 


Homo sapiens 


Human secreted protein oicoded by 
gene 22 clone HMSJWl 8. 


136 


72 


11451 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


89 


55 


11452 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


144 


71 


11453 


AF090931 


Homo sapiens 


PRO0483 


140 


78 


11454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


129 


65 



440 
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SEQID 
NO: 


Accession No. 


specie. 


Description 


Smith- 
Waterman 
Score 


Idcntit 

y 


11455 


G003S3 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


63 


11456 


AFl 18086 


Homo sapiens 


PRO 1992 


151 


71 


11457 


U73158 


Homo sapiens 


partial CDS, human putative tumor 
suppressor (U2 3 94 6) 


121 


95 


11458 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


96 


73 


11459 


AK000496 


Homo sapiens 


unnamed protein product 


233 


63 


11460 


AK024455 


Homo sapiens 


FLJ00047 protein 


90 


73 


11461 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


132 


64 


1 1462 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


86 


11463 


K03207 


Homo sapiens 


salivary proiine-rich protein precursor 


135 


36 


11464 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


88 


11465 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6595. 


104 


77 


11466 


AFl 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


141 


48 


11467 


AF090942 


Homo sapiens 


PRO0657 


89 


58 


11468 


AC005698 


Arabidopsis 
thaliana 


T3P18.6 


128 


47 


11469 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


101 


48 


11470 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


136 


81 


11471 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4669. 


91 


45 


11472 


G02532 ^ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


102 


77 


11473 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


145 


55 


11474 


AF210651 


Homo sapiens 


NAG 18 


1 17 


57 


11475 


Y 14482 


Homo sapiens . 


Fragment of human secreted protein 
encoded by gene 17. 


117 


62 


11476 


AFl 16715 


Homo sapiens 


PR02829 


124 


77 


11477 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


45 


11478 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6909. 


85 


78 


11479 


X91940 


Homo sapiens 


WNT-8B protein 


204 


97 


11480 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


131 


71 


11481 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


117 


63 


11482 


X92485 


Plasmodium 


pval 


98 


52 


11483 


AF217518 


Homo sapiens 


uncharacterized bone marrow protein 

BM042 


895 


100 


11484 


U79260 


Homo sapiens 


unknown 


104 


80 


11485 


L27428 


Homo sapiens 


reverse transcriptase 


169 


52 




1V115530 


Homo sapiens 


B-cell grovrth fector 






11487 


U49974 


Homo sapiens 


mariner transposase 


156 


62 


11488 


AF090942 


Homo sapiens 


PRO0657 


110 


44 


11489 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


156 


65 


11490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


115 


52 


11491 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


51 
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SEQID 
NO: 


Accession No. 


species 


Description 


Smith- 

Wateritian 

Score 


Identit 









7214. 






11492 


AF090930 


Homo sapiens 


PRO0478 


145 


58 


11493 


X63726 


Phoca vitulina 


cytochrome c oxidase subunit I 


556 


87 


11494 


AL021897 


Mycobacterium 
tuberculosis 


pra 


119 


56 


11495 


X98485 


Plasmodium 
vivax 


putative 


92 


35 


11496 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


156 


73 


11497 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


129 


72 


11498 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


115 


66 


11499 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


58 


11500 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


113 


36 


11501 


AE003568 


Drosophila 
melanogaster 


CG12566 gene product 


105 


32 


11502 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


141 


78 


11503 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


102 


51 


11504 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


138 


60 


11505 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


109 


85 


11506 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


59 


11507 


M81321 


Macaca 
fascicularis 


proline-rich protein 


128 


48 


11508 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


110 


78 


11509 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


98 


60 


11510 


X07882 


Homo sapiens 


Po protein 


118 


41 


11511 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


133 


79 


11512 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


68 


11513 


AF000381 


Homo sapiens 


non-functional folate binding protein 


203 


93 


11514 

11515 


X92485 

S58722 


Plasmodium 

Homo sapiens 


pval 


115 


44 


X-liaked retinopathy protein {C- 
terminal, clone XEH.Sc} 


109 


51 


11516 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


141 


78 


11517 


AF090894 


Homo sapiens 


PRO0113 


108 


58 


11518 


M15530 


Homo sapiens 


B-cell growth factor 


93 


72 


11519 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7165. 


66 


73 




AFl 17065 


Homo sapiens 


male-specific lethal-3 homolog 1 


200 


61 


11521 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


335 


87 


11522 


M15530 


Homo sapiens 


B-cell growth factor 


104 


74 


11523 


AFl 16715 


Homo sapiens 


PR02829 


122 


64 


11524 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


68 


11525 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


103 


61 
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SEQID 
NO: 


Accession No. 


species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








gene 22 clone HMSJW18. 






11526 


X92485 


Plasmodium 


pval 


111 


42 


11527 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


101 


80 


11528 




Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


134 


58 


1 1529 


AF090931 


Homo sapiens 


PRO0483 


163 


76 


11530 


AFl 19900 


Homo sapiens 


PR02822 


139 


44 


11531 


G03714 


Homo sapiens 


Human secreted protein, SKQ ID NO: 

7795. 


126 


73 


11532 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


126 


50 


11533 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


46 


11534 


AF090930 


Homo sapiens 


PRO0478 


120 


81 


11535 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


105 


64 


11536 


X92485 


Plasmodium 


pval 


123 


56 


11537 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


122 


72 


11538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


64 


11539 


AE003834 


Drosophila 
melanogaster 


CG8054 gene product 


456 


84 


11540 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4438. 


135 


53 


11541 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


99 


70 


11542 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


77 


63 


11543 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


73 


11544 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


78 


58 


11545 


U79260 


Homo sapiens 


unknown 


128 


43 


11546 


Y95028 


Homo sapiens 


Human clone vp7_l 0RF2, SEQ ID 
NO: 128. 


122 


56 


11547 


X92485 


Plasmodium 


pval 


116 


46 


11548 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 


55 


11549 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


95 


75 


11550 


U67056 


Acantliamoeba 
castellanii 


myosm I heavy cham kinase 


111 


44 


11551 


Y14480 


Homo sapiens 


Fr^ment of human secreted protein 
encoded by gene 14. 


159 


60 


11552 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


50 


11553 


Z34975 


Homo sapiens 


IdlCp 


470 


86 


11554 


AF210651 


Homo sapiens 


NAG18 


101 


65 


11555 


X78677 


Homo sapiens 


ketohexokinase 


241 


92 


11556 
TT557~ 


G04039 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8120. 


254 


72 


AB032976 


Homo sapiens 


KIAAl 150 protein 


565 


75 


11558 


U93572 


Homo sapiens 


p40 


127 


37 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


11559 


U27488 


Pseudorabies 
virus 


glycoprotein gX 


88 


38 


11560 


R80095 


Homo sapiens 


Human superoxide-dismutase-4 
polypeptide. 


1 13 


63 


11561 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


65 




AFl 30089 


Homo sapiens 


PRO2550 


111 


33 


11563 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


146 


60 


11564 


AF090895 


Homo sapiens 


PRO0117 


172 


77 






Homo sapiens 








11567 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


72 


33 


11568 


XI 6074 


Mus sp. 


L-34 protein (AA 1-264) 


117 


35 


11569 


Y45381 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


131 


77 


11570 


AK025395 


Homo sapiens 


unnamed protein product 


683 


98 


11571 


D78255 


IVIlis muscLilus 


PAP-1 


134 


80 


11572 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


110 


70 


11573 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 cloneHMSJWlS. 


131 


78 


11574 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


141 


59 


11575 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


58 


11576 


AC007842 


Homo sapiens 


BC331191 1 


409 


86 


11577 


Z22636 


Sinorhizobium 
meliloti 


Thi 


103 


36 


11578 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


83 


36 


11579 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7784. 


115 


60 


11580 


AF162149 


IVIycoplasma 
bovis 


variable surface lipoprotein 


167 


44 


11581 


AFl 3 0079 


Homo sapiens 


PR02852 


117 


77 


11582 


AF229067 


Homo sapiens 


PADI-H protein 


136 


75 


11583 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


132 


72 


11584 


Z70292 


Homo sapiens 


chemokine CC-1 


386 


93 


11585 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


107 


67 


1 1 >8(> 




Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


134 


75 


11587 


X05006 


Homo sapiens 


S-protein 


237 


100 


11588 
— 


Y21850 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 1880830) 


365 


77 




o 




— f : 

Homo sapiens 


beta 4 adducin 








AF0301o2 


Homo sapiens 


inner mitochondrial membrane 
translocase Tim23 


-~ 










56-KDa protein (aa 1-478) 






11592 


AF151074 


Homo sapiens 


HSPC240 


151 


73 


11593 


AFl 16719 


Homo sapiens 


PR02987 


558 


89 


11594 


AF090942 


Homo sapiens 


PRO0657 


120 


41 


11595 


X55039 


Homo sapiens 


centromere autoantigen B (CENP-B) 


160 


32 


11596 


X98475 


Mus musculus 


vasodilator-Stimulated phosphoprotein 


101 


43 


11597 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


140 


73 
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SEQ lU 

NO; 




Species 


Desenption 


Smith- 
Waterman 
Score 


% 

Identit 

y 


11598 


U07151 


Homo sapiens 


ARL3 


113 


35 


11599 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


123 


79 


11600 


AL050084 


Homo sapiens 


hypothetical protein 


119 


100 


11601 


AF090931 


Homo sapiens 


PRO0483 


124 


73 


11602 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


132 


60 


11603 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


156 


82 


11604 


AF224494 


Mus musculus 


arsenite inducible RNA associated 
protein 


473 


79 


11605 


M60047 


Homo sapiens 


heparin binding protein 


597 


95 


11606 


G00933 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5014. 


721 


97 


11607 


W12091 


Homo sapiens 


C57S derivative of antigen non-specific 
human GIF. 


573 


98 


11608 


AF229439 


Mus musculus 


zmc finger protein 289 


535 


80 


11609 


M86637 


Brugia pahangi 


unknown 


104 


38 


11610 


G03789 


Homo sapiens • 


Human secreted protein, SEQ ID NO: 
7870. 


93 


70 


11611 


G02224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6305. 


508 


94 


11612 


002532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 




5l 


11613 


AK024372 


Homo sapiens 


unnamed protein product 


105 


77 


11614 


Y94918 


Homo sapiens 


Human secreted protein clone 

dd5 04 18 protein sequence SEQ ID 

NO:42. 


310 


62 


11615 


Y48547 


Homo sapiens 


Human breast tumour-associated 
protein 8. 


641 


96 


11616 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


108 


81 


11617 


L34081 


Homo sapiens 


bile acid CoA: Ammo acid N- 
acyltransfcrase 


356 


82 


11618 


G01399 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5480. 


570 


98 


11619 


AF020038 


Homo sapiens 


NADP-dependent isocitrate 
dehydrogenase 


272 


83 


11620 


AB051901 


Homo sapiens 


VDUPl 


247 


98 


11621 


Y48547 


Homo sapiens 


Human breast tumour-associated 
protein 8. 


691 


92 


11622 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method- 
conceptual translation supplied by 
author 


382 


71 


11623 


A06977 


Homo sapiens 


albumin 


562 


92 


11624 


X73460 


Homo sapiens 


ribosomal protein L3 


502 


84 


11625 


AL035398 


Homo sapiens 


dJ 796117.1 (PUTATIVE novel protein) 


373 


98 


11626 


D50310 


Homo sapiens 


cyclin I 


175 


100 


11627 


AE000950 


Archaeoglobus 
fiilgidus 


thermosome, subunit alpha (thsA) 


131 


25 






Mus musculus 


tetracycline transporter-like protein 


137 


81 


11629 


Y36204 


Homo sapiens 


Human secreted protein #76. 


406 


100 


11630 


Af'130089 


Homo sapiens 


PRO2550 


110 


63 


11631 


J02982 


Homo sapiens 


glycophorin B precursor 


95 


100 


11632 


AFl 16719 


Homo sapiens 


PR02987 


558 


93 


11633 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


137 


92 


11634 


U09823 


Oiyctolagus 


elongation factor 1 alpha 


483 


85 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Identit 

y 






cuniculus 








11635 


G01922 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6003. 


157 


73 


11636 


AFl 16719 


Homo sapiens 


PR02987 


491 


90 


11637 


D14421 


Rattus 
norvegicus 


b isotype of B regulatory subunit of 
protein phosphatase 2A 


208 


86 


11638 


AFl 16719 


Homo sapiens 


PR02987 


544 


93 


11639 


AF130079 


Homo sapiens 


PR02852 


127 




11640 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


150 


59 


11641 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4442. 


100 


66 


11642 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


119 


71 


11643 


Y86574 


Homo sapiens 


Human gene 9 1 -encoded protein 
fragment, SEQ ID NO:491. 


64 


100 


11644 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


82 


70 


11645 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6441. 


109 


74 


11646 


AFl 3 0089 


Homo sapiens 


PRO2550 


128 


52 


11647 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


61 


11648 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


73 


84 


11649 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4449. 


128 


60 


11650 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


83 


78 


11651 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


64 


11652 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RI4Ala(AF078683) 


103 


62 


11653 


AFl 30089 


Homo sapiens 


PRO2550 


116 


69 


11654 


Y 14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


176 

129 


54 

"63' 


11655 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


11656 


AF130089 


Homo sapiens 


PRO2550 


135 


77 


11657 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


146 


93 


11658 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


123 


61 


11659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4718. 


117 


47 


11660 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


92 


84 


11661 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


70 


75 


11662 


U00029 


Saccharomyces 


Yhr2t7cp 


101 


50 


11663 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


107 


88 


11664 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


110 


90 


11665 


U63542 


Homo sapiens 


FAP protein 


119 


69 


11666 


AF130114 


Homo sapiens 


PR02459 


78 


38 


11667 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


106 


64 


11668 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


66 
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Mentit 

y 








4478. 






11669 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7868. 


77 


63 


11670 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


116 


68 


11671 


AK021618 


Homo sapiens 


unnamed protein product 


91 


43 


11672 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


108 


55 


11673 


S75997 


Rattus sp. 


nucleoporin p62 homolog 


95 


40 


11674 


AF229163 


Homo sapiens 


natural resistance-associated 
macrophage protein 1 


104 


54 


11675 


M15530 


Homo sapiens 


B-celi growth factor 


89 


81 


11676 


iVr92357 


Homo sapiens 


B94 protein 


143 


100 


11677 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


132 


74 


11678 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


126 


64 


11679 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


66 


11680 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


108 


85 


11681 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


92 


11682 


M15530 


Homo sapiens 


B-cell growth factor 


127 


64 


11683 


AF090931 


Homo sapiens 


PRO0483 


152 


60 


11684 


AF130087 


Homo sapiens 


PR02411 


150 


68 


11685 


U18339 


Variola virus 


D4L 


94 


70 


11686 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 

terminal, clone XEH.Sc} 


104 


80 


11687 


G00454 


Homo sapiens 


Human secreted protein, SRQ ID NO: 
4535. 


134 


68 


11688 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


131 


70 


11689 


AF090944 


Homo sapiens 


PRO0663 ' 


129 


73 


11690 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7871. 


111 


67 


11691 


U79260 


Homo sapiens 


tmknown 


98 


44 


11692 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1. 


361 


64 


11693 


G00962 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
5043. 


269 

l32~ 


100 


11694 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


83 


11695 


Y94526 


Homo sapiens 


Human lysine-rich statherin protein. 


136 


82 


11696 


AFl 16637 


Homo sapiens 


PR01489 


232 


95 


11697 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5066. 


108 


50 


11698 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


120 


81 


11699 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


150 


80 


11700 


AK024455 


Homo sapiens 


FLJ00047 protein 


148 


68 


11701 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4535. 


131 


72 


11702 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


105 


45 


11703 


AC005545 


Homo sapiens 


delta-adaptin, partial CDS 


127 


38 


11704 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
tenninal, clone XEH.8c} 


143 


81 
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■Waterman 


Identit 

y 


11705 . 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


67 


11706 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


153 


67 


11707 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


129 


88 


11708 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


89 


69 


11709 


AB047600 


Macaca 
fascicular is 


hypotlietical protein 


95 


63 


11710 


R96800 


Homo sapiens 


Human liistiocyte-secreted factor HSF. 


100 


76 


11711 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


112 


80 


11712 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


117 


58 


11713 


AF008196 


Homo sapiens 


bax epsilon 


140 


71 


11714 


R13556 


Homo sapiens 


Protein encoded downstream of hlicJVl 
oncoprotein. 


H3 


56 


11715 


G00397 


Homo sapiens 


Human secreted protein; SEQ ID NO: 
4478. 


125 


67 


11716 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


121 


74 


11717 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


125 


51 


11718 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


96 


81 


11719 


¥207 17 


Homo sapiens 


Human neurofilament-M wild type 
protein frj^ment 59. 


106 


85 


11720 


AJ243883 


Periplaneta 
americana 


putative transcription factor 


118 


57 


11721 


AF220264 


Homo sapiens 


MOST-1 


121 


74 


11722 


AF130051 


Homo sapiens 


PRO0898 


169 


72 


11723 


K02401 


Homo sapiens 


cliorionic somatomammotropin 


507 


95 


11724 


K02401 


Homo sapiens 


chorionic somatomammotropin 


461 


85 


11725 


U93564 


Homo sapiens 


p40 


340 


90 


11726 


AF130089 


Homo sapiens 


PRO2550 


107 


73 


11727 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4457. 


128 


79 


11728 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


109 


90 


11729 


Y51824 


Homo sapiens 


Human OSBH protein. 


182 


89 


11730 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


107 


80 


11731 • 


K02401 


Homo sapiens 


chorionic somatomammotropin 


459 


94 


11732 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


93 


42 


11733 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


111 


67 


11734 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


103 


43 


11735 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


75 


11736 


AF090931 


Homo sapiens 


PRO0483 


123 


88 


11737 


K02401 


Homo sapiens 


chorionic somatomammotropin 


340 


92 


11738 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


73 


11739 


G02624 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


131 


57 


11740 


X55695 


Lycopersicon 
esculentum 


glycine-rich protein 


164 


48 


11741 


AK024455 


Homo sapiens 


FLJ00047 protein 


89 


77 
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Accession No. 


Species 


Description 






NO: 








Waterman 
Score 


Identit 

y 


11742 


Y91452 


Homo sapiens 


Human secreted protein sequence 


415 


90 








encoded by gene 2 SEQ ID NO: 125. 






1174^ 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


105 


94 








terminal, clone XEH.Sc} 






11744 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


1 16 


87 








terminal, clone XEH.Sc} 






11745 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


69 








4535. 






11746 


W13831 


Homo sapiens 


Human cyclin E delta 9 mutant. 


140 


52 


11747 


AB001431 


Mus musculus 


motor domain of KIF14 


179 


84 


11748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


72 








7492. 






1 1749 


YD 1400 


Homo sapiens 


Secreted protein encoded by gene 18 


99 


63 








clone HNHF029. 






11750 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


64 








4535. 






11751 


AF09093 1 


Homo sapiens 


PRO0483 


118 


68 


11752 


AF150105 


Homo sapiens 


small 2dnc finger-like protein 


194 


68 


11753 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


69 








4508. 






11754 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


80 








6613. 






11755 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


65 








7870. 






11756 


K02401 


Homo sapiens 


chorionic somatomammotropin 


427 


87 


11757 


U52077 


Homo sapiens 


mariner transposase 


1 13 


82 


11758 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


54 








... 






11759 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


80 








6012. 






11760 


AF090944 


Homo sapiens 


PRO0663 


143 


76 


1 1761 


Dj8112 


Homo sapiens 


NADH dehydrogenase subunit 6 


529 


99 


1 ! 762 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


171 


58 














11763 


AF 132200 


Homo sapiens 


PRO 175 1 


127 


66 


11764 


AK025116 


Homo sapiens 


unnamed protein product 


141 


80 


11765 


G00354 


Homo sapiens 


Hinnan secreted protein, SEQ ID NO: 


75 


62 








4435. 






11766 


AF090930 


Homo sapiens 


PRO0478 


106 


59 


11767 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


80 














11768 


AF09093 1 


Homo sapiens 


PRO04S3 


164 


60 


11769 


S58722 


Homo sapiens 


X-linkcd retinopathy protein {C- 


102 


81 








terminal, clone XEH.Sc} 






.. 

11770 


... 

AF161356 


- -. - 
Homo sapiens 


^ . 


131 


68 


11771 


AF 13 0089 


Homo sapiens 


PRO2550 


160 


82 


11772 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


140 


79 








encoded by gene 1 7. 






11773 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


71 








4518. 






11774 


AF090942 


Homo sapiens 


PRO0657 


-i^l 


."^^ 




AJ223410 


Homo sapiens 


EBIl-ligand chemokine 






11776 


K02401 


Homo sapiens 


chorionic somatomammotropin 


395 


91 


11777 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


70 








6613. 






11778 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


62 








4518. 






11779 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


64 








7339. 
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SEQ ID 
NO: 


Accession No. 


species 


Description 


Waterman 


Identit 

y 


11780 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


68 


11781 


AFl 94537 


Homo sapiens 


NAG 13 


201 


65 


11782 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


139 


68 


11783 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


149 


59 


11784 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


78 


11785 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


99 


68 


11786 


AF 13 0089 


Homo sapiens 


PRO2550 


108 


83 


11787 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


124 


88 


11788 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

protein 


103 


100 


11789 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7075. 


146 


60 


11790 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163, 


165 


73 


11791 


AL035494 


Homo sapiens 


dJ635G19.2.3 (novel protein 
(PUTATIVE PARTIAL isoform 3)) 


106 


56 


11792 


Y3 63 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


66 


11793 


Y3 63 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


125 


61 


11794 


A20770 


synthetic 
construct 


(Aspl-Thrl61)-EPI 


95 


76 


11795 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 


108 


61 


11796 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


96 


77 


11797 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenniiial, clone XEH.Sc) 


111 


65 


11798 


Y75995 


Homo sapiens 


Human skin cell protein, SEQ ID 
NO:173. 


324 


87 


11799 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-5 


120 


75 


11800 


L00693 


Homo sapiens 


carcinoembryonic antigen 


96 


85 


11801 


AL 160493 


Leislimania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 


44 


11802 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


125 


64 


11803 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


142 


82 


11804 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


107 


85 


11805 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881, 


152 


80 


11806 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4518. 


122 


61 


11807 


U91985 


Homo sapiens 


DNA fragmentation fector-45 


585 


92 


11808 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


11809 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


170 


89 


11810 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


191 


76 


11811 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


100 
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NO; 








Waterman 


jaentit 


11812 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


156 


55 


11813 


P92219 


Homo sapiens 
(human) 


CRl protein. 


121 


84 


.11814 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


107 


75 


11815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


74 


78 


11816 


Ml 5530 


Homo sapiens 


B-cell growth factor 


148 


68 


11817 


AF090942" 


Homo sapiens 


PRO0657 


139 


60 


11818 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


117 


100 


11819 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


73 


11820 


Z72853 


Saccharomyces 
cerevisiae 


ORE YGR069W 


94 


42 


11821 


G03710 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7791. 


176 


65 


11822 


AP000616 


Oryzasativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


129 


60 


11823 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


104 


68 


11824 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


99 


51 


11825 


W90172 


Homo sapiens 


Human heart muscle specific protein. 


256 


34 


11826 


AB017007 


Homo sapiens 


PMS2L16 


248 


89 


11827 


AL16037r 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


127 


86 


11828 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


135 


71 


11829 


AFl 19851 


Homo sapiens 


PR01722 


129 


60 


11830 


AF 132200 


Homo sapiens 


PRO 1751 


120 


77 


11831 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


133 


83 


11832 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5066. 


128 


64 


11833 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


152 


100 


11834 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


73 


11835 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


88 


Tl836^ 


AP000616 ^ 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


95 


11837 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


411 


91 


11838 


AK000496 


Homo sapiens 


unnamed protein product 


113 


78 


11839 


AFl 19855 


Homo sapiens 


PRO 1847 


84 


68 


11840 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


88 


11841 


S79410 


Mus musculus 


nuclear localization signal binding 

protein 


113 


69 


11842 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


55 


11843 


AF090942 


Homo sapiens 


PRO06.57 


139 


65 


11844 


AF2202 64 


Homo sapiens 


MOST-1 


SS 


69 


11845 


AF184612 


Drosophila- 
melanogaster 


split ends 


95 


47 


11846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


136 


82 
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Description 


•Waterman 
Score 


Identit 

y 


11847 


AB017007 


Homo sapiens 


PMS2L16 


310 


100 




G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 




79 




W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 






11850 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


110 


72 


11851 


AF090931 


Homo sapiens 


PRO0483 


164 


80 




AFl 16661 


Homo sapiens 


PRO 143 8 




67 


11853 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


119 


72 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 










Homo sapiens 


super cysteine rich protein; SCR? 


100 


84 


11856 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


54 


11857 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


68 


11858 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


69 


11859 


W48351 


Homo sapiens 


Humw breast cancer related protein 
BCRB2. 


94 


66 


11860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


94 


66 


11861 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


75 


11862 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


86 


11863 


AFl 3 0050 


Homo sapiens 


PRO0872 


81 


58 


11864 


AFl 19900 


Homo sapiens 


PR02822 


152 


90 


11865 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


T8 


45 


11866 1 J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


526 


93 


11867 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


80 


11868 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4672. 


119 


58 


11869 


Y10830 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


246 


100 


11870 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124. 


101 


59 


11871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






11872 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 






11873 


D28113 


Homo sapiens 


MOBP 


458 


80 


11874 


D28114 


Homo sapiens 


MOBP 






11875 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


85 


11876 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


110 


64 


11877 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


57 


11878 


M15530 


Homo sapiens 


B-cell growth factor 


98 


62 


11879 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


100 


11880 Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 10 1. 


99 


70 


11881 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


101 


53 



452 



w o 01/64835 



PCT/llSOl/04927 



SEQID I Accession No. 

NO: i 

i 


Species 


Description 


Waterman 
Score 


Identit 

y 


11882 G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


219 


75 


11883 i G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 




61 


11884 


U93563 


Homo sapiens 


putative pi 50 


139 


75 


11885 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


154 


52 


11886 


AF090944 


Homo sapiens 


PRO0663 




80 


11887 


AP000616 


Oryza saliva 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


107 


94 


11888 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


138 


62 


11889 


L26953 


Homo sapiens 


chromosomal protein 


123 


75 


11890 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


105 


69 


11891 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


129 


83 


11892 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


126 


78 


11893 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


82 


70 


11894 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


90 


11895 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


118 


75 


11896 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6532. 


167 


86 


11897 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


136 


77 


11898 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


126 


58 


11899 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


158 


78 


11900 


S58722 


Homo sapiens 


X-Iiniced retinopathy protein {C- 
terminal, clone XEH.8c} 


137 


71 


11901 


G00613. 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4694. 


85 


62 


11902 


G02558 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6639. 


242 


73 


11903 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


127 


59 


11904 


G03472 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7553. 


182 


85 


11905 


S58722 


Homo sapiens 

, 1 


X-Iinked retinopathy protein {C- 
terrainal, clone XEH.Sc} 


135 


75 


11906 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.Sc} 


128 


79 


11907 


U93574 


Homo sapiens 


putative pi 50 


169 


64 


11908 


AF090944 


Homo sapiens 


PRO0663 


128 


75 


11909 


002485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


100 


84 


11910 


AF090930 




PRO0478 






11911 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


496 


88 


11912 


JOS 071 


Homo sapiens 


chorionic somatomammotropin CS-2 


551 


92 


11913 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:I03. 


168 


76 


11914 
11915 
11916 


K02401 


Homo sapiens 


chorionic somatomammotropin 


450 


85 


K02401 
K02401 


Homo sapiens 
Homo sapiens 


chorionic somatomammotropin 


478 
454 


87 
82 



453 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO; 


Accession No. 


Species 


Description " " ■ 


Smitli- 
Waterman 


% 

Identit 

y 


11917 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


73 


11918 


X65121 


Mus musculus 


alphal (X) collagen 


112 


35 


11919 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 




62 


11920 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 




11921 


AF 150087 


Homo sapiens 


small zinc finger-like protein 


181 


52 


11922 


U55376 


Caenorhabditis 

elegans 


F16H11.2 gene product 


122 


78 


1J923 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


256 


80 


11924 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

_ .._ _ 


127 


74 


11925 


AF090944 


Homo sapiens 


PRO0663 


144 


75 


11926 


AJ223475 


Escherichia coli 


InsA protein 


477 


100 




M15386 


Homo sapiens 


gamma-globin 






11928 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


105 


72 


11929 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


122 


92 


11930 


AF090895 


Homo sapiens 


PROG 11 7 


80 


71 


11931 


AF130089 


Homo sapiens 


PRO2550 


129 


68 


1 1932 


AK025116 


Homo sapiens 


unnamed protein product 


113 


53 


11933 




Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 






11934 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4658. 






11935 


AFl 16695 


Homo sapiens 


PR02221 




11936 


AF090931 


Homo sapiens 


PRO0483 


133 


77 


11937 


AF130089 


Homo sapiens 


PRO2550 






11938 


L06237 


Homo sapiens 


microtubule-associated protein IB 


143 


36 


11939 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


72 


11940 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4508. 


158 


76 


11941 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


130 


53 


11942 


Y 14480 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 14. 






11943 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

J -■ imn.- 


106 


59 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 








D2S113 


Homo sapiens 


XTAT^^J I. tl ="T 


~5p 









Homo sapiens 


NADH dehydrogenase subunit 1 






11947 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


113 


72 


.. ... 





Homo sapiens 


NADH dehydrogenase subunit 6 






11949 


AJ006591 


Homo sapiens 


cysteine-rich protein 


148 


76 


11950 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tennina], clone XEH.8c} 


11j 


88 




r^nc 

AFl 16695 


Homo sapiens 


PR02221 


^705 


— 


11952 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO: 128. 


158 


75 


11953 


AFl 13685 


Homo sapiens 


PRO0974 


107 


72 


11954 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7339. 


107 


80 


11955 


AK026107 


Homo sapiens 


unnamed protein product 


129 


84 


11956 


AFl 09907 


Homo sapiens 


S164 


140 


96 



454 



w o 01/64835 



PCT/llSOl/04927 



no/ 




Species 




Waterman 
Score 


Identit 

y 


11957 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


99 


100 


11958 


K02576 


Homo sapiens 


salivary proline-rich protein ] 


124 


41 


11959 


K02401 


Homo sapiens 


chorionic somatomammotropin 


497 


90 


11960 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


155 


69 


11961 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


126 


85 


11962 


M15530 


Homo sapiens 


B-cell growth factor 


114 


86 


11963 


K02401 


Homo sapiens 


chorionic somatomammotropin 


472 


87 


11964 


AK024455 


Homo sapiens 


FLJ00047 protein 


95 


61 


11965 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


145 


78 


11966 


L27428 


Homo sapiens 


reverse transcriptase 


144 


50 


11967 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


187 


76 


11968 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


124 


81 


11969 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4718, 


112 


64 


11970 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


70 


11971 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


105 


61 


11972 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


106 


70 


11973 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


70 


75 


1:1 974^ 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4572. 


86 


60 


11975 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


161 


72 


11976 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


105 


42 


11977 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


121 


58 


11978 


AK024372 


Homo sapiens 


unnamed protein product 


104 


80 


11979 


S58722 


Homo sapiens 


X-linked r^inopathy protein {C- 
terminal, clone XEH.Sc} 


137 


77 


11980 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


96 


90 


11981 


S79410 


Mus rausculus 


nuclear localization signal binding 
protein 


100 


56 


11982 


R59842 


Homo sapiens 


ApoE4LI protease. 


129 


62 


11983 


KOI 664 


Drosophila 
melanogaster 


Bion-like protein 


132 


53 


11984 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 
terminal, clone XEH.Sc} 


105 


90 


11985 


Ml 5530 


Homo sapiens 


B-cell growth factor 


117 


43 


11986 


G03438 


Homo sapiens 


Human secreted protein, SEQ DD NO: 
7519. 


99 


65 


11987 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


117 


84 ■ 


11988 


L26953 


Homo sapiens 


chromosomal protein 


124 


79 


11989 


U93564 


Homo sapiens 


p40 


523 


88 


11990 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQIDNO:163. 


155 


82 


11991 


AF090894 


Homo sapiens 


PRODI 13 


126 


56 


11992 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4442. 


129 


63 


11993 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


67 


11994 


U71363 


Homo sapiens 


zinc finger protem zSp6 


213 


68 



455 



w o 01/64835 



PCT/llSOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 






Waterman 


Identit 










Score 


y 


11995 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


82 


65 














11996 


AFI0OO89 


Homo sapiens 


PRO2550 


145 


68 


11997 


R95913 


Homo sapiens 


Neural thread protein. 


114 


88 


11998 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


56 






„ . .. ... 


6613. 






11999 


X62677 


Oryctolagus 


retrovirus related reverse transcriptase 


147 


81 






cuniculus 








12000 


AF09093 1 


Homo sapiens 


PRO0483 


152 


81 


12001 


AF090931 


Homo sapiens 


PRO0483 


145 


81 


12002 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


64 








^^^^ . . 






12003 


S58722 


Homo sapiens 


X-hnked retinopathy protein {C- 


103 


78 








tetminal, clone XEH.8c} 






12004 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


51 






7795. 






12005 


Y36495 


Homo sapiens 


Fragment of human secreted protein 


108 


63 








encoded by gene 27. 






12006 




LI 9527 


Homo sapiens 


ribosomal protein L27 


547 


93 


12007 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


82 


12008 


R96800 


Homo sapiens 


Human histiocyte-secreted fector HSF. 


127 


82 


12009 


M55409 


Homo sapiens 


pancreatic tumor-related protein 


257 


98 


12010 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


64 








4446. 






1201 1 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


291 


89 


12012 


X03717 


Homo sapiens 


pot. unidentified reading ftame 


126 


43 


12013 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


131 


80 








terminal, clone XEH.Sc} 








AB011148 


Homo sapiens 


KIAA0576 protein 


135 


84 


12015 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 


1 14 


87 








tenninal, clone XEH.8c} 






12016 


AK024455 


, ... . .. 

Homo sapiens 


FLJ00047 protein 


102 


70 


12017 


L27428 


Homo sapiens 


reverse transcriptase 


144 


87 


12018 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


87 








4457. 






12019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


137 


93 


12020 


AK001363 


Homo sapiens 


unnamed protein product 


490 


100 


12021 


¥27868 


Homo sapiens 


Human secreted protein encoded by 


111 


79 








gene No. 107. 






12022 


W00838 


Homo sapiens 


Tumour iiecrosis factor-related gene 


106 


87 








product. 






12023 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


157 


58 








: 






- 

12024 




G03172 


-. 1 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


66 








y-^^^- - .. .... 






12025 


R9591j 


■ - 
Homo sapiens 


Neural thread protein. 


157 


^? 


12026 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


52 








4435. 






12027 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


60 








4535. 






12028 


V00488 


Homo sapiens 


alpha globin 


191 


100 






Oryctolagus 


retrovirus related reverse transcriptase 










cuniculus 








12030 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


79 


12031 


G01246 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


142 


86 








5327. 






12032 


X01703 


Homo sapiens 


alpha-tubulm 


534 


93 


12033 


U09823 


Oryctolagus 


elongation factor I alpha 


557 


90 






cuniculus 









456 



wo 01/64835 PCT/llSOl/04927 



SEQID 


Accession No. 


Species 


Description 


Smitli- 


% " 


NO: 








Waterman 


Identit 












y 


:^ 


"xoTtos 


^ 


—r— r-j. 


1^ 

-^i? 


^4 ., 


~ 


AF255556 


0 0 enia 


pa uin 




85 






coniceps 


— — 






03 


X 


Paracentrotus 


a p a-tu u m (AA 1-45 ) 


— — 


87 






: 








. — 


M62810 


— omo sapiens 


'■ h fl • Itra =~P S i 

mi DC on na nscnp 'O" -tac or 


—r 

— 1- 


— 

_ir 


as — 




Homo sapiens 


Human secreted protein ^8. 


— 


-11 


^ — 




Homo sapiens 


alpha-tubulin 




97 


12040 




X01703 


Homo sapiens 


aipha-tubulin 


— 

517 


90 


12041 


S70154 


Homo sapiens 


cytosoiic acetoacetyl-coenzyme A 


533 


94 








thiolase, CT {EC 2.3.1,9} 






. ... 


. 


Homo sapiens 


alpha-tubulin 


- - 


85 




M23613 


Homo sapiens 


nucleophosniin 


498 




— 
12044 


AL03 1 1 74 


Schizosaccharom 


hypothetical protein 


102 


52 






yces pombe 










12045 




Y94653 


Homo sapiens 


Human netrin-lilce protein (NBL) 


152 


82 •. 








amino acid se(]uence. 






., ... 
12046 


. 

AFO 16507 


Homo sapiens 


C-terminal binding protein 2 




100 


_12047 


X05 196 


Homo sapiens 


aldolase C 


454 


93 


12048 


X03796 


Mus musculus 


aldolase C (aa 1-227) 


^51 


74 


12049 


X05196 


Homo sapiens 


aldolase C 


408 


88 


12050 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


147 


70 








Clone rtJNrlrtJzy. 








12051 




S58722 


: 

Homo sapiens 


X-linked retinopathy protein {C- 




169 


55 








teiTninal, clone XEH. 8c} 









AF090944 


Homo sapiens 


PROO 66 3 


. ' 


59 






Homo sapiens 


variable region 






12054 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO . 


144 


74 








7492. 






12055 


AF09093 1 


Homo sapiens 


PRO0483 


121 


74 


12056 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


76 








698j. 






12057 


U 18339 


Variola virus 


..- ... .... .. 


100 


52 


12058 


R96800 


Homo sapiens 


Human histiocyte-secreted &ctor HSF. 


135 


62 


12059 


AB001684 


Chlorella 

vulgaris 


ORF41c 


81 


75 


12060 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


128 


74 








gene 51 clone HUKEX85. 






12061 


AK024455 


Homo sapiens 


FLJ00047 protein 


70 


46 


12062 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


l.,0 


69 















1206j 


AF090944 


-— : 

Homo sapiens 


PRO0663 


- 

110 


66 


12064 


Y14480 


Homo sapiens 


Fragment of human secreted protein 


109 


53 








encoded by gene 14. 






12065 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


79 








7881. 






12066 


AF090931 


Homo sapiens 


PRO0483 


123 


70 


12067 


GO 03 65 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


75 








4446. 






12068 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


77 
















G03172 


Homo sapiens 


Human secreted protein SEQ ID NO' — 


^45 


^76 








7253. 






12070 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


81 








7720. 






12071 


AFl 16715 


Homo sapiens 


PR02829 


138 


63 


12072 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


127 


82 


12073 


U79260 


Homo sapiens 


unknown 


98 


73 


12074 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


132 


80 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 

Waterman 

Score 


% 

Identit 

y 








clone HPMBQ32. 






12075 


J02459 


bacteriophage 
lambda 


T(tail component; 144) 


555 


98 


12076 


AK024455 


Homo sapiens 


FLJ00047 protein 


127 


65 


12077 


X00911 


Rattus 
norvegicus 


pot. MSA-precursor 




73 




AF 164797 


Homo sapiens 


ribosomal protein L17 isolog 






12079 


A06977 


Homo sapiens 


albumin 


602 


87 


12080 


AF000198 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


118 


38 


12081 
12082 


AL359782 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


50 


12083 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


86 


12084 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


54 


12085 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


53 


12086 


A 14829 


Homo sapiens 


preproapolipoprotein 


619 


82 


12087 


AFl 19900 


Homo sapiens 


PR02822 


107 


76 


12088 


U52077 


Homo sapiens 


mariner transposase 


277 


74 


12089 


AP000616 


Oryzasafva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


100 


12090 


AP000616 


Oiyza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


92 


12091 


WO 527 8 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6,5.40. 


115 


42 


12092 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327, 


112 


81 


12093 


G03343 


Homo sapiens 


Human secreted protein, SHQ ID NO: 

7424. 


114 


64 


12094 


AJ237660 


Bacteriophage 
21 


Ren protein 


187 


94 


12095 


G 02 994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


80 


12096 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
■spo ir* xro- 1 

oxi^s^ It-* INL'. 1 OJ. 


161 


49 


12097 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 
tenninal, clone XEH.Bc} 


122 


92 








... 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 








AF090942 


Homo sapiens 


PRO0657 




70 




AF090931 


Homo sapiens 


PRO0483 






12101 


AF090944 


Homo sapiens 


PRO0663 


103 


37 


12102 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


76 


58 






Homo sapiens 


platelet glycoprotein lib precursor 






12104 


AL 160493 


Leishmania 

: 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


84 


121 


R9591 


Homo sapiens 


Neural thread protein. 






12106 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


149 


75 


12107 


Y09561 


Homo sapiens 


ATP receptor 


188 


85 


12108 


AF090895 


Homo sapiens 


PROG 1 17 


107 


83 


12109 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


117 


65 


12110 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


65 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 








6292. 








G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 










Homo sapiens 


congenital heart disease 5 protein 






12113 


AF220264 


Homo sapiens 


MOST-1 


142 


76 


12114 


Y 12077 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 
NO: 390. 


109 


95 


12115 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


171 


63 


12116 


AF064869 


Rattus 
norvegicus 


brain-enriched guanylate kinase- 
associated protein 2; BEGA2 


294 


98 


12117 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


128 


36 


12119 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


102 


73 






Homo sapiens 


imkaown 






12121 


AF161356 


Homo sapiens 


HSPC093 


118 


67 




AF 130087 


Homo sapiens 




115 


SO 


12123 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


280 


75 


12124 


S48406 


Mus sp. 


alpha 1 (XII) collagen {triple-helical 
domain COL2} 


59 


41 


12125 


AB030816 


Homo sapiens 


H-REV107 protein-related protein 


420 


79 


12126 


R96418 


Homo sapiens 


Partial human transforming growth 
fector beta receptor type II. 


169 


100 


12127 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


91 


46 


12128 




Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 




83 


12129 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

. ... 


127 


57 




AFl 19900 


Homo sapiens 


PR02822 






12131 


¥86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


70 


12132 


AF303828 


Mus musculus 


ubc-like protein MMS2 


131 


43 


12133 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 



127 


58 


1 ^ 


U49974 


omo sapiens 


mariner transposase 





. .. . 


— 




Mus musculus 


Q300 protein (AA 1-77) 






12136 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


151 


69 


12137 


AF192913 


Homo sapiens 


zinc finger protein ZNF 1 80 


581 


94 




X58907 


Homo sapiens 


steroid 21-nionooxygenase 






12139 


AF090931 


Homo sapiens 


PRO0483 


121 


70 


12140 


AK024455 


Homo sapiens 


FLJ00047 protein 


84 


60 


12141 


AF090931 


Homo sapiens 


PRO0483 


122 


71 


12142 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


60 


12143 


AF095770 


Homo sapiens 


PTH-nesponsive osteosarcoma Dl 
protein 


121 


100 


12144 


G02922 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


191 

257 


62 


12145 


U57092 


Homo sapiens 


Rab30 


12146 


AF130089 


Homo sapiens 


PRO2550 


129 


74 


12147 


AF090931 


Homo sapiens 


PRO0483 


138 


80 


12148 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


135 


60 


12149 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


138 


60 
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Description 


Waterman 
Score 


Identit 

y 


12150 


U51723 


Plasmodium 
vivax 


V-SERA 1 


107 


37 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 




68 


12152 


Y23884 


Homo sapiens 


Amino acid sequence of FK506 
binding protein 65 (F1GBP65). 


124 


69 


12153 


G02490 


Homo sapiens 


Human secreted protein, SEQ TD 'NO: 


93 


70 


12154 


AF229067 


Homo sapiens 


PADI-H protein 


162 


68 




R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 






12157 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


39 




U89439 


Bos taures 


ubiquitin-like protein 




76 


12159 


G03725 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


677 


98 


12160 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


90 


12161 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


80 


12162 


AF090931 


Homo sapiens 


PRO0483 


125 


76 


12163 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


191 


64 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


96 


61 




G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4493. 


1 15 


68 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


219 


93 


12167 


AFl 18086 


Homo sapiens 


PRO 1992 


155 


78 






Mus musculus 


cytokeratin 8 (AA I - 489) 


174 


63 


12169 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


94 


48 


12170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


150 


57 


12171 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 ■ 


78 


12172 


AF068294 


Homo sapiens 


HDCMB45P 


184 


56 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


50 




AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 






12175 


X92485 


Plasmodium 
\ivax. _ 


pval 


108 


44 




Yigj^j 


Homo sapiens 


. 

SEQ ID NO 485 trom W09922243. 


"To2 




12177 


X83703 


Homo sapiens 


nuclear protein 


330 


68 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 










Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 








AFl 32200 


Homo sapiens 


PRO 1751 






12181 


AC005514 


Homo sapiens 


CTF5 


261 


90 




W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 






12183 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


75 


12184 


P92219 


Homo sapiens 
(human) 


CRl protein. 


101 


74 


12185 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


156 


56 
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SEQID 
NO: 


Accession No. 




OescripCion 


Smith- 
Waterman 
Score 


% 
y 


12186 


G02213 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


626 


97 


12187 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


96 


51 


12188 


AF1180G7 


Homo sapiens 


PRO 15 78 


89 


73 


12189 


M63154 


Homo sapiens 




130 


96 


12190 


Af090942 


Homo sapiens 


PRO0657 


144 


72 


12191 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


76 


12192 


G02832 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 
6913. 


134 


63 


12193 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XBH.8c} 


159 


76 


12194 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla(AF078683) 


117 


76 


12195 


AF090931 


Homo sapiens 


PRO0483 


134 


62 


12196 


AFl 30089 


Homo sapiens 


PRO2550 


117 


74 


12197 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XBH.8c} 


113 


57 


12198 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


70 


12199, 


J02459 


bacteriophage 
lambda 


Fi (DNA packaging; 117) 


203 


95 


12200 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


135 


58 


12201 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


86 


12202 


AFl 19900 


Homo sapiens 


PR02822 


148 


68 


12203 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ1DN0:163. 


128 


59 


12204 


R59842 


Homo sapiens 


ApoE4Ll protease. 


118 


91 


12205 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


121 


72 


12206 


S75997 


Rattus sp. 


nucleoporin p62 homolog 


131 


65 


12207 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


65 


12208- 


S58722 


Homo sapiens 


X-iinked retinopathy protein {C- 
tenninal, clone XEH.Sc} 


145 


86 


12209 


AJ242956 


Homo sapiens 


El ftjsion protein 


96 


88 


12210 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


132 


78 


12211 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


81 


12212 


K03179 


Homo sapiens 


pro-alpha- 1 type-I collagen 


96 


37 


12213 


GO 1895 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5976. 


310 


57 






Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 










Homo sapiens 


Deleted in Colorectal Carcinomas. 




98 


12216 


AF132200 


Homo sapiens 


PRO 175 1 


109 


61 


12217 


AB013897 


Homo sapiens 


HKRl 


324 


64 


12218 


AF071172 


Homo sapiens 


HERC2 


123 


86 


12219 


L38593 


Homo sapiens 


integral membrane protein 


78 


62 


12220 


AF036233 


Homo sapiens 


cdc25B phosphatase 


204 


51 


12221 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


107 


60 


12222 


AL049795 


Homo sapiens 


dJ622L5.9 (eukaryotic translation 
initiation factor 3, subunit 2 (beta, 
36kD) (TRIP-1, TGF-beta receptor 
interacting protein 1)) 


163 


86 


12223 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


76 
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Species 


Dcsciiptum 






NO: 








Waterman 
Score 


Identit 

y 


12224 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


64 








7871. 






12225 


AFl 30079 


Homo sapiens 


PR02852 


103 


80 


12226 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


86 








4457. 






12227 


AFl 19855 


Homo sapiens 


PRO 1847 


96 


78 


12228 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


195 


87 








4473. 






12229 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


103 


90 








gene 44 clone HTDAD22. 






12230 


U49957 


Homo sapiens 


LM protein 


212 


100 


12231 


AK022759 


Homo sapiens 


unnamed protein product 


646 


100 


12232 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


110 


70 








clone HCACJ81. 






12233 


Y27S68 


Homo sapiens 


Human secreted protein encoded by 


109 


74 








gene No. 107. 






12234 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


123 


76 








sequence. 






12235 


AF220264 


Homo sapiens 


MOST-1 


104 


85 


12236 


S58722 


Homo sapiens 


X-ILnked retinopathy protein {C- 


113 


87 








teiminal, clone XEH.Sc} 






12237 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


84 








7881. 






12238 


AF090942 


Homo sapiens 


PRO0657 


103 


66 


12239 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


72 








4435. 






12240 


Y36495 


Homo sapiens 


Fragment of Iiuman secreted protein 


135 


75 








encoded by gene 27, 






12241 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


136 


95 








RHAla(AF078683) 






12242 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


148 


72 








terminal, clone XEH.Sc} 






12243 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


118 


39 


12244 


AL390114 


Leishmania 


probable (iihv-6) ul 102, variant a 


78 


100 






major 


DNA, complete virion genome 






12245 


AC005200 


Homo sapiens 


plasmalemmal porin 


131 


79 


12246 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


65 








6571. 






12247 


W48351 


Homo sapiens 


Human breast cancer related protein 


90 


55 








BCRB2. 






12248 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


77 


64 








encoded by gene 121. 






12249 


U63542 


Homo sapiens 


FAP protein 


108 


56 


12250 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 


101 


75 








terminal, clone XEH. 8c} 






12251 


G03807 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


139 


63 








7888. 






12252 


G00985 


Homo sapiens 


Human secreted protein, SBQ ID NO: 


122 


71 








5066. 






12253 


AFl 18086 


Homo sapiens 


PR01992 


73 


70 


12254 


AF083929 


Mus musculus 


ES18 


108 


44 


12255 


AF130089 


Homo sapiens 


PRO2550 


124 


88 


12256 


AF090931 


Homo sapiens 


PRO0483 


132 


80 


12257 


X95190 


Homo sapiens 


branched chain acyl-CoA oxidase 


166 


86 


12258 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


53 








7868. 






12259 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


88 








7124. 






12260 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 
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SEQIO 
NO: 


Accession No. 




Descriptfon 


AVaterman 


Mentit 

y 








4409. 






12261 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


133 


72 


12262 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


110 


70 


12263 


Y0806] 


Homo sapiens 


Human c-myb protein fragment. 


125 


85 


12264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


126 


65 


12265 


AP000616 


Oryzasadva , 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


114 


95 


12266 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


63 


12267 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7879. 


166 


73 


12268 


AF090931 


Homo sapiens 


PRO0483 


108 


82 


12269 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


120 


68 


12270 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


120 


66 


12271 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


74 
60 


12272 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


129 


12273 


AF 13 0089 


Homo sapiens 


PRO2550 


134 


79 


12274 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


118 


80 


12275 


AF194537 


Homo sapiens 


NAGI3 


151 


52 


12276 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. 


75 


68 


12277 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


113 


60 


12278 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


111 


70 


12279 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


119 


92 


122S0 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


90 


12281 


U66464 


Homo sapiens 


hematopoietic progenitor kinase 


102 


83 


12282 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7519. 


118 


65 


12283 


WO0838 


Homo sapiens 


Tumour necrosis factor-related gene 
product. 


162 


82 


12284 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


130 


77 


12285 


AF116661 


Homo sapiens 


PR01438 


104 


74 


12286 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


98 


66 


12287 


R13556 


Homo sapiens 


Protein encoded downstream of hlic M 
oncoprotein. 


105 


74 


12288 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


129 


78 


12289 


X53581 


Rattus 
norvegicus 


0RF7 


111 


47 


12290 


AF090931 


Homo sapiens 


PRO0483 


147 


87 


12291 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


106 


69 


12292 


AP000616 


Oryza sativa 


similar to RING-H2 fmger protein 
RHAla (AF078683) 


137 


53 


12293 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


56 
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NO: 




Spec.es 


Description 


Waterman 


Idcntit 


12294 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


60 


12295 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


112 


66 


12296 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


140 


62 


12297 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


144 


70 


12298 


G02455 


Homo sapiens ■ 


Human secreted protein, SEQ ID NO: 
6536. 


95 


51 


12299 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


108 


70 


12300 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJWI8. 


122 


85 


12301 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


117 


73 


12302 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


115 


62 


12303 


AF130051 


Homo sapiens 


PRO0898 


116 


65 


12304 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


119 


60 


12305 


V00662 


Homo sapiens 


ATPase 6 


118 


100 


12306 


AF107406 


Homo sapiens 


GW128 


98 


66 


12307 


A05308 


syntiietic 
construct 


glioblastoma-derived T-cell suppressor 
factor 


376 


93 


12308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


173 


71 


12309 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


140 


74 


12310 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


98 


47 


12311 


Y27571 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


130 


42 


12312 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


134 


76 


12313 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


98 


82 


12314 


U79260 _j 


Homo sapiens 


unknown 


97 


64 


12315 


L27428 


Homo sapiens 


reverse transcriptase 


214 


44 


12316 


D00570 


Mus musculus 


open reading frame (196 AA) 


115 


83 


12317 


AF090931 


Homo sapiens 


PRO0483 


104 


74 


12318 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


123 


84 


12319 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
5909. 


107 


70 


12320 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


97 


62 


12321 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


95 


58 


12322 


G00362 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4443. 


104 


59 


12323 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


115 


75 


12324 


AF286472 


Homo sapiens 


retinitis pigmentosa GTPase regulator 


121 


53 


12325 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


162 


75 


12326 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


75 


12327 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


125 


88 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 


12328 


AF090944 


Homo sapiens 


PRO0663 


122 


76 


12329 


AF090931 


Homo sapiens 


PRO0483 


156 


81 


12330 


W00838 


Homo sapiens 


Tumour necrosis fector-related gene 
product. 


107 


80 


12331 


AF11S715 


Homo sapiens 


PR02829 


127 


72 


12332 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


49 


12333 


S80862 


Western equine 
encephalorayeliti 
s virus Western 
equine 

encephalomyeliti 
s virus, Peptide 
Partial, 259 aa 


nucleocapsid gene C 


108 


35 


12334 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.Sc} 


141 


83 


12335 


AK024455 


Homo sapiens 


FLJ'00047 protein 


133 


71 




AF090944 


Homo sapiens 


PRO0663 




84 


12337 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


124 
102 


71 


12338 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS 1 1037. 


69 


12339 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


121 


81 


12340 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


149 


54 


12341 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


134 


66 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


102 


60 


12343 


API 18086 


Homo sapiens 


PRO 1992 


80 


63 


12344 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


80 


12345 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


145 


72 


12346 


LR96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


143 


75 




G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4770. 


116 


37 


12348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


176 


76 


12349 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


103 


48 






Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


101 








Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 






12352 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS 1 1037. 


101 


84 


12353 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


42 


12354 


S58722 


Homo sapiens 


X-linked retinopathy protem {C- 
termiaal, clone XEH.Sc} 


147 


66 


12 


G00397 


Tf = 

Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 






12356 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


96 


68 


12357 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


122 


43 


12358 


ALl 60493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete vkion genome 


132 


38 


12359 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


90 
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NO: 








Waterman 


Identit 








4478. 






12360 


G03714 


Homo sapiens 


Human secreted protein, SEQ JD NO; 

7795. 


113 


56 


12361 


AB001684 


Chlorella 
vulgaris 


ORF49b 


92 


44 


12362 


U93564 


Homo sapiens 


p40 


257 


84 


12363 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
env of Type A and Type B retroviruses 
and to class 11 HERVs 


427 


84 


12364 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


102 


73 


12365 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


102 


68 


12366 


AF090944 


Homo sapiens 


PRO0663 


94 


45 


12367 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


150 


55 


12368 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


105 


85 


12369 


AF130089 


Homo sapiens 


PRO2550 


112 


78 


12370 


AF090895 


Homo sapiens 


PRODI 17 


125 


64 


12371 


S58722 


Homo sapiens 


X-linl<ced retinopathy protein {C- 
terminal, clone XEH.Sc} 


121 


95 


12372 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


72 


12373 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


144 


72 


12374 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


133 


79 


12375 


APOOOOOl 


Pyrococcus 
horikoshii 


5Saa long hypolhetical protein 


96 


51 


12376 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


105 


76 


12377 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


123 


78 


12378 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

gene 22 clone HMSJW18. 


126 


83 


12379 


Y53643 


Homo sapiens 


A bone marrow secreted protein 
designated BMS6. 


329 


98 


12380 


U64094 


Homo sapiens 


soluble type II interleukin-l receptor 


114 


100 


12381 


J02459 


bacteriophage 
lambda 


D (head-DNA stabilization;! 10) 


467 


100 


12382 


L10908 


Mus musculus 


Gcapl gene product 


100 


89 


12383 


AF220264 


Homo sapiens 


IVIOST-1 


113 


68 


12384 


U79260 


Homo sapiens 


unknown 


94 


60 


12385 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


52 


12386 


R13556 


Homo sapiens 


Protem encoded downstream of hhc M 
oncoprotein. 


116 


63 


12387 


G00952 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5033. 


125 


92 


12388 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


130 


65 


12389 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


150 


62 


12390 


G00454 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4535. 


102 


38 


12391 


G03172 


Homo sapiens 


Human secreted protein, SEQ ED NO; 
7253. 


113 


56 


12392 


G00332 


Homo sapiens 


Human secreted protein, SBQ ID NO: 
4413. 


73 


92 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 

Waternian 

Score 


Identit 

y 


12393 


M69297 


Homo sapiens 


ORF3 


167 


75 




R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 






12395 


AF220264 


Homo sapiens 


MOST-1 


131 


57 


12396 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


139 


62 




AF130089 


Homo sapiens 


PRO2550 




62 


12398 


AK023117 


Homo sapiens 


unnamed protein product 


529 


97 




AF 123652 


Homo sapiens 






96 


12400 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


139 


87 


12401 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 
similar to pogo element 


306 


60 


12402 


AC00503 1 


Homo sapiens 


neuronal apoptosls inhibitory protein 


143 


96 


12403 


J03071 


Homo sapiens 


ctiorionic somatomammotropin CS-2 


585 


100 


12405 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


125 


100 


12406 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


150 


63 


12407 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7878. 


157 


85 


12408 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


120 


75 


12409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


141 


50 


12410 


G03465 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

7546. 


103 


51 


12411 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


71 


12412 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


159 


70 


12413 


S58722 


Homo sapiens 


X-lmked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


79 


12414 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


72 


12415 G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7075. 


74 


72 


12416 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


102 


50 


12417 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


171 


75 


12418 G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


137 


55 


12419 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


52 


12420 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


129 


53 


12421 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


55 


12422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


87 


12423 


U93564 


Homo sapiens 


p40 


350 


93 


12424 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (PC) 


484 


88 


12425 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


130 
J9 


69 
47 


12426 


X60376 


Brassica napus 


proline-rich protein 


12427 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


88 


12428 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
traminal, clone XEH.Sc} 


117 


75 


12429 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


107 


67 
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NO: 




Species 


Description 


Waterman 
Score 


Identit 

y 


12430 


AK027208 


Homo sapiens 


unnamed protein product 


108 


64 


12431 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


154 


68 


12432 


AFl 30089 


Homo sapiens 


PRO2550 


118 


88 


12433 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


90 


55 


12434 , AK024455 


Homo sapiens 


FLJ00047 protein 


113 • 


75 




AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


107 


80 


12436 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


114 


75 


12437 


AC005498 


Homo sapiens 


R31665_2 


163 


75 


12438 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


12439 


G03116 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7197. 


M5 


84 


T2440 — 


AK024455 


Homo sapiens 


FLJ00047 protein 


136 


73 


^2441 — 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478, 


122 


69 




AF090942 


Homo sapiens 


PRO0657 


120 


72 


12443 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


159 


53 




AFl 30089 


Homo sapiens 


PRO2550 


no 


73 


12445 


AL02I395 


Homo sapiens 


dJ269Ml 5.1 (similar to peptidylprolyl 
isomerase (cyclophilin)) 


225 


64 


T2446 — 


AF090931 


Homo sapiens 


PRO0483 


155 


79 


12447 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


130 


81 


12448 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4473. 


122 


74 


12449 


AFl 19855 


Homo sapiens 


PRO 1847 


88 


72 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4409. 


147 


84 




AK024455 


Homo sapiens 


FLJ00047 protein 


78 


68 


12452 


AFl 16712 


Homo sapiens 


PR02738 


118 


62 




G03787 


Homo sapiens 


Hiunan secreted protein, SEQ ID NO: 
7868. 


109 


67 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


144 


59 




AK024455 


Homo sapiens 


FLJ00047 protein 


146 


64 


"12456 — 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


166 


72 


12457 


AF090944 


Homo sapiens 


PRO0663 


129 


79 


12458 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6582. ' 


119 


70 


12459 


R22278 


Homo sapiens 


Human gp.-specific component 
phenotype Gel. 


250 


65 


12460 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


143 


58 


12461 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:195. 


111 


61 


12462 


G00328 


Homo sapiens 


Human secreted piwtein, SEQ ID NO: 
4409. 


147 


84 


12463 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7888. 


94 


38 


12464 


S58722 


Homo sapiens 


X-linlced retinopathy protein {C- 

terminal, clone XEH.Sc} 


134 


74 


12465 


AF200715 


Homo sapiens 


PTB domain adaptor protein CED-6 


332 


95 


12466 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


54 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
_Score 


Identit 








7867. 







12467 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


1 17 


55 


12468 


AFl 99023 


Homo sapiens 


phospholipid scramblase 4 


476 


98 


12469 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


104 


89 


12470 


AF108841 


Homo sapiens 


pol protein 


306 


87 


12471 


¥36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


111 


79 


12472 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4457. 


124 


78 


12473 


U1S339 


Variola vims 


D4L 


117 


60 


12474 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4458. 


124 


63 




AFl 3 0089 


Homo sapiens 


PRO2550 






12476 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


62 


12477 


M91242 


Rattiis 
norvegicus 


calcium channel alpha-1 subunit 


156 


68 


12478 


AFl 30089 


Homo sapiens 


PRO2550 


154 


79 


12479 


Ml 5530 


Homo sapiens 


B-cell growth factor 


154 


70 


12480 


U39904 


Mus musculus 


citron 


718 


97 


12481 
T2482^ 


R99364 


Homo sapiens 


Human REST protein DNA binding 
domain. 


229 


37 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


97 


54 


12483 


AF161356 


Homo sapiens 


HSPC093 


94 


55 


12484 


Y69I66 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


97 


75 


.12485 


AF090942 


Homo sapiens 


PRO0657 


103 


55 


12486 


U79260 


Homo sapiens 


unknown 


98 


74 


12487 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


96 


53 


12488 


U37100 


Homo sapiens 


aldose reductase-like peptide 


400 


97 






Homo sapiens 


putative pi 50 






12490 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


66 


12491 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


89 


12492 


Y00876 


Homo sapiens 


Human LAPH-1 protein sequence. 


219 


45 


12493 


AK024455 


Homo sapiens 


FLJ00047 protein 


85 


53 


12494 


W48351 


Homo sapiens 


Human breast cancer related protein 

BCRB2. 


95 


63 


12495 


Y36722 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 98. 




60 


12496 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


102 


57 


12497 


Y77551 


Homo sapiens 


C-terminal domain of betalc integrin. 


132 


67 


12498 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4508. 


122 


69 


12499 


AF161356 


Homo sapiens 


HSPC093 


98 


51 


12500 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


81 


12501 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


97 


52 


12502 


AF132200 


Homo sapiens 


PRO 1751 


94 


69 


12503 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


76 


65 


12504 


P70494 


Homo sapiens 


Sequence of human B-ceU growth 
factor (BCGF). 


99 


75 
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SEQID 
NO: 


Accession No. 


species 


Description 


Smitli- 
Waterman 


Identit 


12505 


Ml 9651 


Rattus 


fos-related antigen 


132 


92 


12506 


G00354 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 

4435. 


113 


48 


12507 


U94832 


Homo sapiens 


KSRP 


117 


~48 


12508 


W47029 


Homo sapiens 


Human N-proteinase (70 kDa short 


121 


62 


12509 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


124 


66 


12510 


GOB 807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


90 


80 


12511 


AF042384 


Homo sapiens 


BC~2 protein 


516 


98 


12512 


Y12950 


Homo sapiens 


secreted peptide. 


96 


50 


12513 


G03482 




Human secreted protein, SEQ ID NO: 
7563. 






12514 


AL 162044 


Homo sapiens 


hypothetical protein 


174 


48 


12515 


AF 130079 


Homo sapiens 


PR02852 


96 


73 


12516 


B03628 




a!"hrtCa^ner'^crece^r^ofo — T~ 











Homo sapiens 






— 


12518 


G 02920 


Homo sapiens 

omo sapiens 


Hunian secreted^^otdn^ SEQID NO- — 

7001. 


"99 


"85 


12519 


U00694 


Gallus gallus 


protein 


TsT 


~45 


12520 


AB030829 


norve^icus 








12521 


J04204 


Bos taurus 


32 kd accessory protein 


618 


95 


12522 


M36341 










12523 


AF090931 


Homo sapiens 


PRO0483 


163 


82 




AB007925 




KIAA0456 protein 






12525 


G02314 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 


12526 


AF178948 


Homo sapiens 


TALE homeobox protein Meis2a 


In 


"93 


12527 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


156 


85 


12528 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 


12529 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


145 


68 


12530 


AK024372 


Homo sapiens 




Tp 


36 


12531 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


76 


65 


12532 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


132 


~66 


12533 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


146 


53 


12534 


AF090942 


Homo sapiens 


PRO0657 


83 


60 


12535 


W48351 


Homo sapiens 


BCRB2. ^ 


136 


65 


12536 


R95913 






106 




12537 


AF130089 


Homo sapiens 


PRO2550 


148 


84 


12538 


AF 102826 


Homo sapiens 


receptor 


605 


91 


12539 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


143 


52 


12540 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


87 


65 


12541 


GO3600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7681. 


209 


92 
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Species 


Description 


Smitli- 


% 


NO: 










Identit 










slore™"" 




12542 


M31964 


Saimiriine 


collagen-like protein 


92 


52 






herpesvirus 2 








12543 


G03714 


Homo sapiens 


Himian secreted protein, SEQ ID NO: 


146 


76 








7795. 






12544 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


155 


72 








SEQ ID NO: 163. 






12545 


AC004912 


Homo sapiens 


similar to CRl 6, SH3 domain binding 


442 


98 








protein; similar to 2205340A 












{PID-.gl587070) 






12546 


V00662 


Homo sapiens 


URF A6r. (NADH dehydrogenase 


175 


91 








subunit) 






12547 


AF130089 


Homo sapiens 


PRO2550 


112 


40 


12548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


48 








6613. 






12549 


X55110 


Homo sapiens 


neurite outgrowth-promoting protein 


479 


72 


12550 


X98296 


Homo sapiens 


ubiquitin hydrolase 


500 


88 


12551 


R94317 


Homo sapiens 


Hepatocyte proliferation substance HP- 


390 


92 








041V. 






12552 


G01623 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


90 








5704. 






12553 


D86966 


Homo sapiens 


similarto human ZFY protein. 


161 


56 


12554 


D00570 


Mus musculus 


open reading frame (196 AA) 


122 


67 


12555 


AF130089 


Homo sapiens 


PRO2550 


94 


88 


12556 


AF043184 


Homo sapiens 


T cell receptor beta chain 


607 


84 


12557 


1193 564 


Homo sapiens 


p40 


154 


100 


12558 


U42026 


Homo sapiens 


plasma membrane Ca2+-ATPase 


200 


97 








variant 4a PMCA4a 






12559 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


103 


50 


12560 


U90552 


Homo sapiens 


butyrophilin 


356 


98 


12561 


K02576 


Homo sapiens 


salivary proline- rich protein 1 


156 


37 ■ 


12562 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


138 


41 








clone HPMBQ32. 






12563 


XI 6454 


Homo sapiens 


carcinoembiyonic antigen subdomain B 


353 


98 


12564 


AL031186 


Homo sapiens 


bK984G1.4 (Ewing sarcoma breakpoint 


119 


100 








region 1 protein) 






12565 


M68941 


Homo sapiens 


protein-tyrosine phophatase 


223 


93 


12566 


¥14487 


Homo sapiens 


cytosolic serine 


145 


84 








hydroxymethyltransferase 






12567 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


61 








6913. 






12568 


AB005047 


Homo sapiens 


SH3 binding protein 


124 


52 


12569 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


53 








7492. 






12570" 


^68142 


Oryctolagus 


elongation factor 1 gamma 


678 


93 






cuniculus 








12571 


L20315 


Mus musculus 


MPSl protein 


337 


77 


12572 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna. 


114 


50 








complete virion genome. 






12573 


AF130089 


Homo sapiens 


PRO2550 


124 


71 


12574 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


51 








7492. 






12575 


AF005902 


Monodelpiiis 


kinesin honiolog 


534 


82 






domestica 








12576 


Y56021 


Homo sapiens 


Human CD40 receptor interacting 


711 


96 








protein 4C4. 






12577 


Y99662 


Homo sapiens 


Human GTPase associated protein-13. 


634 


100 


12578 


Z48008 


Sacchaiomyces 


Soklp ■ 


225 


100 






cerevisiae 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


12579 


AFl 12207 


Homo sapiens 


translation initiation factor elF-2b delta 


195 


97 


12580 


M22538 


Homo sapiens 


NADH-ubiquinone reductase 


224 


~82 


12581 


AB034730 


Mus musculus 


This gene is isolated by means of 
differential display method using ttw, 
an excellent mouse model for ectopic 
ossification.; similar to megakaryocyte 
stimulating factor precursor and 
cartilage superficial zone protein 


112 


44 


12582 


U93567 


Homo sapiens 


p40 


364 


93 


12583 


AY007233 


Homo sapiens 


phosphoinositol 3-phosphate binding 
protein-1 


475 


100 


12584 


L07592 


Homo sapiens 


peroxisome proliferator activated 
receptor 


113 


81 


12585 


AE003682 


Drosophila 
melanogaster 


CG8135 gene product 


, 191 


38 


12586 


AFl 19851 


Homo sapiens 


PRO 1722 


88 


64 


12587 


AF090895 


Homo sapiens 


PROG 11 7 


144 


59 


12588 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7645. 


144 


57 


12589 


U73199 


Mus musculus 


Rho-guanine nucleotide exchange 


111 


32 


12590 


J04615 


Homo sapiens 


small nuclear ribonucleopiotein 


383 


■ 97 


12591 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


60 


33 


12592 


y 02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


118 


78 


12593 


Y36495 


Horno sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


123 


83 


12594 


AF090944 


Homo sapiens 


PRO0663 


152 


82 


12595 


Y 15243 


Homo sapiens 


ULBP-3 amino acid sequence. 


852 


100 


12596 


X98235 


Drosophila 
melanogaster 


type I 


159 


42 


12597 


AF191309 


Mus musculus 


zinc finger protein 


407 


66 


12598 


W82841 


Homo sapiens 


Human cerebral protein-1. 


107 


91 


12599 


AF070664 


Homo sapiens 


HSPC008 


128 


89 


12600 


U03688 


Homo sapiens 


cytochrome P450 


257 


96 


12601 


ABO 15798 


Homo sapiens 


DnaJ homolog 


226 


73 


12602 


AC003O58 


Arabidopsis 
thaliana 


unknown protein 


88 


40 


12603 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 




35 


12604 


AF200187 


cercopithicine 
herpesvirus 15 


EBNA2-lilce protein 


109 


28 


12605 


AF161532 


Homo sapiens 


HSPC047 


720 


100 


12606 


M21302 


Homo sapiens 


small proline rich protein 


60 


59 


12607 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


107 


47 


12608 


A J3 02650 


norvegicus 


RP59 protein 


599 




12609 


J05401 


omo sapiens 


kinase precursor (EC 2.7.3.2) 


763 


96 


12610 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6613. 


216 


87 


12611 


W78175 


Homo sapiens 


Human secreted protein encoded by 
gene 50 clone HSTAG52. 


337 


100 


12612 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


145 


66 


12613 


AFl 19855 


Homo sapiens 


PRO 1847 


157 


61 
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SEQID 


Accession No. 


Species 




Smith- 




12614 


U82303 


Homo sapiens 


unknown 


107 


62 


12615 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4718. 


146 


49 


12616 


AF130051 


Homo sapiens 


PRO0898 


254 


56 


12617 


AF037350 


Rattus 
norvegicus 


NF-E2-related factor 2 


807 


59 


12618 


Y02168 


Homo sapiens 


A facilitative glucose transporter 
protein GLUTS. 


452 


100 


12619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


158 


84 


12620 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7893. 


246 


60 


12621 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


79 


43 


12622 


AFl 09907 


Homo sapiens 


SI64 


93 


40 


12623 


Y36156 


Homo sapiens 


Human secreted protein #28. 


114 


61 


12624 


AFl 16712 


Homo sapiens 


PR02738 


104 


48 


12625 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


81 


65 


12626 


AFl 13685 


Homo sapiens 


PRO0974 


104 


43 


12627 


AL160371 


Leishmania 


probable (iihv-6) ul 1 02, variant a 
DNA, complete virion genome 


116 


45 


12628 


AFl 07406 


Homo sapiens 


GW128 


133 


58 


12629 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7496. 


100 


57 


12630 


AB047600 


Macaca 
fascicularis 


fiypothetical protein 


172 


66 


12631 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


136 


63 


12632 


AFl 18082 


Homo sapiens 


PRO 1902 


138 


45 


12633 


G00397 


Homo sapiens 


Human secreted protein, SBQ ID NO; 
4478. 


151 


62 


12634 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


95 


58 


12635 


AF064597 


Homo sapiens 


LINE-1 like protein 


108 


46 


12636 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


116 


49 


12637 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


110 


52 


12638 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein ftagment 59. 


79 


70 


12639 


M29622 


Mus musculus 


open reading Same 2 


81 


60 


12640 


M26460 


Homo sapiens 


retinoblastoma 1 


100 


45 


12641 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


110 


63 


12642 


AB047936 


IVIacaca 
fascicularis 


hypothetical protein 


100 


52 


12643 


G03133 


Homo seipiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


41 


12644 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


143 


48 


12645 


AF090942 


Homo sapiens 


PRO0657 


157 


54 


12646 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


130 


56 


12647 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


267 


65 


12648 


AF130079 


Homo sapiens 


PR02852 


109 


57 


12649 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


99 


61 


12650 


X55686 


Lycopersicon 


extensin (class IT) 


65 


35 
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Accession No. 


Species 


Description 














merman 












Score 








esculentum 








12651 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


84 








7795. 






12652 


AFl 19851 


Homo sapiens 


PR01722 


150 


70 


12653 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


133 


50 








gene 22 clone HMSJW18. 






12654 


Y45382 


Homo sapiens 


Human secreted protein fragment 


146 


70 








encoded from gene 28. 






12655 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


63 








4435. 






12656 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


52 








7870. 






12657 


U93563 


Homo sapiens 


putative pl50 


1046 


50 


12658 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


346 


70 








7870. 






12659 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


104 


70 








secreted peptide. 






12660 


AF119851 


Homo sapiens 


PR01722 


98 


56 


12661 


S58722 


Homo sapiens 


X-linkedretinopafliy protein {C- 


90 


65 








terminal, clone XEH.8c} 






12662 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


59 


64 








7344. 






12663 


AFl 18082 


Homo sapiens 


PRO1902 


91 


66 


12664 


LI 0908 


Mus musculus 


Gcapl gene product 


113 


50 


12665 


Ml 8093 


Phaseolus 


hydroxyproline-rich glycoprotein 


124 


37 






vulgaris 








12666 


AF090895 


Homo sapiens 


PRODI 17 


76 


57 


12667 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


43 








7870. 






12668 


W48351 


Homo sapiens 


Human breast cancer related protein 


90 


61 








BCRB2. 






12669 


M15530 


Homo sapiens 


B-cell growtii factor 


90 


80 


12670 


AFl 98447 


Aspergillus 


60S ribosomal protein L3 


217 


69 






nidulans 








12671 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


51 








7871. 






12672 


¥207 17 


Homo sapiens 


Human neurofilament-M wild type 


77 


68 








protein fragment 59. 






12673 


AF055985 


Onchocerca 


pyrrolidone-rich antigen 


67 


40 






volvulus 








12674 


X79417 


Sus scrofa 


40S ribosomal protein S12 


377 


75 


12675 


AFl 07406 


Homo sapiens 


GW128 


74 


51 


12676 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


99 


62 








derived protein #153. 






12677 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


65 








6953. 






12678 


G00590 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


62 






4671. 






12679 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 








6065. 






12680 


AFl 94537 


Homo sapiens 


NAG 13 


142 


59 


12681 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


12682 


AF161356 


Homo sapiens 


HSPC093 


104 


50 


12683 


AFl 07406 


Homo sapiens 


GWI28 


84 


54 


12684 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


90 


39 


12685 


V40883_cdl 


Homo sapiens 


03-DEC-1997 Coding sequence of 


728 


100 








clone BG366 2. 






12686 


M24732 


Homo sapiens 


lamin-Iike protein 


94 


54 


12687 


X00824 


Gallus gaJlus 


collagen 


66 


42 
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NO: 


Accession No. 
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Smith- 
Waterman 


Wentit 


12688 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


184 


64 




AFl 18086 


Homo sapiens 


PRO 1992 






12690 


AL390114 


Leishmania 


probable (hliv-6) ul 102, variant a 
DNA, complete virion genome 


131 


54 


12691 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


180 


72 


12692 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


104 


72 


12693 


AF130089 


Homo sapiens 


PRO2550 






12694 


KOI 664 


Drosophila 
melanogaster 


Blim-like protein 


92 


52 

,,. 


12695 1 AL390114 

1 


Leislimauia 
major 


probable (hiiv-o) ul 102^ variant a 
DNA, complete virion genome 






12696 ; L39103 


Homo sapiens 


glycoprotein lb alpha 


61 


45 


12697 


M15530 


Homo sapiens 


B-cell growtli factor 






12698 


M36913 


Zea mays 


cell wall protein (put.); putative 


75 


40 


12699 


S58722 


Homo sapiens 


X-Iinlced retinopathy protein {C- 
terminal, clone XEH.Sc} 


101 


68 


12700 


M24732 


Homo sapiens 


lamin-like protein 


95 


35 


12701 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 




50 


12702 I AF298594 


Nicotiana alata 


arabinogalactan protein 


105 


30 


12703 I X92485 


Plasmodium 
vivax 


pval 


97 


38 


12704 J AF210651 


Homo sapiens 


NAG18 


89 


64 


12705 ! G03790 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


190 


53 


12706 j G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


78 


12707 U52077 


Homo sapiens 


mariner transposase 


451 


51 


12708 j AC003058 


Arabidopsis 
thaliana 


unknown protein 


244 


77 


12709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6444. 


138 


65 


12710 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


82 


12711 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


104 


34 


12712 


AF242772 


Homo sapiens 


mesenchymal stem cell protein 
DSCD28 


106 


45 


12713 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 






12714 


AFI16715 


Homo sapiens 


PR02829 






12715 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


62 


12716 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


86 


54 


12717 


Z93891 


Hegeter politus 


cytochrome oxidase 


72 


48 


12718 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

-^J-— J : 


105 


46 


12719 


G00689 


~ -. 

Homo sapiens 


Human secreted protem, SEQ ID NO: 
4770. 






12720 


U30221 


Crithidia 
fasciculata 


NADH dehydrogenase subunit 5 


92 


39 


12721 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


131 


82 


12722 


AFl 07406 


Homo sapiens 


GW128 


72 


54 


12723 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


55 
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SEQID 


Accession No. 




Description 






NO: 








Waterman 
Score 


Identit 

y 








6720. 






12724 


AFl 16661 


Homo sapiens 


PRO 143 8 


145 


59 


12725 


Y53871 


Homo sapiens 


A human brain-derived signalling 


584 


98 








factor polypeptide. 






12726 


LI 0908 


Mus musculus 


Gcapl gene product 


95 


37 


viin 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


37 








8153. 






12728 


X55686 


Lycopersicon 


extensin (class II) 


66 


36 






esculentutn 








12729 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


55 








6065. 






12730 


L23852 


Homo sapiens 


Putative 3' end of coding region; 


83 


37 








putative 






12731 


Y13141 


Bromheadia 


extensin 


59 


48 






finlaysoniana 








12732 


GO 1 246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


69 








5327. 






12733 


AF090895 


Homo sapiens 


PRO0117 


130 


55 


12734 


R95913 


Homo sapiens 


Neural thread protein. 


162 


45 


12735 


U72355 


Homo sapiens 


Hsp27 ERE-TATA-binding protein 


972 


57 


12736 


AF247039 


porcine 


163R* 


108 


35 






adenovirus 3 








12737 


M22332 ^ 


Homo sapiens 


unknown protein 


148 


45 


12738 


AF090895 


Homo sapiens 


PRODI 17 


89 


72 


12739 


B08525 


Homo sapiens 


Protein encoded by a novel gene 


296 


86 








associated with colon disease. 






12740 


X90872 


Homo sapiens 


associated to Golgi apparatus 


116 


54 


12741 


M26460 


Homo sapiens 


retinoblastoma 1 


75 


37 


12742 


S52010 


Mussp. 


orfl 5' of EpoR 


104 


31 


12743 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


50 








6720. 






12744 


AB047600 


Macaca 


hypothetical protein 


111 


47 






fascicularis 








12745 


AF090944 


Homo sapiens 


PRO0663 


152 


65 


12746 


Y21106 


Homo sapiens 


Human bcl2 proto-oncogene vf ild type 


62 


64 








protein fragment 3. 






12747 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


76 








7871. 






12748 


X66861 


Mus musculus 


Hox-1.4 


92 


34 


12749 


AF166125 


Homo sapiens 


selenoprotein N 


978 


98 


12750 


W87504 


Homo sapiens 


Human N-methyi-D-aspartate receptor 


98 


29 








subunit encoded by clone NMDA24. 






12751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


78 








6613. 






12752 


X72963 


Nicotiana 


pAP8 product 


80 


45 






tabacum 








12753 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


123 


61 








encoded by gene 17. 






12754 


AF146191 


Homo sapiens 


FRGl 


208 


85 


12755 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


75 








7795. 






12756 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


48 








7871. 






12757 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


51 








7344. 






12758 


U83280 


Leislimania 


39 kDa antigen 


106 


60 






donovani 








12759 


AFl 16661 


Homo sapiens 


PR01438 


73 


48 


12760 


AF164615 


Homo sapiens 


envelope protein , 


257 


83 
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NO: 


AceessionNo. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


12761 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


135 


70 


12762 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


111 


50 




M29622 


Mus nmsculus 


open reading frame 2 


73 


68 


12764 


L27428 


Homo sapiens 


reverse transcriptase 


111 


60 


12765 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


106 


45 




AK024455 


Homo sapiens 


FLJ00047 protein 


108 


66 


12767 


M36914 


Zea mays 


cell wall protein (put); putative 


78 


36 


12768 


L27428 


Homo sapiens 


reverse transcriptase 


159 


59 


12769 


AJ005567 


Mus musculus 


SPR2I protein 


55 


39 


12770 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6012. 


115 


82 


12771 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8144. 


76 


64 


12772 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


126 


85 


12773 


AF130114 


Homo sapiens 


PR02459 


129 


50 


12774 


AF010144 


Homo sapiens 


neuronal tiiread protein AD7c-NTP 


220 


71 


12775 


AF287482 


Chlorobium 


Orfl22 


166 


68 


12776 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4658. 


84 


50 


12777 


AF130079 


Homo sapiens 


PR02852 


258 


58 


12778 


AF090931 


Homo sapiens 


PRO0483 


65 


76 


12779 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


66 


12780 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


59 


37 


12781 


AF 11 9900 


Homo sapiens 


PR02822 


132 


59 


12782 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


97 


51 


12783 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


77 


77 


12784 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


84 


12785 


L27428 


Homo sapiens 


reverse transcriptase 


183 


39 


12786 


AF11985] 


Homo sapiens 


PRO 1722 


128 


61 


12787 


AF116715 


Homo sapiens 


PR02829 


111 


72 


12788 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


77 


12789 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


84 


66 


12790 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


155 


71 


12791 


AF124729 


Mus musculus 


acinusS' 


122 


50 


12792 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


86 


52 


12793 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


108 


74 


12794 


M62415 


Pseudopleuronec 


HPLC6 


82 


35 


12795 


Y36156 


Homo sapiens 


Human secreted protein #28. 


133 


48 


12796 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


74 


12797 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


170 


32 


12798 


G03790 


Homo sapiens 


Human secreted protein, SBQ ID NO: 
7871. 


114 


43 


12799 


Y76184 


Homo sapiens 


Human secreted protein encoded by 


90 


60 



477 



wo 01/64835 PCT/llSOl/04927 



SEQID 
NO: 
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Waterman 

Score 


Identit 

y 








gene 61. 






12800 


X62677 


Oryotolagus 
cuniciilus 


retrovirus related reverse transcriptase 


102 


38 


12801 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


98 


69 


12802 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID KO: 
6613. 


177 


69 


12803 


AF130089 


Homo sapiens 


PRO2550 


94 


74 


12804 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 
tenninal, alternatively spliced} {EC 
3.5.1.26} 


83 


68 


12805 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvinis 


ORF MSV233 hypothetical protein 


82 


28 


12806 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


80 


60 


12807 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4414. 


122 


47 


12808 


X03145 


Homo sapiens 


pot. ORF V 


150 


34 


12809 


LI 0908 


Mus musculus 


Gcapl gene product 


111 


34 


12810 


Y95028 


Homo sapiens 


Human clone vp7 10RF2,SEQID 
NO: 128. 


170 


55 


12811 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


101 


62 


12812 


AJ233591 


Mus musculus 


reverse transcriptase 


287 


67 


12813 


U82303 


Homo sapiens 


unknown 


149 


52 


12814 


AF107406 


Homo sapiens 


GW128 


103 


58 


12815 


U82303 


Homo sapiens 


unknown 


93 


83 


12816 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


161 


54 


12817 


B01390 


Homo sapiens 


Neuron-associated protein. 


81 


38 


12818 


AF287482 


Chlorobium 
tepidum 


Orfl22 


174 


69 


12819 


AE003499 


Drosophila 
melanogaster 


CO 12706 gene product 


166 


28 


12820 


Ml 5530 


Homo sapiens 


B-coU growth factor 


124 


71 


12821 


U83303 




line-1 reverse transcriptase 


161 


53 


12822 


G 024 93 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574, 


150 


77 


12823 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


103 


66 


12824 


AF064597 


Homo sapiens 


LINE-1 like protein 


64 


45 


12825 


X71442 


Rattus 
norvegicus 


ORF I; putative 


113 


45 


12826 


U62039 


Elephantulus 
edwardii 


reverse transcriptase 


74 


46 


12827 


M15530 


Homo sapiens 


B-cell growth factor 


127 


54 


12828 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


94 


84 


12829 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


59 


57 


12830 


AF229067 


Homo sapiens 


PADI-H protein 


184 


61 


12831 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


69 


34 


12832 


AF 1440 54 


Homo sapiens 


apoptosis related protein APR-4 


83 


46 


12833 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


61 


12834 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


142 


40 


12835 


U 19098 


Lycopersicon 
chilense 


proline-rich protein 


81 


38 


12836 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


229 


68 
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SEQ ID 
NO: 








Waterman 


Wentit 

y 


12837 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


81 


60 


12838 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


88 


71 


12839 


X72963 


Nicotiana 
tabacum 


pAP8 product 


94 


40 


12840 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


315 


56 


12841 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


55 


12842 


AK025947 


Homo sapiens 


unnamed protein product 


124 


57 


12843 


AF159055 


Homo sapiens 


leucine zipper-like protein 


69 


55 


12844 


AF040257 


Homo sapiens 


TNF receptor homolog 


98 


50 


12845 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


94 


76 


12846 1 X88799 


Oryza sativa 


DNA binding protein 


94 


37 


12847 


AL359782 


Irypanosoma 
brucei 


possible (hliv-6} ul 102, variant a dna, 
complete virion genome. 


98 


48 


12848 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


164 


56 


12849 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


75 


59 


12850 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


122 


54 


12851 [ M31964 


Saimiriine 
heqsesvirus 2 


collagen-like protein 


79 


50 


12852 AF 118082 


Homo sapiens 


PRO 1 902 


94 


77 


12853 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


139 


54 


12854 


R59842 


Homo sapiens 


ApoE4Ll protease. 


108 


82 


12855 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


99 


66 


12856 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


89 


56 


12857 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


52 


12858 


G03714 


Homo sapiens 


Human secreted protein, SRQ ID NO: 
7795. 


155 


80 


12859 


G024S2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


88 


61 


12860 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


66 


33 


12861 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


75 


47 


12862 


M29622 


Mus musculus 


open reading frame 2 


74 


62 


12863 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


70 


12864 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


117 


54 


12865 


AF090894 


Homo sapiens 


PRO0113 


108 


63 


12866 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


121 


54 


12867 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Bc} 


103 


44 


12868 


S68106 


Ascaris suum, 
Peptide Partial, 
100 aa 


type IV collagen alpha 2 chain, alpha 2 
(IV) {alternatively spliced} 


75 


35 


12869 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


97 


56 


12870 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


146 


62 
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SEQ ID 
NO: 






Description 


Waterman 
Score 


Identit 


12871 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4493. 


78 


75 


12872 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


105 


56 


12873 


AB030234 


Canis familiaris 


D4 dopamine receptor 


61 


58 


12874 


M22332 


Homo sapiens 


unknown protein 


128 


41 


12875 


U05313 


Trypanosoma 
brucei 


CR3 


67 


48 


12876 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


65 


12877 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


66 


54 


12878 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


78 


34 


12879 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


122 


51 


12880 


AF130079 


Homo sapiens 


PR02852 


125 


75 


12881 

12882 


AF090895 


Homo sapiens 


PRO0117 


142 


59 


AF181251 


Rattus 


lung Kruppel-liice factor 


93 


39 
74 


12883 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613, 


167 


12884 


M26460 


Homo sapiens 


retinoblastoma 1 


136 


41 


12885 


AJ277557 


Homo sapiens 


mitochondiial 5 (3')- 
deoxyribonucleotidase (dNT-2) 


280 


100 


12886 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


119 


62 


12887 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


43 


12888 


API 19851 


Homo sapiens 


PRO 1722 


94 


50 


12889 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


121 


39 


12890 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


68 


12891 


AK024455 


Homo sapiens 


FLJ00047 protein 


77 


51 


12892 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


115 


83 


12893 


L27428 


Homo sapiens 


reverse transcriptase 


89 


29 


12894 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO990I020. 


112 


72 


12895 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


72 


62 


12896 


AK023563 


Homo sapiens 


unnamed protein product 


260 


64 


12897 


G02832 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
6913. 


76 


66 


12898 


AC002304 


Arabidopsis 
thaliana 


F14J16.29 


111 


36 


12899 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


83 


53 


12900 


AFl 16638 


Homo sapiens 


PRO 1546 


123 


57 


12901 


AB010361 


Mus musculus 


mszf47 


64 


43 


12902 


R59842 


Homo sapiens 


ApoB4Ll protease. 


130 


86 


12903 


L06498 


Homo sapiens 


ribosomal protein S20 


249 


56 


12904 


LI3635 


Rattus 
norvegicus 


growth response protein 


176 


69 


12905 


AFl 56961 


Homo sapiens 


gag 


183 


44 


12906 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


207 


67 


12907 


AFl 07406 


Homo sapiens 


GW128 


99 


50 


12908 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7881. 


93 


75 
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Species 


Description 


Smitli- 
Watcrmaji 


Identit 


12909 


AF263540 


Homo sapiens 


MIB006 


1809 


100 


12910 


Y48292 


Homo sapiens 


Human prostate cancer-associated 
protein 78. 


60 


50 


12911 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7339. 


179 


87 


12912 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


108 


72 


12913 


Z72499 


Homo sapiens 


herpesvirus associated ubiquitin- 
specific protease (HAUSP) 


5242 


99 


12914 


AF092I36 


Homo sapiens 


PTD015 


297 


100 


12915 


AF161356 


Homo sapiens 


HSPC093 


123 


42 


12916 


AF044670 


Homo sapiens 


33 kDa Vamp-associated protein; VAP- 

33 


506 


90 


12917 


W27087 


Homo sapiens 


Human transforming growth factor 
alpha mil. 


1243 


100 


12918 


Y59807 


Homo sapiens 


Human normal ovarian tissue derived 
protein 84. 


111 


43 


12919 


A.I388518 


Canis familiaris 


non-histone chromosomal protein 
HMG-17 


108 


84 


12920 


AIC023392 


Homo sapiens 


unnamed protein product 


119 


60 


12921 


Y02671 


Homo sapiens 
Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


104 


87 


12922 


S58722 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


124 


81 


12923 


X78444 cdl 


Homo sapiens 


2 1 -MAY- 1997 Human UCSP-2 cDNA. 


1090 


100 


12924 


API 16661 


Homo sapiens 


PRO 143 8 


89 


52 


12925 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4488. 


82 


80 


12926 


API 19855 1 


Homo sapiens 


PRO 1847 


120 


63 


12927 


AP090931 


Homo sapiens 


PRO0483 


92 


90 


12928 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


119 


88 


12929 


Y91429 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 21 SEQ ID NO: 150. 


108 


38 


12930 


Y12661 


Homo sapiens 


neuro-endocrine specific protein VGF 


2506 


99 


12931 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein. 


61 


50 


12932 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


121 


53 


12933 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


104 


52 


12934 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


104 


60 


12935 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


331 


77 


12936 


M15530 


Homo sapiens 


B-cell growth factor 


83 


71 


12937 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


64 


12938 


R59842 


Homo sapiens 


ApoE4Ll protease. 


112 


61 


12939 


AF090895 


Homo sapiens 


PRO0117 


94 


80 


12940 


X73459 


Homo sapiens 


signal recognition particle subunit 14 


549 


98 






Homo sapiens 




141 


45 


12942 


AF054178 


Homo sapiens 


CI-B 14.5a homolog 


238 


84 


12943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


110 


55 


12944 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7868. 


143 


55 


12945 


D00570 


Mus musculus 


open reading frame (251 AA) 


166 


34 


12946 


U62039 


Elephantulus 


reverse transcriptase 


63 


50 
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SEQID 


Accession No. — 


Species 


Description 


Smitli- 





NO: 








Waterman 
Score 


Identit 

y 






edwardii 








12947 


AE003499 


Drosophila 


CGI 2706 gene product 


104 


31 






melanogaster 








12948 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


65 


100 









7795. 






12949 


X70343 


Nicotiana 


extensin 


102 


j8 






sylvestris 








12950 


G025j2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


45 








6613. 






12951 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


85 


56 








7870. 






12952 


AF090895 


Homo sapiens 


PROG 117 


139 


68 


12953 


AF090896 


Homo sapiens 


PRO0131 


93 


85 


12954 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


75 








5327. 






12955 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 


292 


68 








subunit 






12956 


X71087 


Homo sapiens 


monocyte chemoattractant protein 


408 


96 








(MCP-3) 






12957 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


591 


68 


12958 


AF 109907 


Homo sapiens 


S164 


188 


63 


12959 


W48351 


Homo sapiens 


Human breast cancer related protein 


173 


50 








BCRB2. 






12960 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


62 








7868. 






12961 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


56 








7882. 






12962 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


106 


84 


12963 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


44 








6613. 






12964 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


107 


57 








sequence. 






12965 


P70494 


Homo sapiens 


Sequence of liuman B-cell growth 


71 


65 








factor (BCGF). 






12966 


AC009853 


Arabidopsis 
thaliana 


hypothetical protein 


102 


44 


12967 


AF090942 


Homo sapiens 


PRO0657 


99 


62 


12968 


M15530 


Homo sapiens 


B-cell growth factor 


90 


65 


12969 


AF090895 


Homo sapiens 


PROOl 17 


83 


54 


12970 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


54 








7868. 






12971 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


57 








7519. 






12972 


L32558 


Homo sapiens 


sequence is e3q)ressed in human Tera-2 


426 


85 








clone 13 (embryonal carcinoma) cells. 












The sequence may contain mismatches 












(one strand sequenced only once). 97% 












identical in 320 bp overlap with human 












54 kDA prot; ORF 






12973 


AF008 196 


Homo sapiens 


bax epsilon 


78 


87 


12974 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 


96 


80 








protein fragment 59. 






12975 


AL031848 


Homo sapiens 


d.T202O8.2.2 (novel rodent HES2 (hairy 


176 


100 








and Enhancer of Split 2) LIKE protein 












(isoform 2)) 






12976 


AJ245905 


Oilorocebus 
aethiops 


HSBPl -like protein 


94 


100 


12977 


AF067519 


Homo sapiens 


PITSLRE protein Idnase beta SVl 


1923 


93 








isoform 
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SEQID 
NO: 


Accession No. 


species 


Description 


Waterman 


Iricntrt 


12978 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


168 


73 


12979 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


105 


39 


12980 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


53 


36 


12981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


87 


77 


12982 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


83 


89 


12983 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


92 


46 


12984 


AF113685 


Homo sapiens 


PRO0974 


111 


67 


12985 


U31086 


Gallus gallus 


neuron-glia adliesion molecule 


52 


50 


12986 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


132 


68 


12987 


G00352 


Homo sapiens 


Human secreted protem, SHQ ID NO: 
4433. 


91 


71 


12988 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


135 


81 


12989 


AB029042 


Homo sapiens 


ATPase inhibitor precursor 


364 


100 


12990 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4523. 


174 


72 


12991 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

47!8 


174 


57 


~12992" 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

4473, 


158 


57 


12993 


AFl 18082 


Homo sapiens 


PRO 1902 


77 


78 


12994 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


76 


60 


12995 


MI5530 


Homo sapiens 


B-cell growth factor 


105 


76 


12996 


AJ005562 


IVfus musculus 


SPR2D protein 


104 


36 


12997 


Y48346 


Homo sapiens 


Human prostate cancer-associated 

protein 43. 


184 


77 


12998 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


82 


53 


12999 


L11645 


Homo sapiens 


alpha-tubulin 


196 


68 


13000 


AF113685 


Homo sapiens 


PRO0974 


123 


52 


13001 


S58722 


Homo sapiens 


X-)inked retinopathy protem {C- 
terminal, clone XEH.Sc} 


78 


53 


13002 


AJ005564 


Mus musculus 


SPK2F protein 


76 


44 


13003 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


101 


76 


13004 


AL159178 


Streptomyces 
coelicolor A3(2) 


putative secreted protein 


89 


37 


13005 


L77967 


Ovis aries 


sniail proiine-rich protein with paired 


64 


36 


13006 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7324. 


85 


70 


13007 


AF068294 


Homo sapiens 


HDCMB45P 


237 


52 


13008 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


128 


50 


13009 


AFl 44054 


Homo sapiens 


apoptosis related protein APR-4 


100 


75 


13010 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


148 


67 


13011 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


152 


66 


13012 


B01390 


Homo sapiens 


Neuron-associated protein. 


99 


32 


13033 


A27282 


Homo sapiens 


TGR-CL3C 


69 


42 


13014 


G00689 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


135 


46 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


Identit 










4770. 








AF130089 


Homo sapiens 


PRO2550 






13016 
. 


AF266223 


Gillichthys 
mirabilis 


ribosomal protein L27 


122 


50 






Mus musculus 


nuclear localization signal binding 
protein 








U82303 


Homo sapiens 


unknown 






13019 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


70 


57 


13020 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

8159. 


169 


91 


13021 


V004S8 


Homo sapiens 


alplia globin 






13022 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


59 


13023 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


86 


13024 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7553. 


151 


84 


13025 
13026 


U82303 


Homo sapiens 


unknown 


86 


52 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


124 


61 


13027 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

4770. 


83 


50 


13028 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


63 


13029 
"l3030~ 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein. 


59 


46 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


96 


68 


13031 


S71494 


IMus sp. 


SmD homolog {Gly-Arg repeat} 




55 


13032 


AFl 13685 


Homo sapiens 


PRO0974 


100 


48 


13033 


Dj8112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


242 




13034 


AL3901 14 


Leishmania 


extremely cysteine/valine ricli protein 


178 


41 


13035 


G01129 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


228 


81 


13036 


X66285 


Mus musculus 


HCl ORF 


103 


44 


1^037 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 




55 




U47924 


Homo sapiens 


RPL13-2 






13039 


AF090942 


Homo sapiens 


PRO0657 


111 


44 




AK025116 


Homo sapiens 


unnamed protein product 






13041 



1--042 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4484. 


92 


72 


- 

Ml 17^9 




cell wall hydroxyproline-rich 
glycoprotein 


100 


69 


13043 


AF130089 


Homo sapiens | PRO2550 


109 


45 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
6960. 






13045 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


162 


70 


13046 


L10908 


Mus musculus 


Gcapl gene product 


162 


40 






Homo sapiens 


glutathione S-transferase GSTM5-5 






13048 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


89 


60 


13049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7075. 


153 


60 


13050 


U33547 


Homo sapiens 


MHC class II antigen 


69 


85 


13051 


AC006014 


Homo sapiens 


similar to mismatch repair proteins; 
similar to PID:gl304125 


652 


91 
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Species 


Description 






NO: 








Waterman 


Identit 










Score 


y 


13052 


A27282 


Homo sapiens 


TGR-CL3C 


65 


51 


13053 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 


106 


60 








transferase protein. 






13054 


M20030 


Homo sapiens 


small proline rich protein 


57 


33 


13055 


AF0221I7 


Balaena 


metallothionein 


119 


62 






mysticetus 








13056 


Ml 1759 


Lycopersicon 


cell wall hydroxyproline-rich 


78 


43 






esculentum 


glycoprotein 






13057 


R20305 


Homo sapiens 


Val(l)to Met, His(2) deleted, Ala(76) 


274 


98 








to Lys beta-globin mutant. 






13058 


X55686 


Lycopersicon 


extensin (class II) 


56 


39 






esculentum 








13059 


U10696 


Zeamays 


Ec metallothionein class 11 protein 


66 


26 


13060 


Y40417 


Homo sapiens 


A human N-acetylneuraminate lyase 


1183 


80 








(hNANL) protein. 






13061 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


41 








8153. 






13062 


AF002210 


Homo sapiens 


copper chaperone for superoxide 
dismutase 


702 


61 


13063 


AFl 18086 


Homo sapiens 


PRO 1992 


69 


84 


13064 


M24097 


Homo sapiens 


MHC HLA-C-alpha-2 chain 


1550 


96 


13065 


AC003113 


Arabidopsis 


F2401.6 


51 


40 






thaliana 








13066 


AF161531 


Homo sapiens 


HSPC046 


488 


97 


13067 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


50 








6619. 






13068 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


116 


46 






brucei 


complete virion genome. 






13069 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


145 


69 








gene No. 107. 






13070 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 


224 


71 








ID NO: 103. 






13071 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


52 








6913. 






13072 


AF2 16965 


Homo sapiens 


ancient conserved domain protein 3 


876 


100 


13073 


Y15917 


Homo sapiens 


COLlAl and PDGFB fusion transcript 


108 


35 


13074 


X77664 


Homo sapiens 


retinoic acid receptor bete isoforra 1 


53 


71 


13075 


AFl 56961 


Homo sapiens 


gag 


136 


37 


13076 


R59842 


Homo sapiens 


ApoE4Ll protease. 


117 


75 


13077 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


65 








7870. 






13078 


U23183 


Caenorhabditis 


gene lies in inverted repeat and exon 1 


100 


79 






elegans 


overlaps tRNA; may be pseudogene 






13079 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


56 








7795. 






13080 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 








6619. 






13081 


Y 12473 


Homo sapiens 


centrin 


359 


79 


13082 


G01518 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


375 


98 








5599. 






13083 


Y57891 


Homo sapiens 


Human transmembrane protein 


1030 


99 








HTMPN-I5. 






13084 


AB046765 


Homo sapiens 


KIAA1545 protein 


161 


93 


13085 


X55686 


Lycopersicon 


extensin (class II) 


66 


40 






esculentum 








13086 


AFl 3 0079 


Homo sapiens 


PR02852 


137 


45 


13087 


KOI 664 


Drosophila 


Bkm-like protein 


86 


33 






melanogaster 








13088 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


44 
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SEQID 
NO: 


Accession NO. 


specie. 


Oesenption 


Score 


% 

Identit 
y 








7893. 






13089 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


114 


63 


13090 


AL121905 


Homo sapiens 


dJ534B8.3 (novel protein similar to an 
aspartic protease) 


963 


100 


,13091 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7867. 


95 


47 


13092 


AB030236 


Canis familiaris 


D4 dopamine receptor 


65 


38 


13093 


AF161356 


Homo sapiens 


HSPC093 


119 


48 


13094 


Z14014 


Nicotiaiia 
tabacum 


Pistil extensin like protein, partial CDS 

only 


67 


50 


13095 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5330. 


124 


56 


13096 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


73 


13098 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


13099 


AF 130051 


Homo sapiens 


PRO0898 


194 


61 


13100 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


80 


70 


13101 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


111 


43 


13102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


71 


13103 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4414. 


116 


63 


13104 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


90 


64 


13105 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


176 


56 


13106 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


101 


75 


13107 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


100 


65 


13108 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


137 


77 


13109 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 102, variant a 
DNA, complete virion genome 


94 


88 


13110 


AFl 19855 


Homo sapiens 


PR01847 


103 


49 


13111 


AFl 19851 


Homo sapiens 


PRO 1722 


142 


80 


13112 


¥003 11 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


107 


51 


13113 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


104 


54 


13114 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


71 


13115 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7437. 


138 


50 


13116 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


110 


56 


13117 


AK025653 


Homo sapiens 


unnamed protein product 


188 


100 


13118 


AFl 30089 


Homo sapiens 


PRO2550 


129 


68 


13119 


AF161356 


Homo sapiens 


HSPC093 


131 


59 


13120 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7879- 


121 


45 


13121 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


82 


55 


13122 j G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


45 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 
Score 


Identit 








- 








j94 


Homo, sapiens 


Human secreted protem encoded by 




213 


100 






... 


gene No. 9. 







1 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


55 








. .... ... 








AFl 19855 


Homo sapiens 


PRO 1847 








R95913 


Homo sapiens 


Neural tliread protein. 


1j7 


47 


13127 


B01390 


Homo sapiens 


Neuron-associated protein. 


101 


47 


13128 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


122 


67 


13129 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


52 








7344. 






13130 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


72 








6613. 






13131 




Homo sapiens 


NADH dehydrogenase subunit 3 


500 


90 


13132 


AFl 38957 


Bos taurus 


type II collogen cyanogen bromide 


99 


38 








■fragment CBS 






^ 1 

! 


— 

AF084256 


Homo sapiens 


beta glucuronidase isoform d 


157 


69 




U92817 


Homo sapiens 


unnamed HERV-H protein 


113 


48 


131j5 


Y19743 


Homo sapiens 


SEQ ID NO 461 from W09922243. 


967 


99 


13136 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


52 








7492. 






13137 


AF090928 


Homo sapiens 


PRO0470 


133 


69 


13138 


AF09093 1 


Homo sapiens 


PRO0483 


150 


60 


13139 


AF116661 


Homo sapiens 


PR01438 


146 


57 


13140 


AL022318 


Homo sapiens 


bK150C2.2 (Phorbolin3) 


443 


56 


13141 


S79410 


Mus musculus 


nuclear localization signal binding 


114 


80 








protein 






13142 


AF055985 


Onchocerca 


pyrrolidone-rich antigen 


86 


43 






volvulus 








13143 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


62 








7795. 






13144 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


66 








6619. 






13145 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


71 








7795. 






13146 


AFl 07406 


Homo sapiens 


GW128 


120 


50 


13147 


LI 0908 


Mus musculus 


Gcapl gene product 


91 


42 


13148 


AP000616 


Oryza sativa 


similar to RING-II2 finger protein 


113 


66 








RHAla(AF078683) 






13149 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


119 


67 








terminal, clone XEH.Sc} 






13150 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


168 


86 








gene No. 107. 






13151 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


58 








4499. 






13152 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


61 








4478. 






13153 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


131 


67 








clone HCACJ81. 






13154 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


54 




- 




6953. 








AP000616 


Oryza sativa 


similar to RING-H2 finger protein 












RHAla(AF078683) 






13156 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


167 


80 








gene 22 cloneHMSJW18. 






13157 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


125 


68 








encoded by gene 17. 






13158 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


514 


71 


13159 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


130 


66 
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SEQID 
NO: 


Accession NO. 


species 


Description 


Smith- 
Score 


Identit 

y 








SEQID NO: 163. 






13160 


AFl 16661 


Homo sapiens 


PR01438 


152 


63 


13161 


AB046048 


Macaca 
fascicular is 


unnamed portein product 


105 


53 


13162 


AFl 07406 


Homo sapiens 


GW128 


159 


62 


13163 


Y02999 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 121. 


132 


59 


13164 


M15530 


Homo sapiens 


B-cell growth factor 


97 


80 


13165 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


67 


36 


13166 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


126 


57 


13167 


AL359782 


Trj-panosoma 


probable similar to ring-h2 finger 
protein rhal a. 


95 


53 


13168 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4488. 


96 


57 


13169 


AL049795 


Homo sapiens 


dJ622L5.7.2 (novel protein (isoform 2)) 


250 


95 


13170 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4447. 


116 


56 


13171 


LI 0908 


Mus musculus 


Gcapl gene product 


75 


53 


13172 


S73853 


Homo sapiens 


NF2=neurofibromatosis type 2 
{alternatively spliced, form A4} 


160 


73 


13173 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


134 


76 


13174 


A52568 


Homo sapiens 


HMGI-C 


84 


44 


13175 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


91 


49 


13176 


AK025116 


Homo sapiens 


unnamed protein product 


130 


44 


13177 


AF090901 


Homo sapiens 


PRO0195 


103 


84 


13178 


AF090894 


Homo sapiens 


PROG 113 


92 


62 


13179 


AFl 30079 


Homo sapiens 


PR02852 


118 


38 


13180 


AK025047 


Homo sapiens 


unnamed protein product 


141 


67 


13181 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


87 


59 


13182 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


220 


74 


13183 


S79980 


Bos taurus 


ribosomal protein L37 


186 


94 


13184 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


113 


90 


13185 


AF155232 


Pisum sativum 


extensin 


95 


36 


13187 


U16359 


Rattus 
norvegicus 


niti ic oxide synthase 


90 


78 


13188 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


113 


79 


13189 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


158 


53 


13190 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


130 


59 


13191 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


131 


66 


13192 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


439 


86 


13193 


Z66499 


Caenorhabditis 
elegans 


T01B7.8 


127 


35 


13194 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870. 


126 


61 


13195 


Y13141 


Bromheadia 
fmlaysoniana 


extensin 


58 


43 


13196 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


95 


76 


13197 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


116 


60 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 

y 


13198 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-lectin antibody 


98 


41 


13199 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ8]. 


102 


58 


13200 


AF220264 


Homo sapiens 


MOST-1 


141 


45 


13201 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


112 


58 


13202 


Y17221 


Homo sapiens 


Human secreted protein (clone fk317- 

3). 


115 


57 


13203 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


91 


39 


13204 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


65 


13205 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


98 


68 


13206 


M15530 


Homo sapiens 


B-cell growth factor 


105 


76 


13207 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


84 


45 


13208 


L27428 


Homo sapiens 


reverse transcriptase 


205 


49 


13209 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XBH.Sc} 


77 


57 


•13210 


MB 1321 


Macaca 
fascicularis 


proline-rich protein 


104 


4S 


13211 


X55685 


Lycopersicon 
esculentiim 


extensin (class I) 


122 


33 


13212 


Ml 5530 


Homo sapiens 


B-cell growtfi factor 


159 


61 


13213 


A>- 13 0089 


Homo sapiens 


PRO2550 


126 


56 


13214 


AF2 10651 


Homo sapiens 


NAG18 


125 


68 


13215 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


114 


76 


13216 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


157 


70 


13217 


R59842 


Homo sapiens 


ApoE4Ll protease. 


102 


72 


13218 


KOI 664 


Drosophila 
rnelanogaster 


Bkm-Iilce protein 


123 


37 


13219 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 5 1 clone HUKEX85. 


101 


55 


13220 


D14167 


Gallus gallus 


ribosomal protein L37a 


126 


40 


13221 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


178 


82 


13222 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7344. 


141 


55 


13223 


L10908 


Mils musculus 


Gcapl gene product 


114 


53 


13224 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


216 


73 


13225 


G03259 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7340. 


104 


76 


13226 


AF130079 


Homo sapiens 


PR02852 


158 


66 


13227 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


125 


60 


13228 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
3.5.1.26} 


100 


46 


13229 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


72 


13230 


M15530 


Homo sapiens 


B-cell growth factor 


162 


72 


13231 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


161 


66 


13232 


AL160371 


Leishmania 
major 


probable (lihv-6) ull02, variant a 
DNA, complete virion genome 


SO 


52 
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Species 


Description 






NO: 








Waterman 
Score 


Identit 

y 


13233 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


51 








7871. 






13234 


AF126163 


Homo sapiens 


HHLA3 protein 


131 


78 


13235 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


125 


85 








encoded by gene 90. 






13236 


AB001684 


Chlorella 


ORF49b 


55 


45 






vulgaris 








13237 


M15530 


Homo sapiens 


B-cell growth factor 


114 


66 


13238 


AL355929 


Neurospora 
crassa 


conserved hypothetical protein 


99 


45 


13239 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


114 


63 








RHAla(AF078683) 






13240 


S79410 


Mus musculus 


nuclear localization signal binding 


110 


40 








protein 






13241 


AFl 19851 


Homo sapiens 


PR01722 


167 


63 


13242 


S79410 


Mus musculus 


nuclear localization signal binding 


147 


68 








protein 






13243 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


95 


65 






elegans 


RD protein (nuclear ma-binding 












protein) 






13244 


W48351 


Homo sapiens 


Human breast cancer related protein 


99 


84 








BCRB2. 






13245 


Y14437 


Homo sapiens 


Human secreted protein encoded by 


104 


62 








gene 27 clone HSAWA27. 






13246 


AF161356 


Homo sapiens 


HSPC093 


116 


67 


13247 


X61046 


Hydra sp. 


mini-collagen 


108 


49 


13248 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


458 


89 


13249 


Ml 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


92 


37 


13250 


AB017362 


Bombyx mori 


fibroin H-chain 


70 


43 


13251 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


58 








6613. 






13252 


G00360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


58 








4441. 






13253 


AF2 10651 


Homo sapiens 


NAG 18 


151 


73 


13254 


AF224494 


Mus musculus 


arsenite mducible RNA associated 


299 


50 








protein 






13255 


AK026107 


Homo sapiens 


unnamed protein product 


102 


60 


13256 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


113 


61 








gene 22 clone HMSJW18. 






13257 


AK024455 


Homo sapiens 


FLJ00047 protein 


115 


59 


13258 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


155 


51 








gene No. 107. 






13259 


AJ251579 


Arabidopsis 


cef protein 


115 


39 






thaliana 








13260 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


360 


82 


13261 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


61 








6953. 






13262 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


413 


87 


13263 


D23661 


Homo sapiens 


ribosomal protein L37 


487 


94 


13264 


AF130079 


Homo sapiens 


PR02852 


153 


43 


13265 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


109 


61 








encoded by gene 90. 






13266 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 


133 


79 








WO9901020. 






13267 


L10908 


Mus musculus 


Gcapl gene product 


103 


43 


13268 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


70 








6613. 






13269 


.G04064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


51 








8145. 
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SEQID 
NO: 




Species 


Description 


Waterman 


Identit 


13270 


AFl 30089 


Homo sapiens 


PRO2550 


95 


47 


13271 


AFl 30075 


Homo sapiens 


PR02532 


90 


60 


13272 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


103 


54 


13273 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4484. 


77 


91 


13274 


AFl 30089 


Homo sapiens 


PRO2550 


104 


57 


13275 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6720. 


94 


45 


13276 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6834. 


122 


52 


13277 


AFl 13685 


Homo sapiens 


PRO0974 


98 


36 


13278 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


85 


68 


13279 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6596. 


139 


89 


13280 


G01129 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5210. 


243 


79 


13281 


AFl 16715 


Homo sapiens 


PR02829 


108 


56 


13282 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


105 


60 


13283 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


194 


78 


13284 


AL3 59782 


Trypanosoma 
brucei 


probable gianule cell antiserum 
positive 8 


104 


47 


13285 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


104 


72 


13286 


AF130089 


Homo sapiens 


PRO2550 


147 


58 


13287 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


58 


13288 


AFl 19855 


Homo sapiens 


PR01847 


115 


45 


13289 


AF161356 


Homo sapiens 


HSPC093 


131 


50 


13290 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


104 


43 


13291 


KOI 664 


Diosophila 

melanogaster 


Bkm-like protein 


112 


49 


13292 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4458. 


150 


69 


13293 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


284 


76 


13294 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


444 


85 


13295 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


533 


93 


13296 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


164 


76 


13297 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


147 


80 


13298 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


84 


38 


13299 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


139 


49 


13300 


AF130079 


Homo sapiens 


PR02852 


98 


60 


13301 


AF090894 


Homo sapiens 


PRO0113 


114 


52 


13302 


AF130079 


Homo sapiens 


PR02852 


161 


83 


13303 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


154 


76 


13304 


AFl 18086 


Homo sapiens 


PR01992 


120 


75 


13305 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


100 


73 


13306 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


108 


56 
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Accession No. 


Species 


Description 






NO: 








Waterman 


Identit 










Score 


y 


13307 


Y74110 


Homo sapiens 


Human prostate tumor EST fragment 


448 


100 








derived protein #297. 






13308 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


85 


65 








4669. 






13309 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


47 








7874. 






13310 


AFl 16715 


Homo sapiens 


PR02829 


124 


76 


1331 1 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


63 








5327. 






13312 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


61 








695 J. 






13313 


AFl 16661 


Homo sapiens 


PRO 143 8 


142 


67 


13j14 


AF161356 


Homo sapiens 


HSPC093 


121 


46 


13315 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


52 








6613. 






13316 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


98 


90 








SEQ ID NO: 163. 






13317 


AF130079 


Homo sapiens 


PR02852 


109 


58 


13318 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 13 


80 








6012. 






13319 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


81 


61 








encoded by gene 121. 








AF130089 


Homo sapiens 


PRO2550 


106 


77 


13321 


AF09093 1 


Homo sapiens 


PRO0483 


143 


63 




D29833 


Homo sapiens 


proline rich peptide P-B 


59 


33 


1j>323 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


417 


78 


13324 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


1 19 


82 








RHAla(AF078683) 






13325 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


168 


59 








gene 22 clone HMSJW18. 






13326 


A3 1038 


Nicotiana alata 


PRP3 


88 


41 


13327 


AF090895 


Homo sapiens 


PROG 11 7 


87 


55 


1j328 


Y0269j 


Homo sapiens 


Human secreted protein encoded by 


88 










gene 44 clone HTDAD22. 






13329 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


65 








7878. 






13330 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


101 


67 






major 


DNA, complete virion genome 






13331 


AF090930 


Homo sapiens 


PRO0478 


93 


36 


13332_ 


AF1590S5 


Homo sapiens 


leucine zipper-like protein 


118 


71 


13333 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


89 


57 








gene 22 clone HMSJWl 8. 






13334 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


121 


80 








6613. 






13335 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


73 








6613. 






13336 


AJ005562 


Mus musculus 


SPR2D protein 


86 


44 


13337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


169 


66 








gene No. 107. 






13338 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


57 








4609. 










-. 


microvascular endothelial 










norvegicus 


dijfferentiation gene 2 






13340 


AF090895 


Homo sapiens 


PRODI 17 


137 


47 


13341 


K01664 


Drosophila 


Bkm-like protein 


115 


46 






melanogaster 








13342 


G03790 


Homo sapiens 


Human secreted protein, SBQ ID NO: 


100 


76 








7871. 






13343 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


134 


65 
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NO: 


species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 






7795. 






13344 j AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 


76 


76 


13345 Y86248 

j 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


111 


72 


13346 G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


143 


69 


13347 AFl 19900 


Homo sapiens 


PR02822 


93 


58 


13348 [ G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


169 


75 


13349 I W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


83 


68 


13350 ' AFl 16715 


Homo sapiens 


PR02829 


134 


64 


1335! 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


] 15 


65 


13352 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


118 


59 


13353 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


139 


77 ■ 


13354 


AFl 16661 


Homo sapiens 


PR01438 


112 


75 


13355 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


170 


73 


13356 


AF126163 


Homo sapiens 


HHLA3 protein 


116 


55 


13357 


ni83^i9 


Variola virus 


D4L 


101 


58 


13358 


AF130089 


Homo sapiens 


PRO2550 


127 


52 


13359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


184 


82 


13360 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


78 


50 


13361 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


126 


71 


13362 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


174 


76 


13363 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953. 


176 


51 


13364 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


99 


41 


13365 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


118 


56 


13366 


U 12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


158 


50 


13367 


Y 19467 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


65 


64 


13368 


M15530 


Homo sapiens 


B-cell growth fector 


132 


77 


13369 , 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


119 


63 


13370 


AFl 30079 


Homo sapiens 


PR02852 


108 


61 


13371 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


150 


76 


13372 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


117 


65 


13373 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


118 


57 


13374 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


121 


63 


13375 


AFl 18086 


Homo sapiens 


PRO 1992 


147 


74 


13376 


AF130089 


Homo sapiens 


PRO2550 


149 


48 


13377 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


123 


55 



493 



wo 01/64835 



PCT/USOl/04927 



NO: 








Waterman 


Identit 

y 


13378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


120 


53 


13379 


AF090894 


Homo sapiens 


PRO0113 


101 


52 


13380 


AFl 07406 


Homo sapiens 


GW128 


126 


51 


13381 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


156 


84 


13382 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253, 


119 


80 


13383 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


84 


70 


13384 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


53 


13385 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


53 


13386 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870, 


141 


65 


13387 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


113 


63 


13388 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


155 


77 


13389 


AFl 30089 


Homo sapiens 


PRO2550 


159 


62 


13390 


AF161356 


Homo sapiens 


HSPC093 


154 


74 


13391 


ACO06135 


Arabidopsis 
tiialiana 


putative vicilin storage protein 
(globulin-like) 


171 


28 


13392 


AFl 13685 


Homo sapiens 


PRO0974 


226 


57 


13393 


AFl 18082 


Homo sapiens 


PRO 1902 


69 


68 


13394 


D23661 


Homo sapiens 


ribosomal protein L37 


498 


96 


13395 


AF130079 


Homo sapiens 


PR02852 


109 


54 


13396 


AFl 19851 


Homo sapiens 


PRO 1722 


107 


57 


13397 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


165 


68 


13398 


AF130089 


Homo sapiens 


PRO2550 


145 


68 


13399 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7870, 


134 


54 


13400 


J03798 


Homo sapiens 


small nuclear riboprotein Sm-D 


451 


93 


13401 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


39 


13402 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


99 


76 


13403 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


159 


76 


13404 


AF090894 


Homo sapiens' 


PRO0113 


152 


70 


13405 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


45 


13406 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


172 


72 


13407 


G00407 • 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


135 


84 


13408 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


63 


13409 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


58 


13410 


AFl 30089 


Homo sapiens 


PRO2550 


101 


70 


13411 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


73 


13412 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


145 


48 


13413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


95 


50 


13414 


AF090942 


Homo sapiens 


PRO0657 


171 


72 


13415 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


120 


64 
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SEQID 
NO: 






Description 


Waterman 


Itlentit 


13416 


M15530 


Homo sapiens 


B-cell growth factor 


121 


45 


13417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6571. 


115 


58 


13418 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


146 


70 


13419 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


85 


64 


13420 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


77 


73 


13421 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


149 


62 


13422 


AF130089 


Homo sapiens 


PRO2550 


111 


39 


13423 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


106 


42 


13424 


AF119882 


Homo sapiens 


PR02492 


110 


70 


13425 


Y I 4482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


104 


59 


13426 


API 18082 


Homo sapiens 


PRO 1902 


129 


48 


13427 


X70771 


Chironomus 
tentans 


Spl7 


95 


56 


13428 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


66 


13429 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107, 


149 


80 


13430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


103 


63 


13431 


AF116661 


Homo sapiens 


PRO 143 8 


128 


54 


13432 


AK024455 


Homo sapiens 


FLJ00047 protein 


117 


69 


13433 


YI3141 


Bromheadia 
finlaysoniana 


extensin 


91 


40 


13434 

"13435^ 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


118 


64 


AF090944 


Homo sapiens 


PRO0663 


115 


50 


13436 


X55687 


Lycopersicon 
esculentum 


extensin (class II) 


73 


32 


13437 


AF090942 


Homo sapiens 


PRO0657 


77 


50 


13438 


AP000061 


Aeropyram 
pemix 


104aa long hypothetical protein 


82 


41 


13439 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4562. 


94 


51 


13440 


AF161356 


Homo sapiens 


HSPC093 


115 


53 


13441 


Y27868 


Homo sapiens 


Human secreted protein encoded by 

gene No. 107. 


125 


44 


13442 


Y59784 


Homo sapiens 


Human normal ovarian tissue derived 
protein 61. 


1096 


100 


13443 


U33547 


Homo sapiens 


MHC class II antigen 


120 


73 


13444 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


106 


71 


13445 


W54966 


Homo sapiens 


Synthetic human type HI collagen 

SYN-C3. 


115 


34 


13446 


G02256 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6337. 


486 


100 


13447 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


155 


60 


13448 


AF130079 


Homo sapiens 


PR02852 


167 


56 


13449 


AB047890 


Macaca 
fascicularis 


hypothetical protein 


171 


62 


13450 


AK025326 


Homo sapiens 


\innamed protein product 


111 


67 


13451 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


93 


76 
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NO: 




Species 


Description 


Waterman 
Score 


Identit 

y 


13452 


G02493 


Homo sapiens 


Hiiman secreted protein, SEQ ID NO: 
6574. 


109 


91 


13453 


U65650 


Arabidopsis 
thaliana 


blue-copper binging protein III 


103 


35 


13454 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


123 


69 


13455 


AF090894 


Homo sapiens 


PRO0113 


122 


56 


13456 


D38585 


Homo sapiens 


TSC-22 


710 


99 


13457 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


113 


45 


13458 


X55686 


Lycopersicon 
esculentum 


extensin (class U) 


64 


51 


13459 


AF 107406 


Homo sapiens 


GW128 


162 


54 


13460 


y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


145 


65 


13461 


X61045 


Hydra sp. 


mini-collagen 


107 


34 


13462 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

6913. 


130 


46 


13463 


Y35970 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 219. 


189 


80 


13464 


L47668 


Homo sapiens 


alpha-2 collagen type I 


64 


39 


13465 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6571. 


155 


65 


13466 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


76 


13467 


Y I 4437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


109 


70 


13468 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


144 


69 


13469 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

8153. 


121 


39 


13470 


M15530 


Homo sapiens 


B-cell growth fector 


144 


57 


13471 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


116 


85 


13472 


V00488 


Homo sapiens 


alpha globin 


704 


96 


13473 


AJ012166 


Canis familiaris 


brain-specific synapse associated 
protem. Bassoon 


56 


45 


13474 


AFl 16661 


Homo sapiens 


PRO 143 8 


128 


45 


13475 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


99 


78 


13476 


V00488 


Homo sapiens 


alpha globin 


278 


91 


13477 


W65762 


Homo sapiens 


Human HDCAP protein. 


490 


98 


13478 


AB033767 


Homo sapiens 


brain-selective and closely mapped on 
the counter allele of CMAP in cystatin 
cluster 


2184 


97 


13479 


AB033767 


Homo sapiens 


brain-selective and closely mapped on 
the counter allele of CMAP in cystatin 
cluster 


2184 


97 


13480 


Y42382 


Homo sapiens 


Amino acid sequence of fi{317_l] . 


613 


95 


13481 


D29833 


Homo sapiens 


proline rich peptide P-B 


61 


58 


13482 


AK023277 


Homo sapiens 


unnamed protein product 


488 


98 


13483 


X67703 


Drosophila 
melanogaster 


Mst84Da 


53 


60 


13484 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119, 


135 


54 


13485 


AF210350 


Sceloporus 
siniferus 


NADH dehydrogenase subunit 4 


149 


79 


13486 


Y 14544 


Danio rerio 


HoxcS protein 


98 


36 


13487 


AP002032 


Arabidopsis 




111 


27 
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SEQID 




Species 


Description 






NO: 








Waterman 


Identit 










Score 


y 






thaliana 








13488 


AB047846 


Homo sapiens 


gammal-COP 


4463 


100 


13489 


M37194 


Rattus 


clathrin-associated protein 17 


739 


100 






norvegicus 








13490 


Y59743 


Homo sapiens 


Human normal ovarian tissue derived 


138 


96 








protein 20. 






13491 


U01317 


Homo sapiens 


G-gamma globin 


198 


90 


13492 


W04180 


Homo sapiens 


Cellubrevin-1. 


459 


98 


13493 


AL121585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting neidn 5)) 


2058 


100 


13494 


X53777 


Homo sapiens 


putative ribosomal protein (AA 1 - 1 84) 


970 


100 


13495 


AJ271881 


Homo sapiens 


bromodomain containing protein 


3378 


99 


13495 


AF132970 


Homo sapiens 


CGI-3 6 protein 


878 


100 


13497 


V73498 cdl 


Homo sapiens 


21-AUG-1997 Human SlOOPl DNA. 


273 


100 


13498 


AL121585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5)) 


2058 


100 


13499 


X82456 


Homo sapiens 


LIM and SH3 domain protein 


1420 


100 


13500 


AJ277932 


Homo sapiens 


RPBl la protein 


588 


98 


13501 


Y53569 


Homo sapiens 


Consensus sequence for human RADl 


633 


84 








binding proteins. 






13502 


W73434 


Homo sapiens 


Human secreted protein encoded by 


1903 


100 






Gene No. 2. 






13503 


A/005981 


Sus scrofa 


cAMP-regulated phosphoprotein 


577 


100 


13504 


AFH6715 


Homo sapiens 


PR02829 


143 


65 


13505 


U97553 


murid 


unknown 


84 


35 






herpesvirus 4 








13506 


X06323 


Homo sapiens 


put. ribosomal protein L3 (AA 1 - 348) 


1903 


100 


13507 


W82397 


Homo sapiens 


Human UBP protein #3. 


4264 


99 


13508 


M86246 


Homo sapiens 


EHS-2 


110 


71 


13509 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


121 


50 








gene 22 clone HMSJW18. 






13510 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


91 


44 


13511 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


591 


100 








7893. 






13512 


Y04369 


Homo sapiens 


Human HUCEP-14 protein. 


1389 


100 


13513 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


60 








7795. 






13514 


AE000232 


Escherichia coli 


orf, hypothetical protein 


1480 


100 






K12 








13515 


X76717 


Homo s^iens 


MT-llprotan 


382 


100 


13516 


Y00919 


Homo sapiens 


Human Rab protein, RABP-2, protein 


1039 


100 








sequence. 






13517 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


67 








7795. 






13518 


AF090895 


Homo sapiens 


PRO0117 


109 


56 


13519 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


65 








7795. 






13520 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


71 








6619. 






13521 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


60 








4658. 






13522 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


72 








6619. 






13523 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


73 








8153. 






13524 


M15530 


Homo salens 


B-cell growth factor 


112 


67 


13525 


AFl 19851 


Homo sapiens 


PRO 1722 


135 


57 


13526 


G00541 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


98 


82 








4622. 






13527 


W48351 


Homo sapiens 


Human breast cancer related protein 


117 


41 








BCRB2. 
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NO: 








Waterman 


IrtenHt 

y 


13528 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


105 


100 


13529 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


79 


78 


13530 


AJ000536 


Arabidopsis 
thaliana 


COPl protein 


98 


32 


13531 


U50403 


Homo sapiens 


breast cancer suppressor element 
Ishmael Upper RP2 


101 


62 


13532 • 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4433. 


156 


56 


13533 


Z80779 


Homo sapiens 


liistone H2B 


628 


100 


13534 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


62 


13535 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-ricii 
glycoprotein 


82 


63 


13536 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


93 


56 


13537 


AP000616 


Qryza sativa 


similar to RING-H2 finger protein 
RHA]a(AF078683) 


139 


81 


13538 


AF130051 


Homo sapiens 


PRO0898 


90 


53 


13539 


XI 6832 


Homo sapiens 


preprocathepsin H (AA -22 to 3 14) 


1835 


99 


13540 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


100 


80 


13541 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


131 


82 


13542 


G00452 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4533. 


94 


65 


13543 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


141 


86 


13544 


AF130089 


Homo sapiens 


PRO2550 


107 


72 


13545 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


435 


89 


13546 


U63810 


Homo sapiens 


WD40 protein Ciao 1 


505 


100 


13547 


AF090931 


Homo sapiens 


PRO0483 


126 


73 


13548 

"13549 


Y 14482 


Homo sapiens 


Fragment of human secieted protein 

encoded by gene 17. 


143 


65 


U28971 


Caenorliabditis 
elcgans 


similar to RD tandem repeat region of 
RJl) protein (nuclear rna-binding 
protein) 


62 


13550 


API 13683 


Homo sapiens 


PRO0974 


126 


48 


13551 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7874. 


89 


53 


13552 


AFl 16661 


Homo sapiens 


PRO 143 8 


109 


69 


13553 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


101 


85 


13554 


XI 6832 


Homo sapiens 


preprocathepsin H (AA -22 to 3 14) 


1835 


99 


13555 


Y73435 


Homo sapiens 


Human secreted protein clone yd73_l 
protein sequence SEQ ID NO:92. 


528 


84 


13556 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


13557 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


67 


13558 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7174. 


189 


75 


13559 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


131 


53 


13560 


AFl 18086 


Homo sapiens | PR01992 


157 


71 


13561 


Y00320 


Homo sapiens Human secreted protein encodedby 

gene 64. 


101 


64 


13562 


AF159055 


Homo sapiens | leucine zdpper-like protein 


119 


74 



498 
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SEQ ID 
NO: 


Accession No. 


species 


Description 


Smith- 
Waterman 


% 
y 


13563 


AFl 18082 


Homo sapiens 


PRO 1902 


83 


86 


13564 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
5327. 


234 . 


97 


13565 


Y59778 


Homo sapiens 


Human normal ovarian tissue derived 
protein 55. 


66 


100 


13566 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5909. 


103 


60 


13567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


152 


96 


13568 


X83703 


Homo sapiens 


nuclear protein 


1633 


99 


13569 


G02473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6554. 


242 


45 


13570 
13371 ' 


G00521 
XP000616 


Homo sapiens 
Oryza sativa 


Human secreted protein, SEQ ID NO: 

4602. 

similar to RING-H2 finger protein 
RHAla(AF078683) 


105 

Tio ~ 


90 

"80 


13572 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


46 


13573 


X92744 


Homo sapiens 


liBD-1 


290 


83 


13574 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


107 


68 


13575 


AFl 07406 


Homo sapiens 


GW128 


356 


100 


13576 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 


101 


48 


13577 


AF220264 


Homo sapiens 


lVIOST-1 


95 


73 


13578 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


129 


68 


13579 


AK024455 


Homo sapiens 


FLJ00047 protein 


81 


76 


13580 


AF130051 


Homo sapiens 


PRO0898 


128 


64 


13581 


AF084256 


Homo sapiens 


beta glucuronidase isofonn d 


142 


58 


13582 


AFl 18082 


Homo sapiens 


PRO 1902 


85 


51 


13583 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


114 


52 


13584 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6536. 


66 


54 


13585 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


109 


64 


13586 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


109 


77 


13587 


AF090901 


Homo sapiens 


PRO0195 


114 


53 


13588 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


95 


66 


13589 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


117 


58 


13590 


AK000521 


Homo sapiens 


unnamed protein product 


1313 


100 


13591 


AL357374 


Homo sapiens 


bA353C18.2 (novel protein) 


421 


100 


13592 


AL357374 


Homo sapiens 


bA353C18.2 (novel protein) 


421 


100 


13593 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


139 


62 


13594 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


103 


56 


13595 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


109 


76 


13596 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


127 


88 


13597 


G00325 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4406. 


275 


100 


13598 


W03642 


Homo sapiens 


Human carmabinoid GPRN-teiminal 
sequence. 


108 


63 


13599 


AFl 16661 


Homo sapiens 


PR01438 


128 


64 
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NO: 








Waterman 


Identit 

y 


13600 


X55682 


Lycopersicon 
esculentum 


extensin (class I) 


58 


43 


13601 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


45 


13602 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


122 


92 


13603 


AF149419 


Oryctolagus 
cuniculus 


eye sodium bicarbonate cotransport 
protein NBC2 


126 


75 


13604 


AJ005567 


Mus musculus 


SPR2I protein 


63 


40 


13605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


150 


96 


13606 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


121 


60 


13607 


G02S67 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6948. 


85 


60 


13608 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


57 


35 


13609 


AFl 07406 


Homo sapiens 


GW128 


108 


42 


13610 


D82345 


Homo sapiens 


NB thymosin beta 


167 


100 


13611 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


59 


13612 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


125 


50 


13613 


Y32193 


Homo sapiens 


Human receptor molecule (REC) 
encoded by Incyte clone 044150. 


490 


80 ■ 


13614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


57 


13615 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


13616 


AFl 19900 


Homo sapiens 


PR02822 


95 


53 


13617 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


13618 


S7997S 


Homo sapiens 


prion protein, PrP {octapeptide 

repeats} 


88 


42 


13619 


AF220264 


Homo sapiens 


MOST- 1 


130 


81 


13620 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


65 


13621 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


168 


85 


13622 


KOI 664 


Drosophila 
melanogaster 


Bkm-iike protein 


118 


65 


13623 


M86246 


Homo sapiens 


EHS-2 


100 


63 


13624 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQlDNO:I63. 


104 


46 


13625 


AF067205 


Homo sapiens 


vesicle transport related protein 


198 


69 


13626 


U39529 


Echinometra 
mathaei 


bindin 


72 


47 


13627 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


52 


13628 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


123 


54 




KOI 664 


Drosophila 
melanogaster 


Bkra-like protein 


113 


46 


13630 


AF161536 


Homo sapiens 


HSPC05I 


582 


100 


13631 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. lOI. 


158 


87 


13632 


AF130089 


Homo sapiens 


PRO2550 


105 


66 


13633 


AFl 16661 


Homo sapiens 


PR01438 


116 


53 


13634 


AF116715 


Homo sapiens 


PR02829 


101 


60 



500 



w o 01/64835 



PCT/llSOl/04927 



NO: 






Description 


Waterman 


Identit 

y 


13635 


Y14482 


Homo sapiens 


Fragment of himian secreted protein 
encoded by gene 17. 


139 


63 


13636 


U33547 


Homo sapiens 


MHC class 11 antigen 


154 


79 


13637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


107 


75 


13638 


AF090895 


Homo sapiens 


PRO0117 


87 


62 


13639 


G02532 


Homo sapiens 


Human secreted protein, SEQ BD NO: 
6613. 


131 


69 


13640 


AF026689 


Homo sapiens 


prostate-specifi c tran sglutaminase 


128 


60 


13641 


S58722 


Homo sapiens 


X-)inked retinopathy protein {C- 
terminal, clone XEH.Sc} 


128 


81 


13642 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


210 


67 


13643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


2S7 


76 


13644 


Y73483 


Homo sapiens 


Human secreted protein clone yll8_l 
protein sequence SEQ ID NO: 188. 


376 


88 


13645 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


130 


85 


13646 
T364T 


Y87212 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:251. 


423 


97 
86 


U50403 


Homo sapiens 


breast cancer suppressor element 
Ishmael Upper RP2 


103 


13648 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7881. 


128 


55 


13649 


X58521 


Homo sapiens 


nucleoporin p62 


2610 


98 


13650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


69 


13651 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


111 


81 


13652 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


140 


60 


13653 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7344. 


121 


46 


13654 


U13066 


Nicotiana alata 


arabinogalactan-protein precursor 


92 


33 


13655 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


134 


50 


13656 


Y07766 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 23. 


156 


100 


13657 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


124 


48 


13658 


G01246 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

5327. 


171 


80 


13659 


AK000017 


Homo sapiens 


unnamed protein product 


611 


100 


13660 


U90446 


Mus musculus 


RNAse L inhibitor 


3100 


99 


13661 


AE000882 


Methanothermob 
acter 

theraioautotrophi 


phosphoenolpyruvate synthase 


88 


38 


13662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


65 


13663 


KOI 664 


Drosophila 
melanogaster 


Bkra-Iike protein 


97 


34 


13664 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


77 


41 


13665 


Y94890 


Homo sapiens 


Human protein clone HP02798. 


325 


98 


13666 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


139 


65 


13667 


AF090930 


Homo sapiens 


PRO0478 


119 


88 


13668 


AF130089 


Homo sapiens 


PRO2550 


120 


77 



501 



w o 01/64835 



PCT/llSOl/04927 



NO: 








Waterman 


Identit 


13669 


AC006693 


Caenorhabditis 
elegans 


Hypothetical protein W02H5.e 


165 


85 


13670 


W80293 


Homo sapiens 


Human translocation associated protein 
designated Gp25L-H. 


1003 


95 


13671 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (P1R:A41726) 


113 


91 


13672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


160 


60 


13673 


AL 132841 


Caenorhabditis 
elegans 


Y15E3A.3 


178 


75 


13674 


AFl 18086 


Homo sapiens 


PRO 1992 


85 


75 


13675 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


132 


55 


13676 


AF220264 


Homo sapiens 


MOST- 1 


133 


85 


13677 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


122 


51 


13678 


Y65416 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1577. 


467 


98 


13679 


Y16589 


Homo sapiens 


A protein that interacts with 
presenilins. 


2286 


99 


13680 


ALl 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


104 


52 


13681 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


13682 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


60 


56 


13683 


AF090944 


Homo sapiens 


PRO0663 


93 


90 


13684 


AL096770 


Homo sapiens 


bA150A6.2 (novel 7 transmembrane 
receptor (rhodopsin family) (olfactory 
receptor like) protein (hs6Ml-21)) 


178 


52 


13685 


AFl 16661 


Homo sapiens 


PR01438 


115 


60 


13686 

13687^ 


U62039 
YT3l4I 


Elephantulus 
edwardii 
Bromheadia 
finlaysoniana 


reverse transcriptase 


86 


53 


extensin 


77 


T6 


13688 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7867. 


114 


60 


13689 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


53 


13690 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

7795. 


107 


51 


13691 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


178 


85 


13692 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


94 


35 


13693 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


78 


13694 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


98 


50 


13695 


D63163 


Rattus sp. 


cyclin C 


111 


86 


13696 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMS.TW] 8. 


118 


95 


13697 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


124 


80 


13698 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


70 


13699 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7868. 


93 


58 


13700 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


100 


73 


13701 


M76744 


Homo sapiens 


BGP 


112 


52 



502 



wo 01/64835 



PCT/USOl/04927 



SEQ ID 


Accession No. 




Description 






NO: 








Waterman 


Identit 










Score 


y 


13702 


Y15155 


Homo sapiens 


phosphorylase kinase beta-subunit 


206 


100 


13703 


AF090901 


Homo sapiens 


PRO0195 


100 


70 


13704 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


79 








7068. 






13705 


R95913 


Homo sapiens 


Neural thread protein. 


106 


36 


13706 


Ml 5073 


Homo sapiens 


MHC HLA-DR-beta-1 chain 


74 


77 


13707 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 


110 


35 








SEQIDNO:163. 






13708 


G03789 


Homo sapiens 


Human sea-eted protein, SEQ ID NO: 


150 


56 








7870. 






13709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


77 








6444. 






13710 


X70775 


Chironomus 


Spl2 gene homologue 


85 


38 






cingulatus 








13711 


X80265 


Hordeura 


structural protein 


96 


41 






vulgare 








13712 


AB007922 


Homo sapiens 


KIAA0453 protein 


147 


64 


13713 


G03043 


Homo sapiens 


Hiunan secreted protein, SEQ ID NO: 


124 


60 








7124. 






13714 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


108 


65 








gene 100 clone HNFIU96. 






13715 


AF116661 


Homo sapiens 


PR01438 


120 


77 


13716 


AF 11 8086 


Homo sapiens 


PR01992 


127 


64 


13717 


AK024455 


Homo sapiens 


FLJ00047 protein 


93 


56 


13718 


AF 116661 


Homo sapiens 


PRO 1438 


137 


55 


13719 


W48352 


Homo sapiens 


Human breast cancer related protein 


93 


59 








BCFLTI. 






13720 


API 19851 


Homo sapiens 


PROI722 


94 


58 


13721 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


45 








6619. 






13722 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


117 


58 








gene No. 107. 






13723 


AF16136I 


Homo sapiens 


HSPC098 


115 


50 


13724 


Y95829 


Homo sapiens 


Native human Tie receptor signal 


108 


100 








peptide. 






13725 


AF 118086 


Homo sapiens 


PRO 1992 


166 


75 


13726 


AF 116636 


Homo sapiens 


PRO 1488 


95 


70 


13727 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


70 








7795. 






13728 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


60 








6613. 






13730 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


63 








4754. 






13731 


U33547 


Homo sapiens 


MHC class II antigen 


123 


69 


13732 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


135 


88 


13733 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


75 








4438. 






13734 


AF026204 


Caenorhabditis 


C30E1.1 gene product 


102 


51 






elegans 








13735 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


117 


63 








secreted peptide. 








G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


87 








7519. 






13737 


AL3 59782 


Trjpanosoma 


possible (hhv-6) ul 102, variant a dna. 


115 


58 






brucei 


complete virion genome. 






13738 


AF130089 


Homo sapiens 


PRO2550 


122 


70 


13739 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


74 








4478. 






13740 


AF130051 


Homo sapiens 


PRO0898 


85 


70 



503 



w o 01/64835 



PCT/llSOl/04927 



NO: 








"Waterman 


Identit 

y 


13741 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 
KSRP 


91 


51 


13742 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


150 


77 


13743 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


124 


82 


13744 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


91 


48 


13745 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


116 


64 


13746 


AB001684 


Chlorella 
vulgaris 


ORF54d 


70 


66 


13747 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


140 


60 


13748 


AF090901 


Homo sapiens 


PRO0r95 


92 


39 


13749 


AF090895 


Homo sapiens 


PRO0117 


J 48 


67 


13750 


AFl 19882 


Homo sapiens 


PR02492 


91 


45 


13751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613, 


109 


60 


13752 


G 003 97 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


129 


65 


13753 


AF161356 


Homo sapiens 


HSPC093 


99 


48 


13754 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


124 


70 


13755 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


120 


51 


13756 


AF130089 


Homo sapiens 


PRO2550 


142 


96 


13757 


AFl 30089 


Homo sapiens 


PRO2550 


132 


86 


13758 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


50 


13759 


Y 19609 


Homo sapiens 


SEQ ID NO 327 from W09922243. 


92 


60 


13760 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6913. 


154 


52 


13761 


U62040 


Eiephantulus 
edwardii 


reverse transcriptase 


134 


51 


13762 


AB001684 


Chlorella 
vulgaris 


ORF49b 


100 


45 


13763 


AFl 18082 


Homo sapiens 


PRO 1902 


129 


50 


13764 


AF220264 


Homo sapiens 


MOST-1 


129 


66 


13765 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


50 


13766 


AF220264 


Homo sapiens 


MOST-1 


115 


71 


13767 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


139 


73 


13768 


AF130089 


Homo sapiens 


PRO2550 


97 


34 


13769 


U80739 


Homo sapiens 


CAGH26 


564 


100 


13770 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


55 


13771 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


102 


68 


13772 


AE004507 


Pseudomonas 
aeruginosa 


hypothetical protein of bacteriophage 
Pfl 


93 


44 


13773 


AB044885 


Canis familiarls 


dopamine receptor D4 


78 


52 


13774 


AF026246 


Homo sapiens 


HERV-E envelope glycoprotein 


108 


53 


13775 


S71805 


Homo sapiens 


RNA-binding protein=TLS/FUS-ERG 


126 


100 


13776 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


82 


13777 


ABO 15 727 


Mus musculus 


truncated granzyme M 


86 


37 


13778 


G02832 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


74 


52 



504 



w o 01/64835 



PCT/llSOl/04927 



NO: 






Description 


AV;lftrlllail 




Idtnlil 
J. 








6913. 






13779 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


137 


61 


13780 


M14123 


Homo sapiens 


neutral protease large subunit 


246 


55 


13781 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HIvISJWl 8. 


99 


67 


13782 


AFl 32972 


Homo sapiens 


CGI-38 protein 


902 


99 


13783 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6550. 


147 


63 


13784 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


68 


13785 


AF130051 


Homo sapiens 


PRO0898 


138 


72 


13786 


AP000616 




similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


95 


13787 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


92 


53 


13788 


U12206 


Homo sapiens 


unknown 


84 


42 


13789 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7874. 


118 


58 


13790 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


50 


13791 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


137 


76 


13792 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


104 


72 


13793 i AB006006 


Bostaurus 


neurocalcin alpha 


1000 


100 


13794 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


183 


95 


13795 


W34499 


Homo sapiens 


Obesity receptor C protein. 


120 


85 


13796 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


127 


67 


13797 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


182 


83 


13798 Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


112 


71 


13799 ! U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


91 


84 


13800 


G02902 


Homo s^iens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


67 


13801 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


72 


13802 


AF 11 9900 


Homo sapiens 


PR02822 


131 


50 


13803 


AK024435 


Homo sapiens 


FLJ00025 protein 


117 


95 


13S04 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6613. 


142 


53 


13805 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


48 


13806 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: ' 
4478. 


98 


66 


13807 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4478. 


98 


66 


13808 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


150 


54 


13809 


M37679 


Mus musculus 


Ig heavy chain precursor 


70 


100 


13810 


AL451015 


Neurospora 


putative protein 


96 


55 


13811 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


88 


77 


13812 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


71 



505 



w o 01/64835 
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SI- U 10 
NO; 




Species 


Description 


Waterman 
Score 




13813 


AF130O89 


Homo sapiens 


PRO2550 


93 


75 


13814 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


119 


60 


13815 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


64 


13816 


AJO 11435 


Blackstonia 
imperfoliata 


maturase 


93 


48 


13817 


S58722 


Homo sapiens 


X-linked retinopathy protein (C- 
terminal, clone XEH.8c} 


130 


76 


13818 


AFl 19851 


Homo sapiens 


PRO 1722 


141 


50 


13819 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
5066. 


74 


60 


13820 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6566. 


76 


72 


13821 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


135 


50 


13822 


KOI 664 


Drosophila 
melanogaster 


BlaB-Iike protein 


95 


52 


13823 


L10908 


Mus musculus 


Gcapl gene product 


103 


50 


13824 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


146 


62 


13825 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


125 


83 


13826 


L10908 


Mus musculus 


Gcapl gene product 


96 


43 


13827 


G02971 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7052. 


102 


64 


13828 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


135 


53 


13829 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


70 


13831 


AP000060 


Aeropyrum 
pernix 


1 0 1 aa long hypothetical protein 


79 


50 


13832 


AFl 19900 


Homo sapiens 


PR02822 


94 


46 


13833 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


145 


40 


13834 


AF2 18028 


Homo sapiens 


unknown 


117 


57 


13835 


L10908 


Mus musculus 


Gcapl gene product 


79 


45 


13836 


Z26876 


Homo sapiens 


ribosomal protem 


129 


93 


13837 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


76 


13838 


AF044311 


Homo sapiens 


gamma-synuclein 


603 


99 


13839 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7888. 


100 


82 


13840 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


154 


46 


13841 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


64 


13842 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


92 


62 


13843 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protem (nuclear ma-binding 
protein) 


93 


60 


13844 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


63 


13845 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


145 


79 


13846 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


102 


50 


13847 


G03469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7550. 


101 


68 
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SEQ ID 






Description 


Waterman 
Score 


Identit 

y 


13848 


G03240 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7321. 


107 


43 


13849 


U05313 


Trypanosoma 
brucei 


CR3 


98 


40 


13850 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


100 


65 


13851 


AFl 59055 


Homo sapiens 


leucine zipper-like protein 


103 


55 


13852 


R59842 


Homo sapiens 


ApoE4Ll protease. 


99 


71 


13853 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


142 


65 


13854 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


119 


79 


13855 


R59842 


Homo sapiens 


ApoE4Ll protease. 


93 


59 


13856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


114 


76 


13857 


W48353 


Homo sapiens 


Human breast cancer related protein 

BCFLT2. 


94 


57 


13858 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


105 


58 


13859 


AF289022 


Homo sapiens 


fonniminotransferase cyclodeaminase 
formC 


467 


100 


13860 


AF078844 


Homo sapiens 


hqp0376 protein 


488 


100 


13861 ; AB032436 

1 


Homo sapiens 


brain-specific Na-dependent inorganic 
phosphate cotransporter 


2968 


100 


13862 G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6595. 


91 


72 


13863 AF 130079 


Homo sapiens 


PR02852 


131 


63 


13864 : Z29701 cdl 

1 


Homo sapiens 


29-MAY-1998 Wild-type human c-Src 
tyrosine kinase cDNA. 


2380 


100 


13865 L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


80 


33 


13866 AL050318 


Homo sapiens 


dJ977B1.5 (myosin regulatory light 
chain 2, smooth muscle isoform) 


90/1 


100 


13867 


U37690 


Homo sapiens 


UNA polymerase 11 subunit 


358 


100 


13868 


X13923 


Homo sapiens 


cytochrome c oxidase subunit VIb (AA 
1-86) 


491 


100 


13869 


L13848 


Homo sapiens 


RNA helicase A 


6669 


99 


13870 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


133 


46 


13871 


K02064 


Bostaums 


cytochrome c oxidase subunit IV 
precursor EC 1.9.3.1 


96 


72 


13872 


D31763 


Homo sapiens 


ha0946 protein is Kruppel-related. 


4606 


96 


13873 


Z14014 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


83 


43 


13874 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


117 


70 


13875 


Z52203_cdl 


Homo sapiens 


17-SEP-1998 Human PR0217 protein 
encoding cDNA, UNQ191. 


2135 


99 


13876 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


106 


64 


13877 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


50 


13878 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


104 


60 


13879 


L25404 


Brassica napus 




124 


42 


13880 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


139 


59 


13881 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


70 


64 
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NO: 




Species 


Description 


Waterman 


Ideiitit 

y 


13882 


AB000098 


Rattus 
norvegicus 


MIPP65 


905 


50 


13883 


Z95114 


Homo sapiens 


bK212A2.1 (TNF-inducible protein 
CG12-1 (similar to apolipoprotein L)) 


1639 


100 


13884 


AF 132984 


Homo sapiens 


Nuclear pore complex interacting 
protein NPIP 


551 


82 


13885 


AF121862 


Homo sapiens 


Sorting nexin 13 


1453 


99 


13886 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6574. 


83 


69 


13887 


KOI 664 


Drosopliila 
melanogaster 


Bkm-lilce protein 


109 


77 


13888 


AF217197 


Homo sapiens 


FBP interacting repressor 


2725 


99 


13889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


187 


94 


13890 


API 18086 


Homo sapiens 


PRO 1992 


85 


62 


13891 


M58664 


Homo sapiens 


Signal transducer CD24 


371 


97 


13892 


AK023443 


Homo sapiens 


Unnamed protein product 


125 


45 


13893 


AB040972 


Homo sapiens 


KIAAl 539 protein 


2271 


99 


13894 


U03750 


Escherichia coli 


DeaD 


95 


48 


13895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6619. 


73 


53 


13896 


AF078851 


Homo sapiens 


Secretogranm III 


2384 


99 


13897 


U62039 


Elephantulus 
edwardii 


Reverse transcriptase 


109 


48 


13898 


X07816 


Human 

herpesvirus 4 


Epitope C13 (57 AA) 


55 


53 


13899 


API 16661 


Homo sapiens 


PRO 143 8 


146 


48 


13900 G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


77 


60 


13901 Y28643 


Homo sapiens 


Human serine protease inhibitor from 
cDNA clone HETDK50. 


2191 


100 
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TABLE 3 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

locatlon 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, T=Isoleucine, 
K=Lysine, L=Leucine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


h ' 


13902 


A 


1 


114 


434 


AI FKCVEGMFRIAMVNVCFVS SGSLL I * 
PLTY/GVYDEWTHFAYMTIDLLEIPITG 
SHPVVLNALFCLEAP\WISPNTGSFAYP 
VYPKSIiIAHDFAVEATMPYIRLS ST 




13903 


A 


2 


124 


466 


KSNI PNLGDCGWESIiFNKXQSWRSSLAV 
\WDTYSSKKSNAETFTFHADLCTLSDKD 
RPITIQTALAELVKHKPKRTYEQLIAVL 
DEF/ANFIiKKWWKAYDKEMLFCEEG*KL 
CAflSN 














EGFLELIjRTRNHSNSQLQLTTGIGLFLN 
EGLKLVDKFLEDV*K*YHSETFTVNFSD 
TE*AMKHINDYVEKGTQGKIVDLVKELD 
RDTVFDLANYI FFKGKWDRPFEVNDXEE 
EDFHVDQVSTVNEPIMKLLSXMLNIHPC 
FKL 


4 


13905 


A 


5 


1 


464 


KIKSFYASKDTIKRMRVEDWKKIFAY*! 
SDKELI FTLRTLKT* *K*GKQPMLKNGQ 
EI *VP1SPQDIQIAHK\HLEGWSTSLVS 
ELAPCEAPV/RHPLTGLTIAGLQGFGEA 
GRLVHGRWGC*W/VHPF*BOf I /WQFLSK 
LHISLPYDPTTPLLGTCSR 


5 


13906 


A 


6 


308 


3 


HFVIHSKHDLAIAHLGIY/PREMKT*VH 
TKTCT* IFTVALSVIARAWNQPGRPLCS 
EWL\KYMVHTME*HSAIKRLNYRYKNNC 
VHLFLGITLSEKSQTQNVI 


6 


13907 


A 


7 


587 




FLTRETGDPTGRSSSHANTQSRFFPDDP 
PG\PIiKNIiGKTH3CGRHAGRCPGTGPDG 
P\AGCGGPRCWPSGHLAATGD*GPSCGR 
LGMIRGEAGPAGFTACSPLSGCRTPYTH 
HFPASEMSCHLNCASPRTYRSQGNRGCE 
RVAQGSQGAGGERGAKSQVPVPAPARNK 
DPAKCRKPRNRRPGNSGPWRAYRRQR 


7 


13908 


A 


8 




474 


HI LNEEHGKYEGLHE * EVKWHLYI KS PA 
FTDLHLCYQKDMNGISTSASSPAVGTVG 
MDMDKDDDFS KWNF YYS PHSYPDK*LTI 
FKTESRVRESDEVTQIKVNWDEEVI SGL 
LTSLKDNVLKATGVLYDYAYKXYLCEHT 
RSTLKEESLKLERNLQNH 


8 


13909 


A 


9 


3 


539 


SQCSPFISPACSLTALEEETEALRVHPR 
LCLS PNLAPS SGP PRPPELAP CP P S S QA 
GLRTCHSWVKGLHQPLPVASGMKSTFCN 
KTYTCPYPPP/PPLCPNHSPNALTLPDS 
VTHAVPFE/'L*SPSAPPSSTA*ILGSPS 
\CGASPCNHPHSHPGICPTPPGLWPVCP 
CAPRAWQEDGTRQ T 


9 


13910 


A 


10 


2 


453 


RL*LGLEYALLVWGTPKV*H*GGFPIYY 
YIVLLLSYALHQVTEYSMYVS IMAFNAK 
VSDPLIVGTYMTLLNTVSNLIGKWPSTV 
SLWLVNPLTVKECV*TSYQNCCTPDAAE 
LCKKLGGS CVTTUliDGYYVESI I CVS lAF 
V/W*VFLVHKFK 


10 1 13911 

1 

1 


A 


11 


20 


475 


KMGVPPLLMSDPNRFLFPKNFLREKTIS 
PPKTF*PLKIWVKGQ!i7VLNFLGPPGFKI 
FFPVFKFFFFFFFF/RDRVSLYHPGWSA 
VSQSELTAALTSPGSGDQVILPSQPPK* 
ENHUJLGGRGCSEPRLPRAEFLDLRSFS 
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SEQm 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=l.ysine, L=Leucine, M=lVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















n 


J39J2 


A 


12 


392 


63 


KLGDAGSATQSLGGSQLSWQLSRREQEL 
EQARWSAQ*QVETLGRVAESKEAtAKEH 
AGLAVQLAA\AEHEGRTVS EEATHLQ 


12 


13913 




13 


18 


338 


APHPQYLQMPPMLLPPRTGPGQFSLPSS 
EHGGHLE /GKEHETSVTLCGGEPPPQTP 
PRDPDPGP*ARRAPCPERPT*AHPRALS 
RAAPQKPRALAGPRARHPI CPGSL 


13 


13914 


A 


14 


2 


371 


TTKQ* KDNP IQTGAKDLKRHFCKEDMQM 
ASNHT\KRSLTSLVSREMHIKTTRMAGI 
KKSDlffitKHW*GRGE IRT/LLRCWWDCKV 
VQLLWKTVWQFL*GLYQYIPHDPVISRL 
GISLILYIALRTF 


14 


13915 


A 


15 


443 


1 


SRTFLDTDMKSMRG FKAS *NRQTLLLGA 
NAAGD \ KL KAMLTNHS ENRKI LQN YVKC 
TLPMLYKWNNKAWMTAYLFTTQCTEYFK 
PTVETHCSEKKISF\KILLTIYIJAPGHL 
RALMQIYKEIHWFVPVNIPSILQPIVQ 
NVISTFKS 


15 


13916 


A 


16 


375 


38 


HCQNGPLKKTDPTLLdLQETHFR/CKDT 
KRLKVRRY/QPNSNQKRAR/VPILIEDK 
1DFKTKKIFMMIKDLTIINI*ASNTRVP 
KSMKQKLAGLKEEMDNSVIMVGGFSYPV 


16 


13917 


A 


17 


448 


189 


nrdrvslccpgms*tpglkrsfhlglpk 
cwdyrrdiy*ll\farhtqmisths*qt 
tntcsypaf*ksrp*epgyntthtphss 

DML 


17 


13918 


A 


18 


1 


426 


GMSHHARSLIINQLFKKCSTLFVL/REM 
QIKSLLASSSSSSSRNSVSGQGFETVGT 
HCGQKCKLVQPF\WKaVW*!aiLKLWVFI 

lfliynrnahledtcenvdralfvivkn 

WKLSKYP PGVKWRIKLWYSHSVESSTAV 

LPKV 


18 


13919 


A 


19 


2 


423 


YFETFQPLLRRLGYQGTFFFP \KPWS S C 
LGIEYNSGPDSCA*FFLQNQIRLVNSAN 
IRLIWITLKTNQVAIAQFLECKESDQQF 
CIGVTHVKARTGWA*F*SAQGCDLLQNL 
QNVTQ/GAKIPLW*GDFNAEPTQEA*K 
HF 


19 


13920 


A 


20 


10 


443 


LKVDSGDSEVRYVFILQHITLLMCSAYM 
NQLLNIFVRPSLLAVALHMTPGFTKEDV 
YSCFRFLRDVFADBPIFLPGNTL*DFEE 
SCYLLCKSEAIQVTTKDILFTEKGNTVL 
*FLVGLFKPF/VESYHIICKSLLDEK*A 
PFIEEP 


20 


13921 


A 


21 


11 


426 


VLVETNNLRMGQVTM*PELPDM/ SPDAW 
TLSDS PSQKIGHRQQXKYS I IKMKMYTE 
DWAQACLEDTSKYEQVTQIPMAPNDATL 
/ PSS/AHLAFTAS *GAPSDS *LRNTLGL 
ICDaST\KPSSTN*KQTVIALQSHftfiLN 
MKBC 


21 


13922 


A 


22 


1147 


1768 


QLGMSHGYSSKSMPQKLMCFLFNHHLQK- 
GHECLPKVLNSKPPPIIKYLALQDLMLL 
SQYSPSRRQEVFSLSQPGGHPHNWTAIS 
RECLNLLHGMTQKLILYQEAfiATWGRVS 
SSYPVEPKKLNSPEETAFQTPKSSQMPR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQJD 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predicf- 

correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=GIutaniic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
VV=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PSVPPLVKTSLFSSKLSTPDWSPFGTP 
FGSSVMNRMAGIFriVlirrCYGS*AT*HSE 
WVLLNLSLLYHVWLCGVFLLTTKreVSWI 
LFKiyATKAHVFPVQPPFAEGS*VPSKS 
VK*QSSP\LrKYLALQDLMLLSQYSPSR 
RQEVFSLSQPGGHPHNWTAISRECUILL 
KGMTQKLILYQEAAATNGRVSSSYPVEP 
KKUSrSPEETAFQTPKSSQMPRPSVPPLV 
KTSLFSSKLSTPDWSPFGTPFGSSVMN 
RMAGIFDVNTCYGSPQSPQLIRRGPRLW 
TSAS 


22 


13923 


A 


23 


421 


3 


KIIFRAFKS*KSPSMPGFKA*KDRLrLL 
LGTNADNDFKRKPM/LLYHSKNPRALKN 
YIKSIQPLYE* KKA* MTAHLLKVWVTE Y 

FKP ivgtyc*ki/sfkiiullnddapg\h 

PKALIEMYREIHIFMSANTLFILQPMEK 


23 


13924 


A 


24 


226 


2 


THERTHSKIIHVIIIKITHPLNPSILRP 
QTTA* IKWRDLGSLQPLP/PG/LKRFSY 
LILPSSWDYRCPPLRPANFCIF 


24 


13925 


A 


25 


2 


611 


FFFFbLLGLLHQI PDVSPTGKYTTLLPL 
MIILMISG1KEVIY/DHK*HGRQNS/VR 
NTKLL ■• QDSWDTFKWKEVNVGDTVKASN 
GELLPADTVPMC\YIATSNPDRETN/VK 
TRQALPETASV 


25 


13926 


A 


26 


1 


443 


ATQWRPS LVPASAENVNKARS FAAGIHA 
LGGTNINDAMLMAVQLLDSINQDELVT* 
GSVLLIILLTDGEPTAWETNLMSIQNNV 
REAVRGRYSLFCLGLGEDVCYAFLEKLa 
LDNGGLARRIHEDSDSALHMQDFY\QEV 
AHPLVTAV 


26 


13927 


A 


27 


2 

k 


359 


KAQSKQWLPRGGSGPVK/ARVDEESAKV 
IVRVYWEAQGILLIDFLKGQRRTIYAYS 
EKI LRKP E L* * RNAWGS FSRVLHHDRAP 
AHFSHQ/TRAIG/REF* *KTIRHPHY/S 
PNLHPLDAFCF 


27 


13928 


A 


28 




427 


WRKTVKGHFTDQCRKHKAM/LGN*IHFL 
EYHIHKHSIQYSGI QATQEKFGGLKTS Y 
CSKD*YITVLDLTLKISNKDVSN*HENP 
ISMYKN/IPTVIYGSKVIKDRKLKA*!* 
CP/HRNKKF*YFDYFYVLLNTMTFLKLW 
RIITEI 


28 


13929 


A 


29 


47 


412 


HCDVLLAS SRYTCIIiPYSDRDDGPQDQL 
KMSVDFRSRRT6S/WFPQNSWGHMGVC6 
WGGAGRTLDLIHLRI PMRGLRSGGFLCR 
RKLVSESYG*EPSPL*K*KGGWGSEPS/ 
LTTVPSQL 


29 


13930 


A 


30 


336 


419 


RLECSGMISAYCNLSLPGLSDPSNSASR 


30 


13931 


A 


32 


1 


408 


KIALKLRSNYSKISGY/MANIQKLI IFL 
YTSNEQIEFEIKNT1PFTIjAPPKIK*LG 

tnltkcWqelyekiyksvmknikelnk 

WRSSYGKGKSKIKIIKIHK*MDIACSWL 
RRLKIVKISVLHNLIYRFNIVPIEIP 


31 


13932 


A 


33 


412 




QGHLSLQKFL\LPFVQLCPAPRGGVYRG 
RQDSLSCGGLHPF*ASGLLCLPTQASAM 

VEPP3PGTGIDILLVCRLLRPLEKRSIRW 
KSAVGVARFSRCRPSWLPLARKKNSP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


32 


13933 


A 


35 


304 


33 


KVWGEKVWYWQKMTQIVQMDRTESPQID 
N*SLTKEIQWRKDSLENKIf*GNNWTAPF 
SS/RSLMLMKDLTAVTKIKSKWVTDLNV 
KHKTIKLL 


33 


13934 


A 


36 


2 


424 


SKTYS IGHFTYEGKGTITSLWGKVNVED 
AVGETLGRLLVEYPWTQRFFDSFGKLSS 
ASAIMGKP FVKAHGLKGLTSLGDALQHL 
DDLKG\TFAQLTELHCDKLHVDPENFFL 
LG*VLVTVLALHF/SRRFTP/ELQAS/W 
PKM 


34 


13935 


A 


37 


2 


433 


NKRL PG PGF * KR PKP PGEGGPARyPNP L 
GGPNRE I PLGPEV* THFGPQMRPPVFLK 
ILKISGAFWGPPVGPPTWGG*EGLNPRG 
PGYN*P/R/PPAPPPWG*SQAPFPKPAP 
PP/TGIKPNP 


35 


13936 


A 


38 


3 


425 


GAAQLLPVSLPSAQRAIDLQILTTPWEN 
WTSIASLQHKTTIGRSWLITGPISATSS 
TPTSCTVASP\TRGHV*RSASWKYDVLQ 
HS SHMWWHTAVYLGEAFHQ WIATGS CH 
HRVLSCCPITGRSGRDAL*QLLPL/HAL 
TEA 


36 


13937 




^9 




463 


LNE VRD I KLS SDHW PS KTNS FLHS PG PL 
SRFEPQPASVA/PRP*SQQQSLPGICAST 

slwppnpvpfvtsslslsalpglflwlp 
sipplpgspffspe*pevplfpgptyhn 
psppd/ptvleahpdqaplp\pgvptae 
qrptpap*ahrps/lplpp 


37 


1393S 


A 


40 


223 ' 


408 


rntv*nigtdkdfmt*tpqaiatkaqid 
kwwliepmsfc/tyretiigvtdrytrd 
kifesgr 


38 


13939 


A 


41 


3 


427 


NLKMKSRTSKEN I GEHI YDFD IGRFVNT 
*SKPERKRLSFKTSVQEKTP*L*GKHK* 
MKLEP\LPYPKINSKWIKDNVRSKAIKL 
LEENRSESSIWQRML/SIMPKAQAIKEK 
VDNLDFFKIKNCV3KDTIKKALDRPSER 
EKI 


39 


13940 




42 


30 


448 


FEMRE 1 1 VAFHS I L I RYHGL INLRKFQ * 
MDRRYSKEVQDLLETMKLFARIVGPL*H 
DKF I ESHALEFELLREI KRLQEYRTAG I 
TNFCSARTYDHLKKTR*EERLKRTMLSE 
VLQYIQDSNACQQWLRRETY\ lESGPNP 


40 


13941 




43 




92S 


LRQRMPPQGIjQPCPWPPRYP/ PDSPLGC 

pggla\ppnspswpklaslagsrgssap 
pgvwglvatga*plqrpsapppcpr/lc 
pgrepcmcp 


41 


13942 


A 


44 


2 


409 


skspdptqqplrggslthsapgpslsqp 
laqltppafgpstsslfnlqksslsarh 
pqrkrrggpseptpgshpqdatvhpacq 
i fphytp\vwhilapqrhtp*svd/hpg 


42 


13943 


A 


45 


3 


136 


LPMTLFTELE*ATLRFIWNHITIQIAKa 
ILTNQK\NKaRGITNIC 


43 


13944 


A 


46 


365 




AWGSLHAPRSAVPEPGTPPVMGNALflHR- 
QFSPCLDGLSCPS FLRG*NSPPHPGSPG 
LPKHA*APflAVSPWVDPDRTVQPPPPAP 
PST\D*PHCSPPCTPWCPRRLGSAPVMP 
GCPTASyPRR 


44 


13945 


A 


47 


1 


456 - 


AELSELYEESSDLQMDVMPGEQDLPKW\ 
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SEQID 
NO: of 
nucleotiae 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

26 


Predicted 

nucleotide 

location 

correspond 

residue of 
peptide 


Predict- 
ed end 

location 
correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A°=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EEATGTRAAI PWVP PATGAQQLEEEGPM 

es\eeaqpmaapegkrslawgpwageqp 
gqpspgrrtsraedea\eefddwbddyd 
f\preepvkgarlrflpps*ktppsfwe 
nrntplwgglkify 


45 


13946 


A 


48 


116 


442 


PQNMGMERKPGPPKGLSWASGDIiEDNLG 
VMSVGFLLSSPDDAVIWRGPRKTGLIKQ 
FLRNGTG E KVD Y FMGDTQLGTS DEHLS S 
VATWAKAHLGEPGAFTFPQRVSLRE 


46 


13947 


A 


49 


434 


3 


FTVPGPLLTVIPIFISKVPFKFSLPQIF 
NFTPFGFPFQIRV*TI1PISLRFFRKKP 
LFFFF*NKVPLCSPYWNSIARSWVPaAS 
VFQV\KESFYLNLLSVPPQVPLNVFLTF 
FFFFFLGRDR/SLPLLPRMVWNSWAQAI 
LPLWL 


47 


13948 


A 


50 


■ 


482 


EKPYQCSDFGKAFNMKTQXLWHQGVHT 
GMNP YQ CRE CG KAFGRKEQLTAHL I AHA 
R*KPyGCSECGKAFSSKSYLVIHRRTHT 
GERPYECSSCDRAFCGKSQIillHQITHS 
TENHYECNECENTYPRKASLKIHQKIDL 
GKKPF / ECNEWGKALLK 


48 


13949 1 


A 


51 


1 


470 


REFLWQEGHRAFATVDEAGEEVLQILDL 
YADVYEELLAIPVANGTKTDKDKLAGGD 
YTATIEAFICASGRAIQGGTSHLLGQNF 
SAMSEIVLEDPEIPGENQFAYANSWGLT 
TRTTWVMTIGHGDNMGL/LLTPRVA*VQ 
/MVIIPGGIPKAFLICRQKT 


49 


13950 


A 


52 


26 


448 


spgtereyrigqqsvtgvtsvddsnsyw 
r irgksatvcergtp ik*gqp irlthvn 
tgrnlhshhftsplygn/qval*gdivi 
il*rrkqrlkgfteegiklrfkevsafg 

DEGEGDYLDDWTVLCNGPYWVRDGEVRF 
NT 


50 


13951 


A 


53 


3 


495 


AMEVICAFAETHIRGFTLNDAANSRLI I T 
QVRRHYLKEAATTLKTVLDHQHTPSRLA 
VTRVI QALAM KGDV\ DN I * VFHKMLNGL 
EDS IGLS KMD F INN I ALAQ I * NSNLDAS 
V*HrEMMLTS*NWVIEPQYFGSAYLFHQ 
VLEEQLEPTVEKISIMAERLAKPLQ 


51 


13952 


A 


55 


1 


428 


QERGTKKEMEDRMTLEETK\EQILKLEE 
KL*ALQEEKHQLFLQLKKVLHEEERRRR 
KEQSDLTTLTSAAYQQSLTVHTGTHLLS 
VQGS PGGHNRPGTLMAADRAKQMYGPQV 
LTTRRYVGIAAAFAGTPEHGQFQGRPGG 
VYG 


52 


13953 


A 


56 


2 


453 


EDGDLDAFSSDEGLTMAMSYLKDDIFRI 
YITETQECRRYHRPPCAQEAPCIJMVHP I 
VICYGCYGPEVGTRYTC3VCPDYDLSSS 
CKGKGLHRGHTKLAFPSPFGHLSEGFSH 

TSCA/GEARLGP 


53 


13954 


A 


57 


3 


435 


ELNTSIFRSRPIEGLGLNTVLTTDNSNL 
VI\NRIGIV\PSVTEKEYTDPSSDGTYA 
WKIFSHETITKAQILKLFLSYDYAVNWP 

WIAYPHYKSPEKCPSIILHDRLYYLNGL 
* CAASAMVMI AIVTYNVALLAYHRWNGL 


54 


13955 


A 


59 


409 


1 


LCCEHRGKTVCVPRGKFTTLNAS INKER 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

26 


Predicted 

nucleotide 

location 

correspond 

seTete 


Predict- 
ed end 

location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=°Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutainic Acid, F=PhenyIalanine, 
G=Glycine, H==Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Cilutaniine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SKISVIiSFRLMKLKKKYVK*K*\EKEKI 
MniRIRIEKIHKAKS/WFL*HtfKIE*TIR 
MINHQKRENIQITDFKHERTELPPDSTD 
I KKAIRKY* K* FYPNI FYNPEEMNKF 


55 


13956 


A 


60 




393 


GHVSSCGDHPCEG*LCP\PDKVMS *GI * 
VPEEACTQCIGEDGVHHQFLESWVPDHQ 
PCHI CTSLNGRKDNCTTQTCPTGKAS TC 
GLCEIARLRQNADQCCPEKECVCDPES C 


56 


13957 


A 


61 


250 


1 


HKDr:KRVI,yKDKV55I,r:CAGPCSVA*SCI, 
KAASTFQ\VKHFSLQSSWVYRHAPPGLA 
NFFHFLQRQVLTTPPGLVLNSWVHAI 


57 


13958 




62 


365 


3 


IKKKL IWRLFTFPKPSWGKKTKKG I PFP 
*GPKKI PGI KFPQGI KKAFKGNR* TLGK 
GN*KKKKKKKECGPGGDIPCS*IG/RNI 
VKILIIPKAIYSFNTIHIKIPKTFFTEI 
ETTILKFL*NH 


58 


13959 




63 


245 


558 


FL PTQVIS YVKRALAEGAQ I * CGEGVDK 
LSLPARNQAGYFMLPTEITDIKDESCCM 
TEEIFGPMTCDVPFDSEEDVIEEANNVK 
YGLAATVW/SSNVARV 


59 


13960 


A 


64 


106 


532 


EEACQSGTSGQGGyPRRAIGAPR/E/DA 
FTGAVYiyHGUAGGiVPQYSiviHPKA*SL 
YPSGQ/SVAARGNLSGDSFNCHN*GLQG 
LLLASSViRPVMT.TiNDPIIYTTKN/lSG 
PKCHE 


60 


13961 


A 


65 


2 


548 


VQKYPQQNRS CVPPVAEWAVPQS SRLKY 
RQLFNSHDKTMSGHLTGPQAETILMQSS 
LPQAQLASIWNLSDIDQDGKLTAEEFIL 
AMHLXDVAMSGQPLPPVLPPEYIPPSF/ 
RRVRSCSGISVISSTSVDQRLPEEPVFR 
DEQQQLEKKLPGTFEDKKRENS*RGNLE 
LEKRRQALLEQQRQG 


61 


13962 


A 


66 


72 


952 


SSRTYTTSLPLLKKEVTRKIRKYFILNN 
N * NTT YHNLWDLTKWFS GKLLF * RSR * 
EYRKDKISDLCFYIQMLE/QRQVIKPKV 
STRKE INKSRTQ * SGKETMERPKLEDKF 
LVN I NN INN P * * LNL * RKKEGHH C I F YR 
H*KDNKGYIYANNFNILDQMDKFFERYV 
TKMD\QEQIVSLNIPIADKSNA*LNNFP 


62 


13963 


A 


67 


139 


422 


VNGNEPEPL*KGISRHEHRRQPHNGFRP 
KNKGGAKNQ*ASLGMKTPEAPAHSDKPR 
RRQHftAYSS*AMPFLGIClSFSQCWL/C 














NLKNKAVI TKTA* H * QKMTDQWNRI KNP 
EIN\HPFYGQLIFDKDAKNTM/WKR/DS 
NFNK*CWEN* ILTCKKMQLHP/SLTTNT 
KLNSQWIKDL 














KI IGEDGDT/PLSEMAKSQRQIFSKDLV 
KLDSTINQVDIIDIYKLHHP\KWTVYTF 
FSSH*GTFSKIDHIi:iLDKTYPHWL*RTG 
IMQHMLSDIIOTFKLEISTRKVTEKS*HT 
MSLCNKTLEHPStSNEDASAYLK • 


65 


13966 


A 


70 


57 


423 


KDKNSQVTSEEEEQEGKIKENLNSWRDT 
R/ CH/WIGR \WI VRLS FLPKLI FIFWTV 
PIKIRTQFFMKLNKPVLKFIY*TKRTRI 
AKSLLKKK/RQEGKIS PTNRRTYYEAYS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 

Gt=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TLESIGASMPSLDLG 


66 


13967 


A 


71 


394 


1 


KHFRCMLQEDCRIIiLLWFFSETGSPCAV 
S PRIECS SKI IVHS LKIMS SDPRASLS 
*VAGTIGVHHHTQP*GI/CNI*GSGRRR 
SEKQWIKEKNVSSEAKRRRTF*NE*SV* 
KGDYHKDQGLELGTVAHAYN 


67 


13968 


A 


72 


485 


1707 


salglsq pr p pgas daapds tpagv* ta 
sssapagekdanqmewarrdpgplhqaa 
paph* shlvrdbiraqerlrktsrgpphs 
hrsgpvdlsprsshsrdstpvhgpyfrs 
apd prcs apafrgp i atrg r / r prapra 
tashvplss s swke peeataprf* asac 

tatgekstpspftgvwrfgveact*ags 
ptahiqhsqrtwqpvppctngsrspwts 
s ypknoskvgtvpkrgwacqpi petavr 
qat i s q * arkgaarphhqarq vehstq / 
qhpsgdsgkfdqrs lceas tllqperpk 
vqgkickfrv/gerrrtaspnsavpeke 
trtqtmvgtafigcsgkwrrvysvrr/r 
sspripppgvgssXplktpppp 


68 


13969 


A 


73 


1 


462 


QDHRSKSSHSNKRPSIiASSLSBNFFQRA 
AKPLQVWKWKKLYS TPIiAI PTCMGFGV 
HQDKyHFLVLPSLGRSLQSALDVSPKHV 
LTKRSVLKVACRLLNAIJCSSLKMSt'IVyG 
RVT * KYT ,1 , * F / YRTTNRGLL * S MGFAFR 
YLPHGDNPAHLERGN 


69 


13970 


A 


74 


307 


436 


LP *VGCI TEGI PEQDMVRGKHKLLRQEK 
TRLIGPNCPGVINPGBCKIGIMPGHIHK 
KGRIGIASRSGTLTYEAVHQTTQVGLGQ 
SLCVGIGGDPFNGTEFIDCLEIFVNDSA 
TEGI ILIGENGA/NAEDNAAECL 


70 


13971 


A 


75 


1 


435 


E I SDSKAQLRAMAL 1 1 DTWERMWCFSE * 
NHE PLRTHCALAAS KLLKKPD \ QAE\RE 
HLCTSL\WS GTNTDKNGEE LHGGKRVME 
RLKKALI lAHQGMDPSLQVRVFIEILNR 
YIYFYEKEKDAVTIQVLNQLIQKIIiEDL 
PNLHSWK 


71 


13972 




76 


58 


366 


EFPDLVKDMNLH I QEAQCI PNKINLNKIi 
MNRHRII /RLIjNTTTKKRILKAAKQK*N 
I/T/lRGSSrCMMMDFSSEITEDRRKWH 
S I*KILK/EKTQNCLPRVFCPVKI 


72 


13973 


A 


77 


1 


445 


yhetgcflmgaivditltfntyvhfqgk 
mkgfslladpqefmvdnstsmsapmlsg ' 
mgtfqhwsdihXdnlsvthvpftdsacl 
lliqphyafdldkvegltfhqnslwwmk 
klssrtihltmpqlalqgsydl*dlldq 
a*lsdiltp 


73 


13974 


A 


78 


53 


444 


erggygagpvawqflvpstapmlqsppl 
gfaidhtppvpapaet*apcplpyaa*rt 
tgphhiahldttfgtgdtphps s pas pp 
stpwlpdafpcplptwd/rppclqplls 
slptprlpflcflllapyap 


74 


13975 


A 


79 




353 


HIRTEIiDYYGLTWIIYSDT*EAYNYIY 
IMVT*NVYKPQLWNIFDRATMHSQDVRH 
HLLCIRLMLKNPKHHAV*/VLNGHYAFV 
SR/SFKHALVQYVQAFRTHPDBPLYSFC 
IGLTFIHM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

iocation 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A'^AIanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl{nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


75 


13976 


A 


80 


1 


350 


LYFYALLFLSSG/ CVAYVATRDNCCILD 
ERFGSYCPTTCGIAYFLSTYHIKVDKDL 
QTLEDILHQVEDKTS * FKQL I KAI QLTY 
NPDDSLKPNKIDAATLKSMKML* * IMTY 


76 


13977 




81 


76 


386 


paypqvkcpactpascirptnaevlstt 
prgksvaisahsvsfsahrgvtsviklws 
akrlh*yga\kvrfns\gctp 


77 


13978 




82 


1 


360 


ESILQEDITVLiSVGAPNINRASKYMRQ/M 
LVELQRQID3SIIICEDFNTLISEMDST 
RPKISKDIVGI,NGTIN*LDIIDIYRLL* 
PTTAEKTFFSSSEGAFT*MDHILCYETH 
lYKFKLHE 


78 


13979 


A 


83 


39 


344 


WGILEGGEYIWHVGGGKSLRASGGLWSQ 
PPRQPDDSRPLVLVPVCHLLPHWIGPTD 
LGHKRQW/MGPGAVAHVCNPS TLGG* GE 
WIT*GREFETSLAJSrMVKPC 


79 


13980 




84 


1 


427 


qqahlaghhrihtgekp*kceacdkvyr 
gkssl/mk/hrrihtgekaykceechkv 
ysr/n/sqtvkdrrihtgekp*kcks\c 
dkafghdshlaqharihtgekpyqcngc 
gkafsrqstlvyrqaihgvgkly*gndc 

HKVFSNAT 


80 


13981 


A 


85 




307 


twtqrrakl vrr i gwallp f pr p s gs kp 
pp*tppalppcvp/pqsspwtppqppap 
dseevskdwtqmesfken 


81 


13982 


A 




5 


375 


NSLLNTHSEGPGASHCTFWLHERASSRD 
LTGAESYGICRLRRLISLSIVFSKSIHG 
liflCISILFLFLFLFLFFVKTGSWLCCPG 

wpqapelrqssllsllsswnyrr*pphq 
lif\dffcrn 


82 


13983 


A 


87 


2 


342 


viknedhyimikcli*qedqtilnlhsf 
nktniniykphmtnlqbtavdkititv*t 
cgtslsiiavcrlklvkl/vedlsniin 
kldlm/dniyktlhlnirdytflkhtwn 

lYICN 


83 


13984 


A 


88 


1 


338 


CNEPRSHHCTPAWRQSKTPSKKKKQKKK 
LTTS / CI KASMKS RVQG*LR/ CWAQVMG 
* *GGVLWLFVK/REFFTLSINLSKEGHS 
RRVPCLGCLKKKKKKKKKKKKKKKKKKK 
KKFF 


84 


13985 


A 


89 


931 


2 


VAS PNALPE *LLP VSGHTHDLERVGVAR 
LGQPHHTELTACRARMESRDTCPGVTLH 
P*PPSFPSSSSPSPGGPRTRLTHQGAGL 
EGSQGPLQSQNPSAA/PLGACRGGWEWP 
QGPGSGS * GG* LMLRELRE /WQEAAVQL 
PTPG*PS*ESGRPLPSSASGPTPPGSP/ 
SP*PGTQGLCGCHPSGLCATAAGPQDGS 
GPTPTPH\PVQPSCRDSGPGQRLSPTPS 

S/PSPQRESPSPQLPCCTPGSFAGHPCP 
AP\AAPSSVACPLP/PDGFPRAPASSGI 
TTAPSPGPDP 


85 


13986 


A 


90 


66 


464 


LWVYSSFPRLPLHSGAESPEGDILWPGL 
TLFLSPSPSPLS\CHSHLSPTCRRPRES 
PRFSr,CWRPHPPSKPPPG*VKRDCCPSP 
DPVRLSVSENPSAGPR/VFLRPP 


86 


13987 


A 


91 


197 


2 


tIAEIIPAPQKVGPFIKIWVINFFFPFFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, I>=Aspartic Acid, 

G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProlinCj 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaUne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














FFF* DRVLLCHPGWSAWQS * LTVASNF 

*f\k*ssclt 


87 


13988 


A 


92 


317 


2 


SRTPDFR*STRLGL?KCWDyRCEPPCPA 
TSSF**HLSLC*QGNL/WGPPDSIRMG 
AG\TRKTKCWEGCHFLFFLRRSL/NSVA 
QAGVQWRDLCPLQSLTPRFKRIiS 


88 


13989 


A 


93 


1163 


0 


FAFGFEM*YCSVAQAS\VQWYDLTLLQP 
P LPGVQSDS PAS SLPNKLGLQGTSHHAW 
LNFL 


89 


13990 


A 


94 


3 


363 


VLWPCRPAGPSLGLNFPLYSWLRLQTFA 
AIRPGSTGRRLCLPNWVFTRNSLPFH*L 
VC/CSSRHNTYLQECTGHREPTYQLNIH 
DIKLLFLRFAMEQSFSADTGGGGRESNI 
HLIPYIIHT 


90 


13991 


A 


95 


17 


353 


PEPR YVGF I SGGKVDIKKTWKTQGRLER 
TVYYTGMYFINCM\VLKNTDSSWGTSP* 
* IQQHaGKRD]Sn!IYQLL\KLQQ\QFSEAY 
TKCYSNPDSTHDVRKVYQDCPLLAPLND 
TS 


91 


13992 


A 


96 


3 


355 


ANYPRDLCKVTDEGGYTKQQI FNADQTA 
LYWKKMP VSRTLMEREEKSVSGVQASKD 
RLTLLLGGS VAGDVK/ LKPMLI YHSENP 
RIC*SYSLPVLCKWN/NKTWMTAHLFTT 
WFTECLNPL 


92 


13993 


A 


97 


367 


2 


SLHRARRGKGVGVRMGERLPSFQSWRLR 
L/RRRKLCRRWGQGPRGN* Q* QKPDGGA 
RASQTDGGGERRRGVQTGSQAERDTSRE 
TYSPRGIQREI VEVS KGGKTGRQWOS OR 
QIQREKRVGR 


93 


13994 




98 


115 


359 


LNFFFYLLNFMLLFHRYYALKVSYFKSS 
LDRKLLELLWNKYWVNTLSSSSLLTVS/ 
DYTCKECI*ILSKLHSRLISETLFHRK 


94 


13995 




99 




490 


CVDPRVRTIFKKDKVGGLPLIjNLCIYYK 
AT VT * T VWYWHKD / RTDQWDRIKSLEIN 
PCIYGQLIFFNYYFFSQ*CQDSSMEGNS 
LFNKWC/ CRPLTL YTRIGSKWI KYLM* E 
LKLYELSLHDPEFGNGFLEITSEVEITK 
eeigkld\laklktfc 


95 


13996 


A 


100 


195 


3 


SRCCKQRHLVQWNRTEKPE/YGRLIFNK 
GVKTN*WGKNNLFNKWYWDN* ISTCKRM 
WLVPYLSPYT 




13997 






1 


371 


FSLIKISMMLLMKMEK*NLQFIWN/HRR 
LQIAKARLNKKNKTEGITLPDFKIYYKA 
WWYWHKKRHIDHWNRLENSNINRHICS 
QLI LTKVPGANTKDHPFNEWSWEN\ VCT 
KMKPDP/YLSSYTK 


97 


13998 




102 


6 


370 


KK3TI PNY? * RS * TDRSKPN*NYATKEISI 
YI/PIS*NKKIL* KLAETKIQQHI KKKPD 
NRLFYKIQFfiSII,'?'^?ITINQISKPKEK 




13999 


A 


103 


29S 


24 


KCWTSIYLLLFFRDEVSLCCPGCAQTPR 
LK*SFCLSFPKCMDYSREPLVPGLDQ/L 
FLESNRSVSVLKKNVP*WFSNSSGSKSG 
mmAFCGIS 


99 


14000 


A 


104 


2 


352 


GTIADFTQCCQAADKAACLLPKLDELP \ 
QWGRLRSAKHRLKCaSLQKSGKRTFRAW 
T*AGLIHRFPIAEFAEVSNLPTELTTVH 
TECCHGDLLECfiDHEADtiAKS ICENHDS 
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SEQID 
NO: of 
nucleotide 
sequence 



SEQ ID 
NO: of 
peptide 
sequence 



SEQID 
NO: in 
USSN 
09/515,1 



Predicted I Predict- 
beginning ed end 
nucleotide nucle- 

location | otfde 

correspond I location 

ing to first^ | correspon 

residue of ! last amino 
peptide j acid 
sequence residue of 
peptide 
sequence 



Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 



361 



KHTEAF*MPCAKYY\LFGGLNQICALPE 
KTPVSDRKTKCCTESLVNRRPCFSALEV 
DETYVPKELNAETFTFHAK 



ELEMIMLSEBVMLKAKIRQKLGLL/QPV 
S * WNAKEKFLKEIKSATPVWTQMI \KN 
SLIAD/MEVVWIDDQTSHNVPLSQSLIQ 
S / K/ALTLFS SMKAERGEEAREEKLEGS 
RGWFTRFKER 



QGIMMDTVEYVGKGEPVRCWWKCKLVAIi 
L*KTTWKI,LRKLKI * \ YDPAVPLT.DIHP 
KEIKSIYQRDSCISMFTA 



LGYSGVRAPLEEAVCRFSDLiaRAGRW 
TLFKADRQGHLSLQRFLLTFDSLCP/AP 
RGGVYRGRQASLSCSGLHPVGASWPRCL 
PTQASAMAGAPAAASLPPCSLISDCCAS 
NE * GSVSVGPSE PGTGHN 



KWGEKWYWQKMTQIVQWDRTESPQID 
N* SLTKEI QWRKDS LFWKW* GNMtJTAPF 
SS/RSLNLNKDLTAVTKIKSKWVTDLNV 
KHKTIKLL 



IMSIILYYGSICYCYYWLID*LILRWTL 
ALVAQAGVQKRELGSLQPLP/PCLSLSS 
SWDYRPP 



RSFFWVFFFEKKSLLGIPGGKKGPPLN* 
LEPLAPGPKBFSGLNPLEKTGNWGPGPPC 
RGDFGFF*KKTPFPF\GPGGVKTPNPGE 
TF/EPKPPKGVGFPGETPGPGPGENLYQ 



ASIWLHI FLFLSYFLEKBPRYVAQSGLW 
LLGSTGPLSaSRAAGTTGaLHCTQL*TY 
LLLPKHGKAVFF\QETLLLRSTHPFPLK 
HVYTPTYELLLVWDRDSLSHPR 



GCCPCLLFPGSPTVFGSLYS IRLVAFW 
AVPPVWKAVPG*PPVKM*FFHLFIYLFL 
KQ/VSLWHP6WS*LAQSQVSRFFPRSLP 
SNMDYRGAPPRLTLFFFFFKQSQGIEflM 



RMVS IS* PRDPPAFASQRAGITGVSHRA 
RPVYSFLLLSNVPVMDVPQDIYPFLVDG 
YLSLPLVCC:V*V*VLLCYPTWPPGLKRS 
SCT.SLPSSWDYRHEPLKPA/SCCVLLS3 



PSYPGDLTEIMDE66YPNQQIFSIDDTT 
FYWKKMPSRTFLTTEKSIS /GF*ASKNK 
LTFLLEANVAGDFKLKPVFTYHALRNPR 
TLKKYATSTLPVLCKWliIlIKR*VT/AHVF 
TrWFTEYFKLSVETCCSE 



CAAGFGLZj* TPPRT*TRKPREHGTYKP I 
SLMNINAKILKK/ILANRIQQHIKTK\L 
LQSDQVGFI 



FNK*CWHKWIFTCKGMNLEPYFTPNTKI 
WAIKAIQLLEENKEVNLC* LGLGHGF/S 
R1.PKAQMAKETIDKLDFIKLKALCSK\N 
TIKKVKKSEELEKIFANHIHGTS IQNI 



AADSSTHSLTKPAYLEKQNFFFFQMEYC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L I PLAE GP WPNLA* MQPPPTGFKQSSCL 
SLPSTWD*R/HWSPRLaN 


115 


14016 


A 


120 


1 


371 


PSyP*DLTEIMDEGGYPNQQIFSIHDTT 
FYWKKMPSTTFLTTEKSIS/GF*ASKNK 
LTFLL*ANVAGDFKLKPVFTYHALRNPR 
TLTKYATSTLPVLCKWNNKA*VT/AHVF 
TTWLTEYFKLSVKT 


116 


14017 


A 


121 


82 


367 


YSKRSNTIVAGD/FTPLSALDSSSRHRI 
NKKTSNCTIDPMDVIGIYTVFHTVSTEY 
TFFSSANGPFSKTDHLLGYKTSLKTF*T 
KLK*HQ/CIFSDHN 


117 


1401S 


A 


122 


27 


253 


MKTENILGENIGEVPGVGKDSLDMSPKA 
* TIKEKNDKLFF I KVKNFS YS KHT I * KI 
KNQATVWQ/KLLTVHKSDR 


118 


14019 


A 


123 


3 


305 


GTRQGCPLSPL*FNTVLEILVR*ISQNK 
*INKT/SLTADP/MVLHIENPKGSIK*V 
LELINKFSQVAGYKINM/QKTVAFLYTN 
N*LSKKEIKKTIQFIIASKRT 


119 


14020 


A 


124 


1 


327 


QVIFCLGLPKCWDYRHEP/RMPGLRG2VD 
FFSLI *ILWEHMF*VMCILPHP.CGDFWA 
MIjNF*EREGMFF/CIiKRWKSHNVSQA/G 
PQTPRFKRFSRLTLLSSWDHRHAPPHLA. 

K 


120 


14021 


A 


125 


74 


334 


I FEFGVLKVITCLSVSSHEVBKLCPFFF 
FFKNGV* FCCPGLQGYNGS *L\TLELLK 
QSSCLSL*CTQDHRCLPPCIiAISIFNLINS 
SIH 


121 


14022 


A 


126 


381 


60 


tvsqvlahtcspsylkakaggslepgsc 
ypgcsel*schctpawit\sqtpsl*ki 
h*kki1vkraivecvcvyvytyiwllil 
kfsdsipfaqiwgfheefpywnqc 


122 


14023 


A 


127 


323 


2 


rgflglsdevsgqqlfs ilse lenqfkn 
nlegcggscii/sipkcwdyrcqpsrlay 
a*lifvflve\tkntgfrhvgqaglell 
tssdlpValasqnawltgishhaepy 


123 


14024 


A 


128 


2 


354 


EHCQINNLREHLRKPKKEEYTKPTASRR 
K* IVNIRIERNGIKQK\T1KKINELKSW 
LFEWINKIDSHSGKWMVGEE*CVFVCVC 
VRERQSDRERETRLIKI*NERGDITflDL 
TEIK*LLrLQKLNNY 


124 


14025 


A 


129 


74 


327 


GEIifiMLPRLIIiNS*\PCNPPA\SASLVT 
GTTDTPLYPAHYYY\ I I I I I I I lEMEFH 
S\VAQAflMQWCDLGSLEPL\PPGFRRFS 
CLSL 


125 


14026 




130 


83 


382 


YEFHRSDGGSRLSSQHFGRPRWKDYLSL 
EVEGQPGQHSETPSLSLSLSF/CFLEME 
SCSIAQAGWQWPYLGSIiQPLPPRFRRF 
* CLSLP KCWNYGHEPQL 


126 


14027 


A 


131 


749 


2 


RQSLALS PRLECSST ISAHHNFCLQGSS 
DSPASASRVAGITGVHCHAKLIFVFLVE 
TRF\THVGQ \AGLELLTSSDLPALASQS 
SGITGMSIIHVQARCVrF/CGPTFICNQDM 
KY*NKKEKNQSTFLEHLQNQQENRHAAH 
TLHHHTAS/l,KSSEFLFAIHTSFLrS*K 
RSGTTGMF*YN/WTIGNGVEDRFVLGPP 
FGLGVQWYHHSLLQPQSPGLKQSSLFSL 
PSS*DYRHVPTPSFLKFLRRWGrAILLR 
L 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 

residue of 

sequence 


Amino add sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


127 


14028 


A 


132 


371 


1 


KKNLNLNLTL * VKI NTNW IMDL T I KYKI 
IT*KKQEKVGNLEQGKEFSDLTPKV*SI 
KGNTDK* DV* KP FSL*KPM* GNYRQATD 
* KKMSAKHVSNKYP\LSRIYKFNDKAKN 
KPIRKWAKQMN 


128 


14029 


A 


133 


2 


318 


iihkiilkdakkafdkm*hpftmkilnk* 
gieqyat/diikamcdrptadi ilngek 
l:<afplrsgrrpll\fh* iw*vlahtvr 
prkrnkgsqewwlmalipstlgg 


129 


14030 


A 


134 


3 


362 


WSELLGSSDPPALASQIAGI IDMKHYAQ 
PGTDF/STSVYFTLPTQIRL*AL,KRKFL 
L/EFK* I FAS I *QFAKCSCMSRNFLPAQ 
PYHHWHFPNEETEAQGNFSLLWAC/LRE 
DWIPTD 


130 


14031 


A 


135 


12 


398 


KCSTS I VIRE IQVKTREIYHYTPTKMAK 
IKKPDNAKC/WSGCGA'rKTLIHCW*NN\ 
SLEIWLFLMKFNICLPCDPEIVPLDLYL 
REMKTYIHENTYTKVSIAPLFK\QPKSP 
STGESINCSI I I PRMECI I 


131 


14032 


A 


136 


309 


1 


QSEAPSQKNTKKQKTKVGGLTS *FQTYH 
NATVIKTVtra/W/YIGIDIDQWNSIQSP 
EINFHIHGK/LISNKAAKTNEWGKNSLF 
NK*HKDNRrFTCKRMKVDSQPHI 


132 


14033 


A 


137 


2 


251 


QCGKAFRAASVLEMHGRTHPEDKPYECK 
Q*GKAFRSASHL*MHGRTHNQEK/P/HE 
CKECGKPFESAQNLRIQ*RTQaHIRMHS 
G 


133 


14034 


A 


138 


396 


3 


LELLTS * SACLGLPKCWDYRREPPHPAK 
MIL*SKSS\LPFLCSKCL]SmWALU[.LL 
CSLVPLLTLPLKEFYAaH*SPDSLKYPV 
liQKLSKLEMLVLHETVQRKEQLTYSLYK 
PIFGFLVGYSFPLFCSYKTS 


134 


14035 


A 


139 


3 


384 


LDFIKMKNCCSSKELCGKYEKP*T/MEK 
MFAKHISNK*FIFRI*K*LSKLTKKK*A 
KDLNRYSLKEDIQMMKCM/KKCSI*LV 
IREKQTKNTMRYY*NGLS*RS /GQTK* * 
QGWGGTGPLIHCWWEYPMl 


135 


14036 


A 


140 


63 


38S 


FMYLFIY*LER/CLTLaQTKVQWCDHSS 
LQPQTPGLKH\PPASASGVRTTTIMPG* 
FLFFF* KRGPCWVAHRRTT/WLNL6SF 


136 


14037 


A 


141 


370 


3 


GPGHFPALFFKGPGLGPWVPPI IPALLE 
G*GGGFPGP /RGSGPPGQPCFFLKFQNF 
PGFVNGAPWPGFPGNLGRGTALNPEAH 
GSINLKGPPGLQPGGKTKLFFQKKKKKSf 
QHFQKTQLILCF 


137 


14038 


A 


142 


363 


2 


REKATEENFATS * DWLMMFKEGSHFHS 1 
KVLGETPSADVETWSCPEDLAKIMDED 
GYTKQIFNVGET\PSRTLITRKEKSVPG 
FKASKIiQRIG*IiLIiGVlTAaGDFKLQSVI. 
iVIDYSENHAE 


138 


14039 


A 


143 


99 


375 


KNKNQNISPPLWYTLLSPLLLPFI INS 
FFFFFFSF/CFF*KKSRFBPEVAFQGGN 
LS*LNPRPKGSKEFSCLSLQSTWITGGP 
PPHWADFDF 


139 


14040 


A 


144 


39 


399 


LQIIYNYIWLYIWLYIFWLVSKIYKELL 
QINMKETITKNGQKT* IDVSSNHMYMNG* 
*LHEKIHDIVIR*MRIKTTVTYHVMSTR 
MVIIKETENKYW*GYGE\KILIYCWWEN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^amino 

residue of 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryplophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FEK/SVWPFL 


140 


14041 


A 


145 


358 


3 


rehtaedkyeasrswfprfkek\splrn 
i ivqgeaattngevavs y/pedlaknid 
eggytqqif\nadetf*ywkmlfrtfia 
reks\rlpsrdrltlllganaagdfklk 
pmfiyhsdsc 


141 


14042 


A 


146 


3 


376 


AVMGMYKINWFVPTNTS ILQP id*gvi 

ltfrsy*lint\fcntigaigsdssdgs 
gqsklktfwkgftipksxikwtcdlwee 
vkistlmgvwknvipalidlferlktll 
eevtaymvnigrrl 


142 


14043 


A 


147 


374 


161 


EFHRVSQDGLDLLTS*STCLSLPKCWDY 
SKWQRRI*TQ/DLCDSED*SSIKPIQLC 
QELMS *MAEMAHGQP 


143 


14044 


A 


148 


373 


2 


FFFFFFFCDLLLFPKQPRSHASFSLPSV 
VPFPEMPSSSLTLFYLFVYNLLR*YL/N 
SVTQAGVQWHDHGSLQPRP\SLLSSWDH 
WCTTPCPAN* IFFL*RVGLAMLPRLLLtJ 
SWAQAILPLWSPK 


144 


14045 


A 


149 


3 


282 


ILKRIIHPDQVFFIPAMQGWFNI*K*IN 
TV\NHIIRIKGK\HRMIISLDTVKAFGK 
IQQPFM1\*KLGI KGNFLNL I KGT YENP 
QLAGHSGLCL 


143 


14046 


A 


150 




360 


DII*ESKFQGHHTTGVQKGr,QyGIILFI 
I *EVFFFAGFFWAFYHSSLAPMPELGGH 
XYPPTGIFPLKRLEVP/LLNTSVLFASG 
VSVTSAHHSLIEGNEKQIILALSITITL 

GIYFTLLQI 


146 


14047 


A 


151 


349 


3 


kkssgshletkswrlpgpsqgpwckeir 
kgslawqekswppykpfllevgggrvir 
gsrmtrhghlirssktliichhvpk*tvq 
fnfclfifleiviesrswaqagvqwcnig 

SLQP 


147 


14048 


A 


152 


500 


0 


MPSRLECSGA/ISDAQOSILQNSGAPSEP 
SSALSNPCG*DYEDTLPLPGYFLKFFVK 
TRSRyVTPAGL*LLaSS\lPPTSA/FSK 
HWDYRHE 


148 


14049 


A 


153 


373 


2 


QAGFTLLTS SDPPASAS \QSAGITGMNH 
RTWPGNF*PQKSCDSFVTKLMCTCHKKH 
IYAQSLVT*LGHSYIFF*DSLQPSPPRF 
KQFSCaiSLPSSWDHEHTPPCPARTQLHF 
GYEYFSRHFtRRR 


149 


14050 


A 


154 


332 


3 


THKFQRLTSKSKKLKRKLIFHQGAQKSN 
SLNPDIKKNLKRSQENME*EKIFTDHLS 
SRIYRELLKLKAK/INLIKMGKGYFYEG 
DT*MADKHIKRCSISPIIREMQIKTT1R 


150 


14051 


A 


155 


337 


21 


LGLLPFFORCPAKEGGI *RGSLAAVALL 
NCGELHPV*ASWWLCL\PVRGKLPTEAS 
VMGDAPPPTKLECPESTSDCCDGSKNFE 
SVDLSLLGSVGVGSVDLDPVAP 


151 


14052 


A 


156 


21 


m 


HPGSRGCSEPRSGHCTPAWGTKVKTPAS 
TKTKQNKKKK*RTVCLTPWHTLL/RVWC 
DGGDYSSSLL 


m 1 


14053 


A 


157 


3 


394 


REDLLS PGFQGCCEL* LCHCI PAWVT/ S 
ETLSQKKKK 


-153 


14054 


A 


158 


36 j 354 


LSAAFTSQLI.GRriRQENGVW\CSEPRLH 
HCTPAWATE *DSVSKKKKGKP FWI PNPL 
SSLSPAFISGQGPWESHLKNPGRALQGG 



521 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

correspond 

residue of 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 










Ts 


340" 


GFQPLICPRPGRTHMSLKSPWRP 


154 


34055 


A 


159 


WPHKAORGILLAELYPSGSAGHSIEPK 
TITLFFLGNKVLLCCPGWSAVA*S * S TV 
ASN\FGPK* LLCLS PPSSSDSRYARPHM 
ANFK 


155 


14056 


A 


160 


2 


362 


HLSPLSIPQNRHCHHGPFSVSCWAHLPD 
GWAGQRGSSLP/M*GRPGRGAPSPPRR 
GGWPGGGLrPHLPSGRGGWPGRGTP\PS 
Q*GRPGRGAPSPPGRGGWPGGGLTPPPP 
SQTGRLAGRR 


156 


14057 


A 


161 


293 


2 


PHLAIRPPTTVDKSALYWKKMPSRTYKV 
EQKSMPGFKSSKDRLTLFLGAK\AAANF 
KLKPMLID\HSKNTKALKNYAKSILPVL 
CKWR\HKA*MSPHLF 


157 


14058 


A 


162 




136 


LIVPILIAMAFLRLTERKILGYIQLRTG 
PiSIWTPPTGATYRLLS/P*QP**LFTKE 
PVLEPVTCTITLYITDPTLALTIALVL* 
TPLPIPNPVGNLNLGLLFILATCSLAVY 
SIL*SNRNGIPKTYRTKNSRLYTTTHRP 
QRLDAAYWCYVQTPIA 


J 58 


14059 


A 


163 


1 


464 


RQGL/DSVTQAGVQGRNLSS\LQPL/PP 
GLK*SSRFSVSNSWDYRCAPPHliANFFN 
LFFVQRG\FTMLPKIj*TRS\GPGD 


159 


14060 


A 


164 


1 


353 


FNYS S S LGNKS E TP PQKKKKKKKGRG PG 
ENPGTPTFGGAKGGGNPG I KI LKTTGPK 
GGNPCFKKKPK*PHRGGRP/HGNPVYRG 
GPGKKNFWPPGGGGSKG*KEPIKPPPGG 
KKGDFF 


160 


14051 




165 


45 


389 


FFVCTFCRERSLLCCRSWSGTPGLK* PS 
CLSLPKFSDYRHKWPCSARNFLPACLPA 
CLPPSLPSFFPPS/SPSSSPRQGVTVIQ 
VRVQWRDYGSLQP* PPKTKRSFTrAFQD 
LGPQ 


161 


14062 


A 


166 


330 


2 


WDYRCAPPHPVLYLKM/STFNIFFL*RG 
VTMLSRLVSSDPPI SASQVARI IDVSHW 
AQL*RSV/CYVFETGSGSLTQRGVQRYN 
HGSVQPQHSRLS'SSHPSLWSrYRYPPPC 


162 


14063 




167 


2 


396 


YSNVLLGIYPKELKTLVHWMFICV*NTC 
IWiyiFlTALFLa\ES * CSSVGE * IGKTWY 
IPTMKYYSVMKRMAISSHEKTWTRLKCI 
LPSERSRCDRLQTV 


163 


14064 


A 


168 


486 


2 


lifgkiykinnssktdnkrgktqiqilg 
mktgdimtyXqqtskeyyahefmnleem 
dqlfkkhkrp/rihqyekhhltgpmtin 
efefinvktpknqspgwddfigkfyqmf 
eeflsenktgelil*s*yysltnsfyea 
sity*pkpdrq*kkqscgp1sfm 


164 


14065 


A 


169 


200 


3 


GRVDLPPTQESRPPGPPPSDPGVQAPRP 
SIVRNCGILTRGSPGPD\PSPLSS*AQL 
CGSPPPPSFS 


165 


14066 


A 


170 


2 


327 


PGGIGCGELR*HHCTPTCaTERDSV/'WN 
KIIHYIIMLftSPNELILPLLNICLEIL* 
AGPLTAICISMSMATL\FTIAQMGKQPK 
CPSTNEWIRKMWYIHIMEYYLHFKMK 


166 


14067 


A 


171 


2 


343 


PGWSQTPDFR* STCLGLPKCWDYRR/AA 


167 


14068 


A 


172 


367 


3 


FIRDVQFISALRYLLTPERMA/MIKKSK 
NMRYRCGCGK/RG/rLLHCW*KCKLVQP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro/ine, 
(3=Glutamine, R=Arginine, S=Serine, 
T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L * * SVWRFLKELPLHSVI PLLGI YPK/ D 
KEVIYEKDSCILVFIAVQFAIA/RNIHP 
TYCPINE*IKKMWYIY 


168 


14069 


A 


173 


102 


345 


YQLQNI PRWVYSKLQNLSFAFVLL*RDR 
VFLCHLGWSSLL/QLKRSFHLyLASSWD 
YRQVTPRPAN*ILFRD/RGLAL1.PR 


169 


14070 




174 




401 


STRLGLPECWDYRCESSCPATYLPF*HS 
IPFLSQLKVQVHHRRYFHLFPELSWLPL 
LSKNVS*TY*KSFLS/WPSLRVLFCFVL 
FCFVTGFHSVQARVQWGNHGSLQPQPPS 
LK*SIHLSLPSS*DCR 


170 


14071 




173 




388 


SDLQLRAGRTTALFKAVRQGHLTLQRFL 
LSF\V*LCPAPRGGAYRGRQASLSCGGL 
HPGRAYRLSCLPKQAWAMVGAPTPASLS 
SCSLISDCCASNHRDSVGVGPFEPCAGC 
NLTEHRFLSPSGAWSC 


171 


14072 




176 


334 




KLLELIN* FNKVAVYKINTQNQLHCYTL 
IRTTQKGSLKTILLTIAPKRIKYLGI/Q 
/LTKE VKDLHSE S YTLLKE I K* D INK* K 
D I PCS * MGMVFFGVCLFVYFCFEMAFRS 


172 


14073 


A 


177 


339 


2 


KNSPYNFPKGKNPGNL*SLKSRIF/CWG 
STFAHIGELFFFKMGKKFPAI/RPFFPF 
EDRVLLCRPGCSGWRSWLIAAST/CLG 
WDYRCTPPCLANFr,*RQGT,TMT,PRT,I,SN 
SGLQ 


173 


14074 


A 


178 


3 


303 


DLRRAACLILPKCWDYRHVPPRPADNSG 
FUILI 1 1 *VCLPLLCHAQLVF I FFI ETG 
FCHVAWAGLKFLGSSSPLTLA/FSKCWD 
YR 


174 


14075 


A 


179 


342 




INELNIFTMAIFSTLIYRFNTIPIKIPA 
GFVEMDKLILKLVWKFKKHGIGKTTLKK 
NRVPNSKAYYNATVSKT\YWNEDRNINQ 
WN* IDNLKINPCVCGQLIFNKDAKTNEE 
R 


175 

__ 


14076 


A 




2 


323 


S TS YPI TGSHAFL * PQNWDAETNS *HI 
NHWLRLKI I KLLEENTEKNCHDLGLAT 
Dyy/SVTPKa*ATTTKIDKLBLIKIKWF 
CTSKDIT*KVKRQLIGENSCKSFM 




14077 


A 


181 


325 


3 


RKRKKKRREREPKKDEiERKRLYGKDIlK 
KREKDTM/DWEKILQKGVRQRMCI *KI * 
RGVTQELKftNPIRKGGlSNLNKVHQRI*M 
ANKHMKRCPKS *VIREI* IPTIMRYH 


177 


14078 




1S2 


5 


326 


TKTIEKAMI* IRRSRRKKIKFKPEIKNR 
KTTTKINESKSCFYKimJKTDKLLVKLI 
RKKKTKKI1TKDEKNHTI\*DCTEVKGM 
TECYEQVYANKFDNS\EKMDNFLER 


178 


14079 




183 


3 


378 


TVI.CSFIVWGLDFFG*IHTIYS*VFCYK 
WNCVHNFLFRFFSAYI *YLLIFVY* S* I 
MHLCLICY+LLL/CSCIFFWIFYMLYFS 

FWL/CFLYFYFYILCPFIFF/CI FI CliL 
FFVYLFLLFIFFFIFLF 


179 


1 4080 


A 


184 


365 


3 


mqlkcimks isvqagvqwhnfgsmqgpp 
pw\lhhfpasaslvtgttwarhqgrvi f 
/ ifffflmrrsl/dsvdqagaqwhdpgs 

riQPIiliPGFK* FA/ CPCWDDRRAPQCPAN 

fvfla\rhgftilar 


180 


14081 


A 


185 


51 


293 


PYVNQEASVLTNSFYQAS/ r*KKSNYRP 
ISLMNIDAKILNKTQQIKLNKVHDSRQT 
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nucleotide 
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ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














GEDQIPKYCTVSDGRMMRDYFPLFSS 


181 


14082 


A 


m 


326 


3 


TKICQWILNLGKKREFGFKKKKKIKCPN 
KRHRLANGIKK* DPTTGCLQDI CFRSKD 
AQRLQI KGWKKI FQKTGI *R/GVAVLTS 
DNIEFKIKTVTRDRVANYIIKGSIHQ 


182 


14083 


A 


187 


17 


330 


KDNEM*IKRTMRCYYTSFTMAKINLKIN 
NIKC+QG*GTTGT/L/LYC*WKHKIVQP 
FSKTV*QFLMRSNKYLRyNPAILLLGiy 
SN*VKI/C*KTYMRMFIVFLITKDWKQP 
NYSS 


183 


14084 


A 




124 


357 


SLWGGDLEGGGKKKERGENPGGPKKPGK 
KKKKGGNLGARG/VFKKRGRPQKEKPGK 
KGEKGGP*GAGI *GGGGNGVKKK 


184 


14085 • 


A 


189 


333 


124 


RLRQENRLNP/GSRGCREPRSRHCTPGW 
ATE*DSVSKTNKKMYKSI lARDCHAENS 
IVSFLSKRGCVFLF 


185 


14086 


A 


190 


1 


327 


KSWLIGDNIPSSQSLIQKKGVNPSFKSM 
KADRGKEAAEEKSEASRSWFMREKERSH 
LHN/1K*VQGKAASYPEDLASIIDEGGY 
TKQ* IFNEDY/MWKKM*FRSFLTREKL 


186 


14087 


A 


191 


53 


400 


VNILGPFHLFFTFVQNALSPNIYMTHML 
TSLGPLHMSSLQTGFPKESYLKFQPSPH 
SVPVGF/ IPGMQR* FS IHKSVNVI YHVI 
RMNDI SHMI I SKDT * KAFDKI QHIGS LF 
PALSL 


187 


14088 


A 


192 


65 


321 


RARTEIYLIHTLLGVYAGETKTGPYTDC 
M*MPTTAPTTLVKTWI*PKSLSIGERIN 
KL/WILICTYMEYYSAIKSVLTHLTTWV 
NPH 


188 


14089 


A 


193 


127 


241 


I IKNDSRRI CNLTEKLPLRKINS/WPGA 
VAHaCWPSTLGG*GGESRITYIJESI?NKG 
EQTQNNK 


189 


14090 


A 


194 


1 


337 


DLPASASQSAGIIG*ESIEN*NKLVKKT 
LKCSQKDPKEDLNK/WKDIVCSWLGRLS 
LTKVSILPKLiyKFAKIPTKLyYD*KKN 
RPGAVAQAVNSSTLGGRGGQI 


190 


14091 


A 


195 


2 


333 


NFNSLFFFVEIEKLILKFIIJ!rCK\VARR 
ILQRKNKVGRLTLPNEWTDYKaTVIKTA 
WH*HKDTHMDQR]S[RTESTKIlSrTYKGAKT 
I *WRKNSLFNK*CWDK/MITTCKRKKVD 
P 


191 


14092 


A 


196 


299 


2 


PHPQRDWQPSPPSICRIFSPSSIPGIQ 
SWFNIQKSINVFYHIVIUITQHRSSSSS 
SSSSSSSSSSY*I,MIKF/LNKLGIEGNF 
LILIKAIHGKPTVIIH 


192 


14093 


A 


197 


2 


199 


LALSPKLECTGAITAYCSLELLGSNDPP 

TLASKVF/GITGMSH*AQPQWFLLLYY 
FIFFFSSSSVL 


193 


14094 


A 


198 


226 


2 


KKFFFPNPNLKNFPLGRVFF/CSPG*KF 
FAPLGFFLVSFLRQKVPHLGSPGKTFFF 
FFFFGDRVWTCHPCWSAVA 


194 


14095 


A 


199 


338 


93 


PQHNGSCL* SQLFERLRAEDPLNLRVQG 
CNEL*LYLCTPSWVNPVSEKKSIQEKNA 
NLDFREY/RDKRRENWTCV*NSIERSSE 


195 


14096 


A 


200 


1 


361 


PRSHLSQLRVRLSQ/ IKKNQKTIGKGFR 
AIG/MGKKF*GKTLKVPFTRGiCVKQWKF 
LKLKSFLPRKKKTE1TL*TLGKHFRKWE 
KFFATYPSTRE* ISKI CRDLNHFTGKKI 
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SEQBD 
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nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 

0(1 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 
location 

ingto first 

residue of 

peptide 

sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, y=Tyrosine, 
X=lJnl<no>vn, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














IIPIFQEAKGL 


196 


14097 


A 


201 


2 


358 


ENKTTMRYH* IPrRMAEN*VIAVGTL7ir 
QPV7WEYKRV*PLWKTNSIAVTLLNINLP 
CNPT I LLVG I YPRERKTYVYTVACTQMF 
LrVLFTVAPNWKQSQGPSTGE*KKK*WH 
IHIMGHYS 


197 


14098 


A 


202 


110 


357 


IWVFIWCHFLLPEKTSFFPGFPQCLDIS 
* KTLFVHSVLFFFKDRVLLCHPGWSAVA 
Q**FTAASD/FQESSCFSLPGSWDYKRM 


198 


14099 


A 


203 


2 


374 


ILKLHAFFMAETPGACKTPCDLRQVF IF 
IFIYLF/MRLSLTLVAQAGVQWRNLGSL 
QPPPPGFKEF*GRCLFYPKDGLVCPGIiA 
GS*TGQIGKGIRTLSPDLCLSDGFCGSK 
PKS /ASASQGHVG 


199 


14100 


A 


204 


375 


3 


LSSILIRGSPNLSTLITIQEHL*LLLPS 
W VPIAI 1 * F I S PLiAE TNRTP FDHTEGE S 
KLVSGFNIEYAAGPFAILFIABYTWl 11 
INTLTTTIFLGTTYDALSHELYTTYFVT 
KTIPLTSPALMNS 


200 


14101 


A ■ 


205 


393 


3 


SARLGLPKCWDYRR/ASTVPGLVSLMLL 
LKS YVSKMKKCE * LVKPLKAKL *DS CKE 
L*VGNNLIMPSTYDQENDRVDSSLTWLC 
VLLLLLLLLLLL FWRQS FALVARAGMRW 
RDLSSL* PLPPGFRQFSCI 


201 


14102 


A 


206 


2 


377 


FRAVLQGRWSLQKFLLPFVQLCPAPKGG 
VYM*RQRS/SLSCGGLRPVRASWLLCLP 
TQASAMVWAPPPARLLPPRS I SDCYTSS 
EQGSVSMGPAEPGVGYDLLVCCLLRPLE 
KHSIWVRVSCFSRY 


202 


14103 


A 


207 


327 


1 


i ltgn fkq i rml i yhsvnpralknyaks 
tlpvnykwnnka*mtahlfi:awptgcfk 

PTVEPYCSKKKKX IPFKVLLLIDNAPGH 
PRALMKMPEEISIVFMPANPASIL 


203 


14104 




208 


310 


2 


FCPTOaPGMGFPLGEWGLILGPGSPFFLK 
EGFPGVGQPGPPKRWGLKB * PPGAHWKG 
F/I FFFADRVSLCHPCWSAVAQSQLTAN 
PASQVQRILLPQPPDYRIR 


204 


I4I05 


A 


209 


374 


3 


gvrdp/leewc:pf*dlki.varrt/slf 
fkavrqghlslqrpllpfvwlcpaprgg 
vcrgtqaslscsglhpvraswplclptq 
asamvgapspaslppcssisdccasner 
gsvgmghsetdvgy 


205 


14106 




210 


3 


196 


LENLKF/LDKFLETYNLTRLNQEETEIL 
NRPMTSSKI*SVIKISrLftRAl«IAHAYNPST 
LGGKGRRIA 


206 


14107 


A 


211 


34 


339 


IIKSKRINYMSCELYENINIDNFFFFET 
VFHSVI\KLKCYGAFMVHCTLNLPGSTN 
LSTSASQVSGAINRGYPGQF/IGLEK* F 
LVKT6FCHVA*AGLEPLGSG 


207 


14108 




212 


200 


312 


HYGQFHVLFCSLF*MESHSVT/QAG/VQ 
WP/SLALQTPPPGFKQFSCFCLVSSWDY 


208 I 14109 


A 


213 


300 


124 


NHIFFYFQIHRVCVCVCVCVCVCLCVCV 
YRYIYCVSPW/RN* * SL* KECLSLVLLT 

PTC 


209 1 14110 

i 
1 


A 


214 


1 


357 


QVDHTSDRKANLNMF" KI KVI QTMFSD Y 
NGVKSEKMREKTGKLTNIQKLWST/L* I 
SNETNEKPQEIRKYFEVKINENTTYQDL 
WDEVSIVAQAHNPSTLEAEAGRLLEPRS 
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NO: of 
nucleotide 
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SEQID 
NO: of 
peptide 
sequence 
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od 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unloiown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 






A ^ 


215 


^360" 


1 


LRSAWTTE 


210 


14111 


LTiETYCSEXKKTPFKILLFIDNLAGHP 
RALMEM*KEMNVFMPAKTTSIMLPLI \S 
SFNSYYLRTRIHRTFVWAQSWLQCPTT 
AISDIQIGPVAMGLLLIPSAFVLCWLSI 
QQVSKLKH 


211 


14112 


A 


216 


3 


347 


ssrspadglffyqck* qtlyw* hlwykt 
qkekkktqmkekkqrinkarXgdklpec 
eavcgkpknpanpvqrilvghldakgsf 
pwqakmgshhnlttgatlineqwllttg 

KNLL 


212 


14113 


A 


217 


75 


347 


PPIFKRTARGKVTPGWEYPAKG/CGGTF 
LGNRGLFPNPGFSGPFPKKQVWEGALCA 
PG* 3CVCTLKF* FI \SLAIPGNGSKNFFT 

LFEGPSPIL 


213 


14114 


A 


218 


3 


397 


GGQGYSGTGHHGGIWHKMEH*R*PLPPK 
KKKKKKKKKKKKKKKKKKKGGP PKKKPR 
GGQNLKGGGKE KP P P QKGVKKKTLSGR I 
IKKDGREKHTRGELWKKTFIW/EGEKIG 
EKPPKKl *DHEGKKKVLRGKG 


214 


14115 


A 


219 


3 


401 


DSWATLHGNIMK*SAAVL*ALGLV\FGF 
TVSGPTGIVLSNS*LDIELHDTYYWAH 
FHyVLSIGAVFAIIGGLIH*IPLFSGYT 
LDQTYAKIHFTIIFIGVNLTFFPQHFLD 
LSGMPRRYSDYPDAYTT*NIL 


215 


14116 


A 


220 


162 


3 


KPAQRNL*SNPEKEDINILKRNQS/WPG 
TVAHACNPSTLGGQGKQIIRSGV*EQP 


216 


14117 




221 


374 


2 


WCRDRERERERATERERERERERQTDRQ 
TDRHREEA/EGWAAWAIN*GEAPGTSLE 
AflLECSCPKRPPQPAPPEQGPFPRTTAR 
GQPRPPKLLQPEAPSQTRPHGYPWPLRV 
liPQSGPEVRPRE 


217 


14II8 


A 


222 




264 


DHMRPKVRGCSEIi*SCHCTPAWAT\SES 
LSQKKKKKKKKRKKIGEKI *GAFFKVAP 
PFFFFKKKMNLKSPVGIAGVAKNTRWIL 
KHRG 


218 


14J19 


A 


223 


2 


410 


ArSPIIEELIT*HDHAI»INILLM*FLGL 
HM,FVALTTTLTNTNI*HAKETQT* *TI 
LPAIIIiVLIRLPSLR\ILYISDEV3SIDPS 
ITIESMGHQWy*TYEY\TDYSGLILNCY 
ILPPLFLEPGDLRLLDGDDQVALPI 


219 


14120 




224 


399 


3 


GVGKPGFPWAKKWALPPYSKGYPK*VQN 
LILKVKLLLLKKNPGENSLEATGTWGLP 
PKARAPKLKPNW/DSPKRKHPGAFiCKKK 
KRQPTSWEKIFiUlHaSDKRLTSRIYKEL 
QQ*KSKQP1FLIPIYHLNHFKG 


220 


14121 


A 


225 


360 


71 


NRTTWVPFKTPPLLYHI]SITKKK*HTGNK 
KHKTKRAKKRKKTKNAKK/EKEKNPHPQ 
RPQKRKK/EEQKPKRGKKKRGKKKKPFK 
EYSYL*K\YIHTNLQ 


221 


14122 


A 


226 


3 


386 


PSTHVS1NLAMGIPL*ADAVIIGPHSKI 
KDALAHFLPQGTPTPLIPILAIIETISL 
LIQPIALSVRLTANITAGHLLMHLIGST 
TLTVSTIlSn:.SSTLIlFriLILLTIL*IA 
VALIQAr/VCPIiLVGAY 


222 


14123 


A 


227 


3 


372 


YSLDSPSLTRFFTFHFMLPFISAALAAL 
HLLFLHETGSNNPLGITSHSDKSTFHPY 
YTIRDALGLLLFLLSLMTLTLFSPDLLG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 
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niicleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

Smino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














dpdnytgsrnapvkqphphi \kpe*yfl 
fa/ytildppykl 


223 


14124 




228 


2 


263 


PRVRSRWEDCLRPGA*DQ/LCAP*RDCF 
FKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGPFKKKPE 
GGQNLKGGKKKKFFPKRGE 


224 


14125 


A 


229 


314 


1 


LIKTIFLYQVSLCCPG*SQTPGLKQSSC 
LSLPKCWDHRHKPPCPAQSFYFTFTERA 
LRVI Y* FI *RQS /LRSVAQAGVRWCGPG 
SLHPPPPRFKQLSCLSLLSSW 


225 


14126 


A 


230 


86 


405 


VSYMSIQDHENGHVSGYFELVGQFCLRV 
GIVGGSTFESSNSHSYIYFFSFYLFIFF 
EMGLSVTQAGVQWHKHGLLQPQPPG\LT 
*SSHLSFPRSWDHRPVSPWPA 


226 


14127 


A 


231 


386 


98 


FFPQRR*FPFGNFPPFFPPPKLG\PSQK 
SPIGDFPPPPFFNPGPGPPFFFPPPFGK 
GFSFPPPL*FGPPQGFF*EAPPFFFFFF 
FFFFFFFFIRLLV 


227 


14128 


A 


232 


2 


362 


TYVLFVIPYVETDLVQ*AG*SYSMDYSN 
LTRLQTUnjNVPFNIAALATLRLLYLHE 
TGSmrPIRITCHSDEVTEHPYYTIKDAL 
G*L\LFLLSLMTLTLFSEDi:iLRDPDNYT 
liABTPLNTPG 


228 


14129 


A 


233 


5 


365 


KNVTPPVNSSQHDLTDIYRNLYUViaTEY 
TFFSN/AHGTYTKIDHMLSHKENIiNKF* 
RLKF*RLKLNSK6VLWPKWNQTTNQ*QK 
DNRKLSKHLKTKNTLUmiP* IKEEVSWE 
ILKMYLREG 


229 


14130 


A 


234 


3 


367 


NRIVPNHDIPYFSLCVCVLFFFDRVS \T 
LSL*GWSAGAGSLLTAM>TSCVQAILCL 
SLLSTISIJYRRIIPLRPAYIi/SYFKSSKSF 
TL/LPKGIL 


230 


14131 


A 


235 


107 


2 


QEAYE*DKERSSRIYMELK/SIQPKPPN 
NRVKKWTKEINR/HFSKEDTQTDKRHMK 
KC*SLICISLVISD*QRN2iN*TPNEIRL 
IP 


231 


14132 


A 


236 


349 


0 


NNL*IFLIFLFTgVFLSLISYLFYlfIFI 
RSLPICGLFTYFFSFSYIFLKYLMTLAI 
YYSF*\FFSr,IQVILFALIFI 


232 


14133 


A 


237 


3 


317 


TRMHISDKKTYNTNIKRIFRIHLMRQTT 
QLKIGKRYEQTLYKSIFEWTINTYKEVK 
QIjNSKRKPTNTVIKK*SKDLYKHFSEED 
IQITTRY\LKKC*ASLIIKEM 


233 


14134 


A 


238 


346 


1 


AAAQMPAYQELVEEAIAYGRKLGGSQED 
QTMNAHDQRFVMSAADILS * I PDWSTE 
VHMRLSFDKYAMSARA\RRLI ELS KE YG 
LSKGRIVI \KLSSTWEGIiOaAKELDEQR 


234 


14135 


A 


239 


1 


279 


LLLPPLPSLLIiPPSPSSCLPPSPSS/SP 
P SP / S /PPPSLLHPPPPSLLLPHLPSLL 
DPPPTSHLLPSIP*LDIENAPT\NSSNP 
SCPCRPRSTEYFK 


235 


14136 


A 


240 


98 




RPRRPRLVLNS * \LKLSAHISLPKCWDY 


236 


14137 


A 


241 


342 


1 


DTASLLQPMDQGAISTFNVYYL/RNIFH 
EAIAAIDSDSPDGTGHSKLKTF*KGFTI 
LDAIKNI CDSW/EEGVKISTLTGVWKKL 
IIiTIiMDDFEGFKTIVGGVTADWETARE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 

26 , 


Predicted 
beginning 
nucleotide 
location 

amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 
otidc 
location 

ding to'' 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=M6thionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LEL 


237 


14138 


A 


242 


3 


339 


RKDSRKRERKHKEKRKGRKEGRKKPIRM 
TVHFSSKTMVAK/ARRQ*NPVFRMLEE* 
KN\FPPRILYSEIIFFGSESQIKKFSGD 
GNPKGSFPELFFLRIPMEVFNLKEAYLR 
KS 


238 


14139 


A 


243 


216 


3 


NSTGTQTQSQKFMLSLVKJCLISFIYL*D 
GVLLFAQDGVKWRDLGSLQPPPP/GPTL 
FKQFSCLSLPSS *GSRI 


239 


14140 




244 


408 


2 


VLTS*SNIAST\WYGLYQ/TgLH/KIIaA 
YSST /THIG/WNWPVLQYNHNITILNLT 
I YI ILTLNA/FLVLNENSSIGTILLSRT 
*NE I T* LPSLIPSTLLSLGGLPPLTGFL 
PKWAII EEFTKINSLIIPTI lATITLIJI 
LY 


240 


14141 


A 


245 


3 


243 


I IMLKAGQMTVFPNEDVWMAYKHTDRYS 
TSLVMN1SM\QIKTTMGYHYAPITVAKLL 
N/SNTRYWP*RG/CSRVIHCWRKCR 


241 


14142 


A 


246 


2 


337 


FLIMDLQKYINPKIKEPQGETNHKTVSF 
MNINAKILNKILVIQI**FPKKNNVSKF 
G/FIP*/SQGCFNINQCYSLDERRKYMI 
ISKNIEKAFNKIQHSSFMIKTKNRKKLP 
EL 


242 


14143 


A 


247 


347 


1 


HYTPIRMTKIQNTDNIKCW*GCGNP/GT ' 

LLHCW*ECKMVQAL/VfL*TVWQLFTKLN 

LLLPYNPITFFGIYPNEGRTHVHTKTCT 

LIFIIAAIiFIIARTQKQ/PRCSSVDEWI 

NKLWYIQ 


243 


14144 


A 


248 


319 


2 








SSSLGLPKC*DYRRESS\PGLtIVIL*CL 
KFHLRVAMLFYVFEV* \IVIFFFLRQSL 
/DSVTHA6VQWYMLSSLQLPPPGFK 


244 


14145 


A 


249 


329 


3 


KFPFFHT6EAKNYAVFVINKRI KKKQHI 
HIIYNYGGHQKRECKEIEAHVHCWWEYK 
MMQ/SL*KTAW*FLKK*TMELPYDPVIL 
LLRIYSKELKAGTRTDVCTSMCSFLFL 


245 


14146 


A 


250 


316 


3 


KTHKTIMGTNHIPNEKIDST*1:iKNLNIR 
AITIKLL\KNHRVSLHDLRIGHGLLDMT 
PRAQAAKKKIKH*IDEFIETKNFCASKD 
NIKRV/NRQHKE * DTMFANHI S 


246 


14147 


A 


251 


152 


364 


QLTLSNHINNQIKYK*TRHS \NHHHHHH 
HHHHHHHHQ*KRLPE*SyKSKSQEHYTY 
GMDITGPKNLRTIPL 


247 


14148 


A 


252 


230 


1 


PCWICEFIVSSNFCWFVLETGSCYVMa 
RVQWCDNRSL *PQTPGLKEYS CLSFRSG 
WD* RH/VPIfiNF*TLPRGGVL 


248 


14149 


A 


253 




345 


DVGLAGLELLTSGDPPALASQIAGITGV 
SHHSWPLLFFFFFFF/CFFEGGFFFFLP 
V* SQGGDLGSREPWPYGFKG I SRLRFPN 
EGKIGPQPLAPDMFCFFDKTWLSTWPG 
WFH 


249 


14150 


A 


254 


194 


1 


GRVDTKWANTHERFSKCS/TTVD*WIFK 
SWYLCTMEYYSTINKKEILSFVTTWMHL 

EDIMLSEVS 


250 


14151 


A 


255 


327 


3 


VKTAEFVWKWQKNSTKLWNSQAQIDKKK 
IVNQINDLRQTEIWMGDRIMNLESRIQM 
QCDWNTS DFCVTP Q \ YHETEH* WKKVKR 
HLEGREENLTL* I VKLKEQDFEASQ 
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SEQ ID 
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sequence 
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NO: in 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


251 


14152 


A 


256 


314 


3 


KNFPSLINFLRQSFAFNGQGGVQRVDLG 
SPQPLDSRVSDRIRLCFKTQKKKRFSP* 
QTILKLSiCV/E/MTERVLKTAREKHLVT 
YK/GKPIRLTMDFSAETLQARRE 


252 


14J53 


A 


257 


2 


308 


mwm\ EEEEEE VDCGGE V* FFPFCFFF 
FFFCFYVFFFYLFFFCFFFFI FFYFLFF 
LFLFIFFFFFFVFFIFFFFFDCFFFFFF 
FFFFFFFFFFFFFFFFVFF 


253 


14154 


A 


258 


52 


186 


CFGKLGWDDCLRSGVQGCSET*ACHCTP 
AWAT\N3TLSLKKRRRG 


254 


14155 


A 


259 


1 


269 


T.KRSSH' ,.SLPRSWMyRCMPPCPAlMF*TF 
FVETGSHHTAQAHLKL\PPASA/FPKCW 
DYRHESLCPALD3PFTNINSATIITILQ 

GKCWHYC 


255 


14156 




260 


387 


3 


PLLQVAKINPiCHILDLNVKPKTIKCLQE 
NTGENCWDFGSGKDFLDMTPKMQSTK*Q 
ISKLIKIYNFSSKTQ/SFCTTUHENFLE 
DIVKRIKIQSHKLEKRFVNHIPDKRLIS 
RIYQELFRTQHEKPHTIK 


236 


14157 


A 


261 


62 


72 


LRIIKFSEESMLKAEMG*KLGLFHQAVN 
KCNSLTVNEWNAKKRFLKKMKSATPVK 
THMIRK\DMEimiEVWIDQISHNISLSQ 
SLIQRKVLILFSSMKPARGEEAV* *KNY 


257 


14158 


A 


262 


2 


193 


GGRGGSELRSCHCI PAWVTRAKLSLKKK 
KKKNSKF/RELGNKGQFMGP * LRKGFTC 
YKKRSPLIF 


258 


14159 




263 


3 


330 


QLKKKYYEELYAWKLGNPDEMDKFLEIC 
: LQKLTQHKN*KFFKDSRRKSKI>1NRP IT 
IIRLIQ* PKELPKRKTLRPDGFTGELYHK 
HFFFKLTKPLHKFF/AKIKEEEPFPNS 


259 


14160 




264 


326 


43 


TQETEAGESL \DPGGRGCN*LGSCPCTP 
AWATGMKLPFKKKKKKITSLPELRFLSY 
TCSS SQKFLYLNDDTI IHPVAQAKNLGD 
LDSSPTDPJQ 


260 


14161 


A 


265 


311 


1 


FPPFP/SFPLERKEKFKGRKKRGGTKRG 
PFPPFFPFSSLPLGPP/SPQRA*FLLFR 
FREFLKFFFPFYLPPKPGLGKFFFFFFF 
FWDGVSLCRPGWSAVARSRLTV 


261 


14162 


A 


266 


5 


323 


DSTK&QKNTWSIEPGEVGTLIHC/WVQ 
PL*KTVWWFLK/DVK/LELRYDPAIAHL 
Gr/YP!CKNKT*TCTQVIiriTALFrMTKM* 
K*PKD/PIlSnSIEWVT/KLWYMHTMEY*ST 

IN 


262 


14163 


A 


267 


3 


293 


GSLQPLaERFK*FSCLSLL/ SAGEWHEP 
WRRSLQRSCHCTPSSLGYRAGLHLKKNH 
NNNNKSEVSHICTIGIFLLFIY/CIYET 
ESHSVT*AGVHWLDF 


263 


14164 


A 


268 


3 


332 


HLS IINLNVNQLNS PLEAYTLPVW I * KK 
KWPNY\CCLQEIHFASKNTYKLKVKELK 
KKFQAHEKEKHADKTGFKSKTVKXKKNG 
HYIMIKESVQKEWIYIYMFLIADPDIC 


264 


14165 




269 


3 


326 


KVLERHDVLKLTQ/DKICNLNM/PYTKE 
MES IVNNLPKKKTPGLHSFTGEFYQTFK 
NEMILTSYWFSQKFEAEEILNSFYEASI 
LL\IPKLDKDI IRKEN*R/PNAKILNKI 
L 


265 


14166 


A 


270 


1 


318 


PYPAKLSSLSKGEIGTF\LDKQMLKEFV 
TTRPALQEILKGALNVLI */LERKDHYQ 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P=Phenylalanine, 
G=Glycine, H=Histidine, Msolcucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QMQKH 


266 


14167 


A 


271 


1 


318 


LKLLTSGNLPASASQTAGITGINHRAQP 
HLSFSL*LWNET/CVPG 


267 


14168 


A 


272 


108 


322 


DSEVI FLLCGNLTVFVCLFLRWGL/DSI 
THPGVQWHHHNSL* P * TPGLK* SYHLSL 
SSI WWYRRTPLRLAFFL 


268 


14169 


A 


273 


341 


3 


GFFFPFPFFRQGFPLLAQVK/VPWGNFR 
SLQPLPPGVKQFSCLNLPKN/WGFSLKG 
FFKKGCHFPF*Y*IFFNGVKKPDIKFSI 
FTNPFFFFFETESCSWQaGVQ*HDLGS 

TRP 


269 


14170 


A 


274 


41 


328 


fwlltpgqeterereseteresererea 
geaergretdgidsliysklysktihlk 
ileyiXatyfkrlpynwel^t^ak/tri 

lEKPSVRHQCQQRS 


270 


14171 


A 


275 


3 


322 


EAQELL\DPGSGGCSELRSHHCTSAWAT 
KAKLRFKKKKKKKCPGRYLGNIVLQ* FN 
I lAKCDKYHIVKLI F/CFSSAYTASIQK 
LIQVSRTVTHRKNKKTRQYVYILD 


271 


14172 


A 


276 


175 


313 


STLISYFRDRVLLCYPGWSAVGQSQFTA 
A*NFW\VK*SSSLSS\PSSW 


272 


14173 


A 


277 


254 


3 


RPRRQFGIEGSFLNQIKNIYKKSTANVI 
LYVDRLNAPSLI SGTRQGCPLSRLPFNI 

\rL*LPVIIIIRQ/EKVI*GMQIVKEELNIj 

SL 


273 


14174 


A 


278 


186 


1 


PETMQARRQ*SEIFKVLKEKN/LQHRIL 
YPEKLSFQSEREIKTF\QRLKKFITSRP 
ALQRMLKE 


274 


14175 


A 


279 


288 


3 


GRLRGKKGFNPEGENSKEFFKPLPSGLG 
AKTQPVFKKKKKKKKTRNPIKK/WAKDP 
NRYLTKEAI *iyiASRHMKHCSTSCVIHEL 
*rKTTMRYDYVLI 


275 


14176 


A 


280 


3 


329 


lkessrlsllcswdhrfalppmlsglvw 
nscpq/cdpsasaslsig:tgmshhtwl 
*wlf/c*etgshpvtqagmqwydpsplh 
ptkkgvylt/rrfwrlgspsaw 


276 


14177 


A 


281 


233 


3 


IITKDKBEHFIMIKESlflQQGITlTiS/ie 
VPYNRGSK*MK*KLTKLMGEKR* /HSI I 
lAGDFNTTFSIMDR&IRBCSAKNG 


277 


14178 


A 


282 


32 


309 


LPDITPRDHLSPGTMDFIQETGCSKCWR 
GCGTPVCC*WECKL/VIVQSLSRTVWRF 
LBCKLKIGQVRWLMPVISALWDVEVGGSP 
EARSSRPAWP 


278 


14179 


A 


283 


1 


302 


CKRNNKAWMTVHLFTAWLSEYFTPTVET 
YCS*KTIPFTIL/LLDDNAPGQPGVLVE 
MHKDMNVAFRPANTASILQPMNQGGIST 

FNSYYLRNTFPKAIVAIE 


279 


14180 


A 


284 


155 




PRRFFFFFF*DRVLLCHPG*STAT*SCL 
TTKSaSQV\RQFSCLEPPSSWDY 


280 


14181 




285 


131 


15 


DRSNPGRFLWTSWSSLYERPREIRPSSQ 
APPPVNDPI*T 


281 


14182 


A 


286 


1 


359 


FFFFFFSWFWCFSFFFFKKKGGAPGGG 
GPPPPKKTPLFSPQKKNFFLQKKPPGKG 
FKGPGLPPPNFGPG/RNGGPLKPGDPHP 
PDFTPPRWLFKPPF*KEGBPFFPPPGFP 
KNLFLKKVP 


282 


14183 


A 


287 ■ 


52 


419 


LEERAGTSWPKKENKKQHFCMKKKKKKK 
KKKKKKKKKKKiaCFFKKGAQIFSF*GGV 
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correspond 
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Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leucine, M=]Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W-'Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














NL/H/ SGPGPVLKTGGGGRHSPPPPPSP 
PSSLFFPSSSLPSLPFLSLPLPSFLPSS 
PPLLSS/CPPPLFSS 


283 


14184 


A 


288 


236 


325 


LNIRTPGSWLXAVAHACNPSTLGGRGRR 
IT 


284 


14185 


A 


289 


218 


409 


KVKGNKADL IRSANGS *N/ VKGVLCHQY 
LEKSFCLETEFHSCLPQLVHWPDLGLLQ 
PMPPGIAHF 


285 


14186 


A 


290 


3 


426 


HEETGSPRLECSGMITAHCSLDLPKLK* 
SSHLSLGS/RHVPPCLAHFLY 


286 


14187 


A 


291 


382 


1 


ANFyVTLVQ\QGFTMLRSWRPA*PCDPP 
TWASQSAG I TGVSHHAWPKMST I TLGVY 
S FGESEVFS I FKFYFLETGS SFLPQAGA 
**CNHSSLYP*TPGLKQSFHLSLLSTSV 
YRYLPPCPLCPVLV 


287 


14188 


A 


292 


3 


309 


HEVPyPTLNQQLEMIKLSKGGLVKAKMG 
XISQIWAKEKFLKEIKSVIPVHTRMIK 
QNSLIA\ETQEVSLVWIEDQTSHNIPLS 
♦YLIQSKFLTLFNSPKADM 


28S 


14189 


A 


293 


2 


415 


ardqykkstkk*akdlrrhfkneai /hm 
asnyk/krcsis*virkieikssmtscy 
thlqn/alklksdnfkc*qkctamgi il 
swwecnlvqslwtwVqnllmlnthkpy* 
sai/ptsglyptemyrnieqaslrmfwa 
wrgg 


289 


14190 


A 


294 


335 


lOS 


I PTPK* PLSSPPIHTPYTT IPKP/RPPP 
P*HPPHIHPPPPPLTP\PLPTHTLPTPT 
PTNLPHIPPLYSIPPSSPKIS 


290 


14191 


A 


295 


266 


2 


gfpkkvfsfpigppklgfpnsvfs/lft 
ffnpglclkffqffpfsrflplgfsfpf 
ffffffff*deavlchpgwsavvrswlt 
atlv 


291 


14192 


A 


296 


347 




DSLQP *TRGLI!I*FFHLSFLTRWEYRCAP 
QCPMIFC/VFL*RWGFAMLPKLVSSDLP 
ALASQSAGITGVSHCAWLVFLLFSLYFI 
HFSKASETPWAQNQSQTPSEYPATYNFK 
lYSC 


292 


14193 


A 


297 


284 


2 


CRQGFVLCRLVSNSCa3QVLCVRLC:V*VL 
GLQACAT*LGaEGVSrEFSCFHKeRGCL 
CRHRSS/HSMFSAWC*PHTHTHTHTHTH 
THTPSPMQLV 


293 


14194 


A 


29S 


1 


96 


GTRLCLGAITTLFAAVCMiTQKDLTKIV 
DFST*SQL6LILVTIG1NQPHLSFLHIC 
THALFKAILSMCYITIIHIIjnroQDSQT 
IRRLLNTMHINSTS\LTIDSLS*L/6KP 
FLN/GYY*QKDLTKIVDFST 


294 


14195 


A 


299 


3 


134 


HEGRD*PGNHGDTLS]:JjKIQKLAGSGGV 
CL\KLRKLMCEIS1HLNP 


295 


14196 


A 


300 


2 


333 


GVGREGGSREGGEEESGWNGGIRERRSG 
KREEEGKEYRDEVDRKERGEEGGRERER 
GRGWEEEQKSAS* C/EEGRVK 


296 


14197 


A 


301 


354 


1 


TENELLSRIYKNFYQIN/RYKQKWSIKK 
*RK*QAI/TTETIQTVWKHLKKCSISLI 
IRELQTKTR * I LYT* LVE KKLKTNG I yc 
WECCRELGSC 


297 


14198 


A 


302 


1 


2S2 


GTRGMVAGAYNLGLPSSWDYR/ * CLPPC 
IiANFFVSBVERGFTRLTTLVLLS *LYDS 
PASRSETAGIPLIHTSCIVYWSLRIOTAD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Fhenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=l^ucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=l]nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














lAILCKAQAV 


298 


14199 


A 


303 


361 


3 


L PKWWDYRHE P LCLATFFFF * AFP PVP L 
CMYVPPN/ RYVAFFFSLSSF * I CCTFTL 
YIFPQLMDG*AVSSFLFLQPVGVKFKKK 
FFLSFFK\KRRSLGMLPRLILNSWAQVI 
HPPWPRPHA 


299 


14200 


A 


304 


2 


352 


ARiy|VSVS*PCDPPA*ASQSAEITGMSHH 
AQPIIIS*TIAYVSFSLSSLGT*ISSVR 
SFTKSMSQLLYFYYFVSL/PSFW 


300 


14201 


A 


305 


334 


1 


KJMSFFFFFKRSLGLLAQI* /VQWGDPKT 
LQPLPPGVKQ ISRLNLLKKWDY* RGPSG 
LGKFW/IFL*KQGFPQLFQWL1IFFFF/ 
CFFETESRSVAC3AGVQWHDLCSLQAPPP 

ssc 


301 


14202 


A 


306 


270 


2 


DWCAGGGGDGVPRRQVIFVILVETGFWR 
VGQAGFQLLASSYLPALfl/FPKCrjDFRH 
* PPHSALKALFFF*DGVSLLLPRLECNG 
ASSPRA 


302 


14203 


A 


307 


306 


2 


H I FQCVCVCVCVCCWCVCVVWCVCVCV 

KT,VISQT*T.,SPT,CSGP*A/CTFFSVCVC 
VCVCCWCV\CVCVCVCVCKVSDr,TNT,S 
TLCSGP-A*HFFSVCVCSC 


303 


14204 


A 


308 


117 


332 


SPPVLLRCSLSLSVQIiNGKTIRFLK*LK 
MEI.*EHSAIPLLGIYPKGiCKFLYQKDT/ 
CHSIFITALFIIAK 


304 


14205 


A 


309 


1 


351 


GTRKTNN* KWAKDLSRHFSKED IRNGQ * 
V* /HMKRCSASLI IRIM* ISITV\RY1IL 
TCIIM 


305 


14206 


A 


310 


1 


352 


PSPPNSPPSPLPSRPPQPPPYPPPPPRP 
PRSPL*YTPPAPPYW/PSPILTPSP/PS 
DPPEPRSSRP 


306 


14207 


A 


311 


1 


405 


FKPSP*PLTGftLLGLLMTSGLflM*FHLH 
SIT\LLILC\LLTNT\l.riYQ*WHDVTR 
ERTYQGHHTPPDQKGUIYGIILFITSEV 
FFFAGYF*aFYHSSIiAPTPQLGGHWPPT 
GMTPriNPLEVLrjjNTSVILAS*VSI 


307 


14208 


A 


312 


3 


176 


HEILGSNF/CGETQVSIHCI>G*S*TREL 
KQSSHLNI.PKCWHLRA*TTVPGLTTIFH 
LGKPKHKRVR 


308 


14209 


A 


313 


298 


327 


SLTLSPRL*KIKK*RKK*NKCWQGCKEI 
GAFIHCWWGCBayrVQPLWKSXWiliPKKVK 
*KLSYDPAIPLV 


309 


14210 


A 


314 


344 


3 


HSTSLVIREM* IKTRMTYPFTTTKMAI I 
* KNRS * \ WLGCGKTGSL IHSW * KC* MVQ 
SLWNTFGQFFI KL I * ELTIPLLDLYLRE 
MKTYDHTKTCI*MFIVALFIIAQWVKMS 
LV 


310 


14211 


A 


315 


18 


321 


WALFVYCLEKKBFGQPPRrACFFFWDRV 
SLCLPGWEAVMGSWVT/VRPKLLAFN* S 
SHLSLPSSWDYRLIPPCSAKDSICEKNR 
VSTVDGADKIOTRRGKEGD 


311 


14212 


A 


316 


289 


52 


ETSCDVPSKGILHFGKRWTHIMPEKWAK 
GMR*YFH\KEDK*MAHIHVKHC*TSLV1 
MKMQIKSSTPYHFIPPRLTKPLKSAMTK 
CW*TCIWAIYLSSSMKIL3HALCPFFWH 
DVCISLTEVKNAFTGHITRRFS 


"312 


14213 


A 


317 


57 


324 


ycvtffffgkqsfvlsprlncmgpfjjvn 
cslsllstwhyglvpgtqpffXeflvqr 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etit 
od 


SEQH) 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

tocation 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C^Cysteliie, D=Aspartic Add, 
E=Glutaitiic Acid, P=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=l,eucine, lVI=jVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaitiine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFHYVAQVGLKLLTS GDLLP* GS KGAGF 
TGLTHC 


313 


14214 


A 


318 


65 


341 


QWLLKSRSCYFFFL*DRSWAVAQA*VQW 
R/NPPPPGFKQF/SCVSLLSSWDYRCLP 
PHLADFWIFSRDQPDQHSGTSSLLKKKK 
KREIPNLPGLG 


314 


14215 


A 


319 


2 


182 


ARGDYRHMP P YT \H * LETGS C FVIQAGL 
ELLDSSHPPALTSQCWDCTCEPQSWSAP 
CVS I 


315 


14216 




320 




3 


mpprqahfWffiemgfhhvaeaglell 
gsinpptsasqsswdhr*sqsawitgvs 
hhaqlgtitsyhillflkkgeahac*sq 
hnpttclhvnlccslllaisrpptttnl 

PRA 


316 


14217 


A 


321 


122 


466 


QSFSTYC*KMWVTI/CFLKSIDSYLVPY 
TKIDSKWIIDIiNVKPKTIKLLGGNMGEN 
LCHLMLSKDSSriVTQKA*SIEKQINKFN 
SHGGTRLRSQPLKHAEVGRVRLS POORS 
YSDL 


317 


14218 


A 


322 


3 


345 


HEDAVSAPCNLHLPGSSNCPASASRVAE 
MTVPA\PCPANFFFFFFLEREFGFCCPG 
/CK*K*VFQKKPKGKKKSRIEGKKiGVWG 
PKLKERERKKRKKEERKRQKERKEVRTE 
MNGO 


318 


14219 


A 


323 


206 


446 


GWLHQCPYKWTHLAKXMCEKQDS IC* IL 
KECYQKPLLEKSHLIAQVENDEKPADW/ 
PLLRGEYVEWND 


319 


14220 


A 


324 


62 


327 


VERLLRDUlEISIFCaaSrPDGSEAPWCFTLR 
PGMRAAFCYQIRRCTDDVRPQGEAQAWG 
LQSRAGSLEPEGRGEVSA*W/SARTRHR 


320 


14221 


A 


325 


441 


! 


RKJOC* ILQSL/EERLEHWaS PHRDHRV 
WPRQEOKLFSEOKNKATRMTVLYPEE/S 
SKKLGSRS*GSE/CTSVFRITLSVGYVA 
YTGAPVS ISERPSAAIYHRMDIKW*HQHR 
RHriGFSBQBPVQGVEPLQGSSQGPKEP 
QPSPSNPAPRA 


321 


14222 


A 


326 


2 


358 


ARARTLRIMVNLNYAKSTLPVLCKGKNK 
VWVTflHLFTASEAEYFKPAVETYCIiEKI 
FFKILQLISDAPCHPRPLMDMHKEVNVF 
VPAKTSIQQPMD\ ISPFKSYY*EITFQA 
GCC6SCL 


322 


14223 


A 


327 


1 


430 


AREEMPGHLHRCI lEQDWRIHMLIPKLN 
TQMIKIV*YWYKDRYVDTE/YNNIESTE 
VNLYICGELIFSRSAKIIQWWGKRIVLN 
iCWCWDD/WNL/SCKHM*MDSYLTLCr\K 
ITPWI INPTGRAKTMRLLP /EKTGVS FC 
DFG 


323 


14224 


A 


328 


146 


374 


KCLMLTKP YMIMRLT* P YEI R* / CGDTF 
P * S YYA* TGTAVRTWGLTP VI PALWEAE 
AGGSHGQEMETILANTEKPRL 


324 


14225 


A 


329 


2 


322 


ARASHTFIVRKTQCLASKDKXTLVRG*C 
SCDFQLKSMLIDYSENPRALK/NMLYKR 
HNKAWMTLGLFTAWS TE \ KP TVET Y CS Q 
*\KITLKIWL 


325 


14225 


A 


330 


350 


47 


emgflhagqaglelltsgdppasafqsa 
gttgvshrarpan*kkkf/clkt/rvpl 
ccpgws * tpglk* s s cfgls shwnyrhe 
pprpaqfiilidfylrhyees 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, P=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arg)nine, S=Serine, 
T=Tlireoniiie, V^VaJiiie, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


326 


14227 


^ 


331 


753 


458 


FFFVE*GVL\HVAQAGLKLLTSRDLPTS 
AS*VAETTSTHHHA*LHLFNPEKYFCKE 
QILLCCPGWSRTPALKRYSHLCLPNSWD 
YRHEHCNEPEEISLI 


327 


14228 


A 


332 


256 


3 


TLVLMQS CLS LLS S WDYRHKPP / *LSNI 
FNLPMGLSGHNPIINGGISVSLDRVWL 
FLFC/LCFETGSHSVAQAGVQWIIDHGSL 
HSC 


328 


14229 




333 


25 


358 


TPDLR*STCLSLPKCWDYKHVPPHPANI 
FI*QKLFb'ISJJGQYANVLTWKEGGTFHT 
*A*RPMHHSYKLRKDQTIIFIYLFF/LR 
WSL/NSVAQAGVQ/MHNLGSLQSLPPEF 
K 


329 


14230 




334 


3 


329 


HEDWSPFWP6WSQTPDLR*SACLGLPK 
CWDYEREPPHLALNAFLT*NI FNL*W/C 
LSGSNPIIRSICIRFLGKGG*LLFIFIF 
VWFLTQSLTHSVTQAAMQWHDLGSLQ 


330 


14231 


A 


335 


26 


330 


SQLLGRPRQENHLNTGGEGCSL/CKIIL 
VCFWLGHLLPVWFRC/ILYLF/CWSLG 
FNFCFSLVL/CWFLLCFFCAFMFCLFY* 
WFVICHFLFFFFLFYFFVMFLFF 


331 


14232 


A 


336 


3 


348 


DEVFKS/YH*HPMLKC*GDYSLLFHGES 

DL UTTQ T LTHPSTTAMyFVHYCQS P * I B 

Y GT I D T * ? _J V I HRN P I HIRT P Y P CLQAS 
TAINLQLSHMWCNSKATPHSLGYHQTYP 


332 


14233 


A 


337 


2 


329 


ARAARAARVTGIAWAYHLIGKGLEPFFF 
MALLFLR*GL/NSVTDARVHWYHHGSM* 
A*PPGLRCMSLCPA/NFLYFLVE/IGFA 
TLPRLQAICLGSSDLPTSTSGSAEITGV 
S 


333^ 


14234 


A 


338 


60 


354 


NGKECKKLIYLSIYLSIYLSIYLSIYHL 
C*yii/SCIiSr*LSN*IiAVyYLSVCLSVS 
VCL\VAS Oi* ISI YLSIHLSFYS IDRIL 
AHHCVPPTCSLPHATH 


334 


14235 


A 


339 


1 


351 


RDADVMLLQILVI FQALALGVQTEGMaG 
VTHHDRVLVHWVRFPQCL/HRV*SNGT\ 
P*VGTSASERPGLKQSSHPSLPSSWDYR 
CEPPCPVITF/CFFV 


335 


14236 


A 


340 


190 


358 


FGSWLVFFF/CFLETDSRFAP*AEMQGP 
NFG*I.NPPPP6LRGFFSISLPGTGDYGP 


336 


14237 


A 


341 


306 


119 


LRWENHLSLGN*GCSEP*FRHCTPTWTT . 
E*DSVSKQKQKQNNLELKKPGSS \ CWF 
LPETLFT 


337 


14238 


A 


342 


1 


122 


GTRGCGEPRSRHCTLA\W*QSKTPSKKK 
KKKKKKKTGPFFF 


338 


14239 


A 


343 


92 


316 


VCWDIPHEKSRTICILFYPLVIFLFBTK 
ICSSFSF*KGCM/WLGAVAHACNPSTLG 
GQSGWIT*GHE/FKTSLA 


339 


14240 


A 


344 


83 


329 


ATAPAPCYFLI LCI CMCVCVCMHTS VCT 
/CVILCMCMGLCHIL*TCVA/CTCLGiyiR 
FLLPAHII ICVBCRFCLMCGSGNSGSLCL 


340 


14241 


A 


345 


127 


357 


YGVLSRGRVLPCWPG^SRTSGF/SSPEA 
NSWPPCLANILYLVETGFCHVAQAGLEL 
LDSSHPPTSASQSAE ITSMN/HEA 


341 


14242 


A 


346 


312 


1 


LMNLNRSVGALLTRGVKCHKAFRSTISL 
LIGALK**QSJOSDLGISLSKWKDLKND 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 

sequence 


Predict- 

location 
eorrcspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, »=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Va!ine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion. Impossible 
nucleotide insertion 














IYRSLLRQSEEDTS\KWKDIPCS*TGTP 
NIVKMSI LTKAIYRFNASC 


342 


14243 


A 


347 


350 


243 


FHHVGQDGLNLLTL* ST/PCLGLPKCWD 
YRCEPPRIAY 


343 


14244 


A 


348 


165 


338 


HTLDPASHEGPTFLFSL/PVEQCLK*LT 
LKVAT IVLFCFCF *DRVTLCHPGWSAVM 

QSQ 


344 


14245 


A 


349 


328 


2 


SVCPHGSINLSPAETTGACH/ RRLA*NI 
DSRV* AKTFKGK* TNCS YNNIIHIINIINISIN 
NNNKPFLLLRLAGDRHPDQLCPPPHRCH 
FMGATPAGPPSCLRRSHHSSAVCSC 


345 


14246 


A 


350 


3 


338 


HEIEELITFHDRALIGNILICSLVLYAL 
FLTLTTKLTNTNILHAECEIDTVRTILPA 
IILILIA\LPSLRILYITDEGNDPSLTI 
KSIGHQWY*TYEYTDYGGLIFNSYILPP 


346 


14247 


A 


351 


124 


382 


NTFKS *NSNKTKITAIRT*AKDTKQHFTG 
EDI QIANKHMKRCLI PLGKR\ SUSIQE I S 
PRVRINKIRPGMVAHTCNPNTLGGWGRQ 
IT 


347 


14248 


A 


352 


3 


368 


RDRASLFCPGWS*HPELK*SSCLGLPKC 
WDYRR/AATAPGLLCL 


348 


14249 


A 


353 


390 


161 


FSRDRIS PRWSGWSP /DLRLS P / CLSLP 
KCWDYR/R*ATMPGTFFHFNLKNQDARN 
Ql ITCPNLTNKYIYMTSSFILVHS 


349 


14250 


A 


354 


123 


345 


LAPQHFASLRLARP/PAGPPCDRPLAPP 
RPS*FSPAPLPPGADRSVPLSPSVPP*S 
LPPPAGAPPR/PPPPRDSPRR 


350 


14251 


A 


355 


37 


393 


TKHFVSTCYVPEFIAGTNQRKRKRISVL 
ANVKDKQIILLRVRTAITEGYRNYHVLF 
LVFVCFFSPPKQGL/DSAFQAGVWWHNH 
NSL*P*TPGLKLSSHIiDLQSSWDYRRVP 
/RMSSYF 


351 


14252 


A 


356 


2 


335 


ARETSKDRLTLLLGAHAGGDF XKLKPVL 
LYHSENPRPLKNYAWST/LPVL*KWNKA 
*MT\HVFTEWLAEYFKPTVQTYCLKN\K 
FSILLINNPLSHPRALMDRFKETIGVLM 
PA 


352 • 


14253 


A 


357 


199 


377 


VGSSCFLKVCQ/C*AVCVCVCVCVCVCV 
CVCLRCV/CVCL/CV\CVGFFFFLVLCG 
CGCVCVC 


353 


14254 


A 


358 


1 


335 


GTRTTCKYHVPPRSANF*LVLERWGLPM 
LPRLVANSPPQTILLLQPPTVLPWPERT 
SALT/YRPLLDLSPTSLSPRIPSSPLTS 
SLSIPAPPRSHLAPPPPRPPPHPPIRPS 
A 


354 


14255 


A 


359 




369 


KP S P * P LTGALSALLMTSGbAX M * FGFH 
SITLLILGLLTNTLTIYQ*WRDVTREST 
YQGHHTPPVQKGLRYGIILFITSEVFFF 
AGB'F*AFYHSSLAPTPQL*GHLPLIVII 
LLNSL*FPTLY 


355 


14256 


A 


361 


337 


121 


LWSQLLGRLEQEDCLSPRSQGQNKP *CH 
CTPAWVTE *DSVSKKGKWYIHNKKSSL 
KKK\NSCNMQQFGGT 


356 


14257 


A 


362 


256 


375 


EKKTIVQYPHTrGINV*KNLPAKKTPGP 
DGFIDKLY1ITFRGEITTSPHILFH*FKE 
VAVLLNSFSK\AaSITLTPKPNKDIMRK 
ENYSPI SSYIWUQCSFFLEQIFPLSFKL 
NCPGLIIPH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
f>K;iutaniic Acid, P=Plienylalanine, 
G=GlyciDe, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^TIirconinc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


357 


14258 


A 


363 


170 


1 


SQLLRRLRSQGACHHAWLIFVFSVDTGS 
HHIGQAGLICLLS\ *AIHPPCPPKILflRA 


358 


14259 


A 


364 


330 


3 


RIMSASASQSAVISDMSHDVRLIDI*HE 
P*FYQA1YSFEQ*ITSSPSYR*SYLLSY 
KWQDEAEC/AFVTAWQAEAGGWLEPSRL 
VLDS*PRDLi>TSASQSAGITDMSHHSC 


359 


14260 




365 




341 


SQYGLDFHDLSDLPTMASQSAGISGVSH 
RAWPASG*FYVSRHVICSYRL*GYTPSL 
LVLESGSSSVTQAGAQWHNHS ILYPHTH 
GLK*SSCLSITTR/WDYRHESP 


360 


14261 


A 


366 




338 


ARMVS IS* PRDLPTMASQSAGI TGVSHR 
AWPATGKFYVSRGVI CFII *KDTHLFF/ 
CFLETGSSSVTQAGAQWHNHSLL* PQTP 
GLK*SSCLSIPKC/WDYRHESP 


361 


14262 


A 


367 


2 


337 


ARAGI LKH* QRKCKS IR\PLWNWVGYYI 
LKLNIGITHDPAIPFLVLYPSKMHTHAY 
* KTSMRMCITTLFIMaiN*KFKCS*SEE 
E /IGKFWYFT* *LLTSNENEHLQLHTTW 

IN 


362 


14263 


A 


368 


59 


348 


SAFGIQENEMPAGRGGSHL*SQHFaRPR 
RVDHLRSGVPD*LGDR/GETPS KKKKKK 


363 


14264 


A 


369 


350 


3 


GWSQIPDLRWSTHLSLPKC*DYRRGPHA 
QL*TLMYKYLFEHLLSVLLGICPEVEWC 
/DHMAILCLISLGTVILSSKALHHYTFS 
PVTCKCSNFSISLSTLVTFHCFLNACYF 
PLFSRA 


364 


14265 


A 


370 


908 


30 


ECBGTILMCKLPLPGSCHSPASAS*VA 
GTTGVCHEARLIFLYFLVETGFHRGLNL 
LTS*ST/SASASQSAEITSVGHHAQL/C 
I.E]:.ELK*STCrjSILPKCWDYPA*ATVPSQ 
FGSVFLRIF*PMFrsNlGQLFSCSAFVW 
P*HQGNAGLIE*VRKYFIiLFKFLKEFA\ 
RTGINFR*MSGRTDW*IIHPVRGISLLKG 
F*LVl*CSY*L*VyPYFLLF*EVBLLSP 
RLBCNGVHSESAGITGGSHHAQLVYPYF 
FFVCDSVLVGFVFIiElCHFlWLFNLLVY 
LCLWYYLIITLIAVKSQ 


365 


14266 


A 


371 


164 


1 


SCRQVESYGIKRITCISLASSWNYRHAP 
PRPAiaF\*FPflEMG\FLHVDQASLDSC 


366 


14267 


A 


372 


3 


383 


MVSTPAEDAVNIVDMTTIJ/D*EYSLNLV 
DKAVAGLERIDSNFEISSrVGKMLPNSI 
ICYKEIFHERKNQLMQQKSLLPYFNKLP 
QSIQHSAITTLISHQIjSTIWQS/PPPTK 
/RFQLTEGSDD 


367 


14268 


A 


373 


-T 


349 


TLLGOT)H1YMVIVTAHAFWIFWIVIPI 
IIGGSGN*LIPLlIGAPVMAFAErNKIjR 
LRLLPTS\ILLL\IAYAILEAGARTG*T 
V*PALTRliy*NPBAyGHLSMFSIiHLTGV 
SSISR 


368 


14269 


A 


374 


443 


2 


SSGSRSECRFLPLLFHWLEILAGAIRQ 
/EKE/IKGIHIGKVKVSIKLTEVINKFI 
KVA*YKIN\QKSWCLYTNHEQLEKKFK 
IPFKiaSKRIKYIiEIMH*GKDI.YtfE]SlI.K 
TLLKEMKENKWINIPCSWIRRLNNVNIS 
ILSKVTYKF 


369 


14270 


A 


375 


3 


201 


LWIKKLDIIPIIPS*LFFNTEIiDKr,rr.K 
LIWKFRGPR/ITQNSFEKPIVGGLILSD 
FJCTVYKATVLK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


370 


14271 


A 


376 


11 


324 


DKFLEHPPLLVLQAVAGK3SPLHCLSTL 
PTSLF I * DH.VLLCSPAWSAVAQTWFTAT 
ASWAQ*SSNHSPSD*R/PHTTTGHFFCR 
DKVL/TMLPRLVSNSWTQAI 


371 


14272 


A 


377 


85 


364 


YQHNYSFCFISQYVYRLFFFF*TFFKKK 
GFWF*RLNFQGGNMG/SMEP*PP*IRQF 
FCLNPLN/SPDYRGPPRGMSIF*IFLKKK 
GFQVFPSLSKIP 


372 


14273 


A 


378 


1 


382 


gtsgtsyst i fagtl italssh* fft * v 
gleinmlafipvlhqkyt\p*ythaaim 
hfltdsga/svllieilllysnslge*s 
s i lelqli s i flsmmlvi pnkfgaayys 
ihalctdrpqllhilli 


373 


14274 


A 


379 


24 


323 


I pglkrssrlgllkcwdyr*dnfqhmcl 

KNFNNWAAYYFVCKI CCaVDPHGCS IKV 

SFFLS f/cffftksrtvpqaevqggdlg 
*i.eplppglmpfsglsl 


374 


14275 


A 


380 




317 


agwfqtpdlr*spclslpkc*dfrg*pp 
cqklflcpikif*trlk/cylntl*qsl 
plmhfkknviyfilykaalfffflrrsl 
/hsvaqagvqwhdlg/s/lqtpspqfk 


375 


14276 




381 


1 


323 


VKRQPrBWEKI FA/ TLYPSGKGLI TRI Y 
KGLKQLEGKN/KSNNr.1 LKWAKED I QTA 
NRYMKRCSTSLI IRE 1 \MQIKTTTRYHL 
TPVKMAFIKR* /GNNEC**GYSEKRTLI 


376 


14277 


A 


382 


2 


248 


TQPLILRCLPPRS lYRFNS IP IKIQVNF 
F\*EKEKSLLKF1QIILKGP*ITKTILRK 
KKVDQKTFPDFTMYYKATVIKTVSWYQ 


377 


14278 


A 


383 


178 




HNPFLAISFCFFPFFFFIfiCWL/NSFAQ 
ArVKWYDHDLLQP*PLGLKRFSHLSLLS 
SWDH 


378 


14279 


A 


384 


312 


25 


FLRGVFFFFFLRRSL/DSVAQAGVQWPS 
FGSLQAPPP6FMPFSCFSLL/SFFFFFL 
VE/MGFTMLARKVS1S*PFGPPASASMP 
VEITGMSHCLANMFF 


379 


14280 


A 


385 


1 


849 


FFFFKQTKFUOiSKYKNIIKKS/SAFLY 
ISNYLKMKFKKIPST*L*FEVNLTKKLK 
HLTFySKEHYTN*VTHKWNNITHS*TGI 
FNS * IFVIHKMI CRYMATS IKI PVTYFI 
DIF/BKAYLKFIWYHKTP* rAKAIKTKE 

GQWSRRKREQKyiSVFTAN*F*IQVTFF 
FKGNNSIFNK*CLENFMSTCR\KKK*DP 
HLTPYVKINSK*ISHIiNVRPKTLKLL\H 
QKIE*KPHNIGLGSKFFDLT*ISQDTKG 

RTSQSDHF 


380 


142S1 


A 


386 


3 


318 


LREMQNALESLSNRTEQVEETTSELKDK 
AFE*TQLNKDK\KKRI*KNE*RLQEVWD 
CVK*PDriRIIGVPEEEGKSKYLENMFEE 
1 1 BQNFPGLARDLD I QI QVAQR 


381 


14282 


A 


387 


1 


382 


FTPTRTAWKK/SKDWCWRGCGS IGTLR 
HCSWECKMAQLLW*TV/WTFPRKIRQPS 
DVCCDMVIGLPYDPAVLLLGICPREMKT 
YVHTflDMSVITSVSLVIl\ADSEIINLNV 
PSADEWINEMWYIHTVDY 


382 


14283 


A 


388 




341 


HKLENLENIVKFL* TH/TLPRLNQEKI Q 
TLNRSITSSKIECSLIKNTPTRKEKKK\N 
PGPKGFPREFFPHA\KKQGVPTLGNPPQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 

dinger" 
last amino 

residue of 
peptide 
■ sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KFRERGSLLNYFYKPGLFRl PKLGKNPR 


383 


14284 


A 


389 


168 




KKFFFFSFFF*GDRVLLCHPGWMAVMQT 
RLTAAS \ T PG LKQS SHFTLPS TAG YTG 


384 


14285 


A 


390 


337 


I 


TGITPLWPLEV\ PLIWTSVLLASGVS IT 
*AHHSVIESIGNLIIQALLITILLGLYF 
TLVQASE*FESPFTISDGIYGST*LVAT 
GFHGLRVIIGSTFLTICSIRQLIFHFT 


385 


14286 


A 


391 


1 


235 


LNFSyESSMYFALFTIVF\WVFLNF*KF 
FMN* lYHLCYVL* YFLLV\FVCLLTWFM 
VFCFFFFFFFFFFFFFFFCFFFS 


386 


14287 


A 


392 


1 


258 


S CDRLFANHLSNICELVSRKYIYIF*KSQ 
DST IRKQTDKK*AQEIJreRFSTKDIjQMG 
NKHMKRCSTPLAI/REMQIKTMLRYHCI 
PIG 


387 


14288 


A 


393 





317 


LAYCNLCLPGSSDPPTSSSRVA6NYRG\ 
HHDSVF*RAEDINMHEigFISFLFLNRD 
EVSLCCTGWS*TPGI.KRaSCLDLPKCWN 
YRHEPLCLAFFFNGSWFQCQ I 


388 


14289 


A 


394 


91 


408 


LGAEFDVRAYLTSGRLTGPGVPFLTGFY 
SKDHll\ETMSriSYTl!lA*MiSlTLIATS 
LrSAySTRIILLTLTGQPRFPTLTKnNB 
DNPTLLNPIKRLAAGSLFAGFL 


389 


14290 


A 


395 


3 


165 


Ri!imKisfljbHMtK3fl*EK:dTLiacaJwcs: 

LVQPLWKAVW*VLR*/LKTELPFDPAIP 


390 


14291 


A 


396 


661 


1 


LCPLSSFYRKCABLGFWFVCLFVFQIRV 
SLCCPRWLQTPGLRSSSHLSLPISWDYR 
HLPPRLA1GAVFLIFVLMRTCLR*FSDP 
ISLSFE*QQRN*miLISLL*MVDICLT 
QFTKWIFL*PKNS**LKRQCKLPKITEL 
/PK1SIRIQIYPTD*MSSIPHSLSFYLVCL 
FVCLF*MESCFVT/TMaAVQWHDLGS\L 
QPLPPGFKRFSCLSLLRSWDYRRPSPC 


391 


(4292 


A 


397 


320 


I 


PDSKQQIFWVDETVFC/WICKMPSRTFLA 
REEKSMPGLKASKDRLILLVCJRNPAGGF 
KLKPVLTYH/SEHPRAIKNYAKSVTVL* 
KWNSKVWMTGHLFTALNVLSPRRAL 


392 


14293 


A 


398 


1 


158 


CIGPMWENRLILGGRGCSEL*SCHCTPA 
WAT\SKTLSQKEKKK*NMRKVMIQC 


393 


14294 


A 


399 


264 


1 


LINEFSQVAGYKINKQKSLVFLYTKY \ Y 

skiisek* ikkaipftipakkkeikylgi 
nltkdvkdlynenykilkkikdtVkkwa 

DTPC 


394 


14295 


A 


400 


3 


343 


heqkrqskvrevr3lsqglnmdrkrwsq 
dlnpgcrtvalspyhhtrlalncpgrqw 
fvgvnff*rkkiffffkfwdrvllchlg 
wsawpswltaal isw\vk*s frlglls 

SW 


395 


14296 


A 


401 


1 


345 


gtrkntdhtkcr*vceetgyfihcwwgy 
kmvyplgklvwhflkkvsihlpygtsal 
lslmiekltft/chtktctqm/fhwlf 
pi vkkwkqpkclp vgellnklwyis iht 

lYSAI 


396 14297 

1 


A 


402 


102 


355 


DRVIRLAHFCIFGRDRVSPCCPSWARTP 
GLKRSTSLSLPKCWDHT*ATAPG/LRAI 
LLYFWDYRCLSPDLVNFACEHFSVELVA 
FF 


397 1 14298 




403 


156 


3 


NWKIsrGFP*SMLFQPSNGLVSI *NI \PLK 
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.minoMcid ding to 
res/due of ! last amino 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoleHCine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serjne, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 










1 
1 


* Q KW FS FFFE ERVS LFHPGWSAW* S WL 
TACSLDLPAQVILPPLSLPNSS C 


398 • 


14299 


A 


404 


I 


364 


GTRSEVIYKLLLQTPWV1*HMRTRLLLQ 
ACGAR*MWMIL\MIILGLVTCRLTIYC* 
WRDVTRESTYQGHHTPPVQKGLRYGIML 
/FITS*WFLSCVI \WVPF 


399 


14300 


A 


405 


3 


365 


heqtshni plsqspi*s kilmllnsmka 
ergeeaaeekleasrgwfmrfkkrshvh 
s \ kvqgeaanadveatas yp edlvrt i d 
edgdtnkqifXnvwktafcwknmpsrts 
trreekqclp 


400 


14301 


A 


406 


3 


356 


HERSDQLYAN/KLDMLDKRSKFL\EGYK 
LQKLTQKEIQNTNRPTTRI/EISKKKKK 
KKKKKKKNFS PGGFTGE I PPTPKGQFKK 
I FKKFFKNLGGEKTLP I * I YGAG/ I KLL 
PKIKK/DPSKKK 


401 


14302 


A 


407 






YKYSLTPQKLYNHS YI KKKHWKQNNRMH 
STTTTTK2PPPPGFKRYSCLSFPLS*DY 
RCTPS CPVNLF/ CVFLVETGFHHVG 


402 


14303 


A 


408 


3 


360 


HEVRLMDPAFERNEGGGENEEKVDWLNY 
RMWFHLIF*AECYSICRLHAYYSCILGP 
VLLLLFVLIILLFCCIi/SFFDYF/VFMF 
FIFFYSFFFLLFSLFFSLFL/CFIFLYF 
FIF/CFFLYFFFP 


403 


14304 


A 


409 


388 




ALLKFPFPKVLKKPP*GSFFP*GF*LLS 
/LI FPPYFGRETLFFFF*NKVPLCPPDW 
SSiyiflESRVPAGSVFPVK\ESLSLSLLSV 
PPQVQVNGFLTFFFFFFSW*RQGLPLLP 
RMVWNSWAQAILPLWPH2VS 


404 


14305 


A 


410 




386 


VFNAEESAY/YWEIMPQRKFIR/EEKQA 
P*FRQERIS*YCRNAWFTIRTTHICKP 
AISIPQALKKKKKKKHQI.TVFWLYTKKS *T 
MRTLFIiDWFCQCFVPKVRKYLASKTLPL 
KVLLILDNAPGHLKPHIEN 


405 


14306 


A 


411 


2 


417 


AHHIFTARIDVDALAYFTYVTIIIDIPT 
GDEPFS*LATLR*INMK*SGAVL*ALRL 
I FLFTVRGLTGIALPDL*LDI VLHDTYY 
VGftHFHYGLSVGAVFAILGGFID*WPLF 
\SCYTLDRPYAKIHFTIIFIG/VDLAFF 
P 


406 


14307 


A 


412 


295 


397 


WQWPGTVAHRCNPRTLGS*GGRV/TLRS 
GVRDQP 


407 


14308 


A 


413 


437 


3 


PGF6SLI6NLIPASGNGRKSKSCVCVCV 
CVSVCFKSCT*SLCEHLFTCLCPQICVR 
LQLMV/CPQNCVCWCPESGLFLDESVCV 
RLCVCAWMfl.VWMGGSGSG*VCGCGC\MC 

ICVGPLLDSELCLCVC/LLGQSKCDCGL 
RCRLPAWCV 


408 


14309 


A 


414 


2 


392 


HLQFI FFWI LKI FHYL/ FLWFWFYLTSM 
ALFYLQ*QKRiSrCMYEVFNRGLFF/SCGE 
GVSGSPASSSSSPSCSSSTRGGGAVGGG 
GLGFVCFLLLFWGFWFVFLFCWFLVCVG 
WVWCVWWVFFLLGV/ CCWCFFF 


409 


14310 


A 


415 


376 




GFQASKDRL/LLLG/ATAAGDLKLKPML 
I YNSKHPRVLKWFAKCTLPVLYRLYR* K 
NYAWMTAHLLKSWFTEDVKSTIQ*KISF 
EMLLI INNVPGHPRTPMEMYKELNFFXM 
PANTS IP*PMDQGIVL 
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Predicted 
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ing to first 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, U=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


410 


14311 


A 


416 


409 


2 


FFPIKKKKFAGALVFFFFFRKIFFFFPK 
GEGKRGVLVSLNLLPLC*KDFFSPPPPG 
EGVLWPPPPGPIFFFFFLKKGFCLFCQG 
GFYNPSLYLIPLSPFKIHtG+TRGFT/LP 
PPALFFYFFFF-' DRVSLCHPGWNA 


411 


14312 


A 


417 


425 


3 


EE LLAFWQNFKLRPQGPFFS P \GPGGEK 
KR*SLPKTPPKGGPPGP/SPNGRLPSGG 
QLPPPRGG/SPQGPPPLPKGPQGWGPPF 
P*RSPPCPPHRIGVPQVSSSPTPGLVFP 
RGPLNPGNLGGTKKKKKISSKAARDLEL 
VRTRG 


412 


14313 


A 


418 




346 


LLPDRWLNTTFFD\PAGGGDLILYRHLF 
* I FGHPEGYMLMLPGFGI ILHIVTYYSG 
KKEPFGYIG^^V*AMISIGFLGFIV*AHH 
I FTV * I DVYTRAYFTSATI 1 1 AIPTGVK 


413 


14314 


A 


419 




382 


LFSTNHTDIGTLYLLFGA*AGVLGAALS 
LLIRARLGQPGNLLGNDHIYWXGTAHA 
FVINFFIVIPIII\GGSGN*LLPLIIGA 
PDMAIARINNISFRLLPPSLLLLLASAl 
VEV*SRTG*IIYSSL 


4J4 


14315 


A 


420 


26] 


378 


kkkfjfj'*klnff*kffliffppkkkif 
fkkkkkfffykiff/i*kniffspqkni 

*PFLFFFI* YPFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFTM 
SLYFSGRVGGRV 


415 


14316 


A 


421 


124 


428 


KPAATHACATIFMCLDQEAIISN*H*AT 
TQTTEVSL\S FKLDYFSI I FI PVALFVT 
WSIIEFSL*YINSDPNINQFFKYLLIFL 
ITILILVTANKLFQLFIG 


416 


14317 


A 


422 




355 


PVFSYKHSTLLrFSLSLSFFFFFFAPGK 
KGEKNGKPGPFRGGFFPKKIKKPFPPP/ 
GGPKKPPGPGFPQKPHFPFKGGPWPSKT 
LFGRGGKPNFGGPQKPPLV*NPPSQGP/ 
HGPSGGG 


417 


14318 


A 


423 


356 


3 


KPLGIDLTKKVKKLSKKNYKTLMKKIED 
DSN*KDNSCL* IR/MTILPKAI YRFNTI 
SVNIPMRFLPDAMADAW 


418 


14319 


A 


424 


2 


338 


PSVRLGGHWPRSGITPLNPVE /VGLLNT 
IRLLASGVS ITGAHHRLIECNRHQI IQA 
LLITILLGLY\LTLLC3ftS*YFQAPFTIS 
DGIYG*TFFGSTGFQGLHVIIGSTFLTI 

CF 


419 


14320 


A 


425 


55 


342 


GPFTPWSLC»GDLQR*P/RAVKFFLKKK 
KKKKEGNP I HN*RD 1 SS *FLKNLETAVR 
NFWTFFSYFKSKKLIQKSDHPLKNVKKI 
LQNDKRYLKIDWG 


420 


14321 


A 


426 


2 


367 


DRRRFCTYHKDlGaLYLLLGA*AGWCT 
AVSLLIRAELGQPGtrVLDGNDHVYlIVIV 
SAHAFVIMFF\MEYPIINGGFGN*LVPL 
ISGAPDMECSGINNISPWLLPPYVLLLL 

ASAIVEAGSRT 


421 


14322 


A 


427 


359 


74 


I CADYTRKPPYMGAPIEFSCATCVL*LT 

STQRECVCVCVCVCVCVCVSM*VALIPD 
SN/CIPLSAYVCVCVCVCVCVCVCVS I * 
VALLLDSKSYH 


422 


14323 


A 


428 


52 


361 


nypqlsei* **vs*lhy/lfsflffyff 
*lvsrfvarlecsgawhUiahcsldll 
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Predict- 
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correspon 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^=possible 
nucleotide insertion 














GSS /DSSPASASLVAC5TTGMCHQNQI IF 
LFKKVFVGRVQWLTSVIPATWE 


423 


14324 


A 


429 


105 


361 


SRLFFFFFFFKTNFGFVPKVGVK/WAFF 
GLMEPFAFRVK* FSGLTHPRTWNYRNVP 
HCPVNLEF* VKTGFNLVDKAGPKLLT * K 
DFP 


424 


14325 


A 


430 


1 


379 


HAYH I VKPS P * PLTGALSALLMTCGLAM 
**RLHSITLLILCLLTNTL\SIYQ*WRD 
VSRESTYQGHHTPPVQKGL*YWIVLFIT 
S E GVF FAG I F * AF YHS SLAP TPQLGGHW 
PTTGITPLNALEGPL 


425 


14326 


A 


431 


1 


390 


KKVKKWKNLIFMIGRLNNVNMSVLHKVI 
YKFRAICLR/TFHFHRKKKNLTWA*KHK 
KALIPKTILKKNGKSGGITLSDYKM*YK 
TTITTTIRFWWKF/DINKYNKITTHEIY 
FHIYGQMYFFKIAKTFQGIK 


426 


14327 


A 


432 


1 


392 


TRTSGRTQWD * T * LRPI /WTKKKKKKKK 
GKKRKKKKGVFFFYFFLGF*FFLWVFFF 
FFFFLGFFFFFWFFFFFFFFFFFWGIFF 
FFFLLIIYNFFCKF*FFFFFFGFPNFFF 
FFFWFFFFLDFFPYPPFPRV 


427 


14328 


A 


433 


382 


1 


EGGKFQNQKQKRKVSPPPPKKGFFPPAA 
PKNIKGGGKK*TPPKKGGVS\PPPPKKR 
KSPPHKK/IGNFFPPPEGKGGPPBCWPKK 
PGPPFFFFKNPPPFFFFPFFFFLFFFFA 
HKDGLLARECTQAEVKT 


428 


14329 


A 


434 


375 


1 


KYSQLIPDKTAKAI K* SKD CLLKEWFWN 
KWTSTHKSLDL/SPFTKINSK* ITGLNL 
KCRIITLLENNIEENLDRLGFGNDFLTT 
LPEAQSKTELISY/DFIQTNFCSANATV 
K*MKEC2ATEMEKIF 


429 


14330 


A 


435 


1 


459 


PTRPPTRPSTRTLGFTMLAKLGSNARPY 
/DYPSYSASHSAGITCVKHCARAVIHDF 
NGVHEPLLIW*EFLVEISFIFDFFLETN 
PGFITKGELQGHNLGSLQVSPPGLTLFS 
CLSLQKSGFYG/HLAKYLVKF/CPGIFS 
KRGF*GGYPGGAQSPBP 


430 


14331 


A 


436 


14 


382 


MARNTSQKDIRIDPlMKCLWLVKKBaCKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRKNYFHVI \LKNKAVltNGSTITS 
TNVMSPGLHIRLI IILSLMIYKKSTTE* 
VEKQTCYYILM 


431 


14332 


A 


437 


3 


192 


TPGLK*SPCLGLPKCWDYER/AAAVPGL 
*DIIiaiFFHHILSESCLFLLVFIHCEIJA 
NLFLRWSS 


432 


14333 


A 


438 




364 


DRIALGTVr)DLPGRPTRP\AHPIILKVS 
LADRDAI *1ILWQ I P XVAS* YI PLGF /WS 
KAMPCSVDIYSSFEKKKKTVFFDRGQGP 
TPEIQVLGKaEGVKPFEPRGSKPSWRIM 
AKPKFYKRFKL 


433 


14334 


A 


439 


335 


2 


GMIPEMKGNAGP/SG/PRKPGFFLG*KG 
KVKAMLNFKPPFEAFWPKKKILLPLGFS 
GILGPGPKGNLSRPGPPPQAFPLGFCQV 
LGP/SGPGVGFPKPGQIGWSNNPCLTLQ 

KNK 


434 


14335 


A 


440 




333 


HKLEHLENIVKFL*TH/TLPRLNQEKIQ 
TLNRSITSSKIKSLVKNTPTRKEK/SKK 
PLVPKGFTIEFSQLCKKLRVPVFMKPFQ 
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correspon 
ding to 
last amino 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KYGERGLFLNSFYEPGLTLI PKFGKNPG 
G 


435 


14336 


A 


441 


320 




WGINIJWLSP*EKKGRPFLL\SHPKINSK 
WIKDLNVKaKI*KPLEGQIG*YLMIFGV 
EKGLFKGGPKYTLYKRKY*NRELLFIKR 
YFSLTEKSPAEICHIKKLAKKK 


436 


14337 


A 


442 


8 


391 


ERILRHDQVGIIPGM/QGWVRI*ESVDE 

I /qhv/nqlpkithtitsvsvgkardkf 

NIYS*FKLGKTEKERNFmWIKSICTSP 
AAS ILLEGKESPF\ PFRSGTRKD/CLL* 
P/LGSGMVLEALGTAERRTRKEK 


437 


14338 


A 


443 


363 


2 


HHVGQACLELLTSGDPSNLASQSAEITG 
MSYRAQLSIVTFSAYFLW*KLSHAHGL 
ML*QSIII/HVLHFHQVKKAFHVS/PEN 
SQPLHNVEPEDWI F* EHQRKTVPP IHTA 
RKLQDLEPCLV 


43S 


14339 


A 


444 


1 


900 


DSSAGIT/GICHHAQII/LFVFWETGF 
HHVGQAGLELLTSGDPPASASHTGGDYR 
HEPPLI.ASLSFLNKELCTWPERRRKPIF 
SLPKLDPNKRKFSFPP*LF/IGS*TLIW 
SLFCFRWVCTQLADPTKSIAYQSSLMKP 
FGQKSICFGSEKFPDKVYLFSSDRKAKE 
QVLWWWEY* I * DMSACFQ * ALGF CY 
CLSIWN/YELPEKFEVIjACSLPSRNNDL 
I LSLKKKSQNSFFVCVFFFFKTKSNFVP 
QAEGQGPLFG * LKLPLPGFRWI SCLNLP 
GSC\QTGAVPPPPVWFGFLKKNRVSP 


439 


14340 


A 


445 


3 


346 


QIGKVKKLNK*VPHEPSKNKL\FLEASS 
LILCMNNEPFLSRIVTWDENWILYDNH* 
QPAQLLD*EAPKPNLHQKKKKKKKGLAP 
FWGAFSRGNPFYFFNPQQKLHF*KFSSQ 
KRG 


440 


14341 


A 


446 


2 


246 


FKCGKTRRLMHCWWEGKMLRQL\RKVWQ 
LLAMLNMELPREPAAPLLSISPRERK/S 
TTAKTCA*MCIATVCITVKKWJaiRLLG 


441 


14342 


A 


447 


38 


393 


VILHRQGLSL/VTQAEVQWYDYNSL*P* 
TPGLKQSSCLSFLKSYCHG*LFVWWF 
KWMXGFTMLPGLILNSWPQ/CNPPAVDA 
QIAGIRGFHS/VGQAGVQWHDLSSLQPL 
PPGFKQFSHLS 


442 


14343 


A 


448 


54 


540 


ripfehasgflqsshqkphcllhplsgq 
vssdgqXfrkfgisrlgwsgiykaafpl 
hdckfrrqsedpscpneryllyrewahp 

RSIY*KQPLDLIMKYYGEKIGIYFAWLG 
YYTQMLLIiAAWGVACFLYGYLlSraDNCT 
WSIEVCHPDIGGMIIMSAQRDRL 


443 


14344 


A 


449 


2 


310 


FFFLRQSL/DSVAWAGVQWRDLGSLQPP 
PPGFK*FSa.SLPSSWNyRHAPARPANF 
FLYF**RQGVTMLTRMVLIS*PRDLPSS 
ASQSAGITGVSHHAWAKISL 


444 


14345 


A 


450 


2 


466 


KQKIFSVDETAFYWKKMPSKTFHS* REG 
TASKLHTAG*ILLGPNaaSDLKLKPVFG 
DDSGNIiRALKNYAESPLPVLYKMNNKAW 
MTAHLFTAWFTEYFKPSLRS\FRKISWK 
I*LFMDNGPHHPEALMEMCKE/NAVFMP 
ANTTSILQPMDKGVILI 


445 


14346 


A 


451 


670 


212 


SSSSP*GS*YQNRTNTIPSPKKKQTKK\ 
KNNHYKS IFFTDAEILSWlUUSaiQQS I 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glntamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KR FI RHEQLG KILWLGWVNIQTPTNAT 
CYTNRLKtCRNHMTILINAEVLYKIQPSV 
I*KLFPLRGAF 


446 


14347 


A 


452 


407 


219 


PLISLRWENHLGPGVQGCSEP*LCLCTP 
AWMTE*YPISPSQNKTKKQ\HTRKQNQH 
KNKCVKN 


447 


14348 


A 


453 




395 


wflwfrershlh/ rvqneaasadveaaa 
s ypedlaki idag\akqivsveetafhw 
kkvpsrtfitrekttsai,kg/rc*lslv 

DNAAGH F* VEA\ MLIYHSDNPRTL/KW/ 
YAESTLPVLYKWNNKAWMTACLFTS 


448 


14349 


A 


454 


424 


38 


EETEPL/HRPISSAEVELVIKNLPT/KH 
KSPRPDGFIAEFYKMQKE*LVPILLKLC 
QKIKEEGLLPNSFYEASISLIQ\NSGRD 
TV/ RKENFRP IFPMNI FAKI LNKILANC 
I QQH INKL IHHYQVLFSSLHSR 


449 


14350 


A 


455 


2 


309 


prvrsqtpghkrstclglpkcwdyrhep 
lhpatasflvaavpgm/fadpp/cnmhl 
he * mme * mngddas e i ls femrshsvtq 

tggqwcshsspqpqppgppk 


450 


14351 


A 


456 


3 


441 


dawglvldrerpffifffffflgnggyf 
rgpggrpggegp pmetsvspvkki lrgp 
gqgggkarkphplggprgpnhktgg*kr 
ayptgenpvltekpkftgpggngpetpv 
igkagagkplnpgXmggsrnpkwahcp* 

TGGKKGK 


451 


14352 


A 


457 


1 


234 


ptrpnhlelgdrgcreprsshctpawat 
erdwvskkktkg/ekf*kgrtk*ms*ve 
qggsggkgi ftlai snsplp ff 


452 


14353 




458 


42 


470 


KRIPQLKKSPLPLKWPPGEWVGKINFPP 

PRERPKBrFF*KKKTLKTPPKQKPLRKKS 

PFKKPHFLLYSKATKKQKRGKKKRAPPP 
KKKKKKENPKIVIFRTEITVASPVLSWA 
VKPIIHI?FPRBKKPHA\KPPPP 


453 


14354 


A 


459 


2 


393 


DRPIEQWNKRERPRINLYIDGQMIFSKG 
TTAFE/WK*NLCNKyVWNAWIC\NRRKN 
PYLTSSEKINLR* I IDIKVK/PKftlKLP 
EQNIRCSL*VLGVGRDFLENM]!JYTRKK\ 
IGKLDFI KI SQLGMVADTSNPI 


454 


14355 


A 


460 


338 


33 


GDEKIEKSFFFPISSLCFVP/AYIi*QQI 
RlilFTRNSRLLFIY/CaFLEMGSHFVT 
QTGVQWCafflSTLQPRPLGLKQSSHLSLL 
SSWDYRHLPWLKWTEACAV 


455 


14356 


A 


461 


77 


435 


AKEVEASLSTLAEPISIQQQQQQQQNEK 
KNKN* PRI AKAI \LS *KGE ITLP/ELQL 
CYRaMITICrAWY*HKNHHIDQWlJRRENP 
ETNPHTYSELIFDKGPKSI \SLFNK*CW 
EYWIFICTR 


456 


14357 


A 


462 


157 


3 


NGRVDLKIQKIiRRCGGAC3:.QSQL/RQEN 
HLNPGEKGCSES*LHHCTPD*VTKQ 


457 


14358 


A 


463 


363 


3 


PSVAQAGVQWCHHSSLQPQPPRLR \HPP 
ASA/ SQ* LGLQGQATALYIHKYI LFPCN 
VLISFIFPFFFLEMRSHSV/SQARVQWH 
DGSPQPPPPELKRFSWEAELAVSRDRAT 
ALQPRRQ 


458 


14359 


A 


464 


3 


396 


LKEIAEKVKKI IKFKKQLRL*MKK1 INR 
YLKEN/ LNQLLE I KGTLRELQNAVE S FN 
NRL*QIEEGISKLEDKAF*WTESGKIRK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

Ctil 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucin6, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, l=possible 
nucleotide insertion 














INK * TK * TDY I KE PNLRI IDVPGGEEKA 
KSLKHLFEVIIEENFPGLAR 


439 


14360 


A 


465 


399 


1 


ppgvllngppffffflilgpgwvfgqpl 
ktplvffpnfqyhifpfkisrif/lptr 
fpfqvpplcp*pfsspwglkwfillsf 

FFFL*DRVSLCHPGWNAVM*SQLTAASN 
TW\VK*SSHLSLLSS*DyRCVLS 


460 


14361 


A 


466 


279 


1 


TTNIFNPHRVGSTDAGPLNMKEKEKENV 
KRRQATGWEKIVAKDTSDKGLLSKIYEE 
LLKLKNEKTNK*/IIiCLKWSKGFNRyFA 

KEYIQMAIvlK 


461 


14362 


A 


467 


2 


436 


RGELPQLDKYLKKTTAIILNGEEL/E/A 
CPLRSRTRQGCSLS PLL*KKFFPPKSHI 
ESLLFENIILEVLYSAVRQENEIKIVEI 
GKEEMKMYLFTDGNHLCKNSERIRN/YS 
KVAGyKVNV*T* ITFLYTSREQVEFEI K 
KTLSMNRFF 


462 


14363 


A 


468 


2 


420 


RTTALF*AVRQGRLSLQRLLLSFC\CLC 
PAARGEAYIG\RQASMSRCGLQAVQAS C 
LLCLPKQAWAMAGAPPSASLLPCSLISD 
RCRSSQPDSVGVGPSEAGVGYMLWRGL 
LSRSEKR\NIRLGVTRFSRCV/LSPLSL 
TRR 


463 


14364 




469 


-3 


416 


PSP * PLTGAI,SALLKTCGLAM*FHFHS I 
TLGir,CI.LTNTLTIYQ*MRDVTRESTyQ 
GHHT\QAVQKG\liRYGIILFITSEGSFF 
AGLV*AFYDSRLPPTLQLRGHWPSTGI T 
PLN\HLQVAVLGEPVLLAS*VSIT*ADH 


464 


14365 


A 


470 


211 


398 


I FFFPKMGGGLS IILCWRRKRHPQKLK* 
STCLGIiPKWWEYRCEPP/VPGQKKNFKP 

* KTKESPPLVSGGASSASNIKL* lALPP 

FLEKKKFKKQGFKDPPFPLFFTQMPGQG 
WFF FGKI * y CP P 1 FFFFLRRS FTLVAQA 
GVQWKDLGSLQPPPTGFKRF 


465 


14366 


A 


471 


62 


424 


TLMHCWWKWKMVHLLWKTVLNLLIKIN/ 
RTLKHT* PC3SIPAIPLLGVSLREMITYVH 
KKPCP *MFIVALFMVTKNSKH]:jKCSLTC 
ECWJNKLWSLyTMKYYlATKRKELUIHtlR 
SCRNYSSDMS 


466 


14367 


A 


472 


427 


20 


LGLRLPFVEQEENKPyyPLAPFSPPEPP 
MSLYKNSPiyPSKGFLPPSKNHKIPPLN 
PLKMSNLF*/FALLLGNKDFHLLLLGGP 
IGIMGDMDJaTTSQMFYSRVPIFF/CFFE 
TGSHSVAQAGVQWCNLSLLQSUPRIiK 


467 


14368 




473 


424 


98 


NWyLyYyYYYHLLF'RQGL/DSAAQAGV 

qsjhhhcslqprrhrlkqilppqllNtsm 

nyrhkpprlspyvaqagii*llgssdlpd 
svsqsagiigneplhlasl*flfafp 


468 


14369 


A 


474 


415 


75 


KHFLKFFPQTQANGPGGKNKFFFLKTRF 
CFFPPGKEPWAYYKSLQPPNSGGQQIXS 
APTP*IKGAPKGGPPTRVKFFFFFLF/L 
RWSLRSVAQAGVQWHDLGSLQAPPPGFM 


469 


14370 


A 


475 


3 


413 


PVQKGLRYGI \ILCITSEV\LFFAGFF* 
AFYHSSLAPTPQLGGHWPPTS ITPLDPL 
EGPLLITSVLrASGVSIT*AHHSLIEN\ 
NRDQIIQALLITILLQLYLTLLQA*EYF 
ESPFTISDGMYGSSFFVATGFHGLHVI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVl=Methionine, 
N=Asparagine, P=Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=X)n]inown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


470 


14371 


A 


476 


1 


440 


ITLTDRELSYY*A1IRLL*LAYTITFIV* 
I PLYGLHL* L\ PKAHVEAPIAGS I VLAA 
VLLKLGGFGVIRLTLILNPLTKHIAYPL 
LVLSL*GIIITSSICLRHKDLKSLIAYS 
SISHIALWTAILIQTP*SFTGAVILII 


471 


14372 


A 


477 


2 


397 


LFVYNHI DVGALYLLFGA* TGVLGTGLS 
LL I RAELGHPGNLLGNDH I YNVI VTAHA 
FVIILFIEIPIIIGGFG\N*LVPLIIGA 
TD^4flW PR INNI 3L * LLP P YLLLLLASAI 
AEDGAGTGGTDYPPLTGNYS 


472 


14373 


A 


478 


442 




SSSSPFTPPRGKFFFKKTPRKKFFS/SP 
GNKGFFSPiiSP*KFFFFFNPFFFFGGFF 
PNFPPPKKNFFFKKSPGFFFFPPLKKKI 
FFFPPPLNFAPPKVFFKSPPPFFFFFFF 
FFFFFFFFFLRGHPWGREGI 


473 


14374 


A 


479 


1 


439 


PTRSPTRPLVLDRERPPFFFFFFKKGPP 
S VP PAGGGGANLG * GNP PP L/ GVKKFFG 
PKPPKIGE*RPWPPPPGQPECSF1''LKI«I 
GLPHGGQGR*N\PPPRP 


474 


14375 




480 


420 


0 


YSPPSPPK/YRAPGKKFF*KKPRKEKF* 
KKKILGFFFPLSPLKFFFFPKAFKFFGG 
VGPNCPPPKKRFFSKNSPGGFIKPPLKG 
Kl^FFF PAP VKFGP PRGF FKGP P S FFFFF 


475 


14376 


A 


481 


371 


2 


NKI*NKFSKLNSKKSSSSSSSSSSSSSS 
SSSSSSSSSPSVNK*IKI<:CSTSLTISSS 
SSSSSSSSSPT*IAKIKNIDSPKCWQGC 
RETGLTV\LVC!PLWWQFLLKLSMQII*D 
SANMILSMHARG 


476 


14377 


A 


482 




335 


HASGKDRHTDQRNRI KNPETDT* lYSTF 
*QKC/RLI*WRKDSLVNIOMC*SNWA\SP 
MKKIKIiDLSSSSSSSSSSSSSSSSV*lIV 
KLLGN1IVGGNLQYRGLRVHTVDVKA.QHI 


477 


14378 


A 


483 


1 


418 


GVH*FSPL1MPPSRWGPKHGPPNL\LNPF 
FFFFVFLVETRFHPVGQAGLELLGSRAP 
PASA/FPK 


478 


14379 


A 


484 


_ 


356 


FCSHAVRSiyilKTVLIDKATKP/RRWKGN 

ykyhlpvfncktgrtVdsgnplnwfyqc 
fvpeirkylarvqlpfnvfliiidnapgh 
pephepnte6frwfltpdmpliqpleq 
gvlrtlta*yqcfvpeirkylarvglpf 
nvflildnapghpephefntesfrwfl 
tpdmpliqpleqgvlrtlta 


479 


14380 


A 


485 


166 


406 


f i fwgs vs s saegggws ws s lprvtvr 
pdetvdvtihllkk* crpgavahacnps- 
tlggrgqri/tlrssvqdqpsqhg 


480 


14381 


A 


486 


107 


429 


fwvtqtfgffwlnppgglelwppppcps 
nfggnfkkkggpplwpqgvqtpqprgit 
ppgpprggk*rggpp/spgpgplgkpkg 

GGP 


481 


14382 


A 


487 


399 


2 


GSPAPGLPKGWGPRfiGPPGPeKFVFl^ar 
PKGGFPGAEIFLGEPFQRGKKGLGPIFP 
PVFLKTEE6GTLS/NSFFKGRVFLVFKP 
KAGPKKKKKKTNYKP lYMHTDTKFFNN 
MLANRI *QCIKEViyiQYDRTRG 


482 


14383 


A 


488 


419 


2 


PPPPPPPREKKGGLYFNPKIFLGKSFPP 
RNPKDPS* KPRVGIFFEGPKPPKKKKF* 
FGPWGKPPGDG*RK1IPFK\PFFFFFFFF 



545 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














* DGVLLCRPGW lAMAQSRPGLQEWNS I S 
KKHSGPCVLGISSKEVPDAWADAWADAW 


483 


14384 


A 


489 


629 


0 


SSSSR*SLLCKCFK1I*GEKIRNPIEKWA 
KVMHRLFTKRCINI * KDVRSTSLMVHET 
*ITTTLRYHFSPIRLSKT*KLNRTLYG* 
DCGETGIFIHCWWECKMF*VL*RE/VWQ 
YLIKLL/LHIPVDTA 


484 


14385 


A 


490 


2 


351 


KNRPEMDPGM*GY*VNDEGDTTYQQGNE 
ELRS/WCWDDMRLAKVKLDLYFPPHITX 
E INSKWI *DLNIia(IPPIHVLEEMm/ 1 * 
F*HVGIGKHYLTI*LKI*NP*K**IWLH 
ENKMYCL 


485 


14386 


A 


491 


42 


507 


NLAK* IQ** IQTIMHHDQVSFIPGIQGW 
LNILKSINIIYYTNRL*DKTHITISIDA 
EKAFDKI *YLFMKGK\KPLSQLGrEGII/ 

YLKLIKGIH 


486 


14387 


A 


492 


18 


417 


REGKKSKVHHFNIKQGKVMSTKH*KTQL 
/ SHSNAHSNKAQISKSQPHGLHDFFKKK 
KKKKKKKKKKKKRGEKKKKKKRGKKKKK 
KKKRGGGPP*KGVREALFCFFWRKKFFF 
VGGGGGKTPLQCLQjyDTPLWQA 


487 


14388 


A 


493 


413 


82 


FFS SP PP PLPFFFPHLFPP PPNFFFFFS 
PPPFFFF/PPP*KTKFPPPPPIiFFPPPP 
LIfFlf'SPPPPPFlf't'Flj'SPPPPi'FFFFFFF 
FFFFFFFFPPFFFFFFFFFFFFFFFML 


488 


14389 




494 


1 


413 


PTRPPTRPPTRPPTRPVLDREHSPSNLP 

KKKKKKKKKKKKKKKKKKGGGPLKKKPG 

GAI<:ia<:G/EKKKKKFF*KKGEKKKPPGKF 
GKKKI-:iVJGGEI-CRAKT?Qia<KTp--»GKKKI 
LKGEGGKKNPKPRGGKKFFSGEKKKKK 


489 


14390 




495 


274 


1 


lYRIDCAYMKKVERSKISILSFHIRKLE 
N/QLNPK* /HRRRE I KIGAEIHE lEHHK 
* IBKINETKSYFXWKrSKPLAKIiIKEKT 
QITNTRNRAY 


490 


14391 


A 


496 


336 


1 


VFQYTYNKLVSMFYYCFFL*RWGLTVLL 
RLVSNSQ\PSD\LPSSAS*VaQTTnflHR 
HTQLCFTTLSDLLEPYFHLGSFIFSLL* 
FFFIJSTESHSWQVBV*WHNLSSLQPLP 


491 

^ 


14392 




497 


445 


382 


PRVLIFFFLTPRGPPPPPPPNKVFFPPP 
PPQNFFFPPPPPPSWGGFAPK/ PPPPPK 
SFFPPKPPPVFFSPPPBCEKKFSFPPPPH 
FAPPPVFF*PPPPQ 




14393 


A 


498 


430 


2 


3PPPKPTRM*\KVKFGR*KKGGGGGGGS 
PPLSPPLWGSKPVVP*VPKGR/PLPG*A 
GKPPFFLKFQPFPRPGGGPRCPPLFGGL 
GQKWGFTPEVVLLL*PKFSPFPSSLVDR 
RIYCLLKMLEYKSVMMILLFASCLNLYT 
FPIV 


493 


14394 


A 


499 


363 


3" 


KKLVTPAEVLGDI I PENRFLQMPQQREN 
FLCQVWMTKPPTTIFVKTKTGKWYLISL 
KRFCPAKKKT/IKILWRVNRQPPEWEK 
IFANYASDKGLISSLYKDLKQVYKEKNN 
PITK*VKGMY 


494 


14395 


A 


500 


145 


435 


VFMCINSSFLLFIPWYEYTTVWPVTCRR 
T*MFIAALFVIARHm*SRCPSTG]SW/L 
KLWYIHTME 


495 


14396 


A 


501 


162 


1 


FYNYTIIFIYLFIFIFLRRNL/DSWHT 
GV*WRHLGSLQPLPPICFKPFPRLSLP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted ' Predict- 
beginning 1 ed end 
nucleotide | nucle- 
location , otide 
correspond j location 
ingto first j correspon 

residue of j last amino 

peptide acid 

sequence j residue of 
1 peptide 
1 sequence 


Amino acid sequence (A=AlaniR ' ) 
C=Cysteine, D=AsparticAcid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, l=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptop[ian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


496 


14397 


A 


502 


343 




I KKFRWGP KPPLKEPPGV/ S PDFP I I /N 
FGNPGPVPGKDFKFPNFPKKKKWGGEAP 
PWPTTPGG*MGRFP*PPAFGVPRGGDG 
SPPPGAPKEGPLSKKKKKENEKEKERKE 
KSARL 


497 


14398 


A 


503 


23 


405 


kgrnylwekkkknmlvegkigggpppri 
pkf*kvifktppgppffffgpplnffff 
ktdapi.ff*nhppkskiwalappkkkif 
lnpkkpppfffpphffkknfpraffknf 
\afsprggaspsppp 


498 


14399 




504 


418 


247 


PPKWGF\FPKPPRGFFFPPPKGKKFFFP 
PPG*FGPPQGFF*RPPPLFFFFFFFFFF 


499 


14400 


A 


505 


26 


426 


GCTGLLHS*MYAKWCCTDHPIT*VLSP 
AFISRISYSPTHPQLILFF* IFFLFLVE 
TESCHIAQAGLK/LPASSSLPVWRSQSA 


500 


14401 


A 


506 


337 


2 


IPNLKESA/CPNLPKGWGFKD*PPPPSQ 
ISLIFKYPKFLKFEI*KKKPPLKKGLFW 
WV*RKKAPVGF*G\EMflLGEKFFFKFFF 
FFFFFFFFFPRRSFAFVJiQaGVQWRDLG 

S 


501 


14402 


A 


507 


454 


2 


TSKTGQPGRRGSPFPI*WAaGQKRSSPP 
RRGSEAEALLTSQTGWPGRDTPHFPDDG 

RPGRGAPLFPDGARGQRRSSLPRRG/VP 

GRGAP\QSQMGCQLGRGAPKFPDGVAGQ 
RCFPFPRJiGSQVEALPTSQMGQPDRGTP 
HIPDGVAAGQRR 


502 


14403 


A 


508 


390 


13 


RIPPPKSRWKGKGPF*VS*NPRPQr*KN 
FLPPPP*K/YGDPRGPPPPPIKFLPLKK 
KGAPPICPGCFEIPAPRESPPLAPPKS* 
NSRGNPPPPPFFKKKNPLFWGGKTKKLK 
IFFFFFEAGS CYVa 


503 


14404 


A 


509 


2 


282 


WQFLTKLNILPYEPVIMLCIYENESKT* 
IYTKTYA*ILLAALFIMaKTWKQGVLKK 
V\TDKVWHIHMTEY 


504 


14405 


A 


510 


400 


250 


LLSVTQAGVQWHDHSLLQPQTPRLKXHP 
PTSAS*VAGTIGAYHHAWIIFFF\IFiy 
CRDAVLLYWLGWFPKGLLKCWDYWCEP/ 
RMSGLFVFWSDLGFRSEVS * ILSSWDYR 
CVPPCLDNFFF 


505 


14406 


A 


511 


99 


242 


VTKEKE6HFrRrK*LIHQEDTTTINIGV 
PNNRTLKCMKQKLTELKKE I 


506 


14407 


A 


512 


139 


374 


SIiWGKRIFFFFGADPHFVPQAGGA\WGN 
HG*LQPTPVGLKKSSLLrFPISWYYRLG 
TPPPANFKI FCKNGVKPCCPACF 


507 


14408 


A 


513 


23 


40) 


STCLRLPKCWDYRRESPLPALCMLFL*M 
KGKYVYMGFFI \LFFFLFLHKI * LLQEN 
FVYVCLKKNKTKLKA*TKKQKKNETG*N 
ILVDUSrVLS*KKWKCLLWGFILF^F/lIF 
LRQS L/NSVAQAGVQW 


508 


14409 


A 


514 


7 


386 


FFVFSTHITLLFLFLFFFFLFPRPPGFF 
LAGKKNTPPAPPEKPPPP\PPKKKRGPP 
FFFFGPPQKKPPNSPRGGRGPPPPRPPR 
KMEKGGPPPPREIPPPPFFF*PPKTPPP 
PPGGGGGGKKTPPIOl 


509 


14410 


A 


515 


370 


2 


FVFFQWLPE*NS FLFSPF*KGLC*RDL 
AFKRFFPWPPNKKLWPSLF*GFLKNS* K 
FNFSLKSLEVPFFF\CFPKFFPWI*AFP 
RFFPFFPPPP*RG*1FFF\*DGVLLCHP 
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SEQIP 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ingto first 

residue of 
peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GWSAWQFWLT 


5J0 


14411 


A 


516 


i 


163 


NTREKLSGGGGTQIiS * /LLGRPRQKKHL 
NLGGGGCSEPRS\HCIPAWMTE*DPVSK 


511 


14412 


A 


517 


1 


390 


RIWVYYKHLC/Y*GS'TOrPREKIDNFLD 
TFNLSRLNQEEIETLSTPISSSIIETVi 
KSPPT\KQKSPGPE 


512 


14413 


A 


518 


47 


312 


EYTGLII.RLYIE /LFTTQH* KQKALNQA 
K*IiAK6LNKHFlKRGCMNDPKHMKKSSA 
SLAIRKM* IKTMMRCHYILT*MAKIENK 
TESTRP 


513 


14414 


A 


519 


320 


1 


QNP INKQ* KKNNKKYRLV*HKKPPPFTT 
. QPHQSNQHQ\PK* * PQKHT* TKNHHTPP 
PPP*SPPPPSLSPSPPPPPSPPSSSHPP 
LLSPSPPPPSSPPSSPPPRTSP 


314 


14415 


A 


520 


1 


368 


LKTQQEANNLILKWAKDIiNRYLIQK/VY 
RNVKHTKRCPTS \RELQIKTRYYYIS IR 
*RVKYKKRDNTKC* *GRETTGTLVIHYS 
*EYKMV*PLWETV*QFLTKL/E*SY/P* 
DPAITLLGIYSKELK 


513 


14416 


A 


521 


2 


264 


GKKIPMLYFAEIEKCILKFV\KRS*KAK 
* ILPKNNKAGGLKFPDFKTYYKAAaSKH 
QVGGLWKELRSTLQAQQTWVPREEIJIST 
SPS 


516 


14417 


A 


522 


1 


408 


LEKMSTSrAl/RVMQSKTTMKCHYIPIR 

kaki*i]1idnikcwq*cretgpliycc*e 
ct/lvqpl*ktv*qflikvmm\dpafvl 
ifipkk*khmfthkknckhtfraalfvm 
antri ipni fqfvewiinkls ykhi vdy 


317 


14418 


A 


523 


320 


3 


TYIYIKTCPRMFT/AVMLTITKKQKQPK 
YPPTDE*NV\YIHSNAYYSEMKRNEVLI 
HATTWMl!rPKNIMLWKaSSEKRPin:'I*LH 
I YEMSHI IQYMVLCNCIiLLPSMF 


518 


14419 


A 


524 


426 


9 


AELPASPTPGTCTPQPLGSGRDQVP\GA 
VGGTHPGGSGLaGSPP*G6LGMAGCKS* 
ALPRGEVTEAWRQFKCX3ERRQASSTGGP 
GAPSAAAGPGJLKPLTAWGQHHQPAAPSA 
GPAEPLPTQNWCWPASNPGSRTRLSLHT 


519 


14420 


A 


525 


13 


461 


ICIWRKMNILLPYTKLNSRWIADLYVKG 
TTIMFLEVSVREHLHDSQKKIF* *DAKL 
MNQKE*LDILDSINIKSFVH*KAPLREN 
KGKLQSAIHIYEKGLVSRIYEELLQ\TY 
KETLIcaWKKKIDNPIEKWAKirLWRHFTR 
GCPSIYKHVK 


520 


14421 


A 


526 


288 


570 


ATGSLCCPGWS IVAIHGFNHSAL*LLTP 
GLKHPPASASQVAGTAGMSDCTQLCKIN 
FFVFVFL/RRQSLAVlQAEVQ 


521 


14422 


A 


527 


3 


43S 


AVSHDCTTGLQPG*QREILYQRKKKGTF 
SPLPPPSPGIGGKPALPFWGPKTWPPPG 
I PPFLFS PPPPSQS GGGPRPLETLAGS F 
SQNPWNLRDGAPPPPGGGFFG/CPPFGP 
PGGLG/PPGAPPPPLKNFPKKKGGGGGF 
LFPPQAKG 


522 j 14423 

1 

1 


A 


528 


2 


616 


fffdtewrsvaqsgvqwrdlgslqappp 
gftpfsclslqstwdyrrppprpanffl 
yf* *rrgftvlarmvsis*prhlparjas 
qsagitdvshralqvcfittll*lsk/h 
qfkkagvtl/phlqclheigldcvlhkh 
/wsvhhspp/hktnvc 



548 
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SEQID 
NO: of 
nucteotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
36 


Predicted 

nucleotide 
location 
cori-espond 
ing to first 

peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIalanine, 
G=Glycine, H=Hlstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


523 


14424 


A 


529 


57 


485 


RHS S LGNKS E TQS QMKKKKREKKKGKE I 
S FDPAI SLS I * PKENKSLYQKD/TC I CM 
FI TALF 1 1 AKTQNQPKCPSTDE * INKRV 
\IYIYIYIYIYIMKHYSPIKKNEIM/SF 
AATWMELEAIIISETMQKQKAKYCLFSC 
I 


524 


14425 


A 


530 


399 


3 


FFFFFFFFWENPPPPPK/WRGGGKKKAP 
FFKFFKYKK*LFFF*GGGGGHQQKKKKK 
KSSSFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F* FKGFLLGTTHASAHASAHAS 


525 


14426 


A 


531 


491 


4 


SKWIKDLNLRTK/TTKFVEET*VKIFM1I 
LGLGNDFLSMTPKA/LKIDKSDFMKIKN 
/FCSSKD/TIKEVKRQIITEWVKIFTSYI 
FDKGSTSRTCKEL/RKQQQNKPIQKCSK 
VLNRFFFREDIQLSNKLMKGCSTSLNH 


526 


14427 


» 


532 


184 


509 


pqwpahse'lpalgssgtgpy*wrqifd 
skdkessq/wshetsdrpkpadherrsr 
pslatsp\prlephpslpnhsglpilss 
lpwgavalapthfsalawp*rplpcnsq 
gekfffwfgff\eresiisatqagvlgiid 
lgs lkpppwgskgffcpsysgg/wnqkn 

HLSP 


327 


14428 


A 


533 


401 


3 


VIREMQIKPLHIQ/PGWL*SKSVKKY*Q 
GCGQS/MVLIHY/WLKT* IMGPFWKIVW 
QFLIKLPYGTVILLSGIYILWRMEKKPP 
KFCTQIFI2ULFFIIQ*PKSENNPNIHQP 
KNQ*DVHMYIHTMEYYLAIRMNIH 


528 


14429 


A 


534 


419 


93 


SLAIRQM*IKIPPRYTRVAK\IEKSHrrr 
KYWKRHGATGTI.IHCWWECKMA*S*KIW 
*FLIKSlTlNLPFNPVILL*GIYPRKMICr 
CLYKD\CM*MFMaVSFIEQNGKQPKCL 


529 


14430 


A 


535 


414 


2 


NFLARGYINCGPQFFFFPPRKNLKWGVP 
LSNFPPKL*KGGIFGEGPKKVKKNFFFF 
FLFFFFF* KKPP CSPRLKAK/WKIFGPL 
KPPPPGLKKFSFLTPKKNGDKRGGPPRR 
GNFFFFFFLVFLVKTEFHYVGQAGLK 


330 


14431 


A 


536 


32 


435 


DRATJUiQLS*KSETSSYKK2CKKKKKKKK 
DGGAPLKKNPGGMKKPGEKKKKFSPKR 
GGKKNPPGNFEKKTNFGGGKNGAKPPQK 
KKT/AWGKKKNLKSKRGKKNPKTLGaKK 
FPPRG*KKKKKPPAaRPGKASS 


531 


14432 


A 


537 


348 


1 


EYIKSTHQMGKNYPLKNTVSSGQVQWLS 
PVISAQFAIAKSWSQPKCPSUSTEWIKKL 
WCV/WCVCVCVCVCVCMMEYHSAIKRNE 
LMAFAVT*MRLETI I ISEVTQEWKTKHR 

MFSL 


532 


14433 


A 


538 


523 


66 


TDQTSHNIPLSQSLIQSEVPTLFNSVKA 
E*CEEA/AEHKFEASRGWPTRIiKERSRL 
RNNSV\GEAAGSPEDPATV/ IWGGGRTQ 
PQIFSVAEATLNWKKTPCRPPIGREKSM 
PGFRASKDSLTLCRDFKTHLCLVFHYWN 
AKHVQVYILLKVTAKI 


533 


14434 


A 


539 


413 


2 


RDKV* PCCRSWSQTPELK* STRLCLPKC 
*DYR/R*APAPGPFIiI,*VHHHV"SIFKSG 

plscrcsnfqd/hdsikpsfptiqyrhg 
nkfql*tpdtlifflkwsftfvaqtgvq 

WHNLSSLQLLPPGPKRSDAWADAWADAW 



549 
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SEQID 


SEQ ID 


M 


SEQID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidinc, I=Isoleucine, 








26 


correspond 


location 


K=Lysine, L=Leucine, M=Methionine, 










ing to first 


correspon 


N=Asparagine, P=Proline, 










amino acid 


ding to 


Q=Glutamine, R=Arginine, S=Serine, 










residue of 


last amino 


T=Threonine. V=Valine, 














W=Tryptophan, Y=Tyrosine, 










sequence 


residue of 


X=Unknown, *=Stop codon, /=possible 












peptide 


nucleotide deletion, V=possible 














nucleotide insertion 


534 


14435 


A 


540 


385 


2 


MftS KHI KT/ CAEQLAI RE VQ r KTTQ YHV" 














iptrmakvkkt/dnecwhgc*ntlincw 














wdckmmqllwkk/svwhssrgkmyvqlp 














CTPAIPLLGIYTELKTCSHKN/T/CT*T 














FIAALLVIAE KWK\ KCPSAEEW 


535 


14436 


A 


541 


44 


398 


RPPFFFFFKKRPLWKKKQGFPPPA*RGE 














I PSAKK\ PPLGPVSGPGNPPGP * KP/ PQ 














NPPPLGGSPFGKKPPLDIPFPGGPIKGK 














KWGGPPLAGGATTKN/PPPGGNFWKGGK 














APPFPSQKF 


536 


14437 


A 


542 


370 


10 


FFLRTKVSLFPRVEG\KVQSQLIiAAPPS 














WGQVIPPLQSPE*LRQKDPFSPGG*GCS 














EPCSCPCLPAWMTEPNSVS KS * KKKRKK 














EKN**KKKKSFINTTGKIRFRI IH 


537 


14438 


A 


543 


1 


370 


FLLRHILLCHPGFCSVATAYCSRDPGSS 














D/PLPPQAPLPDQ* PRLQA/WHRLAPPH 














SflWFF * * RQGFTVLARI/IVS I SQPCDPPH 














WASQGATATKADDYQK 


538 


14439 


A 


544 


50 


395 


I PGLTRQWLLDPCASPSTPPYT/P* VQP 














STPQNSSPSPKTHNQKGLPMPLSPTPKP 














STAWKKAILEHTSSSSSSSSSSSTCRNR 
NGYTYTVPEHEPARGHTASQTRKQVTAA 














THKP 


539 


14440 


A 


545 


1 


370 


LCSVTQAGVQEHKLNSVQP PPPRFKQVS 














CLSLLFFFFF*KRVFTLFPRLEGRGPI I 














PNGSPTLRG*GDPPV*ASKELRTKaGPH 














OGQLINLF/CLGATGPTYGAQGGFKSPG 














LRRWAPLGPPRA 


540 


14441 


A 


546 


178 


365 


YKKTDATKTKMDK*DSIKLKSFSTVKET 














INKGNR\QPTEWEKIFASHTSDMGIISQ 
ICKELKQ 


541 


14442 


A 


547 


11 


236 
















WHKYK\HIDQWNRIQNPEINPHIYGQMI 














FKKRAKNKQWKKDGLHKKR*WSNDFQET 
CQBQTMEKGRSSQEKMM 


542 


14443 


A 


548 


355 


3 


IKKKNIiGRKRFCFFQEEKTWP*TPKKKP 














LENPGGVFPNPGFPPFFKAKI PEGPFPG 














V/SFF*RGGFKGYPPPLFFFFFFL*DGV 














SLCHPGCSAVAQSRLTASSASQVHAILL 
RHVDEGR 


543 


14444 


A 


549 


1 


373 


CFI,ICRHTSHMWRIWFC3TTAINQIWQ*K 














ESQKWSQ\LYI* KLHLLYEVQ*HFV* K 














YTQ*LLLENADTK*AHAVGKMVLrDLLK 














iyU^SLIFNL*KNK*imiaOTPGMVaHACai 














PSTLGGQGGWLTS 


544 


14445 


A 


550 


446 


2 


NIDKAPTVLGKMVS IFPTSRSPPLGPPK 














CWGHRREPPRPAYFGI*ILFLTPP*NPL 














NWSPQHLSGKISGIFSDPSLSVFFSSFF 














FAPPPKKFMLCF*PFFFFFF*DRVSLCH 














PGWGAVERSWLTAAPISQ\IR*SFHLSL 
LSSWDHHHA 


545 


14446 


A 


551 


342 


26 


WAPPIFFFPPYKRSPQKPFPPPGENNSR 














GF*IFG/RGGFFKKEGI,SQFFFFFLKKV 














FFFSPGGEPQGYFPPP3CASFLKRIFFPP 














PPIKKGDPGRGSPPRGKNNPFF 


546 


14447 


A 


552 


3 


167 


QPHLQDCL*QQNPVAVG*AFSSHPRDSL 














NNPW\WPGAVAHACNPSTLGGRGGRII 


547 i 14448 


A 


553 


3 


413 


TL ITALS SH* FFT * VGLE INMLAF I PVL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
lastamino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=IVIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thi eonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














TKKINPRSTEAAIKYFLTQATASIILLI 
AILFNNILSGQ*TrTNTTNQYSSLIIIM 
ArAIKLGIAPFHF*VPEVTQGTPLTSGL 
LLLT*QKL\APISF1YQISPSLKKK 


548 


14449 


A 


554 


441 


1 


SSPPPQARGEIFF*KKPGEEISQHNKKN 
EVFPPPPP*KFFFSPKAFIF/RGGGGPK 
RPPPKKKFFPKKTPRVFYKPPQKKKKFF 
FPPPVKFGPPRIFFKSPPPLFFFFFFFF 
FFFFFFFFFFFFFFRMGCDRWHGKFWIL 
RDGF 


549 


14450 


A 


555 


3 


455 


NTNSSMYVCIYVFILRQSL/DSVTQAGV 
QWRDLSSLQPLPP\GSWDHRHLAQQ*YL 
FHVLLLEV\ FVYERGTIMVP IL* DCNVN 
CRVPGTRSLSVYCVMGIFYRC/HIESML 
CDRCF I FFDI GSHSVTQAGVHWCNLGL \ 
MKPLPPG*RNFSGLN 


350 


14451 


A 


556 




299 


RRIVIRQENRWNPGGTGCSEPRSHHCTLAW 
ATKHDSVSKKKKKIRSSLGKKT* FYHMI 
LKAMG/CITGIHKGPEGAWQREGHFTPP 
QGPKKAPQRGQTYPDP 


55] 


14452 


A 


557 


3 


391 


PVWWNSFEASSGWLIGFICKRSCICNIKV 
QGETASASVEAGVSYPEDLAKITDEDGY 
IKQQIFNVDKTT/Yy*KKISSRTFIVRE 
KSMSGFKASEDRLM* FLGAIiIR2^NF*LK 
PILICHSKIPMTFKSRAKS 


552 


14453 


A 


558 


337 


1 


TRFPPFLGGFSTKFFYRPLVRFPLL/RE 

RKIFPLPPVFVGGFPPALGPFWGFLFFN 

FSKICEPFGFPWGFPF/CPPKFLPPKPFL 
GFB'PSFHRGFFFFFETVSLCQPS'^SAW 
Q 


553 


14454 


A 


559 


117 


419 


IPELLLQVGI.FFFFF^RKKRGVFSPRWR 
GGGSNFGLLETPPLGIDPFSGLTPPKSW 
ELRAPPPPPIKF*KFFLKKNGF*WVSPG 
GLEISALLVFPA/SASQ 


554 


14455 


A 


560 


347 


1 


XYKELKQLCRKKSNYLIKKWAKNPNRHF 

seeiqmrnhqmkrcsXttpvi IREMLIK 

TTLRYPASPVKMAFI * DR* * *iyiF/WQRC 
GEKGT/H/CWWECKSVQPL*RAVWRFLK 
KLKILLP 


555 


14456 


A 


561 


2 


375 


rPPPLQNGEKIFMII+TGAEKaFYKIPF 
PFP IK/TLNKVGIKGNFPHI IRAL*EKP 
PAYIEHGEAESFPVRSGTRLCPLLLLLF 
HPVLEVLVRAICPLK/E/IKGTQIGKEE 


556 


14457 


A 


562 


391 


2 


ASGSKGKKKYPPFLRGSPPLPGNPLFFL " 
EGGEGKFP*P/RNWGPPPKIFPQKGQGN 
PFFFFFLKGPKGGGFFPPQGEKGVGFPL 

PGEKSQRPF*GEKGGVPPP*RKKPPPFF 
FFFFFSETVLLLPKLECS 


557 


14458 


A 


563 


384 




ISDSGVHPLGLPKRILLCQLSYHVWP*P 
KCFC/PHLCLFMTLFQPP*GLPSVCAPS 
KQEEHGCF P I SVARQE C * PL FPKNLDT * 
HFVGNFLKFFFFFF*DGVSLGHPGWSAI 
LTHCNLCLPGSSNSLPQ 


558 


14459 


A 


564 


522 


120 


SLFLPTLECNGTISAHYNLHLLGSSDCP 
ASASLVAGITGMCNLCLPSS IDPSTSAS 
*VAIDLRK* *GARGNCPVS/VLEKGWPE 
AEMGRGESASWWDPRLGRPGVAKRICRR 
HCCPVQQPQVSMEPSGPASPN 



551 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

correspond 
ing to first 
amino acid 

sequence 


Predict- 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, i=-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl<nown, *=Stop codon, /=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 


539 


14460 


A 


565 


73 


264 


KDRHIDQWWRIESPQIKSMTIHLWSFDF 
FLQGY**KDRHIDQWNRIESPQI\NL*L 
YIYGHLIFFYKGTE/IQWRKKTLSSKWF 
WDNWIFKCKQ 


560 


14461 


A 


566 


413 


1 


EKKKKVPSQVLTPKPQVLGNFPRFSEEQ 
*LSLVPVSSLPLQSLSGKNQGP*AKRVA 
LCFGKSPGSQQIWF/RLP/EDIVTTVQA 
SYS KKRLFLS LLDFQYVLQRREGAVWS A 
NLSIAPW 


561 


14462 


A 


567 


397 


-j 


FLGQDLTVAWARVQWCSHGSLKPRLPGL 
KGSSRSATG/SASPYLVPMFLSNLFWHV 
Y IL * LYKTMTGI * MEM* FTPS CPMYSW 
RDV* IFFSFETEACPVAQAGLQWRSLGL 
PQPPPPGPKRFSCLSLLTSWD 


562 


14463 


A 


568 


437 


1 


KFSEDPAKLIGEDGYSKQQVLNAN/ETA 
LYWKNMLSSTFLARKETSMPSKLQGTG* 
LLLGAMAAGDFQLKSMLTCHFKNPRALK 
NYATS PLPVLHKWNNKAWMTVHL,MTAi.L 
TEYFKPTIKTYTYHNTTGSLTTPHASAH 
ASAHAS 


563 


14464 


A 


569 


234 


1 


FFPPFPFSASSPPQGTSSSRGVF/PPFF 
PPPKKGFFPKIPPGSSSPPPF*EKTYFR 

FPPFFLAPPGVFFR 


564 


14465 


A 


570 


2 


396 


FS*AFYHFGLALTPQLGGDWSPAGITQV 
KST,QVPWWT\SGLM.SGEEIT*ADHKV 
I QNNRNRR I QALL ITVLLGLYFTLLQAS 
ENFKIPFTISDGIYGSTFFECAGLHGLH 
VIIGSALLTICFIPQLTFDC 


565 


14466 


A 


571 




403 


HASGLPSSWDYRRPPPRPANFFVFLVEM 
GFHHLNKai IKSFACNEIQPLSAVSVAG 
LVGCV*VCKCLFPVL*Q*LFQF*/S/HS 
IAN1SMHEWPLRLSLF*LIC*GERMSGFA 
TQSRRDPCSLPGPLYQVLSLAKFR 


566 


14467 


A 


572 


100 


371 


YKSNDPYVYGIIiHFLTTINFFFFFPKKK 
KFPPVFQVGGQGGNLS*U!TPLFLGLKGF 
SCLKLPRSWNNRGAPPPPPNFGFFSKNG 
VSPCNPG 


567 


14468 


A 


573 


371 


1 


REQARES/TWSSSHTPVQKGLRYGIILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPLNPLEVPLLNTSVLLASGV 
3IT*flHHSLIEN 


568 


14469 


A 


574 


353 


103 


SLQLQPPGLN*SSHLSLLSSWIYRHKFP 
CPANVRFFVFFCRDGVLLVAQAGLRA/S 
ASQSAGITGVSHLAQJjVNPLLFYFSRAG 

D 


569 


14470 


A 


575 




206 


FCIKKIIGFIPGK*GWFNISQPV*SLTN 
RIKEKIHISMTAEPIISIITEKVFDKIQ 
HPLILQKPFGQK*EYKFIiNLIKGICEKN 
/PIPNI lYNGEILKQGCLLSPFLFNIVL 
EDlVIQSVKrKEGINIETQELR*SLTNR 
I KEKIHISMTAEP 1 1 S I ITEKVFDKIQH 
PLILQKPFGQK 


570 


14471 


A 


576 


157 


255 


YISPTPPFPPDLPF*1IYLPNPKIjTSPAP 

NFQIY 


571 


14472 


A 


577 


365 


3 


PPHPSVFFF*KSPPPPPPLFFFPPPPFL 
FRGGPLYYS PP/S PPLFFSRRGKDTNPP 
PYSGLSVAGQTPPPPPRRTPPLVSPPPP 
PFFFF*RGGPKWKKAKKQTITFKSPPGT 



552 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 

location 
correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 








"57s 




323 


NKPKHKNDRDI 


572- 


14473 


VGSWEPDEKVSHRELLLSFLFYPFFSHI 
SS*ETLIDFALTSTDIWALWHDAENQTV 
VKYINFE/HVWVT*FCHPTSPQAISC*E 
DERPGAVSHflCNPNTLRGRGRQIT 


573 


14474 


A 


579 


369 




AHLKGTLGGFPDFPFINLEFRGGLTSCP 
NPLNNWE *R/RPPPPPGKIFGAFFFFFF 
FKVETGFHRVDLLVLIS* PCDLPASASQ 
SAGXTGVSHRARPTPG*FKKKIMTF/CL 
'KWWSHYVAQIG 


574 


14475 


A 


580 


394 


125 


AAEGEQGREGWRNRPRGRERASDRERER 
W/ERGEERERGEEEAPQ*SERWR/E/R 
SRE RE RVHE LWSD S DRALEREALLR 


575 


14476 


A 


581 


416 


1 


KGQDLYGEV*KVLLKVIKQVLNKWKDMS 
CS\KINILISFPIRIALKFFF/ELDEI/ 
CSFV* *NKC*R1AKEVLKKKQ* CLGGHL 
/TSPHIRRSYNTTFSQSIWCCYRTKQID 
*WTRTQNPEKDLRI*GDLIYDLRCCSSS 
RA 


576 


14477 


A 


532 


3 


403 


NCFSQFNVIIMEIPAKFFIDINKLILKC 
FCKGRSILKKKR*SDS/QRRNFLVIKTV 
WY WQRE CH I DQWDR I NHPE INLHKYS KL 
I LDKGAKAI R* TKE S / 1 /YS K* C*NWKF 
CM^EK/IDPRTiNPLYK 


577 


14478 


A 


583 


402 


121 


QST,IHSKAT,TT,FNSMKARKAAEGKVEAS 
RGWFMRF*ERNHLHNIKVQVEATSAGVE 
AAASSPQD*GKTTDE/GGYTTQQI5WD 
ETAFYWKTMPS 


578 


14479 


A 


584 


255 


517 


iyifgfifrgrvll/cthps*svvlqs* 
ltaalnfwaqvs/lpsswdyrctppcla 
fffffcrkggstlcprlvgnsnplkfpp 

LGTPCF 


579 


14480 


A 


585 


1 


414 


SSHSCCSKA*SSMGPSPAFYKLVPLFLC 
LFIWDGVWLCHPGWSAWQSLLTAASTS 
Q\VHPSS\CLSLRNSW/EYRYVP 


580 


14481 


A 


586 


392 


43 


cwdyrceppcla\tetgssyisqaglel 
lassnppmvchpkcwdyshetmhsaqnf 
flkgis*f*lccshfihliihllwlgkvih 
tythshtglekyktqcldvkciysdlld 
gaik 


581 


14482 


A 


587 


1 


394 


GTR/YailSrLFITSEVFFFAGIF*AFYHS 
SLAPTPQLGGHWPPTGITPIEPLEGPLL 
NTSPLLaSGVLIT*AHPSLIENNRNQGI 
QALLITIVLGLYFTLLQASEDFEDPFTI 
CDGIYS*TFFVVTGLHGLHV 


582 


14483 


A 


588 


190 


3 


DGINLKIPGGIFFFLKARSHSVTEAGMR 

W/P/1<IYGSLQPRPPGLK*SSHLSLLSSW 
DHRHaPPSC 


583 


14484 


A 


389 


157 


387 


TKKRGRVGGWWIKILGINLTKEIKDLY 
SENYKTSLKEIKER/DTNKWKNIPCF*T 
ERVNTVKMS ILPKAIYRFNVIP 


584 

~585 


14485 


A 


590 


891 


0 


PPPPFFFFLPALIFFFPPPPQ/PKTPKK 
KKNPPPPPPP\PPPFSHIP*QFLSPPPQ 
HHHFIHHPPPQPPFHPFYYWPFKFFFF* 
NINVPPPP*K*IYFF*TKFFFFIPNIFF 
F\TKKKIFLSPPPKKFPPLLTWLSLPIL 
IKNLLSKPPPPQI 


14486 


A 


591 


1 


496 


GTRAYQIVKPTP*PLTGALSALLMTSGL 
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E=Glutamic Acid, F=Phenylalanine, 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PMRVHFHSITLLILRLLTNTLTIYQRWR 
DVSRESTYQGHHTPPGQKGLPYGIMLFI 
TSKGFFFARFL*AFYH\SSLTPTPQLGG 
HWPPTGMTPLNTLKDPLMTCVLLASG/ 
VSIT*AHHSLIENNR1IR1IEALLMTIV 


586 


14487 


A 


592 


175 


2 


PFFFFFLLRHSLTLAQAGVQWCDLCSLQ 
PPSPGFKRFSRLSLP/H*PGMVANFCIF 
SK 


587 


14488 


A 


593 


107 


733 


aaaaaskvlm*p.egqlpgat\gtggvqa 
*apgsva/aegasvegpgfgdtapahqg 
lsptrshgqggagras/ssqqgspggrg 

DGASEVWSGAL/SPGGGKDGASASVPRG 
PYAEAEKGGWALRGLGGVAAPGPPSRAG 
QAP S GS / YTG PNARPAPWP I PGQGGGLR 
RDQAG*VSSWTGSTEPGAHTAHRAPGHG 
GKGGSPQQPHPQGPGQIPT 


588 


14489 


A 


594 


10 


435 


FKWLLKSHAICFWTRS*SYCDNVCVPSL 
WAHHLGIRTEIPEFFLSKFLCTSIIPHF 
TYRRQLRLIQGSTE*EA*EDKLEQK*AL 
GAAQFTLPGMDVFVCFVPCF/CLFEMES 
HSVT*ARVQWCDLGSLQPLPLGFKQFSC 
LGL 


589 


14490 


A 


595 


437 


3 


DEPKKWKTIPCSWIERTNI/VLKMAILP 
KRIYRINA/VPI KLPTSFFTEVGTFSQN 
* KTT I LKFRWNQ \KRA* I AKATQSRKNK 
ASSIT/PDVASNYKTTVTKTARHWYNNR 
HVDQW1ITIENTEIKLHTYSQ*ILSKAGT 
SKQWGKEHV 


590 


14491 


A 


596 


2 


498 


FFPFLGKTKPPTHLFFFPPPPFPIFFPK 
IFFGPLFKKKNPPK/PFFKRPPSFPKNT 
FFFSPPPPFNPLFSKAPPPIFFFFFKKl 
FFFPFPPLFKKSPPKTPL*ILGFFPSFP 
FFPPGFPFPSLP+GGVSLCHPSWNPWL 
SRPSSKLASAFRMPPVEG/SPFPPSP 


591 


14492 


A 


597 




311 


RRVSSESRWRSLESRGRSLEIQGRMERP 
RCQRFRENSCVAPRHCKGPGGKGLQALF 
♦VPVAQLGEPGAQLGDPGAHGEATVPEV 
QGE/PAaLLPGTAK\PGGEGSPGFIPPR 
HCRGTGEEGSPGPTQGRETSKAQQRTGC 
LLSCRGCLCLSVSGDS 


592 


14493 


A 


598 


302 


2 


FLRFFHHKKFTKMJSTRHFSKDILILLSS 
NHMKGCSMSLVIRD/MLLKIT/MKyQFI 
PTRMA/IIKKTDRYWQKISI/CEKI*TFIH 
CW*\NEKMVQPSWKAVWQFLEK 


593 


14494 


A 


599 


3 


386 


HTWPPPPRSPTAPPARTQPPSLQSALPA 
PQPGKKKKaijENEKG*HGSKKG/RGQPR 
PPPLRGPNGRTRSPAGICKGGGGFCPGV 
SKAQGAPRPGGRETWPGFFGRRPPr.PR 
GPGPPGVLWTSRHCPLR 


594 


14495 


A 


600 


374 


53 


EGFFFFFPRWVGGQRALFWSPHFPPPGV 
KIFFPPPPPGWGGLRGLPPPPFNFFFFL 
KKKGFSPFCPVFSQLPPAGVPPPPPPP / 
TGLDLRG*PPGPPPFFFFFFFFP 


595 


14496 


A 


601 


393 


1 


VSPPKGCVSENKIPPPHNYYFAKTFVHV 
KLFGVAPLFLQIL?*GPFLTSIRGLIL\ 
WIPYSPGY\TLTLLIAPRHFLRTIIPVF 
RSVLPKTYLGLSGMPRRYSDYPDAYTT* 
NIGAFRIARESWGEKLDL 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nncle- 

Sion 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threon)ne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


596 


14497 


A 


602 


1 


155 


FCFVT*SGVQWRKrHDSLQPLSPGFK*FS 
/CSSWDCRHAPPHIiASFCIFRSTRP 


597 


14498 


A 


603 


170 


455 


KPNASLKRVLILCIQLQSSRQQKNNPT/ 
VKKWAKDIjNILFSTEDL* TVWRYIKKYL 
TSLVIKKYHFIPTRLSKMKKTEHSKCWE 
SCGEIETLTCSW 


598 


14499 


A 


604 


3 


449 


TLLSNLEEAKKKKEDALNETRES*TKLK 
ELPGVC2JETMMALWEECKPCLKQTCMKI 
YARVCISGSGLVGRQLEEFLNQSSPFYF 
WMNGDRIDSLLENDRQQTHMLDVMQDHF 
SRAFSI IDELF\QDRFFTREPQDTYHYL 
PFSLPHRRP 


599 


14500 


A 


605 


169 


435 


KyENTKINRR/VAPN*rSLSPEKKKKKK 
KKKKKKKKKKKKKKKSASSSKASPSSSR 


600 


14501 


A 


606 


453 


29 


TSPPPPPGENFF* KKPP* K/ HFFPPPQF 
RFFFPPFPLKIFFFPPSLFFFWGVFPHF 
PPPPKKVFFPKSPRGFFFPPPLKKKFFF 
SPPLFFFPPPGFFLSPPPFFFFFFFFFF 
FFFSPFFFFFFFFTVKNFYLLCYFYKN 


601 


14502 


A 


607 


170 


408 


NEYDHFSIIKERRDFIVCVFFFIFSR*S 
FTWTQAGLSS LQPPLPGYKRFSCLS FL 
CSWDY/RWHPQGSANFFVFLVEMG 


602 


14503 


A 


608 


2 


448 


SLHPVIYSEGIKSRQSPCWR* * KTVKKK 
KKKNPNQKPKKKKN/RPKNPLGGGKKFF 
*PEKKPGPGKKILKEKGKKICPFPPQKK 
NFKILKKKKKGAPLKBCHP*GAQNFPGMK 
KINFFP*REVKKNPIGIFKKKPLFWGGP 
IGANPPPENL 


603 


14504 


— 


609 


44] 


"28 


GGTFLKGIKSAPKI *KGFPGWGGARFPA 

RDPPQFGG/PKGGGSLSPRGLNPPGPPG 

tl?ffj:kpkt,tggggppplipalrgvr 
p*nslypqrgggaqippgppppggkrap 

CFQKKKKPRRKEKKK 


604 


14505 


A 


610 


445 


1 


LWLKKNTGGGGGSPPLFPPLGEPKRD/G 
FPRGRGFGPPPPPIKNPPFFLKNQNNPG 
GGAQPGIPGPWGG*GGEFPLPP\ SPGFH 
* PRFGPFPPPRGTKREPPFQKKKKIANC 
FLLSDKSLLLEEAWGQWGPSPLEPTVA 
PKPNPRGKAQ 


605 ■ 


14506 


A 


611 


253 


437 


KKKKVQDMFSENFKML/NEIKATLNKWK 
DISYS*VRRriCWKMAILPT/lQSYHLN 
IIPIKI 


606 


14507 


A 


612 


186 




KKKKVQDMFSENFKML/NEIKENMIKWK 
DISYS *VSRLCVVKMaiL\PNKQSY2LN 
IIPIKIQA 


607 


14508 


A 


613 


37 


453 


KTPPPGENFFLKKPQKKNFFPPGNWGVF 
FPPSPLKFFFFP*TFrFFGGVWPNFPPP 
KKKFFFKNSPGVFF/SPPP*KKKIFFSP 
PRYFWPP/HGFFFKGPPP 


608 


14509 


A 


614 


1234 


0 


KKPPNTPN* *WAKDLNRHFIKDSSIKDD 
QYY*SLGQCKLISQYNFTCTEIAIIKKT 
WNNKSW*GCGETGNLIH/WIECKMVQ 


609 


14510 


A 


615 


327 


188 


FFSSSLLIISIPKFHHVTSLLENLK*AM 
LGSSHL*LQLLRRLRREDHLSPGVSGCS 
KL* SH\THCTPVMVTE*DGRKGLAGDGS 
HSVTQTGVQ 


610 i 14511 


A 


616 


103 


446 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanirte, 
G=Glycine, H=Histidine, I=Isoleucine, 
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N=Aspar2glne, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \==possible 
nucleotide insertion 














DGGGALKKNPGGAQKNRGEKKKFFFF *G 
G\KKKTPGGFWKKNPFLGGGMFAPPPPK 
KKKPLEKKKNF*GGRGEKPSPLPCGKEK 
FSHKKK 


611 


14512 


A 


617 


3 


429 


GLLSIIYKELLQINKISNLVGKWTKDK/ 
NKQFLKKEIHRLAK\yMKRySTSLVTIE 
M*LKTRYYFHPLNKIKHDNNIHC**GYK 
EVGILHILLEA* IGKPFWKAITVTVLNA 
PVFCL*FCFEMEFHSCCPSLKCRAPTSF 
IGAS 


612 


14513 


A 


61S 


427 


6 


WGGPPPPPP IFFFFPPPP \PYFPFFFLQ 
ALFS*RVFFFLTPPPPKNFPPP*GPPPP 
PPPPPPFCV\ FFFPPPPPPPPPPFFFFP 
PPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFLVENCVFINSITSDIEQKISI 
SC 


613 


14314 


A 


619 


68 


434 


YSKD* PINTKKKKKKKKKKKKRGGAFKK 
KPGGGKNKGGKKKIQilFF^KGGGKKNPRG 
NFGKKTLFGGGKKGEKPPQKKKSLEGKK 
KI*RGKGGKKT*NP/CGGKKFSFGGFF* 
KKFPPGGGGKY 


614 


14515 


A 


620 


454 


90 


NFPTPEKFGPPQGNL*KAPPFFFF*KGI 
PFFFPG * KQRGGFKS PQKPP P QGKT I FR 
PHPPKKGGPQGPPPPGGKIFFFLEF\KK 
KGGFPG*PRRFFFPGPGKPPPRPPKKAG 
IQGETPRPGP 


515 


14516 


A 


621 


432 


2 


PEAFLSSLLHPAPGEECFFLKKTPEEKFL 
TAKKYRVFLPPFPLKIFFFP/LRALIFL 
GRFAQIFPPQKKGFFPKIPRGVFFCPP* 
KKKNFFFLSG*1FAPPGIFFKGAPPFFF 
FFFFFLDRVMLCYPGWSAVARSRTS 


616 


14517 


A 


622 


422 


2 


GGGPPLPPPRGGLPPPK* * KA/ VGGGKK 
PPPPAPL*NPPPPKKKIGGGGGGKNPPP 
PPLACFPFGFFLPPPPPSFWGGEKFFFF 
FLSGGPPPPQKKKKKKKKKKKKKKKKKK 
KNKKKNQTKKKKKKKRAAARDPRVRPRV 


617 


14518 


A 


623 


176 


401 


kfsmfi ilwkayytlcvcvcvcvcvcvc 
vcvimylfissq/eklcflgegkicsis 
ltvuirgsrvcv*sqdlpvc 


618 


14519 


A 


624 


1 


355 


HSSGLDNLTAIiflHSHLCOSV/ CLCFLVL 
VCVCVCVCVCVCLCVCVOi/CLYPGPQV 
PK*SKNSSTSCLLPHFTSARTICP/CC/ 
CWFCFCFFA/CYQTCL*ILTSCL/AIC 
TASGVCL*AISKySM 


619 


14520 


A 


625 


2 


400 


fsfsflsi.fffflerdpgshgp\vgggg 
pnlg*wkfwplg*kqfsfltlmrrgnt\ 
rgppppppglvfwp*kktgfcfvaqagl 
eqrpfgdqpvwasqgagitgvp 


620 


14521 


A 


626 


307 


3 


VFFRIiPLTQGPGLFACFCPPPPCRFFFS 

ffpgsnspffp/l*klnfwafepppfff 
ffpffs ilfkdrvllchpgwwaw* s * * 
p*tpgln*sshltlpsgrv 


621 


14522 


A 


627 


1 


391 


nppplggakeggppeargsnppypywet 
pfff*npkifpgggghpvipsfpgg*gr 
kfplpreggvpl/rpnfpp 


622 


14523 


A 


628 


191 


375 


L I L I FKT I FNFLI LNFNF* SGGATQAGG 
QGRllLG*IiQPPPPWLKRFSCLNLLSSW\ 
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E=Glutamic Acid, F=Phenylalanine, 
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X=Unknown, *=StDp codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DYRGWP 


623 


14524 


A 


629 


3 


3S2 


HHCQIRLFILLITASEVLFCFVLFFFET 
GPCSVTEAGVQ*CN/HLQPLSPG/PPTS 
AA* VAWTADVHHHAWLFVCLLI * DGSWS 
VTQAGVQ*RDPSSVQPPTPGLK*ASC\P 
AFQKCLDYR 


624 


14525 


A 


630 


396 


2 


CRIENVLKKKKLEMSNLSEEGMLKANIG 
QKLSLIAPQQWNAKESLLKEIRSALPL 
NTQMIRMQNSLSADMEDIL/WWIR/DP 

tnfniplsqsiiqsnr/l/tlfmfmkp* 
rgeeaa* ekfdai sg * fttfkers 


•625 


14526 


A 


631 


47 


392 


LHSFFFLFLGKKIFTPGGGGGGQKPFF* 
NPPPPG*RNFFFLPP*KTWFMGPAPPPR 
KIFGFLKKKGFPLGGNL/SIGTPPPRGT 
PPPLPPRGGCYGGGPPPPPFIFFFRTFK 
NFSN 


626 


14527 


A 


632 


39 


382 


LFFSPFFFKNFFFFFGRFFFWGGVAPIF 
PPPKKIFFSQFP/LQVFFFSPP*KKKFF 
FFPP*IFAPPKTFFSIPPPPFFFFFFFF 
FFFFFF 


627 


14528 


A 


633 


2 


213 


LDKPGKHSKISSLQKI/HNGVCLQS*LL 
GWLRQEDCLSEGS *GCSEL * FYHCTPAW 
ATK*DPVSKKKKGFLK 


628 


14529 


A 


634 


2 


206 


QENGMNPGGRACS*PRLRYCTSSWATER 
DSVSKKNE/TNKTTLLREI *HFVGGPNG 
KKGLLKTVKGGLT 


629 


14530 


A 


635 


205 


2 


KRSLGLLAQI*/VQWGDFKTLQPLPPGV 
KQISRLNLLKKWDY*RGPSGLGKFW/IF 
L*KQGFPQFFRWFN 


630 


14531 


A 


636 




399 


QVQQTPASCPLDSDLSEDEDLQLRMaSM- ' ' 
TFPKKKKKKKKKKKKKEGGAP*KKTHGG 
PKK/IRGKKK 


631 


14532 


A 


637 


122 


373 


VSNILWTQSLLFFFFFLKGSSLFVPQPG 
GQGLDLG*TKFPPRGLKEFSCLTLRrSG 
NYGLAPP/HPG*FCFFIKKGVFPCCPGW 

F 


632 


14533 


A 


638 


390 


3 


LLVL&LtQDewSPFXPPHEQKLfEVSPK 
ANAAMLPVQPflKP*AHPTFFFy*FPSFR 
FFFAHMG*SS\YFVPNLTKLSEILKGPr 
NDMREVFFPIFRLLILFFFGDRVLLCGP 
GWSSWQ*L*LIAASA 


633 


14534 


A 


639 


392 


42 


PSGPGKPGGKTFLRZPPFQEACKRQGFY 
PLFPPKPLKNPKAPQNRKNLGPICPPPK 
P/QGPLRISPQNSQS6SL*6PNKGNCFP 
A*TWGPPGAP*RGPPKGPPFFFFEKVME 
AMGLA 


634 


14535 


A 


640 


390 




LSFSEEGMSTAKAGQKLGPLCQ/TSQVI 
NAKETFLKEIKSVrPVNIQMIRK*NSFI 
ADKEKILW*I*DQTSPN/rPI,/SQCLr 
QSNVLTLFSSKKAERGEEAAEEKFIMFS 
*RLRKEAASADREAAESYPEDL 


"633 


14536 


A 


641 


368 


3 


KKGGYGVPPPFSPPPGFLGEGSFLPPKI 
KVQKTL/SWPPPPPPGGKKKTPFFKKKK 
RKWFI.TGFFFFP * KKGPFFFFF* CP W* 
KFWGGGFKPIFFFFFF/RDRLSLCCPGW 
SAW*SQFTPRV 


636 14537 




642 


134 


413 


QKDQHNRIESLE/IR/PHTHNTLIFDKG 
VTGFQYGKDS LFNKWCWDN* RE/ 1 TQNL 
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Amino acid sequence (A=Alanine 
C=Cysteine, IMAspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leucine, lVt=!VIethionine, 
N=Asparagine, P=Proline, 
Q=GlBtaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














SLDPAHTTTNVNLKWTRCLKARP/VKTL 
VENMGEETLRD 


637 


14538 


A 


643 


2 


436 


GRVESINLPLGCRLFKKRDNIKCGQ\G* 
RAGGSL1HCW*EC/EWQPLWKRIQQFL 
IKLNIHLPRDSTILLLDIDIiDGHPKTSA 
LFIITH1'ICCFIH1SIHP*LEQPKCPSTDVW 
MHKPWPIHSMEHYSAIKSAAVWDNFKGI 
LL 


638 


14539 


A 


644 


446 


75 


LDLLTS*SACLGLPKCWDYRSEPPRPAF 
NF*/STTK*SLWSSHKQDYICRFLLSYT 
ATQKFST11jTTY*SHI,KTF*NAQMPRLYL 
QSLILGS IF-'-KSNLPR^FQCAAKAENH* 
*3DLYCQIMLFSFM 


639 


14540 


A 


645 


318 


1 


KGVPHKCKNSYVGPPSGPPIK*ILFF*I 
F*DRISLCSPGWNARVFS*LPV\T*TTG 
VKKWWPLSLLNI*GyKGVPPRPGIFFFF 
FFFRDR/SLTMLPRLVTWVQVI FQ 


640 


14541 


A 


646 


3 


348 


QFSSFFVLIYFTLPFMVSFCF/WCFSNL 
VLFCEIIFFLIFYSYYSLVLSFFYCLLV 
LFLSKLSFF/C*Yril(riTVFICSMTSFF* 
RIL/CCLNFIKTS FNFIjL* * FGMVLE S C 
FFFFFFWP 


641 


14542 


A 


647 


2 


553 


AILI PDKIDLKK\ VTGDKE *IJVTMI KGS 
IHQEDVTT INIYASNSR/APKYMKQKLK 
GKEKQISP*K/WGIFNTPFSI/DRAPR 
QKINQEIEDLNNIIRQIDLKDIYRTFHP 
TSEYI FPSSAYET/ FSKIGHKLKHKTS L 
NKYK*TE rMQSMPSD/HHGMKLENNE /N 
VGKLTNMWKSDTLVSDWLGAVAHA 


642 


14543 


A 


648 


388 


1 


GPPR*SPPFGKPRGAVPQ/GGGGLKPPG 

pqgenpfffkkpkitlgpgwg p/ rnp pp 
lgg*kgkip*/ppggc-gsn:<pnfspsgp 
pgkqkqnffsqkkkkkkekkk3kpes* c 
pgtspqpvtdgsqainasiss 


643 


14544 


A 


649 


387 


25 


pgmrggfppfplknfffppgp * / msggg 
ggpngpppkkgffpknppgvfpsppkkk 
kiffspppenlgpprdflkgpplffffl 
qknpgvkkragqgnppqkiakifplppg 
ggggavrhi 


644 


14545 


A 


650 


918 


2 


lglkgltiykilhstiadhtffssshgt 
famtdhilghkihlstfske*ei ipsts 
fq/hhsrnln*k*hinicvnwkipk\fwr 
lnktllnkt*tke/glkrhknils*tkn 
ityqilwdagkavlrgki>'ivltt*irke 
erskXttsfnirkrnkkliinirteine 
ienrksekiln\tkswffekkksiksik 
tlarlnkrkrketqipkhqk*rrgittg . 

NmNKKlKRWIHEQPYTHKLQNLDNIGQ 
FLEKRYLPKLKQGEI*SGWAyiN/SKEM 
ESIINTLPKRKAQDLKMFSQSE*YQTFK 

EKK 


■645 


14546 


A 


651 


282 


265 


GVFFLKKRPTVF*KKKI*LGPHPQLKGP 
PGVFQ/HFPI*NFGISGRGDLFFFFFFF 
FFFFFFLFLRQGQVSVaQTGAQWHNQGS 
LQPPGLK*FS 


646 


14547 


A 


652 


6 


223 


LYAHKFOTII.DEMDQFLQRHNTPKLIQEE 
/HRLTVIK* IKSIVINLPKQKa*GPDGF 
SGEMYQLLKEEI lYNVF 



558 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQED 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQU) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

correspon 

last^amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteinc, l>=Aspartic Acid, 
E=GIutaniic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=L!nl;nown, *=Stop codon, /=possible 
nucleotide deletion, V=:pos5ible 
nucleotide insertion 


647 


14548 


A 


653 


128 


337 


FXFFFXXFFFXFFXXXFFFXXFFXFXFF 
FXXXFFFFFFFXXFFFFFFFFFXFFXLI 
FFXFFLFFFFFLFF 


648 


14549 


A 


654 


US. 


2 


MAAASTTLLALVI I STTF* LPQVNGYI E 
KSTPEECGFDPXSPARVPFSIKFLLVGS 
PIQTHALDIALFLSHTHALPHTHT 


649 


14550 


A 


655 


325 


3 


CSEHKKNVF*LRAKKTNNPNLKWAKDLN 
RHFSRENIQVAEKHI*ICBASLGLREMQ 
IQTTKC/WHPVWKTV/WTILQN/LHTRL 
PYD/PAILLLGIYQRELKTCSHKNMYTS 


650 


14551 


A 


656 


3 


1655 


FFLGMESPSVAQAGVQRCDLCSLQALPP 
EFK*FSWLTLLSWDYRRLPPCIjANFVPL 
VETGF\TMLSRMVLIS *PCDPPTSASQS 
GGITGVSHRVQPALLFFLSCLCVKMIVH 
APLVKTCRP S TVGS CL/SVPAFWE 


651 


14552 


A 


657 


519 


84 


SRVRCQSA\QMGGASHLGYSGVRDPLEE 
AVFPFSDLKLHAGITTTLFKAVRQGHLS 
GFAKDF*RPAEQ/GT*YTSKriA/STKQN 
ILHEE* ITFVLLLPKHEGIWMPSVPPKP 
FLFFS*GMTNSGSAGILDFVENHPSKDG 
*HYVLITSH 


652 


14553 


A 


658 


38 


383 


EKCIFFFFFLERGFTFGGPPGGRAQFWF 
NKTPPP/HKKREFPALPP PGGGNWGLG P 
PARENLVF*KKRGFPLGGKRGLNPRPQG 
NPPP*PPKGG/GKNGSGPPPRQKK 


653 


14554 


A 


659 


98 


394 


KPPGKKRETPSQ/NKKKKKKKKKKKPFF 
PPRC-GKKNPFKKKRGG*OTP?PRGGGKG 
NFPPVI-:G*RGKr:LPPPRKPL?PPPQGGG 
KS*KK/RjA*KWSPQNi:'F 


654 


14555 


A 


660 


214 


567 


GFSVIKIGLKMi^MMIiriLRDSLTVTQI 
GVQWHNI,SSr,QPPP?GFK*FSCLHLLSS 
RDY*HAPPCPWFCILS/ERQFHHVGQA 
GLKLSASSDPPTLAPQRAGITVPSHQ 


655 


14556 


A 


661 


463 


392 


S*HFHPSSSSSSSSSSV1NYY*/FGLLL 
C/QVAHPSNS IDQQMIQSQRPMMLQSAL 
AFEQIIKWrGLLILL 


656 


14557 


A 


662 


205 


242 


GTGWQGGGLDG*QMTLGWPGSGVGQSA& 
piGl/GPGLPGIiPAPPAPSP 


657 


14558 


A 


663 


63 


351 


QSETPSQFKKKTQIEIiLYEPAVSLLGIY 
QKERKSVLKKHYTPMFTftALFTlXvWDQ 

pr/cpkymkr*nvmy/hvyththtmeyy 
satekneilf\fattw 


658 


14559 


A 


664 


1 


358 


FFLRQGL/DSAAQAGVQWCNHSSQRAPP 
LGLGRSSH\PASLAGWSAVA* SRLAATS 
TFEVQVILLSQPPE*LGLQSCSITQVGV 
QQWRDLGSLQPPLPGFKRLSCLSLPSN* 
DYKRIiAPPP 


659 


14560 


A 


665 


10 


257 


GMNEEGWITKEIKKIIWEYYE*IiGTSQL 

dtl/elnkfperykl/prtnsqsienmn 
rtitsdyisnfkrlpkkspgldgftgef 
I 


660 


14561 


A 


666 


360 


2 


enkkifnsapppgspflfpplweirpeg 
flkprvlnppwppn*appfkppmgppff 
plfrgvrvgdpls p pglrlp * pglp p \g 
ppg*pwgqnptffskkkkkswldssgfl 
epnisvnf 


661 


14562 


A 


667 


328 


1 


nlfaenyecpenkkknfqkkkihqiwqn 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leiicine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=passible 
nucleotide insertion 














SE FMKVTE YKLNYQN/ SQLYLYMPEKNT 
*KV/ILEECHPL*PQITKYFGEILKKDVL 
NLYTDKYKI S LREVKGEQNKWNNVSS 


662 


14563 


A 


668 


405 


566 


IHCQWECKTEQILWKTVWWFFTKLNILL 
PGDSAIMLLGIYPKELKMYIHTETCT 


663 


14564 


A 


669 


218 




LLMLESLMFVPPFPSFEKWAKL*IY/CH 
GAHANFLPSFFPSPPFFSLSFFFFYNRV 
FL C C PGWS AWRS QLT 


664 


14565 


A 


670 


2 


210 


NFGQEENSEMNSLCSYLHNLEKGEQTRP 
KftSRRKEIIKI*AEI*K\VQRSNRENK* 
KKWFFEKINKIDKPIiARLTKKWRT* I TI 
VHKETGTITKDPADTKRIMKEY/YKLLY 
MHSF*NLSRNLKRYREVIEKINERSGSL 
KRSIKLTNL 


665 


14566 


A 


671 


200 


3 


SLCHLPHVASKaTLEr\GLVEHMW*DFF 
CFFETESHCIAQSGVQWCNLGSLQPLPP 
RFKLFSCLSL 


666 


14567 


A 


672 


94 


374 


PKFRPQETTEQTSQFLQQINCKGRKRQG 
KRTYRLGEIQKQSQPMATFELYLD/H*F 
* LAKETL *LGAVAHTCNPS TLGGQAGWI 

TRSGVQDQPGQ 


667 


14568 


A 


673 


312 


3 


WKMGQLPGRPQLPQLSQEKILSLNSPNV 
FKKKKLK* * *NILPKKKRPDCFTHKF YQ 
TFKEEIIPF/L/HKLT*EFEKKEILLKS 
F\YEGSW*AAKIil!TADINKKTTH 


668 


14569 


A 


674 


34 


438 


QLT*PD*HFIKYSTQQQKTFFSSTHRVF 

AKLGHNLVY\KANLNKFKWT,QVIRWMFL 
DRDRIALKINKKK/PSSPLKYLQTHLLL 
NDPRIKEGSKREI IKDFALNDNATY*NL 
WCL*NGTLRKVYTTKCLFRKEGPQM 


669 


14570 


A 


675 




349 


Q KDWKSVK PSS *WP PL.A VEK I PI / LKQN 
PHIRFNGPAFFKIAHGLTSSLVFCLANSN 
YERTHSRIIIIiSQGI.QTGFAlKTFRKLL 
ASLANLALPPTINLLGELS I 


670 


14571 


A 


676 


346 


1 


K>1VKLL*NIVWKLLKSGLTEVPHGSA1F 
PFFIPIFPKEMETNVHIKTCTWAFPaAF 
FPMaKKKKQSGNNPTTDE*IQKK\W*TH 
IIDYYSAIKHNELLTHATSCNITLSKRS 
QTKK 


671. 


14572 


A 


677 


357 


6 


AAGDSSLKPMFIYCFQKPMPL*IMLKST 
LIiVLYKWNHKAWMTACLPTAWFTE/HCK 
PKLETYFSEKRFLSNIT*LMKASGHPKA 
LTEMYKEINWFKPSNTTCTLHPMDQEV 
ISTFNSY 


672 


14573 


A 


678 


401 


155 


YYSVfl!QAWQWRDLSSLQAPPPGPE*FS 
C\PASWDYRHKLLTSGDLPASASQSAGI 

TGVSHCAQPLSLFVFFLTNQRFVAALS 


673 


14574 


A 


679 


292 


395 


Q*KISFEMLLIITNVPGHPRTPMEMYKE 
LWFF\MPANTS IP* PMDQGIVLTFKSYY 
A/ENVFCKAIAV/DSDSFDGFGQSKLET 
FWKGFTI*DVIKNIDDL/WRGVKIPILT 
GVWEKLI PTLINNFEVFKASLKEI 


674 


14575 


A 


680 


275 


3 


WPRQASLALNT*S IHIYIHTDMRTHAHT 
YI FGMKSVTQAGVQWHNHGSLEP * S PGL 
RRSSHPSLPTSGEHRRTPPR/LADFLHC 
L*R*GFTML 


675 


14576 




681 


330 


507 


GSV*PVNLIRNCQPFVQSGCA/SLHSHQ 
E\WYMCSPHPRHYLVASVI FILAILPFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etl) 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 

dingt^ 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LRRSL 


676 


14577 


A 


682 


390 


43 


HHLP IRYICLS3 SS ILCLSFIMYLS/ lY 
LS I YLS lYLS lYLCQRCQIPLVFLCLLL 
S*LS*IILILNFSCFFVGRTIVLS 


677 


14578 


A 


683 


396 


63 


ADPTEIQRIISDYYDQRHTNKLENL*KM 
DKFLKT-fNLPRLN* EEVETLIRFIAMHE 
IALIIKSLPIKRSPVLGGFSVGF*HIDK 
EELTPVLGLFQK\IKKEEILANSFYESR 


678 


14579 


A 


684 


1 


396 


EETLPLFADDMIIiYIIKKKQKTiOCTTKK 
STKRY*N**M\NVFGKVAGYKV]SITQKSI 
N*QYTI*KVKI,A.S\FSSSPQKNKIGINL 
TKEIQNVYSENYKTLKE1KDLNK*ESIP 
CSQIRRFNIVKMTVLLKLIYR 


679 


14580 


A 


685 


283 


3 


NCVESSNTIILWEPSQKPLSSVSLTDVCP 
PMLSAALFAIA\RSYTLPTRSSIDE*IK 
KMWYIHTMEYYSAFKKKRIMSFLMPRME 
LKIFRVWKIR 


680 


14581 


A 


686 


53 


324 


HCFCDRIRAAFCFYILDCPFFISITHKL 
I*IFFPFFLKRESIFVPGVGGGGPNFGS 
LNPLPPRLRRFSCLTLPRGGDYGLGPPC 

PT/NFCVF 


681 


14582 


A 


687 


39 


208 


NIFFCREGFAMLaRLVSNS*NWPGKVTH 
IHOT/STLGGLGSRITQTQEFETSLVDA 
GA 


682 


14583 


A 


688 


47 


341 


SSGRVFVLFCFFETGSHS / VLFRLAYSG 
GISAYCSCHLLSSGDSPISATR/GTTGM 
CCaAQLGFCVCVCVCF/lFVF*VKIGFC 
HVAQAQLELLDS SNPPTS 


683 


14584 


A 


689 


229 




GRVDGRQIACQEFKARLASQSAGITGVS 
HRAQPFFCFC/LLFWFEMGSHSVTQftG 
VQWCAPGSLQPLPLRFK*FSC 


684 


14585 


A 


690 


339 




KKGPPGGPTKPEGGRFPRGPPGGGKEVF 
PPPPPGGKGEKKRGGAPPPFFFLKKKRK 
SGKGG\N*FFPPGGGKKGGQKKKKRVFF 
PKKKEKEKKKALSLSLSLSIiSLSIiSLLG 


685 


14586 


A 


591 


372 


0 


YDYLyiKKL*N/IjREKIDKFLDTYNLSR 
LNQEEIENLNTPITSNKIETVIKSPPT\ 
KQKSPGPE 


686 


14587 


A 


692 


117 


297 


APLLESSSASYi:iHFSN*DLQMAH/KHTK 
RCSTSLAFREN* I KTTRRHHFTPTKMAT 
SKRHT 


687 


14588 


A 


693 


122 


322 


EQTNSWS IDFFFQRCHSNAMGEKNCLFN 
KW* WDNYVAI \AKKMKLSLFKPYI KTNS 
K/WIKDLNIRAKT 


688 


14589 




694 


21 


342 


RSHFRHLNEYYATQQGNSDVHPLLIHSN 
I STAI IWQLQRQERRKEKRLKEVKNEKA 
LKE IFFLRGWVLLCHLGWS PWGS * FI V 
TLN\FGLRQSSHLSLPSSWDYRPV 


689 


14590 


A 


695 


2 


343 


KFVPLHS GLGNRS QKQTLS QRKKKKGGV 
FVAWAKVQWHNHG* QGP*TGLRQS S CLS 
L*GGWVKL/PGGRHHAWGFPIHVKKVGLE 
LLTSDDAPSSASQKAGITGMSHTAPGPK 
NLV 


690 


14591 


A 


696 


336 


3 


IPPVKSPKFPILLKKKKFSLFCFSGAPF 
PCYSPPRAQKKGW*KIFSPPPK\QKKPG 
ALFFSPKPAFFPPPFKKKKKKKKAKEGH 
YLMIKGLIQQEKI ILNI YASNIGARRRG 


691 


14592 


A 


597 


374 


2 


AFY*KKMVSRTFIAREKSMPRFKAS KDR 
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nucleotide 
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SEQ ID 
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peptide 
sequence 


M 
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SEQID 
NO: in 
USSN 
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26 
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nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LTFLLGAGNFKLKPVLVYHSENTRALKN 
YAKSTLSVLYQRTTKAWMTSHLFTEWFT 
EFFEP\TVETYCSEKKIPSKILLLTDYA 
PRHPRTLMEMYSC 


692 


14593 


A 


698 


361 


3 


PQMCPPPPR*QGIIFL*L*THCFFCTVK 
ARLFLFPPF/S *RKGGFFFFE *RKFFKY 
FGWELFQIRFKVFFFCDRVSLCCSGWSA 
GART*LTAASTSPAQASLLPLTLPSSWD 
YSHVLPRLV 


693 


14594 


A 


699 


75 


362 


KDCaiYGICSKKQKGFCLTQTGAWQQYEE 
I*GLET/GF SRSVHS MGQRR YWD YRHAL 
THPANFVFLVETRFLHVRQAGIELPP/S 
SSQSGGITGVSQ 


694 


14595 


A 


700 


2 


277 


FFLYTSNEQSEKKI/RETNWFIITSK** 
NRNVNKEI *DLYNENSKTFLKE IKEDLN 
KWKGILSS*TEKLNI IK/PTAI * * FNAI 
LIK/IPMVFDK 


695 


14596 


A 


701 


) 


407 


gtrkhfskedrqyhkslrkqikt/tvsy 
cftptklttvgktansk/cwagckvigt 
lihcq*ecemvwllwt/svqqfl/rnvk 
lyetaisllgictreiktplhiii\iia 
kkwqqlrclstdewinnm* y ihtme yys 

A 


, 696 


14597 


A 


702 


209 


3 


SLNRRLS FFFY * RQGLALL I / E PGWGAV 
ARSWLTIPASN\F£LQHSSCLSLRSSWDY 
RRVPLHRVSLRFSC 


697 1 14598 


A 


703 


308 


3 


TTGLKR FARVGP PKCWDYRCEPP GLATS 
SVLKflFQPIGSVPPRLSRILSFM*NQ\ * 
YRMLITSTKYFPRMVSISRPCDLPATjAS 
QSAGITGMSHHAELVPLV 


698 


14599 


A 


704 


603 


2 


PLPP\GLQVESP/CVSI,PSSMDYRHAPP 
RPAWFSVF* *RRGFTIflLAKM/VLIS*PC 
DPLASASQSAGISGVSHHAWP KQT*LLD 
TDKTEGIFLTNHLLRIPPPMFYQYILIK 
VLESVKETKDKRKSVSSCTLYRCL* ILF 
IS*AYIKFTTFFFF\ETDSRSATQaSVQ 
WHGLSSLQPPPPGFK*FSCLSLLSSWDY 
RSMPPRPA 


699 


14600 


A 


705 


3 


366 


ARVTYYSGKKEPF6YLGMa*AMISIGFL 
GVIV*AHHI FTVBIDADTRAYFTSATI I 
MAIPTGVKIFS*LATLHGSNMK*SAAVL 
*ALRFIFLFTVGGLTGICI.SNSSIiDIVL 
H/DTYYVGAHF 


700 


14601 


A 


706 


358 


3 


GiTlHVeQjiGLKLt,TS*SAliLRLPKdWDY 
KREPPHLAPDRKYFS PLVFAHDNFVPQG 

p* Khvwiniclsqvkaatgiypgeagal 

RWRLHPH* RI FWPQRPRVP I KSPWTLRV 
ISTLCSC 


701 


14602 




707 


358 


67 


qlsstiwy* * kn/ridqwnkiaqntsvh 
rklisdkvtkpiqwskdslfnkecwkn* 
ts i ckk/lsldsaltlftktns k* itep 
gsvahtcnpttlgg 


702 


14603 


A 


708 


34 


363 


RTVFFFFFFFFFFFPKKTPFFFPGPRGG 
*IPPPEKKIFPPVFRGKFKKPPHLDPGG 
6NFPRKPP FFFLGP \ S FFFLGKTLGGGG 
WIPFPPPWPMLFFKKIFWGAGGPSPF 


703 


14604 


A 


709 




380 


STCKRMTLDSYFTPYTKINSKWNKDLTV 
RGRTIKLIEENIGVNLHDLGFGSGFSDM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aliinine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPKAQAKRKIN/WDFIKYLKFCA*EDSI 
ERV/N/RRSTEWEKIFA!S!H1Q*GN*N1Q 
RILKLSKL*ARLKWAEDL 


704 


14603 


A 


710 


382 


30 


PGTTGVHHHTELTFVF/CGRAKISICWL 
VWNS* AQSACLSLPKCWD* E PQHPARFV 
SFNTRSGIPIISTNPSLSLSAFPSQGFV 
FLFPHFPASHPETALAEFFLGTFSLSPP 
GPFTPAS 


705 


14606 


A 


711 


1 


377 


LLV I CGMQM K I T VRCN * TLNKI AI I E KK 
TAKCC*HYGGLKTLIHUWCKSNMVQPLC 
KTV*QF*KKLNrQLT.YLLK*KKAYIHPS 
NCTQM FI A/ AL FS VLAPNWKQSAY 


706 


14607 


A 


712 


1 


390 


lkykimt.gmz^wacwpnysggpgnita* 
tw*aymaltrlepgrgdhtaalqpgrqs 
ttpf* kkkavpfea/ rpvkmrere i * kp 
fspeXrtyscaqegpgrtfgsaqdleaa 
ggrghhkmgavwqk pheillg 


707 


14608 


A 


713 


2 


393 


KVRRQIIWVDTATDSIGIKWIIWKHYV* 
LYRJSTKFNYLAEMDTYVERYKLP \ KEIDV 
MNSLVPAKEIIWVKITJiTKTPDPHIFT 
NEVYQTFKEYY/PVLYKLFQKIEDERTF 
LSSFCEAGINLISICKRFFF 


708 


14609 


A 


714 


219 


1 


PRPGAVAHACNPSTLGGQGGEI * TQSS I 
I KACCLIHDLAIQTQKQMTXWPGAVflHA 
CNPSTLGGQGGRITRSGREDQPSQHSKT 
PSLLKIQKKHiASWAGACNPGYFENYIQ 
KLLRGPGT 


709 


14610 


A 


715 


381 


126 


IIflGGGGVCT*SQLFKRLRQEDHLSPGV 
RGYKEL* SYPCTLAWVTE*DPVF* KKHY 
/TLNTSPKETNKK*GVECIKTGKVLIFV 
AAG 


710 


14611 


A 


716 


417 




CYCCWGATNKILLLTDNASGRQIVLMDM 
YKKNNWFVPVNTRSFVQI.MDQGIILNF 
KSYLRN/TF/HKCIAVINSNSS*ESGQS 
KLK2%I.WKGFTVLYAIQNICDSGQVKMPT 
LTEV/WKEVILPLMGDFVRNRTSEKQNY 
KLS 


711 


14612 


A 


717 


406 


122 


MFSRDAVSLCCLGWS*TPELKLSSSLSL 
PKCWDYRGEPPCSAAXDF* I IRKR/HSC 
R1IIIRNSDAWQR*TTLVSYDRNPP*FPS 
PPPNSPLCPAA 


712 


14613 


A 


718 


535 


1 


HlSnJ!IKIV*NLHVENYKMLVKDKKT* ISC 
LWVRR\LVKIPVLPKIiFYSVIVFPIRSL 
VEYFIEVCTFTLKCIPKEKDLKQPNRF* 
KKKKKVHLPFIKVYY\KLQ*YIATVIKr 
VWY/W/YSKIDKAKWYKTEKAEV*PHEY 
DQLILAEVQINH\NLFNKCY* SN*TTIG 
KNMNLNLSFTSYT 


713 


14614 


A 


719 


255 


1 


GKLQPNKGLI S LI Y\NL *KLVRKSNPKE 
KWAKGMNR*LTEKGIQIAFRHMKRCSPR 
FIMKAMHIITTWSYHFSSKRKTRIQRLT 


714 


14615 


A 


720 


381 


0 


KFIAVKEYI IKEIDLPINNLTLYLKEL/ 
EREELTLCKASRAEVI\KIRAKI lEELT 
PIRKTDNQSWFFLKIHKIGQ/PFKLD*L 
QKKKHVREKVPITKLRYENRVITTNFTE 
/IKRIIREYYDH 


715 


14616 


A 


721 


3 


381 


lYREWRDLTRDGTYQGHHAPPGEKG/VC 
RYGIILFITSEGFFFAGFF*PFNHCSLA 
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Predict- 
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correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=JVIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PTPQLGGPWPPTGITPLHPLKVPLLDTS 
VLLASGESIT*AHHS\LIENNRNQLIQA 
LLITILLGLYFTLLQA 


716 


14617 


A 


722 


56 


455 


KTIINIKPPAIIQYQTPLFV*SVLITAE 
\LVLLSLPGLAVG I T ILLTDRNLNTTFL 
DPAGGGDPILYQHLF*FFGHPEVYILIL 
PGFGIISHIVAYYSGKKEPFGYIGMV*A 
MISIGFLGFIG*AHHIFTGGIA 


717 


14618 


A 


723 


398 


3 


HQLPVFWQ YMKKAWTTRPLFVDWLH * CL 
VFEVSKYPASKGLPFKWLMLDNAPD/H 
PTQNSMSS/DSKGIEVIYLPPNTOSLI* 
FLDQGVIRTYR/RHXPQYSMQRSIRPMQ 
EISNKENI IKVWKKSTTDDAIVA 


718 


14619 


A 


724 


318 


14 


tppfflnipnlfflkfpsppppwfltks 
lffplklkpkvfgk*slskf/cpfpkfw 
pkpqkknfffppppfffffffflrhnl/ 
vchpawsalapsqltatsas 


719 


14620 


A 


725 


2 


371 


APRCKRFSCLSLPSGWDYRCMPPRPVNF 
F/SIFSRAGGLES*PPMIPPA*A/FPK/ 
CWDYRHE 


720 


14621 


A 


726 


430 


97 


PKiaTPFWKRA*NGC5AQKDVPLS]:iP/SGW 
AGGSPWGPGF*PPLGPKGGPPFSKKPPG 
GVGPFLMPPTPEGGGGKLA* PWPPGFNP 
PVSIFLPRPPRAGGKKGTPFPKKKKKEK 


721 


14622 


A 


727 


406 




GGAPPPPFFFFFFI KKGVFPF*QEGFFF 
PGLKISr/LPP*PPKKVGVKGGSPPPGGFF 
FFFKKKVFFFFPGLRTRAQI KVPSTLVG 
QG* KILPPQVSQEGGVKEEGPQAQLFFF 
FFFFEMESHFVTQAGLQWRDIGSP 


722 


14623 


A 


728 


98 


368 


EMGTL/LHY«WEGNIEQPWE /T/VWQLL 
EKLKTELPCDPATPLLGIYPKTLESRDS 
NRFFIFFFSF 


723 


14624 


A 


729 


19 


404 


VCVISICLATEILFFLHTRPCVCIYCVC 
VCVCLaVWGTLCVYVCIESCVCVCFHLT 
GVLC/V*VCVISICIiATErLFFLHTRPC 
VCIYCVCVCVCLAVWGTLCVYVCIESCV 
CVCFHLTGV\CVCVCVCIESSVCVCPPT 
CGTVCVCVRPPGVI.CVCVCVCO?!AWVAV 
SMCLTP/ CPC/VCVCVCVCV 


724 


14625 


A 


730 


432 


12 


FFHKPNFPAftRRGRLFFPIYPSKCLISP 
KAL*F\FGGVGPFFPPPKEGFFPKVPR* 
CFFRPLIRKKQILFLPPLNLAPPGVl *M 
/PPPPIVFFFFFFKRDRVSLYCPGWSAV 
A*S*L/TAVRTRGLK*SSCLSLPKCWEy 
KCEP 


725 


14626 


A 


731 


332 


7 


NKTRKEKFLTRSISRF/CFPPFPLKIFF 
FPLRA* FFWGGLPQ I FPP P KKVFF/P KI 
PKVFFKWPPIiRKKIFFFFSPLILGPPRV 
LLKGPPLFFFW*VFFF* DRALLCHPY 


726 


14627 


A 


732 


70 


294 


FLRFCVETAINKIAIHjflNF*NFL*RWG 
\FTMFPRLVSKL*DSSDPPTSVSQSVGI 
TWSHHARLNFFYYQNSRR 


727 


14628 


A 


733 




355 


FLLRHILLCHPGFCSVATAYCSRDPGSS 
D/PLPPQAPLPDQ*PRLQA/WHRLAPPH 
SANFF**RQGFTVLARMVSISQPCDPPH 
SGPQGA 


728 


14629 


A 


734 


50 


393 


IPGLTRQWLLDPCASPSTPPYT/P*VQP 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Hlstidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














STPQWSSPSPKTHNQKGLPMPLSPTPKP 
STAWKKAILEHTSSSSSSSSSSSTCEMR 
NGYTYTVPEHRPARGHTASQTRKQVLAA 
THKP 


729 


14630 


A 


735 


451 


0 


gpsssptniptpvswnsfx+ssqvagat 
gtcyhawl i fvfsvemdfy/hpva*asq 
saqlcqyfmi flrqgvslcppgwsavaw 
skltt/clclpgswdhrc7uahpanfly 
fwwr/hsc 


730 


14631 


A 


736 


2 


477 


P3HVSNKRLISKIYKELIHLNSKKVSNP 
I*K*GQSLNRDFSKASFLPP*KC\QMVN 
RYIKRCSTSLIIKEMQIKTTMRCHLTPV 
RLAIMKKSKDNKPSLVWDC* IRVLHLF/ 
CK/WICQYVQL/FWLLLSNFLVKQPF\Q 
LPAPPNSPTLG 


731 


14632 


A 


737 


3 


2344 


AAGGPTAQSPAQLAGRALRIAP.WHAVAV 
GACRPGAGSPCSVQGGAASELSPRPQTW 
IGSLKP*TFGAAAG*AHRGCGGSAI ,LN* 
ATPRPAPGLPASPTSSQALPAPLGAWGH 
SDHQPRAFP* SPQASTAIRKEKKQRAQP 
GRASVCPASNPFISSRALPVLQHGPPAI 
SGAGSAVASQAPGSS/GSHAESGSPALA 
HTP * GS * EPHSLI VESTRKS \ELPSSSQ 
GRLLLPLLTPGVAS /PTOTKLPGATAAT 
AGALHQPGLRLS SLQGVGGAKNKQTGCC 
CLQLPTTGLPQAPGALRPLGRLGPAAAP 
GEHRQRTSPQGTVPYGGIRQGWEQPQRL 
RAYGTALPPPHTPPGSSGP/RQAPGCRG 
SGAGEAAGIRDTGGGGPGRRAPCSQASP 
GRGGWQAQVGCETCRGCAQS S /GGGAVQ 
PGLPRKPYPHS/AR*E1ILWPFPCSPRT 
RAQEPQTQGEE/GVKGPQGSPCAPGAVR 
GRG IQLS SEPGKLARQG/PASGDGP * EG 
TGQEPSQAFSSARHPWPSEAKMPDTAIC 
RNQAQ*LQLTSSQPSMGPQLRKSLPATP 
QPNSYWDGGNSATLGRTTNTRRHCGMSN 
FGARGDLLGTVPT* QPLMQRRKEKPRVG 
GEPVQSHTVCG* \ PAGVSRGWPLRPMLP 
ERWRPLSASSGSQGRPGLHPPSLA\CGP 
S SS PHRTCSP /GLDPGLPDAGS IKPPS L 

vgagqgagst/gld/gpglsllspgksl 
lppsspatglsglgwaqsaf*sllrva* 
wlnpvpngpsdtadctpaqaptapamle 
nqanksdfffh 


732 


14633 


A 


738 


37 


450 


NQKK?7PPPRSSKPARPLFFFL*QKYIKD 
P*KTYQGCFFFSPPPKKKKPPPPKGGGF 
FFFF*KKKGGGPPP/QAKKRGGGGPPQK 
GGAKKNPPRGFTRVFKGPFKKRGPP 


733 


14634 


A 


739 


2 


426 


QEFVLGSAPGCGCWVGGEGRVSSPWGDR 
KVGGPSNSRHWNKGAGPSCPPGPSLAAG 
LPGRRAGSWAPVLW\PGPP*GLAQLPSP 
IiW/PPSPPQEREPPLPD 


734 


14635 


A 


740 


21 


392 


AQEFKSSLGNMERPLPYQKKKKKKRPQG 
GA/RPLETQPPGGPGGGEKKGGKKKTSR 
P/IRGKPPFL*KKKN\CLaWGGGPAL*P 
QFFGGVGEKKTFWPGGGGLGKPKPPPFT 
PTRGKNQTPQKKKKK 


735 


14636 


A 


741 




40S 


PSSQGG* EAGG*LEPRSSRPAWAI*RDP 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=jLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \'=possible 
nucleotide insertion 














/ GS PEKGKKKKKKKKKKKKKKGGG6FKK 
TPGGAKIKGGGEKKKFFFKGGEKKNLGG 
ILEKKLFFGGGKNGEKPPKKKKGLREKK 
KF*RGKGGKKAEIIPGGKKIFQKKKK 


736 


J 4637 




742 


394 


1 


RDPLEEAVCPFSDLQLRAGRTTALFKflV 
RQGHLSLQRLLRLFVCLCPAPRGGAYRS 
RQASLNRGGFHPVEAYLLLCLPKQASEM 
AGAPSPA*LPPCSLISDCCASNQ*DSVG 
/VKPSEPGEGYNLWRRFLS 


737 


J463S 




743 


384 




FTGPYLKIOTGYSFLFLYPPPSVLGNWQP 
PHPGRVFPP*DVNFNQWIFVSL/ISSLK 
DEPAFSSTLYSF*QHFFFITYLPPP/CF 
FLIESCSVAQGGVQWCNFGSLQPPPPGY 
KRFSCVGLPSSWDYRCV 


738 


14639 


A 


744 


2 


410 


TPLNPLEVPLLNTSVLLASGVSMA*ADH 
SLIENNRNQIIQALVITILLGLYFTLLQ 
ASEYFESPFTISDGIYGSTFFVATGFHG 
LHVIIGSTFLTICFIRQLIFHFTSKHHF 
GFEAAV* YWHFVN/VRGLFL YGS I YW 


739 


14640 


A 


745 


393 




PPPPPRGRGSPPPPFSRKGGPKGKPPPP 
GNFFFFFKKRVFPF\CPGGFKS*V*GVP 
PPFPPKIFKFKGGTPFPGLWSPLKSKKS 
QFFWVPKPGERKKLFPPFFFFFFLRPCL 
SLS PRLECSGAISAHRNL 


740 


14641 




746 


2 


290 


KNLCRENYKYE*KKLKTMKDLCKEYYKV 
* MKKI KEKM *KDKLMNFK\NN IKIS TL I 
KVIYKFNAIPIKIPMTFFAE\LK*ILKC 
V*NyKRP*IVKAI 


741 


14642 


A 


747 


59 


364 


CRFFFFFGLGGGGGGGGGGPPPFF* KNP 
PWPPKKIPNWGFFFRGPGFAPKKKKGPF 
LGEPPPWGAPLYNPPPPFGGGGAKFPPI 
FFPRAGGPQKRVG/RPPPPPPPPPPRPK 
KKKNLH*LECVEGTEEPPTRPPTRP 


742 


14643 


A 


748 


2 


371 


SSCLDLPKCWDYRHELLHLAHWLVLGIF 
LLLLFCFVLRW/RSCSWQTGV*WSDLT 
SLQSIiliP 


743 


14644 




749 


339 


198 


IADMEKV*AF*TEHKTNHISLNQN* I*S 
KVLTLFNSMKAERD* EGPEEKFGASTGW 
FMRFQE I SQLYN/ IDV*6EAASflDGEAA 
ATCaEDPAKIPDEGGSPK*YIFNVD*AT 
FIRDLSWIFCTSCCSFSISTCCFTLHVM 
L 


744 


14645 


A 


750 


249 


2 


KIFKPPAFPVSPPPFPL*DSSSPPEP*I 
SGGRVGPI\FPPPKKSFFPKNPP6VFFS 
PPLRKKILLVPPPLNLGPPKGPLKRPP 


745 1 14646 

i 


A 


751 




288 


VVNHMTDEGLVSRIY*QPr.QL\KRKTEN 
PVQKWTKCPNRRFSEEDEQMAEKHKQGW 
PASRAIRGTQSKI IMRCHPTHTRMaRIK 
KEKEAGPGGSCL 


746 


14647 


A 


752 


494 


2 


ETGSWFGPQRLEVQVVWIKSSLQP*PPW 
APVILPPQ/LSSG*DHRHVPSRPG*LKK 
KFFL*RWGLTiyiLPRLLLraStLPTSASQNA 
AITGVSHCARSASVFLILRQGLWVWEE / 
GAQW*SALSHLPV*/LSR*WCQP*PI.G* 
DSGSGSPLWRRLAYLPLCaLWKEVTMCS 


747 


14648 


A 


753 




533 


YI,SPRLECSGGITAHCN/LRPPGLK*SS 
QLSLQ\SSWDCRPAPPRPAARLIFFCRF 
EEGRKSHYVAQACLKLPGS\SNPPTVA/ 
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Inst amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPKCW6YRCEPPHTGLCS/GLSEFGFCL 
WWGACSRSSHKNPGCGTA/CPAVFLHSV 
GYASHPSSTSRTAFTLGWVLVFVFRDKG 
LTVFSQAGSTVGVILG 


748 


14649 


A 


754 


1 


346 


ELSKSTLPALCGGNDKAWMTAHRFIAWF 
TDYLKPTVENYW*EKKIRLKVL1,LIDKA 
PGHPKALLEMYTEIHWFMPADTTSIL* 
PTHQGVISTLKFCYL*MTFHKVQ\CYID 
SDSS 


749 


14650 


A 


755 


1 


323 


EDQTSHNVLLSUMLIQSKALMLFNSMEA 
E/R/SEEAVEEKFEASRE*FMRLKERSC 
LQNI KVQGEEASADGEPPACDPEDLAE I 
TDEGDC IK* QI FIVDKTAFYSKKMPS 


750 


14651 


A 


756 


1 


344 


ARQQPGDPARGGGRGRRL \QPHQAAS FC 
TGTLARAEHTCEGAGVLRRPADKLASLN 
*HFSQKKKKKKKKKKKKKKKKKASSSSS 
KKKKDSSYSSSSSRGGSSSSSSSSPPPP 
PPP 


751 


14652 


A 


737 


119 


317 


NRYFPFLRVKK?PWVEGPFFFLR*SLTL 
LTLVAQAGLQWRNLSSLKSPLPGFKPFS 
/PSA/LSRWDYRR 


752 


14653 




758 


2 


363 


cii:aivksainriak*yi*kcsmsiitk 

RQA/ I KRKNKTRRYQL I PVRMTL I KKKK 
RW*RCEEKGRLAHCWF\ECK*RQPL*KT 
K*RFLKKLKLP\FITAIALLDIYPKQIK 
SE/CKKH/CAILFIALFTIAK 


753 


14654 


A 


759 


358 


1 


KADFCFPPTFSPRVKKIFPPPFPKKWG* 
KGP/LPPPGENCVFFFFKKKRGFSPLGG 
GFFYFFPPGFPPPWPFKKLGFKGLTPPP 
GPPQKILVFFFFFFFFFFE/DSVLLCRP 
GWSAVAQS 


754 


14655 


A 


760 


316 


2 


katrsqdirriqvklkeiepqkplqktn 
ksrsvff* kh* *ne.tltrvmenkrek/n 
eidpiqnhkedittdpteiqtirkyykh 
lyahklkkleevdkludtntl 


755 


14656 


A 


761 


3 


321 


fplglwgslgakpegqaplyppti.skrv 
vplc/dkgrsavtfntvyssgssrwfp 
ppcm*iyrlcikgrssdpeqkkkkkkkk 
kkkkkkaskkkkapkdssspkt 


756 


14657 


A 


762 


315 


2 


KTERW/ VFEKINKMGQS *DGLTKKNREN 
/V/QINKIQNERG6LST6STEIQRVIRG 

fcepl*aqildnlekmd*flqpfhlprq 
nykkkkii *kkpiskvier.viknl 


757 


14658 


A 


763 


226 


329 


npttlegqggriskgpef/csrdevspc 
wpgwfqtpdlwrsvrlglpkwwdp/rr* 
apppglnliffrmlsthgsw/cqhpcfkf 
ptsafqyiyiyif/sflsrsftfvaqav 
vqwhdlgspqppppqfkq , 


758 


14659 


A 


764 


335 


1 


qsqerpdsfqvmnltlprrpsmflvspr 
telsksp* \ppaflcpcvwphstllsqt 

LKY* IKYP*SPTPFPSPVLF*RDRVLLF 
HPGWSTLAGS *LTAaSNSWAQVTLL 


759 


14660 




765 


100 


310 


HFGHPKRENCLSP*V*DQTIjHTHTHTHT 
HTHTHTHSRSENGGiyiRLTPHIPERVRWX 

Girc 


760 


14661 


A 


766 


315 


3 


IFPNPPRIFIQNPHPFLFGFFPKKKEGE 
KNWAMETPRFSPPPLGGKKAPTGFPPPP 
IGF*GPKPLIKGRGP*/PP*KNTKIFPF 



567 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nudeotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliion!ne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possible 














FFFFF* DRVLLCHPG * SAVARW 


761 


14662 


A 


767 


1 


309 


KEPYKLTELHNEIRVACGHLSKKKKITW 
VSDS*ASCPSKTRGNFIGPSAGSEAPSE 
KLEAHFHGCGSVNRVHLT/CK*HKRRPL 
GVAHLEVSDQKSGRTSIALD 


762 


14663 


A 


769 


1 


316 


GRAPPPPPPPPPTPRVHTSSKSESEPER 
/DGRE * EPTRSLERL/LFFAS YFCLEAR 
QSTSALPPLPLSPRSGPT\VLLCPSPPL 
PGERPPNPTKLSREKQTKQTHRAR 


763 


14664 


A 


770 


2 


366 


AREIIILTF*QMITFLNTKGRT*SLILA 

SL r ifiattn\llgllpysftpttqlfi 

NLTMAI PL*AGAWIGFRSKIKNALGHF 
LPQGTPTPLKPILPIIETIILLIQPIAL 
AVRLTAHITA 


764 


14665 


A 


771 


2 


364 


NAFPVISGAWTEYPLSHFIQRCRQKKE/ 
LNGLQIGKEEEIFLKKI3aML*IESPKES 
VQKRLDVIYKFSRIAVCKINIQKSNIW 
YVWNTQFENEVKTI *DPQ/DIKHWGIIL 
SQRKKE 


765 


14666 


A 


772 


3 


376 


HEPLGKLKLSLLFI LATYSLTVYS IL*S 
G*ATNSNYAL IGALRAVAQT IS YEVTLA 
IILLSTLLRRGCFNLSTLMTTQEHL*LL 
LPS * PLAI I * FI CTLAETNRTPFDLAER 
ESELLSC/FNIEYA 


766 


14667 


A 


773 




350 


HEF FFFI LKMYLGQAQYFIC FLFFFLGF 
HLK/HKSCSVTQAKVQRBHLGSMQPPPP 
GFMQFSVAEIKDVHHHAQLIPYIFRIFI 
F*F/NFLRHSLALVAQDGVQWEDI,GTLQ 
PLPPPG 


i(n 


14668 




774 


1 


359 


GTRYAAMLSALGFIFLFTARGLTGIVLA 

NSSLDIVLHDTDYVGAHFHYVLSIGAVF 

ALIP\GFIH-*FPT,FSGYTT,DQTYAKIH\ 
FTIIFIGANLTLLPQHFFGLSGMPRQYS 
DYPDAYTTW 


768 


14669 


A 


775 


2 


369 


ARGS I CLRQTELKT VIAYS S X RHI GL W 
TAILIQTP*SLTGAVILIIAHGLTYSLL 
CCLMrSNYERTHS\RlIILSQGLETLLP 
LITF* *LLARLWSriiALPPTINIiGELSV 
LVTTFS+ANIT 


769 


14670 


A 


776 


2 


353 


ARGTGA* VDS *LTTLHGSIIMK* CAANLL 
XWTLRCKILFTVRGLTCailAITNSTLD lA 
LHDTYYWAHFHYVLS IGAGFAI IRGFI 
H*FPLFSGYTLDQTYAKIHFTIIFIGVK 
ITFFPQ 


770 


14671 


A 


777 


3 


353 


HEGLHL* LPKAHVETPMDG* lALDAGLL 
RLRGYGIIHVTLILNPLRKHILHPFLVL 
SI*GIIITSSICIiRQTDLKSLIAYSSIS 
Hr\SLEVTAILIRTPL\SFTGaDILIIS 
HGLTCS 


771 


14672 




778 


367 


2 


FCPiy CPNQKFMGVGAVGPALYPNPFGG 
LGGLV/TPGAGVU!IPAGPPG*TPPPPKN 
PNLGGGPALFFPPLKGLGWKIALTPEAK 
GSINPNSPPALPGGGPNQTFSKKKKRKK 
LMLVYSIELTSRA 


772 


14673 


A 


779 


3 


432 


HEPIHWLSSCVCPACVCVCVCQ/ CLWLQ 
VGVQ* CaJYSSWQHEPLH*VHTRVYVCVC 
LCHSVAQAGVQ*CNYS/CTAACNSW 


773 


14674 


A 


780 


213 


466 


DDILPVWNYSISFLFFFENKAFFLPPGW 
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ing to first 
amino acid 
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last amino 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GAGTQL*LPAASN*GPKGPSRLTLL*W\ 
DNRHGPPC/LDNFIFLQKKHLTLLPRLA 
LN 


774 


14675 


A 


781 


304 


373 


R*VFFFFFLKKEFHFFPQAGGQNQEMG* 
LHPLPPGAG/RF*CLTPPHSWDNGEGPP 
RPPNFCFFRGNKVSPCGP 


775 


14676 


A 


782 


389 


1 


PFPKN* CRVNSRPNVKHKTI/QLLEGNL 
GASVDDnEFGDDFLDTIPKVQSMKERS/ 
WDFIKIKNVCFAKDNVKERGPATNWKKI 
SVKDLSNKGLLPKIYEELLKLNNTET+R 
LT*KWSKVLNRQLTRETHA 


776 


14677 


A 


783 




383 


GTSP * PLTGALS YLLMTCGLAM* *HFHS 
ITLL ILGLLAWTLTIYPGRGDVSRQSAY 
QGHHTPPVQKGLLYRIILFITSEAFFFA 
GFF*SF\YHSSLSPTPQI.GGHWSPTGIA 
PLNSL*VPLIiNTWRLL 


111 


14678 


A 


784 


1 


389 


GTS I VI PTQVKVFN*LATIiIGSNMK* SA 
AEL*ALGFrFLFTGSGLTGIVliANSSIJJ 
IVLHDTYYGEAHFHYVLS IGAVFAI IGG 
FIH*FPLFSGYTIiDQTYAKIHLT\lIFI 
GVNLTFFPQHFLGLSGNA 


778 


14679 




785 


374 


1 


ELKAYWIIVMNLQNLIWNAQPLSIMQIFQ 
I L I KSQ IQNTLWS I SDTGYLPGIDKWN 
CI*LKILCIVKVTINR2UIMPVIDWENTF 
STYTMDKGLIPKYKELKHSNKQTNlILrK 
KWAKGIflSHSHA 


779 


14680 


A 


786 


.1 


363 


GTRLYHaNTN*KKLRVAILISEK\TDFT 
VKKIRNKEGHYIM1KRSIL*EGITILYV 
GrPSKRWNYIRQKLIKLPGEIDESTII 
LRDFNTPLSVIDASTRKKISKNIVESNN 
IISQLDLID 


780 


14681 


A 


787 


1 


361 










GMLKAETGHKLRLLQQ/TSQWMAKEKF 
LKEIKSATPVNTRKTRK*DSLIiADTGKV 
LVACIED*TSHNVLFS*SLIQSKALTLL 
NCMKPERQE 


781 


14682 


A 


788 


1 


352 


GTRKYAKSTKSKLYRWNyKAWMTAYLFT 
AWCTEYEKPTVETYCSEGL/SLKILLL I 
DNASSH* RALMEMYKQ INWFMLDNR I S 
IiI,QPVDQ*VILTFKSYLRKTFHKALAAR 
DNDSSD 


782 


14683 


A 


789 


365 


128 


PLDQHGETPLLLKIQKIjAERNGAHLYSQ 
LLGRLRHENLLHPGGRGC/SHCTPAWVI 
E*DCLK1TQTKTKAPRRVSISHSA 


783 


14684 


A 


790 


217 


257 


WSGGVAHPVIPATQEAEAEELSLGGRDC 
SELRSCHS\CTPA*VTQAGVQ 


784 


14685 


A 


791 


2 


363 


LAILHQTVS *FVHAKEKFWKDLLVKSAT 
PVAI*MIRNLNSLYNLIMETV*W*KED 
QTSHLIPLS*SLTQSKALIFFKAMKTDR 
GKGAVE*KFEATRGWVMRLKEKFCLS/H 
HIKVQG 


785 


14686 


A 


792 


2 


362 


gqeclgllhqtvskfvna:<ekfwkell\k 

SATPVDI*MlRNRTS\LISnMETV*W* 
KEDQTSPLI PIR* SLTQSKAVNLFRAMK 
PVRGKGAVE*KYEASRGWFMQYKEKSC/ 
RMCMIKIQG 


786 


14687 


A 


793 


2 


360 


AEAGSTMAFKNYAKSTLPVLYKWNKKA* 
MTAHLFAARVTKYVKS/YCLNKKIPFKI 
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Predict- 
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ding to 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*VFIDIVPSHPRALIGIYKE/ INVFMPA 
NTTSTLHPTDQEVISTFKCYYFKNRFRK 
AIQLPYSNSY 


787 


146SS 




794 


2 


359 


AREKLYLS ILTPLQRSLVIAFAATELTR 
YYILFETTLIPSlaaiSTR*GD\QPERLN 
AGT YLLFYTLVGS LPLLIALI YTHNTLG 
SLGILLLTLTGQELSNS*ANNLI*LAYT 
lAFIVKIP 


788 


14689 


A 


795 


157 


365 


GIIEEKGYLPEQIFNAKCSGMISAHCSP 
RLPGSSYSTSA*TDP*CQVQWYDLSSLQ 
PPPPRFKLPYLSDPSTWDYILAPP/RPS 
* *FVFFVEMGFRQTPE 


789 


14690 


A 


796 


224 


372 


IPFIFIFIYLFTEWRQSFALVA\QAGVQ 
WRNLGSL* PTHPPPRVKRLSCL 


790 


14691 


A 


797 


482 


^ 


NSFFFFFFLFPEAKDHKDFPSPPFPVFK 
KLKGFC*APP*VQKPVFYLKSF/CLKDY 
MLSFPESKVNFPSPPPL/SPSPLFMVGD 
PPLPPETYPSFCF* DPHS INFFLPPGPF 
LPHFFFFLRWVLLaiPG*STAV*SLLTA 
TS\VK*FFHLSLPSSWVYRCTSC 


791 


14692 


A 


798 


1 


415 


NLGGGGCSELRS YHCS PAWATE * DS I S K 
QTKTL/NKDHTRAGWERA 


792 


14693 


A 


799 


2 


401 


VQTGPHHVGQAGL* LLTSGNP PASAS / Q 
SAGITGMSHRARPQ 


793 


14694 


A 


800 


73 


307 


PMALEHHGCGMCLDPLPTFGKSHCFVLR 
CAEMETRSFLPSWSAGA*Y\CLLQPPPP 
RFS*IJlLPSRWDYRHLLPCPA]Sr 


794 


14695 


A 


801 


87 


401 


SLIEIWTLKKSTCHNLYIWVLS *HSLKA 
I VLREHS /VESLTHVLKMCLV*NFVPAF 
S*RSLCFFLKISRPVWAHVaiPSTLRG 
*GGHITRSRDRDHPSQHGEIPS 


795 


14696 


A 


802 


3 


354 


LRHYTP\PG*QSETLPFKKKKKKKKKKK 
KLSFFPPPKFLKKKKGLFKNPFKKKGKI 
FFNPPPPKKKGFFLIVWPPPKKKNPPPL 
GGGGPPKKIYF*KTFFFAPPPKFENPFF 
FPSPRF 


796 


14697 


A 


803 


1 


830 


vetgfmvrqaglklltsgdlptlayqs 
agitgvshcawlffff**clavtqteva 
ps*ltiasn\pglklssfftlphharli 
fkifsrnevllfsrXpsqtpnlmqsscl 
slpkcwdyeceplypaeslsf\fiklsc 
mlkclevkc]!rovcnll*ntlikwinewi 
egwigrltcvkqieknvicri *vmlkat 
qpvgteps* esrstnsgeamw * skdntp 
fpvfvsvfvlrqs / lcfvaqagvqwshl 
s slqpls prlk* s sclsllsswdlrara 


797 


14698 


A 


804 


389 


3 


fiwpfkgapensffvfmwikghkcpkkg 
fgrkknqlgpkipf*kkkkkgafplwlk 
rfqgpfffffewefhs\iaqagvqwcdl 
sslqplpprfkrfsclslpsswdyrhap 
prpanf\cvfsrdevspcw 


798 


14699 


A 


805 


161 


21 


rgggclqsqfr/rrlrhenrlnpggggc 
teprsyrctpaware*dsvsk 


799 


14700 


A 


806 


405 


82 


fflrwtl/dtvtrggiqwcnlgspqpp/ 
pprfkefsclslssswdyrrpppcpanf 

LYF* *RRGFTMIiARLVLNP*PRDPPASA 
SQSAGITGVGHRAWPMPIFENRFDL 


800 


14701 


A 


807 


938 


2415 


kitfwetfwittvhphlckereaiagil 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=G!>cine, H=HistiiJine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlirconine, V=V aline, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RRWS*NNW/MIHTHTKNLNPYLTPDTKA 
TFK*IIGLT*RAKTMQL/LESFCQKENT 
GENLSDLGVGKDFLRHKKHGS IKGEKIA 
KLDFI QVKNF* SLKDTFKKMKKYALGWE 
KIFAERVSDRGCVSRRYKEL*L/IELKD 
NPIRKGGNNLWKVHQRI *MAIfKHMICRCP 
KS*VIREI*I*TIMRYHCILPRMAVMK/ 
SD* SHGDKNGGSSGTLIH\CR*ECEMDQ 
LLWKTIGQFLSK*M*RCHTDSSIPFLII 
QEK+NCISTKKTCTQTFTAA/MYLLVIA 
KNWKQLPYPSSVWI / QQIWC I YT \ME YY 
SAI krtvdtlnnXqmdlki IRFNE * S/H 
NLRRVHCMY 


801 


14702 


A 


808 


414 


1 


KPKaKLFVPVRQWM-AKEKFLKEAKSAfl. 
PLN\RMIRKQTSLTADTEQV*VIWIEY* 
TSHSI PLSQSL IQSRVLTL \SMK2VETGE 
KAAEENLEDSRG * FTRLKBKSHLHNIKV 
QGEAVSADGEAAAGYPEDIiAKIIDER 


802 


14703 


A 


S09 


399 


1 


TGPPPRNQPFNKPAKGNFWLFLFFFL*F 
/CNFFFFFFFTGAFLSLFGI\LV*CLML 
VWGNFSP*PFKYCFTFSFFLSFIFLIFL 
WLLLNTCLYICYTFCSYLTVL*CSFLFS 
S I FFLLYF* FWRLLLIHPPPTRP 


803 


14704 




810 


1 


392 


ILLAGAIEDAEPGSG*TGYPTLTGSYSH 
PGAYVYLTILSLHLAGDSSILRAINCI \ 
TTLINIKPPAVARYLTPL.FV* PVLDTAM 
LLILSLPVLADGVT I LLTDRNLYSTFFD 
PAGRGDPILYQHLL*LLRH 


804 


14705 


A 


811 


66 


411 


RETRAGAQGLSRFSPPWGNPSQKIFLGP 
GFKKNPGPQKKGNFFFFF*KKFPLVSPG 
GRGGGNFRSLQGPPPKVPPFFCFNPPRK 
GG*/MGAPP 


805 


14706 


A 


812 


3 


392 


LSVPQFPPFFLKKTFRNCPPFCFSPPNI 
LGFPLKGSKK*GSFFPFFPPPNYSFFWF 
PSYSF*NQPCLTRGPF*NQKFPRL*KGV 
PIIiRL*SPLFFGPPPHSF/IFFFFDRVS 
LCHPGYSAVAQS/LLTAASS 


806 ! 14707 

1 
i 
1 


A 


813 


690 


153 


IISIDAKKAFYEIPPPSVISSSSPTDSL 
YYNLGFLKDfFKKGREVKYLNI IKYI YEK 
PTANIIFISGKLKAFSLRSGTRQICPLS 
PLQFNKVLEVLARATWQEKEIRNIHIEK 
EEVKLSLFVSNILCIENPLKMP*KKrSSN 
* * IQRVAGYKTNI *KIS \AFLYTKNKLF 
CCLSHPIYDIPL 


807 


1470S 


A 


814 


32 


376 


LFGI>BRSyiTBGGRLPENPTIPH6*REF 
WELCNKCD\TMRPKPSLHCSRCGHCVTR 
MDHHCPWIKWCVGEDKH*LFLQLCFYTE 
LLTCYALMI SFCHYYYFLPLKKRTLVRN 


808 


14709 


A 


815 


2 


423 


YPLYPFKI FI FPKGFNFCREVGP I CPPP 
KIKVLSKNSQVGPYTAPYKEKKNTLPAR 
VNFGPPKDSLKRPPLFFFFFF*RI)GILL 
CYPSWS *TPGLKQSRCLGLLKHWDYIC* 
PLHPASF*KKHySLRILP/SLLADAW 


809 


14710 


A 


816 


404 


54 


SLAHFFPPPKKGFFPKIPRGVL*PPP*K 
KKFLFS/HPPVNLGPPRDPLKGPPPSSS 
SSSSSSSSSSPGG*MIFy*/P**FGPAL 
VQPMRCYCKASCKVKAPFFSTHSGLMVA 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=iVIetliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Threonjne, V=VaJine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=posslble 
nucleotide insertion 














PPTTAQS 


810 


14711 


A 


817 




410 


DAEVGGSLEPG\SC3GCSEP*SCHCTPAW 
VTG * ETVS KKKKKKKKGFFP PF FFKEGP 
FFREIEKKFGKKFFSKKKKKKKKKKEGG 
PFKKNFWGGKNLRGGKKKFFFFLGGEKK 
KPRGFFEKKTFFWGGKNWATTTQK 


811 


14712 


A 


818 


1 


404 


I PINSLTSKINKLLKRHKLP /HLNQEEI 
DNLTRTIPI *BrlMF*IYKQN'TGPDGS IS 
KFY*TIRKT*Q*FYTSFQK1KNEEVLPN 
S F YEASTP LAS P KKKKKKKKKK 


812 


14713 


A 


819 


421 


2 


LPPKRRCFSPKPPRGFFFAPS*GKKNI/ 
CFPPR*NLAPPGVFLKGPPLFFFFFFFF 
FRG RVG * S WVPAAS PLQGQVFL PPQ / S S 
*VGGSPGPPEHARVNFFFFFFFFF*F** 
RQSLTMLPSLVSNSWA*TILTPQPPKVL 
GLQ 


813 


14714 


A 


820 


84 


418 


ILTCHNARLGEKSQIVTMQASQNNPKLV 
NICEVFLVFFFWKPIFFGTQVGGQYFNL 
SSL*VRLPGYTHFFCLTLPKSWDYRPIP 
PS\P1FFCPFNKKGVSPGYPGGCNILVF 


814 


14715 


A 


821 


290 


3 


FSPPPEKLGPPRNFLKGPPPFPFFFFFF 
FFFFFFFMPCVLIY\FIFFCTNLLFLDF 
CIT**T*NWFISSKAVCLFLFLPLQPAS 
QKTKKRCRKSFF 


81S 


14716 


A 


822 


416 


2 


ITKLTEEKKIWSGKRKWDYVKYAGASQN 
YTAWLRPQSMITLKDSSERKAPHWGEL 
NW\HVHHFIWKBR*PEV* lYIES *AID 
GGLASWSGPEKNKIGKLGAiaCV*RRDMK 
TDPLECTQWINIFGFVCLFGDRVSLCW 


816 


14717 


A 


823 


409 


45 


PPPPFLFFFF*KKK!aiPGGGSQGPKF*P 
PPPPKGKPPFFKKKKKKPGGGGGP PNPP 

ppkgggqk/qffppqggnfnkpkspppq 
pprgqkkkpppqkkkkkkkkkkrlnvtg 

PQFLQLESGTY 


817 


14718 


A 


824 


1 


408 


IPSBHNGIKL* INTKENPRNCTNIWELS 
NMLLH1IH\*WKEN*NKKFIKTNE/NKN 
ITCQNL*DTTKWLREIYSKKCLLQKKK 
KKKKKKKKKKGGGALKBrYLGGGQFYGGE 
GNFIFFFWGGJUCKPCGWIFRRRPFFW 


818 


14719 


A 


825 


259 


274 


YL* YLLLYRRYSA* I EL/LGYYLKKRKL 
LYQRDTYTPMFIAALFT1AKMWSRP*SP 
W 


819 


14720 


A 


826 


422 


105 


FPFSKIPWGNFFPPPPKRIFFPPFPPKl 
FFFPPPPFFFWGGFSPFSPPPKKVFFPK 
SPPVFFFPPP*KKKFFFP/HPPLIFPPP 
GFFFKPPPPFFFFFFFFFFFFFFW 


820 


14721 


A 


827 


15 


402 


I KSljGKNIGINIiGDLVLNNGF*YTLPKa 
QATKEKIDN/WDLAKAKNI CA*KDTIKE 
VERQPKK*QQTFVNHLFDKGLVSKHV/N 
EL*LINKKTNH\HL* *WAKD*NRACSKE 
DVHMTDKLMKDVPSLVIRE I 


821 


14722 


A 


828 


420 


3 


KDAHTCPPGCKGGFSGHRALKAI CRNQG 
FLLTTTE FLTHKCPLPGQE PWGQHQG * R 
DSPRRVSPAPGTWQPPCHRERACLAPSA 
VE/GPS* IQEQEKSLLFFFFSLLRWSL/ 
NSVAQVGVQWRSLGSLQPPPPRFKQFSC 


822 


14723 


A 


829 


1 


269 


IKIKNLARHGDSCL*SPLLLEMLRLEEN 



572 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
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od 
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Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
cling to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleuclne, 
K=Lysine, L=Leuclne, M=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LS PGSRGCSEL / CQPARQGETLSQTNKQ 
TNKKPHQVQLPCFEIINPKEFITCLVLR 
LPGNSW 


823 


14724 


A 


830 


432 


1 


GyNNQQIFDVDQTAFY*KKIPSRTSIAR 
EEKSMLGFKASKDRLILLLGAKVAGDFK 

lkpmlvdpsii:tphelmn*arctlpvl/v 
kww/kawmtaylftv*fiqyfkptvdty 
c/fkili:,lidnapglpravmdr*eesnv 
fmpvnttl 


824 


14725 


A 


831 


1 


233 


KKLQ IKYLG INLMKYTIHWSI CQKLLM 
KEINEDLNKl'IRDVLCLWI*RLNNMSILH 
KLIN/RINLLQLRI 


825 


14726 


A 


832 


3 


477 


yqtckedltqmllkifqkikgegiipns 
lydasiammpksdkdrtkkf\nyrpisl 
msidakilnkilpnqiqqhiqkfifipe 
mkg*fnirk*inl1hlitkmktktpmii 
sigtekvfdkiky/pfpittltklgidg 
rsldaimvars 


826 


14727 


A 




3 


402 


ryqtpllv\*tvlittallgl,slpvltg 
gitipltdreldttffdpagggdpilyh 
hlf*ffgrpeahililpgwgiishigty 
ysgkkepfgytgmr*aiisiaflrviv* 
ahhiytv* idrdtraylpslhn 


827 


14728 






7 


395 


dpqrvscmalssnsfffffwsrkslffp 
gregrgqiwvngtpplqgkrbtspasppg 
gggitkkapklhppkkgwggglekpp*n 

QRGGFGI *SPGGKKGFAGG* KQGQKS KG 

ggkkkgra\dp*ayiplk 


828 


14729 


A 


835 


41 


444 


dprvrkkerenlyrsvsikeiesvi*nf 
ptkki*gldgitsefiqii,pkkkpk\ik 

kertfsnpfdktni TLI/SKPETTFTKK 
EN/VRPVSLMNI /DCKIIiLKVLANCTL\ 
HI *REIHHDS/INFIPAIQlSFNIQKTl!r 


829 


14730 


A 


836 


3 


415 


hayyivkpsp/wipkgalsallktyglt 

m*mqfqsikdlrlglltntltiyq*wrd 

varkstyqgrhtppvqkgl*ygiilfit 
sevfffagif*afyqssl.aptpqlgghw 
pptgntplnpledpilntsgllgigvs 




14731 




837 


378 


0 


TPPKGPGGKIFLKK/SPGRKIF*PPGNG 
/PFFSPLSPLKFFFFPKaFNFWGGGGPQ 
GPPPKKGVFSQNPPPGKKRPPQKEKKNF 
FP/PPGKMGPPQGFF*RfiPPP 


831 


14732 




838 


383 


38 


GPGEKI FLKKPREKKLSPPGEKAKFFPP 
SPLKBTFFFPQGVFFLGGGGPKRPPPKKK 
GFFQKTPRGF*IAPQKKKKKI!irFPPRGKF 
gppq/ri F* KGPPPFFFFFFFFFFFFFF 
WSL 


832 


14733 


A 


839 


59 


418 


TKISSKDQKQY\TTPGW*MAEISTTFED 
AEIIDJUiIFPFNLPIMLHKKPDCSWKVS 

VYCKLK*WSLISATGLDI INT\LLEEM 
KHVSGTWYIATDLAKTFFYI / pdqkefa 
F3WYG*KDIF 


833 


14734 


A 


840 


5 


335 


irhlf*inkfenldkmdtt*kltyegie 

SLN/RII *NF/PWKSLGSGSFTSEFYS 
LFKEE ITAIYKP IWRVEKGGl \LPTCFA 
TRITLIPKPETCIMRKKNCGPVLFMNGH 
F 


"834 


14735 


A 


841 


1 


38 


RRLKLEDHLSLY*KINSKWIRELNIR/P 
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peptide 


Predict- 
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last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
B>=G!utaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ETIKLVKKDKG1JV\QDTDPGKDFMVKTT 
KTKI DKSDYI KLQS FYTANETVNRVKLQ 
PIE*GKIFKMYSSGKGLLSMIYNELKQQ 
HRNWML I * KLEDHLS L Y 


835 


14736 


A 


842 


2 


424 


GRVGSIPLSQSLIQSKALTLFSFMKAET 
GQEAAEKKVEflKRGWFTRFKERSGLCNl 
KVQ/GAGGDTAASSPADPAKINDEGGHS 
/KQQ IFSGNETAFY*KKMSSGTSIAREE 
KSMPIPKaSKDRVTLLIC3ftNAAGNP\KL 
KPMP 


836 


14737 


A 


843 


281 


3 


fffffffffffffktflffgywfs iffp 
llfs isllls fsgs fvtfafqpff * vsr 
yhyqhfnstf*fl\kl*fif*ifflisi 

LFLFHGYNF 


837 


14738 


A 


844 


409 




RCGGTCPWSPVLRRLR*KDHLSLRG*GC 
SEP*LHHCrPAWATE\DSVSKKNKKKGl 
SCRQHIVGSFFFHLKMOiIJIGLLSFFTL 
NVI I YWEFKFTILLFLLYS ICSLSHLF 
SFPAKSWINSKFESLFVCFNSFS 


838 


14739 


A 


845 


63 


451 


knqeqbseetlpnlfykasitlipkldt 
qk*kkrkekkrkkkk\en*kpithrwiy 
akilnkilahqiqqyigkiihhdqvgci 
p/gafdkiqyrcmir/tlqkmgiegthl 
ni i km yirptdsi lenrekp 


839 


14740 


A 


846 


60 


460 


RNNEPFLDQIVTYFEKWILYDNR* * / PT 
QWLDGEEGPKHLPKPNLH/QKKWVTVW 
WSASLTHYSFIiNPXETITSHNySQEIDE 
MH* ECLQSLQLRSVNRKGQIPLHDNPRLQ 
VAQPVLQKINELGFEVLPHPPYSPR 


840 


14741 


A 


847 ■ 


344 


2 


MFLKFHPGGETFLQKGYTGY FSPFTP * K 
FFFSLKPIiNFLGRVGPFFPPPKKGFFSK 
IPQGGLISPPLRGKVFPPPPRLNLGPPR 
VFLKAP/HPFFFFGI\PHGLQPRPMI *K 


841 


14742 


A 


848 


439 


132 


PJJVAAPPPSKNIFFPPGSYNCGGVWPQK 
SPPPK*GFFPKP\SGVYKNPPQIEKVFF 
FPHARIVPPPGDPIKTPPPIFFFCFNSI 
NFSLRGAQLYVWDFLGSGC 


842 


14743 


A 


849 


94 


470 


LNHFLSFFLSFFLSFFLSSGSLSLSLSF 
FFFWKGSPPPP*GTK/QKPCLKKKKKKK 
GGGGGQNFFTPPGGKKNPPPQGGGGGGS 
PAQKKKP /EGGGPPGSKKKNPGGGFKKK 
KPPKKPRGGGGKPFFP 


843 


14744 


A 


850 


2 


396 


FFKKENIQMANKHVKTYTTLLVIREMKI 
PTSMRYHFTSIRVAKILKTDNTRC*QEG 
RTIRILMLC*LQNKMVLLL/WKNWQFLT 
KIKIFLF\FDPTISLLGIYPR3MKTYVH 
S SF IHGNS KHKRTGWNLN VHM 


844 


14745 


A 


851 


369 


36 


skgasdilekpplf*evglptpspplge 
kmp/fnykggpgkgisllifpkkkfipp 
gghfpfpfp/pfffmdrvllchpgwsav 
avsqltttsasrl\k*fsclsyseg*dg 

RIS 


845 


14746 


A 


852 


1 


367 


ppppkikspsgpgppskrgspgapprgg 
kfwflsqkgwggp*fppppgvkpe/nip 
*pprgrvpl/hlnlapapppgppk/pkp 

PSP 


846 


14747 


A 


853 


1 


378 


AMLATLI SNS * PQ / CD P PALAPQ S AG I T 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 

amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 

scqucn ce 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GMSPCARPTSCSFYP* ILLTVRTFIKIR 
NFFFFFDTGSIIFVAQPGGEGHDHG***P 
QPPGPMKSFNLSLPSRWDYGGAPPCPVI 
LCI FCKDEVSSCGSG 


847 


14748 


A 


854 


369 


1 


SACFGLPKCWDYR/R*ATTPSLEIIIYI 
ER* EKTTS PDNAHRFDVTKLHH YDCSAM 
DF*KAL*N6DDFPVLMSKMHMQSrjNII\ 
FVFCFLRQGLALSQ/AGVQWRDHGLPQP 
QLPRLQPSSHLSL 


848 


14749 


A 


855 


380 


335 


npggrgcselrlchctpawat\Setlsq 
tkiqpkkqev*ll 


849 


14750 


A 


856 


373 


1 


LRIKILNKKGLANLIP**R*KIIHSQVE 
FIPEMQA*FIIRK/TPSIKWPIVRTKKK 
KKSQMLISIDTEKIFDKIQHLFLVKPVI 
KIiGIEGTFINFIKSIYXKKSTTATIISG 
LNASPLRLKTEQDF 


850 


14751 


A 


857 


369 


-T4 


KVQNLYFEKKLtNKIKVDLNKW/NSLCT 
RFRKLNIAiniMFKLIHHYN/SKVKIPI 
KIPANCFGEIDKLILKFI*KFKRPQIAK 
IILKKNKDGYILPDFKTYYIYMQYHLC 


851 


14752 


A 


858 


I 


348 


QWHRPAS /IjAEPPPPRFK* FS CLSLPNS 
WDYRHAPPRLflNFVLFCFVG*GGIHri 


852 


14753 


A 


859 


1 


366 


CrSPTENQKLEMIKL/EEGMLKAETG*K 
LCI.LCQKV/QWNAKEMFLKEI*SASLV 
NTTMI \RKQSFTADMEKV* W* IEYQTS 
HNIPLSQSLXQSKALTLYSSMKAERQEE 
AAKEK/LEASE 


853 


14754 


A 


860 


12 


366 


PSTLGIJIRASCLSLLSEWD*EHMPPHPA 
N*KNFFCRDG/SLTMLLKLVL\NSWPQA 
IHSPWPPQ 


854 


14755 


A 


861 


424 


78 


NPRPLEGQTASFSIiAPNF*TTLQKKVNP 
FF/SLKNPPPPPPPPKIWAPQGPFGPPS 
PGGLNGGFFLPPRVKAPIIHKGAPPPQP 
PPQKKKGPRVFPKKKKKKERKKEKGMTR 
AFCYP 


835 


14756 




862 


327 


3 


SHWFFAAV\GREISM*AMAPDQTKKICP 
RSAEDAIKYFLTQATGSIILIHAILFNN 
RLSEQ*SITNTTNQYSSLIIIMAIAIKV 
GMAPFHF*VPEVTQGSPLTSGLLVL 


836 


14757 


A 


863 


1 


341 


YDRNKl'JKDIPGS* lERLNIVKLSMLPTV 

lYKFHVILIKIPM'i'FFAE'rEKSIVKFSR 

dye*pkqs*keqnwkthtp*fq/qffta 

T VI KMVW Y* NKDRY 1 Y/DQWNR I * S LG I 
NPCI 


857 


14758 


A 


864 


32 


324 


lvwpffffffff* kkkffffppf*rkgk 
nfffftppppglnlff/cphplkkvjekr 
apppppn*fffflkktgfffftqkglnf 
ppletppltppkgln 


858 


14759 


A 


865 


350 


3 


RVKKPRPFWGF*MVLKPLSFFSKKTN*I 

lfplkifsppktvpwgkiflgal*npff 
cfkhplwffgf*kl/sfffpplyff*kp 
laplkrrfsffffffl*dgvslcrpgws 
avarsr 


859 j 14760 


A 


866 


342 


118 


gs vtqaevqwcdhgslqpetpgi k* s /h 
pptssyqwgttgvchpglamlsrlvls 
swpqailspwpptvqglqv 


860 1 14761 


A 


867 


1 


354 


vkpsp*pltgalsallmtfgltm* ihfh 

SITLLILGLLS]SITLTIY\Q*W\RDVTPE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

correspon 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, li=Leucine, M=Methionine, 
N=Asparagine, P=Prorme, 
Q=Glutamine, R=Arginine, S«=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














STYHGHHTPPVQKGIJRYGIII.FITSEGV 
FWAGSV* TFYRSSLSPTPQLGGHWPPTG 

ITPLN 


861 


14762 


A 


868 


3 


344 


QIKNPDNISC/WIROSATGMFLHCWWEC 
KLLQPLWKTT*HQ*SRR*AVPLSGMFP/ 
NRYSSTCTPASI*KTFLSALFMMTPL/C 
LSRVEWIK*SWCNQKL*y*SVMKRSEEF 
SLKNA 


862 


14763 


A 


869 


345 


1 


KQAGLKNVKITG/EGASVNQSR*VS/YL 
GTIKKITEE KGYLPERVFNAGASAVF * G 
KKLPQRTFISK/EEKQAP/GSEVGKDRL 
TLLFCANMSRFMISTALLYKAADLQSLK 
GKDKHRL 


863 


14764 


A 


870 


340 


45 


FCSFCRDRVSLCCLGWSRAPGLK*SSCL 
SLPKC*DHRC/AA/VPAWFQRCILEKKN 
QIY*CAERIL*SERPHTRHLDSSTVNIL 
PCPLSKVIFWKKKNQIY 


864 


14763 


A 


871 


209 


72 


KLWDIHTMEYYYSAIKRNKPLKYTTWMD 
LKDIMLCEKANLKRSHTV 


865 


14766 


A 


872 


3 


353 


SFFLGPPPFFFFFFPPPQPLPI FFWRFF 
FCFSPPKKPPP/PFFL*GGSPPPPF*RG 
FFFF*KKKGGVFPPPFFFFFSPKVSPPP 
PKKGGGG/PPPPFFF 


866 


14767 


A 


873 


3 


267 


DFTMLARLVSnS/WPGWAHTCDPSTLG 
G*GGWITRSGVRDQPGQHGEI 


867 


14768 


A 


874 


1 


346 


PRRPFFFFFFFLIVFIYFKCVYLF*DKV 
LVCHPGWSAVPHHGSLQP*PLRrKQSSH 
LSLSSS/WDHRH&WLVFVSFYRDEVSPC 
IANFCIFL*RQDFSMLPRQ/VLKAICLP 
WPPSL 


868 


14769 


A 


875 




172 


ICLLSSGSPPASASQSAGITGVSHRARPG 
LLNFFC*CAFSVPGPCLGYPDTFTH/LC 
PPSFHQSVIASLDFSCLS*L*MSP 


869 


14770 


A 


876 


1 


348 


LLFCFNNYSFFHGVKPKIPAFFVFGPGC 

LPHFFPPITPAPFFPLDRVLLFPPGWRE 

MGP?*APPTFSPQG*GVFPPPPPQ*LGP 
-■iGPPPQPGVFFFFCIFGRDSA/IAILPR 

L 


870 


14771 


A 


877 


i 


203 


GFHYAGQASLEVLTS*STCLSI.PKSWDY 
RRGPP/* PGLSYFLYPSLRS * FI CAMS I 
HIPFIKKKKASD 


871 


14772 


A 


878 


343 


102 


EWEDCPSPGGRGCSEPRSEffiCTPAWVRE 
TIiSPKKYLFLQP*KLKirrKIYLSISIP* 
KLET\ ILKimFEMMNPIflSYHFFF 


872 


14773 


A 


879 


2 


359 


RDITKGDMQMETICHM*SFSTSLTSGI \H 
NSP IRPS KMKNIDSTRYWRGCRAIRAL I 
HCWWRC*MVQPFGE* ISSFSKKLNMYLS 
YDERPTFRYEK*K/P* VHPKI CV*MFMA 
AFFLISPNW 


873 


14774 


A 


880 


205 


1 


FFRGVTEGL*EPPYVESVI /AGGTTARR 
PLFFFFFFFFFFLRWSFALVAQAGVQWH 
DLGSPRPPPPGPK 


874 


14775 


A 


881 


150 


2 


CRARVDGVPWRNPGSLKPPSP \aSSDPP 
TSASQECGI TGAHHHTRLT*VF 


875 


14776 


A 


882 


345 


1 


KGNQPWKTEKRALFC/TLKKKKIFKNSP 
PRGNF*KFFVKKKIPLKKEKGFSPTWGK 
KKEFFFKKKKKKKKK**AKDLNENFSRE 
DVQMAKKHMKRCSTSLirREM* IKTTIR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 

otide 
location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutainic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, lj=Leucine, lyi=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyroslne, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YHF 


876 


14777 


A 


883 




355 


DRLLFSASHLDLGTLYLLFGA* SGVLGT 
SF\SLLIRAELGQ\SGCLL*NDHIY1JGI 
ETAmFVMDLFIVRPI IIGGFGN*LGFP 
NKKGADMAFPSINNISF*LLPPSLLLLL 
ASAIVEA 


877 


14778 


A 


884 


262 


2 


PTCEQSEREIKKTIQFTTESK*IKFVGI 
NIAQEAKDLYTETIKYKTLLKEIKDTNK 
*KN1 IY^/\SWTRRFNNFKMLGWAQMLMP 
VI PA 


878 


14779 


A 


885 


16 


318 


ILT^AnrAnLFFFWFLGGKKGFCFPRLKG 
RGETLLN*TLIFR\VKGNF*PKLPKIW/ 
DFKGWPYHL/AENFVFFLKKGGLTLLPG 
LVLNS* IKEFFHLWPPKGVA 


879 


14780 


A 


886 


298 


322 


KRRTP*ypPGEPHFPPPPPQG*KARGAP 
PPPPKKGFF\ PEKKKKPGVMVQGTTQKP 
PKKTKVRVD 


880 


14781 


A 


887 


353 


3 


FPFWGFLJCTLFGKALWLFKGFFFFAHQK 
FFFLPSLPPPP\LGFFLRGLT* FKIFSP 
LF*KGAPQKGISHPLFFFLISPKPPPPF 
FPFFFFPPPPFFFFFPFFFFS\RDRVLL 
YCPGWGA 


SSI 


14782 


A 


888 


22 


341 


I PCT CLKLHGKVHDHDQS QLWW PKPQEE 
MRTLENWLAVLQS I LQWLD I ELP YDKAX 
ICAREIKTYV\QQNCT*TFMLALCITAK 
KW/KQLKCPSTDEWVSRMWYTCTR 


882 


147S3 


^A 


"889 


51 


338 


ERSQLQWLMP I I PSLFFFFETQFLFCCP 
GGRAWAIFKI FEFLA\PK*KQFSCLTLQ 
TI *VYGLNPPPHKNFVFLKERGLFHVGQ 
SGRDLPPSGDPP 


883 


14784 


A 


890 


1 


225 


GRLRPENHLNPGGRGCGELRLCYCTPAW 
VTERDTIS/RHTHT*NYFY*6Sr*QLAN 
S CCCnSTMIjNKFNVI CILRWC 


S84 


14785 


A 


891 


357 


164 


GKGCSDSRLCHCTPAHAT \TKTLSQKKL 
ILKKQNKFRS *LDD* INMRCTVWPCFHV 
FLFIRftAPLFSDWLYNK*MNRNT 


885 


14786 • 


A 


892 


207 


302 


EPFfiGlItNESlHOEeilVLNVYTSSNR 
PSKYMKQTLIELKGEKVKSTIIVGD/FY 
THLLVIDRTSR*KR 


886 


14787 


A 


893 


86 


332 


VMRVSCCCLKD* ISLCHPTWSAWQS*L 
TVASNSW\VKQSSFLGLPALWEAELGGS 
LEVRSLEPVWAT*TP\FCKNKKL 


887 


14788 


A 


894 


2 


336 


FFFWPPPQTFLKNPGGPGPKREEGPGGS 
PRGQKQGPPGLHLTGFGGQPGG*KHPEP 
GPGGEAPKGETRGEGGGPTRPQIP/QLI 
NGKSGKPPKVTLNGAWGTIKIFLIKTPV 

G 


888 


14789 


A 


895 


1 


214 


ARESLASFLPPSLPPFLLGLRFLPSFLP 
FYFSFLPSFLGFGSFLPST* VPS FLPS F 
LLFFPSFLPWLWFLPS/FLPSFLPSFLP 
SW 


889 


14790 




896 


131 


352 


tlhesdsesvprdfkisdalaveddqrs 
pgtlnaaels/ssvrerkkkek/kpepg 
l*dqs ikesdsymvsggriq 


890 


14791 


A 


897 


3 


184 


CSVAPAGVQCHDLTSLQSPPP\SSSNPP 
TSSNPPTSAF*VAGTTGMCHHAWLIFVF 
LVDAA 


891 


14792 


A 


898 


327 


264 


nravslmnldakvlxklisasqiyvyik 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first^ 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaianine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaHne, 
W=Tryptophan, Y=Tyrosine, 
X=LInknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














KMIH/HWQIRFTMRM*GQFII*TAINIY 
HKLIK*GRKHMIPLIYAVKIFEKI*YHN 
IMIKTLHKL*IKEPPGSFIYSL 


892 


14793 


A 


899 


3 


313 


TKAASHSQ\RA1ILQFLVGRIH*HLKSRT 
ISPGRVGATAAVNSTAILEYLTAEVIjEL 
AKKASKDFTVKHTTPHYLQLAIRGDKEL 
GSLFKSTIAGKGVIPHIHKS 


893 


14794 


A 


900 


340 


1 


HLCPHKEHLY/ LLQTSCPLAI I * FISTI 
AEHTKAP FDMAKVESELVSG FN I E YPAG 
PFALFFIAEYPNIIIINTLTVTIFLGPT 
YVATiSPELNTTYFVPRKIWVRVFIIIM 
FV 


894 


14795 


A 


901 


1 


252 


LTFFPQHLLGLSGMPRRYSDYPDAYAT* 
NIL3SVG/SSFSLPAVILIIFMI*EAFA 
SKRKVLIVEDPSINLE*LYGCPPSFPSF 


895 


14796 


A 


902 


326 


1 


LGPPPPLKEPPGVFPIFPF\*FWNFGPV 
GPF*iCIIjIGPl,Ii,DFPQFFFGGVSQIVF 
PPF*GKIPFF*KNTPPFF*GIGGSIPHP 
PLFFFFFFL*nRW,T,rHT,GWflaVA 


896 


14797 


A 


903 


1 


352 


KG - FPWS C3 * KGTQKKSGr.SQGGQRNV 
P?CGIHFVKEFLGFPGAOLLIS*GGRLI 
F*KGPKKGFLP*TVFG/RRAQFEAPR 


897 


14798 


A 


904 


325 


2 


^Lh^KU 'II^T M<J\T /Pf.NLPVE 
D IQ KH PDQTAVQ CDACLKCR K* P YGMDQ 
HLEK\WYWCKNHDSQFRYCMVPKDPEDM 
DLVHPNYGKPYKKPSKETYWFSQMP 


898 


14799 


A 


905 


I 


329 


IGLAIHGAERIVRGQTAKSLAVHN\VCE 
QKKIKKKKQGPGREFQNLKWENPKNPGG 
SF*RGLGPQSNFFYLKQ/RLGFFF*KKP 
PKI * ILAMEPPNKKIF*KI!IKI!IKAHFFF 


899 


14800 


A 


906 


148 


2 


DVDKLFLLRSLPTL*RPEYGSYMIEGT\ 
TGQ PYGGTMS E FBTTLEJiNMR IR 


900 


14801 


A 


907 


384 


1 


ESKRSIFGPPNPPPPGFKGFSRHAPLWL 
RSREPPHKAGVKKGFPRFRPGS*MPIP* 
SFLCJNCIPQRVGFPSLPPPGAPFLKISL 
FLRQESRSVTQAGV* \YGLLQCRPPRLK 
SFSCLSLPNSWDYNR 


901 


14802 




908 


3 


297 


TKIKSL* INHLIRAKTVKLLEENMGINL 
HDLSSGRQQPFRYDTFQ/SMINTAIKEK 
IMKLSFVKMGSFISQRTPLRKERQLTK* 
EKIFADVYLIIGRRG 


902 


14803 


A 


909 


169 


373 


ASNILSATDISNTFGPPGSQGFSGREAY 
VEAGTYYTNPS aLiGQVKVF*YWMQLIVS 
IVLL*LL*VTQKL*L/GPRQEKFV*YVP 
AST*ASLPLKPCDPGGPKVFEMSVADS / 
DV*SLWLALIGESKFRCLVF*SKFLPSS 
AYTYSPLEKfCLFILLGCFVMICFCFCFL 
RRVL 


903 


14804 


A 


910 


132 


368 


GRI FLFVGQEKGARVSFLFLF*DRVSLC 
HPGWSAVAQSQLTTS/TWTQNNSSHLTP 
P * VAENHRCA/PHTPNVLFLC 


904 


14805 


A 


911 


2 


339 


NS WAQE * AG I TGS CHHTQLTLEF * AHDR 
QMMFFLFFLKTNFTFCPPA*RPWPGLGL 
TE\PLLPGLKHFF/CLTLPSNWDKGHLP 
P/HPYQI/CGFLRKNGASLISG 


905 


14806 


A 


912 


3 


381 


LNFCDTHPLTPRPVSIQRQCLPLVEAGI 
RWRALISPQLHPPT/YSLLSSWDYKHAA 
PHPANFFFFFFKKGSLTMFPRPGGS * I F 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predlct- 

location 
correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienj'IaIanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=^po$sible 
nucleotide insertion 














lYLLIYLERESY/AVAQAGGQGPKFNSG 
QPLSPRFKQFSYLSLLR 


906 


14807 


A 


913 


394 


1 


MSKGKIG*KIjG*KFAQKVSQFVNAKEKL 
LKEIKHFTLMNT*MI*KENS/LIADLEK 
ILWWIjE3!IKTGYSIP*TQNRIQSKALIL 
FYSMMTERGEEA\KKFEARRDYFMRFKE 
RSRLQISIMKV*DEARNADGEAA 


907 


14808 


A 


914 


326 


200 


HNWLSIWK3CVKSNCFLMPHTKINSRWIK 
DIK1/R/LHRIKILQENVGNHIYKI*RQ 
KRS * PRHNFLHKHCKCLVWHKTNK*1SIH * 
KIYL*CI*KTFAVFM 


908 


14809 


A 


915 


3 


334 


LWRVKGPKFKFWLHHQLGLYC *VIS FF 
INKMKEYVLIHRIK*DHK1SI1 F*MGYKKL 
*NV1JXW*LVLFGFSSKNHL/WSGAVAHT 
FNPSTLGDRGRR ITQGQE FQ I SLTNMVK 


909 


14810 


A 


916 


14 


299 


YQKLPQKKSPEPlirAFTDKFYKIFK*LTA 
ILQ*LFQKI/E/KETPLPK*SNESSNAL 
I AKPDKDIT/SKNYKPISVSNIHAKILN 
KI *QYQVQQKQLYS K 


910 


14811 


A 


917 


3 


339 


SLQPQSCRLKCSSCLSLSSCWDYKHKPP 
CLA/NFF/VFF*EKGICTVKH*NGDYPL 
GRRVTKRDHERGF*NAGKVLLVGW/CGF 
ETQSRFVPQAGGQGRNLGSLQALFPGLM 
PFSC 


911 


14812 


A 


918 


2 


321 


giisgiykelsqpkmtdssikkv,kd]:.nr 
nftkeyvqmackhmkrcsaslvireikX 
ikstmr*h* \tptrmtk1kd\dkntkcw 
*gygavgmlihcqw\ntkmvqplwkisr 


912 


14813 


A 


919 


51 


326 


FFVFCLFSLVFISLNCGLF/PTFFRIPS 
L/PYLQLI * IYLFFY*LKPI FLKFFRDR 
VSLCHPGWSTVA* S *LIEASKYW\VQ * S 
SHLSLLSS*DY 


913 


14814 


A 


920 


239 


2 


DSLILSAVQ*HDLGSLQPPHPGLKQ/ FL 
PSNWHYRCMSPCLAYFLLFLVETPFCHV 

TQASLELLGLSNLPASASQSAG I 


914 


14815 


A 


921 


1 


242 


PRPRRRLLKYFILFNFLRQ/RSRSATQA 
EVQ *HDHS SL * PQTPRLK\HPPASET 


913 


14816 


A 


922 


249 




ALFCALKKTLCGTLMF lAALF I IAE\ SR 
HNQTQPTCLSTDEEINKMWHRHAMGNYS 
AIKRNEVLIYAKI*MYLENIMLSEII 


916 


14817 


A 


923 


404 




SRPQKKLGELKCS *RPKGLNKKNQRKPG 
TFFFFETKFHSFCQAGVQWVYGC*SHPY 

llhiifp**vf'-me'lilitv/psrnkdl 
ignse:<qihn/wsslyfyf*lfflrmsl 

TLVPQSGVRWRNLGSjQPPLPRFK 


917 


14818 


A 


924 


6 


358 


ffcsllghggetpnnsplphpwl*etks 
ppkpphplkkkkhslffkifilkkkkng 
ppffppagpnprg*nppppq/rppkgpp 
ppkekinpppppqgknfpkgkrpplgpf 

PKKGGGY 


918 


14819 


A 


925 


268 


2 


hmyaf vkidltahlrpsagkke i pagkl 
yf/l*rkkpknqignqkkkkclphagrm 
vahawnpsnlggqgagtt*sqaletsmg 

NMGKP 


919 


14820 


A 


926 


74 


338 


IAGITGVSH*ESKNSY*KQFL/WPGTVA 
HS CNS SNLGGQGE W I T * GQEFET SLAMM 
VKP 


920 


14821 


A 


927 


339 




LRLFACPPLPKWWDYRGEPPRQ I FFSWH 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
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N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 














FFKKQGFICPIEKGRELSNCCFQKEVLI 
*KKVILKI*PFFFFFLMESCSVA*AGVQ 
WHDLQPLPP\NSWDYRCPPSHPAKFCIF 


921 


14822 


A 


928 


3 


416 


RCWMENKIN/ PFWK/S /VWPCVTKVNIH 
S P YDLTISLQYLPYRDE YLCSYKWLMFL 
IVLFIITKNYKQPKCASVG*WLRILWYL 
/YMENYSAIKMNK 


922 


14S23 


A 


929 


344 


69 


YKRSPLL\KSA*DGPANJfKSLPRS *YKR 
SPLLIKAKPDQI ISGWEASPLl I \PRAL 
LQKLKPAASKF*W/NPKKKRALIS /KAI 
PSKKITKftEGIMFPNPKV*YEDTATKTA* 
WWYKNSSSL 


923 




A 








HGKKIDPRFIPYTKI/NLKWFIDLNVKP 
KTIKFIjE*NIGE/TIFVIDHKK*NP*KK 
KLVN/WDFIKIKSFCFVH*KPAVGK*KE 
/QP*TRRNYVQTHTSDRGLISRICKELF 
♦LNSKVINSIRKWAQSLN 


924 


14825 


A 


931 


479 


83 


SRKGLGRGLECRGGFPPPPPPKGLPGGVA 
PPGFEGFFeSL*XG/DRG*PPIiKGVKGE 
KGKRI*EGG/EKGGGFLPPPPPQKGFSK 
NFLGFyAGKKENP*GGSPPPPLFGPAQK 
ISLGKGENKVPPPPGNLFYLFIF 


925 


14826 


A 


932 


36 


440 


ttyasldeaqlprakfnaflttf*hi ik 
ngpilgkigrkymiadqNghrmknhhck 
ltqvlnthyvgpkrilltgtplqnklpe 
lwjillnfllptifkscstfeqwpnappa 
mtgervhlyeeetiliirrvhkvl 


926 


14827 


A 


933 


8 


398 


CLQGKEETLTTKWHLRYFLPFFFFFWPK 
KGARKNGGRPPGGSPKNQRAPCPKPFEN 
PGEKTPQKRGFFPPKPCPGPGPGPTGGQ 
GKPP/HPQRPQILPPVPQAGPQGRKGG* 
WNPPPPG*KGFPAPTPPRM 


927 


14828 


A 


934 


1 


441 


THSHPRALKEVYTE INIVFMPANATS IL 
QPMD»GVII.TLKa.yYLRSTFCKAIflAVP 
SDS/SSDGSGQSQLKTFLKGFLILDAI* 
N IGDSMEEVKI STLTEMWKICLNP IFMND 
FEGPKTSVEEVTADWKITEEVQVESED 
GTEFLQSH 


928 


14829 


A 


935 


3 


383 


PGFKASKDRLTFSSGNNAAGNFKLKPVL 
lYHSENPHVLKNYAKSILSVLYKWNNKA 
WMTAHLFTAWFTE\PYVKTyCS\EJ<NIF 
FKIIiLLIDSVPSHPRALREI*KQMTIVY 
MP/STTTSILQPMNQG 


929 


14830 


A 


936 


417 




CFFSRDEVSPCGPGFSPSPD/HHDPPPP 
PSQCWDYRRDPLWVPPHICFLIHKKKRS 
SHNGSSMYDP*KPPHKWMKSPPPVSVLY 
GSIPVQVQIAPPPETNPVYFFSPPPPFW 
GGGRVCVCVCVSSEVCDFFGGDENVIV 


930 


14831 


A 


937 


412 


3 


KPTRVKKINPPFFEOITKNKGGRVGGSLY 
S PF FGGLGQ KHGFT PEAKGSINLKFPTA 
PKSWGTQQKSLFKKKKKKKSILKFIMNQ 
KINPKQIKQS*GITLPE/FKLYYKEATV 
TKTAWYRYKNRPIDQWNGTDRNKATH 


931 


14832 


A 


938 




416 


KNEHIDQCSGIERPEIDSHKYSQLIFDK 
GAKVI * * RNDNLFNKW YFWNWMSTC\RK 
INLDTDLTYLFQNGSQT*P*NIKLLENS 
MGGNLGDL/GMSQ*VSSSSSSRRIHDKK 
LVS/WDFIKIENICSEKGIIiCHMKSQTT 
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X=Unknown, ^=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














D 


932 


14833 


A 


939 


^ 


443 


DFDYKHDHDyAELGTRPGSVGQQSPDPQ 
FTPSRMGREGEGTHSIL*CSSLGMGVIA 
DLSTDPTELEKRALEVAGPDGQASAISP 
ASPRRKAGDGCffiRRALPGCTSLTGETTG 
KSGEAGQDGKPPGD/GPIGPYSLPGSGP 
GSGESMMG 


933 


14834 


A 


940 


3 


404 


LFMFLEFFF *KPRS C* VSQAGMQHC ^LK 
ALQAPPPRFTP/SLLSS*DYWSLPPPLA 
NFLYF*ETRGFTGr,TRMISlPQPTEMPG 
IjflSQTAKLIPF*K/HRVLVES/HG*SAE 
AVHRGDLHILEP*TPGLK*yS\CISL 


934 


14835 


A 


941 


397 


] 


FVPWSQVftHaiCKKFLKEflESTTPVLSGI 
RKQm"FARD)«E*V*MVWIKDQPSYSIPL 
SQSVIQ/SRflLTLFDCTKaERNRRGKLQ 
WEPAV\EGSKGWLMRFKERSHLHNIKVQ 
DEAVSYPEDLDKMDAVNTKHQI 


935 


14836 


A 


942 


441 


29 


SVTLCKHTVHVPTFLRRGKRCPLFGQCQ 
PWPSPG/PHPLSMSPV*PRPQ\PIPLHS 
APPIALGPPIPMLSMAGLPFPFQGSGLR 
NWKPPFPQPSLLQ* *HFPKPLKPIRGLR 
PFPPSTLSIPLPGTRPRAKLGVMRSRL 


936 


14837 


A 


943 


2 


397 


ARDAAPEPGEHLLQGLSAHHGLRPPRDS 
RPGPD\PP\SPPHPLPLPAVPTFVFSFG 
DSWLL*SPFFPAPWASEGGACAPNPHaL 
VPS PSGQASGLRGGAPSWKAGLDTDGQQ 
AGRQSPGPAPPSPPLPPSQDC 


937 


14838 


A 


944 


3 


449 


LRQVWHEGEMPNKTTLIYHYTPI* ITNI 
KNTDNTKSLMGCAEARSFTHC/WMRIKM 
VKPLWETVQ*YLI**KLQLPFNPAVALW 
\SICPRKVKTY/CHRKNKTRS*MFIV/A 
LFVTAQTKN 


938 


14S39 


A 


945 


380 


175 


RAPAVPATQEAEVGGL/H*TWEDRLNPG 
GEV/CSEPKWCHCPPAWATKPNCVSKKK 
KKKEKRINHQHDEK 


939 


14840 


A 


946 


21 


331 


VAPLKYGPSKETl / IQS VRQS TECKKI F 
AYYPSDTGLITRlYEEL*RtNRKEKLNS 
3VyKYAKDLI*PPSKEDTQMANRR/H*K 
KCSTSLIIREMRFKTTMRVDAV 


940 


14841 


A 


947 


365 


1 


GGGPFWFFFFFFLGGFLKIGKIWAKKKT 
GPSFPQKKKRGPKKKKPPGFFFFFFF*K 
GFVAPGGGQWEGFGSLQPLPPGVKQFFC 
PRFLRKW/DSRFFFFFLIKSNAEQFPTL 
YAGVRMRYIK 


941 


14842 


A 


948 


357 


137 


I SAHCELRLPGSHHS PALLCS PGLSRTP 
NLKQSSLFSLPKCWDYR\LATVPGQLLL 
F*RRHCI S I DPSLHLNNM 


942 


14843 


A 


949 


53 


352 


REDNHHKCIRNKMINRKWANEVNRCFSE 
E I QMVKEHE KLLAS P P I RS MQ I KRYY\L 
NSIA* KKEKKSD\NTKLWQGFGETKS V* 
RYIIN/PYDPTIPLLGIY 


943 


14844 


A 


950 


937 


3 


KVSPYKINIHK*VAFLCTNNIQAEKK/M 
NMVLFTIATNKKI /SYLVIH* IKEVKYL 
YNKNYK\ LLKKNRDNANK*KGTPCS* I * 
RINIIKMSVLPKATYRLSAISLKL*MQL 
FMELEKE\TILKVIWKNKTAHLAKATLT 
\QSNPGGLTLPGLYYKATVIHITW/HCY 
lOITHVDQ * /WRRES PEI SLHF YTQLFS D 
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residue of 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=Thrconine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=tlnknown, *=Stop codon, /=possible 
nucteotide deletion, \=possible 
nucleotide insertion 














KVNRS KQ / WKNCLSNKWYWESWLI ICRR 
RKLKLYLS PYKKCNS K* I KNLTVKLKAI 
KI VEHQKS TVLVTGLWKGFMTNT I KVNA 
TKIKVKN*HLIKLKWFCTPKET/T*QIC 
RQQRVWNKLFSNF 


944 


14845 




951 


246 


1 


AASTKTGVQTKTCT* MLIAALVITVKRE 
KQSKCSSVE\E*TMWYIHTMEYYSVTKR 
NDVL I RTTWMNLENS MLS KKDSHKG 


945 


14846 


A 


952 


2 


255 


QGCGEKGTFLHCWWEYQPLWKTVW*FLK 
DPETDIPFDQAIPLLGIYPKEYQSLYYK 
DT\AKTVraQPKCQLMVDGIKKMWYIVDA 
A 


946 


14847 


A 


953 


343 


3 


GSIYTKEMEPKVNNLQK*KASCTDGL*G 
EFY*TFENEIPILYNLFQKTEAKGTLTN 
SRDEASIILIPKPEKCTRRKENDK/P/I 
SIiMSIDAKLLNKILAN*IQQSRIYSRHT 
RLIQ 


947 


14848 


A 


934 


1 


349 


AQPPFTLFILHP\IYCIFSS*S\R*QKT 
LKTT*FQGFIFCFFFFFK/C2AKSCSRIK 
GRGPI lAKGTPSLPGLRDPPTS PT*VAG 
TKGGGPHTQLKPIFIAK*FyiSF*HNGK 
FCSRRGR 


948 


14849 


A 


955 


2 


351 


GLKNyAKSTLPVLYKWlSIN\KA*MTTHLF 
TAMSTE*FKPTVETYCS/EKKIPFKILQ 
LMDNAPSHPRALMEMYKEINAVFMPTOT 
ISILQPMDQGVISTFKSYYL/HKTFCKA 
IGAIDSDS 


949 


14850 


A 


956 


336 


3 


PTKENFEPDGF\TG*PYQTFKELIPPQT 
FliKTSREYFSSSSSSSSSSSSSSSSSSS 
SSPYRPl/SLDMTD/AKIMKILTNQVE 
IjHIKRNTHHDLVGFIPKXWFWIRKTDQS 
NN 


950 


14851 


A 


957 


181 


1 


RWDLTMLPWLVSNS\GLRRSSH/LPKCW 
DYRFEPPCrA*GWPLrGPHWNVSTWGS 
PRISRQ 


951 


14S52 


A 


958 


3 


293 


ggltsphvktyyqatvikrawng*rgvc 
mdq/ynktenpetdpckysqlmfsevtk 

AIQ*RKr)Sr.VNIENWl(IN*MSIHKKSSSR 
iCHLNQYLTPYTKL 


952 


14853 


A 


959 


350 


3 


nkkkgqslvfrsptlfffffl*iemg\f 

IMLftRLV*NS*RRNMTTSGSQSV3IT6V 
SKHARPKRKLFLFSI ITTIGWAGV\WWL 
TLVISALWDYRREPPRPAVLLPFHIQES 
RMPLSN 


953 


14854 


A 


960 


349 


1 


ggfppplfffpf/ PPPWGKPPPPFFYQ \ 
S FPPKKKGGPKPPFF7GSPWGVHKKQFL 
VKSPLPFFWKKKKPPPPF*KKFFPP\VF 
QKQPGRVKKPPYERNSWLLGPPTKKSPK 
KSTGS 


954 


14855 


A 


961 


379 


160 


PGPQGEPPFFLKNPQKLVGQSGRLF*AP 
liRGVRQKNSLNPGGGGALKPGS /HLWP 
PSWGKKGDFLF*KKKKKKK 


955 


14856 


A 


962 


184 


2 


tffspkkifpffffskkffpkgtffspr 
enffffffff*dgv*l\chpgwsawps 

RliTATS 


956 


14857 


A 


953 


1 


328 


arlvlks*pq/cdlpasasessgitgvs 
hcasasqsagitgmshhirpkwislylg 
fwsfnknvlhffcasslegesmnkells 
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ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
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Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, '*=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKMWE FLDKKLHC3S TKLGGNHVCVNF 


957 


14858 


A 


964 


1 


334 


HPLGGHVGGVPLGP / WLNPPPPQ*GTP 
FFLKKPNNPGRGAfCPVIPAPRGG*GGKF 
LLPPRGG/PSMNPNCPPAPPP 


958 


14859 


A 


965 


206 


360 


QKSMSLHQQ*QDIHSSQSHKKrSSW\PGA 
VAHTCNPSTLGGQGMQITRSGDQD 


959 


14860 


A 


966 


10 


332 


NFFSPGAPPPPFPFGGGFFLKKGPPFF* 
KKKILGGPPPPKKNPPGVFPFSP/AKNL 
GFPPPPPPFFFF 


960 


14861 


A 


967 




282 


RKCAKDLNRHS TKED I WIS TEC I * HC * S 
LRKSTLKPRDMPHIY*MD*LYKKNDDIK 
CW*GYGVTGAVFHFHWVSRMVQPLLNKW 
TVPRNVKFTL 


961 


14862 


A 


968 


339 


195 


FFFCRDGVSPCCPGWSRTPRPKQSTCLS 
LPKCWDYR/RL*ATVPSLDYLF 


962 


14863 


A 


969 


3 


342 


IKKGPHPQLKKPPGVFPVFQFKKSYFPP 
PPPLFNNPFFFR\DKASLCHPGWSAVML 
SYFTAAYTSW\VRSSSHLSLLSS*EWRH 
TQPCLFF*F*FL/CRSK 


963 


14864 


A 


970 


384 


6 


GRKNVAGFPLSPLYTPPHGHGLGPPQTF 
GAGPPAHKSHQKVGRQKRGPGFVPPRPP 
APLFFFFFKE*SVLEKKENNLYNSLFAY 
KILNK7QGETQCEGEflHIC\VCVCVCVC 
VSVCTCVHVC3VLAIC 


964 


14865 


A 


971 


427 


109 


LNSQFELQEKRISNLEni!JEKII*SEKES 
KNRMARN/E/QSLRETWDTFGYTNIGIM 
ECPEGEEK/GKRKRKTIKKKMAANFPHL 
MKNINLHIQKaL*TPTRITCTGPYGG 


965 


14866 


A 


972 


389 


145 


SLQP*TPRLMLSSHLSLLSSWDYRCVPP 
HIiANFLYF* *R*GFAMLGSSNSSA/SAS 
QS GGI TGVSHHftPWFTFKKKLVADT 


966 


14867 


A 


973 


170 


419 


VSFLFFFFFLERQFYFFSPGGGEGGEFG 
LIEPFP\P6FGWFSCI.TLW/RRWD*GAP 
PPLPAYFGFL1K\TGFPLVGRGGLDDRP 
3R 


967 


14868 


A 


974 


212 


3 


QNSMIRYSRSMSRSCLCTTILILSLNRS 
LKSPR*/WPGVVAH&aiPS'n.aGRGGRI 
TRSED*DHPGQHGETP 


968 


14869 


A 


975 


28 


356 


VFETSLFRNKKEIVKGTSLPDFRTYFKP 
LI I ES I * CLHGIGQNHAWNRVGNLQ IDA 
/ECAENLIYEKGGISIQW/SLFI!IK*CWS 
ITIHKEEKKIRPLHCIKIKGTKDFKKLS 


969 


14870 


A 


976 


350 


i 


lfckekvsllcpgrfeppgmkkstllsl 
pkmrgyr/rlapppqleiffhffltska 
tpl/ clgrs lkrlns qmfs pffs egvs l 
csqagiqwrdfg3lqpailwfk*fscps 

ILSSWN 


970 


14871 


A 


977 




343 


plhsslgnl vrphllnnsnkkdttkcm * 
gcekpgflihciweymmv*llwktdwqf 
likpnihlpydsavahlgiypr*mktfl 
qkl/cp*mfiaavfliakngkqlrcpsv 

SK 


971 


14872 


A 


978 


3 


424 


KRMRRQATDWEKIFAKGISDKELLSKIH 
KELLKLNKKKSPKQVACFKMGKQT*TAH 
LH*TIYR/CRRVTHRKDHSTRSWLLREL 
QILNT 


972 


14873 


A 


979 


114 


325 


QQQQNN* I KKLAEDSNRD FSN/EDI QMA 
NK*RCSTSLVIMEMQVKVTVRYYYLTLI 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleueine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serire, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop eodon, /=possible 
nucleotide deletion, \=:po$slble 
nucleotide insertion 


T73 












SIAIIKQKKKQWPKK 


14874 


A 


980 


339 


2 


GPTGFKFNQGGPPPKGSFFL*KISPPS* 
NPPFFFGPQKGFKPFFPPPYLNPGKNFF 
SKKPLFFFF*KKGFFKPPPFFFFFFFF* 
GRVSLCCPGWSAV\A*SRLTVTSTSHVQ 

T 


974 


14875 


A 


981 


2 


377 


qmankhmksfstssafmkm* iknkmrsh 
dtcirmanintndt\cw*gcrsnkilmh 
sw/weckmvqll*tmv*kfliky/dtyh 

DPAISLLGS I KKKSKFCT* /t/CKILYL 
NVLFIMNPNWKQSNWYSMG 


975 


14876 


A 


982 


1 


457 


INK I EAENTTEKFNKTELTS SSSSSSSS 
PLVGLTKRKKSQN\NEGDITTDTIEII/ 
TIIKDYSE+IYA/NKIEQASSSSSSSKT 
YNLSGPHHEGTESLNRPIMSEEIESLSQ 
NLPTNRSPRflDSFTGKFL*TSKDELPPT 
LLK\FQKTEQERTLPR 


976 


14877 


A 


983 


332 


2 


TPKAGQMQVLFPQYGSKVAEAP FS WAL 
PPPPPFFHFFCFPQPHCSLPPQIPPPFS 
RFFLQVKSSQQGPPPSLLGWQEQ/NTLP 
FFFFF+SHSLALLPRLECSGAISTAA 


977 


14878 


A 


984 


365 


2 


KSSKLFNYPPLFFFLKNFGPKKK/SGPF 
FCPFKNFFNPRVPGPNFNFKGPFFLKKG 
PTVF * I KKFWWGPLPPLK* PSGVFHF FH 
FLIWDFRPRPPFFFFFFFLVERGFHHVG 
QASLELLTS 


978 


14879 


A 


985 


1 


153 


GVQWC3SrYNSL*PRPPG\SGDPPTSAFQV 
AGITGVHHHTHLFFFFVGFFFNF 


979 


14880 


A 


986 


225 


i 


LYIYRERERERDRDRDRDVSHTHLYIHT 
RTHLDNILFCHPGWNAVa*SWFTVALNS 
WV\KSSSRPSLPSSSDSS 


980 


14881 


A 


987 


2 


375 


GTDEYIliIALIWMVSWVCLFFFFEKKA 
PFCPPAGKTGANFGLRAPPPPGIKEI FW 
PP*PSEEG\EPRSPNPSRGNFWFFKKRG 
GSPr.*PRLFG/HPRP*GNRPP*/PPQRG 
GNNKGDPPPPPGGIFP 


981 


14882 


A 


"988 


55 


335 


HI Y IDVl! VSGS WLLTV/ IS LLELTVFCY 
NVGAL*ASG\QRSENRISVSDFLLPSFY 
LPKAGL*SFPAFMMMGHKILIPEKVIPH 
MLEEE\TCTERP 


982 


14883 


A 


989 


295 


369 


ILVTRNSEL*VLPEQRTRFLEKftMALRS 
IPLLKR/NGRPGTVAHACWPNTLGGQGG 
RIKRSGV*DQP 


983 


14884 


A 


990 


2 


240 


KKISMWLEHEMI.SERSQS*KTTYYMIIF 
T* KLQNRQI YK\TERIYIYIYIYIYIYL 
FFFFFLERKFLFLFPRGEGGGPF 


984 


14885 




991 


463 


0 


gfrqlscls \lpsswyyrhms phlanpf 
\ tf * vergfrhvgqaglellt pwds p s s 
asqsaaih/gmshctqp 


985 


14886 




992 


82 


365 


EICHQLYRSFLCSLFC/DHPSWKSH*DT 
MNCVHFI IRLLNFSFFFFEKKFRFVPQV 
EGQGSNLG* LKVLPPRLKPFS CLTLRRS 
WGYRGPPPPPVN 


986 


14887 


A 


993 


3 


317 


GLKQSSCLGLPKCWDYRHKPPCPTHIIF 
NTH* I IKVLKVSFSFCI /PLCWS lALSD 
HVQ/ P V* L YNMLVSS FLLLLL FFE I E W 
ILACLFCPVGSPILFFF*YILTF 


987 


14888 


A 


994 


350 


139 


LFRRLRWEQHLS\QGGQCCSEP*SHHCT 
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nucleotide insertion 














PAWATE*DPVSKIQQNKNQDTFSHTFIL 
ECRTSKGVSAFLAL, 


988 


14889 


A 


995 


219 




RPRRPLIQKTKAKNHHFTPTRMPVIKDN 
NKC*QGCGEV/Wir/CRMVQPLWKT/IQF 
LKMLNTELP*DRAIPLTGGKE 


989 


14890 


A 


996 


3 


498 


CLPWAAVARQPPSAHPFPQPGP*LRTLA 
PDTATAVNQALQRQES *TGC* S PSGWPA 
VPTPVAPG/PPSAG*GVVDPRAWPHHNG 
P PGQH P KEKT YQ PVPAP * VQLS PTRQTP 
APMACSSLLLPALPPPAPAARPPCPAPP 
FLPSRTSTLPPRRLSSTIKSKTPPGP 


990 


14891 


A 


997 


2 


419 


ADTEAAASYPDNL\ANDESSYTKQQIFN 
VDKTALYWKKTPS RTS I AREGKSKPGFK 
TSKDKLTLLLGANAADDFKLKPMLTYHS 
KNSRAHKNYAKSTLCFRN*NKAWTEHLF 
lAVJF/EYFKVTWTNCPEKK/VLLLTHN 
APG 


991 


14892 


A 


998 


388 


405 


G*FFPLSASKFFFCP*ALKLWMEGCHIR 
PPPQVRFPLQSSQA/GFISASLKGKGFP 
ASMPANIGPPKGYL*PAPPFFFFSFLRQ 
SHSVTQAGVQWRNSSLQPQPPG/LRRSS 
HLSLPGSWDH /MPP CPANFCI F CTGG 


992 


14893 


A 


999 


1 


423 


WILHD/NLRQPAQWLDQEAAPKHFPKPN 
SH/QKKVMVTVCWSAAGLIHYSFLWPGE 
TSTLEKYVQQIDEMH*KLQCLQPALINR 
RGPILPQDNARPRVTQPMLQRLNKLGCD 
VLPHPPYSPDPLPTDYHFFKHFDNFLQG 
KHF 


993 


14894 


A 


1000 


2 


406 


SDLFRAI*ETTSVIVYDVSMDYLEIDSE 
MIRDLLNTFPV\HLELQQDS SGVIQGAG 
I TEAS TINAKE IMHLLMKGNRQRTQE P T 
AANQTSSRSHAVLQVTVRQRSRVKNILQ 
EVRQGRLFMIDLAGSERASQTQN 


994 


14895 


A 


1001 


290 


3 


VTQLPETVPHFLRDRVSFCCSGS/DHSS 
LQPQTPGLK\YPPTSAT*VGGTCHHAQL 
I FIFLRDGVS\SVAQAGVQWEDLSLL* P 
PPPGFKRLSCLENR 


995 


14896 


A 


1002 


143 


408 


IKKKIKEEI*KYI.DISKNK/DTYQNLWN 
ATKAVL*GKFIGINVY1FKNRKN*NK*F 
NSTSGKLKKLEKKEQTKIKISRKETIKI 
RAKINE 


996 1 14897 


A 


1003 


84 


409 


DKLAaLPSSWTLHPGSPLGVTRAPPPPP 
PPPPPPPPPRVLEPVPRSLYPGLAVPW 
PRAIiH*PPHPGSLPACPAPGLLGGTRQC 
SXQTILPKKKPPPLDADPAKEPPPP 


997 


14898 


A 


1004 


392 


2 


EKNAVHFDQKNLVSIWEPLQVPTQVCIQ 
NKTNSWGIAGYPFHFLKRNQRFFNALKV 
PGPFSK*TIHr,VNLKNLFFF*DRVSFCC 
PSWSAWKS*FTJ\ASTSI.\IKQSSCLNH 
PS S *DCRHTPLLFFVRMES 


998 


14899 


A 


1005 


2 


421 


FVSLLLLITSAIVAAGAPTGRTVYPPLS 
GWYFHPGAFVHLTIFCLHLEGVSSILRA 
INFIASIISIIPPAI TQYQTPLFG* SGL 
ITAGLLLLSVPVLAAGITI \LLTEPSLF 
TTFFDPGGGGDPILyQHLF*FFGHPEVY 


999 


14900 


A 


1006 




428 


DIHYG*IIRYLHflDGASIFCICLLLHIG 
RGLYYGSFLYSET*SMGIILLLATIATA 
IIGYVLP*GQISF*GATEITYLLSAIBY 
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X=Unk.nown, *=Stop codon, /=possible 
nucleotide deletion, \=po3sible 
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I GTDLVQ * I *GGYS VDSPTLTRCFTC YF 
ILPVI IARLPALHL\LFLLETGSNKPIG 

LAS 


1000 


14901 


A 


1007 


1 


419 


SPIVPIVMAMACLMLTERIILGYIQLRP 
GPNVGGPYRLLQPFADAIKFLSKEP\LK 
PATSTITLYISGPTLTLTIGLVL*TPLP 
I PNPLVHLlSrLGLLFILATSSLAGYS IL* 
SG*ASNSNYTLIGALRAVAQTISYEVTL 


1001 


14902 




1008 




27 


GQPGQHRKTPSPLSIFYF/ELAGGGDAS 
T*FQLLRRLR*EARLS1,GI*GCSEL*LP 
PCPPAWVTK*DPVFG6RKECKERLSQLRK 
LRPREVKSLVQHYVAHKGGNWDPNPGPM 
ATQPGHSCTTLCVPLCHR 


1002 


14903 


A 


1009 




377 


MSLGGQQDSAQTTRS PQFAGFFGQHS IP 
DRTGLGFHLCVCKCVCACVQVSVCKCVH 
MCACACMCTRTPMCECVQVCECVQVCVR 
VSMCVQVCACVRVCACVCSGACG/CTCV 
T*MSLACG 


1003 


14904 


A 


1010 


1 


412 


HKfiNTSTYLLTLTOTFSG*VKACPTTHK 
TAEWASTLIEQIIPRF/GPAFISKIVK 
QVTTTLDVHMKEHTPYHPQSSGKVERfilSr 
SLVKQHLIKIiALKTRQSWVTLLPFALAW 
LWAAPQSPTGINPFELLY/SAPLPLSN 


1004 


1490S 


A 


1011 


1 


410 


FRAVaGaSRQENGA\TVILRDIARAREN 
IHKSLAGSSGPGASSGTSGDHGELWRI 
ASLEVENQSLRGWQELEQAISKLEARL 
NELENS S PVHRSTAPHTQQVS P I APVE P 
PAEKPATPT*DDVDDDIDLFGRDND 


1005 


14906 


A 


I0I2 


269 


32 


NrSRIYKELLQLNlIKKT\SISiCEAQYLN 
RITFTKDI *MAKKPMKRCPTLLVIREMQI 
KTTMRYYFTPARMATIKKTDNNEY 


1006 


14907 


A 


1013 


21 


443 


RIRKKNSYPHYVKS IVAYTFl IRLFPTT 
IFMCLDQEVIISN*H*ATTETTQLSLSF 
KLDYLS I ILr PVALVATWAIIQFSL* YM 
\NSDPNIKQCLKDLLMFLMTILVLSTAN 
NLCQLLIG*EGA*MISFLLrS**YARAD 
A 


1007 


14908 


A 


1014 




309 


ssraaaihggaclwlqiile^elrweg*ls 
vggqgcskp*lhh*tlawaterdfvsqk 
kkkkpqkanp\hletsckkkkkkpmlgp 
pksqrkegpkapqgrtlwp 


1008 


14909 


A 


1015 


3 


256 


EKNQI LKWTMDLSRHFSKK \NMQVGNRH 
MKKC*TSL1IRE1Q1 KTTMKYHFTP VKM 
All WPGAGl TGMS HRTQPS PTLLLTHW 
A 


1009 , 


14910 


A 


1016 


397 


3 


LFFSPPPKWGFFFFPPPRFFAPPPFFFF 
PPPPFFFFFCEKPFFFFFFPPGGGAPPL 
SPPFFFFFFFFPPPPPPFFFFK/SPPPP 
PKKK* KKK* KKKKKKTPNTPKKKPKKPK 
KKKKRAAARDLELVGRVGGRV 


1010 


14911 


A 


1017 


2 


154 


EKERRE*KREE*KERK\RKEGEREGGRE 
RERERKKKRKRERKKDRCMFIVHFRE 


1011 


14912 


A 


1018 


1 


406 


NS S PPKNKN* NK* KQHT* * EKNFANYI S 
D I EL VFRTYWELLQLNKTGNP IKNS F * K 
NQFKNKGK* PNFYITQSF* RNKPI * KF/ 
AKDLNKRFSKEDM* lANKRIKTCITPLA 
IRKRQ I KTT VKSS FKA/ 1 YMVWKE T 


1012 


14913 


A 


1019 


302 


2 


IVEPLFLGWLCLFLQNBRVKGPGPYFSK 
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nucleotide deletion, \=possible 
nucleotide insertion 


i 










GKANPWFPTV\SKQPFKKPAFPVYQKQ 
LPLF/PALVDKKKVL/RFFFFF*DRVLL 
\CCPt3WSAMVQSWLTTTSTS 


1013 


14914 


A 


1020 


47 


3S2 


AKIALLHSSLGDRVSHTHIHTHKRSNEI 
ELVIKSSPKKKSSGLNGFPIEFYCSFK* 
ELIPIFLKLFQQ\VEAVGILPNLF*EVS 
ITLIPK/PRQEQNSNKTIYRPISLMNIG 
GK 


1014 


14915 


A 


1021 


273 




QS KERRQRTQEEMGKTFHNHEEMGTLFG 
FSQSCLKDE II II IFETGFRS ISQAGVQ 
W/P/NHGSLQPRFPRLKQSSCFSLPSSW 
DYMC*LPH 


1015 


14916 


A 


1022 


327 




KFLKGWP lYLKKGFPFVPQGGGQWANYR 
SLVPQ I YGVK\YPSAFASKINSTMGMCP 
PAWVTFGMAQLINGD* I F/ IFFLRQSFA 
LVAQAGVQWHDLRSPQPPPPGFKRFSC 


1016 j 14917 


A 


1023 


3 


407 


THLTQR*/HNIQTPP/WV*TGTLSTVLM 
NFYLWM* FHFQSlTLFIIiRLLTTKnTMY 
QGWLDVTQQRSYQGHHTPPVQKGLRYEI 
ILFITINTFFLEGLF*ALYHSSLFPTLH 
*RGDWT/PTGI IPLKPLEDPLLNTFE 


1017 14918 


A 


1024 


3 


425 


LPPQKKRGFPKIPPREF*KTPPKKKKII 
FPPPGKNWPPTK/RFLKGPPPSRFIPFP 
FFPLPSSPSPP 


1018 


14919 


A 


1025 


430 


1 


GGFPPFPPKNFFFPLSP*IFG/RGVCPK 
FP PPKKRFFS KNPQRGFNNPPQKKKK I S 
SPPPVNFAPPRDLLKRPPPFFFFFFFFF 
FFFFFLFMHPSFPTSVPQWQITSFLCLL 
LILQCCVCVLIGMWGDQSESSPSHHLQ 
PRWL 


1019 


14920 




1026 


463 


494 


rktgfprlt*hldl.dfffffffwkgifv 

ap^;\gg'rgrni,g*IjKPPplgwapfsglt 
p p ks yln * g ap p p p p fnf /c vf * rktgfp 

RLTR 


1020 


14921 


A 


1027 


606 




AGePGPPKWPGAFGWG/SLNa*S*LKG 

kgskgapipqsgptvfpggippkesplr 
ppggmqgrrgprwpgasgc/vg*dsvlr 
qp/gvtgeapcpgasspqrsqaarrgsm 
gkggafhswggcvlsaawpgwqppgsh 
g*/pgrrggqsgcscpks*awwrpclag 
shssakhkgmssegpaprgrais pvsag 
sigpiaargf 


1021 


14922 


A 


1028 


436 




GGAPQGVLKRflaPFFFFWKQNFNFVARI 
IKGKGGVLS PRQFPPLGFKKFWGPTPL/ 
SRWGFKEGLKPPGSFWFFKKRWGFSMWP 
RWV* \IPGPKEL\PPPFSQRGGIQGGT/ 
HRPPIIPPSLF/CFFEMETHSVaOAGVR 
WRDLGSLQPL 


1022 


14923 


A 


1029 




436 


PPKKIIFSPKP*IFWGGGGPNFPPPKKS 
FFSQNPPGVFFSPP*KKKKIFPPPPENW 

GPPKIF/YKRPPPPFFF 


1023 


14924 




1030 


28 


366 


EDHLSPGGWGCNEL*LCHCTPA\WRQI>IK 
TLSPKEKDHK 


1024 


14925 




1031 


3 


142 


FDCSALQEFGTRLYCPSWSQIPELKRCT 
HLSLPKCWDYR/R*ATTPGLRIVLEI,QK 
KLRR*CRELLYTPRSVTPNINDI*HWGS 
TFVTINEI ISIH*YIALAGLRFLSSSDV 
LTSASQNVGITGMSHHTWP 
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nucleotide deletion, \=possible 
nucleotide insertion 


1025 


14926 


A 


1032 


7 


361 


PALRPSARGQQYYDETKKKTLCRNAQND 
SYLPDPNPFSRFSTL/DHSWHQLEAPDA 
* KAP FGLYWNCGARVCR\QGI SAK* TEA 
CGLGTIKPSFFLIPLKQGELLGYPAYNE 
NLKKKKKK 


1026 


14927 


A 


1033 


472 


3 


TKQQMLNVDRTAFYKKMSSRTFIANAEK 
SMFGFRASKEG*LLGANTTGNF\LOjyiL 
I YHS KNPRTP KNEAKS TLLVP * K*ilBrKA 
LVAAHLFTA*FTEYFKPPIEIQI ITADW 
APGHPR\MYKEMNVFMSAN\TSIL*PMY 
QGIISCRIPAARIEVKGA 


1027 


1492S 


A 


1034 


488 


0 


PLR* ILAQRSGIHSMKT*SGEAftAE*KS 
EARSSWFLRFKEKSHLHNIKAQDKAAST 
DGKAAESYLED\TD*DGNTK*ICNGDKT 
AFY* KEMPSRMFPAREELMPGFKASKDM 
LTLLLRANAVGDVNLKPMMIYYTENPRA 
LKNYDKTQLC/ PVLYKW 


1028 


14929 




1035 


114 


491 


NYFKNFTFSNKLTL/CTjL*LFLTj*KLFN 
F*TF*LLC/I*HIA*NTNTLYSCTKIFS 
FFIFLFYKHPLKFIYFFyFLDFII*GAY 
*SLCYY*WYFSCQFCQVLFHVPWGLCC 
HVLVRLLYFQTDEPFY 


1029 


14930 


A 


1036 


114 


496 


LLRAILTYLKYKISAMNLVSA*FIYLHL 
TYHCVP/DHPVQGR*LLNK* INELFCYR 
SFGF*WVFSYSHLSED*flLEEKYLRERS 
RWK/DLNVTSETVKILEEILGKTLLDI 
GLGKEFMMKTSQANITKP 


1030 


14931 


A 


1037 


381 


45 


KNRGRKKNS * LGGP \SDKFYKTF* KL IQ 
I I YKLFQKNEKEGTL\ *MSRPI SLINIE 
ANIQN/RAVLAMI IQKHIKKI IRHNQW 
F/IPGR*G*SNTCKSI\NVIHHIKQRR1 


1031 


14932 


A 


1038 


134 


394 


SFCDKFTNCISESLCQ/SYSPT5riIKIF 
VLFKIRSGSITQPGVQWHDHSSPQP*TH 
SWAQTI FLLSSQWGTKGMCHHTWL/ IF 
LFLFVF 


1032 


14933 


A 


1039 


483 


30 


YDGGS E I IMYVLESRLIGTEKFHKVTND 
NLiLSRKYTVKGLKEGDTyEYRVSAVNIV 
GQGKPSFCTKPITCKDEL/AYVSTTIYT 
SETCTPVDL**DINKNDLPL*LQILAPP 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLFI *VKNLMEFAKKLLELINEYNKVER 
YKINIKNILLAKNTWTLKF 


1036 


14937 


A 


1043 


50 


457 


TKYQLANKNMKICSTLPIIRE/MQIRPH 
EI SLTP IMMALSEEQKNIIKYWGGCREVE 
TLVHCWW*CKTVQSLRKTVHRFPK*LKT 
ELP*ELPPK/PGSQRDICTPMFIAALFT 
IAKRFWKHPKCPSTDE*IKEMVJYIQS 


1037 


14938 




1044 


2 


469 


LKQSSCLGLPKCHDYR/R*ATVPG*FIL 
*NTTEDPNLTKN* I LNTNH/WCYSVQLI 
LGESNSTIKTSLSFSQASSHQNTTVQFL 
STSPSFKNCQWLAI LPRK*GLTLS FCFA 
FERQGL/NSVAQAGAQWLHHGSLKAQPP 
RLKQ/FLPNCPTLMECYRLQP 


1038 


14939 


A 


1045 


42 


200 


LICT*MLIVALLVIARNffKQIRC/SST/ 
DEWLNKLW*IPMEYY*AIEKEPTIGTC 


1039 


14940 


A 


1046 


475 


36 


LNW * LNHP I KRQKLAG / YVRKHD PT I C 
CLS\QQSRFIFKDTNRLNIKGWRK\TFH 
AWSNQKIA/GVAVyVSNKTN\FTRNKE* 
NYILIKGSIH** CI II INICATNS *NLK 
I*NKLTEMKEEIHCSKITVED/YNTLF* 
TMDRQKEFHRIHQ 


1040 


14941 


A 


1047 


3 


326 


FFFFIIIIIFETERDSVSKIIIIIKWKK 
DLF*FLVTQVTPEITKQYISTL?LQSKT 
KWDRPGTVAHSCNPSTLGGRGRRI /TLR 
SGVRDQPDQHG 


1041 


14942 


A 


1048 


403 


15 


EVKNSSFSYRLMTEIMPFGYVSiCLYNCD 
SGS*LS*SENVCKMNRY*Y/LEERGNTS 
FKSLLLLMWKLLHSAIFTSA*TPRGL*I 
YFIYMYVLETWSHSVAQAfiMQWHNQGSL 
QPSPPGLBCEFHRTDIVYAK 


1042 


14943 


A 


1049 


25 


457 


YMWMWSHHSCMFTLLFDSSFPPPSLCFI 
LRSLFLLYAELPLAFLLVKDSFILFV*I 
MSVFHLILKLQ*WIHNIYFTILTI FKCT 
VQWH* VYSHFSVAKTAT\ ISRTCLIPSS 
WDYRHMPLCLANFFL*RQGIALLPRLVL 
TSWLQ 


1043 


14944 


A 


1050 


14 


478 


FEPYCRH*TTRWN\CWWGYELVQSLWKA 
TEQYVLKPDICVF/LEPGVSLLGMYSKE 
LCTLC\ YQKTRMFIATLFALVKS * *LPK 
SS*MVE*ITKL*YFHTMEYYTAM/KNE* 
ITI *TTIWMNI/TI IIJIERSQTQK 


1044 


14945 




1051 


2 


465 


GIDQPSKPIPL/TQSFTQTKA/LTFENS 
VKaERGEEAAE/ER/VEASSD*FMRPKQ 
RSHLHNIKVEGEAATADGETAGS\SYPE 
DPAITDEGSHTKQQIFSVDBTACCWKIS 
SRTFAAKEKSMSVSKRSKDRLTLLLRAS 
AAGDLKEFPRLV 


1045 


14946 


A 


1052 


392 


24 


DFAPRRKKRETRRS KINSLTS * * KELEN 
QVKTNQKGSRKQEITKIKVELREIETLK 

TFPKKKVNKSRSWFYEK\INKLDGTLAR 
QINKKRKENQ/DTIRNNKGDIITDPMEI 
*TTIRGIPPLWR 


1046 


14947 


A 


1053 


204 


488 


CAVCYTQISLTEWVDL*SLDPGFETVSH 
LFDRVWQP * DCRRVTDMLMVF *NLHFVI 
LFFLFWF/DFVFVLRQSFPWTQAGVQW 
RDLGSLQPLPPG 


1047 


14948 


A 


1054 


498 


2 


S VWCLFLS PGITSHT YVPMI FKIGAKK 
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NO: of 

sequence 
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od 


SEQID 
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USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
Iftst amino 

residue of 


Amino acid sequence {A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=ProlinB, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VHWWKS ILFNKS CWRNLIS TCI RMKVDP 
M/LTPGTKINTNWI KDLTPSAES 1 1 CLK 
ENIG/THFHDIRLGNAFWDMTPKA*ATK 
EN* IPWITSK*QT\FCAAKNTVN*VKR* 
PMD*E/RIFAKHISEKRLICVIYKEQLE 
LN 


1048 


14949 


A 


1055 


28 


209 


IDGRSRGLWW\ CWM.SQLLWEAEAGGS 
LEPKSLRLQ*AMIM\DCTPPW 


1049 


14950 


A 


1056 


163 


479 


TSFLGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHSPCCPQYRIiaPI 
PRARHDFACASLIFVCILLVHVLLMPRS 
VAGRGVGSGLLG/CQAGR*L*TC 


1050 


14951 


A 


1057 


2 


1258 


ELFPWHFQPSRAASLWAKAIRDKVPGT 
RRWPVPTDGRPEGFALSEPHHS CLHGRH 
QGPERCRH^aAASP*EAGNK*PKDKDGHP 
GERTETAAGVHTGCGGKGPAAAGGR*AQ 
GGSGEAGAAAGGAGAAVGGAGAAVGGSA 
GGRWPAGLLGQHGAG* GEGP CRQHGPP P 
GVSAGQAAS PAKAAGQPGPGT*GAAGQP 
GRG * G PAG P RGGAAAERAGAGAMPAQGP 
AGAAAEPAEGEAR PGAGDYG PAANHPGA 
RTGAGGAEGAGAAAGGLPRPAQAHR\AQ 
I HGPVPLGTGGRSS S/ GGI PDNMSHRLW 
QRHRSHGEASAGQRHPHPGPTGGER\GL 
QSMLSKIREVAQQGGLKVGLRGRALiGDQ 
EEAPIQQQVFRLCPGNLWR/RPPHHMRP 
*AVLL*NIFHISRRREDVMDPMPSSPIC 
PL 


1051 


14952 


A 


1058 


2 


437 


ERSVRTACCKQSRGLSSLLCPPPAPRLS 
*TGSPVTGSSALEPRSPHPQP\PSLRPR 
SPGPQPLHPLGPRSPSPSPEQTVPSHPA 
RLPSLSPERTQG\PKAQ*SQLPPPSLAL 
AQPAPAVGGREEVAS PACPSWKDKS RLR 


1052 


14953 


— 


1059 


47 


456 


TRCYGTLLRLnAPGKLWTRKM*WASTy 
ARTDRKS SASFRPLLPLPRGFQLSLHFS 
LTS PS CLAFSANTHRGLRGENYHITK/C 
DMAPKV/HN*TQAWQWHDFGSPRPPPP 
GFGRSSCLGLPSGWDYSPLWHHEQIL 


1053 


14954 


A 


1060 


484 


13 


FPTSASQNTGTTGVSQHAQPIIF/LFVE 
MGS PQV/GPASLKLPTSRDLPTPASKAA 
ESVSAHHH*FLHFL*RWASGFVaPSWS* 
TFLL\KQSSCLSLPVC*DCRCYNHWAWP 
HSVFSSQFLSLFPFNFSFLFFLSFSFFK 
DRISLCRPGWEFHQTVQWSR 


1054 


14955 


A 


1061 


1 


494 


gspghpvcgrrrsalenpgqpcslqpgv 
vsgatgaivrvqrtssawataaamgayv 
e/ttrpryt*kagvgcdrlwgawlkadg 
lgetkaehtlhdg/ ppedalygli egdd 
tnftiqgewhcmytgtlpdgtwdtni 
qtsaqkmm/nakplsfkv/gigkvsh/g 

WDE 


1055 


14956 


A 


1062 


3 


395 


SDPSPCGGIRFDEMEKFLEirarLPKIiKE 
/DKINDPTPVK* leSVMKI PHTVEP /PS 
HKCFTSEPY*TPKE* IAPISHKLP*SRG 
IMGTATPPVMFHLQENVIPK\PTEKK*T 
YRPVSLMHlHXKIIiNKILTYK 


1056 


14957 


A 


1063 


402 


260 


WYCPPKV/ LLRFSVYSSPPEVWAVGS I K 
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SEQ ID 
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nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /'=possible 
nucleotide deletion, V=pos$ible 
nucleotide insertion 














AELYMLRPLFPRTREVD* I F * I CPV/ LK 
GSPKNKDFCP*GAKTFEYFYPAT]:.GLNL 

KI*IFRQRW 


1057 


14958 


A 


1064 


66 


379 


EFGRGCGEPKLRQCTPAWVTEEDPVSGG 
KKEKRKEREKT*REPSVSRPESH/PPSQ 
EAFWWQALSRPVLPNFISGRSASGLGK 
S EA I KS LFLYIiAVCS TTEEQA 


1058 


14959 


A 


1065 


223 


437 


kscmscdsnkifflrqgikScstiTQagvq 
whylgslqplpqgpk* *shls llskwdy 

RR\PPR 


1059 


14960 


A 


1066 


414 


65 


NKSPDRHELAS*KCTQ*THTSIARDFFA 

kt1qwgknslfniwc*dnwistskrikl 
dpyiipytkNtnskwirdlhvkaktikl 
lekngpgawcpcstpsysggllearslr 

PAWAT 


1060 


14961 


A 


1067 


418 




LWVFFFSRQSL/DSVAQAC5VQRCBLGSL 
QPPPPRFKRFSCLSLLRSWDYRCRPPHV 
ANFVFLVBT\GFTMIiARMVLNS *PCDLP 
AAASQSAGITGVSHHAEL*FCGF*YIHR 
VAQPSSQSI*DDHPIKKFCIL*QPTPFR 
S 


1061 


14962 


A 


1068 


24 


440 


E V*NL YS EINYKMLLKEMRGDLNKWKS I P 
*S/WI*RLNIVKMAVLPKL1YRFNLIPI 
RIPDEFFVKSTS*L/CKF1WNCKRU11A 
KTIF*K*RTK/SGDLILTDFKTYYKTMV 
IRTI*Y*QKDRCID*WDRIBS/PEMNPY 
lY 


1062 


14963 


A 


1069 


442 


26 


LRNQVSTPSSKTPHFFFFFFLKRQTGAQ 
RHI!1HSSL*PRTPGLKQPSTSASGVSRTI 
GASHHTPLIFFYFFN*GKEGWGGCSCFV 
AQAGLQL\RLQMILPPQPPLKCWN*RHK 
PPSLTYPIFLKRTP*SCFSKEKATRIPP 


1063 


14964 


A 


1070 


294 


46 


KNKLECRKERKENNKKKAKIND IKNKSML 
EKIHKDKN*?FERtniKIDKLLATN/RIT 
KKTQIVIFMNKIEDITTYPTDIKNVKR 


1064 


14965 


A 


1071 


141 


469 


PKKQGVQLTQNAPLPFRVGGKHGLCPKK 
KKERBRDRE IIS I I I I I IKLPTKKTPGP 
DGFTDELYQTY K* LTS lliKVFLKKRRE 
CFL/PYSMDPAVPAIAUIPPP 


1065 


14966 


A 


1072 


49 


472 


EFQIIKK*NSFFADTEKALVWIENQNSH 
NMPLSQSLIQSKflLTLFNSMNAERVGKD 
AEEKLEASR6FWFiyiRFKKPSP*HKVQGE 
AANIDVEATASYPEDLPEI IDEGGHGKQ 
/QIFNMDEIVFYWKMPSRTFPAREERSM 
PG 


1066 


14967 


A 


107.3 


82 


419 


LTVGFRETRSSSWVCSSSPKEALHALRF 
TPALKF*FFFWSFSRLFFFFLRDRVLLC 
CPGWSPTLGLK*SSCLGLPKCWN\YRYE 
PP 


1067 


14968 


A 


1074 


1 


407 


LEDGFFKITQSDKKEKKRI\KKCK*NLQ 
EVW\DyVKHSNLQVIEIPE*EVKCLENL 
FEEVIEANl /PHLAS YLDIQIQAVQRTQ 
RGYIiiRQTS PKHIWRLSKVNMQEKILN 
LPKEKHL I TYKGNS IRLTAKPS PKPT 


1068 


14969 


A 


1075 


3 


396 


LHAYHIVKLSP*PLTGALSE\LLMTSGL 
TM*FHFHSITLLILS LLTNTLTI YQ * WR 
DVTRES TYQGHHT PPVQKGLR YG 1 1 LF I 
TSEVFLFAGFF*AFYHSSLAPTPKLGGH 
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nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline; 
Q=Glutainine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WPPTGITPLNPLEVPLLNTS 


1069 


14970 


A 


1076 


250 


47 


VAACQNAFCTFSCYWL*NLLCRK\MDLD 
IS LLP * TKVNSRW I AGFMVRALTI KILE 
ENIGSAFGNRKSQ 


1070 


14971 


A 


1077 


2 


393 


PLTGALSiSrLLLNSGLAMRDDLHS ITLLI 
LGLLTSTLTIYQG\WREVTQESTYQGHH 
TP PGQEGLR YGTMLF I TS EGFLYTGF F * 
AFYHSSLAPTTQLGGHWPPSGITPLYPV 
ESPLLNTSALLA*GVSIT 


1071 


14972 


A 


1078 


2 


500 


LHVIGVSEGEEN\GMKQNKIFEEIMGPN 
F/PNLVKYINA*/IQEMQ*TPNGIHLKK 
TVHRYI IFQLVRTJfNKERILIVAREEWH 
GIFGGTML*MNDDFSSEFIKARKIEMMF 
LKYKRGISSSAKHC*SRILCVAQ1SFVS 
KG/BITTFSDKRKLRAFIISRPAHRNAK 
GK 


1072 


14973 


A 


1079 


419 




LRWSFAIAAQAIVRWENLGSLQPLPPSS 
SNSPGSWRLQ*A1ITAPLRSSLGNKNETL 
SQKKRK*KDNSEWEKIFANHASDKGLVS 
K* I KNS YNS ITKN\DI *KWTKYLNRHFS 
KKDIKMANKHMKRCSTS 


1073 


14974 


A 


1080 


468 


2 


KLPPPRGAGSSAP/PLFFPPTKKGFFSP 
PPP*KFFFSPNI'LIFLGGFFPIFPPPKK 
NFFLKNPKGFFFSPP*KKKKIFFFPP*I 
FAPPKFFFKS PPPLFFFFFFFFFFFFFF 
FFFFFFFLRTRGRTRGRTRGRTRG 


1074 


14975 


A 


1081 


467 


1 


PTYKFNAIPIKIQ**FFKKIENCILKFL 
/WITLKEPQIAKTVLK*1SIKVEGFTLPDFK 
\LTVTKTVWS*HKDRHRDE/WK*TESLK 
IKPYKYDQFFFFFF*FSFL/HFF*YLTY 
SQHHKLIFNRMPKPFNKERLFNK*CWEN 
WIS ICKI IKLDHYLTPYTK 


1075 


14976 


A 


1082 


387 


3 


LKQPPPHKS FVKNKNGVSLCCPG* F* TP 
GLKRSS\HFDLPKCWDFRCEAPVLSLLN 
NIKL*L/CGRGLIKPQFP/SVK*K**AP 
3WSAHLAETFFFFRQGLTLLPMPGVQWH 
DPGTVQPQTPGHK*SSHFSL 


1076 


14977 




1083 


2 


435 


FHPPTNWGGFSPPPP*KFFFSPKAIiNFG 
GGVGPNFPPPKKRFFPKNPRGGFIYPPL 
KKKNFSFPPPLKLAPPRV/S FKRPPPFF 
FF 


1077 


14978 


A 


1084 


447 


47 


PSRKFFFLAPPKRGFFPPFPLKNFFFPL 

GGLFFG/RGVFQFFPPPKKGFFFKKPRG 
VF*PPPKKKKNFFFPPRENLGPPGVFLK 
RPPPFFFFFFFFFFFFFFGRRFVK 


1078 


14979 


A 


1085 


438 




AKNHPKGFSPFKKFSPPFGGGGIFIRGQ 
L*KSFFFFFKKPKFLV*KPPFKIFFF/I 
FFFKQRVLAL/CPPGGNKGVPQGPFPIF 
FSRVGKPLFFNPPKNWGKKNPPPTPGKI 
FFF?CSFFL*RQGLTTSHRLVLNSWA*A 
ILPLQPPK 


1079 


14980 


A 


1086 


43 


265 


ikhkleyikiknfcasnsnr/terqpme 
wek\nfanlisdkgliprihkqll*ln/ 
tkqqnsp r * kwvkdqlqwpyl 


lOSO 


14981 


A 


1087 


11 


412 


IASGLHDFFNKKKKKKKKKGGGPLKKTP 
GGPKFNRGGKKKIFSFKGGEKKTPGGFL 
EKNPFLGGGKMGQNPPKKKKPFGEKKNF 
*GERGEKKPKFPGGKIW1SPSPQE/WKAP 



592 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

otlde 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=C5^teine> D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RDDPRPPE 


lOSl 


14982 


A 


1088 




292 


RSLPASADSSSLVAASLAGVRDRVSTCW 
PGWS /AV/ VQTL * SAELGLP KCWD YR CE 
PPCLAEAPSFMRSGKASCTLETVWEDKH 
KYEEAERRFYEH3AT 


1082 


14983 


A 


1089 


376 


3 


HIGLYIASCKTLLKEIKGS*IDILCSWI 
ERLNTVKKAILLKLIYGFNTIAI/KILS 
GLFIETERMILKITGKCNRSQTANTILK 
PNKVGRLALPNFKTYYKATVIQTVWSWH 
KDRWYGFFCVPTQI 


1083 


14984 


A 


1090 


277 


407 


QI KAERSHHKQ I TSMRNVKHCS SENYDS 
*RKPAPGYIIIK\F*K*LIKRKIIJ!JSGR 
LKNRPIKEE*RYAYQDISETIKUER*EN 
DIFKVL\KEKKSCQPRLLYPLKIS*IEV 
KYFFDKSKLKEVITNESPV*EMLNIVLQ 
/INYDSPYEP 


1084 


14985 


A 


1091 


421 


2 


GHPGVLFKG* *A*NEKSL*NCSFFNY*S 
SFSNMQTRIKNV/WPSTVAHACKPSTLG 
GRGGQIT*GQEFETSLANTVKP 


1085 


14986 


A 


1092 


369 


3 


PLKE IRSVTP VNT* 3NL IAD/ 1 GEKVLV 
VWIENQTSHNIPLGQSLIQSKVL/TFNS 
MKTER* /EKLQKFHASRGWFMRFKERSH 
LH/NIKVQGEAESADVEAAGVYPDPAKT 
IVKGGYTQ*QIFNVX)V 


1086 


14987 


A 


1093 


71 


506 


FAEDNGL*LHPCSCKRHDLAVFYGCTSF 
VLTFGr.*PWFLTQS/FLNPLEFS 


1087 


14988 


A 


1094 


118 


385 


SDLGKDFMAKTSKAQAAKTK\ I YTWDYI 
KLRSPFSAQ*TKQSTE *RRQSTE/WKBK 
IPADYSSNRGLISSRQETQTTQWLKKIN 
KNALCTL 


1088 


14989 


A 


1095 


419 


■ 


DAKI PGQMVAR/RI PRPIKKX E'CPDQVW 
FIPGMQGGFHIGKSINVflPHIQMG*KSF 
IilNIQHPFMI*KKKlILSTGTTHEGDITQH 
/R*MIfiEHTTNaiLNREKLKAFPLRTGI 
RQGCLL* PLLFS IAVKT*P*AMRQEKEI 
W 


1089 


14990 


A 


1096 


34 


464 


NSSKKEKRKlPHDL6LRFLDMTPKT*ST 
NGMMD/KDFIKIK/FCaSKDTINKVKRE 
*EKLQIIYL/DKRLVPRIYKDSYYPIRQ 
LKKWVNNLNKHCFKEDTQiaNKHKKECS 
T\SIRE*QTKTR*RSHFISTRVTKIKMS 
G/N/NKYWQGC 


1090 


14991 


A 


1097 


458 


28 


RI IKVDLKMCTHNFDSLEEMNQFFKKSR 
WPR*ISIR* FE* SYHH*HN* IHNLKVSKSS 
GPDAFTGEFYQTCEEEIAPILQNLFQKM 
ESIRSNSFYKVRITLIP*GniSrGS/TKKE 
NYSWSFMNLDAKVLIKIIjAN* I*VFIK 


1091 


14992 


A 


1098 




500 


FWIRHFLSRHKQRKNR*MDFIKrKNLYA 
SKDTVKEVKR*ST*WE1!JYFQI/RMFDTG 
LVPRVYKEFI *LSWK\DNHI * QWGKKAV 
SRGFSKGNTQMAKKHMQRCSMFFVIRKM 
*I\KPKMRYHFTPSSMT*EK1IKDNTCFK 
SDGEYGILIYYYWGI 


1092 


14993 


A 


1099 




484 


RRAPFFFFFFKRSFPFWARAP/LQWAFF 

glvqnppprqipppglhpf/scpnppkk 
wgykgppptpgkilgf**kgggphlnkk 
ifisrpcdppaptp\qkvgiqggtpppr 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
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09/515,1 
26 


Predicted 

location 
correspond 
ingto first 

residue of 
peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
P=Threonine, V=VaIine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PFFFFEIWSCSVAQAGVQSCDLRSLQAL 
PPGFEQFSCRGMPLKWIGSHPSPHP 


1093 


14994 


A 


1100 


144 


260 


I LILDSALGHPEPHEFNTKGIEWYLPL 
WTMYHQPLDQ 


1094 


14995 


A 


1101 


265 


457 


DPAAPLLVTYTREI \*MYITKRYTAALF 
IKTKNWK\RPKCPSKGE* INKLWYICTM 
EYC3AIKNE 


1095 


14996 


A 


1102 


387 


40 


RLEADILRILVYMNFLQRLKKQRYLFTG 
PQYRFLFLELWLCVCVCVCVCVCVRftHF 
*IISRDKVLLCCPV\CFCFLNPPCVCVC 
VCVCVCVR&CAIINltLLVETRSCSVAQS 


1096 


14997 


A 


1103 


465 


208 


QNMTKTFQDYRP I SLMKI YARKQQYRYR 
y *RMLH]SrYQMGFTLGLRGWFNI *K/S 1 1 
IHIN* *KBaCIMIILIESEKAFDKC*QSL 
II 


1097 


14998 


A 


1104 


"57 


250 


MTCSRMQ/RNIKLLRYKSEKL*EENSTS 
RNEITILSEEDSFTNVKLEN*MKrVKQK 
KKTSKKTGVXKMMKKQISEFKGKTSNF 
Y*EQINT*EKKNFKENWSL 


1098 


14999 


A 


1105 


2 


378 


YVUPRQCGGILKGVLKGEFTISNEKaWP 
GRGSPSSVSiaD/LtCQNIKTIESKKTCL 
IQKLIKVKAE I YERESRKTMEE INETK/ 
SQFFEKI /NKID* / PIAHLTKKKKKTQV 
LKSEK\GNITTiniTEIKRV 


1099 


15000 


A 


1106 


291 




VSIYVSYNLDSCKGQIRIKSCDDQYIFK 
RLYFFEITFLKFLN*K*VliKKIRDIWLL 
C/YPGWPQTPGLKQSSCISLPSSWDYRH 
VPLHLAPINYFLW 


1100 


15001 


A 


1107 


24 


397 


DIDHVWEFETVFYHVGQAGLKLLTSDDP 
TALftSQSAQITC3VSHRTRPSSGHFNNTP 
EPPEASSLSSHPKLHKSPVT*NGAQL/Y 
GSSKLLSCVUSGP ISLVHSTLRLRKG6P 
QGNISQISLiyiaPP 


1101 


15002 


A 


llOS 


64 


414 


AQLTPTQPQGEflALHKGHV*RKTAPTCL 
FMAEKNQAASFFLFPFPSICINKE/HFK 














KQPGGGLLISKILWAPPPaFFLTREQP 
PPIFF 


1102 


13003 


A 


1109 


62 


475 


FEPLFYLMCLLNLFPLQLPRHPFLFLTV 
DLVWTWGCPLPSSPQ*EWLLAAPHRSTP 
PPLSSGFPARRQLEPGAGARGP/ HHTQA 
LHLSFFFVFLRRSL/DSVAQAGVQWRGL 
GSLQPLPPGFV\ILSSPLSLPSLTY 


1103 


15004 


A 


1110 


76 


477 


EEWKKLYRWRENISNLISDKELIC* IN/ 
NQTFNIQPLKTKNPTKDVNDLKTFTNED 
TQKYTyiHQ*R\SHMKR*SP*LFI\KMQ 
MSTALRFLYPPITMDRIQIPENIKFWRG 
CGILIHSW*TRKMFHSL*KAFWPF 


1104 


13005 


A 


nil 


1105 


3 


KEERSETLAKGKLIAAGA/YN*KEERSQ 
IHNLTLHLKGGKKrNDKPSKR/QE/l*Q 
IRAEISRIRPRKKKEKNNEFVFLKINKN 
FKTHSYIKKKREISNTKIINKRGDSTTD 
ATEMKKITRD*CEQPSSSSSSSSSSSSS 














SSSSPPKKKIPGPDNFTGEFYQTFQEEL 
MPALCKIFQ\NVQKPEGDI/PNPFYQVW 
I YLVPE PDE /QP FVI KTLKELVI YGW/ Y 
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nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
IISSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tltreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L/K*IKriHEKLTANIILNGEKLKSFPL 
GSGIR*QHLFLPLLFITVLEFLLRIISQ 
EKVIKRMQIGKEKV\KILFTDWRM*WV/ 
ENP /VNSQNIGKIMKENKVSGYSNTHKS 
VAFP*TmEQSAlirKILK 


1105 


15006 


A 


1112 


37 


466 


DRAAEFPTENPLELISKASKVEEFNVSI 
QRPIVFLCISNKQLENEILKIQFH*/PI 
SVASKNSKyGGINLRKYL*DMHIDNK/P 
LNKNRGGLCSWSATSVLPKLIDNFSAIS 
GNYSRFFFFGEIEKIiNLKPIWKYK*VRV 
AKTVSY 


1106 


15007 


A 


1113 


33 


436 


PGLN**CWKNRISTCKRMKLDPNLAPYT 
KITSKWIQDINIRPVTIKILKENRGKGL 
QH*MWQ*FF*I*PQKQE*TNRTKSDFIK 
TAQQRKQQNERQSVEWEKI/FANQISNK 
ELLSKIKFRLLPLDNNKQLT*KWA 


1107 


15008 


A 


1114 


2 


391 


PLTHSRMAIKKETK\NYKCWQ*SGGIGN 
LGNCW*ECKMF*PLWKIVLQFLEKKKEN 
TELP*NPI:iIPILGIH/YICRNVCIQMFTE 

alfiitk/swkqpr 


1108 


15009 


A 


1115 


3 


458 


irdpleeavcpfadlbclhagrtttlfra 
vrqgplsii*klllpfvqlshvprggvyr 

gsqas ls cgglhpvqas *i,lci,ptqasa 

h.:udap\ppa3lppcssisdccaasergs 
vgmgpsepgkgynllvcrllrplekpsi 
rvgvslfsryhm 


1109 


15010 


A 


1116 


204 


13 


P KMKTDSRW ILGLFF * FLRQS L / SSVI Q 
AGVEWRNPGSLQPPPPRFKQFSLENQGI 
PPHGLVYS 


1110 


15011 


A 


1117 


34 


454 


IHISWEFLTSKLYGTLTSQYNKKTNIL 
IK*MGKEFNAHFT*EGI* IVNKHMKRLP 
TSSVIREMQTKTTVGFRSVLTREAGIKQ 
TDS /NQAWVRKWSNSTLLCCWWKHRVLQ 
PTWN\TAWQLLIYKMIWQLHFYAQKDIY 
VN 


nil 


15012 


A 


1118 


83 


451 


CFLKFFLYRELISPFLL*Q\FVRPSPAF 
RRKPPPWVAFFFSDPS/FSFLVFFPKGI 
VFFLGDALKKVLT*KKNFFFFGRD*VLL 
CCPGWSRTPALKHSSCLGLSKCWNSTTR 

P 


1112 


15013 


A 


1119 


316 


441 


FIFFFFLKTNENFFAQVGGH]SrR]!ILG*LK 
LPLPGLKQFSCLTL 


1113 


15014 


A 


1120 


67 


312 


RQI DQWTRI ES * ETDPREYS QL I FDKGA 
NPIQWRK\L FNKWFWHNWTSTCKQ IKKN 
LDIVLASFTKTSTSHRLRENLYRIHI 


1114 


15015 


A 


1121 


27 


490 


GTQLHSREKKNSPFNK* YWEN* ILTCKR 
MKLDSYFIAYTKINSKWIKDLNVSLLGL 
LAKIK*S PKCKIQNY* TSRRiC* KGKDLG 
Q*FLGYDTKCKATE*KNK/LISrKWDYIKL 
KNFCIA\NKTINRMKRQPTVWEEI lANI 
ISDKVLISRICKKTNR 


1115 


15016 


A 


1122 


490 


48 


ehtnnkrdtltvdigkvlvi*tedqtsc 
n/ipsnqsliqskaltlfnsl *aergee 
ate/ekleaskgwfmr* kershlynikv 
qgeaasadeeatasyqedlaeiideng\ 
kqqifnsdkialhqkkm/preksmphfk 
vskdrltllref 


1116 


15017 


A 


1123 


2 


259 


iygqlifnkgvktiq*rkhsvfnrcywd 



595 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO; of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

cor res pen 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T="Tlireonine, V=Valine, 
W=Tryptophan, ¥=Tyrosine, 
X^Unknown, *=Stop codon,/=possiblE 
nucleotide deletion, \=possible 
nucleotide insertion 














N*ISIWKRM\GLDPYLTPHTKINSKWIK 
DLNIRTKTLICSSWRKKKRENFLKKKRK 

KL 


1117 


15018 


A 


1124 


206 


24 


CSIFFCIYLFiyLFIYLFIYLRQSL/DS 
VTQAGVQWHYFSSLQPLPPGT\GLF*RT 
SYSWDF 


1118 


15019 


A 


1125 


3 


351 


RRGRGFTILVWAQTPDLM/HPPALASQS 
AGITGVSHCARPICLFFKRQGF/tfSVAQ 
ARVQWWDHGSLQLQPLTIiR*rYHLS\SW 
DYRRM/PAMLGFFVFYAYPGITLLPRWP 
WISGLNELP 


1119 


15020 


A 


1126 


2 


349 


SEKHKDIGTLyLLFGA*ASVLGTDLSLL 
I RAELGQPGNLLGNDHTYNVI VTAHAFV 
IIFFIVIPIIIGGL\GIWILP 


1120 


15021 


A 


1127 


362 


2 


RVSLHCPGWSQTPELKLSSCLSLPKC*D 
YRCEPPHPAYLTSFITEDSKNFEYVTTL 
F*NNIY*TiyiFIEIPP*VHYIGjMILF*DQ 
LSTFF * S F*N*FFVCLFFRQSHSVAQAG 
\VQ*CDLGS 


1121 


15022 


A 


1128 


1 


349 


I YFLTRSHSVSRAGVQWHS SGS LQP* PP 
RFIOISSPPLSLLSSWDHRTWG/HMPP/H 
SSSFSSSLFLRSGWGGKGVLWS 


1122 


15023 


A 


1129 


367 


24 


IHCVFYPCKGVTKKPWYT/HF*RKGFII 

inifhgpfkltfvcpeggknsetlypfc 
flplsiju:kksrplgvflv*kgplsppl 
qgtfygvgwvgffffsafvifikklvlsp 

YYKN 


1123 


15024 


A 


1130 


61 


299 


IPRVDNYCGYVREIILFSFLFFFKRSLV/ 
SOTQAGA»VJHDLSLLQPLPPGFKQF\SC 
FNLLKNHCYRDVPLLLNTLCI IFKT 


1124 


15025 


"A 


1131 


2 


261 


FFFFLVFSFFLLRLFLLPSPSSPPPSVF 
FFLPSSFSL/CLVSEDLLTLSQTKVQWR 
DLGSLQPLPSRFKRFSCLSLPSS*DYGR 
TPLW 


1125 


15026 


A 


1132 


2 


381 


EFYFFF*SSQMESPSnMaGVHWRDLG/ 
SIAPYASWVQVIILPQPPPPGFESFSCL 
RFPSTWDMEHAPPRLAKKKNKI 


1126 


15027 


A 


1133 


2 


221 


PSLLKKi:iARHSGRCL*SQIiLRNLRNE]SIH 
/CVPAWVTK/GETPPSKKKKKNPPPKGP 
GAQTFETPGFGKPKGKIKV 


1127 


15028 


A 


1134 


331 


2 


R6PPPPLKEPSGVFRIFQFKNLESRGGG 
ERGKQSPPLGPPIRRGLSPKPPPPEKGL 

TVFFFFFFRCKDGVSFCCPG/WISRTPG 
LK*SSCLGLPKCWDYRHEPPHPAPAL 


1128 


15029 


A 


1135 


3 


391 


AVNTQMMRK* KSL lADVEEV* VWI EDQ 
TSHWIPLRQSLVQSKAL/T*RAKPSMKA 
ERGKEAAKGKLEASRG*FMRFKENN\RL 
HNI KVQGDTASADVEAAASYS *DQAKIT 
■ GGCGYTKQQIFNTDETAPY 


1129 


15030 


A 


1136 


107 


350 


tys i vaivmrfyffiifyflrwslawi 
qaccchpgnwhdgslqplppgfhkpscl 
sllss*gyrcpppcqa\ififlvemg 


1130 


15031 


A 


1137 


35 


297 


TLMHYTWRVKEKNIIISIEAEKAFDRIQ 
HPHM/IKYLNTIKAVYNRPTASIILSG* 
NLKAFPLKSVTQQGCPLSPLLFNIILEV 
LSTRP 


1131 


15032 


A 


1138 


378 


1 


PFTKIWLKWI IDLWI KHKTIKLLEDI IG 
ENLDNIiGHDYDILDPTPKA*AMKKIIiIN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

IISSN 

09/515,1 

26 


Fiedicted 

location 
ing to first 

peptide 
sequence 


Predict- 

nucle- 
otide 

correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleueine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=lInknown, *=Stop codon,/=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 














VALC*NHESFCSENDTVIRMRK*ATDRK 
KIFAGDMI KDCySKC/NEKAEVNNMKKN 
UP lYKWAKDLNMLL 


1132 


15033 


A 


1139 


271 


361 


KW/ YWNPIHI ISQVCL*GPE lY QHVYGQ 
MTKQRCQGNSM/MAKDSLFNKWC*NKWI 
FF/C/RI*NLNS YLTPYRKI K* KWIEDQ 
NITVTTKLLEENMRPSFAFGIGKD 


1133 


15034 




1140 


28 


340 


LLYFiyKRGFTMLSRLVSNSRPQ/CDPP 
TSASHTAGiaDGSHHARLLFFLEK\EPA 
PGPPAGRKGOTFG*REPSPPGFRGIPP/ 
LPPPGNWD\YGGPPPPKPNFGFF 


1134 


15035 


A 


1141 


94 


128 


GSQMPRHLVD*MTRHLATLRES \CYSR\ 
VYPRFIEFLHFDIQSTGQKSHR 


1135 


15036 


A 


1142 


85 492 


VWVGLLSLEGSPSKFGNFIEFGVLLSSG 

GFSAWRLFFFVYFLRQSL/NSVAQAGVQ 

QWRDLSSLQPLPPGFE*VLKQRGVCLFV 
C?ETE3H3IAQAGTQWCDLGSLQPLSPE 
PXRFSCLRNSRSLRRDIEPSEGNQC 


1136 


15037 


A 


1143 






kmnr*pisikeidfivknlpk\kfgpdg 
ftgilyXhhfkkei iqtvcnlfqkteke 
gmls\iykasiaqipkpkkdkqtkatd/ 

YKPSGIEAKI IKRILaNGIQQYl\HDQV 
RFTPGMQGWSNIQI 


1137 


15038 


A 


1144 


56 


482 


TMKTLLyWQKNG*VDQWNGlESPEIDPy 

k\yiqlvfdrkaktcnv/rkdslvnkwc 
tyakkkktlnlfftpftkitsktylknw 
n/svwfusrtplgktlgdlgfprdfkfln 

TPKnKSLEKH/MDTlJ!irFKjGKICS*K]SIM 
VKKNKKT 


1138 


15039 


A 


1145 


190 


482 


RQGLALSPRLECSGVUAYRSLKLVGSS 
DPPTLaSQIAGITGIRHCPWPKTLFLAS 
VIMPJUniS*PSLSPVPSP\SLPSPPLLA 
SQSRRSQ 


1139 


15040 


A 


1146 


138 


485 


IFFPSV*TIFLLLALFFS**MyLIFVKS 
SLSIYSVYQF/SFLCFWSPISDPKSQRF 
SLLSFIVCFFVFPHHRVLLCHPGWSAW 
* S* LTAVSNS\LLKQSS 


1140 


15041 


A 


1147 


101 


426 


GDWKKFyiYKaSESKSPLILFEKKKSVL 
EEYSSFDI**AIK:ViyHISRKRK\KNQL 
ANLTDAEKAUStKI *HPEMRKTTQQFS]SrR 
SFLHLRKGIYKKPTANMIIKEQIIF 


1141 


15042 


A 


1148 


458 


15 


VFVCI.KI CKHRKGTAKKWYYNFMGLPLY 
MRSWDQNGTM*KMTIYAPNIGAPKYIE 
QILMGANREVNSNT ILLGDFS I PLS TL/ 
DKELPELNFT*NKTDLTDIYRTFHPMAA 
KYTYFAGTHGTFSKMCPVGFDIRVILVE 
FRRSTYGLDW 


1142 


15043 


A 


1149 


1 


507 


KGP PAPP PYYKKKFQGGPKKFPP * SHPF 
KRPRGEDP/YKPRNLKPPWAKKKNPPFP 
KKKQRGKGP KNP P PWKVKPE KS F * P SP K 
KTKIWPPPPPGGQKKKKKPPPPKKKKKK 
KSVSSSPKNI/YRSLFCTNLKKSFPT*F 
*NLMP/TKLPV*EFPPPPYNSF*SAP 


1143 


15044 


A 


1150 


436 


32 


nlcmyvymyliyvetyvciysiyvsiyl 
i dlslyhhhyls i cmcvs i yp i yls li y 
lcisliyhlfnlciyisiyliyls/yvc 
m/ssi*smyvciyliyvcifsislyyly 
lylsiitiiylylcnyclfylssief 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Piienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 


1144 


15045 


A 


1151 




657 


QDCKIQNEHKIQNTLFLYTRNAHIKIKF 
HLHS KKI LAVNLTKHV* NFYAENYTTLM 
KEKI*IER1IKDLHRKQ\NLNPCIWIVKL 
NIVKIPVI\PHR*RSLTIPIIIPAK\FF 
GDKEKIILKFIWKGKG1R*LRF*K1RIK 
GGLNLPN\LRLTWTVI KIV* Y WWKERD 
R YLNQ \ WNKI ENS /E KL KPRKYVQLI S F 
VFDTSG 


1145 


15046 


A 


1152 


430 


438 


TFWVKKFFLLNLPK* INPGFKKKKNKRG 
G* KRP /SGSQVLKKLRGKIALTPEGKGK 
I RDCFCPPPPPLRKKKKPPIRP* KKKKK 
KRKRDPPPGNHYSNSYRHTSCYFTLLHL 
TLQIFFTSRSPVAMSCRINL 


1146 


15047 


A . 


1153 


104 


478 


ELLLEGSPCLRAESKAGPAGRLVPTLLS 
WRVQSPGVLCAWEEECEEPPSTGEDAPS 
LAIRETQIKFITRMHLSPLSMAIFQTNN 
DN\RTKRK*CWHGCGEIG/T/LLHCWW3 
/CKLTFP 


1147 


15048 


A 


1154 


171 


'176 


NS SDYYYYFLRQSFTLVPQAGVQWHDLG 

SLQVTSDnPDLR*?ARLSLPKCWDYRRE 
p/RMSDSl<rYFLK*VPWI*KARA*T,LFlP 
S S C QAG / ASAGQTLSVS 


1148 


15049 


A 


1155 


3 


562 


pdyqtarrrqapadrcqpgpaaticpr* 
pkarsssmnrsnptvtrragp *pswktg 
aasksqtgdgdrkelippnrtpnn*vst 
aaik*qvrpyltqkvpaa*tfcphqklq 
gvrplcrgpp\rpfgspavtdnrqaklk 
tptq/hpisjdppeqkrpparppvgrhaev 
kqpgppfpagprpapstdgl 


1149 


15050 


A 


1156 


67 


417 


swarplyfpapahftarggpvftskrvp 
sppnrrgwrpppXqfoi&mnpspkflap 
kippt/gplppnkepskenvnpg/plcp 

L 


1150 


15051 


A 


1157 


446 


87 


pdcinviscfltkiyhvpffl* rqgltk 
lprlvsns*aqailpp/sflsqsagitgv 
slhtrpimyfssyyiifgdtvlfXetqs 
hpaaqagvq* cnlgslqppppgfkrfsc 
lglpsrhma 


1151 


15052 


A 


1158 


1 


938 


FFFFIFATYLFNKGK\LTGKRQSD*SIC 
KHLKGYYIKKDILTASKH/V/KKCPTSL 
AIREMQILTGIIKCC*RCEGNGIFTHYW 

* ARALAQLLWKNV* QHI LKLSMC I T YNP 
TILPLGISNRKVYM/CCPK/DPCSRMSR 
AAQPVI 


1152 


15053 




1159 


370 


3 


afkftaaegaamsvtggqpvlnswwlgg 
vtttt/cpttfimpdntv\rgdileltr 
\plgaqvamavhqwlr)ipek*nkhk/lv 
vteedveuiyqkalmljmvqnnkkaagim 
ytenahraidefhr 


"1X53 


15054 


A 


1160 


55 


471 


SPPPGLPKKI FFWPLSFYVWP* PQRFCP 

lfapfkqesnplkvwppfgalqnkeskg 
cspsm/sarqes*plfpknletppfggk 
flkffffff*dgvslghpgwsai/idsl 
qplppgfkq/slclslpas 


1154 


15055 


A 


1161 


3 


467 


gytnqqifpvdktafy*kmpsrtfmare 
ksmsgfkpsk/dragdfklrsmliyhse 
npralknypkstlpvl\khkawmkhfcl 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

correspond 
ing to first 

residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cy3tcinc, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=IVIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl(.nown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














QPPVKTYCSGKKIPFKILLVIDNVPC3JP 
GALMEMYEEINWNTATNTTSIL*P/VT 

SGVIS 


1155 


15056 


A 


1162 


216 


464 


S SEGPGS PGELPYGAQEFVMKAPQGI LV 
IR*T* FFETESHSVAEAGVQWCDLSLLQ 
LPPLRFNQFCLSLP \SWDYRRPERHPAN" 


1156 


15057 


A 


1163 


29 


433 


AVEFGGDSCSPQHPLNPPLGSPQHSPPP 
LIGICT/REE*GWGGGLPVPCPPALPYP 
S TP SWGG I CVCYGRGGVAP PP*HPPLT* 
LGGGSTPTSALPAPGAPRHPGHPLMVLQ 
TLPWGPHTPARKPYINKVLS CVDF 


1157 


15058 


A 

— 


1164 


486 


11 


stclglpkcwdyrrepprpalrcpffln 
illflkftlseiniptpapf*lvfawyi 
ffs lfs f * p /dleh yi * sgi cvdtm* vg 
sscffnpt/wsissf*lvcldhlyfflf 
ffflrqsl/dsvaqagvqgqdlgslqpp 
ppwfkrfsclsripahfewsra 


1158 i 15059 

1 

1 




1165 


467 


131 


nrslewakmyktydeiklvtnlpt/rks 
pglckitakfyhiykdglvlllitilqei 
qvgfhpns*yq\iilipkysegttkkgn 
crpiflv1iietkilheirai*vhkqirt 

LE 


1159 


15060 




1166 


455 


32 


EEGVLKAKRGCKSGLLHQTVNQWN/AN 
LEKFLKEIKSDTPVKI*MIRK*iraLTAD 
KEKVA*IQDQTSNH IPT.TQSLIQSRALT 
LFNSMKTERGEEAGEEKFEAS RG *FMGF 
KEKSHPHKHKSASKAASADVEAVASYLN 
S 


1160 


15061 


A 


1167 


26 


418 


KIWDYVKQTIIL*ITGIPERGGEKVNNLE 
NIFEAIVQ/ESVPS IFKEVDTQEQEIQR 
ASSSSSS 


1161 


15062 


A 


1168 


64 


313 


KWCKGNSYISKDLKELmiGV/QLK*VQ 
/DL*SENYQILLKEIKEDLNKWKDIPCS . 
*IRRLNIKMVIFSKliIYRFNAIPIKILA 

A 


1162 


15063 


A 


1169 


464 


36 


QQaEAAESLXDPGGRGCSELRSCHCTPA 
WATEQS INK* SINQS IK»KSKLC/CWEN 

tlvkhifhkrltsriykelXqqlnkktn 

NSL*K*EKDVMRyFIKKiyEI)r*MANEN 
ILMKLVIREIQINIiKEWLSWFCCKICCC 
FVLFVF 


1163 


15064 


A 


1170 


290 


487 


GSLPHHTPKR*PFLF/CVFETGSRSVT* 
AGVQWHNHGSLQP* PPGLKRSSHLSLPV 
LIFLFSVEMGV 


1164 


15065 


A 


1171 


2 


413 


GKVFI*FSTLH/GCNMK*SRS\LL*ALR 
FIFIiFTARGLTGIVlANSSIiHITLHDTy 
YM6THFHYVLYIGAEFAI IGGFIH*LPL 
FSGYTLDQTYSKIHYTIILIDINLTFFP 
HHSLGLSGRPRRY+DYPDAYTTRYILS 


1165 


15066 


A 


1172 


381 


2 


PPKLQKNFFFSSTGKFFLGGGRAFSPPP 
KKGFFSQI PERFFFPLPICKKKI YFCPPP 
CFGPPPTFFLRAPPFFFFFFLLLFCSFL 
VGVKFILFL/FYLFIFRDTV*LCyPGWS 
AWQSWLTAALTPRP 


1166 


15067 


A 


1173 


9 


405 


NLDKKGRNRTPQSWFQABTPMASMTFS KK 
KKKKKKKKKGGGALKKKPWGAQKKPGKK 
KKWFFLKBGXEICKTPRGILEKKPFFGGG 
KKGPNPPKKKKPLREKKKF*GEKGEKKP 
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SEQ ID 
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26 
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beginning 

location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

last^mino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionin6, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, y=Tyrosinc, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 










4T0 ~ 


0^ 


*KFFFKKKFPPPSKKTP 


1167 


15068 


A 


1174 


FSYYPPPTRGGCPSSPPQIFLPPPLLGV 
FFPFSPLKIFFFPRGFKFFGGWPFFSP 
PKKKVFFKNPRSVYKNPPKKEKK*SLQP 
PG*VWAPPGIFKR/PPPPFFFYIFS 


1168 


15069 


A 


1176 


319 


3 


KEARSVFRVEGRG*KNFSASGPPGGGIIP 
GGGRPGRGNFVFLIKKGGPPPGPKG/SP 
FFDF/GGPPPPPPPSGGGSGVNPPPPPF 
FFFFKMKSRSVSQSGVPDAWADAW 


1169 


15070 


A 


1177 


385 




SAS FGLPKCWDYRHEPTRPASVSVTVTS 
SRLS GA*ARSGKGTLVFWAQMVFK\R I P 
LTKYSDHSREPSSLQLCMQSTPSKAEFT 
VAKADKRLCCTAKSSTAKSIAQIKCQND 
AGRSPQESLHVSGRV 


1170 


15071 


A 


1178 


147 




KLKEIKKLLEENAGINLYDLRLGSGFLD 
MT P KAKQQ KKENLKWDV IRMKN SCAS KD 
TITFYN\YRSDKGLV/SKKYKELNSITE 
RQPNF*KDLNKDFSK/EQMAKST*KTTE 
RLFIREMKTKTT/ME/YHFLSTRMAKI * 
KDSNR/RLCRKTGTLIHCW* GTAPANLL 
KNCQPLFPF*GTVWQFLKRLNLELPDDP 
AIPPLVICPKEMKT/C/CYTEICTQMFT 
AA*III 


1171 


13072 


A 


1179 


1 


398 


SRSRHCTPAWWVTLVLKKKEKEKEKEE 
KKRCPCYIVSHQSL*KPAWQFL/RNVK/ 
LELPYDPAI / PLLGKCLKEI * KYAYTKT 
CM*MFIALFIIAKKyKQLKRLSTDEWIN 
KMWYIHAI 


1172 


15073 


A 


1180 


3 


389 


GYDRVIPNP*PLTGA\LRALLLTCGLAM 
*FYLQSMWVLIliGLLTDTLTIYHRWSDV 
ARKSTYQGHHSPPAQKGIiEYGIMLYMTS 
EVLLLGGLF*AFYHS\SLCPTPQLGGHW 
APTGMTPLDPVEVPLLNT 


1173 


15074 




1181 


354 


2 


ANSSDACILDGHRALPLQLPPCGYSISP 

AAQSSTVPMPLLLIPPPHCNRTPSLWHY 

SPASNPTNPSPY\-'TAPSPPPPIPRNNP 
FS * KFPYYVWVY-SLT'-CSQPPVSSPNF 
TQPSVL 


1174 


15075 


A 


1182 




3S4 


GANVSG/DL/KLKPVLIYYSSNPR/ALK 
NYARS ILPMLCKQEKKEKKKK\AWMTAH 
LFTAWFT*YFKSTGET/CAKEKI PFKIL 
VLIDSACSHPEALMEMCKEIHWFMPVN 
*HS I / LFCMQPMDQGALST YKS 


1175 


15076 


R 


1183 


1 


410 


PPLP*KFFSPGGVQS*EGGRFWPPP*VG 
VLPSSSP/I IPFTPGNWGCFSPFSP*KF 
FFS PKGFIFCGGVGPFFPPPKKRFFSKI 
PPLVFFS PPFKKKIFFFPPFVNFGPPRV 
FFKRPPSIFFP 


1176 


15077 


A 


1184 


406 


62 


PPI INMLCSLAPPFFSPPPLRGVLPPFP 
LKNFFFP*GVYFWWGGGPHF/SPPPKKG 
FFPKFPPWVENPPPLRKK/YHNFPPPGK 
FGPPRVFFKAPP 


1177 


15078 




1185 


337 


383 


RSYISFQK*VKDLNKHFSKDDTQMA1I/K 
HIBCRYSTLVTREMKIKSMTRYHFTPTRM 
AVPF 


1178 


15079 


A 


1186 


3 


471 


LAPSDK* ENRLPGKRPL* EVR SPSflRQP 
PHLRSEE PLRPAATPSGK* GASPPSSHA 
VEEGGGGQPPPGQPPRPGGEGRLCPAAP 
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beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

Glide 
location 
correspon 
ding to 
last amino 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeuclne, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TGK*GAPLP/EPPPGLGG 


1179 


15080 


A 


1187 


2 


406 


FLVETEFCYVGQAGLELLTSRDPPASAS 
KjGAGMTGVSHQVQPQ**S*LWT*/PSSV 
EAGTSPGLSFLSSSWALSAQEGCLAVPS 
/SGSRGLLVGliLLLWTKPSPQLSPVPAS 
QRLSSLSLMPPLPQPQHLTHTSIET 


IISO 


15081 


A 




160 


459 


NFMTIDMLCSAWIHFCSSGIJ3FQL*KL 
FKS Q * ENNLINKWAKDLNSFTI FS YG*K: 
LAHEKMLNIIREIYFTAIMS*YTLTRTV 
KVKKTDTKCW*GCTATGNLIHCCKNV*P 
LRNTAWQ\FHKILNIYLPYHPAIPLSDM 
NSRE *Kin /CIHAKMCftWIFI 


1181 


15082 


A 


1189 


232 


2 


GI *W/PGV*WHHH3SLQPQSPRLK*SSC 
LS LS SSWDYRYVPPHLftNLKK 


1182 


15083 


A 


1190 


2 


402 


PRVRHaSGSPSPPPPPPGL\SHTSPSQ* 
VFSWPSY*TPCLSALTLRSVLSLL*QRS 
PRTLPITNKC/DP/PASHSSCRIPAGL* 
ALGRQQLFSCFFCPFETESHSPAQAEVQ 
WYNLSSAQPSSPEFK*FS/CLSLPSS 


1183 


15084 


A 


1191 


19 


390 


WCVPAVPATWEAEPRRSRXRSKPLTGRQ 
SKSLSY/NKKKKKKKKKKKKKKKTGGGP 
* KKLLGGPKYKGGKKKKFFFFKGEKKKS 
LGGI LKKKTFFWGGKKWPTPPKKNKALK 
GKKKFLGGRGGNPP 


1184 


15085 


A 


1192 


485 


112 


QRPDPR\SAEflAIKYFLTQATASIILLI 
AILF1SINILSGQ*TITNTTNQYSSLIIIM 
AIAIKLGIAPFHF*VPEVTQGTPLTSGL 
LLLT*QKLAPISIIYQISPSLNV 


1185 


15086 


A 


1193 


55 


385 


THAFADAWADAWGLFKGILTENFPNLQK 
YIHIQVQGQ*RTSSRFNTKKTNSRDLI I 
ILPKVKDKLSKKKKKN\AENKKILKYRG 
GPF* ♦KPGGAQFYGGGRKSFFFFFGG 


1186 


15087 


A 


1194 


408 


3 


CQSA\LLGGASQLG \SRGSGVRDPLEEA 
VC PFSDLQLHAERTTALLKAVRQGHLSL 
PRLLLSF\VCLCPAPRGGACRGRQASLS 
CGGLHPVRASRLLCLRKRAWAMASVPPP 
ASLP'PCSLISDCCVSNQ*DSTGRV 


1187 


15088 


A 


1195 




371 


LFSTNHGDFGPLYLFP\A*AfiVLGTDLS 
LLIRAELGQPGNLLGNDHMYKGIVTAHA 
FVIIFFIVIPIIIGGPGN*LVPL1IGAP 
DVAIPRISNISF*LLPPSLLLLLACAIT 
EAGAGTG*TVY 


1188 


15089 


A 


1196 


76 


404 


PTPLRTHDQSSKVSRYKVNIQKVGAFLY 
MLSTRTTGI * \ IKSTSFPFASPKVKYLT 
INLTKCVQ/DLWKEKKL/NEIKEDQNK* 
/WPFSWIKGPNIVN 


1189 


15090 


A 


1197 


50 


334 


I LHMVS I P S I S I Y* HLLPAG * AGTHIG* 

LPPA* ffwvmg/rdrvllchpgwnawq 

S*LWASNSW\VK*SSHLGLSKY*D*RH 
EPSYPASGTKLN 


1190 


15091 




1198 


250 




qyylvssnt*siidflqlpqkciftvgf 
sk**cicgfifkni*lff\fclfesesc 
svaqagvqwqdlss*qplppvfkqfs 


1191 


15092 


A 


1199 


2 


402 


TDARHHTWL I FVFLLEMG FHHVGRAGL G 
LLTSSDPPRPAKLNFL*R*GHAVM\CPG 

WPHE Vtclglpkcwdyrce /htapkphh 

FLFFFFFSEKNFIPVPRRGGG/WKNLN* 
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sequence 


SEQID 
NO: of 
peptide 
sequence 
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od 
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NO: in 
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Predicted 

nucleotide 
location 

amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MEPPPPG*KGFFCPGPPKYWN*KAP 


1192 


15093 


A 


1200 


50 


437 


STLYLEREKEQISLSVIVSSPLPSTSTG 
PTAAAQSSISGPSLPLPPHNGAGDAPAG 
LG/YGQGPSGPPWGPS/SGPSPR*ALVC 
PTDPSGAARGGRGG \ SRGSCCAPAGPAG 
LGDHRPGGMGEGPAAPPKSS 


1193 


15094 


A 


1201 




313 


PPPPGGYPHFSFXXXXXXXXXXXXKSLL 
XXXXX*NPPHSPPRAPGGPP 


1194 


15095 


A 


1202 


105 


485 


VQQTTMARIYVCNKPARSTPRFTKQILL 
DVL K/YIDTQTIILRDFNTP /LNS VMS W 
RQKTNKDNLDLNLMLGQLDLIDISRILH 
VSTR/YIFFSSLHEIYSKSGHMLSHKAC 
HNNF* KIEIILII LLYHC 


1195 


15096 


A 


1203 


1 


476 


PHFGPPPPPQDPRF*G\GPSTRKGFFPQ 
PPGFFPQAWPRAGQPPPGGT/EPVFPPP 
KHP*RRNPYPKFGSPPLPGKIHQGSPGP 
SGQFR\PPRWGKKRAQPTFWAHPPPIFF 
FF*GVLLCHPDWGTVARSR\PPQPPPPG 


1196 


15097 


A 


1204 


1 


269 


WEFETYNLGIK/WSKDMKRNFTKEDLM 
MADEHPRRC\*HHLTPPRTAQVKLELEP 
NE/WQGCGEAGPATPCWGDAAPVQPLLQ 
IGRRFLKK 


1197 


15098 


A 


1205 


459 


42 


KLKPMLIYHSENPRAFQDYAKSMLSVLY 
K*KNEAGLRAHLFTAGFTENFKPAVETY 
CSE*E ISLQI *LL I /GHPR/SLMEMYKE 
MNWFMPANTTSILQSMDQGVALTYEYH 
YISNILYEAIDNTENPLIDLGKANAWVG 
PG 


1198 


15099 




1206 






EMKTLTEMSSPGMPTEKVSELED/DIN/ 

EMHRKLQCREKR / 1 KRN/E IWVQEL * YH 
YmCMHir/MRMPEKEGRKEEIFKVIMaE 
NFPNLATm-TKSia'KEAQRTPERIKTIYI 
YIYIS ^-LQItAKEKESILKETCQKTSKPK 


1199 


15100 


A 


1207 


7 


383 


ldigcfwwedkysscyfslatsgklkyf 
sfsclihcflvcvcvrahk\ci* seas* 
cvcvhk\ci*ssas*cvcvcartngi* / 
cqcflvcvcaqv/cn:*ssas*cvcvcts 

Gl * /CQCFLVCVCVCVC 


1200 


15101 


A 


1208 


405 


214 


AEAGRSLCPGGGGCNEL*SPPCPSAWVT 
\SETLSQKQNPI1HHKKKTGQVRGQSSYL 
PLWEVHE 


1201 


15102 


A 


1209 


384 


"2 


GVTHH/ARARFFC/LLDTGFHSVFQAAV 
Q*A\NHGSLQPQPPGLKPSSCLSLPSSR 
DNK 


1202 


15103 


A 


1210 


403 


2 


vps iqtngksltyfdfffkqlfktklkl 
dasfvbfvcvnvepfrtfalsktqsllcl 
slfllif*tincnfcflrqn/lrsvaqa 
gvqwrdvgllqplppefkqfc 


1203 


15104 


A 


1211 


156 


405 


eskqmliqcksqmlyc*ri*tv/clcvc 
vcvcvcvcvcltcgticrvcivflfiwl 
nck/plcepcc*cnrqgwapwllpvrst 

RW 


1204 


15105 


A 


1212 


70 


408 


KAFS LLP PGVS PG PLCNPRRPNF I GGGP 
KKKPPPPPQRWTQGFFITPPPFFFFLWG 
G*GQD/PPPVAQG*MDPPPRPPK\GRGP 
RHEPPPPAF 


1205 


15106 


A 


1213 


410 


3 


KKSMPGFKASJCDRMTLLLEAQAAGDF\W 
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nucleotide 
location 

residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 















LKPVF I \ YHSENPR I LKiSIYAKYT / PVLY 
KWSNKAWMIANLFIA*FIKYFKPTVETC 
YSDKKIPFKILLLIENAPDHLHALMEMY 
BCEIHAVFMPANTTSILQPMDQGVILT 


1206 


15107 


A 


1214 


3 


340 


KIRVEVNKIVIYNRKLLEEIKKIKSLFFEN 
INKIDQFLPRLRKTRQKMQINKIRNEQG 
GIS IN IMEI KYSYKEML*AI I CKKIiNNL 
HEMNKFL/EHRVPKLTQVE lENLNYI YK 
K 


1207 


15108 


A 


1215 




57 


RGLKI QNVNE I KS * CFEKI KI DKFLARQ 
NFKRKRFKFRD\*KGDIMADITEI*GII 
RAYY*QL*GNyLENLEEMGKJ'LVTYNLP 
KLNH/D1ENLKXPVTT/3EFKSVIKSLP 
LKKSPIiHDSFTAQFYQTFEEE*VSVLKR 


1208 


15109 


A 


1216 


350 




QSSFFSIYPNFHLLSFLFCFKDFLFFYA 
SVLATNSLPLFFPQI SRXFPX PFERERE 
RERENWL\CCLGWSAVMQS*LTPArJ!IY 
W\VRQSSIiLSLQAS*SYRCGPPHPANIF 
HFIFCR 


1209 


15110 




1217 


139 


358 


KNTFFVWEAICRLFVPQAGGQGGNLS FK 
FPAPP/NLRG*SASSPSSGDYRPTPPCP 
ANFFFFKKNRI SPWGSGW 


1210 


15111 


A 


1218 


361 


50 


WGDHGSLKPQPPLRSKNPPPLASLVTGT 
Q*r.G/HPVaPFPIQLIFFFFFLRRSL/D 
S lAQAGAQWCDLSSLQAPPPRFTPLSCL 
SLKKKQ!TPVFVITWMNLEDIMLS 


1211 


15112 


A 


1219 


1 


361 


LKQHVSNVEKTAFYWKTQSKrFVARE*K 
GEILSSNRGECLLQSFKGQPIiLLGANAA 
GDFKVK\PMIJ,DHSEN/PKAI.KNYTTS 


1212 


15113 


A 


1220 


292 


362 


HIPWPAIQQAEVGGSLEPRRSRP*PPR 
LK*SSHLSLLNSWDYRNVLP/RLfllIFCI 
FLCPVRX FKLLGSNHQNASVSQSARITG 
VSHHAWP ILKLKKLFVCLFEMGSHYVAR 
AGLK/PPGLK*SSRLSLQNCWESRREPP 
HPDTIH 


1213 


15114 


A 


1221 


372 


3 


RLFFLLP PRPKGDFFPTLL IW * RHGFS P 
PHVFKPPPLNLILGALKKKFFLPS/LPY 
VKFYFFKEAPLFFF/IFFFRDRVSLOJS 
*PPGLKQYTHLGLPKHWDYRHESACLAN 
NHNNNHTNFFFETE 


1214 


15115 


A 


1222 


3 


291 


RSGDQDCPGQHGETLSLLSLTFVNLSLI 
aiL*TLSLIFVKIQKLAWHGGVEL*SQL 
/LRRLRQENHLNQEG\DCI PAWATEKDS 
VSKKKKKKKGGPF 


1215 


15116 




1223 


379 


19 


HMQILTIMRYYTPIRMAKMKKIGYTK/C 
W * GCAATGYI \ WECKMVQSL/WQNTWAD 
S*KLNTHLSYDIAIQFL/GFYSKEIKAY 
IHTKPCT*M FTATL I IKSQ I/MKQYKCL 
STDKWIKQNRGIYI 


1216 


15117 


R 


1224 


387 


40 


KKTLSTP \PEKHPTPPLFHPPPQKGGKA 
PPSFKPPPPKHPPPPKGISPPPP*1PPP 
PIYPPPPPAKPPPDFFL*SPPPPPPFSP 
PPPP*TPPPFFSPPLFFFFFFFFFLLIR 

LLV 


1217 


15118 


A 


1225 


21 


477 


IVSRLITVHLQKPRLDPRVRPRVRRKEN 
YSSV/SFINIDRNVLNKIL/SSQIQQYI 
KKLIRHE*FWF/IPGIQGWFNILKSVIV 
TYVNK\QKWKKHTI I SVDTEKAFDKIHY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQ ID 
NO: in 
USSN 
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Predicted 
beginning 

location 

ing to first 

residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon,/=possible . 
nucleotide deletion, ^possible 
nucleotide insertion 














LALILIKKKKKKKRALPQSFIPGDLFKK 
PNGGFPGLKMMGRAPGE 


1218 


15119 


A 


1226 


1 


398 


ERINHTLFFLAEAQKRLPTGISGRGRPA 
IPHNTPQRAPDHB.YLPAALAaQHRAGGQ 
QAPPPG\SSPSSPYDEVKDREGDVTASH 
GLRGNGWGSP*ATSLVLIilNLMYMTAKYG 

DEVPGPSMENAWNAIAHWEKW 


1219 


15120 


A 


1227 


3 


23S 


nAVn^AGHnG/R- ■P*SQLT,H\'L,RWRHRFS 
SGDRVSDP*SCLCTLAWVAE*DSSSKKK 
KGGPFKGTKFNSRGGGRKYFFYGA 


1220 


15121 


A 


122S 


207 


2 


NRVSPCCPVQWHDHSSLQ/PRTPGLKGP 
\SASAF*VAGTTGVHHHBQLIFHFFFFY 
*DRVIQAGVQWRNL 


1221 


15122 


A 


1229 


15 




RKSVNVllHHSEILKKKNXMIISIDTKK 
SFDKI*YPFMTZLSEasr*DSUSrLIHNIY/ 
AKP/VANT riiHSKEUIAFPI KQGCLQGC 
\PPFLFNIILEVIiASTKGRKEIKSLQIR 
VEE1/KPLSLFADNM/IVYIE2IPKES 


1222 


15123 


A 


1230 


363 


1 


AADPFACEPVSPHPPLYLVTSSQSLLTS 
VSPKKQPPSAGMQLGVGDSLSGWGWGRT 
KKTRLFFFKTRSLSAAQTRRQWCEHRSQ 
P*PPGLIKRSSHLSLPSS*NHRHMPPHL 
AN\LCLFV 


1223 


13124 




1231 


3 


402 


QANSCIFSRDFGSVGQAGLEPLTPGDPP 
ASASPRCRDYEC\6PHRQLLLTFC* I S I 
LILV/CIS*MKDYFITCIYFFNYS*QII 
FYRRASDFFPFLRQGLALSPMHDHGSLQ 
PHPPRLN\HPPTSAKGVTGT 


1224 


15125 


A 


1232 


3 


396 


FLSQHGFLFLFFAGIDKLILKFIWKDN* 
NNYEKED*RGVITLP/QYKAYSVATVI K 
MVWHWQSDKHVDQWNREPENRPNYYMSQ 
*FPL*YVSQENENLPLHKNPYMNVHNGF 
I CNSRKQSRYPSTGEWVNKL 


1225 


15126 


A 


1233 


3 


395 


LPKCWDYRR* PPHPALF / LFF * KHPKF P 
KMQVK/WRKH*TENRLVMYFSEKFGD*L 
KDQHKLYYLDMAFQFIFTFTYVMEYS IL 
FKFNIR*LTLSSVKMAVFLVETGF\TML 
ARMVLI S* IiRDLPTSASQNAG I 


1226 


15127 


A 


1234 


416 


2 


KIDKLILTFIWKCKRPLLAKIVLKKKNK 
I*RLPLPSFKTYGKGMV/TK/TKWGWPK 
NRPIKGTGF*VQKKKTPHIYGQLVFDSG 
ARTITN/WY*DN*IST*KRMKLDLYLTI 
YKINSKWIKDLNVRAKTMTFLEEWIWVI 
LD 


1227 


15128 


A 


1235 


384 


22 


FFQEI*NAIPVKrTQMVRKQNSHPANIEK 
VL/WWIEGQTSNNIPLSQSLTQSKALT 
LFNSMKA/E/RGDKAAEEKMETCRGWFM 
RFRERRHVHNIKV+GEAARSCGSLAAGS 
PLKHLLLHNTLIC 


1228 


15129 


A 


1236 


379 


2 


SPYTKIHSKCLKGLNVRPLTFKPLEENP 
G/VMVPDLGPGKKFIS *APKAWATKTKI 
IQWGTIGLK/AFCPAKEAIPRVKTWPSE 
S EG I PAGHAF YEGF I FQI FKGLQPFHSK 
KKKDLILKLDLLSADIS 


1229 


15130 


A 


1237 


1 


407 


YRVGQAGLKLLTL*SAHLGLPKCWDCGR 
EPPCLTIIAL*SVFSLVIjPAVLrKLINF 
F\CRDGGQTMLPRLVSNSWPQMIC/LPW 
PPKVLGLQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


iM 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 


Predict- 

location 
correspon 
ding to 
last amino 
arid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Lcucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1230 


15131 


A 


1238 


3 


380 


elsqrtccmd imsytkaivitmawy* fk 
dkqrvqqne ve s pktll yi yrlw i yyeg 
dttdkg*tf\nkwywdswi/st/ckkm* 
fdyhftqcikin/sqwm 


1231 


15132 


A 


1239 


1 


396 


FADDMIVYLENLKDSSKKLLELVNQFSK 
VSGYKSKV/NVHKSVALLYANSNQAE/Q 
IKNPTPFTIAAKKQK/YKQK]SffirLGIY*T 
KEVKDLYKENYKTSLKEIIDDANKHKYI 
PCSWMGRIDIVKMTILPKAI *RFS 


1232 


15133 


A 


1240 


392 


58 


SFSMLAIRLVSNSR/ PSRVLPASftSQSAE 
IIGVSHYAWPSKLSF*LTIDQTHLSCNL 
FI YVFFERRS CSVTQAGGQW YGHS S LQ P 
*TPGLKQSSCFGLPKCWDYRLEALPRLM 


1233 


15134 


A 


1241 


500 


204 


SLSLSFS*DGVLLCLGWSSTPGLKRFS C 
LS mSSWDYRCVPS SQTNF \ VFLVEMGF 
HHVGQAGLELLTSSDNARLGLPKCWDYR 
RVPPRPAAFFLFFKG 


1234 


15135 


A 


1242 




397 


NFMINNLPKKKALCPWFTGEFYLPFKE 
ETIP*VIRISLSL/IFSIIFQKIETEGI 
LPNSFYEGCIILrSKPYKDI*\ENYRLT 
SVMHIDARFLDSIIiRISrPIQQCIKVrHCS 
HVAFV/SGTQDWFNIQKSM 


1235 


15136 




1243 


12 


362 


AGFYHVGQDGLKLLTSSDPPASASQSAG 
IKA*ATVPGLSPIiNFCEVRFMESWSWKR 
P*RTS\VCSALAKCWDRNASQWDKLPS 
FPT/RM*SPCLHPAVSAFCGSASVLPGI 
IVTTSV 


1236 


15137 


A 


1244 


183 


383 


KQAGRG6SCL* / LPRCWDYRREIiHLAF 
MP6* LKK/ FFCKDGGLTMWPRLVLNFWP 
PVILLPRPPKVLG 


1237 


15138 


A 


1245 


410 


2 


FSRDRVSSCWPGWS /PNS * PQSDQLALK 
CWDYRCEPLHPAFC*MNFLKVSC*HGTS 
/DSKY/CQHVSPKNKDISLOIYNirilP 
KKFNIL*YIWFLDFFFFETEFCSCCW*D 
LGSLQPPPPGFK*FSCPSLPSSWVHRHV 


1238 


15139 


A 


1246 


440 


46 


KTEGANINKNTTYQ/NLWDADKAVFRRK 
FRALNAYTRK*ERA.QIjNLSSTLKTRKKE 
QNKLKADRR/QIMQI* *KLGKVENKQTK 
TIQQIH*TKSWFFEKISKIYEL*QA* *R 
KKGEKTTITNVEUKRYSGRVG 


1239 


15140 


A 


1247 


266 


340 


RKCW PGAVAHTCNLXTLGGQGGRI T 


1240 


15141 


A 


1248 


3 


391 


DAWADAWGSRSRAVALFFFFWGFLGGGG 
LKTGFYFI PQVERRGLNFD*WWPPPPGL 
RGSSPPT/LPKQWEPGGGPPAPSNFWFF 
FEKRGFPQVTQAGFKLWN*GDPPAGPSK 
GVGITGGTPSPHPLFLKKR 


1241 


15142 


A 


1249 


3 


323 


MHHHA*LVLVFFCGDICVSLCCPG*S*TP 
DLKRLSHHGLPKHWDYRC/RATTPGFFS 
LFFFFFFNPPEFLLPWAPPSYSLKQNKI 
FLKR FLE PLAPP KGLP LKRAKGRI 


1242 


15143 


A 


1250 


3 


405 


QPDSYSPQPGRLSAPPEQEGGPWLILPH 
ACAP S Q I CVGPHGAS I LS YQERKGTKVL 
SCEGHCKLSSPVGLVGQSFCWQQPDAVQ 
WVP FRRRTQTPGTVAHACNPSTLGG* GR 
RITS/RPGVRDQPEQH 


1243 


15144 


A 


1251 


407 


2 


P FLKKTKKELP * GPAI PALGVPSFFP-FF 
FLPYKRKEIKGIRTPPMRIAALFTIAKI 
LNQPKYPSVni*LHKRHTSNTTLLSRMK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

iociition 
correspon 
ding to 
last amino 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne, 
\V=Trypfophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \^possible 
nucleotide insertion 














H*WIKKT*/HTDMMEYySAIEKHGILSF 
AATGMSPKDIMLSEICQAQKNRY 


1244 


15145 


A 


1252 


1 


253 


VADVRESLDPGD*GCSELRLCHCTPAWA 
T\SRTTSQQQLKKQNEMTT*KTYGIQLM 
MCQIANYCSKLLYFRKMSNLQSWLKKKK 
K 


1243 


13146 


A 


1253 




372 


KKKAQKSGTI VLP CNPS YSGG * VGENA\ 
WARSQPGLI S* KQQ *QQQNQQQTTSKTD 
PYIYG/YLIHDRGGK\NTQGWDELLNEW 
GWGIWLMTQRKIKLELYLTPYTRTNSTW 
IKDLNVAGCGGSCL 


1246 


15147 


A 


1254 


2 


410 


KNIiTPIVGnKftATN*ESIPPQT/SRKKK 
KKKKKKKKRGGALKKKKFKGGGGGKKNF 
FKGGKKKKLGGGVKKRGEGKKPGGKKKK 
RFGKKSFFSRGGKKKKNRSSSSSSSSSS 
SSSS 


1247 


15148 


A 


1255 


61 


487 


GRPGPTHAFWIDSFFMGNEARFINHS C 
DPNCEMQKWSVNGVYRieLYALKDMPAG 
TELTYDYHFHSFNVEKQQLCKCGFEKCR 
GIIGGKSQRVNGLTSSKNSQPMATHKKS 
GRSY/RE*KKNKKKRGGRFKGSQFSSPG 
MQG 


1248 


15149 


A 


1256 


118 




MFIAELFIIA/RRWK*PKCPLTDEWINK 
M*YSHTM*HY 


1249 


15150 


A 


1257 


390 




TQKNTPTPKPKPTSQQHHQKNKPTPPGF 
FFFSPPGK\GGFFPSPLFWVPPGFFPPP 
VFKTRPPEF IFGAP * KKFFFS PPRSLNF 
FFLRGPPSFFFFFFF?S*VGKEGSSPSH 
ENPLFVPTEGW 


1250 


15151 


A 


1258 


378 


3 


GAFFFFFPPPKKGFFFTPFEFFGPGFFF 
S PPFFIPPPQIFFFGPKKKKKFLPPPPQ 
KIFFFLRPPPLFFFFFF?FFFFFFFFFF 
FFFL\DLD*NVLSGTSAPEKNQELLGMV 
AYACNPSTLGGRGG 


1251 


15152 


A 


1259 


462 


287 


RDGXFHHVGQDGLHLLTS* STRLSLPEC 
WDYRCDLPCPAIPAVTLYQIQYRPLGLE 
SKA 


1252 


15133 


A 


1260 


446 


34 


NVRAET/ iktlevntgvnlhdls*/gk:a 

FLDARPKAQVAKEKSRQPSLHRRLR/FC 
ASKDPIKEVKRQPTE/WEKINHVSDKGL 
EFKIHKELLEKLS]SHPI*KWDKDIJ!IGHF 
S K/ ELQIASKHMRRCSASLVIRKCNEIV 

DPD 


1253 


15154 


A 


1261 


2 


383 


GSQRKWFLEMESTPGDNAIDIVEVTTKG 
FNYYMNLVDKAVAGFERIDSNFERSSSV 
S KI LSNS I ACYR* I FCKKKSQLMQ/QTS 
LWFYFKKLPQPPQPSAATTLISQQLYTL 
RQDS P PAKRL * FTDGL 


1254 


15155 


A 


1262 


476 


50 


FFFFFSDTGSHSW*AAVQWHDHDSLQP 
*PPGP\SDPPTKSSTREF 


1255 


15156 


A 


1263 


143 


472 


TGAVPIRPSWN/RPPAMIFF*NAQGILF 
AE FLRS QRAI LWE FFE KAYHES VLRKS A 
KGLAEKCPGKLHQRVLVQYDNALAHFSH 
QTRTTKSSTRQFR 


1256 15157 


A 


1264 


179 


489 


YI FFFLSLFFFLWPRPEYRGAITVHCSL 
NLPGSSDPTASASSVAGTKPH*YIFFFL 
SLF/CFSVAQA*VQGGNHSSLQPQPPGL 
K\YPTASASSVAGTKPHPLIFFFF* KKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codan,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FL \ C F VP QAGGQGHDFG *L*PPPPSLKQ 
GGGLTLSRIGDYRGPP 


1257 


15158 


A 


1265 


1 


536 


FRGGWGSVRRAPGTASCYLAIiAGGPPGQ 
CPPAAISCPSPLPWGS*TE/PYVPRRPG 
DPSAAP/PSW*VPRPGVPRNVPGRGRVG 
QDG CGQAQD I QAAASLMADTAPDSPGSA 
• GSVRALPACVPEISGS/SGLPPGAALP* 
VAA/RPTPGRHVDTHPRQHSFCGGQEGD 
IRAFPTLYLEVYPGPP 


1258 


15159 


A 


1266 


403 


2 


TSLHPRGYMRLLRQGFISAPCGYMRLPR 
GSSLHPRGYMRLLKTGVHRCTPWVHAPS 
QTGVHLCTPWEHTPS+RFISAPLGAASH 
VT/GSLSIQHIYIFVFCLLRQGL/NSVT 
QAGVQWCNLGSLQPKLPQAQVI LTK 


1259 


15160 


A 


1267 


2 


401 


FVLNPGGRSCSEPRSLHCTPAWATRAYL 
QLGKKKKEKKKERGEENKNQGP PRPLKE 
EVGTPGQKKP L/WGGSKGAGQKQPA/QK 
KGGKKKAnHKGQRGKN*KRKREGGGSTS 
KNNSRGTGAKA*NPTIWGGGGKKI 


1260 


15161 


A 


1268 


49 


416 


LRGRALDPRLLRECGDLGAPPAPEVALE 
AGTCWTR/CTL*APPRGa/DRSPWPPRS 
PMCKAG/DECQDGIPG\MKAWSCGLRTR 
QCPWP*PKIjPCGPGHPAWRT*PLPQTA/ 
CGPGPAAPCRG 


1261 


15162 


A 


1269 


420 


47 


GPPGWASFRrj!IFPKARR*GEWKTPGESG 
GAFFSPPGKNF\AGQN*GRPPNPPPPGP 
GKGGIQTRQGAGLGKNPFRFWGGFPNPG 
NK*GGGTKKKEGPPPPLFFFFLNTDFCN 
LIKRDHLGVGAHL 


1262 


15163 


A 


1270 


404 


1 


AEILELKNVIDILKNVSESLNSRIDQAE 

SRISELEDRLFENTQSEESK*KEI*KNE 

ARQQDI,ENS?KKACT.,RVIGI,KEEVEREM 
GVESLFKWIIIENSPNLEKHIHIVQEGY 
RAP \FNSNKKTSRHS I INSHTK 


1263 


15164 


A 


1271 


387 


2 


KKEKKKEVI II FVLLLPLK*FQNIQVWL 
VRELEKKFTGKYW\FA*RKILPKPTQK 
RCTKNKQKRPRIHAPTAAHVAILEDSVF 
PGEIVGRRIRVKWTQQLTRVHSDKAQQN 
NVERKVQTFSGIDKKLR 


1264 


15165 


A 


1272 


1 


393 


FRM6RLPGGGAPHFSDGVAGQHRSSPHR 
RSRGRAEALLTSQTGRLGRGAPHISDDG 
RPGRDAP\PS*TGWQPGHGAPHFPD\GQ 
PGRGRPHIPDDGRPGHDAPHFPYGVAAG 
QRLQSRHFGRPRQAAGRWRL 


1265 


15166 


A 


1273 


3 


317 


SSYETKGVMIASFSSREADNHTAFIRIK 
TWASDSTEFI IIiPVEVEVTTG*WKTDE I 
ESCFVFSIAYSGICVKNNC*LFFFP\APG 
lYSSTEMUJFGTIiRTQGKKIF 


1266 


15167 


A 


1274 


92 


368 


LCPPGGRAGQGQNLN*GVPSPPGPKGFW 
GPTYGGGGGKRTTRPGGGKPEKSFD/GP 
PGPPPGGTKGHPAPKNFFAQKERGPFFQ 
NGGPGNKLF 


1267 


15168 


A 


1275 


401 


3 


DMCDWFKKEFSDTTPKA*S ikekqlts f 
ikiksscsps\kntikgltrqeklwgki 
fanhmsdkglvshiyeelskln\k*kln 
f * kwakq * dhllkkth* *qickeryslg 
kgkitmryhstsiimciiktihs 


1268 


15169 


A 


1276 


298 


426 


GATMLVLLF*LAFDLRQSL/NCFPQAGV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, lJ=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QWRDLGSLQPSPPWFK 


1269 


15170 


A 


1277 


421 




VSFKDQKL*INKENHPIKKWKKICNQPL 
PQIRYMDANKHIKIFSTSIAARETNMKI 
TT/RILWKTVWQFFKKVSIYLPLEPAIS 
SYLFTQEK*KHIDGSCSPSRVATAKLLA 
LQWRRPSWNCMHQRAGSWKQIGAPPFSK 
L 


1270 


15171 


A 


1278 


1 


218 


TRSGVQDQPNQHGETLSLLKIQKLASYG 
GVCV*SQLLRELRQKKCL\NGPRSHHCT 
PAWAT\EQNS I * KKKKG 


1271 


15172 


A 


1279 


88 


396 


TFLFSSSFFFFGGGVPPCPPGLRPG\PN 
FaSLHPPPPGFGGFPSLTSPRA*NWGPR 
PPPQATLGAFGLEGE*IFPPGRPPEP*L 
WGPPPPPPQRGGMAWPTPL 


1272 


15173 


A 


1280 


94 


399 


TDFLFL*TDFLFLCLCSLKNKIWVNEFR 
YGGFSLGVSNTQALPPSQEVNDAIKQMK 
KHLKLAK/DKIS IVRCITKMGM*LLGYR 
SSWQVKRITWQLIVLSIMLAYRAVFGI 


1273 


15174 


A 


1281 


236 


2 


TQS RLVFF YMTGPAVYLNHHALRTHQGS 
HLCFFFCFEMESHSVTQfiGW\VMWRDLS 
SSQPPPPRLKRFSCLSFPSS *DY 


1274 


15175 


A 


1282 




410 


KKKTLKKIiOCLCPPGVKGEFNPPFYVLS 

KVFPKKHRGT.FFKVTPVSLTPFFKKKWF 

KIGGGSGE'PPPL.WS'B'FFFif/RDRVLLCH 
PGWKAVTRS * LTTS VDSSDPS/ CLRLPS 


1275 


15176 


A 


1283 


404 


232 


LSSWDYRCLPPHPANPLYF* *RRGFTVL 
ARMV/GPRDSPASSSQSAGITGMSHCAQ 
PG 


1276 


15177 


A 


1284 


3 


402 


MQIKITMRNYCTPVEMP*KKK\SRTKPR 
C*GGGGTTKILIPCWGDYKIGE/PLW/K 
SVWQFLIK*N]!lHLLYDPAILPLrFYTRE 
MKTYVHTITKYCNHNEK/CMQGFTEALF 
/TLPKTE/SQPKHSST 


1277 


15178 


A 


1285 


1 


253 


LRGKFMTLHS/SILKRG*SQINN*TATL 
IK*KKKGKIKPK/RSRRKVI IM1SIRHK*D 
* KQNMNKFNETRSWFFKNMKKI YKSLVE 
VT 


1278 


15179 


A 


1286 


400 


105 


FNFKKKKKKKNREVSE*QY*QAKYLKEN 
FTKNI*VAEKHMKNAQYVIKEMLI*LTM 
RYYYTPIK\MAKIKLTD1SIHKCQQLKPSY 
MVGKEFGKFL I KLNM 


1279 


15180 


A 


1287 


400 


119 


TVFHHIGQAGLEIiTSSDPPVSASQSAG 
ITGLSHW\PASSSFHSNHSCPVPLEPHQ 
VAQLDSFGYEKVCSFSN*VPGAQDKNDL 

SEN 


1280 


15181 


A 


1288 


157 


386 


NDLQFHPFYCLF/DLFCQSLKKAGMQWH 
DLGSLQPLLPRFRRLLCLSLPSSWDCGH 
VS/PMPG*FCIYTHIYFLVEMGF 


1281 


15182 


A 


1289 


400 


124 


FDAPAKGQGGDFGSPPPPPPG/AQKIFP 
PHFSQ*IG*KEGAPMGQPIFVSFSKTGA 
PPPGQGGFQIPTPCGGPPQNFGIQGGAP 
APGPGGGFF 


1282 


15183 


A 


1290 


419 


3 


KNF FFLERGFFFFP PGGRAGGG F * FP P P 
FFFQGKKISGPPPPKKRGPPGAPQKL/R 
K IFLVLKKKGGPPLWPGGFLNPAPKNFA 
RANFPKRGGSRVGPPSGGPNIFNLGGGW 
FFRFFFFFFFFFLVFKTEFRFCCPGWSA 
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SEQ ID 
NO: of 
n ucieotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/513,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptpphan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possi We 
nucleotide insertion 














M 


1283 


I5I84 


A 


1291 


397 


39 


WGQTOAKDW IHGLGPKTQGCPPPLGVFG 

CP/SLIPPNLFFYGIPGGLKIFVHKNPP 

FPIKKE^7vNVIIATFFPFGVFLFFL/CFF 
ETRSHSyTQAGVQ*RDLVSL*PTSGFKR 
FSCLSL5SSR 


1284 


15185 


A 


1292 


399 




LNFFFLHRQKLCNILGSDDKVPAi'S lYP 
EPRICYVFSLLFEIIFSFDFNKKIRIYK 
KHVSRIYKDLLKFNNKDNPVKKWAKDLN 
RHFSKDGIQKVKWHIENC*TLAIK/EMQ 
NTTTMR*HLMPNRMAKIRKTI 


1285 


15186 


A 


1293 


412 


1 


ARMVSISEPLGTPIWESQKGGNKRVSPP 
TRPNPLFLKNPKKCPKSED*NKPPLLKK 
KKKWSGPCPAQ/SCPGQNPHP/SPPKQL 
PPSPSTPPSRPS PTLAFPMAS* LAIS PA 
HTHCCWRELSGTHTLSIPCLKPLRGP 


1286 


15187 


A 


1294 


229 




FFFETESHSVTQAGVQWCaSIPGFKRFS CF 
GLSSSWDYRYMPRP\MJF\*FLVETGF 
YYVAQAGLKUIjSPGDLPALAS 


1287 


15188 


A 


1295 


401 


2 


STLPVH*KWTTKAWMTAHL/ FTAWFTEY 
FKPTVQNYCSEEKI PFKILVLIDNAPGH 
PQALMGMNKEMSWFIPV/NTTSILQP\ 
ADQGVIFTLKSYCIKHLFQHQCTPAWVT 
QLDS VS KK* INI FQ PGHVAHACN 


1288 


151S9 


A 


1296 




336 


KKTPERKTHKEHHNGEK\LRAFPLRSGI 
KQ/GCaF*P/YILNIWWliAKEYGKE 
I KAS FRKEELKLCLF IDMI I FVEI PENS 
PQKL\IN*FSKVAG**VSTQNSVAFLFF 
FFFF 


1289 


15190 




1297 • 


3 


384 


HTNMQKVLV/'VKTEDQPSHNIPIJSQILI 
QNKSLTLFNS IKaERSEEAA*EKCEDSR 
GWFMRFKKKK/RIS ITKMQGEAASAWQ 
AGTSYPKDLAK/DBGSYTKQQIENVNET 
VL Y * KM I / PTEDF WRKDKSM 


1290 


15191 


A 


1298 


3 


395 


SRSVTRAGVQWHDLGSLQSPPPGFKQFC 
LSLLRSWDYRLS P PHQQMGGI FL/ VFLF 
FFFPGKGVFLGGPQAGGKGPNLG*l\INPP 
PRGLGE FFGLNP PRGWGFQ/HP PPHPQL 
F/CCFFRGKGGSPRRAGRAPNP 


1291 


15192 


A 


1299 


260 


4 


GFIVSSAVFPLLKCLLDILVS3SVLLYC 
LWNFKIQKSFRFYNLSPLVSSFKNSH*N 
SN I /WPGAVAHACNPSTLGGRGGRITRS 
GD 


1292 


15193 


A 


1300 


91 


219 


LSRLSAWDY*HVPTHVFFVEMGFHRWQ 
ASLELLS *GDPPAIiA 


1293 


15194 


A 


1301 


277 


441 


TVEF*GVQSMKEEI \ELCLFEDDMSVYE 
ENLKELTKN\LLK* I SNYSKVAGYKUNI 
*KAIAV1HTSNEQ*NFEIGNT1LF/TIS 
TPKYLGINLTKYAQGLYEENYSNLMNE I 
KELNTW/RDILCS 


1294 


15195 


A 


1302 


256 


482 


YIS1VYFKICIGPGTVAHACNPSTLG\G*G 
GRI /TLESGVSDQQVQHG 


1295 


15196 


A 


1303 


182 


460 


VGREFLDMTGKEFIYKFLYIRKIN*SSS 
KLKTFVLPKTLLRD*KAKLQTGRKYL* I 
TYPVKGLVSRLCKKLSKLNSKKTTQLKK 
WAKDMNR.HF 


1296 


15197 


A 


1304 


1 


468 


FKQFSSISLLSSWDYRHPTGNFCELAPY 
KQNAPSCSNVFTDNVPVIST*QGLQALS 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalaninc, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknoyvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














pnweqspwretkktavdlcgviwkpcq 
vflrkkpktktgq/heqqifgf/cffse 
teshsvtqagvq*hhlgsl*plpppfkr 
nsttrtsdydgp/hvpp/hpa 


1297 


15198 


A 


1305 


880 


1026 


EEHGAGLSGSQDAAGGVPAG*GGWAQLW 
VTRRASLFLDKTHWPVDEQNLGSLYTIE 
ATAYGLMQKLELGRYNETHAIAKWLLEK 
QELGGGFRS TQ /S DGDPRETTWALEAL 
TRFREAVPFKGIQDLHVQIRAPKTALNV 
NWYIDHSNAYQQRSAKFLAQDDLEIKAS 
GNGRGTISILTMYHKSPESREDNCNLYH 
LNATLHSALEENKKGGETFRLRMETRFQ 
NN* EATMTIMEVSLLTGFYPHQDDLKQL 
TSDVERYAFQYKTKTSTSDSTWLYLEK 
LSHEKNTELGFRVHRMLQAEFLQAALVT 
lYDYYEPSRRCSTFYNLPTEQSSL 


1298 


15199 


A 


1306 


3 


726 


RTDHYQFQSLKHCLTGGEALNPDVREKW 
KRQTGVELYEGYGQSETWTYKWELSYE 
DANTYIVX* KTL* TQKEEGM ICANPKG 
MKIKSVSMVKESLPyyVHIVDDEGNVLP 
PVEERNVTVRIKPTQLLCILNCYLDKCE 
KTAVS*QGDFYITGDRARMDKDGYFWFM 
GRNDDVINSSSYRIGPVEEESALVEHTA 
VLESTWSS PYP IMGEVGKAYIVLTRAY 
S SHDT * ALTRVLQEHVI K 


1299 


15200 




1307 


230 


486 


NAICPNGSKGSLGLGSCSVTQTGVRW*D 
HSSLQPRPPGLK\YPPTSAS*VA\GPQA 
GVQWHGLDSLQTPPPG/FKRFSCLSLPS 
SWDY 


1300 


15201 


A 


1308 


463 


440 


SAIPVHT*RRRKQTSLNADMEKV*WWI 
EDQ I SHNI PLMKS L I QGKALTL FNGMKA 
KRSEEAGEGKFEASRGQVMKFKERSHLY 
NVKVQGEAAGADGEAAASYPEDLAQSTD 
EGGYTKQQWFS IDV/TTFYWKKMSSRI F 
lAEFHHTD* GDNGSVP 


1301 


15202 


A 


1309 


463 


59 


EVISTLTKMGKILKTKNAK/CWQDCGTK 
RTFIHCWWGYKLVQPLWKRV\YTFPYNL 
AIPLIGI YP */NMKI YSHKRTCTKMF/I 
NSLFI ISKNWKHLRYTFTREWIK/M* YS 
HTKDYYSAIKRNKLDMFSNMHESQKHYG 


1302 


15203 


A 


1310 


438 


29 


pggkggggptpapfpfpppppkvgvfw* 
gfprpflfkppppnkapfpgaphff/sl 
ppfprvfkktfptkkkffppggggkkkp 

PPPLQGGKKMGSALKFPSGGKKPWGISP 
QI 


1303 


15204 


A 


1311 


2 


310 


HNQKNEFGPYVTS YVKVKEAK* lAK'i'RK 
LLEENIGVNrHDIELGSGFLftMMP\EA* 
AIRLKIEKLDFIKIKNSCASKDTINKVK 
KTGRPGMVAHAYNPSTLGG 


1304 


15205 


A 


1312 


19 


348 


RGQQDVLPG*RGQQDVIiPGRGTYSEYGL 
IFSAR]SrPSMEV*SVMHRPFHHHGVp/HQ 
WCSDSGSYFISKHCGN/WVSACGIH*CC 
LVPCLPKASDLGE *WNGDFSLSGKEKKK 
EFYLIMTFSQ 


1305 


15206 


A 


1313 


2 


424 


ENKLTNHGKTGNGGAQSQ/PPECEPRTH 
LQRGr,EGRGGGEPWGQGQPD\PPSNSSL 
KNPQAGVPPFSSLKGKVKRDRSVSVDSG 
EQREASTQ*TFVN6KKKKKKKKKKaKKA 
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nucleotide 
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SEQID 
NO: of 
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sequence 
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od 


SEQID 
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USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AlaDine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaiiiic Acid, F=Piienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Mefhionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tiireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AYSPDSKA/PSSKSSPKSS 


1306 


15207 


A 


1314 


375 


3 


KKRTLLIPLMNiMDAI/MNKILAH*IQQH 
I KE I IHNDQVQFVPGMQGWFNI QKNINV 
IPYY*LFY*CYIWRIKDQTLriISIDTE 
KND KIQNS F I VKTLRK\ I KENFLTL I KR 
IY*KSSNNKKN1D 


1307 


13208 


A 


1315 


403 


3 


IiEGVSFFFPRVASQGSILGSCNPPLPRF 
HHFS/CPHLLSKWGYRFPPSPPA\FFFF 
FLKTGFFFFCKNFAIjCSFP*KQASPPPP 
LTVFFFS /HHPFFPFFFPLFFFUIWS FA 
LVAQAGVQWHDLGSIiQPPSLGFKRF 


1308 


15209 


A 


1316 


38 


427 


PEXPARPTRPRPSAWQPPRLRSEEPLRP 
AAAPSEK*6ASPPGSHPIWEVRSVSARQ 
PPRPGGRWWGVNPPPGQPPRPGGEGRLC 
PAAPTGK*6APLPGQPPRPGGRWGGYPP 
ARSAASYGR*GAPLSSRP 


1309 


15210 


A 


1317 


43 


399 


LTFFFFLGEGARPPPPGWGPGAHQGITA 
PLFWGGQGNPPP*PPGEGEPS/QGPPPP 
GGNVFFWKKKGSPRAPGGPWTGGPKGLP 
RPPPPKGCE*RGWPPPPTPKFLGFTTFQ 
KKGKNSGP 


1310 


15211 


A 


1318 


437 


54 


MNEQKGERGLSSLGT.PAR*D*GKGKVKE 
GR/EGVTFSPREPKEESLRWSTPQKE IV 
GVINHPRGQRVDRQGAGSGCEGFALRAQ 
TGGPPA*EITEA6EQKaGGGSKGAQTLQ 
RETRPPRGQRGGGRSASL 


1311 


15212 


A 


1319 


351 


133 


GGGWAEVPLTSQVGRSGRGTPQLPDGAA 
RQRRPPPPRRGSWAEAPTSQTGRPGRQA 
PHLPDDGRPGRDAPHttiP 


1312 


15213 


A 


1320 


474 


11 


KINSPSQKKKKKKQCSSIRKLA*/DQNR 
HFIEEETQIPNKHMKRSSIS/LAIKEMQ 
IQITKSYCIIIRLCK*LIKNSDSIKCWQ 
GCREI/GSLIHCWRKLKMNSTALQMVYK 


1313 


15214 


A 


1321 


485 


33 


gpasrvagitgahhhawlifvflvemgf 
hhvgqagle/lpvswgkgghl/rlcllf 
gycgqcxtykps *rrffcvs f yira* k* g 
fvclfv*mvsccvtrlefwthrc1ihsk 
LQP* ipglkcs\pasgf*varttglyhg 

VHHVSNSTALTSGLQ 


1314 


15215 


A 


1322 


451 


145 


THPFGRPRGGVFKVRKLNPPWLKKENPL 
FrKKKKKKPGPGGGPFIPTPSEG* /GKK 
KGPWPEKEASNKPKFAPSNPQTGLGKKK 
KPPSLKKKKKKKKVYSLIF 


1315 


13216 


A 


1323 


470 


151 


GKAEERHDPGSRACSEPRSCHWTPAWAT 
\SETPFKKKKKKLCISWPGTVESCFSVR 
T**I*LRQYVLYLFLFQPKNVIYLFICL 
HDKiaSMVMFLDFKYYFCFLFLD 


1316 


15217 


A 


1324 


64 


367 


twegegswltsqdrttaplh PSLDNRVK 
lrlkkkks vgfls ps illaknqi kkpap 
ftma/sk/rikylginltkdvkdlynen 

Y*TLMKNLKGGAMCSFTSL 


1317 


15218 


A 


1325 


193 


381 


AQLFKTSMGNSETLSLQKKKKEKS*LTL 
PSSWDYRHPPPRPV]SrF\*FLGEMGFREV 

AQAGLEL 


1318 


15219 


A 


1326 


410 


7 


GFFPYP*LPK/SLRC*GKDFYNQGSGQA 
EWLPPVIPPPLEAQAGGSP*GRSLRPTW 
PAWSNPSFLKKPTPLFLKFS*GRRIALT 
PKAKVSVWWDSPPALQPGGPSKTFFPKK 
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NO: of 
nucleotide 
sequence 


SEQ JD 

NO: of 

sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
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Predicted 
beginning 
nucleotide 

location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
locntion 
correspon 
ding to 
last amino 


Amino acid sequence (A=Alanine 
C=Cysteine, »=Aspartic Acid, 
E=Glutamic Acid, ItePhenylalanine, 
G=Glycine, H=Histidin6, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metfaionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKLRI IGPG WAHACNPNTLGD 


1319 


15220 


A 


1327 


295 


3 


VCHFGIYFCVCSLYFTCLYFPFLIFL*V 
TLTFLVFYFDFTVLFI ISL* YFNDCSRD 
CNIHM*LLSLPVLIFHCCC\FVFRDRVS 
LCHPGWSALASSLL 


1320 


15221 


A 


1328 


2 


392 


RQASRPP * SAPPAPAGKEGSGE * PPS PK 
/PPPPPPKWPPRGFPFPPAGPPPPPPP 
KKSPPPKTNPPPPPP/IKPPPPPPPPPP 
PPFFFSPPFP 


1321 


15222 


A 


1329 


378 


7 


TDDLILKFI*ERYKPNQF*KRTKLKDSL 

P:2FENYYKAKWKTRW**YKE\KiIIDQW 

rWlH:SK»QIFNIASIl/HIEKEYLCNKW 
CYETTKHEEKI\'>1NLKPLP*TIKFNSK*1 
TIDLNIKTITIWLKL 


1322 


15223 


A 


1330 


403 


2 


VETGFRHVGQAGLELiIjTSGDLPTLASQC 
WDYKHEPLRLGELRSLTRJiWAMQQDPVS 
PNK*IKIK*I/P*SEQFSGI*SIHNWQ 
NRHFYPDPTHFHHSKVKPLTHYWSPHS 
FLSPTPGNHQCLLSVSMDFSIL 


1323 


15224 


A 


1331 


1 


389 


KMKSQAIDLEKIFTKHISDK6L6Y6LYK 
ELL*FN/R/RCQTTQLKMGER*TNSSTM 
GMSFEWMVSKHMKRCSCAFPSLVTREMQ 
VH/TTSLFVERYHYTPTRVAVI KQSDQV 
\W*ECGIRTLIHC*WECKMIAT 


1324 


15225 


A 


1332 


392 


3 


NNFMPSSA/PPPPPPFFFGGPRVFSPPP 
FFKPPPPFFFFGPQKKKIFSPPPPLKFF 
FFLRPPPPFFFFFFFFFFFQKNLGKGFF 
F*PRPEKKKKPPPGGVFFFSLFFLRCSV 
ALSPRLECSGAISAHCNLCI 


1325 


15226 


A 


1333 


394 


1 


SQWGPTAIHYCAWLVFKFLGHFSKTTI 
SAPLLK*TSTRAETSKSFS\CPRKMERA 
EGRRMFLGKSLLKQIVLLF/LISGSCVT 
AS*FL/CFFQTESHSVAQARVQ*HDL6S 
LQPLSSRLKRFSCFSLPSRTRG 


1326 


15227 


A 


1334 


416 


1 


ltllpkptp yntrkenyrp i sli nidak 
il/nilagkipqyikvhnXnmgltpemq 
qlfni*k/rksv1ivichinrkeekkill 
ina* ivfdknptmi * kns wq \ ge yke i y 
snillngkmlkachl\ktrin*gcqlsp 

DAW 


1327 


15228 


A 


1335 


391 


57 


wqrvcnwdknlyqkekkqdimvrkk/wat 
dli inftheemqsvhnlqkdthphslvi 
remqikttsy/hllakilnsdnssyw*g 
cggmgtl*ccwwvskeihalweivwfiy 

SK 


1328 


15229 


A 


1336 


285 


2 


ifffpwrgeii*hlslinkgsrqnerah 
\kdngdfsqiisklklmykykilnqtti 
kymffpkkhrts iki ehilgskgslnnc 

QRISVLQTGRV 


1329 15230 


A 


1337 


34 


391 


akiaslysslgneansvskqnkktknqn 
lkn/iktagglgwahscspstlgglhX 
dpgv*gcrel*wcrctpawasg\etlsq 
K* inkgkerkkkmkncwwmmjsliqlts 
fsftktdwss 


1330 15231 

1 
1 


A 


1338 


292 




gggisktpggegainpkkppalppggkk 

EAFSPPQKKKKGEEA*RDISSEDI *MAN 

khmkscppllmiremqikstmryrfipi 
rme\ilkkqkns 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

locittion 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=PTOline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thi eonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1331 


15232 


A 


1339 


369 


16 


DPWQNEQLSRVTCPGPRRASMFRIT*YK 
VI 1AALMAYSVGPRAVSCIKALWTTYG\ 
ir4DNYKRHERRFLKPL*WPGMVAHACNP 
STLGGHGRHERVGRVRSIPDCWRVAAG 
VKASI 


1332 


15233 


A 


1340 


394 




TQSGVQRHDAGSLQPQPPRLKQSPHPSL 
PS,SQDHRRVPPCPARPCSSHDSYVLREG 
PCGRW/HESRGRVFLVLFS***LSLMRS 
GGFKKGEFPCTSSL/SCLPPST*DVTCS 
SLPSAMIWGPLQPHIFMHGF 


1333 


15234 


A 


1341 


400 


161 


KIGPNI.PCFSTKRPWKIDKIQFWPPGKV 
F*/SPPPPKKILFFPPPPSGPFF\PPLP 
PLPKSFFPPKNPNLPPSFGFDIPPP 


1334 


15235 


A 


1342 


377 


3 


NRHFPKAETLICASKHLKRHLPLLVIREV 
QIKTTNICHNIATRLAKM*KADNTKCWE 
SCEi>VLTFy\C*WEYKLVQPL*KMVWQK 
PKVCVPY YLAI LLLQHI PEGTCAS /C Y* 
DIHKRILTAMLFAM 


1335 


15236 


A 


1343 


170 


369 


RCNENKITKEVYFGEIFVELDNKEKNTS 
FFYFYFLFY/CWR*GLALLPRLVMrS *R 
QWLLPWPPKML 


1336 


15737 


A 


1344 


385 


2 


SDIiRRSTHLGLPKCWDYRC/RATVPGRL 

FIFLMVSLKHKSFKF * * RP I YLFFLSS L 

MLL VS YLKNH * L IQGHKDLLPMFSSKS F 
MILVLTSRSILSLLFCFLFFVLRQCL/N 
SVTQAGVQRHNHGST.QPQP 


1337 


15238 


A 


1345 


1 


281 


HMATKHMKRC* / SEMCTLKPQLNTTTPD 
HQNG*N*K*LVNFKGWQ*FGAVWNLIHC 
W*DCKIiAQPFWKIDLSIKAKISMSYSHV 
VRKIFKLKNK 


1338 


15239 


A 


1346 


176 


1 


SVIWNIKEP*IAKIILKKKTIVGGLTVP 
DFKTYYKaMVl/KTV*SWLKDRQHNQWY 
RED 


1339 


15240 


A 


1347 


398 


2 


PRPPGPVRRRCPS/LTATSGSSSPPSSP 
FYLGYPQQFPLPAVLNRGPQILFWGPHK 
10n:TLPARGR*IGSS*TAPPFFF/SLFS 
FLLFSFLLFDTGSYSVPQAAVQWFNHGS 
LKPQPPGLKLSSQLLHRU3W 


1340 


15241 


A 


1348 


9 


395 


GLQNPCVGFLVSGFFFFFFFPPKKGLGV 
PPKKK/RGPNPNGPVGEFIiATGPFFWTG 
PLKKNPAPPRAIJFWGPPPPPPWGG*NPP 
PPIiARGSPFFKKFFGGPDPFFYRTNPRG 
PNBOIGPPWPLKWGNPPPK 


1341 


15242 


A 


1349 


119 




ENVFRNM/WPGMMAHTCNTSTLGGQGEW 
ITRSGV*DQPGQ 


1342 


15243 


A 


1350 


58 


396 


GIRVGKVCFIIFFFGFFFFFGKGVSLPP 
GRETGGPPL\LIKPPPFGVKGIFLPPPP 
GGGG\CGPPPPPRVI FVF* GKGFPP *GP 
GGF*PPAPKGSAPPPPPLWKNWG1JRIFG 

GR 


1343 


15244 


A 


1351 


396 


1 


GPTKGPGPFIjDGQGFPPPTLKQNFFFPA 
LFFPPLGKRLILGGF\PKPGGLLQPPT* 
KPLRFKPGGESDCFPKFF*GRIVCFQQS 
LCFFPPPKKKKTAFGKRFKKKTLLFFFF 
SDGVSLCHPGWSTVAQCRL 


1344 


15245 


A 


1352 




250 


RRL* SQLLGRLRRQNGVKLGSGACS ERS 
\CSERRSRHCTPAWATKRDSVSKKTKCR 
RHQHVGSLTVRLRPGYWI HHCGCHWWP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


1345 


15246 


A 


1353 


249 


3 


CGATLCPKRYMKITPSFFVEIDN/AILK 
FIWKFKGPRIAKTTLK*KQG*RTHTFHF 
HNILQKAIIVKTMWY*YKHRQSPGRVAH 


1346 


15247 


A 


1354 




416 


lltlssslkitihtnkgrt*sfirenii 
i fiattnllgllph * ftpttqlfinlam 
aipl* a6avi igfrskiknalahslpqg 
tptplipilviietisij:.iqpiala\ar 

LTANI TAGHLLMPLI GSATLTI 


1347 


15248 


A 


1355 


2 


416 


I KYLWVRPETLKLL* / ENIE *NPHNIGL 
GSDFFNLTSNSQGI KEKIDESDYFKLKS 

cctesdttneirqlicnerkclqitcdkg 
lif*kklk*ly/kqkthnh*k/wskrlk; 
yfsrkdihmakry/mkkcsts i irenkp 

K 


1348 


15249 


A 


1356 


426 


85 


HaRLVL*FVF/LFETAYLSVAQA/GGAM 
AQSAHFSLELPQSSDTATSTSQVCYHRN 
T/RLLF* I FCX3YG/RLCCLGWSRTPRLK 
QSSHLSLPKCWDYRWEPLYARPYLSCFP 
ENARLG 


1349 


15250 


A 


1357 


2 


301 


GGLLEVKVQDQLGQHNKTPSL/ IKIYIL 
P I YTQKET* KNliTRHGGMCL* SQLIiaRLR 
*ED*SSPQI*QCREP*WRHWTPVWTIQQ 
DSKSRGEKKQVIFTHYT 


1350 


15251 


A 


1358 


2 


389 


FLHVGQSGCiELPTSVbLPASASQNAGIT 
GVSHRSWLKFSLLDVPSPLTP/VSSAVP 
LISYLATGWRQAAIAASPIFLHQLftRPA 
QPAREAAADS*LPADSAFPPKQWTC* *M 
FTGALFKKKKKPCFVAKK 


1351 


15252 


A 


1359 


309 


3 


KWDHIKLKEIFCTAK*TINKVQRQPMEW* 
K/IFVNYPVDKGLITRIYKELKQLYRKK 
KS11NLIFKMSKS\SSLMREMQIKTTMR 
YHLTPVRILVYVLP KHARS 


1352 


15253 


A 


1360 


300 


2 


KRAFH*KKMPSRTFTAREGKSMPCFK/A 
SGWATPVIPAL*EAKVGESLEPRSSRPA 
WATRRDSCLF*K* * INK*KLTLLLGANA 
AGNLTLKLMLICHSKS 


1353 


15254 


A 


1361 


396 


60 


FQPGGPRGTPLP/PYKPPPQKTPKKTGP 
GGGGLYSPPFGGGGRGNPFFPGAQGFF* 
PPSPPPPPPPGGRKKI FLPKKKKKSLLF 

FF 


1354 


15255 


A 


1362 


37 


385 


ALFSFSFFFGFFWGKKIFFPGQGGRAGG 
HSNLPEPPPPGGGAFPRENLSGGGE*RG 
PPISNGEIFGFLKKTGVPPGGRGWFKPP 
APKEPPPPAPPRGGISGQDPLPPPVLPG 
WGKKN 


1355 


15256 


A 


1363 


12 


421 


EPWEPQTLGICTHLQTLFHDYQVLMKML 
VVTVRYHLTPVKMPFIRKTPDTAGM*KK 
KKGCYQGSBCKGGMLI /HPPWGAL*MSIV 
KKKTWGTFFKKIQKELPWDPAIPWLGMY 
PKEGNSVFQRGMGLPSVIGTLFPIAG 


1356 


15257 


A 


1364 


278 


3 


CCTGEKLETFLLR*ATRQGCPHLFDNVM 
EVLANAVRYEKEIKGIQNGKKKVKT\SL 
FPGDEIV/YAENPHKS*PKNSL/KLKSD 
CSKVNIQKPIAF 


1357 


15258 


A 


1365 


265 


330 


WPGAXAHACNPSTLGGRGERIT 


1358 


15259 


A 


1366 


379 




KRKLMYCWWECK* IQPGWKTVWHFLKKL 
KLELPYDSVILLLYTfLKES/PVYARDI 
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nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

nmino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gliitamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CTPKPIATLFI IA\ECSLTHD*TKKMWY 
LSIMEYYSAIKNE/YLPFATTQMNLEDN 
I FSEESQA* KHGVF I FP 


1359 


15260 


A 


1367 




376 


HLPGAESQPPPV/DNSWDRPAGRTQLLW 
TPA/DPHSYG*GGAGPHPCPSQPGCCAP 
VQSCS*APSE2iQSLGaaDS\GPAATLPA 














KKKKKKKKDDSGKG 


1360 


15261 


A 


1368 


2 


409 


EEAMPKAKMG*RPFAPNSQII»NAKKKLL 
KEIKSVTPV1II*MIRK*NSLIADMKEV* 
VIWIEDQTSHNIPLSQSQI QSKVRMLFS 
SMKMRREEASEEKLEVSR/GWKSAGFM 
EFKERSNLHNIKVQGEATSTDGEGA 


1361 


15262 


A 


1369 


220 


416 


PQPLFDWQMYTYTLHNDILVNDGRLS PG 
GQGCSELSLCHCTPI*Ml\SETIiSQiaaC 
KKMGQARGLS 


1362 


15263 


A 


1370 


310 


289 


AFR*A5TISKGSK* *H*EKKVN/WNFTKT 
*NFCTAKDIIKRMKRQPTKWEVIFANHI 
YLTGILIS\KiyKELETTQ*PETQSLK 


1363 


13264 


A 


1371 




322 


HASERTHHRGKSTETTTPAWATERGSVS 
KQKTNKKTKK/HNT6SSIHNMVSDFBKQ 
VTQTF'SSMQMSNKPLKRYLTSVI lEMN 
IKPEYHFTPIEMAI IKKTDNTKC 


1364 


15265 


A 


1372 


3 


332 


GKEVS *ED IMMtHVYAPEOTEAS KHMKQN 
LIEMQGEI/DE/HPVIIGDFS/TPLSII 
DRSGQKIIKDIV/BLE/STVKQLDLI/D 

iyrtlylkrveyvf\ftslsktfatinc 

ILGHKV 


1365 


15266 


A 


1373 


381 




cs vaqagvkwcasihs slqp * tpgvk* ss c 
fslpshsdyrheX palffkffvqmgs \t 
ml*fs*escikgsrqpcsytylhsppif 
sslfpsiiapesikrqepsfffemescsv 
tqaqvqwrhlsslqa 


1366 


15267 


A 


1374 


1 


385 


ykkginakifnki**v\esaishdqvgi 
vlvlqgfls i/rnks isvthhvkrlkn/ 
hmli*idaekafdkiqqsshgkigikgn 

F FDLLKS I YRKPMAKVILNS E KM KCLP C 

kc\ktsqgcqlsplfflfff 


1367 


15268 


A 


1375 


1 


357 


csgvisphcglkflgsndlpalasrvag 
I igmt p \haql iknfcccwwr * clafgg 
*rfktmcrpivk*nksink*nrtvcmsg 
cgds / rqllgrlrw s pglhnvqgcs e p * 
whhctsacl 


1368 


15269 


A 


1376 


1 


375 


HRPKFKS rSLLEENMGESFHDLGL/GSD 
LLDMAP IVQSIRDKTSDFLEIKKSCSSK 
GTV* IMKKQATDWE/RTFVKHTSNKGS I 
MYEEL*NL*KL*NNPr*KWAREFSRHL/ 
EDTQMATKHMK/KCSTAL 


1369 


15270 


A 


1377 


260 


400 


KRGLGKTVFPWPKKKNGPFFFFPKTTSK 
GRKTLMVNPE* PKYYSVKKKKKKKGGGL 
PFPCKIFFPNLTIQTTFFWPKTKPLTLK 
KRKKPPKKTPTFWGHFFFERGTKKTQGG 
KKMPFKKGGWEKLFFLGQKKKMAPFFFF 
QKPPQR/WRKTLMVNPEP*NLLKKNM 


1370 


15271 


A 


1378 


403 


1 


TGVSQALSLFFFKYFKPRMVEFLVEPSR 
*KGPSVYGLQIFFSIPia/PFPCPFWAF/ 
SLV*SSPPCLSLHLLPAFGVLPKRSLPR 
PMLQSFSPMFFARIG*FQILLIiWFQSLL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

last^amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S*FFFFFGDRVSLCRPGWSAVAP 


1371 


15272 


A 


1379 


195 


381 


AERYQTS RSTADRI S CLFI * SAEI / VFP 
DSGKKTGGNNimNHNNNNTNNKDNNNNK 
ISLSLKP 


1372 


15273 


A 


1380 


361 


39 


PPVKETAKDVNRRFPKQPMTNST* SDA* 
VDEWCL*FQLIGRLK*SRRIA*AQEFK 
AS/HGQHSETLSLKNYLKKKRFSTLLFL 
EEMQIKTTTRYHFTPIRMAKK/SHHTKC 
*QGLAMLPRLALNSWAQAILLLHFSLPI 
SWNYRHTTSSTQASLHVLFVIGCLGNRL 
LTSFAVSFTGG 


1373 


15274 


A 


1381 


400 


90 


lpsswdyrcappl]:j«jfcvf/m*sqgft 

RLVSNS * PQDPPTSASQSAG ITGVSHHA 
WPAFFIMWFSSS YSR/TMH* *QHPHV*K 
LHICPN**KS\LSCPQVPTIIW 


1374 


15275 


A 


1382 


■ 


431 


DNIPKKSAVYYWITLFKKG*DNVEDEAy 
SGTLAISIC/EENIHLVCSLSDEEQHST 
AQTTANTIDITVGI.AYTJLTAKSKLNKT, 
CT* *MPKLSYPLF*KIL*KrlKTKrJIV\P 
NQLQIICAELPVEILIINCDQDPETWLGGV 


1375 


15276 


A 


1383 


2 


432 


ELSADVSFFFITTPLSGVL*QNWGIAAF 
IPIELRSPTEVTFSFDVGNGPFEISLQS 
P TH FS DHQWHHVRVERNMKEASL/QVDQ 
LTPNTQPAPADGHVLLQLNSHLFVGGTA 
TTQRGFIGCIRAL/RMNGMTLDLEERAQ 
VTPEVQ 


1376 


15277 


A 


1384 


1 


421 


NPPALASQRAGIADICHCAWPLLRLSKP 
QFSNL*SKKFALGDV*SSNIL* YI YI FF 
EMESRSVTQAGVQRHDLGSLQP\LPPKL 
SLLPPKLSLLPPK\ SASCLRLAGSWNYM 
HVP PRPAWFCIFS GDGVS PCWPGRTRTP 
G 


1377 


15278 


A 


1385 


2 


416 


I FS VDETALYWKMMPS RTWYL/ RKEKSM 
SGFRDSEERLTLFL\GLNAPGELELE IL 
LI*HPEILGPLICrYVKFTLSVFY*WINEA 
LITflHMFTAWLTECKKPSVRTYCI.GEEM 
PFNILLLIGNAPGYPRAPMEIQEINIIV 


1378 


15279 


A 


1386 


170 




DGVFLIFGGQNEKLNKNKDG/DLTKLPR 
LA*NSWAHQSSCIAFSKCWDYQREPPCL 

A 


1379 


15280 


A 


1387 


47 


41S 


FMGFFFFFLKKKKaALGPWTPPPPGQGG 
PPPPPPEGPGKGGPNPPRQIIYGFWGQR 
GPPPGGGGEPGPPPPGEP\PAGPPQTGG 
PQNWPPAPGP*KGF*GASKGAP*KGVDP 
GGERSQVPNROA 


1380 


15281 




1388 


58 


503 


RPTRPGKYIMIKEFI FQKDRTI *NVyAP 
KNIMiKYIKQKLINLKGKRDKLTIiTVAN 
ISTCP*VTDITSR*KICKaTEELF*QPA 
\DLIDIYRTLyPTAA*YTFFSSAYlTYSK 
IGNSIGHKTFSN/CKHMDNIQWLFSDHN 
GIK*EINIII 


1381 


15282 


A 


1389 


2 


401 


LVSQ WNS KNRI LKKI KNS TP VHT /RQM 
IRKQtmLIADMETVLVVWIEDQTSHNl\ 
SQSMTQSKALTLFNSVKftERGEEAA\Efl 
GRGWFTRFKEERQL*NLQVESEPPS\AD 
VEAVASYPEDVAKI IDKDGYTKQR 


1382 


15283 


A 


1390 


294 


3 


KIVIFDIOTVKNC!SS*ItEVIFDFltoVKN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

location 
correspond 
ing to first 

residue of 
peptide 


Predict- 
ed end 

location 

last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Tsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CSSKTVIEGKVSDCKKIFSKHISDKYLI 
SI IYKELSNLNNWQQPN*KWAKDL/KNM 
SSKHEKICSISFGKMQIKTQIRNHYIPT 
DAW 


1383 


13284 


A 


1391 


3 


422 


PLFKKEEGTNFPFPPFWGRFFPGGKSFS 
PFFFPPPPFIKGGPKVSQKGPRFFFFFF 
F*DGVSLCEPGWILFPPQVTEALLRGFL 
ALSSLCFRLDKFFFFF*R*GHLLSRWDY 
RHEPPHLRIS*RKTKE\KGLIMLTRLVL 


1384 


15285 


A 


1392 


3 


400 


FLYIYkDIYR*YKTFQINNKLKllYFK* 
EKDLSKLFTKDVQMTNMHI*MGSTSLI I 
KQMQIKTTMKHHSSLPECLHFNAANITC 
WKRLGPAGTLILRW*ECKSVKPL/WETF 
♦QCLIQBSMQENYH/DPATPLIG 


1385 


15286 


A 


1393 


409 


2 


IVRHFSKEDIHSTNEHiMKKGFLSLVKEV 
QIETTMGHHHITVRRVKIKTDIFASA\N 
TKC**GYGKTETLIlSrS/YWECKIVQPL/ 
WKNILAVAYKI KHTPFPPRIHLSGSQKS 
HFS lYSKEMKTLRQHKDIFMATLSQN 


1386 


15287 


A 


1394 


409 


24 


KSPFFFFEKGFPSPPPGGAQWGGFPPPE 
PPPPWK*FSPPTPPNKWGNRPPPSW/P 
G*LFFFCSPGGFPPLPNWFLTPPLR*SY 
PPGLPKRWGFKQKPMGRG*RMFCPPKI 
SKKLWSGKKKKKGRPIAI 


1387 


15288 


A 


1395 


83 


411 


QRDSVSKTKQKKINNKSRTDLNRHFTKE 
DRWM/ANKHLKRYST*LVIREVQIKTTM 
RYYYISIKKSKINKPDirraCWQGCGRPG 
TLIYC*WE/CKMVPSLK*TFW*FLKRLN 


1388 


15289 


A 


1396 


308 


14 


NFFFFYSWSNIIYCQKGTAFTPISDKID 
FN*KIISRQK*QYMLIKGSIKKr/DITII 
NTYTSNNSSPKYIKQTLTQLKGEIAPST 
IWGVFNTTLSIIK 


1389 


15290 


A 


1397 


321 




KQ*CSIEYS/PKyTR***SKL/WPGaVA 
HAYSPSTLGGRGGRIT*GQEFKTSLA 


1390 


15291 


A 


1398 


183 


48 


KWPGAVHHACNPSTLGGQGGRI / TLRSG 
V*DQHGQHGESPSLVGRVR 


1391 


15292 


A 


1399 


18 


422 


KAGMAILIDKVDFRTiCNI ITK * /EKHFI 
I TKGSVREEN I SVLNV/WPHYRASKHMK 
Q/ I L I ELHVELGEHTI I VG/DFNI SFS V 
LNRIDKESARRR*NI *NNTNHQLNLVDI 
Y*KTKlSTTNGRTTFFPSflHKLFTIIIHI 


1392 


15293 


A 


1400 


28 


339 


yscdhnsvqpqtpglkqslcqppe*iag 
itdarhhiwpiff\lkrsfvfiaqagtq 
wrgll*kkkkvggfkkgscltlpgrgdy 
rrfreekilipgrgfcneli 


1393 


15294 




1401 


390 




fglgkhvlhmtskaqs ikektd* ldflk 
ipcnfnfskdtikgmkrqarngekifprh 

QS/DDFF *WAIDLTFPKEDKVMANKHI K 

rci.rllll\smryhyh*nkktdhtkc*r 
erg\ei,ellccw*eykmvq 


1394 


15295 


A 


1402 


149 


402 


rfwihlyrkrvfpplustpvitwkekhgf 

VFVFVF*DRVLLCHPGWSAVPMS *LTAa 
STSWA\K*SSHLSLPSY*DHQHASPCLC 

w 


1395 


15296 


A 


1403 


1 


403 


krksvpelic/apec*apcsyrpnlpss 
pgphldpvgptptpspprpgpargppaa 
rpkepaepgkeerrgl\papgg 


1396 


15297 


A 


1404 


342 


3 


KEDNLIDKGGSPKQQIFSCVW* TALY/YT 
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SEQID 
NO: of 
nucieotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predifteil 

nucleotide 

correspond 

amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Piienylalanine, 
G=Glycine, H=Histid!ne, l=Isoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutan)ine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan,Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














alyskki pprtliagkeklmpgfkaskn 
r/tlllganaag\dfklrpmlidhsel* 
/rpkstlpvl*kka*lrahlfialftkh 

FKPTVE 


1397 


15298 


A 


1405 


498 


66 


knyakstlpml * kwns kalmtahlftaw 
ftkVffmptvetycsekkrf/sfkmlll 
idnahshpkalmeiykeiniafmsakta 
stlkpmdq\kvilaykssylrni\fhkt 
taalssdssdasgqskfk*iwkgftrtr 
grtrgsar 


1398 


15299 


A 


1406 


3 


411 


VSPLCLGRSQTPE IK* SSPLGLPKCVDY 
RHEPLYLASMGVLNCPISNLGQCIPP *A 
AWWSLLLGC\PILMLNLVQTPDLPSVLQ 
PRAPGPR\HPPVSAS*VAGTLGTHHYAW 
LSMLLGSTL*DVFPFLSPSLIPSEE 


1399 


15300 


A 


1407 


361 


2 


irvpq;<k;<ikspgrghkifsf*ragppf 
ffsffflrqkeshsvtqfleeqwhdhgsl 
qsrlpgpk*fsh\pppphppvtgttplf 
iyyyyyfleqsl/dsvalagvqwhdlgs 
lhprvrprv 


1400 


15301 


A 


1408 


393 


2 


pgfni svlkkasgglfffsplgkkgffs 
qpfffqsprvfppppffnpppffffccp 
l*kiffspppglkffffechappffffff 
fffpdrvslXccpgwsamvqshfsatst 

LRAQAIL 


1401 


15302 


A 


1409 


393 


60 


LNVNPlIL*SYLDKHEGMLyHILQSKEFL 
SKTSKAQAKEVTLCKWDYNKLKMFCTAK 
KTL/IKVKR*STE*KKIFVKY/STTK/G 
*TYRIFNKianJKMNTQFKNEAETSRSKM 
AE 


1402 


15303 


A 


1410 


422 


2 


vndrkmdgwmngwkegr*tqkkkggrrk 
rekersmflegrkegrekereerekesq 
kerqkrgkarkeqrqearkqggr/erek 
ererergreregergrksss*tgskvvk 
i illavkmlvhlkslhlqkvcvfsfsrs 


1403 


15304 


A 


1411 


398 


54 


PGFFFKKIFVENFFGPHFQFFFPPPPKG 
KNPL* PPQRGPG* \RFPGVFPPLKKGSK 
KKFPPPPRGTQPRWGKKPLLPKRPGFPF 
GPRVS FFCL IE IGWFFWAQVPKKKKKK 


1404 


15305 


A 


1412 


391 


125 


EPAPFFFFPPPGKRGFFP\PP*FGYPPG 
FSPPPFLKTPPGNFFLGPKKKKFFFPPP 
GEKICFL*RMPHFFFFFFFFFWPPRQA 
ICFIIE 


1405 


15306 


A 


1413 


3 


387 


tpdlk*ltrlslpk*wdyerattpq\lf 
ffkpfaslgsism*yhspvciasklril 
fdssfspsatakhpltsafaaawlspfa 

liqphssesslyssycpvpillf/iylf 
iflrqsl/dsvaqaevqwrd 


1406 


15307 


A 


1414 


409 


I 


KRSGPFFYF/SPPFKKGSFAKPFFLGSP 
VFFPPPS SPP\GFFFFSPPPKRGFFPNP 
FFWGAPGFFPPPLF'-NPPPDFFFWGPKK 
KKNFPPPRGKKFFFFKGPPPLFFFFFFF 
FFFFFFFFFYKIIS I KGEIGRSSWR 


1407 


15308 


A 


1415 


3 


385 


hrpppvmglsprpaaaegvp3ppavlah 
qhhaqflarp*lpphgaglrtcsppcls 
lpnpppwapvwpepp**alppap/avpg 

PIN 


1408 


15309 


A 


1416 


2 


238 


PGGRGCSELLPRHCTPAKIVT\ SKTPSQK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
nucleotide 

ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RKKRE KPGS L * EKTLFFCQTLVFGPKKK 
NGPPLLGRVWAKRKGGFHPPLKN 


1409 


15310 


A 


1417 


3 


394 


NKHLKRCSTSSVIREMQIKTTNRCHFSE 
*KHW/ 1 1 NMCW*ECGQTELSALLI RC * \ 
WDSEMVQPLWERKEYLKEITAESSWDPA 
IPLLDI\YPKRNEASC 


1410 


15311 


A 


1418 


307 


357 


LETRKTAEKIKREAGFEK*NIiETNKNGN 
TISQNLRDVAKEVRREKFIftMNA\ *SiCK 
KEKP* IHNLMLSLKELEKDEQTKPKVRR 
KKKILNIRAEIIRD*KNSGKD*KKSWF 


1411 


15312 


A 


1419 


341 


3 


I FVFLVGRGFPPLTRMVLIFWPHDPPPL 
PP*V1EGLFH/LPAFHPQAFLRKMVPPY 
*KNKIPPPRKNTFFFLKPPPFFFFF*DR 
VLLCHPGMRAVAQSWIjATASTSQAPAIL 
PPR 


1412 


15313 


A 


1420 


405 




WGEELVPSGRVHELFSSALRSERDTDSV 
CSCGGQCHCPGAI PGAALKAACRGDPAS 
RVGVETGSTMGNNGFFSFFI FKPYFFRD 
RVLPCHA\AMAQS *LTAASTS* \VKSSF 
CLSLRSSWDYRCVSPHIiaWFKTF 


1413 


15314 


A 


1421 


1 


399 


QPTLLTELGLRPVPPHPATMKIIiY**IN 
TTLHYSYKNYFLFLFLRQGLQPIiQAGVQ 
WQNHGSLQL*TPGVSFLSSWDHPNPMrF 
FIIFFLERQGLTLLPRQVLNS*AQ/C3SIT 
PASAFQSAGI IGMSHCSWPYKN 


1414 


15315 


A 


1422 


168 


414 


EWREKAGESLEPGNGGGCSELRLCMCTP 
AWVTEQDSSYKKKKKTPGGGKGVF/C*K 
LGGGINF*KRVFCFGKGPTKKKTGGEA 


1415 


15316 


A 


1423 


369 


1 


KHFCKSQDRDSVPKKECYIYIYPMSVFMP 
INLTNWI KFLERQKLS *QIEEAI EKLNS 
IY*/PNESEFII*SLPITKaPGPYGFTE 
FYYSFKGEAMQILHKI.FQKRG*EHSFSN 
VIKTLKSKPEK 


1416 


15317 


A 


1424 


403 


1 


KNTEHSKRYSIPQRYCGFSSREFFHFLV 
HMKVMFPLFKCAIALCLKQVYTVFFETE 

SCSVTR* /evqqqdhgsqqpqpprfq\h 

PPTSASQIAETTAILENFRKICIYFLRQ 
SL/HSATQA*VQWiaiHGSPQAPPPG 


1417 


15318 


A 


1425 


401 


3 


pffffkknfpfffqvegkgrklg*qpfp 
pg\lknfsapplpgggn*r\rppfarlf 
fffi.*krgfspiikqgglgsppffspp\r 
apkwvgfqgitpppgnfffffffseaes 

rsvtqagvqwrepgs \qplppgf 


1418 


15319 


A 


1426 


386 


27 


cringvl i lc* ce ckmaqtl * i * fgnfl 
qkkldltcdstipllgiyprdmktyvhX 
kktct*mftaalliitkswekllrpseg 
ewinsrhpydrivlsnyethmvnkiqrh 
yaeakktv 


1419 


15320 


A 


1427 


398 


2 


GLQVHATTKFFKFFVETGSCYIAQAGLE 
LLVSSKPPG/SSFPKCWDYRCETRRWPF 
LFKFSLPIFVF*LNHLFCLHLLPLFILK 
LNLPCFMYYLFIPPALYSLSLLALFRND 
CDLSPIHLEVKKYFSKKDRL 


1420 


15321 


A 


1428 


401 


70 


HHAQLRCFFNSFVEVGVSLCCPGWSQTP 
GLKRSSHLNFSKCWDYKR/AAITLPSFF 
* I ALF * ENP PNCHSL PTPLS P PKLAALG 
NSCLHPWLPQPPVKGLSDSFVCMHPGS 


1421 


15322 


A 


1429 


388 


131 


rgggclqrh\rgfkqenwfnpggrgss* 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 

location 
correspon 

last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lV[=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PRSMYCPPAWGAQGGSLSQKKKKKKKKS 
KNYQ KI LSTQKTQKSRVFIVWSKHCTRH 


1422 


15323 


A 


1430 


184 


413 


VSFLFDFyFLFF*RW/RSHSVTQaAVQ* 
CDCGSLQPFILFYFLRQSL/KSVAQAG/ 
VGSLQPLPPGFKQFSCLSLLSIWD 


1423 


15324 


A 


1431 


76 


533 


SCKRTTGRQFPDLPTRPPTRPQGFALVA 
HAGG*WRDLRSLQPPPPGFKRPSCISSV 
LWCT]CAFNFDFQEIYSFSS*I*GFDSFE 
VNFFFLEMGVSL\FAEIAGLKLLGSGKLP 
ASASE/SGGI IGVCHWAQLL 


1424 


15325 


A 


1432 


373 


29 


RQGFPPMGRWLDSLKKKKFQIPKSKIF 
PGGQAPLVIPPSREGEflGKFF*P\GGKG 
AI KQNYC/HCPPTWGKEGNFVSKKKKKK 
EKKMHQK*RCTKDQQ*T*MLLFPSLK3 1 
YYFK 


1423 


15326 


A 


1433 


389 


73 


TDCPAIGRNHRDPVRPLLSSPHRKI/YA 
NLYYYIIVRILTLIQSTDLM*ISPWLV 
LTRVCVCVCVCVCVCIILPS/CYPVORF 
MYPTTTViCVQNI S I GQEKQKTKT 


1426 


15327 


A 


1434 


402 


39 


QAGLQLLASSDLPALASQSAGITGVSHC 
AS PS ISL*APLGLDTFSDFPCFNDLDS F 
EEHWSGML/SECP*TGI*DLSFFS*VYW 
GYGFLEDHRGKVPFFITSYQSYTPPTQL 
IIADVKFEQLV 


1427 


15328 


A 


1435 


105 


387 


efskvagykintyksivilyiisne*fek 
Xkkiipfiiaskrikylki/qfakavqd 

FYAEKYKTW*RIIKEDIJIK*KimiC*WI 
GRL/NI/LKMVLLP 


1428 


15329 


A 


1436 


242 


2 


fgggffwvgtpnkmgpfpqiiknlf/tr 
ppenf*kpp/lpfiaffslgflgvffff 

*DIVSLCCPGWSA*AQSLLAAALTSPG 


1429 


15330 


A 


1437 


411 


0 


llfvaqeclkwfsalggpgvpcs \lpan 
qqpsvwvp*qs\ehvtvlqrfclwpvlg 
paygqi wepvpstiik/ppppgfk*fs *v 
sllsawdcrctp/ptdpanfcifsrdav 


1430 


15331 


A 


1438 


41 


412 


FFCTD* GSLCCSSWTQTSGLKRSEHLSF 
PNCWNYGH/RASVLGLIFPPFLRKKT/C 
P * KKFL *NEDLLCCQLGLELLAPRDS PA 
LG PQ S VGTAKVT PRPW/LLFL * TQNYLN 
S SRVRFSPAPLKYKEV 


1431 


15332 


A 


1439 


1 


406 


HAYW I V\NPS PVSLTGALSSLLMTYGLT 
MGCHFLSITLLILGLLTNTLTIYH*WRD 
VTRESTYQGHHTPPGQ*GPRYGriLFIT 
SQVIFFTRFF*AFYHSSLSPTPQLRRHW 
PPTG I TPLNPLEVPLLNTCVLLR 


1432 


15333 


A 


1440 


3 


422 


IVIEKVSVWIDQTSHNIPLSQSLIQNSAL 
SLFSSMKTE*GEEAAEEKFEASRAWLMR 
/FEERCHHNVKVQG\EAaSGDREATGSY 
PDD3:AKIIDDSGYPKLHIFNIC*KPTDT 
DIVDETAr,C*KKPSTYFLFFFLKQSLAL 
PP 


1433 


15334 


A 


1441 


375 


2 


KVS S PRAENV TLYNCPPLFFFLCQD/CH 
SVTQAARQWEHHSSLQP*TGL\ASQVAG 
TAGAHLYPSQNLIGRGIRSIiA/SVAQTG 
VQWENIGSWQPLPVRIKGFSSLSLKSSW 
EYRCPPPCPD\LFIFL 


1434 


15335 


A 


1442 


397 


122 


GQASLELPTSGDPPASASQSVRITGMSH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
tISSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

ing to first 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serinc, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=L"nknoH'n, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CARP ILGSCISLFAYPPCFLKCFLF/CC 
REVI LL * CSGWS *TPGLKQSSGLSLPKC 

WDYRCEPP 


1435 


15336 


A 


1443 


3 


299 


WEKMFAMyLSD*ATVSRPyR*LLQIJ)IKT 
KYKLKLSQS/MNRHFTRGDVAMANKHTK 
RKSTSLVIREI KTMHYHYLLKRMAEVQK 
/SVKCWQECG*GGGKCG 


1436 


15337 


A 


1444 


10 


410 


CTLFLSISPDSAGGICTMLLFQNTUHFK 
ISLALGVSDKCFSNFLIMTIVRNVFYIH 
GKKCILYCSPGCAHTHMHTLMHTHIHLK 
QFFLRRGLTLLPRLECSGAITAHCSLDL 
LGSDSPP* *EMyFiyMV\EMYFILFPRV 
CTHTYJUTTDAHTHTPETIFFETGSHSVA 
QAGVQWRNYCSLQPRPPGLRFSSYLSLP 
GS*NHRHVPP 


1437 


15338 


A 


1445 


431 


9 


GPAQSLDSSVSPLQTISSTVKWRHNAYL 
AGLSGGI/PCDCFLGPRVPWGSGICQWH 
WAVSPTLS\PTVRPPLFElETLyPT*V*V 
PPLETCPQ*HISHCRCVINWCEDTHTPK 
KHLVCVHACVCVCVCVCVCVCIiGRSRQS 
EH 


1438 


15339 


A 


1446 


S5 


416 


PGCLSLPKI WDYRCEP PRPPHFFI FSS I 
QGPLTDFPLTPLEQVQISPTAKGPLKKI 
PFFFF**IUIGVPILARWFKS*PHDLAA 
SASQKFGITGVSPHVHPIFLY/CSSIQG 
PLTDFFLTPLEQVQISPTAKGFLKKIPF 
FFFLRQGLT\RQWSTHLGLPKCWD*GCE 
P/PMPGLHCEFFKG 


1439 


15340 


A 


1447 


376 


3 


IKSE I lETESRKPI EKIS/RAQSCFCGM 
VYEIDKLi:ARLDRKKMKrQIJ3NIRN*RD 
EINTDFTDIKRIIRENYKQDYGTKFNNL 
DETDIFLERHQLPKLTQEEKDNLNSPIT 
IKDIEIIV*NLSTG 


1440 


15341 


A 


1448 


2 


192 


SKWIKHIJSrKELKP*DS*KKPE6NLHDIR 
FGNDFLDVTPKN/MATKGKIDN* TLPKF 
KMLMRIWRN 


1441 


15342 


A 


1449 


411 


3 


VFLPPLPPFCFIjNFP*GHaGFFFKI FF* 
KKKGVEGPQLGPPPPFGPFKKFFFF* I F 
LKAPLFVPPTLKPLF/CPPKKKNWENNP 

PPFFFCFFVKKTQFYFFFFFFLENRVLL 
CHPRWSVMV*SCLAVASTSWAQAICP 


1442 


13343 


A 


1450 


413 


1 


SA*TGEWCNPGGGA.CHEPR*PPCPPPWA 
TERGFVSKKKRKQEK/SRIMECFQDNLP 
GFPQl FSVMKNKQEOl* FWTK/VKMTKY 
NA*NVNGS*YWKKRENEEECYDWKKLP 
QSILS *YVKKAR*KPGCWLMAAIPALW 


1443 


15344 


- 


1451 


^ 


384 


DPAIPLLGTYPREMKTYVHIKTCTQMFA 
AVLFTVAETSKRPKCPTDE /VMKI * CT Y 
I/MTYYSAMKRDE 


1444 


15345 


A 


1452 


3 


390 


LPDHPGSSVSTPRGVIITGHGFWLLFPW 
VSFFFFFFFLKRISLLPPNWRGGGQNLL 
NKSPPPRGP*NFIiA*PPQGGRKKGPPHP 
PGAPGDKNP/QPFFFFGEKKKIFNPPTG 
GEKKKPPSPFKWGGGGPN 


1445 


15346 


A 


1453 


37 


381 


LILYINVCVCVCVYIYTHTQMGSHIVAE 
AGVQWCNLGMW*P*TPGLK*/FLLSSGD 
YRLTLPHIA1ILYIFFFPFEQRG\FFFLL 
KLV*NSGPKATLPRVGITGLTHTPRPYR 
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SEQTD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Aspflragine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon, i'=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GDLFF 


1446 


15347 


A 


1454 




400 


temvlhtemvlhi s vwdplldrpc3rgvt 
wlmltaelflyggteafsssfdv^gpcs 
etvecfsd/l*a*gpp*nt*spiyrfas 
fllafciylletrsp\svtqsggk/wqd 
ltslqplpprlk*ssclslpsswe 


1447 


15348 


A 


1455 


77 


398 


EAEIVPLYSSLGNRVRPPSQKMYIYYEQ 
LH2Un'PGN]:jAEMDKLVERYKVLSKFT*E 
KI\LNSPVSIKEREF\AVISLPQKETLG 
PDSFTGGFYIFKEELMPILQRLFW 


1448 


15349 


A 


1456 


397 


69 


cppglsglpwva*avpp/erpapppppp 
agidaglgddplqqthha/ppaaagsap 
agcaagpargaprgrspprrgsagaapp 
pwppaaaaassaggssapcasspaappt 


1449 


15350 


A 


1457 


400 


2 


ALFFWAPKKKKNFSPPPGKKFFFFKGPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FF*ERPIFILFYFIilTQKNIPRDFFFFFF 
FIiRDHVSFCQPGWTAMV*SQLTAA\ *TP 
GLKQSSHLSLPSS *DYRWVPPC 


1450 


15351 


A 


1458 


343 





QGVHSRKYKEISKLNS *KKNPTRKWAKD 
TNHHFTKDNIQMKNKNMKDVQARCSGS * 
F*KN*NFKSYAL*PKCSEI/KKSVTGTN 
WRNSQIYDKLSTLQira*WIFSRSLYNFE 


1451 


15352 


A 


1459 


109 


401 


QNYRQLGQWDRIESTVIDPCKYGKLIFD 
QSAKATEWRKDS*NHQTS/WWKESFTRV 
KALEE ITKRKVNSLASI SVKTDVHVIKV 
*LKKKKKKRGGHFKE 


1452 


15353 


A 


1460 


42 


424 


CPAN*NSFSRDHySLPMLPRLIMrSWPQ 
WLL/PWPPKVMGLQ 


1453 


15354 


A 


1461 


419 




IFPLKKKKRGGGVGPPLYPPPLGAKRGG 
SP*KQNLKPPRPHKENSPFFYKKKKKGG 
RGGAPFFPPPLGGKTKKFFLPPK/IKVS 
FNPNLFPPPPPPGGKKKFFFPKKKKKKK 
SRNTVWF 














1454 


15355 


A 


1462 


1 


277 


CHTNPLCMSFPPAISHHPLMPPHPVSRS 
VAQAGViWCDLGRG/CSELRSCHCTPAW 
MTERDSISQLIRKKKKLIKIKKKKNKNK 
SVIFLGGGS 


1455 


15356 


A 


1463 


382 


3 


KAGGSINQNSPPPPPPGEKKKKPPPKKK 
KKKKNPFWGGPRILPHNPPPFER*R*KI 
FLGPKFLTPLGPKIKPLB'FFFLKKKKKK 
RMTY/HSVAQGPLLNKDTLQAG\LSKA* 
RSPPKSKSMGQSFHRN 


1456 


15357 


A 


1464 


561 


86 


NDPILSLKAEKTFGKIQDSFLIVSSSL\ 
NKPGIEGNLLS* * KASTKQNP IMNLRV\ 
LWFPLRS*TRQVCLLSPLLFN1VLEIL 
AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLFI*VKWLMEFAKKLLELINEyiIKVER 
YKIWIKNILIAKNTWTLKF 


1457 


15358 


A 


1465 


3 


221 


RFHRVSQDaLDLLTS*STRLGLPKC/WD 
YRLEPPCPAKTCLILNGSC*VFLCYLSC 
FKAQERPSQNSWGAFVTL 


1458 


15359 


A 


1456 


2 


396 


WWPAWTHTPGLKQTSRFSLPECWDYRRE 
PP\PGLVKFLL/IQYRWTQHVGLGDWV 
QDHHPHIEICLHKSVI*NGIVFAyiILRT 
SSLTLFYLRRGLF/SAPQAGVQRHKLSL 
LEPLPPSFKRSS\CSSLLSSWDYR 


1459 


15350 


A 


1467 


349 


413 


RLGL*PRKDITD/HVSLHKDTG/WPGAV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

otide 

location 

correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaniiie 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metiiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptopljan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AHACMHS TMAGHRRQI PRSGV* GQPGQ 


1460 


15361 


A 


1468 


403 


2 


LPVEWKGGIFGSFHPPPLRLKKFFSPTP 
PKS*D/YRGGPPPPGYFFFFFLKRGFSP 
FGRVIFKFPPPGVPPPPPPQNFGFKGRS 
/HPSRALFFFFKEVFF*IYPFFFFFEME 
PRSVPQAGVQWHDLGSLQALPPRFM 


1461 


15362 


A 


1469 


423 


3 


lEFSFFAPSKDKGAILGPSTLPLPRFNN 
FFCPPFPINKDIRRGPPPRPNFFFFRKR 
GGFPHWAGMFLSFNLQEGPPPSPPQ/SV 
GF*GEN/HPPRAQFFFFKKFFF*CyPFF 
FFFEMEPRSVPQAGVQWHDLGSLQALPP 
RFM 


1462 


15363 


A 


1470 


13 


427 


RTRGLVFDKTEFKPPKI KKKKKA.\GP FL 
RGNRS ILKKKATFPKNLFAPNPGPPKFI 
KKVLSDLQKNLNPPPRMGGNFTPSLLK* 
NKSSKQQITRDFRDLTSPLDQGDL*KIY 
KTF YPKTTE YTFS SAPHGF Y / S * FDHKI 


1463 


15364 


A 


1471 


378 


1 


FVRPPFFFFSSSRPFFKWGGPLPPAPQ 
FFFKTPRGi!IPLLRG*KPPPQTPVGGAKG 
SPPW/VPPGFPKKGGGVFFFQQGL * KI S 
PPGPAPP*I,VWGGESPPPKKKKSRPGW 
AHRCNLaLWEAKAG 


1464 


15365 


A 


1472 


412 


53 


SRLSFFLSSEPHGPPSPLGPFATNKVAF 
C*/PPPPF*PSPPPKFFFF*GPKSVIYF 
YQR*PPTSRIFPPKGGAGPP/PPFWGFV 
NPKKK\LKPFSKPSPPARKFKNPEGTKL 
GFFKKKKKKL 


1465 


15366 


A 


1473 


116 


34 


DWNIKPVLISNVMTGLMSMIITLLLQLF 
LL*LFFFK/WELTFFGGKFHQRVDPSFQ 
GSLGRGRQFFLLTKTEYHI FFL IKVENS 
KTEGAGLAT*IK 


1466 15367 

1 


A 


1474 


42 


428 


EIIMESINRFDVlTF*N5'CL*CyriMSKI 
NRQASSW/ENV*NQYVGEKILIFLIYKE 
CIQINKK\KLRPKIDTQVKDKQISEEGM 
QMftNQHMIQC\QPSLVLNKMQIEIAEGH 
HLPYQINKDSKMMVEERRQ 


1467 


15368 


A 


1475 


87 


433 


PQSSPFSDHCSYQRLFLHLVKVSTYRLQ 
KIRKIHKSPGMNEYFGGFFLFFSFLFFF 
ETGFNFVPQAGVQGQDLG*LQPLPLGFK 
GFS /CPQPPRTFFFFFFLRVWGGFIQKK 
KFLFV 


1468 


15369 


A 


1476 


212 


430 


SLILMTSNGIHFIYFLFLLLLFFFF*DR 
VSACHPGWRAMS * S YFT*ALTSQVK\QS 
YLSLSSS*NYRHTAIMP 


1469 


15370 


A 


1477 


493 


2 


PGAAAHACNPSTLGSQGGWITRSKI PEH 
PG\TLWNPRSY*KTKMCGLEAP 


1470 


15371 


A 


1478 


454 


492 


HRVGEERF/CLFETESHSLTQDRVQWHD 
LGSLQPPPPRFKQFSWYHI *PHAW 


1471 


15372 


A 


1479 


2 


213 


IDQERERLMEGDRERDTETDAEKDMGRE 
/RNRYRERERLRG\RRERRKERDRMT*M 
PRERERENLHSLYRETYRDFETEWVMDR 
ERQRLKHRLL*AVIVPSHSSLGSRRKTP 
FQKKR* REGERKS EFLIQRD I 


1472 


15373 


A 


1480 


77 


453 


SFGDSLTLSPRLAVQWVYLGSL*PPPPE 
LK\YSPTSASQVIHYLLFFFFFFGKKVS 
FCPQGGGEGPPFGLLEIFAPGLMPFFCL 
NPPKGWVWR/RPPTMPKLFFVFFIKRGF 
SPGEPRGVSFPEPGT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Preilicteil 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argjnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1473 


15374 


A 


1481 


398 


96 


KRPWCFFSPPPKKGFFPPLIFVGPRFF 
SPPPVFK/SPPPFLFFFPP*KKKLFSPP 
PRKI IFF*KPPPPFFFFFFFFFFFFFFF 
FFFFFFFFFFFFVFILLLS 


1474 


15375 


A 


1482 


484 


64 


QKRQMMJKYMK ICS PLFVI REMQTNTT I 
RYDDIPTRVAK1IKT]S[K*KSNSTKYCQEC 
ESARP* lYCWWKYICMVQPLWK/S/VWQY 
LLKLNI*LPYDPVIPLLSMYPSQIKTCI 
PTKTCRQMYLVTLFI ITKNQKLAHSARV 


1475 


15376 


A 


1483 


2 


400 


YNKKGQGVAPRYDVDTAftNFPKELAKIl 
DEGGYTKQQSFNIDYTALYWKKTPCKSY 

SWR* ih/slaskdrlt\llvgnwgdfk 

LKPVLIHCSKNPRVFENYTKSTLPVLCK 
WNSKAWITAHLFTAWVAEYFEPT 


1476 


15377 


A 


1484 


459 


65 


GGPPPPHKTIDFFFFFFFRGESPPPPKK 
KKKKKK6*DNAEDEAQS*RPPPSI*EKI 
MHLICALTEEY* *LTAEIIASNIHISTG 
SAYIILTEiyiLELSKIiST*WVPKLiFyPNQ 

lrt/rael*mvilnk*dqqf 


1477 


15378 


A 


1485 


2 




PPPPPQRFGLRG6APPPPKKKB*DNAED 
EAQS *RPHPS I * EKIMHLICALTEEY* * 
LTAEIIANNIHISTGSAYIILTEMLELS 
KLST*WVPKLFyPNQLRT/HAELSMVIL 
NK*TSDP 


1478 


15379 


A 


1485 


122 


501 


PRDPPAIALQNAGITGLKQSYFLSLLSS 
WDYKHKPPCAPGYLK\VFFLEN*YIiALY 

VQLFQKK/DLQHGCLLTPSISGYSVTWD 
GVQWRDHGSL* P * PPRF 


1479 


15380 


A 


1487 


491 


425 


ipkndi*aep*rinkr/cAkRkmvhkgf 
eaestickalryeeivcsrntefvhgwl 

*ENRGQIMKGFGNSVEKLEHN*KFYEG* 
LLYRFE*IiAaGFIVFQQ*VSKEIVKaWP 
GTViUIACNPSTLGGRGGWITRSGDRDHP 
G*Ha 


1480 


15381 


A 


1488 


441 


17 








WCQDKIS/IIQKMKVDPYLISNI\NLKW 
LKDLNVTAKT IKLLKEMTGAILHOLGFC 
NAFriDGKPKAQTTKKKQVK*TSSKFKSR 
CQWLrpriSTLWETEAGGSPELRSSKPA 
WPT 


1481 


15382 


A 


1489 


413 


1 


LFWKEMVPLPWPGRVGNPGPK \ GWLRLR 
PPPF*FSFQPFWGPPVGLLFPKNFYY*P 
PGGAHGPNPP PPGGKG* G P PRVGGGKFN 
*PKFGPCPPGWATKQKPVFQKKKKPNKI 
KNRTVKVPKVTKFIYYVPYLNGVSN 


1482 


15383 


A 


1490 


359 


407 


RHRMITFFT* IVTQKFPYL*KKMDIQIQ 
ED* RTPN/ RDQHKHTPRHl 1 1 KLSKSQ\ 
MSKRILKTRKK*LWY*GTHIRLSTYFW 
PKS 


1483 


15384 


A 


1491 


397 


39 


EKFRGRSFAKTLFSGPGQG/ PKGNPGLI 
PEGPPPFFFWPAEDGGFFFPPGQLPPQG 
P I RGPGPFLGFPVP PPEGEGP P PTGP * K 
PGQKRPKKRGWS PQKKGPKKKTLFFFFF 
CQMESRSVT 


1484 


15385 


A 


1492 


378 


1 


FGVFWFLAPRRK6GFFPSHLIWVPPGFS 
PPPGV/SNPGPGIKFGGPIKKIFPCPPR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutaniine, R=Arginine, S=Serine, 
T=Threorine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGKFGPFKGAPPFFFFFFPQDGVCSVAQ 
AGVLQ * AE I VP \ F P PT * ATEQD P VSKKK 
PTRPPTRPPTRPPTRP 


1485 


15386 


A 


1493 


3 


428 


IKSP*PDRFIAEFYQTFKELVPILLKRF 
HKI /EEGTLPS * FYE/ACVTLI PKPGKD 
TTK 


1486 


15387 


A 


1494 


3 


417 


ILli|NARIiKPFLLR*TTRQGILLLSLLFN 
KI*VLEFLAGAISQEKEIKVIQVRNEEM 
SKTLFSQTT*S*aRHPFlKIPLPKMSW 
N/NFIKVTEYKVSIH 


1487 


15388 


A 


1495 


429 


4 


PFCLGSKRFEPFFNPPRGGKF* KKKKIF 
GPPHGGPPLFPPLWAPKGGGPPRAGGSG 
PPPPKGGNPLFINPKKK/PPPPGGAIFQ 
SRFLGGVNPKIFLFPGGKFLTDPGFPPS 
LPPGGKKKKPRFQKKKKRKKRKVKEKKy 
FL 


1488 


15389 


A 


1496 


3 


433 


FSKEDTPMVNKHMKD/CCTSLVIREVQI 
RTTTRYHLMPTRMAMCIYIFNYILFLK/ 
SKNNKC**RYREIGTLIH/AQWKyKMLQ 
LLW/KTIWQLLRMLNTKLSKDQE\ IPLL 
GI*KKKKKKKKKRGGRFKGINFTDPGVE 
RINFYNSAPK 


1489 


15390 


A 


1497 


3 


326 


WPACL * AVAAVALLVPEATRLTMGNNLT 
VCTPHSIAELLSSKG/DLWLTDNR/LLK 
YQALLLE\DLQLRTFTCLNPATF/VPEE 
TGEPEHDCEWVWQTGKRNNKDHCLYSL 


1490 


15391 


A ■ 


1498 


345 


22 


KSKWLIDLHGKCKTLKR*YHKPTK*\LG 

hgndfldttsnawf i /rkidkldf i ink 
nvcsgkvtvkrmk:r*ttdrdkisskgis 

DKRLLCKIYIKILTTQQ 


1491 


15392 


A 


1499 


194 


432 


PWCVCVCVYVCMCVFETQS/HVARAGM 
Q*HNHSSLQL*TPGLKQVSCLNLSSSWD 
YRHTPN*FFFF/CNFYIiERGGVS 


1492 


35393 


A 


1500 


2 


417 


RD*FMRFK\EKSySCNIKIQHEAflSA]!IV 
ETMASy*EDLAKIIlirSGSYTK\SQIFNV 
DEVAL\KKMPTRTFIVRERKS I PGFKAS 
KDRLILITiG/ANAPGNIKWKPVLIYHST 
NPRAFKNUAKS ILLVLYKLNSKAWRIAH 
LF 


1493 


15394 


A 


1501 


414 


0 


SSSSSSSSSSSPKWTTGALQP/LLSRTP 
LKDSS EEESSQ*AEL*AVHLVVHFAWKE 
KWPDM*I.YTDSLAVASGLAGWSGT\MKK 

HDWKI CDK/DWGRGMWMNLS 


1494 


15395 


A 


1502 


3 


125 


RLGLPKC*DYR/RAATTPGLH*F*SWKE 
QRPQ * LALGRKPVNRNKNTQKY I H 1 YT Y 
TH/PIPTNVYICIYTHIHTPV*YTHIIY 
1 1 YLFK\rLVPGFLRRSLT/SVAQflGVPW 
RDHSSMQPRPPYKQLTCH* *LASQSAEI 
TGGSHHTWPALILKLERTEAPVISFRTE 


1495 


15396 


A 


1503 


406 


172 


DIILDR*RQKQRLRLRQKQRETETE/RR 
DRGRERQRYRQRQVQRQGRRQRQRLRHR 
Q * QRQ IDRLRQRQ /RAERDRGRGRDRDR 
GRDRGRDRKRQRLRQRQR*RERQRHRQR 
/RERDRQRQRQRDRGRRRDRETDRGLDR 
G*DRGIDRGRCSDRGGDRDRGLDlaSDR 
DR 


1496 


15397 


A 


1504 


3 


420 


ITGVSHRAWLPS* FLKFFWVEMESHSV 
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residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAGLK/ PSGLKRS SHLDLP KCWDYRHE 
P\PHLAFFQFF 


1497 


15398 


A 


1505 


407 




PISIYHSKDPGSLKSYTKSTLPVF*KCN 
NKTWLTAYLFTAWFPKFFKPTVETYC/S 
RKKSPFKMLLLIANAPSHPRALMEMHKE 
I PWFMSAVTS ILQPMDQGWLTFKS YY 
LRNTFCKAIAAIDNDSSGGSGQIQ 


1498 


15399 


A 


1506 


1 


408 


PPGFKPFCLSLPS3WNYRHPPP/RPG* F 
FVFLGGTRFHHVGYTASQHLTSRETHAY 
ALQ 


1499 


15400 


A 


1507 


2 


416 


EPRSHHCTPAWQLSKTLSQKKKKKKRKR 
MVFTGRKKGLFFC3N/LKMLGTLSREYPL 
GPGIKlSlRLAQKRKPLFYKKRF*ISriNEGG 
GAHPGGPKSWERGGGRKI*TPVGKrASN 
PEYHICIPPGKRNQNPFLQKKKKRRRD 


1500 


15401 


A 


1508 


276 


14 


SPPPYFLtlR*GRKGRGQF**LRLFFII 
NLRQCL/DSVSQAGFQWHNHSSLQPRTP 
GPK+PSFLSLPSSWDYSHAPQLLAFYGA 
NCFN 


1501 


15402 


A 


1509 


1 


391 


NIFKEIMSENFPSLMSENFPSLGKEIES 
QIQEAQRTPNKMNPKRSTPKQVI IKF*V 
RE/MLKTAREK*IVICKGTTLRPAVDFT 
AETLQIRRE*DDTFKTLja3KKKKKKTLP 
AKLTFPS ILVFKKTRGGSR 


1502 


15403 


A 


1510 


2 


419 


PRVRSRATNVISKYKQHKKKlSrTSK*IKD 
LNVKPEPIKLL\EKTTGEKLLDIELGNI5 
FLDS/TPKTRAPKANLTP*NYFKLKSFL 
TAKETFlSr\KTPTKRGANHISDRGLISKI 
YKELTIQ*QKNNLISK*AKDLKRHFSKE 
DV 


1503 


15404 


A 


1511 


392 


122 


SVSLCVSLQSSGMFRC/'LPTMPLRR*RQ 
ENPLTSAGGGCSEMRSHPCTPAWVTQQD 
S VSKK* INK* I K* /Q *LRQ1WKI KPSP I 
FCEQGKRRK 


1504 


15405 


A 


1512 


2 


281 


GGCS ELRS CHCTPAWTT \ SETLSQKKKK 
RKKICIECNTLKSH/ILSFVCGGKYHSP 
VSCTCL*WSI.GRSLKRSLERHQKNLNDP 
YPIFQNETNIH 


1505 


15406 


A 


1513 


242 


382 


QGPNLG*LHPPPGGLKGFSPLTLLRSWK 
NRLPPQHPFYFCFFRKNiCV 


1506 


15407 


A 


1514 


1 


388 


rtrgterdt/rfrerrrerdrgregyrw 
ryrererhreres * v* rdrvrdgdrvid 
rererqsererlr/rerdwerdtyivre 
teteretegerdiererdivrdrym/rd 
rdrdi lre * erdretaldsek 


1507 


1540S 


A 


1515 




285 


TRXQICNGDKTALY* KEMPS RMFPARVE 
LMPGFKASKDTLTLLLRPNAVGDVNLKP 
MMIYYSENPRALKNYDKTQLCLYSTNGN 
KAWMTDYRFT 


1508 


15409 


A 


1516 


415 


2 


MGDRESLPPSRVFLFFLAPCPKGILFHP 
V*FGSVRSFSIERCYR*GQRNEFWGPVI 
RVKASSSRAG*WFSSTAPPLFF/CFFE 
TES CS VAQAGGQ *CDFSSLRPLP PKFKG 
FLCPSLPDAW 


1509 


15410 


A 


1517 


1 


324 


PTRPEMGFRHAAPAGLELLS3SDLPTSA 
3Q3GGITGV3IIRTWPILA3NNY/SMDKL 
CAI/CERFILFFLKNILLWILILFIFIY 
FCLIK*CITINFSQNSLLPHSIHLYI 



626 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 
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Predicted 
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nucleotide 
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correspond 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lso leucine, 
K=Lysine, L=Leucine, M=iMeth ionine, 
N=Asparagine, P=Proline, 
Q^GIntamine, R=Arginine, S=Serine, 
T=Tlireonine, V=ValiBe, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1510 


15411 


A 


1518 


2 


163 


ACRYPWLNFVLLVEMGL\TMLASDLERS 
G* P Q / CDL PAS AS Q SAG I TAES HHGW P 


1511 


15412 


A 


1519 


425 


52 


vlrgaqflfqearlkwdfgpggmappcn 
ppplgv*kgplpkg\ggpnppgpprghp 
vfskkpkitppgggapevpppwgggaek 
ffypggprvqgte/rktpppppgrqkgt 
plpqkqkkkkpwsl 


1512 


15413 


A 


1520 


3 


404 


THASGKSNI RGLTL P VFKTYYKAAV I R / 
TVWP/WLRANTNRQNRL/DGPEVDPYKC 
SQMIFDKGAKAIQ*RTDNNLLNKW*/ES 
* TSTCQKKKKKNQFR/DPAFLI YTKFMS 
NGRKNLRVIGKT*KVLKKN1KQNWGDLG 


1513 


15414 


A 


1521 


332 


39 


T SRE TGNFQLDLTAFUSIEVGVLFLWRKG 
KGKIDIGIGTKEMP*FPI*LLMFSYIV1 
ERERKPE/LSLSPSLECSGMILAHKKNS 
LPGSSDSHDPASEVAG 


1514 


15415 


A 


1522 


482 


0 


raetsgvllcspgws*tpdlr*Slclsp 
pkrwdyrt*atvpgllyslysrpsp/de 
lkgceksrtspa 


1515 


15416 


A 


1523 


1 


417 


NKC**RCEEKGMPVYYWYECK*GQPFWK 
TKERFFKMSTL/NIELPYNTAIAIiNMRP 
Q*I*SQ*RKVCSCMLCaT\PTPNK*IKT 
MW*VCLVEYSSPLKKN\LLFSTTWINLE 
DISSNK/LRHRKSS ISRFHLLL 


1516 


15417 


A 


1524 


1 


397 


VFPPAGPL/WSAPTRSSLaPTPQLGGHW 
PRTGITRLWRLEVPLLNTFVLIiASGVSI 
T*flHHSLIESNEHRIIQALLITILLGLY 
FTLLQASEYFESPCTISDGIY 


1517 


15418 


A 


1525 


2 


376 


LKAKTGQKLGLLHQTVSKFVNAKEKFWK 
ELL\KSATPVDI*MIRKRTSLITDMETV 
* W*K3DQTSHP I PLS *SLTQSKALNLP 
KflMKTDEGKGAVE*KSE?aiRGWPMQFKE 
RSC/EPCNIKVQG 


1518 


15419 


A 


1526 




386 


ESMLKAKTGQKLGLLHQPVSKFVNAKEK 
FWKELL\KSATPVDI*M1RKETSLITDM 
ETV* W* KEDQTSHHI PLS *SLTQSKAL 
NLPKflMKTDRGKGAVE*KSEANRGWPMQ 
FKERSC/RFCMIKVQG 


1519 


13420 




1527 


127 


388 


KRKSQINNLLLQFKELEK/QEETKPKAS 
RRWKKKKIRVDLLKIKKGKPLEGVKKKG 
GFF*RTNKRDKPLLKLPKKKGGGRI KTF 

HKTS 


1520 


15421 


A 


1528 


3 


402 


henhmki cs ts yvirelq ikttmk* yyt 
pvrttaiqntdhtkcwqg*req/6sliy 
cw*qckmvq/plwkrvwqfptklkhsln 
i*savvllgiyp\ksgkfnvhtktct*m 
flak*pqcssvdeer:<km/wymygfe 


1521 


15422 


A 


1329 


2 


365 


leklyrsgakffcrdrvslccpgwsrti 
glkqsscfgfpkcwdyr/r*aalsgrll 
ltissh/reqieqefthypeksaiclfmq 

DGPPRRK/HPPTRPQASNIQNKFSFFLR 
QSL/NSLAQAG 


1522 


15423 


A 


1530 




418 


GTDTE*ACDKIQKPFPDKRLNKLG/ IRK 
KHLQLDKYL /WKNPIAS I ILKSRRLKS V 
RLRPGDGRQGCSFPAWLFNI ILEGLAHA 
IR*EKQKQKQGQ\QIGKKEVNLSLFIED 
IMLYIENLKESTKKPIIIINEFSRKEDF 
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location 
correspon 
ding to 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=S6rine, 
T-Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RP 


1523 


15424 


A 


1531 


381 


3 


ILQGFVIRKKSKIFQKACHTSDLWTLTN 
LSCLHMKGLPPHRLYSSCTVLLFLRKPV 
MRKTTLSCCFLT*DFFLPLLPPCFFR*F 
ILLIP*LyFCKDKVSLCCPSWSKLLP\Q 
QSS CLSLPKYQDYKA 


1524 


15425 


A 


1532 


38 


479 


deacgpqdpyltpyvktqwikd*trnkg 
iqfleengkn\fd1gpssdlldmtpktr 
atkvklVndirlrnfcaskdtinlsllc 

R\KR*PVEWEKiSAKHISDKGi,iSGiYR 
QPPLNSKTSHLI * K*AEDFNRHFSEEDI 
QSALYRWVL 


1525 


15426 


A 


1533 


105 


447 


LIFCRVFEYLHSLHLPQEICLSLALFSR 
FTFCVII CEVDVWSVIFKVPPCSKRNKV 
AVHTMLYIQIFVSLFI*PQNWKQPKCPA 
TVERINKMWYIHIV/EYYSMTKR 


1526 


15427 


A 


1534 


76 


471 


VWVCLLSLEGSQSKFGNSIEFGVLLfiSG 

gfsawrlpffvyflrqsl/nsvaqflgvq 
qwrdlsslqplppgfk*vlkqrgvclfv 
cfetkshsiaqagtqwcdlgslqplspe 
fkrfsclSslsppwpsg 


1527 


15428 


A 


1535 


45 


338 


SNHEPFLDWIVM*RKVDFIRQPAMTSSV 
VGPRRRSKALPKAKLAPKKVMVTIWWSS 
ARL IHCSFLNASET lASEECTQQIDDMH 
♦KLQRLQAALINRK 


1528 


15429 


A 


1536 


425 


1 


FFWITFHSVSFSPLSQKCILTL* ILLKS 
IVKNIIMENFQRSLVRK*AKDMNRHFTDD 
HVQMAS/KHMKRCSIjVIGEMQIKNXTVS 
YHYPPIRMIKVRNSSNTKCW*GCGQTGS 
LMRCWW/]SfK*lSrQLLWKTGI PPHGLVSTR 
SRVF 


1529 


15430 


A 


1537 




443 


ETFVDHYQCGGIRPFSDLQLHAGRTTAL 
FKAVRQGHLSLQRLLLSF\VCLCPAPRG 
GAYRGRQVSLSCGGLHPVRASWLLCLPK 
*AWTME6TSTPASLPPCSLISDCCASNQ 
RDSVGIGPSEPGAEYNLLVPRPLSPSEK 
RSIWVGV 


1530 


15431 


A 


1538 


487 


3 


TQNGGVIiSAPRS VFSPTTLR/ CTLQAQ 
C*AFWWGGTQQRASSTAflMAAMKPLGIW 
AGGAAGILPKLGFQD/LPLSAEADPAGK 
ELS IGRQRAWR3QPDQSAEPPFSQAPRP 
GYPPSPQPLSMRRGPGANPRLARPLRGP 
VRVRIiRRASSERQKRSRGGSGPLG 


1531 


15432 


A 


1539 


394 


489 


lYLPIFETESRSVAHAGMQWRDLDSLQP 
SPTG 


1532 


15433 


A 


1540 


475 


202 


PGGGWPSPPDNPSLKENSQGGPPVSHPP 
PPPGNGEGGKTPGAGHSGI *NPPPPLKI 
N/LEKGGRGGPPGPLLNPPFPKEKPRGG 


1533 


15434 


A 


1541 


14 


46S 


LSMWWNSRLKARRLVLVSQWNA\KERF 
LKEIKSVIPMNTLMVRERNSLIVAVEKV 
LVA*VKEQTSHNIPLCQSLVQSKALTLL 
HSVKAKRG* EAAEEKLEAGKHWFLRFKG 
KSRLHNIKVQGEATSTDGELQQVTYPDL 
VKIVGEGDYPKQ 


1534 


15435 


A 


1542 


479 


141 


RKTDSWDLIK\SFSTAK*AINEADGQTT 
EWEKTFAMYASDKGLISRKELKQINKKK 
aWSPIKK*AKDTKKQPTNMKKCRTSLI I 
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sequence 


Predict- 
ed end 

otide 
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ding to 
last amino 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoph an, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














REMQ I KTTMRYHLTTESE* LLI KSQGRA 
L 


1535 


15436 


A 


1543 


17 


474 


NPKKKKKKKIDLLDTYYTLYPK/AEYTF 
SSE IHKMWLYIN*LRAWLNKFIW*KIKT 
ILSDHNRRQPElNKCNKT3a?/RVT/TQK 
FKNLILSNYWVKGELQTQIAEFLKRNTO 
low I YLNT *NT I ICVLNSGGS T /ALNT Y I 
DKtJFKKRVPS 


1536 


15437 


A 


1544 


89 


481 


ICLKVISLRQENATSVACIVSCLSEGSR 
ASENLVLHNaSPNSCGLGIAFTCLCNW 
SSGNKITDCYD*LSSS/WLGFFFFFFLK 
KSFLFVAQAGGQGGNLG*PKPLPPGLKQ 
FLPPPRANFCFLBKTGFFLF 


1537 


15438 


A 


1545 




300 


PPPPAPXXCRPY*XPWXXPVYHSSWRHG 
SGftAQGAVIiftGFGGVGRC2GPGAASVSIP 
LCPENQGCKEPGPSHAVPAPSALPSLRS 
LTGGQIGGTRaaQAVG 


1538 


15439 


A 


1546 


2 


436 


GAPEIKSIIKGYKD/RLCTTKFYNLDEM 
DKFLVRHKLPKLI * E* IDNLNRWITSQE 
TDW*I*QQSSSSSSSSSS/PSSSSSRPN 
GFTTESYQSFEDKLIPIICKLLKKIDKE 
\GHFPLQL*GITQIPKPDXyH/lENYRP 
ISLM 


1539 


15440 


A 


1547 


54 


419 


PATWfiDHLSQGflRDCSEPRLHHCTPAWV 
RE*DPTSKKKKEK/NDQ*LI.IP*S*RRL 
*RSSHQTSHLMDWNSEflGR*EATFLRSY 
SKL\KEELELLr.ILDKNDDHKNSD*YLr, 
SIYSIPGrVLS 


1540 


15441 


A 


1548 


37 


339 


KRWKCLRA*LLMRPRHftD*U!IPDGVGYS 
EL*SRHCTPAWVTEQDLVSISTI!IRKNER 
HTLEYSHQHXSG* P*TG/EESEYPSAL* 
QRE I IDYS FIQGMTDQL 


1541 


15442 


A 


1549 


477 


1 


PGSHDLGSYT*PQ/WSSPEVTSRDAPS 
HPSAPKFCSNPCHGRWLTSSRQPKLRLT 
SAVPGAPGAHGTFHALGAGAAEAGGHSA 
SRPEAALCRPLPPLPMTLXSHPLLSGPG 
RLAWGCNGRR* IKGGG 


1542 


15443 


A 


1550 


430 


8 


CWPGSSGTPDLK* STRPGLPTCWDYRHE 
PLCPASKTFLSPQIETPYPUmmflHSPR 
RPALVNYSIiSVSMDLPILXAFHTNGIT 
GYVASPAPPVRWGSHSVAQAGVPWPNLG 
SL*PPPPRLK*SSRLSLLSSWDYSHMTP 
LN 


1543 


15444 


A 


1551 


2 


419 


ETSPSLQGWLGVLFPKRGAKTSRFLIIR 
PQGGSFKDGDFFNPPGEIKTPPAKKKKK 
KDSARSP PAHLQA* GAGLWDARASFRRP 
FQAPVPLSAQPRAQPLAVGTSRDGGSSG 
PTPGQESAV\PWREKHPQQPPPQPG 


1544 


15445 


— 


1552 


2 


387 


FRHVAQAGLELLG\SSDPSVSASQSrGI 
RGMTYHAQPGL*LFWSKNS ILSGPKV/L 
QDYMW 


1545 


15446 


A 


1553 


393 


2 


KNPIFFFLKQGFPFFP*LEGRGKFLGPC 
HLCFPGSKKG\LPHPPQLIGAPGPPPSP 
GFFFLKRGFS/LFCPGGSFSLRRKGPPP 
PALPKF\GFLRVTPLAGPGNPSFFFFFF 
RDGVSLCHPGWSAVAASRLTA 


1546 


15447 


A 


1554 


1 


427 


LLLLILYAIVEAVT*TEGAGYPPLPGSY 
SHRGASGHRflIFALHLTGGCCI*GAMHC 
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nucleotide 
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SEQ ID 

NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

nucle- 

lo cation 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ittmidieppaitqyhtplla*sglit* 
glVlvlslpvldagitilltdrnlittl 
fwpdgggdp ilyqhll* vfghp evyiri 

LPR 


1547 


15448 


A 


1555 


3 


382 


TLyLLFAA*AGGL/GSALSLLIQAELGQ 
PGNLLGNDHIYNVIGTAHALVI ILFIVT 
P I I IGGFGN* LDTLI FAAPDMALSRLNN 
ISL*LLHTSVLLLLASAIVEAVA*TG*T 
GYLCLftGNYSHP*ASV 


1548 


15449 


A 


1556 


380 


3 


EVSPCCpeWS*TPdfKQfACLGLPRCWS 
YSREPAPPAKACFLDI*VK*WFCK*GSP 
EVWSRPGLYLFIYLIKQKKGVK* *PJ4EY 
LVLFFCFFVFVFVFVFF*HRI S \SVAQG 
GVQWCDLGSLQPPP 


1549 


15450 


A 


1557 


100 


254 


iptvpty*tpiksfhrariessgpgysw 
pvdsak*vpi:awsldsr\rdsgnlvhp 
lmrvtn*mkrhlvtltqs\cys 


1550 


15451 


A 


1558 


2 


289 


APGVSSTMEDEMGGGLEPQHRGCSKQRS 
HHCIPAWATE*DCLKENNNKKNYLLLIS 
IiRTQSYLYF/C*V!SISY*LKLS I KLAGGT 
GE!CEH*SQKRKSK 


1551 


15452 


A 


1559 


24 


354 


PLPSASPGPEGATPVPTS/ACPWKIKLY 














KTKKKNKKRKQQiaStE *ETG 


1552 


15453 


A 


1560 


376 




AARGSGVRDPLEEAVCLFSDLQLRAGRT 
TALLKM,FK\RQGHLSLQRLLLSF\VCL 
CPAPRGGAYRERQASLSOSGLHPVRASR 
LLCLPKQAWAMRGAPPPASLPPCSWISD 
CCASNQ*DSVGVG 


1553 


15454 


A 


1561 


3 


408 


AASTVFLPFLERKGIDFGFLFFFFGEKK 
FPFLAPGGAPGGLFSFPEASSPGLNPFF 
WPNPPEK*KKGGPPPPPGFFFFF*KKRG 
FPGG\PGGAPFPDPKIGPPGPPKGGEFR 
GGPPPPGPNFFFFFLKGKGGGGPP 


1554 


15455 


A 


1562 


355 


161 


FKPGDGGCSEPRWCHCTPIWVIMRDSVS 
KPKQSKTKKR1IVFT/C*D*VF *iraiIINY 
INCFS *RALyiLYEKQFM 


1555 


15456 


A 


1563 


410 


1 


tpppppknflgtplfpknkagkglfppp 
lgfpkggqwpippqrfpplfspkkrad* 
kpkpppalkirgpwpgvfpkgfcgfsff 
ppppfprggglifflpp/tk*sgg*kkp 
kkkkkekcpkkqrggidqltsnlgv 


1556 


15457 


A 


1564 


2 


374 


ADRNLNTTFFDPAG/GVGDPILYQHLF* 
FFGHPEGYILILPGFGIISHIVTYYSGK 
KEPFGYIGMV*flMISIGFLGFIV*AHHI 
FTVGIDVDTRAYFTSATIIIAIPTGVKV 
FS*LATLHGSNMK 


1557 


15458 


A 


1565 


396 


0 


IIFLIFLRHGFAWAQAGMQWCGLGSLQ 
PLFPGFRLFS/CLLSSWDYR*RQGFTML 
ARLVLNS *PQAI\SASPSQSPGITDVSH 
CA 


1558 


15459 


A 


1566 


186 


452 


KQKCNFKTLNTKYQPLPFFFLERNFCFC 
PPGGGEGADFTFLEPLPSGAKGFF\CLT 
LQRM/WE * RVS PPTPLNFGFLVKKGFSL 
CGSTGF 


1559 


15460 




1567 


453 


37 


KTALYWKKTPSSTSIAREEKSVPGFKGQ 
AHFLFRGKHKFKLVSKLIYHFENP/R/A 
LKNYAKS IL/PYKWWKAWMTAHLFS P\W 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Pred let- 
location 
correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartlc Acid, 
E=Glutamic Acid, F=Pi)enylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoph3n, Y=Tyrosine, 
X=Unlaiown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FIKYFRPTVEM*CSYNKILVLIDTGLDH 
PRPLIETYK\FINLHVCVLENTTPIVQP 
LPR 


1560 


15461 


A 


1568 


453 


39 


talywkktpsrts lareeksvpgfkgqa 
hflfrgkhkfklvsklidhfenp/r/al 

KNYAKS/TLAYKWNKAWMTAHLFS P\WF 
IKYFRPTVEM* CSYNKILVLIDTGLDHP 
RTLIETYK\FINLHVGVLFNTTPILQPM 
PRAQ 


1561 


15462 


A 


1569 


1 


352 


FGTRRERERERERERERERERERGRGGA 
GHQIPSIEGQHKIVGAIIY\CREC\GEK 
ISGSTSYIKVWDIRDSAKCIRTLTSSGQ 
GIS6DACAATFTEAITSAQGEH*INQIA 
LRPSGT 


1362 


15463 


A 


1570 


394 


3 


TLDWPQSRGGSTGKPVYPSVCCCCCCCC 
CFNFCQEAEKNNAE6LUHNQAGRTKDGS 
PAPSHDH*A/PHGTEV/DLLESTLQTSI 
KQVESKPR\EQRRTGAGGQKEKATC3NPE 
KSVLTSMYTKSQGS EGRLPGNR 


1363 


15464 


A 


1571 


399 


2 


KHQLPVFWQYNKKA.WTTRTLFIJ3WLHCC 
FVFEVRKYLASKGLPFKWLIUJNAPGH 
PPRTP*VQY\KGIEVIYLPPNTMSLI *F 
LDQGVIRTYR/SHYTQYSMQRTISAMQE 
NSNKENI IKVWKDSTTDDAIVA 


1564 


15465 


A 


1572 


37 


400 


RGTITGEAASADQEIflDKLSDAINKIMB 
EKG\Y*LPEQVFNLDESTLFWG\KKKPQ 
RTLLSKEKKRAPGFKTGKDRLTLLFC/A 
NAVKLIIRTALTYKAENPQALKENVKHQ 
LPVFCLTTRGL 


1565 


15466 


A 


1573 


84 


485 


AGHKDSPRPHQTQEPSWLHLWDPAPGLQ 
VELPASPXGRALALLSPWWDGTGRPGA 
GGGTRRGGSGPTGAHGAGGRLMHGGLQV 
PSPAPREGS*GPARYQAQRMWARTAGGP 
STPSAGRSRVPSPHCPGP 


1566 


15467 


A 


1574 


3 


463 


TPAQGIJIDPSNMRKHA.YCX3CCV/ CITLC 
VGAQNKNHAVCGLYSTCPRLCVYEHEHI 
CVNE*VCEHVCERESVRVCESTH/LPLC 
A*TCGPI FGCMSEKHVFSYTPCVHRVCV 
CVCVHLGCCVC/VCVCVCVCVCVCVCV 


1567 


15468 


A 


1575 




383 


FLS FGFAPQaGGQGHNHG*GP P * P / PKA 
KGIFPPHPPEKREQRVHATPPGKFIiDFF 
FFWKKGGLNLGPKKNLGPGGKKNLLVS P 
PKGGGKKKETPGPGGVFFWGGIFFFFPP 
PCSPG*PLSLLKKPKGG 


1568 


15469 


A 


1576 


35 


469 


RIPRCHQPVGPLGCREGAKPQGPGPDAA 
DSHPPASPRAH*/P*SLHRDPIPGFRGP 
CRRNaGAGPRaHTAGICaPQPNS*SHHH 
PWAAAPAGDDSPTTSLGSHVPVGRKSSD 
GCRQRALCTPGSPAPSEftEVGGS PELRS 
LRPAWA 


1569 


15470 


A 


1577 


473 


62 


SLEINPYISGQLIFNQHAKTNH* SMRKE 
\SFFNKWCLDN*1STGRRMKLDPYLKPH 
IKIiNSK/LKDLNlRDTTMINUJDLGFGN 
GF*TMTTK*AIKEKIN/WDFIKI*NFCA 
SNDIIKKVKRPNVTAYTCNPSTLEGPR 


1570 1 15471 

1 


A 


1578 


59 


426 


LERRSFGAPLSPFFAPQFEMKKGFGSP* 
KFFFSPKALNFGGGWPFFPPPKKRFFS 
KWPQEGFIPPPLKKKKKTSQPP/YKFGP 
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nucleotide 
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M 
eth 
od 


SEQID 
NO: in 
USSN 


P ) 1 

besinnina 
nucleotide 


P lict- 
eleid 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 


sequence 


peptide 
sequence 




09/515,1 
26 






G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaiiiine,R=Arginine, S=Serine, 
T=Threoiiine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Impossible 
nucleotide insertion 




correspond 

residue of 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 














PKESFKRAPPFFFFKGGPFFPLVSNQWT 
GKPRLPPPPPKETQNPSSFQIGGQKKGK 
GGPQSFSFPIKKKKKKGGHSRSRTSPRV 


1571 


15472 


A 


1579 


2 


419 


KANKTKNAYFEGINKMDKPQVI LKKKY/ 
REKT* I PSIGEEKGIMSP\NSEDTKRII 
KECFKQTYAHRFySLKE/MDILLESHKL 
PKLTQ/EETDSLNSPV 


1572 


15473 


A 


1580 


129 


403 


YMFFI PINCQDHPK*KKKKKKKKKKRGG 
/RPFKKTLRGPKLi!JRaEKKKFFF*KGSI 
KKKCLEILKKKLFFGGEKKCKNPPKKKK 
PSREKKKF I 


1573 


15474 


A 


1581 


317 


76 


prfffffpppkkgffsppffffsprffp 
pppflkppprffffgpfkknff\ppprp 
liffff*apppffffffffffffv 


1574 


15475 


A 


1582 


259 


377 


PREMKTYLPTKKLGYKISHYH*WWGCKM 

VQPWKTWOFLKGLNIKLP*DSAVPLV 
GM*PREMKTYLPTKKLGYE/MFTLSLL,I 
IMTK* KQPKCPRMNKW 


1575 


15476 


A 


1583 


1 


415 


PTRPITSSICLRQSYLKALIAYSSISHI 
AL6V\TAIIiNPTP*SFTGAGILIIA\HG 
LTCSLLFCLaNSKYEKTRRRIIILSQGH 
QTLLPLIAF* *LL2UlLaNIiAI.PPTINLL 
GELSELRTTPS * SNITIiLTGLNILGT 


1576 


15477 


A 


1584 




406 


LNLLPLVLGGSSCLPPPCX3GN*KPPPPP 
G\LPLEKKGFSPCGPAGP*PPALHGPPP 
PPLPRGLI 


1577 


15478 


A 


1585 


383 


3 


KKFGYP FYWIG* KI LK*FPG * K*SLPHR 
KSPFFFFF*GRVLLCPPGWRGTTKGHGS 
QVT\LPAAAMTFQV\K*SSPLRIiPSR*G 
YRQASPWSGNFFF/ OiVESLSMLPGLIL 
NYWAIAI KPSGPPKVLG 


1578 


15479 


A 


1586 


126 


413 


NPTLKK* KMKENRMKKNEQSLRnL*DTI 
KLTNRCILGIPKEERKKGASGIFGEIMV 
GNSS]!TLIR\E3!II]!ILNIEVaQ*TLSRI]!I* 
KRATLRHVIMKM 


1579 


15480 


A 


1587 


242 


409 


GWMIFRFNFFFFLRGSFTLVTQflGG*GG 
DFG/SLRPPPPQLKRFSCLTLPRSWDYR 

H 


1580 


15481 




1588 


2 


338 


EIEKKGKGKKRRG*RSNKKKRGGRLEGQ 
KY*SPPCPRLSFFLVDEKGPEVLQDSLG 
GWWKTP/SGCPEMTDSSQPY/YRAFYVL 
KN/QRVGFSVDVGEIEKDQDVEKNQDPS 
CPRL 


1581 


15482 


A 


1589 


360 


0 


NNFLEQLKFPPKKKITTDSYKAPRPFFF 
FPPSQKGVFPPTLFFGFPPGFPPPPFLN 
PPPGFF/CFWAPLKKFFFPYPGG*TWVS 
LKGPP/L/RFFFFFF*DGVSLCHPGWSS 
SAQS 


1582 


15483 


A 


1590 


75 


412 


VEGQHCNFCAAQETINRVKRQHTELLET 
FANWS PDKGL I PRT YKELJCHLNRKKHS Y 
*KWADDLDRHFSKETYTPPKTYSCTQYC 
*P/SITIEII*KLKTSNIKTRLQGGH£CRL 
ER 


1583 


15484 


A 


1591 


309 




FSTQGAHMQVCYMDTFHDAEIWTSIEPV 
TQIVNMLPNR*FFNPCPHFFPPPFC3Py 
LVS FLLLLRDRVS L CCPGQSRSVGFK* S 
TCLGLPKRWDYKCEPPHVA 


1584 


15485 


A 


1592 


2 


415 


LSISLSIFSFLP\FSFLLPSLSLSLFLS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, D=Leucine, M=lVIethionine, 
N^Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valin6, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














FFLFLSFFPPSFFFFFHRASLCHLGWSA 
MA*SWLTAVSTSQ\VKQSSHL 


1585 


15486 


A 


1593 


3 


396 


HSAFLFFL * DRVS LCHLSWKAVAQS QLT 
ATSTLL/VLKQSSHLNLP 


1586 


15487 


A 


1594 


300 


62 


KMTGVLKSSCGKS PKQVGWLCVCVCVF / 
SFKRQGIiAS/VAQTGMQWGDHSSLQPPA 
PGLR*SSCISLISSWDYRRVPRRAPG 


1587 


15488 


A 


1595 


1 


412 


fdrpaadqkaasalkasgvqaqmakgty 
hdwslqdwkvlwmth*vs\qqeqdptnl 
yisnlplcmdelqlenmlkpfgqvistr 
ilrdysgtfrgdgfarmestdkcdavie 
hynvllimtppgvsaptbpllckfae 


158S 


15489 


A 


1596 


428 


2 


QSESREPENFLLPTSTSSLLLVHCLSRT 
CWNSTLSCVPHFFHLTFNSHLLLTSQP 
RRVHFC*LSLSSI ISWKLLNTLPGVP IR 
\ASEI FGLRTIRNFPFLSHS VLFP FSLS 
ESHSVTQQGVQMHEOiGSLQPLPHGCQWL 
SCL 


1589 


15490 


A 


1597 


2 


442 


QGSL*PHPPMLK*SLTSAF\NYRHVSPH 
LMJYFIiFFF*KQGIjAVrjLGLAIJI*AQ/C 
HLPALASQSAGITGVSHGAWLFFFFLNQ 
CYLI*FLILIF\ERR1ISPVAHVLVNGGD 
LGLIiKPPPPGLMGFSCaaJIiLRSGKYKPP 
ATSPGYIFC 


1590 


15491 


A 


1598 


419 


121 


NLGYPRVSPPPPFUStPPPEFYKGPPKKK 
FYyPPPPPAQKIDPP*TPPPFFFFGTDG 
SHYVAQAGLELLASSDLPASALQSTGIT 
KHEPPHLAETI FLVFL 


1591 


15492 


A 


1599 


45 


397 


DRVSLCYPGWSAW\*S*LTJiASNSW\V 
K*SACFSLLSRGDYMCWTLHIANIKKNF 
LGRVRGSHL*SYHLKLCLKK*RTKEGFL 
SFFWKGKRHFLGQKNirJfPRLKVFFLPG 
PPKGLGY 


1592 


15493 


A 


1600 


3 


397 


SRRPGRFSLMLTLSWHS*VCEAALAARE 
EQWSGCFKSHF*LEVNFWVSGIDREV/C 
GY/LKTVLGEDIiNDYVSTQI*D*LMKPR 
CPEKQDESLLKEFGGGA*RLI!IVVHRPGA 
VAHACNPNALQAKGRRIPRSG 


1593 


15494 


A 


1601 


244 


2 


KTKTSFIISLIDSCGYLLSYSNFQI *KKM 

;fk/ylhlravprhvivkftnlerqekv 
lraarekg*vmhkgkpirltadlsa 


1594 


15495 


A 


1602 


435 


2 


pqeaiytptqhptygaicriarihgsrd 
q/svemkwhpqltitpsgp/lgksllpi 

PVALCFADLEVF I P KGGMLS PGVT 1 1 S L 
1IWKLRLPWVILQ*RITPLARVIPPDYQR 

E IRLLLMNERKKS YVWNTHDPLGHLWL 
PCPMVIK 


1595 


15496 


A 


1603 


288 


8 


EFPLIREMQIKPTMRYRLTPV*MD\R*W 
*GCGKRGTLVHCWW*CKLVQSVCERSSK 
nitielpcdpavlilgmhtkerklrpcg 
WTRGVGPGYC 


1596 


15497 


A 


1604 


411 


236 


LIPSEHFFFSG*F*AFDRSSLAPTPQLR 
RHRPPTGLLPLSLLQGP\LLTPSVLLS* 
GGHFKGPNFTPPGGQGKIFFMGPPKSNP 
GPGV 


1597 


15498 


A 


1605 


2 


437 


KCLRISPCAGPRRPWCPSFEPRVCVWPL 
GVPQVG/PEGP*GEG\GFEGGDVRQLWQ 
GKICKKKKNTLGKRGPWFPKEKRGFSLTP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutainic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Pi-oline, 
Q=Glutamine, R=Ai-ginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=UnknoK'n, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














WFGPQGLWEPCLNPC3GPLFLIGGFQKGP 
KVLIGKGKTPGPKKKNNRPRKRGSLGPP 


1598 


15499 


A 


1606 


31 


467 


EFPGRRFRGGGCSELRLRVCTPAWTTEG 
DS I SHNI * LVNEN/HTVNSWKCSDS 


1599 


15500 


A 


1607 


418 


32 


VGQVGLELLTSSDP PASAYPK* WDYKR \ 
DPRAQPNPNILKAQSNRSPPRQR* PASS 
FSNQNSCMHRCISLP*VLFLIFFKNRP* 
NNSFGYNSSGWWSGLNIDSrSC* FWFLFG 
NSDTEIRICRSFHRGREV 


1600 


15501 


A 


1608 


379 


3 


FPILSPSYIIPSSTELP***SLQSPLFEP 
YDFFSVNMFSFLFFHLKSHTVAQAGVQW 
RNHGSLQPRSFQLK\HPPASASAGTTGM 
CHHAW/LIYLCVYLF]:iEMRSHCVSQD*V 
QWHNHSSLQPETPTK 


1601 


15502 


A 


1609 


456 


99 


PFPTPFPPPHSNWGLGLGCGL/ EPTMTP 
S GLGLKGS PS * S PPAHRLSGAQLCPVLR 
APVLGPSQMPPGRKKKPT2UIGS PWRKGV 
FVMSGQSDPLGPSS CQELGPRQSTQGTF 
PGA 


1602 


15503 


A 


1610 




420 


FRFSDGAAGQKCSS PPHPGRGRAEVLLT 
S QTGRQGRGAPHI SDNGQPGRDAP \ps * 
M*WRPGRGAP\PS*VGWRPGGDAPHF\Q 
TGQPGRGRPHIPDDGRPGRDAPHFPDGV 
GAGQRLQTRHFGRPTAAAWKVKWTSLR 


1603 


15504 


A 


1611 


426 


3 


OPFLLEAKVSFNPKWPPALPPGEQRDS 
VSQKKKKNLLIHKKftHSKH* FFICHE\C 
ESALLLHQ]!nHaGGKSyVCNK*GRGFRN 
KSHFTYQRTHSGKKftFL* KE CG * DFL* K 
AILTAYQKTHSGKKSFVCKECR*DFTQK 
TK 


1604 


15505 


A 


1612 


428 


273 


HHA*LIFKIFyCVETRVSLLCPGWS*TP 
ELK*SSLL6CPKCWDYRKEPPIRPTT 


1605 


15506 


A 


1613 


311 


4 


A5IKFKNLNEIKFPEAHNLPKFTQEGGLN 
NPVSLY*ia!I*TYSLRLFP\KKKKSGPDD 
FTGEFN*TSKEEVPSLHKLIiQKI * ERNT 
LPNLFHKARVTQVS KSEM 


1606 


15507 


A 


1614 


338 


88 


PNPPPPSKGK\GFPPPTPGRKKKTRPPP 
PPGKFLGF*KKRGPPPFERGGP\NPAPG 
GPPPPNPPKGGGTKGEPPSPEGGVFFFF 


1607 


15508 


A 


1615 


2 


162 


KHGGTCLYS LLLGRLRHESCLNLGGGGC 
RE/PE/'SHCIPAWATE*DSVSEKKKYL 


1608 


15509 


A 


1616 


3 


399 


PEVREYLTSRGLPPKVLLILDNAHGHS E 
PQRFNTEGINVFYLLPNPRSLIQTLDQG 
VTRTFKSHYTWYSTERIANAMEENPDRT 
S*KSRIMTPL/IDAIVMTEKAMEAIMPK 
T 1 1 SCWRKLCPDWHDFTRFT 


1609 


15510 


A 


1617 


390 


1 


KRNC FGPFLTPPP GQRLGGFKFLK \HFF 
YYRGERGGFFFSPPNKGFPPPPPFWGFF 
LGALKF*RGVPPSKPPPPGPGKKFKLKP 
FPPRKSYGGCFKCFLGGPFPFEGPPPQK 


1610 


15511 


A 


161S 


468 


0 


m:<lvniwllllvall*gkkhlgdrlekk 
sfekapcpgcshltlkv3fsstvaeyey 
ivafngyftakaensfisralkssevhn 
wriiprsnpssdypt*wp/valkkkkkk 
agv/lplenys I ITRV 


1611 


15512 




1619 


421 


2 


ssrlslpkcwdyrhepprpaqprilkki 
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ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














S**EPFFLILKCI*LRTQGLGRGHKGKL 
NSTYVLHKFMD* *LGQLMNGLNMLDESS 
FCWVLKHTKQ\WRDYYFVCLFVCFVRQS 
L/NPVSQAGVQRRDHGSLRAPPPGFAPF 
S 


1612 


15513 


A 


1620 


406 


2 


GTKKAS SNGF INGP GAAKKKS QTFAFLA 
KPGAFPSILLDEKNRPQIKPPACFP/SI 
RESQTSYPVFFFWSPS CGFFFFWGKP E P 
LLLKEAGKVPLFLFLFGFL*DGVSLCQP 
GWNAVLRSQLTAESNPHASAHAS 


1613 


15514 


A 


1621 


411 


0 


PPPPKKKPSGPPPP/PSSSSSPSPPRKF 
LGGPRVFFPPPFFKPPPPKNFWGPPKKK 
KFPPPPGGKKFFF*RAPPP 


1614 


15515 


A 


1622 


2 


403 


TiiRCGLNFPCSSSLPFIAS*VAGTTGrH 
HHAQIiILLIFCGDELSLM\CPGWS /PNS 
SHLGLPKCWDYR 


1615 


15516 




T623 


"298 


"411 


LI INVCWPGLVaHA.CNPSTLGG"*GGRI f 
TMRSGVRDQP 


1616 


15517 


A 


1624 


263 


2 


DSVSKKKKNFKE*LIPVLLKLFC2NIEEE 
GILElSrSLYia«3V\LIPKPDKDT*RKEIY 
RPISLINIDAKIVSKILflKKIQ*FIKKl 
TDAW 


1617 


1551S 


A 


1625 


3 


281 


PFS CLS LPSCWDYRRPPPRPANFF/ VYF 
YKKNTRTQCFTVKHGFTVLTRLVLIS*P 
CBPPSLASQSAGITGVSHRTQPHTVFFL 
NNPALPKLQT 


1618 


15519 


A 


1626 


300 


20 


npgprgfpppgppkrij:frggrprpgf* 
yflketflgffwfqknflvfflgnktpfp 
pqfffff/rdrmslchpgwsevaqawlk 
aaltsqtpai 


1619 


15520 


A 


1627 


394 


40 


pqpaaaslfspglffffppvfspppffk 
tpprifffwpp*knfff\pppaffffff 

gwsavaqs *ltaaijsrsqt 


1620 


15521 


A 


1628 


386 


3 


I fptrcthlhlgtnklptcs ffdqakkn 

PFCS\HHSRGVGLRARLFCERLTIEGAG 
TPACPAP*FPGEPTRP/PGRVRWLrPAI 
PALWEAETGS*XVaRTGLELLVSSNPPL 
SASQSARITGVSHRTWP 


1621 


15522 


A 


1629 


401 


93 


ARGVLPLNPPPWGGRGGGS P*GKNSKPP 
RPRGENPPPFLKPQKLPPPGGGPPPPLF 
LGG *S KKTP PPPKGGAPINQKPPPGLPP 
PGKKGAPFPKKKKKKDKNIRTKKKARRS 
GSLLQSQHPGRPR/RGGPPLTKKPPLAS 
PPQEKRGPPFQKKKKKKTKI 


1622 


15523 


A 


1630 


417 


47 


pppgtisspnp/qknlkkgpgpggnprn 
ppplggqrg/rglwakksrppgpprgnp 
plfkkkkinggggpppwppprgaragk 
slypgggppq*pqmgppppppgak:kgpp 
pkkkkkkpkkprkt 


1623 


15524 


A 


1631 


416 


54 


EYWCKGR*IDQWNRM*FKMDQHLHSQFI 
FN*ATKAIQWGKESLFIIRLCLKIIWLTIR 
DK\IYIjDACIjTTY\*KINSSWHSGAFL* 
SQILERLRQEASLSPG1*VQPRQHSKTP 
SLKKFFFKFFS 


1624 


15525 


A 


1632 


2 


373 


LVFLDNMLKLLRHSALASACFPEDLAKI 
MDEGGYTKGQI FKVHGTAFCWKKMPSRT 
FWHEQS VPGFK/ATD *LLLGANAAGN\ 



635 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 
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amino acid 
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peptide 

sequence 


Predict- 

locafion 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A'=Alanine 
C=Cysteine, I>=Aspartic Acid, 
EXJIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLKPMP lYHSENPRVFKNYVKP ILPALY 
TWWSKA/QMAAPLF 


1625 


15526 


A 


1633 


3 


383 


NIWN*KAWM'rVHL/FIT*FTEYFKSTVE 
NC*EKKKILFKI/LLLIDNTFGHPKALM 
EIYKEIWVIFMPANTISIL*PMDQGVIA 
TLNSYYLRNTFHKTIAFINYDSSDRCGQ 
SQLKNFWKGFSILDAIR 


1626 


15527 


A 


1634 


2 


182 


GACTQLIGRLRQENRLN/SGDRGCSEPI 
LHLCTPPWATE*DPVSKKKKKNPCPKLK 
KGPPP 


1627 


15528 


A 


1635 


331 


1 


LFPPPPVLKSGPGPN*N* PPLKGKSFGP 
LKKKFFFFFF\FRDWVLLYCLGWSQTSG 
LKRSSCWDYRCEPPHEiTPNF/SYF/CRD 
SVSL*PRLEGSGSI 1 ALCSLKLLDSSSP 
H 


1628 


15529 


A 


1636 


80 


381 


KLSKCHAEIHSRGFYKYSLFQLGASQFPQ 
VLRITHPHKGSWAAPRS*G*SQC/SHFF 
SFFFETKSHFWQAGGYGHNFT*LQPPP 
PGLKRFSHLSLPSSWDYS 


1629 


15530 


A 


1637 


381 


38 


SKRQGFHHVGQAGLELLSSSDLPTMAIJI 
PTCPLQKCWDYRC/DATaENLSSTFFMQ 
EKCLAFL\PLFSSAPHSL* * PKAVIEKD 
CPGL*IW**VAESQRMNVFNFIK 


1630 


15531 




1638 


3 


295 


PGPDGFTAEFNQTFKE*LIPILVKLF*K 
IQEVR\ FFKFS I TLTPKSNKD 


1631 


15532 


A 


1639 


48 


380 


ILGKAISFTIE*KGLEYIiGIYLTKEAKG 
LHTENYKMLLKELKKDTNIWKGILCQ/W 
TRRLHMVKISV 


1632 


15533 


A 


1640 


343 


23 


SWLTAT/FCFLGSSDSPflSAGVELLTSG 
DLPAliAS*TaGITGVSHRTRPALSINTS 
TLLCSSPYCPPHLQSLQGTTQPVPFLKA 
QRTGCLLQGVILTSQAENFCNKH 


1633 


15534 


A 


1641 


3 


397 


LELFSSAHCCPSLTVMQYYPPXRPTSHD 
CQREKSPHRTKKKKPLEGVFLG*KrKTN 
LENPPELPFFGGGPPPK1GGGPFKIV*GG 
WPWPWGLKIPTLPKGPTPRPPWGPLGTF 
GGEGKPLRAFPLQRFFRGPE 














1634 


15535 


A 


1642 


2 


308 


NKWR*GNWISVKIDNFNFYLMPYTNLS * 
IRDDLNAKATTIKLVGENIGENLGIGKW 
F* ERTLKAXLRGKKMDKLDF I TI GNFCF 
S KDR I KNKNKAERGGSRL 


1635 


15536 


A 


1643 


16 


386 


EKKKLSLFTENGIPYL*NPKESAKRLLS 
LINDFSICV*GYKMN\DEKSVAFLNTNK 


1636 


15537 


A 


1644 


533 


3 


PLSLSLSSLFLSFFL*DRACFVAQAGVQ 
WLDLGSLQPPPPGIK*FS/CGGNVAVTP 
RLS PLTLPAMTEVRLPSSKI QTNKEKNT 
VMSEIYQSLM*MGWG/M*VRMCVKFKKP 
WVDN* *VSVMGCLLPSFLPSFLSFLFLS 
PSPLSFFPYFFETESCFVAEAGAQWLDL 
G*LQPPPPAVP 


1637 


15538 


A 


1645 


341 


19 


GIGGRPP*FQLLGRLKQEK/HLERGKGF 
HEPKSRPCISAWATKGDCFQKKKKIHMT 
LQQIQPKQSWKTDISQFRTDYKTTIIKT 
VGHGIWIDIFINGIELSENLNQCS 


1638 


15539 


A 


1646 


279 


3 


TFYHNEKDNNKCWLTI \GKI *MLTGCYW 
ECKMVQLPWKTAWQFL/R/DVNI *LPYD 
LAITLLS lYTRKRKTYWTKTCIQMFLA 
VLFTTAKRWKQP 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M[=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1639 


15540 


A 


1647 


392 




QKLE IMLREEGMLKAKTGQKLGLLCQAV 
THLVNAKEKFLKIJ!rVLLQGTHKW*ESET 
AKSLILFNSMKAEKGE*AAEEKFRASRG 
WFM/RFKERNCLHHLKVQGEAASAWEA 
VANYIEDLVKKIDKGGCIN 


1640 


15541 


A 


1648 


425 


148 


SQSLI *SKALTLCNSMKT/ERAEEVAGK 
JCLEASNF^LKFKKRS CV\ENVNMQGKVA 
SLDGEAAASSPEDLVNFIDEGGYTKQQI 
FN*DKTSFFF 


1641 


15542 


A 


1649 


271 


462 


RRQKKKHKRC ILS PLLFD I VLE VPAKT I 
*QEKKIKGIQIGKKEV/KIISLFADDW 
LYL 


1642 


15543 


A 


1650 


70 


398 


RPEASLRHMCLNAGQLLSKRAKLGALSL 
SFFFWKKSLAFAPQPGGQGGNLG* * KPP 
LPGIJIGFSGLTLLRN/WE*RWPVPPPT\ 
NFGPLIKTGFPLVGQAGFDLRTLG&LR 


1643 


15544 


A 


1651 


425 


3 


FEG FPKVGFPLGPRFPSRVPPFGTLPPP 
PKRGPRCFPQPGAPPPKIWTPPGALPQG 

GVGPALPG/ALQKFGPKNPGGFFSGPPQ 

MAP \GGFPGGP * RPPRGGAPFFFFLRQG 
FPYAQAETOLPGSSDPPISAPQVAGTTD 
VC 


1644 


15545 


A 


1652 


385 


2 


KGNNPSPPEIKFFFFFF*KGLLPLPQGG 
GQWGYERSLQPPPSRLKLFSCPNLPSNW 
EYRGP * \RLL* LI T6RGTSGS KTKVPTP 
CGPFIS1.NQLGLGTQARNFS/RLVFFFFE 
TESPFVAQAGIQLRDLHS 


\645 


15546 


A 


1653 


242 


3 


KNKNFGINRGFFFTFKVPGFFFFLGKVK 
LFFFFRNLKFFLKAKPP*\VFFPIGPSF 
PFFFFFLDRVLLCCPGWNVWQSr. 


1646 


. 15547 


A 


1654 


3 


285 


HFI lYTKDLHRRFSKEDI* IIIKHV/KK 
CSPSLAVREMQIKTTVHPGTVAGTCNPII 
TLGGQGRRIVQDQLKQQSKTSSLQKKIL 
FRIi2UlHGGTCP 


1647 


15548 


A 


1635 


1 


373 


KVSLFFFFEQGLLCCPGWSAWSSLQPQ 
CPRVKQFSHVS/LPSN*EYSCTPOT'PSL 
QVCVS IHKY I * YI YI YI FKFP/CR/DRT 
LARLHRLVSNAWSQAILPPWPPKVLGLQ 


1648 


15549 


A 


1656 


189 


2 


VQPGQQERNSISKTKNKTKQKLPTKKSP 
GPD*FKEELIPrLHKL/F/HKrEDKGTL 
HWSFYWTI 


1649 


13550 


A 


1657 


385 


13 


G6PPPWGARSPPKL*NPPPPRKKPPGPP 
PPPGGAPPLGGFFLFFPPPGGPPPGGGK 
/SFSPRFFFFFFFLGGGGQIFLFSPGGG 
PRGVFSPPAPPLVPKPAFFSKKKKRGSS 
GGEKPEADGYFIK 


1650 


15551 


A 


1658 


352 


2 


HLSLLSNWDSIRAPP/RPCVIFKNVPLN 
1FFL*RGVTMLPRLVLNDJJP1SASQVAR 
IIDVSHWAKLRRSV/CYVFETGSGSLSQ 
AGVQRYNHGSVQPQPSRVS*SSHHSLWK 


1651 


15552 


A 


1659 


265 


3 


HSGQRDEGRMRCGEWLESHGWRARSCM 
TLKTSFLFTMAKI*NQLKCLFMDEWIKK 
MWHIHTMEYYSAIKR/DEIPSFVATWME 
LEVIM 


1652 


15553 


A 


1660 


1 


163 


NQQNENRKT I EKI IGTKT* LFKKI KKFD 
KPLARWT/RGKKNIQITNIRN*RRDVI 


1653 


15554 


A 


1661 


56 


320 


KFFMYSAGESTKIRCLF/SCLFLFLR/Q 
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peptide 
sequence 
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USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

otide 

location 

correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-^Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histi(iine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamlne, R=Arginine, S=Serine, 
T=Threonine, V=V aline, 
W=Tryptophan, Y=Tyrasine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ESRSVSQDGG*SEPRSCYCTPAWATE*D 
SVSKKQNKNKPKKMIiNKCFASSLLITLP 

PARHKA 


1654 


15555 


A 


1662 


3 


421 


giitdtfpniiekginiqvkkv/rrppsr 
fnpkkttsrdliiklpkikdkgs*kqke 
ktsksarlpqphgllglglgksass p i k 
kerkqXqitysgapiflvtdfsvetlqv 
rrewhdvfkvlkek/dfyprivylvkis 

F 


1655 


15556 


A 


1663 


362 


2 


VIFADEAQI LKKEDKLDFIKMKTSVHQK 
TLSTEYKDNHREKIFVSQISDKELISRV 
YKGLLKLNWTDKNL1LFYF*-RLGHSVTQ 
GRVQWCNHSSLHPQTPGLK\NPPTSAS* 
AAGNTGVHL 


1656 


15557 


A 


1664 


79 


355 


IHLPSLIGDFNLFTGISLLICLVLFVFI 
FETGSCSVAQ/S/GVQWHNHGLLQPRPS 
GLRQSSHLS PLSSWMHEHGPG * F I YF/ C 
VEIRAHYHP 


1657 


15558 


A 


1665 


47 


384 


KEKASGPLINFFFFPKLCKLAPFFFLPF 
FLWGGGGGKFSRNPQKHFPR* KRVFVNF 
FFFFFFLRHCIiTVSLSHCHPGWSAVA* S 
QLTAGSNFW\VKQSSHLS/LPSSWDHRH 
APP 


1658 15559 


A 


1666 


163 


601 


IFCfCfieVLPCCPGLADLHfSTSNSlSFr" 
YSSGLLRMTNKTETPMSTI PKGVGVAWR 
FGNSECIFQELPLTLHHLLSTMLASFIH 

SHEASANATjVGRSLTVGWGCRGVGVSVD 

PAAWLWRDLKGCF*DKSHSVTQSGGQWC 
NLSSLQP-'APRLKRSCILSLPRSWDHWH 
VPPFLANF* IFCKGGVLP\FAQGW 


1659 


15560 


A 


1667 


418 


3 


SVCLGLPKCWDTGRKPLCPAPSFFY/fiG 
SITLIPKSEMHLPRNENYRSGFLLN/M/ 
nAKIIJKrRILANClSN*I*I)IH*\*KVKFT 
PGKKDWFNNRKPTDI IYPH*QNREEKSL 
VSSTDMAKVFNKIQPVLRELTIEIKGMF 
LNL 


1660 


15561 


A 


1668 


411 


1 


LRLHVGRTTT\LFKAVSQGHLSLQRFLL 
PSVE ICPAPRGGVYRGRQASLSCGGLHR 
VRASQLLWFLTQASAMAGPPPPVILPPC 
SLI*DCCaNNKHGFIGVGPFEPCVGYNL 
LVSHLLRPSEKPSIRVGVT*FSRC 


1661 


15562 


A 


1669 


151 


1 


PLEKEAEITGP*PHAWLIF/CFF*TESR 
YIAQ/aJMKWHNLGSLQPTPPGFK 


1662 


15563 


A 


1670 


1 


389 


TFF*KLKMIlfcSEEeMLKjUCISQKLdtC 
\TVSQWNSKEKFLKEIKSATPEST*MI 
RK*NSSVA11TENV*ERSRTSHNILFS«S 
SlQNKDLTFEHSLKaWRGQEVAEEKSEA 
SSGWEMRFKERSHLHTIK 


1663 


15564 


A 


1671 


1 


363 


ECTGPKIAKI ILEKKMNVGGLPI/PWFK 
I*YKAPVI*FWLKVPVIPSSSAILMKTV 

*Y\YFKDRNQDEWYRLRVWKINSHIYGQ 
LI\FSKGTKTIQW*KSLFNK*CWNNWLF 
TCKRMKLD 


1664 


15565 


A 


1672 


203 


2 


ALNKRTDMPS**IRRCNII/KCLFSPKM 
N*VFNVIT/IQCPSGHFFTETDKSILKF 
IWKSK*PRLAKRTL 


1665 


15566 


A 


1673 


15 


378 


NYHHNQNNEHMYHSLPNFFFFFFGKGAP 
PG\PKGGGRG*REP*IPGGKGNPPL*PP 
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peptide 

sequence 


Predict- 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-TIireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KEGGPPGGTPPPGKPGKIWDFWEKGGFP 
LSPGGAGTPGPKGDGPPWPPKGGGKPGG 
TKQPGPKKFF 


1666 


15567 


A 


1674 


2 


349 


lifltviarlvsns* pq/cdppalasqsa 
eitgmshraqprgir* i *cgsyttchpl 
llltVylfiyfsteshsviqagvqwhdl 
gslqsqlrrlt/ciltlpsswd*piivtl 

PG*FCLF 


1667 


15568 


A 


1675 


348 


78 


LGPWVPPVIPAPLGG*GGRSPRPGFLTN 
PGPKGKPRVFLKIQNLPGV/RGRPPLFP 
/GSPGSGGPKSP*PLGPSFPLT*NFSPP 
FWGPPGGPK 


1668 


15569 


A 


1676 


2 


385 


TSRRDYRP \GHHNQL I F* / S FCRDGASL 
CCPCWSQTPGIKKSSCLVFPRCWDYRCE 
PGL*I*MGKNPT\LFSNGL*CDCIPLIH 
SiaDIRKKPHS*LQGL*LCHQQNSQTES 
CSVTQflGVQ* CDiGSLQP 


1669 


15570 


A 


1677 


386 


1 


KSTRPVLYKWNMKA*MTEHLFTA* FTEY 
FKATIETFCSEK*IPLKILLLICNVPSH 

QGVISTFKSYYLRNTFDfCAIAT/DSDSS 
DGSGKNLLKTFWKGFTI 


1670 


15571 




1678 


2 


193 


EGGRIFFNSFFEVILTLI/PKPKKWER 
K*SyHPISIMWGDVKILAQTLLNQIQQY 
LKRIIHYDS 


1671 


15572 


A 


1679 


561 


830 


TLLLGT\NAWDFKLKPMLTY\HS \ENS 

HHLVYRHaFT3IF*aSQLKTYCSENlPF 
KlLLFirW 


1672 


15573 


A 


1680 


415 


2 


tscaWlslypvYlyrsssLprfifchfk 
hlwyknitkkaaeintnfccyinrvllc 
hlgws /atwss *l tvtskllgsrdpht 
lslpss*egrc1tprlgdllnlfsrdgg 
girgtstlprqvlnawpqaillvrit 


1673 


13574 


A 


1681 


1 


78 


RPRIRHEVGQAGLKLLTSGQTPASVP*C 
WDYRPEPPCPALHI SYK*NHAMCGLKCL 
AVSA*RHVLGFIR6\WHVECCFPFCS*A 
GLKLLTSGQTPASVP 


1674 


15575 


A 


1682 


414 


162 


GGPGGPI PGA/AGLRPPPPPLGNPPPP* 
KAKICPGGGaPPVFPGS*KGGGESP*PP 
RGRGPFIGVQPLPFGLGHKRGLFPKKKK 
KI 


1675 


13576 


A 


1683 


378 


129 


QPFGPRNFYQIFNFPG/PPV*HSPLFGP 
KFPPFSPGGQQWGPLGNPRPPGAKGGST 
LRGPRTGGSRGGPPSPGKFFFFLKQSLA 
L 


1676 


15577 


A 


1684 


3 


374 


GISVLPGIGAPGNKPELFEEVKLYNNAR 
EREKYDNMAiiLF/AWKTMQALEKAYIK 
DCVS P S E YTAACS RLL VQ YKAVFRRVQG 
SEISSIDEFCRKFRLDCPLAMERIKEDR 
PITIKDD*GNSLS 


1677 


13578 


A 


1685 


2 


373 


PFIRPETIKLLEEIPGGKLLDLGLGNDL 
LARTPNAKINTWDHIKLKSFCTIKETIN 
TMKKQPTDCEKI FSRLISDKGLM/SQMC 
KELVQLN*KK/TDDSVKGWAEDLDRHFS 
KKD IKMANRPGKVL 


1678 


15579 


A 


1686 


378 


201 


HATCLANF\CSYG*DRVSPFCPGWS*TP 
ELKRSTRLGLPKCWDYRCLGRSLLFPGA 
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SEQID 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to iirst 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaiiiic Acid, F=Phenylalanine, 
G=Glyclne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Uni{nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPDH 


1679 


15580 


A 


1687 


1 


404 


MCVNPGGGACSEPRSCHCTPAWVTERDS 
V* KTNKKEWKRRKKI FHaWS / KLKRVGM 
AVLISDKI YYKLKKVERDKE YI IKGSTY 
QEDISITNTYTIilSrT*APKYMKDTLTELK 
EE ID I YAI I VGDFNTLLLI I CRGT 


1680 


13581 


A 


1688 


307 


33 


DEGSCHDAHAGLKLLDSSGLPASASQSA 
GILGVSHGARPLISSYNGTSHGGLGPAL 
VTSFITLSHLFKDLLSLQI/HVTF*GMGL 
GFAGAKLSL 


1681 


15582 


A 


1689 


11 


394 


IFILEARTRISSRTFFKS *INIHaPYNS 
AI * LI GI FPREKK/ STCPYI YTQMFIAS 
LFVTAQTRKQPKCPSTGE\WSKNLWN 


1682 


15583 


A 


1690 


20 


391 


SEGKGWYSCTKWQSKKIjGGITFFFFFPP 
PKNPPPKKSGP/QKGPFF*GKGPPWPPK 
KRGHKK\RGFPKQRARPPKPVFFLIFGK 
RGFLLGPKGG*NPGEKRNPPPWP*KGGK 
NPGNPKGGPHLTL 


1683 


15584 


A 


1691 


72 


392 


IKMIGSLFFGFAFFFFFGKKTPFFPQPK 
R\GGKP*IT*TPPPGN*ENSGPSPPQKV 
GIKaPPPr,PK\WF*FFGKKGVTPFPPGG 
FEPPTPKEPSPPVSPKGGKTNPM 


1684 


15385 


A 


1692 


389 


161 


HGGACLRSQLLGRLRREDCLNQGG*GCS 
EPCTPAWVTE*D/SSQKNQKSKIKKSGL 
DNSFSIG*GILGLSTCI)RYS 


1685 


15586 


A 


1693 


286 


363 


dgisl*prppglrqpsqps\lls1siwgcr 
stplclagffvfvetgflhvaqacuiqg 
ftmvaqs/asqkihit*gafetiqvlfy 
wggvgfgqwrfffffeteshsvtqagvr 
gcslsslqvpppg 


1686 


15587 


A 


1694 


1 


356 


elleprgrgcserrpchcppvrvaeqds 
vskkkrerkylyflrfnwrslrifiyfc 
*/hplqhniqnisftlqnsfgffsrqyc 
psplei ifltpltenllglfmkgi iqnw 

FFGVGLF 


1687 


15588 


A 


1695 


3 


298 


KYFET1SIENKNIQYQ1JI.VWKLVFRENLT 
WNACVKKEERFQVNN1ALYPKN*EKSM 
LNPKGKI IKVRSEKNDIE*KNDEENQ*N 

*SWYFEKITYWQTLD 


1688 




A 


1696 


3 


405 


RLWCGWRNRHLGS*NRVENPETGLHRYA 
QLIFLTKVQKQVGEGQPFNK* CGGTWAP 
TGKT/MEQPPKflSSSSSSSSSSSSSSSC 
KM*NIVFFKMGENLWDH*AKSYEVRTKA 
♦TIKGKVDKLDFIKIKHFCYGKN 


1689 


15590 


A 


1697 


6 


392 


LQRTLLVGLFNAAGNLKLKPMIiI CHSEN 
PRALK]SIYAKSTLPVFYKWIKIAWMTV*T 
VAAWLTEYFKPIFENYCS/EKKIPFKLF 
LLTNNAPGHPTGLMEMYKTTNVGSLPAN 
TTSVLQPMGQRVISTFKS 


1690 


15591 


A 


1698 


390 


3 


AI IE SD FLTT SREVAKKLRVHP FMVLWV 

LEP igkvkkvwkwgp*klnk/nkknhci 

EVSSSLFLCNWEPFLDGIVTCDEKWILY 
HNW* *SAQWLNREVAPK\HFLQPNLHQK 
KVTVIVWWSAAGLIQNEH 


1691 


15592 


A 


1699 




245 


GGGGEYSKI lAIKTALKNTTYLGIYLIK 
\DVPDLYTKNYGTMLREIKYLEK*RAEP 
CS*TERFKIVKMSI/LPNLIYRFNTI 


1692 


15593 


A 


1700 


2 


324 


GTSGTSGTSGTSQTCRISRVRSTSSWTS 
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sequence 
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nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 

location 

correspon 

Smino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknoiivn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GRHLQCHVSKAESSWPVLSASSCSTGSP 
ISLSGS/GSSLSNSSKNLTLPCLFSSLL 
S * P *AWPVDSAFKI YPGLSSVGS CL 


1693 


15594 


A 


1701 


183 


372 


PGVLLCWPGWSAVTVHQCDHSALQPRTP 
RLK*SP\STWDYRYTPLCPA 


1694 


15595 


A 


1702 


29 


382 


gkrfp cswe pkfpltqggfpp lhpgq kk 
kpgskkkkdkskcw*g/crea/gtlthy 
rqelemvqsfwk/tvmqvlrklnvelpy 
dpafllglhptstqkr\dtmfma/aitl 
/iserwkqprcts 


1695 


15596 


A 


1703 


1 


382 


KKVKIIGEAAVEFPDTIKKIIEEKEYLP 

l*vy\nadesglfwkkklqrtfiskeek 
\smdrltli ilckcswvyeqdgp *alke 
kgehqlpvf*l*nkkawtvrtlf]:jD*fh 
qcfvpevrkyvaskrl 


1696 


15597 


A 


1704 


2 


330 


KLNNLLLNNS*VNTEIKAEVSSSIjLEIN 
EYEDTTYQNLWDAftKAVLKGKHVAPHHF 
LQEVKK/RLKRFQINNLTLYLKEL/EKE 
HINLKaJSGRK*MTKIGDIiFGLYFVI>NG 


1697 


15598 


A 


1705 


100 


342 


APKWS IVCPELVGSWSH* /PSRMKPWTL 
TRQGFTMLARIVLNS «RRDLPAIiftSQSP 
GITGMSHRTQPLLINLMEIFTEILS 


1698 


15599 


A 


1706 


600 


211 


SCSVaRLEFSDVIKAHCHL/RTPGLKQS 
SHLSHLSS\WD*GRVPHDLANF* IFCRD 
RVLPRLLQAGLEL\IiA3SDPPS*A3EKC 
WNYRHEPTVPRQNLGLLKTTYGWVFLKK 
Y I LTVSVFSLMCLDHCLIW 


1699 


15600 


A 


1707 


409 


i 


RGPFFFSPGKAFLKLGSNSFFPIJCKTEA 
PKKPIFSPVSP/LNFPPKTGFPPVFPPQ 
MGGFFF/CFPSFLFFFLPPPPPFFPSFP 
PPFFFPFFFFFFFPI.FFF*DRVSLCHPG 
WSAVALSQLTAM.TSMDSSNSPTCT 


1700 


15601 


A 


1708 


154 


2 


IGKPLAGLTK/RKRENT*INKIRNEKGD 
MTAnNTEIQSIIRDFVS*RTaHQ 


1701 


15602 


A 


1709 


263 


37 


SAQLHPLNIQNHRQSILLHDFFLKKQD/ 
G/WPGAGAHACNPSTLGGQGGMITRSGD 
QDHPG*HISV 


1702 


15603 


A 


1710 


390 


42 


YAGGFRAIFFFPLPREGAKNPNFPVGPP 
PFGGPFFFFFAPSQKKEPGFFGKKGVF* 
GGAKGFPNARGPFFF/G* KKKKGPKNKT 
PGFFL/MGPPNPGGPPPRGEGGKIGAKK 
KKKSIRL 


1703 


15604 


A 


1711 


3 


167 


YTCVFVCLCLCDCMC / CVCACMYI CVCV 
CTRVCM*VCVCMCVRVCVQALTVLCKSV 


1704 


15605 


A 


1712 


116 


391 


KRNFFFGPQGGGEGPKFN*RGPPPPGVK 
GIFPPSPPEG/GKKKGAPPPPGIIFWFF 
KKKGVPPCGPGGV*TPDPGGPPPPGPPK 
GGAPRQGPL 


1705 


15606 


A 


1713 


401 


47 


HHYATKPFTHAHTCTCIQDTCNTCMQHT 
QVHT/HTDTHTHTRKVSSVCVLIMAEQE 
RPCPHC*GGEAGAECGVCAWLGLSS*TW 
RKRKGPAHTVRWGQEQSAGCflHGLVFP 
RSYLWT 


1706 


15607 


A 


1714 


1 


400 


CVESCEVDIEMVS CCV/ CS G* S AVCS GT 
ASAHCSLP I PGSRDS PASACQVAGTTGM 
PH/LYPGVPLKPREGLQFTELPSGQLE I 
QPTCENKR*HVPCAI.*VQLTDIRPN*RY 
QFRVAAVNVHGTRRFTAPSKHFCS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=Unl{nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1707 


15608 


A 


1715 


413 


3 


lntfepqhspfivnssemii.lveclfvt 
sgwiyherfusrpgreidwatcysetgpc 

pvtqdgvqwsnhgsl*pqtprlk\hptt 

SVFPVA/RQGL/lTSVAQflGVQWHDLGSL 
*CRLPSLKGSSHPRLPSSWNYRYAPPPC 
I 


1708 


15609 


A 


1716 


421 


144 


RLE CS GG I TAYCS LNLPGP S S S PASAS * 
VAETTGLLHWKKKIVETGSHFIVQTAFK 
FLDSSDLPALAC/SWDYRSELLCPACFy 
NFCLFINIPC 


1709 


15610 


A 


1717 


3 


384 


YSACVCVCVWVDVSVHRCMSISGH/ARI 
IRVGPAHKAQSSCRPVPAGCCTSAQTPP 
WCSAATADPPPR*GESLPGAYPGSHSTC 
CPGCCLDW*HSLPSTEPAACRAGAPGGG 
H*AWLGCGGRAGGRPG 


1710 


15611 


A 


1719 


3 


615 


PVGSWARSSGAGWPPGSPQSVSDGiEAGH 
SIPAPRGQCSRHRAEGRTAEVCLSHCSF 
SGPRPGLVPIR*SLGRPDAVQAIVPDSQ 
EGRKTGIHAEAVMFPLGPGKGVKGCaGG 
RLCPSSPPSHCLDLGRRGMPPSSGPAGP 
RPSGVGS0L6RPGA6AATSSSSSSSSSS 
SSSSSSSSSSRDGEGP\TGSVEAPGSLL 
GPWLPSQPS 


1711 


15612 


A 


1720 


320 


3 


GLKWUffLKDHSLYLILAKWDDCLDEK* 
YADKMLQYTKSVroVSRSRGCQGQVGEKK 
QATEAAQEWEIiRK\RI,3fMEKECDALRAA 
REEQKELRDVRKAKKVYVCVEV 


1712 


15613 


A 


1721 


44 


373 


KAMGQTLWKTVWQFLTKEE IGPM*QSCP 
T/DLALVFLGICTIDLKAYIHTETCTQM 
I IITLLI IJ^RKKASCSSVOE/WNKKL 
YYIRTMESYSSLR*]SrEI.SSYKKHWGGGS 


1713 


13614 


A 


1722 


135 


396 


AQGLFCTSVKLASEQPLRILFQLDKRWK 
\FEIYGTSG*L*S II ICQNNLQSKFQMY 
HHKIMSSLGAVRHACNPSTLGGQGGQIT 
RSGD 


1714 


15615 


A 


1723 




383 


LNLRAKAIKLLEENIGIHLPDLGLDDLF 
LDITPKSQATK/AKIGN*GFIKLKHFCa 
AKMI IKKMKRQYKEWKKIFGISHVSDKKI, 
VFRIYK*HLPLIIKNSSS 


1715 


15616 


A 


1724 


2 


405 


ijsrtslilnqniie/ iklsebcgtwkaktg 
* klgllqqkXiskianakekllkevksa 
tsmnm*mmrk*knfipemqkvlww/i* 

ni/plcqsliqnkaltlfnsikaergee 
a\kleatkrwfmrfkesclhwvkvqdeg 


1716 


15617 


A 


1725 


90 


400 


sqllrrlkqnslnpresssktttrtkvs 
hwhkisrrhvnq/ynkiensginlhiyg*l 
tlnkgdeasqyss /dilenkwcwqkkkk 
\yldpyltpctkisstwtisgfl 


1717 


15618 


A 


1726 


390 


1 


TFFPKI \EKKIL* FIWGPRRPKIANVFP 

*qnkpkiegiplpgfkiyyralvtktaw 
f*hknppigqrnkvensetnfhppseln 
ffffXykgaknihwgedslfnkwcwenw 
i si*rrmklgpni,tpytk 


1718 


15619 


A 


1727 


3 


365 


HASAKSNLRWIIKLNLRAKTIKLLEENM 

genlwdlelsrgfldrt/pklysiklkl 

RK/WNFI KI KNFCFSRDT*KS *MAGKNI 
LSQKDLYSEYLKK*CMLI1RQSKTST*M 
FIAALFIIAKH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PheDylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Ij=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1719 


15620 


A 


1728 


368 


2 


RNCLNSEGGGCSEPRSRHCTPAWATEPD 
SVS KMK*NKKINR* IMFLY\EQQSETEY 
FKLPLTIjA/SRWMKFLGIWLSKGVQDLE 
TENYKILLQEIEEDLNK*RNKTY*WIKL 
NTVKMKSILSKR 


1720 


15621 


A 


1729 


326 


30 


NPFGGPKKGGSQGREIKPPLPPMGKPLF 
F*KTKNKWVGGAPPVIPPS/SGVLSQKK 
TFTLEGGGPNKLNSPPALP/ARGPKKNF 
FQKKKKKTKQNPSQREE 


1721 


15622 


A 


1730 


1 


374 


iFJSADKlA\FWKKF*KPQGTSVGRj!lEKQ 
APGFKAGRNRLTTI/L/GASAVGFMIEA 
ALICKAANPQVLKGRDKHQLPVFRLLYN 
KKAWTTRTLFLDCFHQCFVPEVRKYLAS 
KQLVFKVLLILDNaPC 


1722 


15623 


A 


1731 


389 


1 


F?PKIFFFSTLFFFFPRFFPPPPFL*P/ 
SPPIYFF/CaPiOCKNFFPPPPGKNFFFF 
KTPPPFFFFFFF*D*VCLCCPGWSAVAQ 
SWLTTTSVFRVPVI 


1723 


15624 


A 


1732 


118 


422 


DITTHLFKWLKFKK*EKGLlSrILFTKEDM 
QMEKHIiKECLT* FWKELQIKMRYHYPP 
IQiyHKI *K15r/STI SIAWOGYRTIGTIiF/ 
HC* *EQPFW*FIiSKIiNMILPYNPA\lML 
LSIYPNALKKHVHTKTCM*MFIA?y:,FII 
TKNWK2PHCPSICKW 


1724 


15625 




1733 


407 


1 


NIKGPFRGPLIQWGIiLIWPKDSFPILGY 
PPFPSPKISPPFLARSCSGAPNHFPLPN 
QSPCFPQPSFFLGEEECKEFIiPGYSLaFP 
FN/RFGGS*RVRKGNQGPIPMGESFLFF 
F*DRVLLCCPGWSAVA*SRLTATC 


1725 


15626 


A 


1734 


322 


362 


TAIYIYII*LFTDNIPSHPRTWIEIYKE 
INIFVPANTIPIVQHRNQGVICTFRKTI 
TVTDCDPANGSGQSKLKTWKGFTILDAV 
KN/IRDSWEKVKIG/TIN/GVRKLIPSL 
KNDFKRPKT 


1726 


15627 


A 


1735 


49 


395 


RGGPGFFFFFFFFSKKSQILPPGWKGRG 
EPRVNGTPPPRGKGNPPAQPPQEGGKTG 
PPHKPG*FLCFLKKKGVQKQG*GGPQ/A 
PGPKGAPRPGPPKGGEKREGPPGPTRPN 
LYYAH 


1727 


15628 


A 


1736 


417 


2 


flffffflffffspasgpeilftcl*ht 
hihflffyskst* ♦ppvfaggmfqdpqw 
lpetkmvpwfkkkrttlt/yip* *KLCE 
cdlsnffcff*drlllcrpgwsava*sr 
ltatstfqaqanrtrg 


1728 


15629 


A 


1737 


316 


338 


FFFLSFYFETESYLHHP*GFIVKLSKVK 
DIEN\LKKTARGNYQVTYKGAS I RLAAD 
FSAEISQAWREWDNMFKVLKEKTNWQPR 
lYKTLFVPHF 


1729 


15630 


A 


1738 


197 


379 


QKRAQIDKAIFICRDIAL/ P* FQ/MYYW 
ATVTKTAW*WYKNRHIEQWNRSW\PPEI 
KSQSYSHL 


1730 


15631 


A 


1739 


4 


401 


RG YRHAP PS LANF C I FSKD/MGFTWLN 
S *PQ/CNLPASTS *SAGITGISHCTRPQ 
MATFLIGPHKI IPWSVLWPNL 


1731 


15632 


A 


1740 


94 


117 


KDRPMVPPVGAGEDQADEPCRGHASLWS 
QLVSAPTTPI PLPGRDVPSRATP FPAAL 
AAQQPP * AS PYPLPPGLGAGHASAS PVT 
VPFSPISESTGS*ESAL/PAPRPGGSG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

correspond 
ingto first 
amino acid 
residue of 


Predict- 

correspon 
ding to 
last amino 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=GlutamicAcid,F=PIienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tr>ptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1732 


15633 


A 


1741 


3 


390 


VDQFLISHDLSKLTEDEVHNLNSSTTIG 
\EVEFKVKKL\*KKKSSGPDGITGKFY* 
TVREEVTPIL/SYLF/HEIEKEETLVnS 
FYEARI IL/IPKPDKNKP*TNIDAKTTS 
KVLPNRIWQYVKIIIQHN*VGFV 


1733 


15634 


"A 


1742 


3 


442 


DLHSRVEPRVRPSVRKQ*WLLKVLICA 
SKDTLKRAKRQPIGWEKIPVNHMPDKDL 
IPEYKINMQKSGVFLFTMNSYSSTTDHD 
IJMKWAKDLSRHFSEEDIQMANKHM\KR* 
SVSLVIREIKIKTTGR*LFTPKCMQG*G 
KIITLVH 


1734 


15635 


A 


1743 


411 


2 


LPPFKHPPPK I I LGAPKKKI TLPPPRPK 
KCISLKGPPFFFFCRYRVLLCCPGWFST 
PGLKQSSHLGLPKWWDYEHEPYCTQSSF 
SLSPFLKQTGR*WFDLSNFFFY/CYCFI 
YFSRDRGLTLLPELVLNSWPQAILL 


1735 


15636 


A 


1744 


1 


393 


RPGGPPKGRSREQGREGERSHHRRPRAP 
WflESHMWGAEVFSVPRSFDQRPQEKCVQ 
SSYKQQLPAEARDGTGNLIRGAPLFFF* 
DGVLLCRPGWSAGFKQFSRLSLPSR*DY 
RRTPPHPANF *ML/ CIiRESLT 


1736 


13637 


A 


1745 


395 


0 


PSAPS FFSTRL* LGEPPGFPPPPFLKPP 
PRN/SIFGAPKKKFFLPPPRGKKFVSLK 
GPPLFFF* DGVLLCCPGWSAGFKQFSHL 
SLPSS*DyRRTPPHPDKF*MF/CLRRSL 
T 


1737 


15638 


A 


1746 


397 


1 


CGNFLKREKNFEARRFLQK*AARFRNIR 
*VTPEITAFCPL*HVASFLVFLTPNFPT 
IPQLYCLEPLGEMGGSGSKLPPFPTSKT 
PNPLISV]SILCFP\AIKWENFFF*DGVLL 
LHPGWSAVAQSRPTATSTS 


1738 


15639 


A 


1747 


392 


2 


FTKKGGRGGNLS P/ LPPQKI F* KKKTLK 
KPFFWQRVWFFS PLFFEQKBQGPFFFKK 
PFF*KRPPDTPQPPPPSFIFFFIiLFFFR 
/HLVAQAGMQWRHLGSPQPPPPGLTQSS 
QLELPHTPPHPDNFCIFGRDR 


1739 


15640 


A 


1749 


33 


403 


IKGFKKKHGKGGAPQKKGGTGAKPPPPP 
PQGF* K*QKRKNGS PPNVFFPNPGGPPP 
PPPFWGDKRGGPPRGGGAPPPRGKRENP 
FFPTLA/HGKKEKKK 


1740 


15641 


A 


1750 


3 


396 


KRQTTNWEKVFA* KN\ADQIiISLIRRKC 
LKVKK/W*R/DMNTQFTDKQILMT1IKHM 
KRSSTSL IMEMQHKAGVI FHPSDWQKC* 
STDNTQS*QGHEEMYTIKHSW*TFDyQQ 
PF/SESNL 


1741 


15642 


A 


1752 


30 


419 


NEGIGAGHEVSFAANKCESMRKNVEVLA 
UqERLDGELLSGIiTSTQALPGWAYLHLL 
SHHAVRPLFLCPKRGWVIiCHPGWSaVa 
QSQ\FELLGQVILRPHLPSS*DYRSIPP 
CLANFKNFFRERARYACR 


1742 


15643 


A 


1753 


16 


410 


VGPKKSiiQQVWAAVQATLPLES YDLAHP 
IILIWSLADRDAI *NLWQIPIVAS* YIP 
LGF/YSKAMPSSVDIYSSFEKKKTHFFL 
TGGGGPPLYSNYLGRLGGANHLTPGVKN 
QPGQFGKPPPLQKVQTLAGRG 


1743 


15644 


A 


1754 


2 


17 


WSSLIKHRRIHTGERPYQCSECGRVFNQ 
NSHLI QHQKVHTR*EMYI *SR/CGKDFT 
QKSTLI*H 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
corrcspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1744 


15645 


A 


1755 


138 


380 


XPXXXSPXPPPCPXPLSSLFLTKCSAY* 
TPAHRPPPPGPFVPPPKPCCPPPPPLRP 
WPPLPLLTNPIPPPTLSFHPGPALS 


1745 


15646 


A 


1756 


194 


3 


WLCIPIEHTTEQQPGFPF/LIIWSQPYS 
PLFCLFVF*ETVSLCGPGWNAGVQSGLT 
AASTSQAPSL 


1746 


13647 


A 


1757 




403 


RVLKSPTLAYTYLLFYLGATILMGVNIW 
KQPKCPTK/GQQKIKLQYIYRMEYYSAL 
KKK* ILLFAIR*VisrLGDrMLSEVSQ\SR 
KKMIVLSHMW 


1747 


. 15648 


A 


1758 


398 


55 


fffffffffp*tkglgcihrcdhgspqp 
rihXgssnapslas *vggttgafhhrrf 
tliqsssvhvsthtlhpypsslp 


1748 


15649 


A 


1759 


456 


31 


FAKRITDKKLLSLIYLAT*KGEITKaLI 
KQ* XLKAQaKNSLKRHTIVLlIHMKL/CL 
ILLIIKEMQIK/STLRYHFFFITLAKXI 
* KI,GI!ifTFCWQGL*GTLIHCWWECK*HNS 
YVGGIWQ/FSNKLYVQI*YISrSAISNLDG 
RVGRPG 


1749 


15650 


A 


1760 


3 


378 


QFQYFYYNGSVIKAVWYWSKNRKMDQ* 
NRTBSPDKKIflKHMQLIFDKGTWTPQWR 
KDDLFKKWCWMN*TSTCQKKKKKKGGGP 
* KEQNLTPPGWKDI I FLFGAPKNMPGAG 
VKTRWGGKNPGFPQ 


1750 


15651 


A 


1761 


69 


384 


YTSaSWGGARYTASAAGWKTLLLLFLFI 
*DRVLLCHPRWSALTQPRI lAASTFQ \V 
KQSFWLMHIGDWDYRRCMPPC5ANFFNF 
LCKKKKTLRRQEWQTPALVEV 


1751 


15652 


A 








KFSTPGNXNLFFLKAPPFFFFCRGRVLL 
CCPDWYSTFGLKQSP/CFSLPKCWDYRR 
ESPPQAFFVFFGLPSLPFSPPPAPSLSQ 
SSSFFF*MEFHSFAQftGVKWIiN\LFGSL 
QPPPPGLK* FSCLSLPPTRP 


1752 


15653 


A 


1763 


2 


390 


PRVRGFFMRKFVDSYLVPYTKSNLKWTI 
DLHVRAKAINLIiI!I*ISIVRIHLYDRELCaiG 
FLEMIKKTKaAATTTKLDFIKIKNFCAQ 
QMSS /MKVKRQSTKWKKKCSYHISDKGL 
VSRKYVKKAYNSS IRTQSH 


1753 


15654 


A 


1764 


334 


3 


WSKRSGPPLSKNQTKKKNATKPQT * KNW 
INEIGPII/NTSPSKEKTGSNGFTD*FY 
*TLKEELLSILLKLPQKTEKSVIFPKSF 
YVKDHSSCLSGIHPKDAWMEQHMQINQC 


1754 


15655 


A 


1765 


259 


1 


KSTLFLMKICYWKGSFLKKHFFLETRSLS 
D/SPG/GVQWRDHSSLQPRTPGLKQSPH 
LSLLSSWDHR*APPCPflNFRFRLKTRIG 

RDV 


1755 


15656 


A 


1766 


402 


386 


FKKKSRLRKI / KVQDEAASTPDLAKT I / 
DYEGHYTKQQIFHVDERAFYWKKMPP/R 
TFIARKKKSMPGFRASKDKLTVLSGANA 
AGDLKLKPVLTYHS/ENPRALK/HYAKS 
TLPELYKWNTKAWMKIQKFPS*FT*IFM 


1756 


15657 - 


A 


1767 


2 


406 


PRVRPRVRKLITLLHVSQRWSSEKKKKK 
KKTKKKKKKKKRGGGRL* KKKKKKPRRG 
RVNLFFWGPKKSTPPRVFKHRGGEK/PP 
PPPPKKPRERKPSLGTOSTWHGISPIKH 
SKKTKSTK 


1757 


15658 


A 


1768 


14 


409 


lASGFLFFIYFGVIGRPPKRGGVFFPGG 
GGAPHPPQGGC*KKKRGGGGGPPPFFPP 
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SEQID 
NO: of 
nucleotide 
sequence 



SEQID 
NO: of 
peptide 
sequence 



Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
sequence 



Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 



Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tInknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 



SLGGLGRPRPMSKIKTPPSAGGKPP/PF 
LRNQK*TGGGGPPQ*PPLFGGPGGKMFS 
PPRGGGGGGPKTPPPPPPGGE 



TKTG P FP SNRAKKKKI VFQKKKKEKKIN 
CFVTAPFKGI KTEATD *EKI FAKHLS VK 
G/LVYL*YI*ISKIYKELLKLi!raEKTTI 
PIKK*AKDLSRHFTRYTD 



FSPVSLG*GRGNIYAGMSNV*EVPPEID 
FQHEVKRALQTS FQVKLVKI I FFKSTI * 
KSLAK/WLAWAHaCNPSTLGG*GGWIT 
RSGVKSEPGQHGE 



KKKKEKKIICFVTDTFKGIKTEATD*EK 
IFAKHLSVKG/LWL*YI*ISKIYKELL 
KLNNE KTT I P I KK* AKDHS RHFTRYTD 



KKIRRGGGQPPLFPLIGGGREKKFFFPG 
KRGFY*TKAGPFPSSWAKKKKFVFKKKK 
KEKKI ICPVTATFKGIICrEASD*EKIFA 
KHLSVKG/LVYL*YI*ISKIYKELLKLN 
NEKTTIPIKK*AXDLSREFTRYTD 



KKKKKKPTTFPGLPFFFSLSLPPRVPGA 
RGPPSSFPSKPGLGC* IKKSGVSPAPKK 
KKPRKTHPKNFPRPNPKBGGEKFFNLPT 
HPPGGTPPFF*RRRENPPGFAPQKG/EV 



AAGEWLHQGSLQSLPPGLKQSAPLGVSK 
W*NPWHDPPPPAPRFWWVGGVVLRWS 
FFLGAQaGIFFFF*IEMGSHYVAQa3I,E 
/PPRLQGSS*LNLPSSWDYRPVP 



QQMRDKENLFEHNK*GIRGI YLNI IKAR 
HEKPTVDTILSGESFSSKIKTMLISPFL 
FNTVLGVLA*ARKRKDI*VGK*EVKSYM 
FTNDMI\LDNPKDSTPKKKKTGYFMGGP 
GSKPPPQRGQAFLSLTRT 
LFTLGQKKIKGPKFFS 



SPSSSSSSLFSPPPFFWGGPRF/SPPPP 
VFKPPPPFFFLGPPKKKFFPPPPAV* FF 
FF\LGPPPPFFFFFFFI>IEAGFPFFSPG* 
GP/SGPMAGFRSLPPPGNSLS KKKKS EG 
LGEGGNSVLTEVLLISSYQIPGWPR 



lisflvsslivrlyrpllclfpdpskgh 
cipcllplfwillhi flfmhs f i ycl in 
drvllclpgwcawrprltaasatqVik 
rsgshlslpssk*\wdhrrcpgffffff 

FF 



SHLSLPKCWDY/RL*ATTPSQKILVFTH 
G*VLSLLSLSFLIPPD*TF*KMSLMRPS 
LTQKSS I IQHDLLDKVP ITI FLRQSL /D. 
S / VTQARMQ WHDLGS LQPQPPGLKLSSQ 
PQA1\SSWDYR 



LTLFWGAQYLPKKGKGPFFLLSL.PSPSV 
WGPLPQKKKSPPLCFPYFLNRVLLCHPG 
WIAWQSWHSSAHFSL/VLTRFK*SSCL 
SLLSSWNYRCTLLHPPNFLNFWYRQGNA 

VL.PKLVL 

RKNE XWSGAVAHAOJPSTLPGPGGQ IMH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

0(1 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, l>=Aspartic Acid, 
E=Glutaniic Acid, F=Phenj/lalflnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N^Asparagine, P=Proline, 
Q=Glutamine,R=ArE;inine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SGDQDEPG*HEPPRPTDSFFLYPSH 


1771 


15672 


A 


1782 


45 


387 


TQTPKLRRFFHFSLLKSWHYRCSPL\PQ 
HNG YFLTNLD lATSMFFFKDRVSL \LAQ 
AGEQWCDPSSLHPQTPGLRR\PPASASQ 
EGETTGAHHHTWRNL I YFFYYT YKF * GT 
YPMA 


1772 


15673 


A 


1783 


1 


401 


FATIiARLVSNS*PE/CDLPASASQSAGI 
TVSPAHIAISPSFPAMPSSGLGLSYPAY 
HPGLGLRC!HLPVLTSPWTSDTGPSSVLP 
DAGALHCPPEPQHICPL\LSGWLQTP 


1773 


15674 


A 


1784 


432 


1 


Fi:,FLFFFIKKKTPFFQaKPKNRGF*KPP 
EFFQIF/S/CPVFLGRFPKPQKEFFFPE 
REMGFFFLFSPQGGGPAEGFPPPPLNQG 
EAPRAGHKKKGPNLGGRTFFFFFFEMES 
\FSVTC3AGVQWHDLGSLQPLPPPFKRFS 
CLTHAS 


1774 


15675 


A 


1785 


15 


434 


DSPDPAPSPPSALPSLLPFHVYRDVCPV 

LCLRGWP* ♦MVERGRLGISPTWLLGWPF 

pgga/phikpe*yflfaytilrsvpnkl 
ggvlallls i li lai i p ilh iskqqs i i 

F 


^775 


15676 


"A 


1786 


! 


258 


CWPETPVLK*SHLDLTKRaEDRREPVWA 
ASTTIFFFETGL/NSGAQAGVRWVHLGS 
LEPLPPIPSLFI>4TP*GPCPGLGSSWPLR 
EF 


1776 


15677 


A 


1787 


399 


63 


SLHNQWKSTPTLKTSKKISHI *STWPG 
WAHRCNSNTLGHHGGRTA* /HSGVQDQ 
PGHHSETSSQRLRNPYIKRCIKYIjAHSK 
CCINDSFTVSVTSRKLIGKREVSPNNIT 
FR 


1777 


15678 






3 


474 


MSISPVEWNSKEAGHAANRQFPSFSPWK 
DDSHDASPPEPASPTIGNPIRRIiAESSW 
TWGSPCAEHPRARftGHRK2».TDCPWAAG 
SQWRGPAGQGAPRS CLFPGSRTAARAQH 
PRVAPPPPPAPLNTRASflliRSQLPRNPL 
♦VMTPRPPAAAPRSPVGP 


1778 


15679 


A 


1789 


66 


395 


LVQPLFDFIWHSRLSLWSGRDLLWSGKE 
TMNPCLNHSIGVLQEWQGSDVKKRRRLM 
ESLTGPAflDVIRILKSNNPAITTAECLK 
ALEQVFGSVDSSRDAQIKFLKTYQNP 


1779 


15680 


A 


1790 


413 




PSPRIOjI TDYS *EEGPRFWQV\EKKGQP 
LKPHPGLGSPHQESPRVGPPKR6GYNPF 
* KKRAKFPAPGENKGPPFWTGRDPTP * G 
KTKKKTNPPKKGGKKGPPPKPGQFFFFF 
LRRSLAWSPRLECSAAISAHCKLRL 


1780 


15681 


A 


1791 


314 


1 


KTKPFFLKKTPPKKKNKKKGSC* DKRRY 
QTRKG/IVNLGH/HPSFLFYFILFFETE 
SHSVGKAGVRWLRANSLQTPSPGFQQFS 
RHSKPSRKDYRHPPKRPQECVQ 


1781 


15682 


A 


1792 


104 


409 


EKQSFAFDWHFDLYFDNVK*KEGEESKfl 
GEFNASTGWFGNFRKRL/RFKNVRVTGE 
PAS VTQEAADEFPDNFKKI TEKKGYLHG 
KFLMYHEAAKYLNFWCPTK 


1782 


15683 


A 


1793 


392 


2 


GERERDRL*REREREREREREGERERAR 
ERERARQGTSTVESRF/HSYCRRDQDVA 
RPiCAKGEVAAGRGSPDGLQVGRGQ*PGP 
SLRPGPWREWGFATYLACGGPIPTGEVE 
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SEQIl) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glufamic Acid, F=Plienylalanine, 
G=Glycine, H=Histi(3ine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unl{nown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














SGEEWAQQGEWGQALGPP 


1783 


15684 


A 


1794 


405 


119 


IIKMAIVPKAIYRINTISIKVPMAFFTK 
\LKTIIKFI *N*KRARIAKAILSKKNKA 
GSIPLPDFKLYYKAIVSKPTWCWYKNRP 
IVLVHSRTAKKK 


1784 


15685 


A 


1795 


455 


3 


CSVTQAGEQWCNHSSLQPQPPGLKWS/C 
PSQLPK\WNYRHVP PRPALTAHPALTAD 
F*RRKEYKLMRHRGKKWHDFTLRSKKMK 
AMRHEYHSFLTYPNGHLYLHWQFFFFDT 
ESHSVSQAGVQWHDLGSLQPPPPGFKRF 
ERVGPGGNSGAD 


1785 


15686 


A 


1796 


2 


134 


prlqhctpawatpqdsvskrkrdmttdp 
ai*kg**atttniyihkfynldeidqfl 
kkhklpqltwyei/dnlnspitrSiefv 
ilnskkkypgsdgftgef*d*fcletkk 
rydyrpcnlkriisdynkhlht 


1786 


15687 


A 


1797 


1 


404 


PTRPLTGSSASGMMVEIFPKTYLISSAC 
WVKISKIDLKLFS PFFFFGFLEGGFYFC 
PPNSRGGSPGGKFGLIEPLPPGLKGIPP 
PPPKRGGEFGPPPPTPAYFFFFLWGGGV 
/PP/AVGGGGKPPI*GNPPPWPPQG 


1787 


15688 


A 


1798 


2 


383 


sgwlwacrspdseplacpagprq*gayh 
pgqqlwmvr*vsqqcwptppalhsis\p 
rp*lpprraglgtcspp/vpespllppw 
apawpkpprralppa/ppvpgpidrpea 

EECRRMVRDMQAAPPAAP 


1788 


15689 


A 


1799 


35 


410 


ATGPSLGKVCS*AFSFri*PKIiDFWHLYP 
QASGFLL/CHNSPSQTNSCSSFGBOreGM 
VLRVNSL*PDHPWNEFyFLNFFIFyHDS 
/ SLTILPRIiVSWS *AQTILLPLP/PKVL 
GLE 


1789 


15690 


A 


1800 


65 


415 


kkgvlkggpppqplgv*gpppekggpgp 
f\gapkekppiialgepkgkpf*rgpkgf 

FPKKGLEPGAPPPPKPRGEKPPVFKGPH 
PPCKKGGGDKRBKNGGL*RKKPTBLGNG 
PPPPG 


1790 


15691 


A 


1801 


417 


3 


NLGPNIFPPRAPQNWGPAPPLFFFLKKN 
FTWGGGSTPLFP /tNLGGLGGPFPGLEV 
YAPPSPHG* PRFFFKNQKLPPPVWGALY 
S PFFGGWEN\RKAPPGQTLFFFFFFLK* 
GQDLPMLLQLVPNYWAQVILPPWPPKV 


1791 


15692 


A 


1802 


1 


431 


QPCTPGLK* SSCLSLPSIGDYRC/RTTV 
PS*FFFFPLEK6VGPIPRGGIKGLIINC* 
LGPHPPELK\NPPPQNSQEVGTTGPPPR 
PG*LFFFFETFFFF*KGGFFILAQP*MK 
WGALKKTPPAFSKRGGYRGGTPMEPTHC 
FLKEN 


1792 


15693 


A 


1803 


256 


399 


AIKNIHDS*EEVKISTIAGIWKKLITTL 
I DDFDGFKTSVEEVTVDVME 


1793 


15694 


A 


1804 


407 




FEKANLFPFLFKNSGPPNVPGRGMAGVP 
KFCCPSKRKTOGP I FFLRRLFLFYPKCR 
RPPLFWPAFGP*KFFLPQVFAPAF*KPS 
PKKKRAPVFFFFFFF/RDKVWLCHPPWS 
AMA*SQLTVTSVSWAQAILLPQR 


1794 


15695 


A 


1805 


429 


116 


LLTKKKRKKKPLACGESGLGGP*GTVTG 
V* QAEDTHVI W \ VLS SAPSLSSEEMTDS 
MPGHLPSKDSRYGMEMLTDKKWTWDGGA 
WDSSPQGANGKRGARQASGFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO; in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histiiiinc, I=Isoleucine, 
K=Ljsine, L=Leucine, lVl=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovi'n, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1793 


15696 


A 


1806 


2 


408 


FVIFVFLVETGFVS ILARALACAISiSrS * 
PRDLPTLASQGAGITC3VSHRTWPRRSCV 
F*EAFLTMLMVLHIPPSLLWHSVFTHLL 
ALSS * CFLFFLRGPWAVTQAEGQGHDLG 
SLQPPLPGLKWFSCLSLPSS*NYR 


1796 


15697 


A 


1807 


1 


196 


FRLGASLDLSGGCSGVSWTPAPPLPPPP 
PPSAASGIS /SGSTSAAAGL* S CRTAFF 
SFLSSFFFFFLKKINPFPFLGGI*PFKG 
GFRLV*GNI * PPPQG\DFLGGTRGGKKP 
VIGGGKFGQGGNFPLPLKKPPGPQKNPPP 
\PPPPPPSAASGISMRAHLQLPACDRAA 
QLSFIjSSHLFFFFF 


1797 


15698 


A 


1808 


395 


3 


LGKKiMNNPKFWQGCEATGTL/M/HCWWV 
CKFVQSFWNTDS IY* G* AAATHDSAMLL 
LGMHSMPACTFVHQKT* TKMF lAALF IL 
PLNWKQV/RCPSVI\DGYIPTMDQSTAM 
KMNKLHAKTWMNLRNLMLNEKPR 


1798 


15699 


A 


1809 


7 


454 


IPGSTISLQPPPPGFG*FFCLSLLSSWD 
RQPAPPR/LANLRR* T * LQS A* LWR\RG 
PILDEMKSFMCNCNSLTEGVKGRVEMMS 
QNGRLLTKFCHVGQACLKTPDLK*SARL 
GLLKCWDYSCEPPCLTQMPFSFFLFF*D 
RVS/PLHPGWS 


1799 


15700 


A 

— 


1810 


20 


355 


PQCAHGCRAVAPVCVCFCFCVCVCSHCV 
SMCMCGEVSAGWTLLCLCLST\CSGAYA 
CDWGCG/ CYSACVCARVCACVCAQLLTC 
IGMWE*GQAGQGEVLLDLPLCLCSCWAP 
Q 


1800 


15701 




1811 


3 


414 


SSKWDWNSLQEFMDKIAGMKKNLGNLTE 
LNNTV^EFHRAITSINNRINHAEERISD 
LENWLSE I \NRQT * KIVTRNEQKLRE W 
DYVKRLNL*IIG\VFEREGGKAYYLQNI 
FED IVHENFPRFARDAKSQ I QEMQRT 


1801 


15702 


A 


1812 


1 


443 


AGKS PS PKKFKKGPGGGRWS * FLGGAGG 
GVPFRPGIQGSNYRFF/SPPPPPLGEKK 
KPPFKKKKKTPQKLQNGNPHPYFNPRVH 
GCREQQPWEKAPDATRQPHYADKHVEAG 
EPREPPKPHSEPLFSSPRKRPLFTRSST 
/SGASPP 


1802 


15703 


A 


1813 


411 


66 


WKNKVF]JKECLBR*IF/IIQKKKLDP\P 
LTRYIKIKSK*IKDIJIIRLEIKKTPGKE 
SVTLaKWLIRSPYLSFP*QIQSKQ*IWS 
YVLIKITKEERWSTSKEYQKPWRERBCAM 
YKIK 


1803 


15704 


A 


1814 


369 


20 


QEVRPSIYLSSSNRKYVKDSNARFTKBA 
IQIANIHMKNCPTSLIVGETHIKTSKGY 
HYVPIRMAKILKD\CNRRCGETGTLIHF 
*WEWKMVQPFQKPF*WFPKKRKIH*QFD 
LATRS 


1804 


15705 


A 


1815 


2 


675 


GLAILGRRLRGEACTRSPFSS ILIFMVS 
MGEWPGVP/GIKGTRTGREAVPTRRRKS 
S IWPQTGGAGESSG/PLRGLPRQGKPGS 
PRGPSGPPQNCARWWHPQAAPLGACCFS 
GPEPSRLPPWRQGNWCa:.PSTPSSSA*EG 
WR/PVLQPGFL*S PSSLftS ICPGAERGP 
PGSSRP/GLRGAPGG 


1805 


13706 


A 


1816 


273 


2 


MESHIMWPFTIGFFHLVQCLHGPSMLQC 
IAVLHSSYLFIYLFIY*DRYLLCHPGWS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














AWQS*LTGASGSW\LRRSFHLSLP*C* 
DFRHEPE 


1806 


13707 


A 


1817 


40 


404 


GLHE \ PGVQGCSEL * SHDCTPAWVTS ET 
LTLKKLKIKKSKKKKRANPRGFERPTFG 
SAGPGGLLKAIISKPRVIQGTQKSGALL 
GFNSNGGENHPPQKQPYLGGLYAKAHSP 
RGGGPPSPCG 


1807 


13708 


A 


1818 


392 


3 


EKYNMSYDIKSTNHRKNCKLDFIKI/RN 
CCSLKDTINKMKMQASNLEKIFAIHMPD 
aGLIFKRKNSCNLVRR*QPPFFKJaKDL 
NTQKSQ*TNG/HGSKETSLIIREMQIKT 
T/MNYTTSlPT*MLKlKKiyiNI 


1808 


13709 


A 


1819 


311 


431 


EWGRAWWLTPVTPSLWEAEAGESRDQE 
* KLCTTVEK* KTISW/HDVPIRSSWTGM 
YAHACMPFTLGGRGG* ITRSGVGDQPD* 
HGBGL 


1809 


15710 


A 


1820 


68 


410 


AKKNQGPGPMVFepGGIKPPPKQKKVRG 
GFFAICPKEQVFFFFFFWRAWQSLNHC 
SLQPQPPGLKQFKQSSHLSLLSS*GYKH 
VLPCPANFILFSLVETGSXSIYFPGWSQ 
TP 


1810 


15711 


A 


1821 


408 


1 


TPFFFLRVLRLTPPLLGNFFGPGFPPWG 
GFSPGPLLKGPRP/CFPIFKPIFQPGKW 
GLVFFFFPPFFFYPRGSR*NLKKI FPNF 
PFFFPVFF*ILNPPFFFFFFFFFFFL*D 
RVSLCRPGWSAVARYRLTASSTSQ 


1811 


15712 


A 


1822 


362 


76 


SEIAPLHFMVGDRVSUILETKrKQSNKQT 
KKTLIFRDRVLLCRPGKNa./VEVQ**LP 
AASN/FLRLKQSSCISLSSNWIYRHAPP 
HIATVFNFLIIFE 


1812 


15713 


A 


1823 


314 


2 


VISKPCPRELTCITYGVSLTQCSMFGRM 
KGLLLIWPPVCEVRRASGRPPLMGSEEP 
LCPAATPSGRCTQQ/IiH*ERAMMTMAVL 
SNRKGGISIVGECR*RlSrQlVAVS 


1813 


15714 


A 


1824 


57 


389 


NLHLLQLPTYTDADSTGPTLSGMNVKNL 
HWSYEYKYRSITGVQWLILGSLQPLPPR 
PKCSCLSFLRRWDYRCaPPREATF\*PL 
VETAFLERLTSCDLPTSASQSADITGV 


1814 


15715 


A 


1825 


410 


70 


vp imsatqdyrhepprpagrflkklkme 
p phnpalllvgi * pknmbcslhkdvctpm 
fsgtlfaiakiqkkpncpssmdewincr 
nyXmhiydgicysalkkneilarrsgtr 

L 


1815 


15716 


A 


1826 


2 


411 


flvemgf \smlvi aglklpts ggapas a 
sesagitdv3hrawpvffflkrclvlvg 
rs * *a\wphtnlipplpsgikgdlcpns 
aggwekgapppspgkf/cef*grtgttn 
fahg/wtktpd 


1816 


15717 


A 


1827 


276 


3 


grpgpadfrvrpqllqrflfiylfteme 
s cs vtqagvqwcnlgslqplp pglq * fs 
cv\k*fsclgllsswdyrhmpphldnks 
ifsrngvs 


1817 


15718 


A 


1828 


1 


391 


LEPRRRFLQCVQDCATALQPGQQSKTLS 
QKKKKKKGGPP/S *YQKGQGCPSGKKGR 
GVAGKGAFGPGFGGENKTPPGGGPTGEG 
PFPQKGWGPSQGPTKGPNLWGPGGPKL 
GGKGGPPGPTKGGGGPSSF 


1818 


15719 


A 


1829 


2 


134 


DHLSPWWWQPGQHSEAPSL/LINIWKL 



650 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ED 
NO: in 

USSN 

09/515,1 

26 


Predicted 

location 
correspond 
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Predict- 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyJa)anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGCGGAHLSSHL/LREDHIitTLEVQGCSE 
P*LHACTPAWATE*DPVSQTKLN*NIWK 
LftGCGGAHLSSHLLGRIT 


1819 


15720 


A 


1830 


427 


107 


VQHQPGQQRESPS III IQKLAHGGARCL 
*SQLPGKLRLENRFN/SGSERSHQCTPA 
WVTDRDCLKRTGITRASSQELSVGIKQS 
CLNPRTAQLQVSAQSPSTVSTNL 


1820 


13721 


A 


1831 


3 


540 


VQFPNFiayyKATAIKTVL\QHKQR*ID 
E/MNKI*TPEINSYVYGYIiNENKDAKAI 
QWGNDSF\FKKWC*DNWISACKYSQTSV 
SASSASSSSS 


1821 


15722 


A 


1832 


385 


2 


AGRQSETPYHKSTIMKVSLQVCVDLSCL 
LGPGSLVQDSPNIPAQRKRFLRLGSFWL 
PLSRFSRVGWPLPHWGQRSSGFSLPR\P 
P*SQIPAPRSPPPAGPVPARSWVCGPRP 
QTRPLPAERPSPRPRRL 


1822 


15723 


A 


1833 


7 


399 


RISRSYLSEYGGSGKEHPTLGASYARIM 
VFG/VFIIYF*RQCLA*AQWYSHSSLLP 
QTPGLKHPP\AQAS *GftGTlGTHHHT*L 

tfaf/ifvlgcffl*nkisvtqaggqgc 
nfgslqppppglkrvscltlpr 


1823 


15724 


A 


1834 


2 


306 


LARLVSDS * PQ/ CD PPASASQSAG 1 1 CV 
SHRAQPAEEILKVFGTLCnEPQRPTPDI 
FI IPYLFC\LFEMESCBVAQAGVQWRIL 
GSLQPLAPGVKRVS CliSPP 


1824 


15725 


A 


1835 


12 


400 


KKGMVPKQLKVGKKPPFSCWGPNMKKRD 
SPVFHHQDPIFPIPFFFFGNGFLFFPP/ 
IjAGGQGGNLN*PWPLPWGLKEFPPPTPR 
GRGEKGGAPPPP.TNFVFT.KKGGFPLGGR 


1825 


T5726 


A 


1836 


T20 


401 


GGLEPPPLGDPPPLPPKRG 
KGSFVFIPQPEGEGPFLG*LKPRFPGLK 
QFSCLTLLRSOJYGPLPPPPVIF/CXSFL 
R 


1826 


15727 


A 


1837 


12 


357 


GLLGGQMNGSLGTQTSYEDLMSSS\FKP 
NSPPPTPS/VRTGHLPK*PLESSNGPPP 
PQVSHSFQGQWARGHPSPPPQWHTPFSP 
PQQYTQCSKTDP*PPPSPPYLGQEGSNA 
PSIA 


1827 


15728 


A 


1838 




380 


LMKTHAKLLNKRVAN*IQQYKK/HH/NQ 
MGFILGVQIYFNF*KINLIQLINSVKKK 
KM/HSSSSSSSSSSSSSSSPPVLIKSLC 
NLGKNKRNFLCLTKGIYKWKTEXNSiyiKI 
I LNGEQUSTAF PLRLGTK 


1828 


15729 


A 


1839 


2 




vpgdakwfsvlhlkdafffiplvpesqy 
pfafewenpntrek\twavlp*gfwdsp 
hffaqplerdlrglqledgs ilqyvdhl 
lvysptqeasdqntiktlhfpadrsykv 
skkkaqitlqqvhclgyiltpgtck/ls 

PERVQAI 


1829 


15730 


A 


1840 


1 


642 


eikgiqigkeevt/slfaddidylrids 
tkkllevicelnkvg*kihm*t* ivfly 
ignehlelei/r/elmpciktsstmkyl 

E INLKKDV* DLYTEKYKVFPRE I KIT* A 
NQ * E ILCLCSRRLNTKM/ STFPQVFCVY 
YAIPVK/ IPSENFLVLVDKLILKFI*KY 
RGPRTAKTTLKKKKKWRLTLL I PKSYY 
KTIVITIGWYGFQDRQVD*WNRIE 


1830 


15731 


A 


1841 


3 


423 


HRITSE*DLHLARELIW\SMYGSLDHKN 
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E=Glutamic Acid, F=Phenylalanine, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


183i 












YDAELDRMAMPCLCAIAGALPPGYVDAS 
SSSKAEKKATVDAEGNFDPRPVETLSVI 
IPEKLDSFINKFAEYTHEKWAFVKIQNN 
WS YGENI DEELKTRPMKR\ P Y\ KTYSEK 
D 


15732 


A 


1842 


33 


179 


YMLGGRGCSDLRSHHCTPAWVT/TA* LS 
KKQYQRQQNRLQYHLQEYERKK 


1832 


13733 


A 


1843 


349 


10 


LAGGLNSMEGER\LR*EREECTQQQMVH 
DKYCKDLMGFGTKPRHITPFSSFQAVQP 
QQSNALVGLLGYSSHQGLMGFGASPS PA 
KSTLVESRCCRDLMEEKFDQRK\QWVLK 


1833 


15734 


A 


1844 


15 


856 


AAEQLSFIYKLPQNPSPSTPGSSLSGTH 
GMQTMLGSTHLPNLTDMLGPELGVQG IP 
SPGCACQRGRG\GGGRECCS PPGVSPQG 
\SAVGRGAEGPGGLTRSGSGAASALWP 
GEKGCWCRTASGAGPQRRQRTRGPGSWG 
LSFSQTSEEKCPSPAGSAGAGPVCQRRQ 
SSFAGGCGTGAGAPGST\GDAHPAQGGS 
GGPLRSLPAVGGPRPGPSFLKTSGSGSV 
PQGVPILWSLT*RALRAPGSQGPAGLAV 
SCTGGRGYRDPQAPGTGftG*HGNSTRLR 
GP 


1834 


15735 


A 


1845 


402 


2 


SKAHRQGRPLRQG*APG/AARIPEQKRI 
GGP/EERRRPSARGPRATRVAGEGPKPK 
GQTAMAGGGHDPLPLPPARSRSQESIGA 
RSRGSGHSQEQPAPQPSGGDPSPPQERN 
LPEGTERPPKLCSTLPGQGPPPNV 


1835 


15736 


A 


1846 


446 


32 


tkkdvlni \lawkhuntkasetfhffq 
sgqakvqqgfvkegcelinealnlfnwv 
ygamhvktctcmrlldrlqyimgdyaea 
lsnqqkavlmXtervmgtehpcirpl 


1836 


15737 


A 


1847 


440 


4 


VDGRHVEVSKKGGQVNYQAG\KTVEIWA 
DKLGCajMLGTADMVECLKSTRYKELIQQ 
AITAGGAPIAFGPVIDGNVIPGDRQILM 
ECJGEFUSTYDIMLGVITQGBGLKFVDGIVH 
NEDGVTPND*KFSVSNFVDCMRPRRGPN 
YSRFQ 


1837 


15738 


A 


1848 


526 


0 


PREDPPPKRQTPIPTHVSPVWEKGPWGP 
APLRPDHPSLSPCPAMG*K\PGLPRGCP 
QTQISPLFNRSPASPLICHHHPPSEP\K 
PGPEPPPTPSSSIPSLAHFTRPGESSPL 
PPPQTPSGPP 


1838 


15739 


A 


1849 


417 


31 


QATGQECGCHRGGPPPGPAGETEPQAPL 
RLPGGTGIPWGAGILCPS*LPGPGSLSP 
AAGRG/SGPSAGPGAAWFS S P * /PACPS 
SSRSAVPGGAGSFREAGPGLFYTLPAPP 
WCGRGASNKIIQMPGLVC 


1839 


15740 


A 


1851 


3 


285 


YTVCECVCVCLCVCLPVSL \SLCLSVSV 
CFFPSLCGFVCVCPCACVSIAECALCAT 
KRFV/ CMAACLW * AS FCVS TWVMRPAVN 
RFRRGGSALGA 


1840 


15741 


A 


1852 


128 


524 


ki pglgqrs egvgqgkddlhvtapvpth 
gwgegaaskptvlpppppXpdaptvfff 
ffffwekkyffgpppkgagpqiyllgpn 
ppgfkpffpphpqaagi*yikpppwykc 
pllkkrgvstlap \gfpkp pp 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalarine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1841 


15742 


A 


1853 


1 


1648 


MWPLHSGLGKGVKPERKTTVRRPFVSA 
GKKYACPFKKAETPQWRRLRMRPKAPAA 
SLLATFPISQRLIPLTPARKHCPSESET 
STWWSKAPATPRTRGAGPTSPPRPTRR 
R\*ACSLQKLFAVEEEFEDEDFLSAVED 
AENRFTGSLPVNAGRLRPVS SRPQETVQ 
AQSSR/PAAVTPHCSLRGFGPARLGP /P 
ASLPPARPVLTA/GPSCIGAAPLRPVST 
SSSWIGNQRRVTVTEVLREPARPQSSAL 
HPLLTFESQQQQVGGFEGPEQDEFDKVL 
ASMELEEPGMELECGVSSEAIPTLPAQQ 
REGSVLAKKARWDLSGSCQKGPVPAIH 
KAGIMSAQDESLDPVIQCRTP\DPP*DL 
VLWVTFLFQ/PALTVPTQQLHWEVCPQR 
SPVQALQPLQAAEGTIQSSPQNRFPCQP 
FQSPSSWLSGKRHLPRPRTPNSSCSTPS 
RTSSGLFPRIPLQPQAPVSSIGSPVGTP 
KGPQGALQTPIVTNHLVQLVTAASRTPQ 
QPTHPSTRAKTRRFPGPAGILPHQQSGR 
SLEDIMVSAPQTPTHGALAKFQTE 


1842 


15743 


A 


1854 


235 


223 


IHKFIHQIWLAKITCQRTKV*KERSVLL 
PTSFSPVPSQGHYTCKQ\LCSLASDLSQ 
PDLWKFMNTjAVLHAMWNSRKVSCYPWT 
MIYFLIIANRT 


1843 


15744 


A 


1855 


373 


3 


IKDGIYRYFYl'LRI.DSFCLCCEFMNQFFS 
CMDRELSQRCFLNSAY*FPSNL*CYLCY 
LFFFF*I.KKYFLTFFLFLRDRVLLCCPE 
*SAMVHS *LTVPLDFW\VKGSSCHSLLS 
SITOYEHALPHLY 


1844 


15743 


A 


1856 


378 


1 


RQRHSPAGNTGRPQVTPa3*ISWPSITK 
DRTSMSSVTSGALGHTAASPHARLLPLA 
LPSVRTQHGSPPPGQEQPT I 1 CPSNLPT 
HPSLPLGMHPSVRASPPLCK/ P / SP PS I 
PASVHASKHPSPPVY 


1845 


15746 


A 


1857 


3 


379 


YMVRKVIEVWFLLLLLLFFFFRGGFLGQ 
GWGPPAPGFSFGKKHPQGLGSSNLRGPW 
* TNP * PVPGGALFLVGP PTPAH FPFKUF 
SRGFLIiV/ALaDF*TRPRLVSPHGG/RG 
AKGTPAFLESMPPWMP 


1846 


15747 


A 


1858 


452 


1 


gthglij:.gsgpf\rqvfkpdnfvfgqsg 
agnnwakghyiegaklvdswdwqee * 

ESCDCLQGLQLTHSMGSGMGTLFISKIR 
EECPDCIMNTLSWPPPKVSDTWEPyN 
ITLSIHQSVENTDETYCIDNEALYDICS 
RTLKLTTRCI 


1847 


15748 


A 


1859 


1 


385 


NTSSDYIFPFFFLFRNSIHSVTC3AGGQM 
HNQGSLQPWPSRLKX CPTASAS I CLHMP 
PFLaNFLIFFVEIGSPYVAQAGSRDPPA 
LASQ/SAGITGMGHCTQP*VrLFFFFFF 
SFRKKCSPWPPGGGPPIF 


1848 


15749 


A 


1860 


470 


17 


lEMDSERVPRDKLACITKCSKttlFDAIK 
IT*NELASaD/DFFPTLIYIVLKG/NPP 
CLQYNIQYITRFCNPSRLMTGEDGYYFT 
NLRLGTHCSWLMMTMWTCFRAFRIFSSQ 
WMSYVLRFRNGITGVSHRAHP 


1849 


15750 


A 


1861 


3 


790 


CSRPEFPGRRFVEAVRSKPYLSLPF*SR 
*SFFNVPAE*TSAKDILASSEFIKQNND 
VSS \LQKFMPQEVK*LDHIHTA*ADGSW 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iyiethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine,R=Arginine, S=Serine, 
T-=Tlireonine,V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














KAELERGPDGSAPSPFPQKPAASRQPEL 
GELATFLGRVDPWYQSNWTLCPAIHKL 
AEMPPSLDTSRTVDPFILDVITYYIRMG 
TQPIYFQIYTVKIFFSDLSQDPTEDIFL 
lELKVKIQDSKFPKDGFSPRRRGVAEGP 
GAELSLCYQKALLSHRPREVTVSLRATG 
LILKAIPASMY 


1850 


15751 




1862 


192 


3 


SSGSHSVTQA\GVQWHEHSSLLP / LLT * 
P PGRK* ASHLTLASS * DYRRAPPHPANF 
*IFCREGV 


1851 


15752 


A 


1863 


82 


370 


SLCQKRAFVGEKLVHGLLVSPSGGRVPS 
CPDPWGCRPRFHA.IAVYSFLKLRWIPE 
VSILPEDLEELYDLFKVRSSGKMEGSRP 
*AGLSRG/DPACP 


1852 


15753 


A 


1864 


2 


325 


IQVYSISHLSIYLFMYHLSRGSMHVSM 
SLSI*SIYPCMDV*MYLSNLCMHVYIYL 
FYGSIYRFYLSICLSVYLSIYLSIYLSI 
YLICHPSIFICrviDEHAIFATWRH 


1853 


15754 


A 


1865 


3 


377 


YS PWCKLFRELCKINVFD \LDSPLLSGK 
EFNDTTHNTFDHMWRTKEHNEAGWLLLS 
SVDKVMKENDELRDSNSWLQ!CQV\ * PLK 
SAKTALSGSLNSCREKAEIVEKQTQSLT 
M*VADLQRKMHVQP 


1854 


15755 


A 


1866 


10 


378 


dWKNGEFIDAL*KVYGHKAPNKSAVYKW 
IT/*FKKGQDDIEDEDHSGRASTLRKKI 
HLVYAIiIDKD/*RLTAEAI2UJTIDIS IS 
lAYRILTEKLKLSKLSTQWVPKQLCPDQ 
LQRRAELPMEILK 


1855 


15756 


A 


1867 


346 




DILWKLQKPQPNGKMLKAAKBRKKFAF 
KGVPVRMNADFS lAAMKA/RRRWNS I FS 
F*KE]!INCHLRLLYSAKNIFP/EIKTFSD 
REFVTIRSAVKEILECDVLWAEERLSHVK 
SRNV 


1856 


15737 


A 


1868 




377 


GTFFQRTQCKGIKQ\YWGLIIKASSDP 
TCVEKEKVYIGKLNMILAQMLKQEWTEH 
WPAFISDIVGASRTSKSLCQMJMVILKL 
VSEEVFDFSSGQITQVKSKHVKDSMCME 
FSQI*Q\LCQF 


1857 


15758 


A 


1869 


90 


384 


QWLLFTEYSSLYHPVLFFFFFFPGGRTG 
PNPPAGGEGNETG/PNGPSTPGGGGNPP 
PLPPGGLGLLSMPPPPRQILLMETKKRP 
PP*TlSrKCCSPGYSPP 


1858 


15759 


A 


1870 


2 


578 


FWKHALLLGLDFLPGKMAPWSGQGCSL 
GHTEGGTSWDFAVGGASWRLKWCVKGD 
SHKGPATPIASCKGPLGRPCPLLAQSKA 
*GS *KRG/VAPGSP*IiALGMGGG\DRLT 

LI SQVHGWQVTQI ipfstegetkaqrsp 

SLPPRDLIRGRHSWNIDSTQIiGYCPLL 
PPPLHPAGPLPVPFTNGBIQKEKSRE 


1859 


15760 


A 


1871 


1 


382 


SGQDAGSCLLYGAGSGAMVSGAYNPYIK 
IIEQPRQRGMRFIYKCEGRSAGSIPWEH 
STDNNRTYPS IQIMNYYGKGKV\RITLV 
TKNDPYKP YPHDLVGKDCRDGYYEA* FG 
QERRPLFFPNLGIRCA 


1860 


15761 


A 


1872 


490 


1 


ADSLSKDPGRPLHP PNIWGDRGRQ* PGE 
TPQ *HRRS CASQDPGRSQ *PGKILPSPK 
PGRPPPMTGEDAGPPK\HGNPS PNDQKK 
NPPPK*/PQGNRIHRDPGRFPPQ*PRKK 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SFPNNLGDSNDQRSRFPVA*KNLPQSSS 
GTRAIGRELYLPPQPGSSAFPNSTT 


1861 


15762 


A 


1873 


373 


1 


gggvpgalfshkkkslfffppppfffrk 
gl*nf\kikgcgspflpllfflekescf 
vp \rve cggvilgp ckvclpgs ppfsas 
as*vsgatgac/rprpgkffspffffyf 
lvemrfhrvsqdv 


1862 


15763 


A 


1874 


3 


374 


YMLGKEIVSKTKIGQELGLLNQ/TSQW 
l!JaKEKFL*ETKSATPMNT*IKRKQNSPI 
TETEKALLVWIEDQTSHNILLTQNVI *N 
MALTLFNS IKAERGEEDTEEKLEGSRYW 
FMRFKKKKAISIT 


1S63 


15764 


A 


1875 




364 


IHSGKGESLWDLGLDTEFIiDLPSKa*HI 
KaKWDKIiDLIKM*ISrFCSAK/ET*/lRMK 

tqaihwerifvkni*1irklvykellklr 
n\kkntirr*aidmdrhftk/ekmpmtn 
*ytkgcsislvil 


1S64 


15765 


A 


1876 


41 


461 


GFLYLCSSEIDPYPSPSKKIKSEWIRTY 
WMKLLSENATIMLKDSDLSQV\FCVHIP 
FVQaAKftKIEE*DYIKLKWFCSAKGAI * 
KAKRQPTE /W/DRI FANTPCVTGLITTT 
CMEFTQLTSLITTPTITLPWNPRQSYHH 
NPT 


1863 


15766 


A 


1877 


2 


185 


VRPTKLDPLERTQYTLPLLYKWNNKAWM 
TflHL/FTA*FTEYFKS\SVKPDE*VEEI 
DYRMLYL 


1866 


15767 


A 


1878 


1 


491 


IHRLLWPPLSAARPPSRESGLRCRAPRR 
PASAftAATAASPS\PTAPQGPPR/RRRL 
LIQPPLYPRGLFTPGVPPL/APGGSREP 
S* SLT*DARTLPLPVLGPRRARLLGACP 
AVQAEEGVDPVGHLSEV1AAPCKQP*TP 
PPACHCUJGEGRPSGVQAPLHKftKLYP 


1867 


15768 


A 


1879 


24 


449 


LQPPLTWALLLQPPQPKSRAAFFFFFFF 
FGKKSQFLFGPPGGGEGEKLG*REPPPP 
6TKGMPPPPPRERGKKRGGPTGRENLGI 
*RKGGVPPGGRGGGQTPNPGGGGAK\PP 
KGGK*GGGRLPPPQIKGPKRGRPKKRKG 
RP 


1868 


15769 


A 


1880 


190 


2 


PLYCHKVGQVGLBLLTSGDPPPLSLPKC 
WDYRY\DHHaQPSF*LFLSVQISGIKFN 
HSWQPCI 


1869 


15770 


A 


1881 


1 


458 


faieagrnlpdkefryijitvivtaavyw 
gmjsklrcltsXlltfqhragvspytsp 
fgfa*tcvfakqllepilc/rpcfhrap 
llpkleghfae/fp*qcffr/qalgfsp 

/RSTCVGLRHG 


1870 


15771 


A 


1882 


3 


392 


YMMRYHYI LIRMAKVKI \ SSTKDDTQ* L 

klihc*qecrmvqplwkmv*qfliklni 
*lp*nlaillwgiyliemstyefr/rki 
ciqmfmtdliviakywtq /pq cps vagw 

IKQ/IRSIHTVEYYSAVKRNQL 


1871 


15772 


A 


1883 




473 


IQGGIAAYRVDRQERSNGRQGKRRNLSG 
TNMRRNKQKGLRRCKPRW/RSRGMAWKV 
DKPRDAR*KSDRDRQ/RGETDG*RVSGG 
LTA*EAYRH\HRGQ\EAa.SQSGRSRQAG 
RQTP * QEQAGNEAE IRREEE KSRGQERE 
RE\KDRDADRHKG 


1872 


15773 


A 


1884 


138 


444 


CYLTLIKCRI IYINDKV3VIjT1VWY*HK 
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SEQBD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
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ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A==Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=^possible 
nucleotide insertion 














DRHID * WNRLQS P * LNPS IWVQLI LLKN 
AKNTQWVKDTLLDKWYW/ENWIYQCMRM 
PLGSYTI PMNQFKMNYGLS 


1873 


15774 


A 


1885 


27 


470 


TICCYIFLFFSFFSLFFFFFGKGVSF*S 
PGGENTG/ANLG*WNPPPPGKGISPA* P 
PKEPGMEGAGHPPGTKS*PQKFGPGPGG 
GPSFPYRGPNKTWGKPGPKWFPGGIGYP 
ATIPQKKQPEGKEGR\PGGPMSPYNGPP 
KTPTHGKGPG 


1874 


15775 


A 


1886 


478 


1 


KNVQEUSALQAAMCiRKMLlVCQTQCVII 
SGESGAGKSVAaKYIMGYISKVSGGGEK 
VQVHEGKQKDKLGRPHLHGSARPPICTP 
ASV*LLPTPPASAP\HVKDI ILQSNPLL 
EAFGNAKTVRKNNSSRFVSLCRPAWSSC 
SLRftDLSTHTPTHTHTRV 


1875 


15776 


A 


1887 


1 


402 


HSLERPHY IG* LFKNI FSRD / RVFAMIA 
HWSRTPGLKQSTPLSLPKCWDYRCEPQH 
PAGSFFFFFFLKNGFWGCBLGGRAGGQQ 
*LKS*WRPNPLG*GNPPC*PSKEVGTTG 
AHKKIANREIETEACGGTNFSL 


1876 


15777 


A 


1888 


511 


124 


GTERQHFAGMrVPPEGP*SMyitDPiaiK\ 
DDRPAEmMWIREPGLLLPRAPAQDAGKY 
YCHRGNLTMSPHLKITARPVLWHCLLRT 
GGWKVSAVTLAYLIFCLCSLVGILHLQR 
GESCPQWVCPNPTPSSPG 


1877 


15778 


A 


1889 


667 


310 


QLKP*ATKSV*KDTAFGIDVGNDFLAMT 
PKftOftMKGKlDKWGPIYR*S ICTAKETI 

nrvkr*prk*ekifakptwqkgqis/ri 
hkefqqln/kqksnnlierqtkdlnrfl 

SKECSKDL 


1878 


15779 


A 


1890 


462 


3 


KWFFPLGPPFLPPP/PPPI*NPSFQKNK 

kltrggcaryfpplksprpripfppfek 
gegspnskhspappfwgpketfffrrtp 
ppppppplcsflkkhlfsyipkgflggk 
gqnfhsflpsfffs ikkkllglgvhvrf 
ycrvnsctgffvqmy 


1879 


15780 




1891 


1 


455 


NTCLG)?GNGFLnATPKA*SMKKI INKLD 

fnetenfcsvkdtvkgmkrqathwek/v 
rkthilykdlilkiynqllkhhnkktst 
ikq*akdlnrediqmtnkhmkrcsgwgr 
wftpiiealwackvegslelesslgniv 
rphrkrereihcy 


1880 


15781 


A 


1892 


1 


537 


RGGIQAPKEVSPEGRQEPARKSLr*rA* 
ETP P* SQ * / P I PEEP/TGVFMKKPVSVS 

letgkiiavwpkvngkelpdnptikwfk 
gkwlelgsksgarfsfkeshnsasnvyp 

velhigkwlgdrgyyrlevkakdtcds 
cgfnidveaprqdaygqslesfqrtskr 
slyaialedpgm 


1881 


15782 


A 


1893 




514 


vrcqrrche *racgsslvnakklyedal 
markvkqslfsldvetdedkfqmmslqV 
cslaygtltkilsekrsaksygmssvem 
rsagqtskahlhqprrvsqvlqvpavnl 
lpfrkkgqtkdpalntslpqkvlgttee 
i sgkkhtsdti svas slhysppas pqgs 


1882 


15783 




1894 


473 


2 


vmgesrgfsppptftggnfvffwkqgvs 

LFCPEGFKTLCSS SPPPPPPQKAGVLGG 

sfharpppffffqi*ipfffgikkp\lc 
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Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=TJn known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SPPP*PL*-KKRGIFPVPKGIFCEMESCS 
VAQAGVQ*YDHGSLQPWPPGF**SSYLN 
LV*LSS*DYKCVPPHPMY 


1883 


15784 


A 


1895 


613 


11 


FRPRS PACGLHAVS SPKXPGAQALSS PG 
LNLVFTAGSWDAG/LLRLSPEAWNPRVA 
LPRVHTG PS ST /DLS PS CPLGLACGFGT 
SQPACL* SPFLLLAPARPSGCAWPGLPP 
VCSLHLRGGWGGADPTGCLDAWAMALAS 
LRPCLCPALPLSREPPSPLLNDLVLPRM 
PRRAPVLPLGRTPHSLLLCHVQSPSEQL 


1884 


15785 


A 


1896 


20 


449 


KFGYSSAAARRQQLGWEAML * YS FPLQL 
EPSAQTWGPGTLRLPNRALLVWKFEGS 
EVSPCVACGIQAALSMGSTSSVKLLSHP 
QAPLPQWHQMVFARCLCMCGAQLNVPP \ 
ESFTFQVSTKDVPL2UL,MRCALRK/KaTV 
FRQPL 


1885 


15786 


A 


1897 


393 


3 


RPTAQSKGNI*VRVAS\KALSPKLLDFL 
PGKVLNGEKVDVRPATRQNLSQFEAQAR 
KRECVRVPRGGIPPRAHSRDSSDSADQR 
ATPSENLVPSSARVDKPPSVLPYFNHPP 
SALPVMGLPPPPIPPPPCI 


1886 


15787 


A 


1898 


395 


217 


rer/cksfr/ppa/hdqm:ikgaq*qvn 

QAAflAQAflAPTUUiMVSEDISSLLVSSQK 
SKVSNYM 


1887 


15788 


A 


1899 


1 


375 


NTVLVQ*KINKaWMTVHPF/TAWFSEYFK 
ATVEIYCS/EKIPFKILLVFARVHSHPR 
TLIEI*KEIYAVFIPflNTPSILQPMDHG 
1 1 LSSKP YYLRfCASRAQRLTPVI PALME 
AEAAGSPEVSSSGLA 


1888 


15789 


A 


1900 


47 


326 


VKSIIQFNSKHEPGKHDKTPSPEKTQIV 
WWLGSELRSPSYSGAAGGREMGRSLESM 
RSRPQRaETAPLHSSPGSGSEMLSL*RQ 
HLTPTAWAQVQWRGLCSLRPRPPGFK* / 
FCPSQHHPQPPSSWDYATRCQATKQFVF 
FLEMGFWMFARLILTVEmW 


1889 


15790 


A 


1901 


181 


837 


AGRVDRREPGMGTGICKELETGSRETRS 
ASRWGRGRWRLGQACHVPQGLPLSTFHL 
GAQAKRRGGTPIACSSHLPNSH\''GSLKA 
QRDEftMVQSGrAPAVSSTCTRWT*GSEW 
* *GLHIVAaRRQQ6REEEPPRTTAAPET 
LCFQQTSWASSCSLEHSAQPSEVQVRAL 
SVPSHSPMWV\PQLSLPRDHRKPPGE 


1890 


15791 


A 


1902 


1 


385 


YTWGFRGKKPLIHeL*fiYaVQPLWRAA 
WRFSK*LRVEL*FNAMSPLGVyPEENK 

LFYQNSTCTCWFITALFITGKT/WNQP 


1891 


15792 


A 


1903 


207 


3 


FREMEFLHLGQAGLELSTSGDPPTLASQ 
NVG/HYRREPLRPAWTLQS *PLGLKQPS 
CLSLPSSWEYRHMY 


1892 


15793 


A 


1904 


2 


391 


IQPLISQRKYKTLGQNSLS/CCAHP/PP 
HFFFVLDSYFHSLLSQLKNAPFT*G*CP 
YWYSKSYSLIHSWGSSSILPCPLTSSGF 
PSLPPSYQPLPCPSLFLL*NSLSTLCLL 
FFSLLSQPSFSNRWPSQVYL 


1893 


15794 

L 


A 


1905 


3 


424 


YRAGCLQSLLPPPLLLFLLPCDVPFPPS 
S \ LP * VKAS CGL I RSQQHVGT I P CLQNR 
KSNKPLHKLPSLRHSLRAMQNRILPILR . 
IGKFF/ IFFS *DGVSLCHPGWSA6VQLQ 
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nucleotide 
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ing to first 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=GIutaniic Acid, t'=Phcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FTALHLL/ VLKRFS CLSL 


1894 


15795 


A 


1906 


2 


389 


IQRGLDKSCLITDNIPSSQSLIQKKGVW 
PS FKSMKADRGKEAAEEKSEASRSWFMR 
EKERSHLHN/ IK*VQGKflACYPEDLAS 1 
IDEGGYTKQ* IFNEDY/MWKKiyi*FRSFL 
TREKLTPGFKASKDRLTPLV 


1895 


15796 


A 


1907 


458 


13 


AKEETQSVWDFPM/VPQGVYVNFPVSR 
NANLSTIKQLLWHRAQYEPLFHMLSGPE 
AYVFTCINQTAEQQELEDEQRRLCDVQP 
FLPWRMVDCEGARVNKLLSSQISLFIG 
KGVRE11,DSLSDPEVSDFSTKMCQFCEKS 
AAL*DQATS 


1896 


15797 


A 


1908 


409 


3 


EKTDGLYRASQRGKDLRRLCAQSFHASW 
KDGMALCALIHRRRQ\DLIGYAKLRKDD 
PIGNLNTAFEVAEKYLDIPKMLDABDIV 
TTPKEDEKaiMTYVSWL* I VIAGAEQAE 
TAASRI CKVIiAVWREKKKLME E RV 


1897 


15798 


A 


1909 


116 


379 


HSGPRREGALLLPKCLPHAKRCLLLFKM 
CSDGATALCCPGWSSAAPS * LTQS P/AS 
TSQAK*PSHLGLPSCWDYRCIPPHPftNC 
LDYYY 


1898 


15799 


A 


1910 


418 


3 


QD*YATANRWFICMLSQACFLPSL*PAH 
LL*L*QLMLFSFPGTPVFSYGDEIGLDA 
SALPGQHIVIEAPVMLWDESS FPDI PGAVS 
ALMIVKGQSEAPGSLLSLFRR\LSVQRS 
KERS LLHGDFLAFS AGPKLFS Y 1 RPM Y 


1899 


15800 


A 


1911 


394 


3 


ILEAYPEVKDPAVKGASSKKEMYGH* /A 
AEQALPVASEQEQQRHERSEKKQPQVKE 
GNNTNKSEKIQLSENICDSTSSAaaORL 
TQQRKIGKTYPQQFPKKLKEEHDRCTLK 
QENEEKTNVNIIVISKKNREDV 


1900 


15801 


A 


1912 


499 


141 


PGLGERDWTSKYGQGAGEGSTREWRSRC 
G/ IRPGGDAGQQQPRPE*SVCPRGAHSP 
GPQSWKASPAWHSABPGGRCSQLGVQKE 
GPPGHLLQPGCRTPGPGIRKERFSGYLQ 


1901 


15802 


A 


1913 


127 


387 


ISFVFPPTLPKMPQLKPETISMTGLNLF 
QHLCNLAHLATSAYDGCSNSEV/CDLDL 
LY*AALFLKLDYQIRFPNYFSTKYRIYY 
LCLY 


1902 


15803 


A 


1914 


504 


0 


PGPGQRKHSTAPMCLLNIYAOILRFTLA 
NQIYKCIKRIIHHDQVGFIPIMQSWFNI 
QNQ*\INLIHHINRLN*KNHMI ISFDKT 

HHLFIIKTFIKLGIEGNLLNLIKNIF/S 
KNPAANIILNSEKV 


1903 


15804 


A 


1915 


46 


415 


YTSNKQLQIQILKITYNSTKKYKILINW 
I KDViCDQ YTENQKI LLRK I KD LNK/ YRD 
TSCSW/NIVQMSMFS/KLIYRFSEPPNK 
NLSTL* 1 LTTSF * FAMKYTGTG I YKVTM 
KWRVGRLSLPNFQNY 


1904 


15805 


A 


1916 


420 


1 


EKaDCVERARKSPDSIP*\RGGQISVTM 
VSPNEQEKAGQLAIGVRAVRYKGV\LLA 
KMWRKKLHLTLSANLEKI IAIGLFF3NF 
ERKPPENTFLKLTAMATHSESNLSCFAQ 
EDIAICRPHPAIKMPEKAEQYKPLTASV 


1905 


15806 


■A 


1917 


384 


3 


TRT ITSGQYSTHVIRASRVPTRS * VPVF 
RSCTSNRRFSQAIEPRVMH*KVHIRAST 
VRYDSGGHVAVYPANDSar,VYQLGKILG 
ANLYWMSLNNLDEESNKKHPFPCHTS/ 
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Amino acid sequence (A=Alanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K.=l,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RRTALT* YLD ITNP PMY 


1906 


15807 


A 


1918 


405 


105 


KAEAQRREE I C S GP HSQADS AWHP LVTW 
WH*PPPVARSSQEPGVHSPVP/HVCRKI 
LL IRPKI>«aANEGMYRELRWFTPWSRSR 
*VGCLTTPGMCVKHLRCV 


1907 


15808 


A 


1919 


409 


3 


GGQEEGVFRVLNSKEFETERGKP*IPPV 
KDQEIIH\PTKFNHVAHMGPGDGKQVLM 
DLPLSAVPPSQEESPGPAPTNLARQPPC 
RWKPYISWPSSGGSEPSVTVPLRSMSDP 
AQDFDKEPDSDSTKHSTPSNSSNP 


1908 


15809 


A 


1920 


9 


470 


APARNPLPRPCTWPTGP * CLRPARP PVA 
SACLCGGTWNLALWLCS PGT/PVPFLTP 
PCS / SCEVQQPASHS VASNQS KE PAKSA 
AVAHECPPGGTGSADPGWPPGATCPESP 
GPATPHTLGWEPGKSSPPTMEEEPWAP 
QGSPCWTVRQKMTMMM 


1909 


15810 


A 


1921 


556 


1 


I<!LWAALG<3lLRRVGtoPPSHRl5lSPS- 
AMAGQAGGNGDGEEGSS\GGLAAVPHLP 
*SQHAftHPLLIGPPGQQHlGDSKV*GFP 
SPRIiEENTLENGGWGSKQLHGSPGSQHA 
GGSWKNGETSLKG/ PH*ADGAGRHTMPQ 
SPPSPPFKPHSV*HNPPAS/PPPHGSPA 
PSGTSPLPMSAVSLLPPGSL 


1910 


15811 


A 


1922 


567 




GGWGETFSRLGNDLQaHE*SRFNAQAQE 
ETSR\VLaVSLINEALDKGSLEKTLSAL 
LLPAAGLDDVSLPVAPRYHLLLVAAKRQ 

KRQVTGDPGAVLWLEEIRQGWRATQDA 

NTAQRRSlLGVAAINQAIKEGKAAQTERV 
LRMAAVALRGWPDCANGYORDLESAMA 
KIQRPAAV 


1911 


15812 




1923 


2 


405 


IQGCGITSSSVLHGMVFKKETEGD/VTS 
VKDAKIAEYSCPFDGMITETKGTVLIKT 
DEELMNLSKGEENLMDA*VKAIADTGAN 
VVWGGKVADMALHYANKYNMMLVKLNS 
QWDVKRLCKTVGATALPKLTPPCL 


1912 


15813 


A 


1924 


510 


37 


LLGHAFHVQSSSGRPQLAEASGHSHLKK 
GECVQQRTGNVGLSPNTARWGTPLGPSI 
SSSAPPWSFSAFPGPPGSGK*AAKDGPG 
CSLKSGLKR\RSQGGLR*TRGSGPSPMP 
PSPSPS/ERPPPGDEGLLPCTPRGGLPG 
PKINTACVCAADI SPGLEPV 


1913 


15814 


A 


1925 


74 


429 


ATIPGHELLLLFFFFFFFFFLGKGAWPL 
G*GKG\GaPIRG*GNRAPRG*REFPPPT 
PGKRGNTGGGQPGQPIPGFLKKKGAPPG 
GPGVPKTRGQIEPPPWPSKRAGVTGWTL 
GPGPKV 


1914 


15815 


A 


1926 


515 


304 


ALAaSLaLAi:iNGVFTKTIK\*IVGRPRP 
DFFYRCFP/GMG*PHSDLMCTGDKD\W 
HEGPKETSPSGHSSPV 


1915 


15816 


A 


1927 


1 


433 


NTVGSNKKSKKLYFAEIEKSYLNFIMNIi 
KGPQ*VKIILKKKSSVTKCTLIHSKA*Y 
KV/LKIVWYWQRDTHM/DHWNSIKQHMV 
KQFFDGDTKTVQWAKDCLYNNWCWEK\W 
ISTSNRMKFNIYPTP*ANINCHQDPNSK 
ELYRYKN 


1916 


15817 


A 


1928 


316 


336 


GGT*PPQAATPIS*1.YLPITSS1,/TLPP 
APSLPLQIAPIPSDSPSPRYS/TPTPPI 
SLGLAPPTTLLI P VPSLPVS PRPLNSTA 
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nucle- 

location 
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last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PP/TFLTGSGAAVTWSLAVSLSPAPAS 
AMDJCAQNM 


1917 


15818 


A 


1929 


490 




PTRLTCPGSHVPGPVTFPPYETHSMP\P 
PCHVHPLVQPPGSSHVPGPVTSSPSPPS 
ATHIHPLVQPPGSSQLPNTWVHPSQRMR 
TWLQQHPPLPAPPQP/PSPGSICSSMTH 
LTEYLASFPAPQ/PREQAPQGGVPTSLP 
QW*APQPTAI>CRATQSQQLLDG 


1918 


15819 




1930 




552 


RNPRKRAFPSPPIPAPPS*NQENERSRH 
PQSLL\PPVKSRKRAFPSP/LSPCSSFV 
KSRKEAFPSPPVPAPPS*NQGNERSRHP 
QSLLLUIEIKEMSVPVTPSPCSSFVKSR 
K*AFPSPPVPVLLRDHTFLPLPEPRQPS 
TVPVGCFGFSRIPRRWNHTGCAFLLICH 
SftRLLCDPRVSSCWRHS 


1919 


15820 


A 


1931 




3 


RVPGESRK*ERVLDSRHKD*EGRRRGNI 
\MGKRVDYTT*TVITDVTNLSIDQVGVH 
RSIAMIMTFAEIVTPFNIDRLQELVERG 
NSQYPGAKYIIRDNGDRIDLRFHPKHSD 
LHLQTGYKVKRHMCDGDIVIFNRQCI 


1920 


15821 




1932 


521 


103 


atepagvrlkeggnitesfvtvgnvisa 
ladfsqda/srys*kkkqvlvpyrdsvm 
twllkdslggnskaimmatispadvnyg 
etlstuiyanhakni inkptinedmjvk 
lirelhaeiarlktllaqgnqialldsp 


1921 


15822 


A 


1933 


1 


490 


NTGWWRKGEGNGTLMVKELPCVPLCS S 
AGGLTEEDEGELCAAGFPLIAEDFGQAL 
QQLQTAHSQAVGAP/KGGDQVLGGGTGA 
PPTAYPNHALS IS * IPSVSWHDEGALPA 
VKIEILGIIHLLPERHALLSLVQARSGL 
LLHGPPATGKILLNmSTTECCLT 


1922 


15823 


A 


1934 


383 


151 


EVAFLENLIKDDIERGRLPLLLVANAGT 
AAV6HS\S*LGRMKELCTQCVHRPKMEG 
YMHVSCJHPVPESHKjyiRAKAIF 


1923 


15824 


A 


1935 


379 


1 


YVPWSKDKEYFFNSQ*GLTTHQILPYI 
DGFRHVQKISAEADVKIiNLVRIAIQILL 
♦VGLQSYLGQGHQPGKS/CRGPGCEGWE 
GMVIJIS*AQLSLSGTTAL*HWCPSSRKL 
IQFGIIKNLIRRLCI 


1924 


15825 


A 


1936 


376 


2 


GCLFIYMKPTAMSSSQVRRSGEVSPFTA 
VPA*S*K/QGHGAVLGCHITSEACFQAC 
FPFLRPGRSTCFA*SGC*DVERSSSHSH 
GTAHSP\HGTfiHTPMEQHTHSHEQHTHS 
HEQHTHSHGTRHMY 


1923 


15826 


A 


1937 


426 




KHGEI IDDLLKVYRDNA3KKSAI / YSKW 
ITHWKiCRRDDVGEEI'HSSRPATSVCEET 
I HLVCALI *ED*LIA?ffiTIMIAVGI PRC 
SAYTILT/E/KLKLSKLSTRWVPK/P/L 
LPAQLQIREKRSMAIUSKWNQDHEAFLH 
IIIAGLY 


1926 


15827 


A 


1938 


2 


469 


KRRKLVS S lAAAIHPALSS *TRQRFLLL 
LFLFHIVREDLVQIBIKKKHTIQIGNVE 
VKLL*PTDNIILCGKLYRFHRSTQKVLQ 
LI/NFS IVAAYKINMHISLVFLYSNDE * 
LENKTKQTSSFTIALKRIKYVGSNKKSK 
KFYFAEIEKSYLDFIWN 


1927 


15828 


A 


1939 


468 


920 


I PLSTHS PGMGTSTHLSTRHHLS FWTYH 
EELU!JPGEETDWA/lMIiFRWGI.LMLPRL 
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peptide 
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Predict- 
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ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSNS*AQVI PPTSASQSAGITGMS \HAS 
GHLIFNY\CSM*WLHLRNSQ*KRCTGQE 
TGPCPVTQDGVQWSNYGSL* PQTPRLKQ 
SYHVSLPSSWDYS 


1928 


15829 


A 


1940 


448 


474 


grpgg*gsttrnalyhvmngedwiltt 
ckhgkdwqkhkdsrcdrdnteyek\ydf 
gemiihnatfciivhrgrrhgs frflermq 
aacvpvmlsngwelpfsevinwsqaavi 
gderliiqi pstirsihrdki lalrqrs 

QFLWESCIATAIEDPNLIGWR 


1929 


15830 


A 


1941 


1 


423 


NTTLIFAGGM*NVCPGPLCARLLHRSLT 
HATPDPPLLTLP* IPTPTPSSRCAPKAP 
PHLICPTSCPLCSKPHPWPP/ CSSCQ 


1930 


15831 


A 


1942 


48 


417 


rltmyqviiyrlhctalqplpprvk«fsc 
lslpstwnymhmpqcptnf\ciftrdev 
ppmlp\swwrtp 


193 J 


13832 


A 

— 


1943 


450 


2 


VHRRSNRQIIMDTSR*RGflMQSVESV\GV 
PY\EQ*TIVDGI\NSGVWEGIAYA*IEE 
RYPEEPALEDQEKYLYRYPGGE/AYKVE 
TIKIiNVEAVNTRRDKPTOIFAKNQAPVR 
MRRNSFTPLSSSNTIHEPRNYSVGSRPI 
KPLSPLRAQECI 


1932 


15833 




1944 


451 


3 


GRITRHLPRRAEDDREREREPSPLPSRH 
PMFPPSVTPKASSDWP/PASS IPCQACH 
G/PPPVSLPRKPAHRSC\PVFPP\ASGG 
DTSIHSGKTVYVKRKSQPAWPVLPPGGL 
*APSHQaPSPSPPDQHRHCPEl\R*DLL 
PAPAPSPFSIPPLY 


1933 


15834 


A 


1945 


402 


3 


VRLLVSWKVMYP*SHSKASL/HPQITAS 
LTGSCVNCIVILILNFFYEKISAWIAKM 
EIPRTYQEYESSLTLKMFLFQFVNLYS S 
WFYVAPFKGKFVGYPGKYTYLFNEWRSE 
ECDPGGCLIELTNQMTI IMAGDV 


1934 


15835 


A 


1947 


1 


405 


KKKKKPGGPLGPAQGKTPKTQKGGGAQG 
NRGKPPLFSGGNFGNGRIFGNPLPAPGP 
GGGGGPR*KTR\KRKNGNWDLLKGGGKL 
VFCPMVGKKLSGVPGV*QKNFKGGW 


1935 


15835 


— 


1948 


443 


1 


LTPGaANASIiLG\CCMEDLSVKG*RQGL 
WEALLTHNMVAGCRLEEVDNAYGHYEAF 
STLAPiCaWLSVELAEPCVPEPGLPPVFA 
NFIQLLSA/PWVTEGGTAWLEWWHVQP 

MLALMEAELRKSQVLNRVT *GAHYSD C 1 
AAALRIKIT 


1936 


15837 




1949 


396 


2 


gngrggsvppnslnedgiscai* /rhin 
wlng*tptiycl*ethlick/dh/hrlr 

VKGKKKI LHPNGNQKPAGVALLT * GQTD 
I K* KAI KSNKEGH YEIKGS VKQENI TVG 
NIYAPSTEAPRYKKQLLDLKGVW 


1937 


15838 


A 


1950 


419 


1 


RWQ/PSARPPTPSGK*GASLPARPSSGT 
*GALLPGCPVWKVRSASSWPPSRLGSEE 
PLCPA\PSHLGSEER\PSRPPSHIGSKE 
R\PARPPIA*DVGSASAPP/LPSGM*ER 
PAAPPSGR*GASLPGRPSPEMWGVPLPR 
RPVWDVHAPSRAPVWEVHSVSAWPPHLR 
REKTLRLATAPV 


1938 


15839 




1951 


422 


3 


QKHVIMSE/DAKIAFGKIQYPFKMESLN 
TL*MKVNFLNLIKITQKNPIAHTMFTGE 
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Predict- 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MLE/AFLLRQGKRSC/CSLSPILFIILL 
EVLPS*VWQEK*/IQG*EREE*NCDFVD 
DMIVYVEN* KKPLYNLL* 1 1 S * VARFKV 
ILDTCI 


1939 


13840 


A 


1952 


2 


409 


CRHYSKNPPQRPLRTDAELPAV/DFWSL 
GAIIiFMVGCGQPPFQEANDSETLTMIMD 
CKYTEPSHVSKEGKDLITRTLQIDPRRR 
aSLQ*lENHPWLKGVDPSPATKNNIPLV 
*yK\NLSEEEHNSI IHRMALGDITDR 


1940 


15841 


A 


1953 


'2 


367 


IQCVRL*VHVCLCASSCVCICLYVHATL 
CVSTCL*CVAVCM/CLLCACATVSACLC 
/V/CGCVSTCVCVCPPVCTCDCVCMSAC 
VCT/ CLCVCVTVSTCVCERLCVPTCVCR 
CPCICSSLNGNEWMG 


1941 


15842 


A 


1954 


374 


2 


EAPWLLRAGRGALPPCWSKTPPSPLLFX 
PPLLPGTLVYQPWVPLPPWNRIiACaVTL 
ST*AEaGTSNPSWHLPPVSTAPQHPSTW 
QSPGAGTMGDPSLSPWTSHLIiPGCCHNP 
CHCPFPHHKPTRV 


1942 


13843 


A 


1955 


1 


411 


NTPSPELHPC*PGL\PPLPSPPQQPPTW 
APPRTSTQQKLPILCLLKPSAHTDAPCT 
QPGSTLPLHTPHTQC!ACX3TAYQIHTT*A 
APPPGAKPG*RCPPPPPSRQPQRMQTPA 
PGNPQPCPRLTTLTRVLVPAVPPLPI 


1943 


15844 


A 


1956 


33 


451 


rghntfgplqsppprfk*fsclsllrsw 
eeXrdyrcmpvhpawtmlaslvlnsXcd 
lpalasqspgitgvshptrphiis*flmp 
le*ghhhlllvllsqppnmpp*fqrfst 

GQLECSSFRTNLIVPSPSLNFHSDFPLT 
S 


1944 


15845 


A 


1957 


3 


399 


YMQVRTTMSDHSSHSISKINTDNTKSCK 
G*GSTESLEHDRWEYKFVQLLWETVWHY 
VR*TFTILYNPEI ILKR/ IFRHTYKNVC 


1945 


15846 


A 


1958 


47 


399 


AANPTLPAVFFFFFLETEPPFGPPGGRA 
GPQSRLTEPPPSGVKPIPR/ PPPPGEPE 
KWPD\QGGGGGGDPKPPGGPPP*ATGQI!r 
SAGKKKKKKI PRPGAHGaDPLLPGKAGG 
EEWLDPA 


1946 


15847 


A 


1959 


407 


238 


TQAFALI /KDGGVIGGICFRMFRTQGIR 
EIVF*AVPSNEQVKVSGSPRRQPQCTAH 
S 


1947 


15848 


A 


1960 


106 


1854 


NEAVKSKPNQTRNHKEKNNYVHLPDAGP 
SQPASAGGSSSACRRSTKATLHKRWVSS 
PAGPGVQPLSWQHPPMMA* GLQGNPS PQ 
AftAPPAEPGLVSGlSrCliT*EMAQAGA\GT 

ggslsaslgkrcphipvpce\pvl*glc 
vpgrsslgelggnpvtvqs fpgpgaesd 
pw*gdaeccsllqasfgdragwsir/rg 
svgrpagvprgkgrkptlsga/ sgpgsv 

LGGFCCPKPLSREAESGMVQE* *GRFWT 
GQERTPTGR*GCKVAGYLHSSATVGHSG 
AGAGGSPGKTSATLDVGQGLGGT/PSGP 
AWDRTNGYMEKAALPLCNGKVTGNTQCG 
AQPPASGPSPWVWQPLLLRQRLRQSTGL 

*llfpgvggglqpgeg\gqpfplspwgs 
ltgrknsnkarpggnegteg*garngas 
ltwvwtvpnggycpqagrdfwlvdsrfk 

PSIjGLRAMV*WSRKAPLSFEDGRMGTV* 
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ing to first 
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sequence 


Predict- 

location 
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ding to 
last amino 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Plienylalanine, 
G=Glyc!ne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVl=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, V^ossible 
nucleotide insertion 














PQGQEGRASHRKGDSGLSPHSGKG*DP* 
G/PGaKERSISPPLPRGL*GP/VGERGR 

NGDWPW 


1948 


15849 


A 


1961 


76 


427 


QRQEPTPGHTTSGTRHPP6P6GRGPIQT 
HSHSHLFPWLETSAGSSPPNFLCSLPPY 
TPLLRLLPSPFLCPSQTTQRMPIPAPHP 
SPASPCKVP*DARGAPAGGAPSTL/RGH 
HPPHP 


1949 


15850 


A 


1962 


375 


131 


FLVGR IRLLYCLRE LFVYLLREQKAKRN 
SQWVHTLPISSRHVDAVPCSPSINRNRM 
GRDKKRPFPL\CVDA*LHSPSWIQP 


1950 


15851 


A 


1963 


89 


538 


GLILLQKWHPGAVSSMVHRG*LLHSGLF 
FSFPDKVSSWQAGVQWHDLSSLQAPAS 
GSKRFSS LGDHRHAPPHIiAHFCI F/M* T 
QDFAMLARLGTSFIYLFGCCC\FETGSH 
SVAQAEVQWHNHSSLQPHSPRLRRSFHL 
SIPGSWDHRNAP 


1951 


15852 


A 


1964 


402 


42 


CCHYPGTPGLQGRLSASHQAS/SGPQLP 
APLPLQVLPFIiALGIGVDDVFLLAHRFT 
EALPGTPLQVGPCP \QGSSEAAQLTG * E 
PLQSSDLGLLMNLGASCPHL+TGEIIVL 
CPK6YCLDQ 


1952 


15853 


A 


1965 


8 


14 


SSRAAACPPRPALSLPRPPVFTRRMGPQ 
GSGLPKLPVSAPS SRMGSFPAPS PLLLA 
SGDRHLCPDAAALPLIiAIETGCDS \ PPG 
H*PPPSPGGLHPCSPPS*PLIL 


1953 


15854 


A 


1966 


52) 


2 


YELYGriKRHEREVSNNQQVRYKEHSIP 
PDYVSSVPTDPTWGPERREEESSGHFMV 
DHTGTAAGGGGGMILaSPKLGATPLPPE 

*apa/pppppppppppgvgsghlnipli 
leelrvlqqrqihqmqmteqicrqvlll 
gslgqtvgapaspseihgtgtasstkpl 

LPLTV 


1954 


15855 


A 


1968 


111 


395 


ylmrgfllhhnmvkgkederktkgarli 
lffffffkkespspprveqrdp\nlqt* 
nplppkvkklshptppekwepgavpiicp 
nwflffrknggs 


1955 


15856 


A 


1969 


413 


101 


DVNRHFSKKNKHCQ*LVIKEMQIKTKRH 

yhfsptrmgkiknXkkenkfwqgygetg 
1 fthcwweckmvqp i kktvwqfflkvk* 
llvfgpackelgsyyslvtks 


1956 


15857 


A 


1970 


411 


' 


NLTPALCTKUHFKWVMDIKMKGKTIRLL 
ENNTRKYLHNIKIWKGLiqRTE\KP*TIK 
KKTDKFDYDTIKIfLSSSECNKTRQTIDWE 
NrL\YLQYITDK* I ISGIHKAFLKV/YK 
/KKTNNPVGKWARDLNGHLIKNDIHMAC 
1 


1957 


15858 


A 


1971 


3 


828 


gqachf i frsaqaggsrlriwrakqlgl 
wrvcalimpalehsfptrllqgnsvpgp 
sisldtscl/canlrcvweggrg*tgpglp 
slgkvlvegipsespgptashpccsprp 
dpdq/lscisapsatptvh*shltlpmg 
wpgpq*glhlsdqpgprkp*pgcsglgg 
gdaaprgmekpXpppqlp 


1958 


15859 


A 


1972 


398 


3 


gpgc yfs vrlqcp /rki pawkravcsat 
lixslqgpslsaphvlglaalavhlges 
rsalpevdvgp papgaglpvpalfdsll 
tcrtrdslffclk*allpqsrllsslsk 
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ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














SDTCSFDKIRRDRHADIPPLY 


1959 


15860 


A 


1973 


2 


433 


QDPTKMTKEELNALKSTGDGTLG/RASE 
VEVKKEIVANVGKREILHNTEKEQHTED 
TVKDCVDIEVFTAGENTEDQKSSEDTAP 
FLGTLAGATYEEQVQSQILESflSLPENT 
AQVESNEVMGAPDDRTRTPLEPSNCWSD 
LNGGS 


1960 


15861 


A 


1974 


373 


580 


T I FSRQVLR IQNALSDKPNVSTVYSNNG 
SELHGTSEASV/YHWKILISTEYHKTL 


1961 


15862 


A 


1975 


47 


413 


KWKHLRGSEHWRPOENQVHQRIAELRJCA 
GLWSQRR/PAEAAGGPRPKSHWDYLLEE 
MQWMATDFAQERWKVflSVKKMVRAVARQ 
LQDRTRHEAQARREEPSRLRQTSPVLPE 
KSSVPGLVLRR 


1962 


15863 


A 


1976 




3 


GYERSRGTTSGTHSSRDYEG I FGQATVK 
WAMLDQFRMLSPCSKEVMHQPFYLKRVE 
IMAQCEEWIJffilQQYSSDKRVGRIMSRH 
SAAIKHRTGQLREELLKLPCPEGLDPDS 
GDAPEVCHAATGAEETIMHDQV\QPSSS 
KVLPSDFQ 


1963 


13864 


A 


1977 


281 


567 


prsslqgqelsrpwgrkbnfpgppvppp 
qrmfydselfsgisdpsccf/rsaptad 
qvyqdqdmhewrkhcmdylvrkwrplg 
kgqggkissphhp 


1964 


15865 


A 


1978 


357 


2 


kqilgpphpqaqpgrwppphgpkdapl 
wsshaappghglgragpaagveaqatlr 
dsspstwtreglhvqaqrkhpshvhkg/ 
sgpgcledgefptslrlqaqlaeigrgn 
glsvrrq 


1965 


15866 


A 


1979 


29 


434 


vqaeaeglpgdtehpqpqlmsrslegqs 
dvtikhvacgdfftacltdhgiimtfgs 
gsngclghgsltdisqptiveallgyek 
aqvacgasnvlalaterelfalgrgdsg 
rtqartkesh/ylpqqvpmp 


1966 


15867 


A 


1980 


3 


2082 


ssegyiirgnmseneeeeisqqegsgdye 
veeipfqlepqspqfepqspefepqspr 
fepej9pgfesrspglvppspefaprspe 
sdsqspefesqspryepqspgyeprspg 
yephspgyesessryesqntelktqspe 
feaqsskfqegaemluirpeeksplnisv 
gvhpldsftqgfgeqptgdlpigppfem 

PTGftLLSTPQFEMLQNPLGLTGALRGPG 

RRGGRARGGQGPRPNICGICGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
S LRSHQRVHTGQRPFSCGI CGKS FS QRS 
ALIPHftRSHREEKPFKCPECGKRFGQSS 
VLAIEARTHLPGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


1967 


15868 


A 


1981 


2 


188 


LPETNFAELFLPYISQHHLIRKYKK/WP 
GAVAYACUPSTLGGQGGWITRSGDQDHP 
GLHEEWP 


1968 


15869 


A 


1982 


3 


424 


EGQAIVERMNLCLKQQLQKQKGENRYYR 
TPHRQUSTXALLTLNFLSLPKaRILSAAE 
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nucleotide 
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correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutaniinc, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QHLQKPAAKTEAEQLVWWRDLITESWEI 
GKI ITWGHGYAYV3PGL/NPSRHLKPYH 
ERMLP.KRFREDPEDPPSCSHVKTDAEED 
PN 


1969 


15870 


A 


1983 


3 


399 


YSKLSFKGTLTKFRRIYSSSFYKEFQGC 

I sdlchp/r/tltnclrgellkenlvwg 

AFGCHPHFPCYYHKSQERNLLQALRHPT 
TVAFGETGFNYSYKCTMPVPEHHKVFKR 
QLELAASLEK/PLVILCQQADE 


1970 


15871 


A 


1984 


1 


405 


RRHIGGGVRLYYIGGEVFAKSLSDSAIF 

AQTPNCNQRYGWHPGTVCKI PPGCNLKI 

FITOQE YADLLDQSWQGLEADY/QLTRM 
CTILMSLLKGWGAEYHRQTATRTPCWIE 
r.HLNGPLQRVDKVLTQMGYPSILM 


1971 


15872 


A 


1985 


27 


452 


QGREHAQGGQSPGAGHLGPTPEPQPEPQ 
PRPSSQAVPAGRWEPAQE/PTRHPHPRL 
SPASRPLGSPPAPLVRSSPGRCRDHEHT 
IWSSTVGTSVEPAPSLGRPQAPLEPGTV 
TSSVRLQQPHMHTPGKIMPDPSKRNGKF 
TFT 


1972 


15873 


-j_ 


1986 


414 


220 


GAKQEELLSP/GSGGCSELHSCHCTPAW 
ATRAKLRLTKQNKTKRESYRQGNQWWGR 
ILGELLGRL 


1973 


15874 . 


A 


1987 


52 


412 


TRERKLFFACDNVWKHIifCRYLRKNSFGE 














PKNFFFFKACQCMGKKNTKHDFHEQNF I 
EHaLLQQLNINFSTNYIEGHVF\HPGAP 
lESLLLMCH 


1974 


15875 


A 


1988 




143 


EKRRGLDKRTP/AQAAFEKMQEKRQMER 
ILKKASKIPPPFVCMWSVDS 


1975 


15876 


A 


1989 


3 


163 


TEFQPSEK/WGEDLGDNTWEYIFAIDLL 
CCHQKWICHPLFLVGWRAGAEVSGVF 


1976 


15877 


A 


1990 


1 


439 


DKTliaEDAIRNIJEfflYKiaaVNINEEAST 
NKSKTSTKLHVGNISPTCTNKELRAKFE 
EYGPG\IECDILKDYAFVHMEHAEDAEE 
AIKGLDNTEFQGKHMHEQLSTSRLRTAP 
GMGDQNGCYTGGKDGHWSKECSIDRSGR 
VADLTEQ 


1977 


15878 


A 


1991 


1 


145 


WAASKAMK/MGDWKTCHSFI INEKMNG 
KVWDLFPEADPVLLKRLRESR 


1978 


15879 


A 


1992 


2 


425 


NISTLKKTLESDCT \KLFSQGIGGEQAQ 
AKVDRCLSDLAAETNKFRDLLQEGLTEL 
NSTAIKPQVQPWINSFFSVSHNIVEEEF 
NDYEANDPWVQQLILNLEQQIMAEFKBSL 
SPVIYDSLTGLMTSLDAVELEKWLKST 
FN 


1979 


15880 


A 


1993 


3 


449 


VAGPAPGAGARPGLDLQFLQRFLQILKV 
LFPSWSSQNALMFLTLLCLTLL\LKSFD 
QFTCNLLYVSWRKDLTEHLHRLYFRGRA 
YYTLNVLRDDIDNPDQRISQDLERFCRQ 
LSSMASKLI rSPFTLVYYTYQCFLSTGL 
LGPVSIFGY 


1980 


15881 


A 


1995 


1 


410 


SSRRPFIALKTKSMEDLNPEDIDQLITI 
SGMVIRTYQLIPEMQEAFFQCQVCAHTT 
RVEMDRGRIAEPRVCGRCHTTHSMVLIH 
NRYLFSDKQMIQLPESPEDMPAGQTPHT 
VI LVAHNDLDDR/ VQPVDRVKDSGFF 


1981 


15882 


A 


1996 




154 


LFFFRLLVRYTKKVAQVSTPTLGKVSRN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

amino acid 
residue of 

sequence 


Predict- 

otide 

location 

correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$$ible 
nucleotide insertion 














LGKVGKKSWKHPEAK\KRPCGKN 


1982 


15883 


A 


1997 


2 


475 


KKLRRQTRREAQKE \LPDKV\RWGLVTP 
S ET PVG ISNLSPFLGPRCL / QNPAEL KP 
PWRFPWAQRPKAHXERaNAGPKTPAEQR 
KVKKIKKLKEDISQGVHISVYRVRNLSN 
PAKKFKIEANAGQLYLTGVWLHKDWV 
VWEGGPNAPKTITRLMPLRI 


1983 


13884 


A 


1998 


3 


431 


QLRTRDRGWPSRRPEREKRTSQSARRPT 
CTESRWKSEEEVESDDEYLALPARLTQV 
SSLVSn.GSISTLVTLPTGDIK6Q/SPL 
EVSDSDGPASFPSSSSQSQLPPG2iALQG 
SGDPEGQNPCFLRSKraaHDSAGESSLG 
SSQA 


1984 


13885 


A 


1999 


1 


400 


aldlrglqilvgfpkrrvttcsyptalq 
s p i e yqrkkrstavmrtepdsayqas pr 
pysaqpadskkptkgycynptlpXrlei 
mtlegttq 


1985 


15886 


A 


2000 


1 


372 


QNIDLVISFPSSRLLQAGAELSVERVLE 
I IKQGWaLPKDRL/RGSCAPTLSAaGR 
SSGGQS PCMPGLCVCSFWVLTVSWLVCQ 
KFPELKFKYVEEEQPEEFFIPYVWSLVY 
NSAVGLYWNPQDI 


1986 


15887 


A 


2001 






GGIGRGGGAGGGVGAflGSASGGVGRRGA 
GGVIADSQAPGQQVEQGVGASQGWRE/G 
RGTSGGVGGSGGACGSV/GGSGGAGGGV 
GACGSTSDGVGRSRGTIGGLGGSGSAGG 
GVGACGGASGYVGIRGAGGG 


1987 


15888 


A 


2002 


2 


362 


WVTFISLLFLFSSAYSPGVFRRDAHNSE 
VAHRLKDLGEENDKALLLIAFAQYLQQC 
PFEDHVKLLNEVTEFAKTCVADESAEDC 
DK\SL\HTLFGDKLCTVATLRETYGEMA 
DCCAQQEPE 


1988 


15889 


A 


2003 


2 


358 


EANHGWFIRLKEGSQLYNIKVQGEaaSA 
DVEAAaSYPEDIAKITDEGGCAKQQIFN 
VDKXQTAFSWMKRPCRTLIAHEEKSVPG 
FKL/SKNRLALLLGANMGGFKIiKSVLI 
CHSENSRTF 


1989 


15890 


A 


2004 


190 


1 


DQTCFLS FTVKAVTFNGWVWWLTPVI PA 
LWDYRH\RPMIFFVFLVETGFHHVAQAG 

LKLLGS 


1990 


15891 


A 


2005 


1 


132 


GMCHHAQLIFVB7CSRDRVLHGCSQTPC 
LKQFSCLGLPKCWDYR 


1991 


15892 


A 


2006 


2 


134 


PMTFFTELEKTTLKFIWNQRRIQIAKAI 
LTNQK\NKARGITNI C 


1992 


15893 


A 


2007 


315 


127 


SEIAFFFCSFLKIILDTR/FSFFAEAGL 
KLLASNDLPSSSSQGAGITGVSYGTQPV 
CFEYHVG 


1993 


15894 


A 


2008 


3 


325 


RCSMLAVREM/QKATECRCHFLPTRIALI 
K/ND/GSNKCWK\HCWWECKMVQLLWKI 
VWQFLKKLNIELPFDPEIPLRDIYPKQL 
KTYVHTKTCGQMFIAVLFILNVPHLMNV 


1994 


15895 


A 


2009 


369 


3 


VGQAGLE FLTS GD P PASASQ SAVI TGMS 
HHTQP I FCI FGFA\GCPDWS QTPELKQS 
AHLSLPSSWDYRCMPPHLANFYFCRH/R 
VCCPGWSQTPGLK\YPPALVSQSVATTG 
TSPRAWQDTILPV 


1995 


15896 


A 


2010 


114 


287 


APLCLCLRHLRLLIKRLWLGTEAHTLX 
NPITTRGRGGQITWGREFETSLAHiyiVKP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 


Predict- 
otide 

corrcspon 

last amino 

relleof 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valinc, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^=possible 
nucleotide insertion 


nucleotide 
location 
correspond 
ing to first 

sequence 
















1996 


15897 


A 


2011 


218 


346 


IMKFNLLKFYFIYLFSRQSL/SSVAQAG 
VHWHHLGSLQPLPPKF 


1997 


15898 


A 


2012 


310 




SSSSSNTHFGIPKYLINPDTCFLa/KVN 
NSSLTGLEYTETLKPGIK 


1998 


15899 


A 


2013 


3 


301 


SDCCASNQRDSGGVGPSEPASKHTLCV/ 
CNSLDLIE ■ 


1999 


13900 


A 


2014 


332 


3 


LRDPLETSAVCPFSDLQLHAGRTTALFKA 
VRQGHLSLQRLLLSF\VCLCPAPRC3C3AY 
RGRQAG\SLSa3GLHPVRASRLLCLPKQ 
AWAMAGAPPPAWPRPCSLISDCCASNQR 


2000 


15901 


A 


20,15 


2 


325 


LLLHffiVPPVNLFLRDRVSLCGPLCGPGC 
SQTPGLKQSSCLSLPKC/WDYR\RATAP 
GL 


2001 


15902 


A 


2016 


373 


3 


wj4hglqpgkpkudslskkrkkkkgkkln 
tfplrsrtrqacllspiilsnivlevlpn 
eirq/qkkeikgthtgreeltlslftdn 

MIIYVDIPKQSTKKNQGSYSVARPGAVA 
HACNPSTLGGLGG 


2002 


15903 


A 


2017 


343 


1 


efpfvsgsratgkssdiratkyiwrvle 
ylrawprgqrhlkss\insllgsyhpga 
frgdkwscchqkdetggrgedevllccp 
gvlgcsalcrsgvrtkfgirgrpwkere 

R 


2003 


15904 


A 


2018 


139 


2 


NSISTKNTKNWSGMVAHAFNPSTLRGRG 
GQI \RQQDFKTSLANMVKP 


2004 


15905 


A 


2019 


3 


322 


AHELVFFFGAYRKQPFHRDMKPKNLLCM 
GPKLGKIADFGLARELRSHPPYTDYGST 
KWR/YRGNPALLRPTPMAKFPyEGWLHI 
SGSSIWNIKTSDSNFIFFKESKG 


2005 


15906 


A 


2020 


205 


377 


NIVENIVFCWPGVCFLQTCTVCINPETS 
DE / WPGAVAHACNP S TLGGQDGQ I TRS G 
DRE 


2006 


15907 


A 


2021 


3 


324 


KKWGKR\LNRAPNGRRyPETRMALEEDQ 
CHICKELHIKTVRFHCTPIRMAKIHTTD 

npqswprcgtgtlihchhgcktv/r/pl 
wktvrq/fl/rklniplpcdpavlslci 

Y 


2007 


15908 


A 


2022 


2 


382 


RVSQDGLNLLTSRSTRLGLPK/CWDYRC 
EPPSP 


2008 


15909 


A 


2023 


1 


421 


RWNPGGRGCSELRLHYCTPTWVTERDSI 
S KIAIOIK/NNKRPRWNCR 


2009 


15910 


A 


2024 


339 


3 


SWDHRRVLI FVFLVQTGFCHVGHAGLEL 

ltsgsqsagiagvshra\gqkhqfrpeh 
rflkfgfvfrdrvslccpgwpqtprlkq 
ssrlslpkswgpataldpdssyleemvl 

S 


2010 


15911 


A 


2025 


2 


146 


NTFGRSRQEDHLGPGGQACSELRSHHCT 
PAWXVIEQDPVSKKKKPPKP 


2011 


15912 


A 


2026 


3 


380 


rlecsggisihcnlr/lpgfkefsclsl 
psrwdytrlp/pfvflvetgfhhlgqag 

LELLTSGDPKCWDYGC\DHCTWP 


2012 


15913 


A 


2027 


32 


296 


DYMNSLMYFHSLVLASTDEGFLPKTVST 
QSAGITGISHCARPWIPF/CFFFFKNRK 
TRF VAQAEGQGGNFGSLHPLP PG FRGF P 
CLSLT 


2013 


15914 


A 


2028 


2 


187 


FTLLPRLECSG\MILVHCSIMLPGLRWS 
SCLSLLSSWNYTCVAPFSIFYYFIFLTW 
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SEQID 
NO: of 
nucleotide 
. sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 
peptide 
jequence_ 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GI}'cine, H=Histidine, l=Isoleucine, 
K=L}'sine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutanilne, R=Arginine, S=Serine, 
T=Threonine,V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















2014 


15915 


A 


2029 


197 


3 


aaglsqagmqwgdfgsrehlpsrfqrff 
clkvpnnwdyrhgpprpvgifffsfXlv 
etgfhhpgq 


2015 


15916 


A 


2030 


-M 1 


362 


KFGINLKKKEKKGPQKNQNFFPTFLFFF 
FFETESHSITQAGVQWRDLSPLQPLPPG 
FKSLPSSWDYRCLPPCPANFCIF\IPDS 
VSLCWQGW 


2016 


15917 


A 


2031 


7 


131 


GTIMAWAPGEGGCSEPRSCHCTPAWVTX 
SETPSQKKKKGVLL 


2017 


15918 


A 


2032 


1 


343 


LECSEPRLCHCTPAWV/TGDSVSKKKKK 
KKKK 


2018 


15919 


A 


2033 


243 


14 


QKLAPWPPQSAGNMERVPPHPASMAFLT 
KIBKTALKFIQNHKRPQIAK\T1LSKNR 
VGS ITLFNSKIHYMTTVIKTI 


2019 


15920 


A 


2034 


238 


3 


FLALPKVLGI IGCVPPHPASMAFLTT I E 
DTALKFIQNHKRPQIA\KPILSKNRAES 
ITLFNSKIHYMPTVIKPLRVTST 


2020 


15921 


A 


2035 


345 


2 


LVKIQKEMNVFMTANPISIPQFMDQGVI 
S SCKS YYFRNKFLYVX lAflMDTD /S SNG 
SWQSEWKTFWK6FI I LDGI KNTDDSWEE 
VKISTLTGI KKKLIPTSTDDFKKFKTLV 
EDVT 


2021 


15922 


A 


2036 


1 


140 


GRCC\HEIiRSRHCTPAWATRAKLKKKKK 
KKREKKTQKGKNGLGFWAF 


2022 


15923 


A 


2037 


199 


348 


RSSNEGGRDCVFCFGDRVSLCLPGWSAV 
AQLWLTATSTSQ\VKSLSHLGL 


2023 


15924 


A 


2038 


3 


193 


NGUSTAPIKRLRLaUWIKSQDPSVCCIQB 
AHLTCRDTO/RCYLKG/WYKAF 


2024 


15925 


A 


2039 


116 


337 


SKLLKILPRLCWGWWQAPVIPATQEVEA 
EDHLNPG/RSRPAWATQRTPVS IKKKl I 
LRPMQK/CHQPSW 


2025 


15926 


A 


2040 


368 


252 


VFFFFALFYLFFFFFFFFFFFFFFFFFF 
QFLGFYFRFKF 


2026 


15927 


A 


2041 


204 


399 


VSSHKINGLTVCSTSPFFLSLLPPSEES 
ACFPFAFCHDCKFPEAS/SVNLPVKPVE 
L 


2027 


15928 


A 


2042 


103 


356 


WHFSPPQPLPPPyPLPNPPPPPTPPPP\ 

PPPPPPLSPPPPPSTPP/PPPPPPPPPP 
PLPP 


2028 


15929 


A 


2043 


3 


344 


LYKWNNKASLAAHLFAAWFTVYFKPTVE 
TYC/SGKKKIPFKILIiLrDNTPCyPRAL 
LE MC E E I S I VFAPATTTS S /LKPMDQGV 
IVTFKSNYLRNTFQAGGGEKKEKHERKK 

NIIS 


2029 


15930 


A 


2044 


2 


349 


PRVRKSPGPNGFTANFYQTFKELISILL 
KLFQKKKKIKKGENPPNS F YGAI I PRI P 
NPNMDLSKK/ETYGPVSSRNMEAKI FTK 
FIoAGHFKQSFGREIHHDQREFIPGIQGG 
FNIGN 


2030 


15931 


A 


2045 


280 


462 


CXFFLWVLVWdWLLFXVWLWLdFFG 
FWCCWFCXCGVWFFVFFWCVLCGVL 

LGCWC 


2031 


15932 


A 


2046 


3 


284 


PSPSFSLLLPPSFSLLFPPSF/SPPPPS 
FSLLLPPSASLLLPPCTSLIilHPPTSLQ 
LPPLPSFYLLLPTSISHHLPTHNLPTTS 
IQDPSTPCSIK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
F.=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2032 


15933 


A 


2047 


95 




GRVDAI LAHCKLR/ LPGFTLFS CLS LPS 
TWDC 


2033 


15934 


A 


2048 


211 


! 


EPTTMTGAKWGSSQTTTOYHMLQSLRIN 
VRVDFFFFFCTDR/SLSMLPRLVLNSWI 
PAIILPHPPQVIGLQ 


2034 


15935 


A 


2049 


35 


266 


EVRRPSTWQPPRIJISEEPLRL \ QP PRLG 
NIGFCHVGHAGLKLLTSSDLPTLASQSA 
GIT\GHSYRRASEEDKKESSMSS 


2035 


15936 


A 


2050 


281 


3 


QSNFKEJVLLSILLKRIYRSNIIPIK/VP 
MTFFTEI/EKVIPKFIRNLKRLRIAKVI 
LSKKNKTRRITLSDFKLYYY\IAVFVAA 
WYWKKNRLSDQWN 


2036 


15937 


A 


2051 


251 


12 


ILRFFLPKFLGFQKPPSLEKTHFFLPAF 
GlSrSLFWPPEKLGQKKVFFFFFFFFLRQG 
L/NSVSQAGVQWHNLGLLQSPLPRLR 


2037 


15938 


A 


2052 


2 


325 


ADHLRPGVQDQPQPGQNGKTPSPLKIEK 
\LAGCGGGHPRLREENa,NPGGHGCSEP 
RSRRCTPAW/VNDSKTIARKKKKKKGVE 
KNECGRKVMRV/LQGPKAKVKPWGENL 


2038 


15939 


A 


2053 


3 


166 


SLLLPRVECaiGaiSAHCSSfLHL\HNLHLP 
GSSDSPASASQVAEVRGSLEPRSSSLA 


2039 


15940 


A 


2054 


227 


3 


LNENIGRITGMSHHARLILLFCE/YRVS 
ITQAGV/QHDPGSLQPLPRVFKQFSHFS 
LQSSWDHRCRPIRIiAIFWVFC 


2040 


15941 


A 


2055 


301 


0 


QRKSHMFLTINQKLEMIKLSEEGMSRVE 
TGQKLDLMC/QVSQAVNAKEKFLEKIKG 
DTSVHTQMIRKQSS INVDMEKVGIVWIE 
DQAP/HNHIPLSHLLMRAR 


2041 


15942 


A 


2056 


1 


109 


RPLRRLRQENHLNRGSRGYSEPKLC/HL 
CTPAWAT 


2042 


15943 


A 


2057 


313 


250 


PQPPPPSPPPQPSPPPPSPPPPSPSSP/ 
PPP 


2043 


15944 


A 


2058 


134 


2 


EKESRSVA\RLKGSGAISA\HCNL\CLP 
GSSNSPPSASRVAGSSGA 


2044 


15945 


A 


2059 


120 


326 


NSLVADME/KVLWWIEDQTSHNIPLSQ 
S/LIQSKALTLFNSIKAKRSEEAAEKIF 
EASRDWFMRFKIKQK 


2045 


15946 


A 


2060 


102 


413 


ERTGFRHVGKS GLE FMTS GD PPTLAS QS 
VGIT/VHEPRTRPG 


2046 


15947 


A 


2061 


311 


1 


FFKKQRFCKSGCKTPGKPRGPKKLGANQ 
FSGPTPLKNGVGFS PGPKGGFFPPPPGG 
FPAGRPKWLTRILGKGGSLLRG/RYPKK 
GFLKPI LGDNS PQRAPKR6P 


2047 


15948 


A 


2062 


229 


380 


WYDLGSLQPMPLRFKQASRVWLPRSWDY 
RHPPLSRLN\IVFIVDTGILHVG 


2048 


15949 


A 


2063 


142 


383 


PQS CFSTHWQLLQKQEETAGAVS VCVCT 
S\VCVCVCVCVCVCAGAMCVCAGA/CFC 
vcvcaga/ CLCVCVGA/CLCVC 


2049 


15950 


A 


2064 


408 


200 


HLIQIKALTHFSSI3CAERGDKSTE/EKF 
EGSRGWFVRFKERGHLCNIJCVKHEAANA 
YAEAACSQLSRRSS 


2050 


15951 


A 


2065 


348 


3 


WVSPYSPCVCVCVCVCVCVCVCRDWVSP 
CCPGVCMCVCVCVCRDW/CFTM/CAQVC 
VCVCRDWVS PCCPGVCVCKDWVS PCCTS 
WSILKLLSSGNPPTTVSQSAWITGRSHS 
AWPARA 


2051 


15952 


A 


2066 


73 


286 


NLIRGLLESHILISMRYGYCKSYTLSMR 
DIPEALNKWKS /IPMFCSWSRRLIVSMA 
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SEQID 
NO: of 
Ducleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unlcnown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VLSKLINRFNAKDVQH 


2052 


15953 


A 


2067 


329 


0 


PSSTFLLPFPRPPPSPPPPPPPPPPSPP 
/PSESSPAPSSPHP 


2053 


15954 




2068 


221 


3 


ELKVILHCLRDFSLQSSIMKVLILKILI 
LSyCCVC/VCVCVCAVCSCVC/VCSCVC 
/VGSCVCLCVLCVCVCVLSC 


2054 


15955 


A 


2069 


1 


167 


GTRENPLNPGGGGCSEPR\S CHCTPAWA 
TKSETLSQKKKKKKKISNYKTPFKSYRI 


2055 


15956 


A 


2070 




354 


ARRCLGLPSSWDF/SVENHFHRVGQAGL 
ELATSGDPPTSASPECWDCRH\DHHTWP 
LL 


2056 


15957 


A 


2071 


20 


341 


CIVTVNTRGENINICWSPDGQTIAVGNK 
DDWTFIDAKTHRF /LKQNSSSS /SEVN 
EISWNNDNNMLYPDTGNGCINILS YPI T 
ESRAIYQRPIFHVHVNQDLPHGDVLS 


2057 


15958 


A 


2072 


353 


58 


LQLLTTSDPPASASHGAGIADGVWFTQ/ 
SLNGAQAGVQWRDLGSLQPHPPSRL/LL 
ASQSAEIAASJVRPPPRLGSEERLCLAAH 
RLGCEEPLCLAAQSGK 


2058 


15959 


A 


2073 


1 


338 


GSRLQRVCINYRLSFFPFLSQGWINFTW 
LFCLCVCFLRDRVSLCCPGWPSTSGFKR 
SSCPSLLSRWDYRHMPQHLASHTLFKKL 
/TILPR 


2059 


15960 


A 


2074 


1 


322 


GGGREAGEARGGGEGGC2GSGRRRRGRGG 
PRTGGAEGGRGAGETPGGGARPEREQGR 
GRHSERQGPT/RQTKRPKTKTKQLSQNK 
NT 


2060 


15961 


A 


2075 


450 


225 


TPVGRGCSELRS CNCTPAWVTD \ETLS Q 
KEKRGGVKIGWKKRRTHI ISLHLPGSHE 
KFNWLETVKNCVNHFCHLNT 


2061 


15962 


A 


2076 


2 


470 


tpqnkphpttkntqpqptqkk/ptqtqp 
/ttpkpktkknttpnpapputqknhttk 
pttqnphktkqknktkpnttqqqnkptn 
qnpktqtqqktttqtkppkkkkkktadt 
tspnpistkkikklaep 


2062 


15963 


A 


2077 


304 


1 


NSATPPCSPTA/KPHPTPPTPPPTSFIP 
TCQHSPPTKICPQARPRTPPYAHPPRCP 
RKIIPKGRRHPLAPPQAARRDLNyYHPI 
IWRGRVCVCVCVCVCARA 


2063 


15964 


A 


2078 


167 


^ 


TI LQTNSTWSNVLLWQGAVAQAC \NS ST 
LGGQGGRITRSGDRDHPGQHGETLSRA 


2064 


15965 


A 


2079 


3 


364 


HETRSRHSCAWCSAALWRAAVASRCPSS 
/IPVTPPQCLYWP/WKVPLQCPPDL 


2065 


15966 


A 


2080 


303 


2 


SKRGRPSGHECPFLGTSSSCRHVASCTI 
RTPRRLCKSQLDRCSPLKERHKFGLSRS 
VLSAMTQSGIY/WQPPPPEFKRCFSCLN 
LLSSWDYRHAPPRRARA 


2066 


15967 


A 


2081 




326 


AG I TGMS HRAWL FLYFUSTKFAFT \ YGLV 
IiNFFLHKIQEPSLG\SGSGP/LSCNS 


2067 


15968 


A 


2082 




343 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPSPPPPPPPPPPP/PP 
PSSPSSPPPPPHTPHP 


2068 


15969 


A 


2083 


330 




ACDRS IIS RIHKEL \EQLCKQK\SNNP I 
KKWPKDMIRYFSKQDVQTVKTMNKCSTS 
LIIIEMQIRS TMRYYLTTVRTPHPS 


2069 


15970 


A 


2084 


1 


340 


RVESHGTTHLAQLIFVLLVQTGFHHDGQ 
DGPDLL/NLVIRPPQPPKVL 
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SEQID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
niicleotitie 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 

dinl'tT"" 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PI]enylaIanine, 
G=Glycine, H=Histidine, T=lsoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2070 


15971 


A 


2085 


2 


348 


ARALSLGANAAGNFKLRPLLRYHSQNPR 
ALKNYAKYTLPVLHRWHSESLETAFTEC 
FTEYFKPTVETYCSEKD/I/SYKALLLI 
EKALGHPRALMEMYKDIS I VFMPVMTTS 
ILQPMD 


2071 


15972 


A 


2086 


369 


2 


KGLP PS PANLFFLS Vy/MGFPMFTRL I L 
ISCPCYPPPSASPSAGIKGLNPPGWPPF 
SFFYQKFVRFFVLIVRNRTFGFVDYFS IL 
YLIYGCSNLYRFLLSALYFFVSVFVFVL 
TQGIEAMVPRA 


2072 


15973 


A 


2087 


314 


3 


LLESVGPTRNSRPFKGLIGRILLDPEFY 
PSLVSDDLPASASPNAGRILPFFFFETG 
SHS \TQAGVQWRHHGSLQ\LKRSS YLSL 
TSSWVYRHTPPHPANI CIFSRA 


2073 


15974 


A 


2088 


330 


16 


CPCFFLSALSVLVGWCFAFWCGCV/WW 
CVCFVWFVCVCFLGCWLCLCFV/VCL 
VGVCCWFFGGLCCVCC 


2074 


15975 


A 


2089 


1 


337 


GTRTFLPPSYKDPCEY/IWAHPDNPGSS 
SNCNMLNFSTSD/PPVHSSGNWKVLSSP 
NRPYYYSYTATPHTDPTPHLPSPNPSSP 
SPSYPLLSDSTICQTTPTTTPITSSHTL 
LTS 


2075 


15976 


A 


2090 


350 


84 


QRKEKKRTRKRATERRDESREEKaGRKE 
GENQEKENKEQQGRQRRSRDRT/EEEKE 
EAKRHEHKNPKKKKTKPPPQKKKKTEKK 
KTDNLSN 


2075 


15977 




2091 


1 


355 


SDPPTSASQSAGITYVSHRTWPLLEFSG 
TSIRLAGKPAGVLVEVTGK/SVCGGGVT 
KTHWNKCHTGYPKCCWSSQAGESSLQPP 
PPGFKRFSCIiSLPSSWDYRLLPP\QNFC 
IFSRDGL 


2077 


15978 


A 


2092 


27 


345 


ASI PCLKKRKKKKKKTGGKPPP \GGPKP 
KRGGGGGPPPTKRTLFGPPQGNQTPGGV 
S /GPPKGDPPFSPI PPCQRKSPPFEKGG 
KKGPPPWRGVKREKGHFLKNFQQK 


2078 


15979 


A 


2093 


3 


301 


HEHVAQAGLKLLGSSDLPTIASPKCWDC 
KR\DYCAWPHIFPISGPLYVFFPLPRMS 














RAQFQCHFFKGWPDP 


2079 


15980 


A 


2094 


76 


342 


FCFFCCVWFVPGFCLFLCFLFWFCFFVF 
FGFFGFWFFVFFLVCFLVFVCFFGCFFF 
LFCFF 


2080 


15981 


A 


2095 


115 


325 


MDERKKIRGGGRQGKECKIHCKKKLSPG 
I RS YP VEN/ F / VDTM YDYLQ PAYYKLND 
LTNADPCAVRYLLFDQN 


2081 


15982 


A 


2096 


80 


227 


S CLGN/ C IHLYSHS PTLSFTHTHTHTHT 
HRERERERERERE I CI CMSVYA 


2082 


15983 


A 


2097 


308 


1 


NSTVTMENSVNIHYRTRVFTEAQFTIAK 
SWNQPKCPSILEWIKKL\WIYIYVCVYI 
CVCVC I CVCVCVCVCVCVC lYIYMMEYY 
S AI KRNE LAAFAVTWTRA 


2083 


15984 


A 


2098 


2 


361 


A35ACLGLPSSWDF/SVETRFHRVGQAGL 
EIjATSRDPPTSASPECRDCRH\DHHTWP 
LL 


2084 


15985 


A 


2099 


1 


221 


LLWRLRHENHLNLGGRGCSEPRLHHCTP 
SWMTR/GKTPSQKKKTKQPMEWQNLCLK 
KVFGNTGVRE I FNG I KLS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

sequence 


Predict- 

otide 

location 

correspon 

last fiiiiino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Tlireonine, V=Val)ne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \>=possible 
nucleotide insertion 


2085 


15986 


A 


2100 


186 


2 


ERVYVCVCVCVCVCVYQCYLPQIHSL/V 
ASSLKANARVCVCVCVCVCVCVCVCVCV 

SLWHPLTA 


2086 


15987 


A 


2101 


305 


2 


HMILLSQSLIQSKALTLFNSMKA/E/RG 
EEAAEEKLEASRGWLMRFKE\HSHLCN\ 
TKVQQEVASADTEAAPSYTEDHSKITDE 
GGYTKQQIFNVDKiCASYNNMS 


2087 15988 


A 


2102 


6 


166 


EQTALEILARAIRQEKEIKGIQIRKEEV 
KLCFFVND\MVLYLENSKDFLKVDAA 


2088 15989 


A 


2103 


53 


176 


EIKNNRPGMVAHAY/NPSTLGdRGRQIS 
WGQEFETSLVMMVK 


2089 


15990 


A 


2104 


3 


267 


FRHVGQGGLKVLTSGDPPALVSQSAGIT 
GVSHCAQPIVGDFNTPLSIL/D/HSTRQ 
KINKDIQDLNSALDEADUiDIYRTLHPK 
/STEYTFF 


2090 


15991 


A 


2105 


268 


3 


PPKEHGSSPATEQSWMENDFDELREEGF 
RRSNYSE/LREDIQTKGKEVENFEKNLE 
ECITRITNTEKCLKELMELKIKAHELRP 
ECRSLR 


2091 


15992 


A 


2106 


3 


170 


GFHHVDQAGIiELLTPQVIHP\LGLPKCW 
DYRREPPCLASPHEHQIAISQKEHRSAK 


2092 


15993 


A 


2107 




398 


SARGPDGFTAEFYQTFKEELVQILLKQF 
QRIKGEILL/KNHYVKPSITL1PK\PG\ 
RDITKKLLTRSFCVSLCPVLSPLQSLQS 
RPSSLSMISLHSVCFSVLSASAHPHVHL 
CPTCPTLVLSGSHCVCCVSLFFF 


2093 


15994 


A 


2108 


3 


370 


HENWNNKGWGDSTIYSMKYLYFKPMLRP 
Y/C/SQKKIPFKILLLFDNAPGHPRVLM 
E I MYKDME VF\MPWTTF/ ILQPMDQRV 
ILTFKSYYLRNTFHKTIAAINSDYSDGS 
GQSQLKTFWKGFIVL 


2094 


15995 


A 


2109 


1 


213 


HFPVENESAPG/PKAAGDLLTLLLGGNA 
AGDFKLKAIiLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2095 


15996 


A 


2110 


391 


3 


KKKKKNPHRKKIKKRWKKI FNAERNKKR 
AGVIIHII.DRLDFKQQTIRRDKR\GYYT 
MINGTIQQEVITIIiNIHAaNTKAVRYIK 
QVIiLKLKTELGPNTI ITQDTNTLLLSTL 
NRSSRQKNQQTLDLICNI 


2096 


15997 


A 


2111 


188 


365 


FQNTIHICVCVCVCVCVCVCL/CVCFCV 
FFFCLLCGGFMCGCWCEFCILFCFYGVG 
FFFL 


2097 


15998 


A 


2112 


3 


340 


RMESALDRLKQFTTWGDTGDFHAVDEY 
KPHDATTNPSLILAVAQMPAYQELEEEA 
lAYGRKLGGS QEDQ I INAI DKLSVLLGA 
E I LKMI TGRVSTE VDARLS FD / SDAMVA 
TA 


2098 


15999 


A 


2113 


11 


305 


FLFTDFCLFMTHILGHKINYITN/CKRN 
VI IT/S YFS PHNRIKLKIS IRKISRKSS 
NTWKLHNELLHYPQIKDEVSREIRKYLE 
LN INENTNF/QNLWD IHK 


2099 


15000 


A 


2114 


3 


387 


qtnh/niplsqnliqskaita/ncmkae 
rseeaagkfeasrgwfmrfkkssyMkv 
qgeaasagveavaiypdlaelide/gcy 
tt/qifsvfqtaffwkkkpsrtfm/ree 

KL I PGLKAS KDS S S LLLRVHAAGD 


2100 


16001 


A 


2115 


152 


393 


vycpicwlvfffcfcsvlilfvmfv/ cl 
cfsffcflgfwvfssfylficvfffiv 



672 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=ValinE, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














LGCLFLCWFDLFLFVLFVLCLFFVS 


2101 


16002 


A 


2116 


2 


373 


araclglpsswdf/svenrfhrvgqagg 
elatsgdpptsaspecwdcrh\dhhtwp 

LL 


2102 


16003 


A 


2117 


3 


449 


HEFDHAMLQAHRAHHLAIDAYHEFEETY 
IPKDQKYSFLHDSQTSFCFSDSIPTPFN 
MEETQQKSNLELLRISLLLIESWLEPVR 
VLRSMFANNLVYDTSDSDDYHFLKDLEE 
GIQTLMGRAEKRHCRTVQNLKQTYRR/F 
DTNS/HHHDALL 


2103 


16004 


A 


2118 


146 


15 


FFFFYFFFLFFFFFIPFLFFFFFLFFFF 
FFYFIFILFIYSLLVF 


2104 


16005 


A 


2119 


406 


3 


LFSVNETGFYSKMLSRTFTATEETS IPG 
FKVSKDKLTL\LGftNAi«3DFia:.KPML r Y 
HS/ENPRAIiKNYAKFTPPVLYKWKNKRL 
MTTHLPTARYTEYFKPTVETY/ 1 LLLID 
NAPTHQEALMEiyiYKEHWFiyiPAlITTS I 


2105 


16006 


A 


2120 


109 


396 


YYFYFYLVLFYFFIFLFFYFFFFFILLL 
FYFFFSFIFFFFLFVYFIFLCYFFLFFF 














SYYLIFFLSYII 


2106 


16007 


A 


2121 


15 


413 


IVLAENTNFWLSFLFPVALGILIVLKGV 

KYI FWPLEYCQ:? LKMFVS YS FHY\ FFLG 

SLLFLKYGFHMYLILLL/CIFIIIMCFF 
IKYSF?FCCLYHFFFSFYLFFLYFLIF\ 
CYLVILFFSFLLFLFLSSYCFFF 


2107 


16008 




2122 


1306 


429 


SSSSSSSSHVLRIIKDEDFKILEQRQW 
LSEKEAQMiCKEYENEDYFNKLIENMTS 
QPSLALVLLRDNGLQYWKQLLQPRTVEE 
AIEYFPESLCaQFAMDSLPVHQLYGSDS 
LETAEREIQHFFPLQSTLGLIKPHATSE 
/HKRGPSMVMILTKWNAUAEWRRIMGPT 
DPEEAKLLS PDS IHAQFGISKLKNI VHG 
ASNAYEAKEWNRLFEDPEEH 


2108 


16009 


A 


2123 


3 


206 


LRRLRQEKHLCTWRGGQCSEPRSQHCAaA 
W/VSNSETPQKKKKKECEKKKKKKNLPNS 
ALKKTYSQRGKLF 


2109 


16010 


A 


2124 


23 


401 


lASGRPFPPFFFFFFFWPPPPGGYPHFS 
FLKKKKKRGGGGGKSLLPPGKGPNPPK\ 
WGFPLFPPPLFPQKTPPPPFFFYKTPP I 
PPQGPRGGPLKFTPPGGGGGNPPTILLD 
KKRGVLQPPPFFWTII 


2110 


16011 


A 


2125 


3 


439 


MFDVSLLTFTHSLFHFSPQFHRKCELST 
LCDGGELRDHILLPTSICPITR/DKCSC 
PGEGC 


2111 


16012 


A 


2126 


1 


213 


HFPVENESAPG/FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGS FKPNL 
PLVWCSHKKAWVQLG 


2112 


16013 


A 


2127 


104 


419 


NS F FFEEELYNPFGGLGKKTFFGGGEE F 
GHTPPENEALGGKHKFFTGEGGQTFSNN 
GEEKSVSFWISIEKILHRALL/AHALCK 
NCWELNFGQKEE PFFPPPEE F 


2113 


16014 


A 


2128 


399 


140 


PPPPKNFFFPPKGKFFG/RG/VGPKFPP 
PKKRVFSQKPPRGFFYPPLKKKNWPFPP 
PGNFGPPRGFFKRPPPFFFFFFFFFFFF 
FFFFF 


2114 


16015 


A 


2129 


10 


457 


KTSWTWCPVPWPATQKAE/AGGSPEPG 
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SEQTD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PlienyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=TIireonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, V^possible 
nucleotide insertion 


2115 


16016 


A 


213U 


409 


0 


PPPPSPPSPPPPPPS/PSPSPPPPPPSP 
PSSPPPSPPPPSPPPP/PPPPPPSPPPS 
PPPSPSPPPPPSSPPPSPPPSPPPSSSS 
SSPPPL 


2116 


16017 


A 


2131 


317 


177 


FFFFFFFFFFFFFFFFFFFFLRLQIFFF 
FliPILlFLFLFFLFFFLFF 


2117 


16018 


A 


2132 


3 


542 


EPWSVAQA6VQWRDLSPLQPLPPRFERF 
SRLSLPSSWDYRRLPAHPANF\QFLVET 
VFRHVGQAGLELLTSGHLPALTSQSAG I 

TGMSRTS\RPGFLFKV 


2118 


16019 


A 


2133 


146 




LKSVTS lAKTWIQPK/ CPTNSEVDKCPS 

T VKW r KKMWY r / YFTMEYYAAI K 


2119 


16020 




2134 






GRTOAI LAHCKLR/ LPGFTLFS CLSLP S 
SWDC 


2120 


16021 


A 


2135 


2 


3H0 


THTHHTH/THTHTHTHTHTSLS 


2121 


16022 


A 


2136 




337 


KRNNKAWMTVHLFTAWFPEYFRPTVETY 
CSEKKIPFKIL/LLVDNAPGQPRVLVEM 
HKEMNWFRPANTASlLQPMNQQarSTF 
NSYYLRNTPHKAIVAIDSNSSDGFGQNK 


2122 


16023 


A 


2137 


2 


356 


PVSSSQVRASVYLKKKKKKGPPPEGITN 
TAGEKPPQSFRGKG\PPFPLISPKKEPV 
I \ SFLKNFGPCT IGKKKPPHPPAKNGG P 
LQ/RPPTQGGGRGKKKGKNNQRPLSGLG 
GNRSPKPPF 


2123 


16024 


A 


213S 


2 


357 


PLGSSDPPPSASPVGEATG/RVFFFFFL 
VEWSHYVAQNGLELLDTSNPPAVASQS 
VRITCVSHHTWILSPLYKIIQVCVSK/S 
PELEQSEDKSLKK 


2124 


16025 


A 


2139 


379 


16 


LLQVRCFVSTVMRGSSCQKTIQVYYVQE 
AIPPSFLLSPFLM/PYTKINSRWIKDSN 
VKPKT1K\TLEENLGMPLNI FFTYQLLW 
LYIiHPESQLErcaiSFRALQEGNLIIFFI ' 
GRVGRPGTTGL 


2125 


16026 


A 


2140 


73 


411 


NYLLMNLFFFFFLERGLTFAPRAGGWGG 
NLTSWNIiGPPGPNKPPPFPPKRP\GTPK 
PTLKEGLFGFFKTTGFPPSAQKGPELPG 
LRSPPGLAPPRGGNiaSGNPPPGPLKTFH- 
G 


2126 


16027 


A 


2141 


46 


421 


AGVSWRDHSSLQPCLTSEAEAVXSHLSF . 
LS SRDYGPMPP PP KKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKQ ' 


2127 


16028 


A 


2142 


26 


479 


LAYLLPSPVMYKPQNWRTILQSISWKGL 
I CRI YKETLQLNHKKTNNPI FFKWVKKS 
KHFTRGAI QVANK\HI KMKLNI ISVYGW 
AQKNTMRYHHTPIKMKKMGCALWLTPVM 
CQEAGYILKMYP6WTPYPGRWGRPTAP 
ALPPRGTFPRD 


2128 


16029 


A 


2143 


9 


166 


QNRLLARLTKKKRGKNQAHP / IKNDKGD 
ITADPTEIQASIREYHKH . 


2129 


16030 


A 


2144 


1 


328 


LEKESRPPPPPPPPPPPPPSPP/PHPGS 
LPKPLGLQPLPRPEVSFLTEYLPKLKAC 
EGGGVE I AARSFPRI YLMGMCSRQDR IQ 
KDlDWIQKSEAEDCLFAGVjKAILKR 


2130 


16031 


A 


2145 




440 


KTFLRSLWQLVE/CYISSGLIDDHRRPM 
ALTPRHPRSGITAPRPRPQPPGRVGIPE 
PTALSPSPGPPPPPCSTPGRCQVPSLER 
RRKEGREPPSVGRGCGGHGISPSSDIFF 
HLNFCLAHPPDLF VFVCS ISWQLYIFHE 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 


Predict- 

otide 

location 

correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon,/=possible 
nucleotide' deletion, V^possible 
nucleotide insertion 














KKKKKGGR 


2131 


16032 


A 


2146 


23 


422 


lASGRPFFFFWGAPGGEEGKKNPFWGPR 
GGKKKIRGAPTPRGKGNPPPKPWGKKGG 
APPKWLGLFGKKGAQRGPPGWFKTPGKK 
KSPPPGPKKGKK3SFRGRP\GPPPQIFFKP 
FFSTGEKNPLGKKTPNWRLLPW 


2132 
2133 


16033 
16034 


A 
A 


2147 
2148 


3 

^3 


372 
\ 


KKNTPFFLTPRGPPPPPRGPLSPPPKRA 

TPPPPPFKKKKPGFPPKKKNFFPPPGGG 

PPPP/LPGEKKGGYF/WW 

FNCRWFETKSHSVPQAGEQWHSFiSPLQP 

LPPGFK\EFSCLSL 


2134 


16035 


A 


2149 


2 


361 


FFFLVET6FHQVGQGGLEHLTSDDPPTF 
GLP\GGWDYR 














KKSFFLVSPAHVQWGDPFNPPIPGSNKF 
PFSTPPKTGVIRGPPPARKIFVFFIKTG 
\FPQLGRGVLKSLPQ/CDSPPPAPPKGG 
VSGGNPPAPPFFFFFFEMESHSVSQVGV 
QWRNLGSLQPPSPGFKQF 


2136 


16037 


A 


2151 


23 


421 


lASGRPFFFFFFFFFFFFLGGRGVFFYP 
PGGGGGPNFFKKTPFPPGKKKPFPPPSP 
KGGFLTPPPPPPFFFFFLKKKGVINGGG 
GGVKISPPGGTPPFLPQKGG/NKK(3GPP 
PPRKKKIFLFFFPGMKQRPPP 


2137 


16038 


A 


2152 


3 


366 


VIWAHCIFHLLGYTDRPCLKKKKKGFPG 
GEKMGLKNFFPPLGKKPPPPQI PNF1SSV 
IEKPPRGFYLGGAAPPKKFFLLKP\GPP 
FFKEPPPPKKKGGASPPLKKKPFKGKKS 
PPFFFSDPTS 


2138 


16039 


A 


2153 


423 


253 


FFWGGGAPIFPPPKKGFFPKNPPGVFFS 
PPKKKKIFFFP/HPVNFGPPKDFFKRPP 
P 


2139 


16040 


A 


2154 


287 




KEMNKKKNPKKKKVSKKISTAFYNKTLN 
NINFF\FAFFFFFFFFFFIFFFFFFFFF 

QLMQHSAPTRP 


2140 


16041 




2155 


433 


2 


RRVLFIFPPL/HPFSPPPLPFSPFPPPP 
PMKFFSPPPTFFYFYKSPPHPP/HPPQV 
VFSPPTPPMPFITPPPPPHTSPSFPSFF 
FFFPFFFFFFFPSFFFFFFFFFFFFFFF 
FFFFFFFFFFFFHIISIKGEIGRSSWR 
VMSRTRG 


2141 


16042 


A 


2156 


69 


606 


LWWPPLSRHAAYRQWPGPTAPRGLGHKV 
KGPGAS PA\ PCGAAAGSTAQG \ GGGAAC 
LPGPAAGAVTWPAGPGGGRATGPVLQR 
PAGAGQPTQQGQHDHAGRVLWQHGIGRP 
GAHRPGPCAP/DSAPRAPGGRCG 


2142 


16043 


A 


2157 


449 


0 


SSPGSSSSCSPPPPGGGVGFFL/YKPPQ 
KKPPPPPVGGPGFFPQKKLKSFFPLPPP 
FFLGGGGPPPPPPPKIKSFYPPPKKVSF 
PPPPKKAFLPPPPPPLPPPPPSPPQPPP 
LF 


2143 


16044 


A 


2158 


241 


5 


KKFSFFPPGGSQGGFFGSLQNPPPGFTP 
FFCLNLGKKWGQRGPPPRPKWF/SFFFF 
FFFFLVETGFHH ISQDGLDLLTS 


2144 


16045 


A 


2159 


434 


2 


lgsasql6csgtodpleeavcpfsdlql 
cmrttalfkavrqghlslqrlllsfXv 
clcpvprsgayrgrqaslscgglhpvqa 
srllclpkqawamagap/ppaslqpcs l 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
e(h 
od 


SEQID 
NO: in 
USSN 
03/515,1 
25 


Predicted 
beginning 
nucleotide 

correspond 
Ing to first 

peptide 
sequence 


Predict- 
ed end 
nude- 
location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lfnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISDCCASNQRDSVGVGPSEPRAGYNLLV 
RCFLSP 


2145 


16045 


A 


2160 


448 


84 


FLGGF?FFFFWFPPPSPPQKEA1,\QKKI 
FFVFFSPESNQKNFFFSFSERGGPPNFL 
FKGGGR\FSPWFLFFFFFFWFFFLWFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLGKCPLKYT 


2146 


16047 


A 


2161 




156 


RGCSEPRLRHCTPAWVT\KETVSQKKKK 
KFLTLC3GKTFKNFFFFWRAPGTFP 


2147 


16048 


A 


2162 


412 


37 


FFHSPPPPPPPAC3GAVFPPKKKKKSPPP 
PTPLLL/RGGGGPPPPPPPKRGPPPQKP 
KRGFFSPPKKKKKFFPGPGGPPGPPQKT 
PPPPPPPFFFFFFFFFFFFFFFFSLFFF 
FLRTLWRKGRGRGI 


2148 


16049 


A 


2163 


5 


370 


QSSCVQWLVPWPTIQEfiEAGGFLEPRS 
ERLQ/CBHCTPAWA 


2149 


16050 


A 


2164 


437 


36 


FFFFFSEAESCSVAQAGVQWHaESLPQP 
LPPGXSSDSdSLLSSYMCLSPRTRGS 


2150 


16051 


A 


2165 


3 


623 


RQGFTLVAWAGVQWYDLGSLQPPPTGFK 
RFSCLSLPSSWNYRHaPPCPAWFVFLEE 
TGFUIVGQAGLELPTSGDLPASASQSAG 
I TGMSHCA/RPKVCSYHLFF 


2151 


16052 


A 


2166 


337 


0 


PPPPPPSPPPPSy PSPPPPPPSPPS \PS 
PPP 


2152 


16053 


A 


2167 


350 


156 


ABATSKIRCQKCYIMIAGHSGTRLQFQL 
LQRLRQENHUSPRA/RGCKEPRSHHRTW 
PTWYISKSFLA 


2153 


16054 


A 


2168 


1 


337 


DERSLSQRSRSWSOTGYYSDLSTAHHSG 
BOQCKRTKKICK/IKKKiaCKKKRGPFKKKG 
PLKTRKKPRGGIiLKAHPFWGGPPPGFFL 
T6EGAPPPVFFKKKKKKPPLGGKGGFLW 

G 


2154 


16055 


A 


2169 


2 


606 


RVLRAVAAHEEPDKEGKEKPHAGVS PRG 
VKEQRRSSSGGSQEKRGRPSQEPPIiAPP 
HRRRSQPP\HPGPLPPTNAAPTVPGPVE 
PLLLPPPPPPSLAPAGPAVAAPLPAPST 
SALFTFS PLTVSAAGPKHKGHKERHKHH 
HH\PAPMVIPAPAEPI 


2155 


16056 


A 


2170 


311 


2 


GLQPLGLGSVKQCMDLACVPETVCVCVC 
VCVCARflHTPVCTQGCVPESTQCVCMHV 
C/VCVWACTCVPVCMHTCVGVGASVCMQ 
RNELGRWAWENDAIRQQRC 


2156 


16057 


A 


2171 


78 


337 


NTPNSIYKAMSLKGPITGTFLPNYPGHK 
VCVCDTYLCVYQHTHTHTHTHTHTQAFP 
HI/HI YTHT 


2157 


16058 


A 


2172 


2 


278 


KNRLKPGGGGCSKLESCHCTIAWAT\SE 
TPSQKKKEKRKRKGEKRKRKKKIVYTKI 
E KGKAKMTGES F I HF CKKS SS VPQVALS 
AEYRCSST 


2158 


16059 


A 


2173 


2 


365 


FYHVGQAGLKLLTSSDPPALGLPKCWDY 
RRE\DCAQPNVKS 


2159 


16060 


A 


2174 


195 


389 


FRVFSKLKYYYDFFRGRISLSCPGWSTT 
PGFKRLS CLSLPSSWDYRRPPPCPAN\ F 
CIFSRDGVSP 


2160 


16061 


A 


2175 


23 


374 


lASGRPLIFFFFFFFFFFLFCRGGGPPF 
NPRGKERGGFLIKGPPPPGGKKNFWAPP 
PGGGEIKRTPPPPGVFFFFLKKKGFSFG 
GGGGKK/PPPPGEPPPPPPQKGGKKKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TRGPLF 


2161 


16062 


A 


2176 


2 


158 


FFFLKQSLSVTQAGVQM\PVDSLQPLPP 
RFKRFSCLSLPSSWDYRCVPQCPAM 


2162 


16063 


A 


2177 


340 


120 


PRFHFLAS PS EMS QMT VKAKTTVPAS E C 
AYPKIEPFFPF\DPRGSES\FDKLGVHH 
HPLFPLSGEPLMILHKNTH 


2163 


16064 


A 


2178 


3 


256 


YLKKDLNVNQKTLTLLEyEKQGK/HLHD 
MRLCKEFLNKTSKAQYILKKISQY/LIK 
VQKFNAVKDPVKEMKRQASD/WENIFT 


2164 


16065 


A 


2179 


95 


1 


GRVDAlLAHCKLR/LPSFTLFSCLSLPS 
SWDC 


2165 


16066 


A 


2180 


1 


143 


ARGERERERERERERERERERERERERE 
RERVFV/RERAALSRERGDKCERSGLF 


2166 


16067 


A 


2181 


1 


304 


ARGQGHLSLQRIiLLSFXVCLCPGPTGGA 
YRGRQASLSCGGLHPVHASWLLCLPNQA 
WAIAGAPPAftLLPPCSLISDCCASNQRD 
SVGVGPSKPCVOraLLVW 


2167 


16068 


A 


2182 


3 


163 


FQRRSTESCGWDKDARSQSRSPPRQQAH 
GHHS /HTHTHTHAHTHTHAHTHAWTRP 


2168 


16069 


A 


2183 


3 


205 


NFKLFCKBFFFLRQSL/DSVSQAGVQRH 
KliGSLQPLPAGFQLFLQPPPPGSKDTHP 
GVQRHDHNSLQP 


2169 


16070 


A 


2184 


228 





WFFRIYDSFFLVFLFVFFFFFFFFFFFF 
FFCVFTPFPVYFIQYWYI 


2170 


16071 


A 


2185 


287 


3 


SLFLAI PQWEFQRKI YIiRKQKI VIiLACQ 
PSKGLS IVQWRGQLNSPVLKSQDPTPX 
PRFKRFLCINFLSRWSYEHVPPRPDS FV 
FLVETGFLHLV 


2171 


16072 


A 


2186 


252 


105 


VSISCHPQ\PCLIYTCY/HIDGYSNIHL 


2172 


16073 


A 


2187 


267 


459 


KHVPGSSYTWL/ IFFFRGNFLKKGAHFV 
PQGVLKGGNLSSLYPPPPRLKQYSCLTL 
LRIGNYRHA 


2173 


16074 


A 


2188 


170 


15 


RSVMKDLNSHFSKENIQMANKRTKECSA 
LWFREIK/TTMRCHLTPIRMATIKK 


2174 


16075 


A 


2189 




504 


DVTISTCHASAKVGTRLVFDHYGKI IQK 
TPYPHPRGMTVSVKQLFSTLPVRHKEFQ 
RNIKKGLG\RRSCFCFDF 


2175 


16076 


A 


2190 


1 


565 


FFETESLSPRLECSGAISAHCNLHLPGS 
SDS PASASRVAGTTGACHHTWV/IFFVF 
PVETRFHHVSQDGLDFL/NLVIRPPRPP 

KVIG 


2176 


16077 


A 


2191 


142 


382 


NTPPLLFFFVIRDRVSLCCPRWSGVAQF 
WLSAT YASRV\KRFS CLSLP / SNWDYRC 
VP 


2177 


16078 


A 


2192 


138 


365 


KHQYHHCC/LLKKKKKKKKKKKKKKKKK 
KKKK I KWKPCHGGP FKKNFW / EGQPRNWA 
GGV 


2178 


16079 


A 


2193 


385 


21 


rgeifffktrrkkfspqggrggvfppsp 
pknfffpqggkffgg/eggpkvpppkkg 
gfpkkpqggfkvppkkkkki isppgvig 
gppgnflkgappffffffffffffffff 
ffflvravkls 


2179 


16080 


A 


2194 


415 


56 


PPP PTTAPVFS PPPPPRPFFS PPPP VFF 
FRSFPPAPPPPPLFFPPPSPP\PSFPPP 
PPPTRPCPPPPPTIPPPPPPFSPPPPPF 
FFFFFFFFFSLLPFFFFFFFFFFFFSFL 
FLFFPSRIiW 
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SEQID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nuCeotide 

correspond 

residue of 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G,ycine, H=Histidine, I=Iso,eucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possib,e 
nucleotide insertion 


2180 


16081 


A 


2195 


35 


369 


MKRPSPPPPTPPPPPPPPHPLLPPFPEK 
KTGPP/RGFFKGGERGPPPPKKKKDPPP 
QKQKKKKKWGGGGGGGGQRKKKHPQKNP 
PPPPGRGGGKNFLWGRPRGPPPQGGGGK 
K 


2181 


16082 


A 


2196 


2 


266 


SKPRSCHCTPAWATQ\ETPSYKKKKKKK 
KGLPLGGPGEKTrFFSLKPGQKAQPKNK 
IPFKKKIPFFSTPKGKKKKNFQGGPKKK 
KKEI 


2182 


16083 


A 


2197 


1 


277 


ARGERERERERERERERERERERERERE 
RERERRHP/RPRERE 


2183 


16084 


A 


2198 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERGGAPP 
PRLYRERVLKKQRGRFLPPVRERKK/RS 
PVQTLPGY 


2184 


16085 


A 


2199 


346 


1 


VPRHGLPCFCFPHQLQPPKRKKFNNRKR 
ALTSHTQFFVFBTESCSVA/RECSGAIS 
AHCHL\CIMRSSNS PASGGAGS SELR FR 
HCTPVGQQTEIWSPKNKPIKKEEGCPYC 
QFLSC 


2185 


16086 


A 


2200 


3 


389 


HEGMILAHCSLNLPGSGD/ASHVARTTC 
VHHHVQLILFIFVETK/FSHYAVQAG/L 

ELLSSSDP/PTSQSSRITGMSHH 


2,86 


16087 


A 


220, 




HECHCTPAWAT \ SETLSQKKKKKKKKKK 
I FPGEKTWGGWNKKKFT ,FI,NRKFFI,GKG 
LFKLKRGPLKLF 


2187 


16088 


A 


2202 


69 


379 


KKKRVFFWGPGGSKPPT/ SGNPPPWPPK 
GGGLRGLPPPPGPRGVIFKNFGKRGPPP 
PPGLKSWGPRDFFGLRLQRGGI SGLNHG 
APPFFFGVLKKLPPLFFVSKG 


2188 


16089 


A 


2203 


81 


379 


KKKKKPPPPKKKNPGPKKQEGGPLRGPP 
TFSGGGGGKKTCPQKLKAAGG/SKKAPG 
PPPGAEKKNPALGGEKKTLGPNGRGPKR 
LRGLAKKGNPPSSFGGK 


2189 


16090 




2204 


1 


341 


GQSLPVSPRLQLSNGNRTLTLLSVTRND 

vgpyeceicjnpasanfsdpvtlnvicey 
llfXpvaqatsphprsqrpglsvplssk 
nadspptprnpahhdflpoanmgrpsld 

QE 


2190 


16091 


A 


2205 


2 


370 


DAPPRPANFVFLVKTGFPPVGQAGFKLP 
PPGDPPPIASQITGKG/HCAQPPF 


2191 


16092 


A 


2206 


362 


24 


PRGSSSASSSSSSSSSPPSSSSSSSSSS 
SSSPF\PPPPVLKPPPPPPKKSPPPKKK 














GKRAKPPFQKKKKKRKRESSFYQQVIIH 


2192 


16093 


A 


2207 


357 


3 


AGQALWLARVIPVLWEARS YYVRI SWYR 
NIVAMDSDSLDGSGHSKLKSFHKGFPVL 
YAIKKTHDSWEGVS\MSALIVWKSLIPS 
CVDAFEELTSSAEEVAAQWGIATDLEL 
LVGCARA 


2193 


16094 


A 


2208 


280 


411 


GNDVYFLVFLF/ CLFEKESCSVAQAGVQ 
WHDLSSLQALPPRRMA 


2194 


16095 


A 


2209 


3 


392 


P I I YEKYTQQ INEMPRKLQHQQLALVSR 

ngpillhdnaqlhitqc/lqklnelgyr 
vlphlpysldlsptdyyffkhldnf\lq 
gkhfhnqedaenvsqkfvksqsmdfyat 
/gnklishwpncvdcn 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=G!utaniine, R=Arginine, S=Serine, 
T=Threorine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


2195 


16096 


A 


2210 


248 


417 


GILANCASIKDLISRIYKKLKQIN/KQK 
TNDPLKKWAKDMNRYFSQEEIQGVWKHM 
KK 


2196 


16097 


A 


2211 


213 


493 


GKE FYFACPGKIGALQGFFKGGGPFFFF 
FFFLTPPPNTNYLTNKIiKIKSPLCTFYF 
SNILGFYPSITHRTIPDAWVTA/SQSGP 
VLMRSIS 


2197 


16098 


A 


2212 


159 


413 


IiAPRVI FGPPKKPPERPPLFFFFFETES 
SSVTQAGVQWHNLSS\PPRFKRFY/CLS 
LPSNRDYiCRPPPHPAtlFCTFSRDGVS 


2198 


16099 


A 


2213 


455 


60 


NPRREVGPICPPPKIRVPPQNPQVGFYS 
PPLKEKTFTSPAPVNLGPPRDPFKRPPP 


Tl99 


16100 


"A 


. 


_ 




AQPRV2«iRPLVPSS\PPLPSIiCIAPAPR 

GPGS LCPRGSLEGDNGSS PG 

LTLSPRLKCKGYVSAHCKFCLLSSSDS/ 
RLR/QENCLKLGGGGCSELP.LP.HCTPAW 
TTERDSVSKTNKKNSLKIFTCFFVDAA 


2200 


16101 


A 


2215 




286 


FSQLLRLPIC/SQGGRLLCCESCPASFH 
PECLSIEMPEGCWNOttlCKaGKKLHYKQ 
IVWVKLGNYRQVFPRTKRKYSI IVQTSF 
ILWIQSDLEIDR 


2201 


16102 


A 


2216 


173 


2 


IT I FFFVRQGLTQAGVQWR\DLSGETLV 
ILPPLEAPCSLQSSWDYRRVLPHLPNFC 
IV 


2202 


16103 


A 


2217 


216 




vyppsfmvfsqvflssthislsfsffwn 
ylfiylfiylfiylrwsl/dsvtqagvk 
whislgs lqpllp6fk 


2203 


16104 


A 


2218 


14 


228 


KRSSHLFTDDIILYMENTKHSTK/NLLE 
LIKEISKVTG/YQKSVAFLYVNHKC2AIK 
KTIPLTIASKRIKNSGQA 


2204 


16105 


A 


2219 


244 


3 


EVLNQNSGLPRMPNILRAKi\SLRVPRQC 
SRGWFSANGAGTTRYLYAKEWGGLGGG 
GCSKLRSCHCTPVWAT\SETLSQKK 


2205 


16106 


A 


2220 


162 


2 


inmvgnlfffgdrvslchagwsavapsw 
ptvastslXvkqssflslpssmnhrh 


2206 


16107 


A 


2221 


146 


3 


GRVDGVPWRNPGSLQPPSP\GSSDPPTS 
ASQESGTTGflHHHTRLIFVF 


2207 


16108 




2222 


239 


2 


SYISKPDKFPHDSSPEIKPVTVUWRNSF 
SFI FPFFFLFPEKESHSVTQAGVQWRKL 
GVSSYWP\AGLKLLTSGDPPALAS 


2208 


16109 


A 


2223 


2 


159 


LNEDLGGGGCSELRSCHCTPAWAT\SES 
PSQKKKRKKKKKKKKGGRGRNSKI 


2209 


16110 


A 


2224 


3 


345 


RFKLFSCLSLPSSWDYRRVPPRPA/NFF 
VFL1ET\GFAILTSSKTERQSRLECIFG 
FYGLPCREKHASERRSVEG/HERKILFS 
FDFFFLGGTESCFVTQAGVQGCYLGSPQ 
PPPPG 


2210 


16111 


A 


2225 


103 


319 


FSEEYRNVTNFLMLTMSCSCLTLVE/C/ 
WSEGYMATPCTILLLLFFQRHCLTLSPG 
GVQWCSHSSLQPQTPGIK 


2211 


16112 


A 


2226 


2 


110 


FHHVGQAGLELLTQWS IHLSLPKCWDF \ 
RHEPPHPA 


2212 


16113 


A 


2227 


2 


178 


IFLIFTFLEMSSHYVAQAGLEFPG/FIOj 
TSRLSLLSSWDYRRPPPRLAWFFAFLAK 
GDAA 


2213 


16114 


A 


2228 


273 


3 


fififislffifflrqsivlspsavqsr 
lqpppprfkqfsXclsllsswdykrvpp 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIiitamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 
















2214 


16115 


A 


2229 


333 


1 


HTPPPPPTFHSLCTEGftPPPPQNT/PNP 
PKNPFI I PLSRSHTRSNEPSLSTPRTHP 
HSPRPPLLTHPHPNPRaSAPPGSRYPPR 
ARHRERPRERPTQREREEERESVCVCV 


2215 


16116 


A 


2230 


350 


3 


KKKDIPPVNIYAPNTGAHKYIKQILLDL 
NRDGP\HTITAGDFTTPLSVLNSSaESR 
CSRPFIISPSLWGLRE 


2216 


16117 


A 


2231 


277 




QMHPARGHLPQALI PVQKPALISQGMSA 
SGSTQVSPFLSPCFWVEVNCSNTKVLT\ 
PFCGAGTGSHSFAQAGAHWCNRGSLQPH 
PPGFKRL 


2217 


16118 


A 


2232 




335 


ETRFHHVGQAGLEPIjTSDDPPASASQSV 

/gitgvslrrrpges/gswktvccnnms 

EPT 


2218 


16119 


A 


2233 


29 


448 


CPSLRQAWHEAAIDEVRTGTYRQLFHPE 
QLITGREnAAHNYARGHYTIGKEI IDLV 
LDRIRKLADQCTGLQXGFLVFHSFOaGT 
GSGFTSLLMERLSVDYGKKSKLEFSIYP 
APQVFTAWEPYITFILTTHTTLEHSDCA 


2219 


16120 


A 


2234 


1 


365 


GARLILVFLEETGFHYVGQAGLELLTSS 
DPPASA/FPKCW 


2220 


16121 


A 


2235 


28 


460 


DRLIDHI SKLGTRGLQGFLVFHSCGNLG 
\TGFGFTSLLMERLSIiDYGKKAKLEFSI 
YPAPQVSTAGVEPYNSILTTHTTLEHSD 
CAFMVDNEAIYDICRKNLDIERPTYTNL 
NRVISQIVSSITASLRFDGALNVDLTEF 
QTNLV 


2221 


16122 


A 


2236 


614 


84 


LAASLAACAQLSALaASHRMWALQRLRK 
LLTTEFGQS ININRLLGENDGETHALSF 
TGSAIiAALVKGLPEALQRQFEYEDP\ IV 
ROGKQLLH/SPHFFKVLVASRLVTLEAG 
HFCPCCAETHKW\AWFRRYCMaSRVava 
LDKRTPLPRVFLDEVAAVRVCGHI lqlg 
DTELQQHICHL 


2222 


16123 


A 


2237 


1 


393 


GPMLAQLSVFRCGSTSAPKDLWYHFIEL 
PYHGESITMLIALPTESSTPLSAI IPHM 
STKTIDRWMSIMVPKKVQVI LPKFTAVA 
QTDLKEPLKDLGITDIVDSSQGHFCQIT 
KAENLLV/SHILQKQK 


2223 


16124 


A 


2238 




402 


HVGQAGVQIGKACWELYWLEHGIQPEGQ 
MPSDKTIGGGNDSFNTFFSETGAGKHVP 
RAVFVDLEPTVIDEGRTGTYRQLFHPEQ 
LITGKEDAANKYARGHYTIGKEI IDLVL 
\DRIRKLA\DQCTGSQGFLGFP 


2224 


16125 


A 


2239 


2 


478 


grgglhripwtpltpsfarglvpslar 
gvearngagpi ks yprpgsrlkmqngs k 
gsglqnktfhweicdahvnskiqlkqVh 
i ssrrhkdrvagkplkpkyrpywklqrs 
psilaaklafqkdmmnptiapaflsspla 
aaeavssaltlpprpfcfv 


2225 


16126 


A 


2240 


255 


2 


FLFVKPHQISCPTKKGIKSFLVLCPLWF 
FLFWRQDLAHPGWSAVTQSWLTAASN\Y 
GLKQSSYFSLLSSWDYRCIPPHLGGKRP 
L 


2226 


16127 


A 


2241 


365 


11 


eppppggekkkrgkpppkktpkkdgpqk 
ksaaffggXgkikkkkgafrkkkkkggg 
pppkkkppprkkkkkknfsplpppkhtp 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SLFQKPPTKIFLFSPPLLFLNILFFFPL 
SPFIF 


2227 


16128 


A 


2242 


3 


143 


PNFLSPHLDCRGTIRGHCGLYLPGSCDP 
S/CLSPPSSWNYRSTPP 


2228 


16129 


A 


2243 


132 


2 


msahcnlhlpgssdspasvsqvagimga 
chif\vflvqtkphhi 


2229 


16130 


A 


2.,4 


334 


13 


TS KHMKRCS I SLATREMQNKTT IGGHFV 

PTRT AK TQKPENAKYRQGCT.,Mr/WCWVC 
NLA3 P LWKT I WEYLVKCLP YS S AI FTLG 
\IYPEEVLASSVPGT 


2230 


16131 


A 


2245 


312 


2 


PPSKLPEKNFFFFFFFLRQSL/SSVAHS 
WQWRDLGPLKPPPPBSKQPSCPTLLS I 
WETQTALSQDCaXAIflPGGQRETLSHKI 
CVCVCVCVCVCVCAVCIYIN 


2231 


16132 


A 


2246 


3 


230 


RAQflMVETSRERCLLRPPQIETR\HVAQ 
AGLKLLASSDPPTSASQSAGITGVSHHT 
WPQPLTFCPHftKSLPFINQI 


2232 


16133 


A 


2247 


2 


214 


GRVDLVIQAGVQWHDLGSLHPPLP\GSS 
DLP/ASDSQVAGTAGREKYAWLIFF\VF 
FVETGS /HTQVSNS YDVL 


2233 


16134 


A 


2248 


3 


139 


EfiVQGflHHNSLOPP'tPGNSSDPPTSASI 
EAGTAGSHYHVWLI FLLF 


2234 


16135 


A 


2249 


3 


160 


EGVQGCHHSSLQPPTPG\SSDPPTSASI 
VAGTAGSHYHWILIFLLFCFEGDAA 


2235 


16136 


A 


2250 


319 


3 


GQKRYKI FI.CFFT.FGWLVFLRWSLTLSP 
ELERESVSKKPKSQKPPIKKPKFTPREF 
KEVLAKYGTKFFFVFFGLFVWFFLRWSL 
/NSVAQAGVQWRDPGSLQASPRP 


2236 


16137 


A 


2251 


15 


394 


FVSFSFFPSFPSHLFFSSSSFPPFLPSF 
HFSFLPSDRPSWPS/FIiPSFLP 


2237 


16138 


A 


2252 


60 


306 


GRE R I LEE I MAEDFLNLMKDLN I S I Q VA 
QQIPSKMNSKRPHRYQHFRSQSQRILKA 
TREKQLATYKGSSVT/VSPSPGPQTVNS 


2238 


16139 


A 


2253 


3 


351 


GFHHVDQANLKLLTSSDLPASTSQSTGI 
TGI\DHCTOPNFPI 


2239 


16140 


A 


2254 


150 


2 


RPRRPDHSRLGAGDOPGQHSKTPS\HQK 
KTKTSQAWRHAPAIAGTRQAEA 


2240 


16141 


A 


2255 


357 


1 


LNLNLSLTLYAKINLKWITGTWVKHKTT 
KFLGGKWGANLiyiDTRLDISAPLDLTSKAQ 
LTKEKIDKLNFIKIQTF/CSIKDLLGNL 
KRQAPEEKKILRNHISNKELVSRIHKEV 

PKLNNKK 


2241 


16142 


A 


2256 


397 


1 


fslfppvggqggffsscksppprfraff 
cpnpfrkggnrgppphpgkgtlgfffff 
frqsfalsprveynlclpgssdppasas 
qvggapglpppawvnfgifff/cfflrq 
srsvaqaemqwrhlgslqamp 


2242 


16143 


A 


2257 


2 


132 


TLTLTPRLECSGTIS /AftHCKLHLPGSR 
HSPASAPRVAGCGGGHL 


2243 


16144 


A 


2258 


214 


347 


KISL ILGVHKI CCEF / CFFEMKSRS VTQ 
AGVQGHDLSSLQPPPPGL 


2244 


16145 


A 


2259 


347 


2 


ffsfffffseaesrsva\rlecsdtvsa 
hctlhlpgs 


2245 


16146 


A 


2260 


333 




sdqrwtenafvelrdegfacps fsel/ r 
stpsts6eevenfekkldecitritnte 
kct-kelmelkakarepreecrslrsrhn 
qleervsvmedqmnemkregkfrekrt 


2246 


16147 


A 


2261 


2 


357 


SPRSCSVYGIAILLFLYFLYKIAFALLY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predic-ted 

nucleotide 
location 

in'?ofir? 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
hist amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phen>'lalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLALKS FLHE IQEP /SLASGSGPLSRNS 


2247 


16148 


A 


2262 


355 


0 


PPLPSPLSPPLSLSFFSLPPFPFFFPFF 
PLF/CPSFSFFFPSPFFP 


2248 


16149 


A 


2263 


348 


35 


YFLKKGFS FFPRGE6RGKDSRSLKQLTF 
GFKKP/SCPSFLRKW\ETRLCPPAQKIF 
FFFFFLLEMGFCYIAQGDLKGSSQSSGI 
TGVSYHIWPTFIGHLTMCLAKC 


2249 


16150 


A 


2264 


270 


92 


DRPARRKMFFTYSSRLFFFFFFFFFFFF 
FFFFR\FFFFFFFFFKFFFFLFSRWSLM 
IFIS 


2250 


16151 


A 


2265 




417 


FRPPAGVQWHHLISLQSLPPSFKRFLYL 
SLPSS /WDYRCMPPQLA/NFFVFWFDV 
VWWVVLVEMRIJiHVL?iRLDIVVLICI 
SLMftMGMEVIiFLCSFAIHLPSLLKCRFT 
YFflHCLFLYYFLETGSHSVTETGWQWCI 
I 


2251 


16152 


A 


2266 


344 


427 


LIGLCEDTWLCSAIHAKRVTLIAQDIjQLA. 


2252 


16153 


A 


2267 


310 


417 


RVPYGTLGSGPLATMAGEDDKLKPHMEG 
AAAPILVR 


2253 


16154 


A 


2268 


186 


464 


IsrSCLSHNQRQLLFRI.EKGMGPISAQEKL 
VIiHTLSGFSGLV\VGWLVFEMESCSWQ 
AQVQWRVHPPPRFKQFSWLSFPSSTOYG 
HVPPCPANF 


2254 


16155 


A 


2269 




427 


PEPPSLPPDGAKKQKTQKSKKWRICFUl 
KKKKKKKRKKQKKKKKKRGGGSQKKSRR 
GPKLTTETNI IILIKGGLKKMIIYREIEK 
KLLFGGGGVIGPTPTQDIKBREEINYLE 
AVGREKQRFISLVK/TNVA/HEATRDTI 
FRGYL 


2255 


16156 


A 


2270 


399 


44 


ISFQLLLPITVLPGHVRVLMEMYKEVHV 
VFVP\STTTFMLQSMGQGVTLTFKSY\Y 
LRNTFNKAVAALTSDSSEESGQSQIiKPV 
WAAFS SLNASKNI /RDSWEAVKI PALKG 
VWKKWSLKM 


2256 


16157 


A 


2271 


19 


420 


AAGIRHEERERERERERE/QRERERERE 
REREREEARRNIYHTYRPRPPRVFFFFF 
FFFFKKKRRGFLVPTPPGGG6AQKKKTS 
LEGGKGVFFKGGGKKTPLKNPGGWGEPP 
QKKNGGGGPPPRDPPPPPRPLF 


2257 


16158 


A 


2272 


463 


20 


SYUIPLSQSLIQ/SRALTLFDSTKAERN 
RRGK\RSGKLQWEGSRGWLMRFKERSHL 
HNIKVQDEAVSYPEDLDKMDALNTKQQI 
FSVHKIALYSKKMPSRIFIAVERESMPG 
FQAS KDRL /LLLG /ATAAGDLKLKPML I 
YNSKNPRVPRAEF 


2258 


16159 


A 


2273 


474 


82 


VGVWADFLKNTSQAQATKAKMDKWDPIK 
LKS FCTAKE T I S KRKRQPTEGE KI FftWY 
TSDKGLI/SRIDKELKQPYRKNPNITPVL 
KWAKGWAQWLTPVI LALWEAKAGRTQGQ 
EIEATLALFSGLFCQVFLC 


2259 


16160 


A 


2274 


152 


3 


AEGHNGDG I IQKS S I XRTLLSNDKNPQN 
IHRRPTRFLTMLYQQNLCHLGL 


2260 


16161 


A 


2275 


32 


361 


LGASAHYEKPTVITLILNGEHMIIFPIR/ 
S KTRLGYLLSLLL FNI GLAI LAS AINQ K 
KEIQI IQIVKKKIKTQRKKKKNKTKKKK 
ALFKFKGGPEKEKGPPKNPFKTPPWF 


2261 


16162 


A 


2276 


329 


487 


EFVNITIKIATSLHYKAIVIKA/AMVIW 
YWHKNRHIDQWWRIESLEINPHICIIK 
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SEQD) 
NO: of 
nucleotide 
sequence 


SEQIB 
NO; of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glj'cine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2262 


16163 


A 


2277 


2 


466 


ahhscshllttaahyscspqlltttaah 
tys ppqlltpahhhs cshlltttaahhs 
csppqvltaahh\gahtcspqlltprhh 
sspqlltpahhscphlltttaahhscsp 
pqlansyrg\spycsswsqtpgfrkssh 
vglpepwifhglqw 


2263 


16164 


A 


2278 


^ 


467 


KNVTQQGKIHIRDKLDEMWGNTSVFCTN 
HMKHQTNFNAKKCNVFKECGK\TACNFQ 
LTQYQISHANQKPYECQIC6KPFRKRAH 
LTQHNRIHTGGKPYECKECSKVFICCST 
LIQHKRTHTSEKPYECLECRKTFRRSAH 
LIRHQRIHTGEKPYK 


2264 


16165 


A 


2279 


383 


3 


FLCVCACVCVYVCMCMCACAC\CMCMCG 
CVCVCVCVCVGLRGLGWGAWCESWGPP 
LCFLLLGILPLKSRLLWLPRRTISICTL 
PSAQGPLPAPGFGKYASNTTGVKGSSSV 
FSLLSRITALSHLHW 


2265 


16166 


A 


2280 


47 


219 


VCSELKSCHCTPAWAT\SVTLSQKQNKT 
KQRRTLGSIFFQHTFMHLKKEKSLILQK 
W 


2266 


16167 


A 


2281 


294 


160 


NKTFFFFFFFFFFFFFFPPFFFFFPFFF 


2267 


16168 


A 


2282 


68 


490 


RTFAYPKPPVFGSHSGKTTIPKNARGSF 
PS/PPLPFPCPSPSPGPSPVSCLPPHLP 
SPSPSPSLSPISRLPSP/SRSPSPSLS 


2268 


16169 


A 


2283 


3 


454 


CQSAPLGSASQLGYSGVRDTLEEAVCPF 
SDIWLCAGRTTTLFKAVRQGHLSLQRFR 

LPFVWLCPAPRGGVYRGRQASr,S CGGLH 
QVRASRPLCLPTQASAMAGAPPPASLPP 
CSLI SDCCASNERGS TGMGP/ SE PGTO 


2269 


16170 


A 


2285 


1 


452 


LKDS6RDWSQFEGCALGKQLNLKLLDN 
WDSETSTFSKLREQLGPVTQEFWDNLEK 
DTEGLRQEMSKDLEEEKAKVQPYLDDFQ 
KKMQEEMELYRQKEEPLRAELQEGARQK 
tHEL/QEELSP/LGQEMLDRflRAHVDAL 
RTHMAPYSDELRQ 


2270 


16171 


A 


2286 


3 


266 


NSSPPSSGHSTPRLAPPSPAHEGTDKAV 
SALKSPQPNRGMGRGQR/PGLPS/DTAT 
ITPHTSGFPKQPQLSUCVQTQRAKGRLS 
HWDLEP 


2271 


16172 


A 


2287 


274 


453 


lYTFVKSSSKTliRPRHIDQWNRIENPEI 
KP/EYSQLIFDKANKISriKWEKDTLtNKW 
CWDN 


2272 


16173 


A 


2288 


81 


487 


TVYFKPTVETY \CWDKKKI PFKGLLRRD 
NTPCYPKSLLEMCEKINIVFAPATTTS S 
/LKPMDQGVIVTFKSNYLKNTF\RLGEG 
RKKKKKKGRKKS FHKALAAI NS DS SDGA 
GQSKQKT/FWKGFSIPNAIKNIQDPWE 


2273 


16174 


A 


2289 


75 


469 


SRGVAGAPPKPSPSTPSPGPLDVTPGPH 
SSHHAASPGP\PPPPEPTASSMASAP\P 
PAPQPTPL\ PPATLGPPSAGPE / PS PGS 
CTSTGWGYSFCCPRCRRWMRWQPPQQGP 
AWHWWPREFP/PPPRVSGS 


2274 


16175 


A 


2290 


256 


55 


PTQPLRVELTTVTLRCDINKWDYIKLKS 
FCTA\KKRQPTEREKIFPMHVSNKELIS 
KIYKELIHRI 


2275 


16176 


A 


2291 


497 


29 


SLTHRVAGGAAVTPLAHAGAHQ I FFLGD 
PHPTSSLLGWGPAWDPCAFQVSDHPASS 
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SEQID 
NO; of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
,09/515,1 
26 


Pred icted 
beginning 

ing to first 
amino acid 
residue of 

sequence 


Predict- 

°o2ian 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=]soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=llnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RVGKTLLSCPGC3GRLPWVTCPGGGQLSC 
LTQMHLTPAGMCSPCVCVCVCVCVCVPW 
GMCPRCMCVCVCPCRDVY/ I CVCVCVCV 
CVCMCLCPCWDVFTEL 


2276 


16177 


A 


2292 


141 


392 


S CS TE VKQ P K IGVRE VDPVAAPGI AP S S 
PQKKQTT3TSSLKSLLRLSKVRFLFNIV 
LDVLAEEIRQEKEI KGIQ/LGKEE 


2277 


16178 


A 


2293 


14 


291 


YVGTEGISFISFMRVTNYMTRHLATLRE 
S\CYSR\VYPRFIEFLRFDIQSTGQ/RI 
TSRQHPPR/DLRDALLYLNRRITLVRTR 
CKSVAKRPPGSP 


2278 


16179 


A 


2294 


no 


293 


IiLSNRSIAIASLCGGCISIELRSCHWTPAX 
WRQRETVBOCKKKKKKKPGWGFLNPPPQR 
GKLCF 


2279 


16180 


A 


2295 


96 


313 


WNGCYLLSNRSLAIASLCGGCNELRSCH 
WTPA\WRQRETVKKKKKKRKKPGWGPUI 
PTPQREKLGFLKEGPGF 


2280 


16181 


A 


2296 


234 


2 


CCLETTRSLFDKGTKENTQWGKDSPSNK 
RCWKNWISTCKRMKLDPYLIP/YTNINV 
KY/IKDLNLRPEITKLLEENLREK 


2281 


16182 


A 


2297 


334 


6 


KLFSPPGGGGGPPPPPPKKGWVPPETP \ 
KHGGGAPNPPPPKGGGAPKPPPQKKNSP 
PKKKKKI FCPPPKKKGPKGVFLRGPRGP 
YRVFLKGPPLFFFFLKKSWRPIiAMYA 


2282 


16183 


A 


2298 


467 


8 


lpgfkastdrdn\vagdfklkqmliyhs 
esssalkndtkstlpvlyrnkeaw/vta 
hllrpwcteyfklraetccserkisfki 
llliXdnapsypealmkmfkikvfmsdn 
ttsivystdqgviltcksyylrktfyka 

ITAIDSDSCRMPQEGN 


2283 


16184 


A 


2299 


1 


449 


SIYLSIP/FNLSINLSIYLSIPIYLSiy 
/HISVyiYL/SIYLSSIHLSAIICLSIF 
QSISLSLSINLSTYLSTIHLSLSSIIYS 
S 1 1 YHLS IH/LSYYLS 


2284 


16185 


A 


2300 


1 


445 


QAGLQLLTSGDPRTSGLPQCWDYRC\DH 
RSWQT 


2285 


16186 


A 


2301 


241 


22 


KWVLGTCACVHVF I CVYMCLYVCAWV/ C 
VSMCLYVCACVCMNQC/VTVCMCVCVCV 
CRFVCVGIPPHSKWISIG 


2286 


16187 


A 


2302 


3 


478 


GGQTETLLTSQRKGGWPEALLTSQMGRP 
GRGAPHI PDDEQPGRDAPHLPDGAAGQ\ 
SAPHLPDGE\PGRGAPHIPDGAAGQRRS 
PLPRWGG/ELGRGAPHIPDGAAGQRRSS 
HPR / PGR PGRGAPHFPDGATG/ Q /DGAP 
HFPGIPPDGT 


2287 


16188 


A 


2303 


440 


41 


KSHHLSFLSFLLFFFP/TKSHSVAQ/CW 
SAISAHCNLCLPGSSHSSALASRVAVTT 
GVYRI 


2288 


16189 


A 


2304 


2 


395 


FFLVKTRFLHVGQAGLKLPTSGGPPALA 
SQ/SL/RITGMSHRTQPE 


2289 


16190 


A 


2305 


184 


2 


SIKKPKPLVffljRWH/NT/WQGAVTPACN 
SSTLGGRGGRITRSGVQDQSGQRRESEF 
LRLGYGLD 


2290 


16191 


A 


2306 


86 


472 


I IKLCKWQNK/RFEAGSLVPEMGFY YVT 
QAGLELLFSRDPHTSASQSAGITGAAFH 
QRWSVGTVLLQVDRGTPPVGDCGSRTPQ 
WPGQAFLRTALKSEAHPPHSSTDVTPVL 
WSEGSPCIiSPPSLSFTG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO; of 

seq'uence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cy steine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine.V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=tJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2291 


16192 


A 


2307 


133 


2 


RVIHVWRSHPAI P \TTREAEAGES PE P 
GGGGCNEPRSCHCTPAW 


2292 


16193 


A 


2308 


3 


369 


LTTAFPDFPGSTHPLTSASQVaWATCSAH 
HHSWLIFVFFVETGFHL/SELLSSSSPI. 
ASASQSAGITG/REPPCLTSPFFF 


2293 


16194 


A 


2309 


253 


365 


TDYFYLFI YLRQSIi/ SSVAQAGVQWQDL 
GSLQPPPSGFK 


2294 


16195 


A 


2310 


362 


0 


TKAQQCVHENHFKLKDTIWTLNIKVWRNI 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 
GDEE/HFIKIKKSVIQEAIIIINIYA 


2295 


16196 


A 


2311 


167 


2 


LCYCVI FVPFIFWRQSLTLSQGQMQWCD 
HGSLQPLPP\GLKRSSHLSLPNSWDYRR 


2296 


16197 


A 


2312 


67 


482 


DHLIPGGGGCSELQSHPCTPAWVTA/ET 


2297 


16198 


A 


2313 


188 


437 


AGFFPPENQLTHMKVRKANESDPWGVKP 
ESDESETHGSLSLSLSLFFIiFIFIiRQSL 
/NSVIQAGVQWRNLSSLQRISP 


2298 


16199 


A 


2314 


41 


325 


TTTLFXRVRQGYSSLLMFLLPFVCLCPA 
GRGGVYRGRQSSLSCSRLHPVEASRPLC 
LPTQASAIAGAPPPALLPPCSLISVCCA 
SNEQGSVG*DP 


2299 


16200 


A 


2315 


417 


0 


SPPRNWDYRCVPPRPANFVFLVEMGF/Y 

HVARPEFGLELPTSGEPPTliA/FSKCWD 
YR 


2300 . 


16201 


A 


2316 


98 


288 


lmawpatweaeaqeslepggwgggeeg 
cselrschctpawvtXsetlshthtkkk 

KKNGAAL 


2301 


16202 


A 


2317 


. 


410 


LNHIPNLSLTKRKPSPHSLNLKKKKKKK 
KKKKKKKKKKKKRGGGVKKKPRGGQKKK 
GGEKKNFP SKKGGKKKKRGEFGKKNFFG 
GGKKREKTPQKKKS PKGKKKNLR/EERG 
EKNPKRGGEKKRSSSPRMKNLRGEKK 


2302 


16203 


A 


2318 


249 


3 


plkas s ppkafnfcrevgp i cpppkkkv 
pVpkipklvfipppirkkllpcpppltl 
apprvplkrpp 


2303 


16204 


A 


2319 


2 


393 


AHLGLPKCWDYEHEPPRPAPLPLLNSYF 
GLDLLT/S/GDPPALASQaaEITGVSHC 

AQ/PEYVif 


2304 


16205 


A 


2320 


389 


2 


rgksniggpgplrgkkfspppplknwgt 
kxgppppppffffff\gkpgfsiffkpi 
lkfkgpkkpfprapknggs pgpgwffff 

FLSMFTQLPGIKLSKGGKKHFASPPPPF 
FFFVHYKTGYPSATPDAW 


2305 


16206 


A 


2321 


389 


206 


FQWRWGF/NHVGQANLELLTSNDPPASA 
SRSAGTTGMSHHAQLKNYFLMVRMWENW 

IAVGM 


2306 


16207 


A 


2322 


371 


42 


SFPPQSGFFSPPPPHEFFFPPPPSFFSW 
VGVRQIPPPPKIFSSPFPPGGFFSPPPE 
R/VDFFFS PPPFFFPPPS FFLS PP PPFF 
FFFFFFFFFFFFFFFFFFFSFFGVKKRK 


2307 


16208 . 


A 


2323 


82 


386 


PFLTQKYFFTPPEEGFLKKPNRREGPPS 
PITDPTLWPNMMKGIVPKAPPI IFMGGG 
INMTFSGFVTTKAPFPRPLRFNPMLRQG 
FDLLTLKAS \WGS SASWY 


2308 


16209 


A 


2324 


1 


413 


RGSGDNRHGPPCRVNFVFLVETGFLHVG 
RSGLELPTSPALASQSVGITGVSC/RPP 
PQASY 


2309 


16210 ■ 


A 


2325 


58 


400 


SETLVSKKKKKKKKTPPPPKNPKKKiasrP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucte- 

locittion 
correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth)onine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPPPGAKKGPGGKF/PPPPIFQKKRGG 
FKK/SPPKKPPKKKKTPPPLGKKGPPPP 
RGGPLGGFPPPPPQKKNGPPPPKKKPPF 
FFPPK 


2310 


16211 


A 


2326 


462 


306 


AHHNIiEa.PGSSDSPASASGVAGTT/GMC 
HHARPI LYLSGEftS ALLHCFS SAQLF 


2311 


16212 


A 


2327 




^1 


393 


SRPPSRTEKIRNFFFVETGSCYVAQAAL 
KLLDSSDPPTSASQSAG/ITGMSHRAQP 

TS 

^fketkmwTiprswtogswvsqpysaT 
ltsspesgfhsvtngdsmtcledgkiisl 
vaphgipqrgsslqdgvsfcpaptmkpy 

LASTKM\RLHEAREIX3FFSFLSLRQGIjA 
LS PQLECSGAILAHCSLKL 


2312 


16213 


A 


2328 


2313 


16214 




2329 


2 


115 


GCSEIiRSRFCTPAWAT NSKTASQKKKRG 
KKKKGGVFYF 


2314 


16213 


A 


2330 


2 


406 


AAAPSALALRDGWAVRPELDLLPPCGEE 
VAPGAHCLGCGPSPCLFLSPSHTRQSPP 
APTSSPGLSTSPPLVPTHVSAPHSSKGP 
PSIPGAQALRGCGLGGWDRXPSSPSLP/ 
PDVSPKPLNFAP 


2315 


16216 


A 


2332 


226 


377 


KRKSKHITFLFKTLSWPDTVAHACNPST 
LGGQGGRI/TPRSGVRDQPDQHG 


2316 


16217 


A 


2333 


3 


191 


clspggggcgglrlcycmpawvtxwetv 

SQERKKERRKEREKEIKKERKKRKKERK 
KEKKIOCG 


2317 


16218 


A 


2334 


295 


81 


FFKPFFPPKKTCPEFS \ffwekrgfppp 
PLRS FFQNPPPKGGPPILLKGPPPPPSL 
GWPPPPPIiFFFFFFLR 


2318 


16219 


A 


2335 


375 


8 


TQIVPLPSNLGNKTRLRLKKKKRNEQGN 

iptdtidnrikqiiq/t¥yeqlcank\n 
ldkmdkaleshnfpklkqreslni/hsa 
keihfi/iiinistkktpdptgftgkflq 

IFKEKKMAGHSGSHL 


2319 


16220 


A 


2336 


399 


97 


FFFFFFFKQNLA/ S VTQVGVQGQYFRS L 
QPLPPRVKPFSFPKPLSNRGYRGPPLGR 
VRQENCPKLKSKRFQLKKI PRLGKKKKL 
RFPKKKKKEKEKKIVKT 


2320 


16221 


A 


2337 


411 


57 


PPPQIFWGPPFFPPPPFFKPPPPPFFFS 
PQKKKKI\SPPPPKKKFFFKTPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFF 


2321 


16222 


A 


2338 


97 


354 


akapslslvlfsfstfflgiqgiiallpm 
lecrgai taycslnlpdssgpptsaps p 
tpy/riagatgthhhalllfkffsrdgl 

PL 


2322 


16223 


A 


2339 


391 


45 


LMFFHLSHKHRSGEAPSIHWS I YLS IHP 
S IHPSLYHSS IHLS IHSTIYPSTCLS IH 
/CISIHPSIYPSICPSVHLLAHSFIHST 
I CPSMHLP IHVS IQHFLSAQILPVS VFG 
EVSDI 


2323 


16224 


A 


2340 


506 


0 


RDHEQLGIVHADKKKKKKKKKKIKKKKK 
KKKSSSSSSSPGTFFRG/VPLKDPVG 


2324 


16225 


A 


2341 


203 


1 


VELRVRATEPGFNFLKGIYCTSMVDWIK 
KMW Y I YTME YHAAI KG /DR IMS FVATWM 
ELEAIFLSKLMQ 


2325 


16226 


A 


2342 


402 


40 


PYPPKKKASPTDA\FSSSSSSPPPFFFF 
S PPPKKBFFSKPFFFFSPRFFSFPPFIiK 
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SEQID 
NO: of 
nucleatide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 














PPPQFFFFGPLKKNFFFPPPALKFFFFK 
SPPPLFFFFFFFFFVEAGVLLCYSG 


2326 


16227 


A 


2343 


-TBI ' 


1 


qlkdqas dllgkwgdevket i p s flp s s 
lppfXlplflhqflpsflpsffppsllp 

FLS 


2327 


16228 


A 


2344 


2 


407 


FVASQLGCSGV\RVRDPLEEAVCPFSDL 
QLRAGRTTALFKAVRQGHLSLQRLLLSF 

Wflcpaprggayrgrqafls CGGLHPV 
gasrllclpkqawamagapppaslppcs 

LISDCCASNQRDSVGARPSEPGAGH 


2328 


16229 


A 


2345 


405 


178 


I IKFI YEKATA/LLNG/EKLKAFPLKLG 
TRQGCLHSLLVCNTVLEVLTGTLRQEKE 
IKGIIIEKKEVELSLFADRIL 


2329 


16230 


A 


2346 




412 


fkaskaslspllgantagdfklkpgliy 

hslkVlknyadsillvlcqwhnkawmia 

iilftawfteyfspplrpaqkkisficmll 

FIDSAPSQPCVLMEMYKEIN\n/F,ViPflNT 
T S I LQ PMDQG V I L TiKS y WLRDT t'H 


2330 


16231 


A 


2347 


2 

_ 


397 


KSLEPGRRRLQGARIMPLHSSLDNRVRL 
CLKKERKKEIGVLIRCWQECKIVQPLWE 
IVWYFLKKIitl\ESPYEPAVPLIJSriYPRE 
MKIHVYTNTCTQIFIVMiFTIAKSGKWW 
GTVACaCKPSSSE/WLRWVDYL 


2331 


16232 


A 


2348 






PKKK 


2332 


16233 


A 


2349 


49 


262 


qmckgsnrrrgkrvgsrqiskkktnapi 
kkwakdmnref/diqmankhmekcstsl 
i iremqikstmryhly 


2333 


16234 


A 


2350 


356 


2 


fvtaplhsslgnrarsyckekkkvqvaa 

KAVLRGKFI lAYTVFKKRKISNINLS IS 

lktlekeehtetkadgaqyvrki \raki 
nkietanetksrslektsktvegkclrd 

ILLSAQI 


2334 


16235 


A 


2351 


360 


19 


LDAQFLEVGLGELLFRSTVPTLQPPGCG 
ASFPVIiNTLPF\SLSPSQSSSSPASLVR 
PWVAPPFLCPHGEPDGGPDSTTSLPFPG 
PAATGPAflRLIQHPASRQPRPASHTHCG 

V 


2335 


16236 


A 


2352 


360 


0 


NTFLAAFRLVFCQMTSYSLALLSHKLAL 
MPLNLSDLLTRWTHCMGELFFLDILAIQ 
NPFHTVFFLGHPEWGMESRFWQAWQW 
PDLRQLQPS\PPGSMRFSCLSLPNSWNS 
PSYGRL 


2336 


16237 


A 


2353 


34 


474 


EGWRPCKELAARQV6CPHSCFSPHWQLL 
QKQEKTAGAVSVCVCTSTLCVCVCVCVC 
VCAQAMCVCAGA/CFCVCVCAGA/CLCV 
CVGA/ CLCVCAEAVSVCVQ/VAVSMCVC 
RS/VSLCVCMQGQSLCVCaGA/CLCVCV 
CGIPPPVLCLW 


2337 


16238 


A 


2354 


297 


16 


KFFFLKSFFFFSFFFLTPPRFFFFFPKK 
KKIFFFPPRKIFFFFLIPPPPXFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXRHGG 
WFEEITILTV 


2338 


16239 


A 


2355 


3 


315 


PVTPATRETEAGETL \HDLGE PGGRGCG 
ELRSCHCTPAWVTEQDSVSKKKKKKRGA 
RFKESNFTTPGLQHNIFFLGALKLISGA 
GVLKRRDGKTLGF PQ FNRPWG 


2339 


16240 


A 


2356 


399 


154 


PGQRGEIPSLPKIQ/EIiAGCG/GHLNPG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
Od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

i*ocation 
correspon 

last amino 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Aspflragine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GRIFGEPRSRHCTPAWATEQDSICKSHS 
RSGLWCYKSSSVYLSTRGVWVRGIASV 


2340 


16241 


A 


2357 


416 


2 


FFFFFSETESRSVAQAGLRT\QWRNLGS 
LQAPPPGFTPF 


2341 


16242 


A 


2358 


209 


2 


KKNLVPWPRGGYFKSLQPPPPGV/SCPN 
PPKKLEYRVLFPQPSNPFFFFCIFSRDG 
VSPCCPGWSRTPD 


2342 


16243 


A 


2359 


279 


380 


RGYNP / VIPGAVAHT CNPSTLGGRGRQ IT 
RSGDQDH 


2343 


16244 


A 


2360 


415 


224 


FFFVFFFFLLITFILMLNLLPCHKFLFL 
QFLLGYLFLLYL\CAFFLLVALNIFITS 
FQQLDYYTF 


2344 
"2345 


16245 


A 


2361 


278 


2 


ISDPFRFWNIYRIHTGLISLIQISENCV 
SCQKFQILEHFRFQXRDAQSVLRK\RKa, 
WTGAVAHACNPSPLGGRGGWI SRSGDLV 
HPGQHSETP 


16246 


A 


2362 


139 


282 


KKKKGGRGSGGGRGGGSXXWSGTKKKKK 
GGEKKNLWC3GGKSGGKGGGS 


2346 


16247 


A 


2363 


19 


409 


PKPPSVLGSGGPARYPSPLGSPNRPVPL 
GPGVGAPPGPPGKTPFFLKIKKIYPARG 
GPPVIPASPGGEGKKSPLPPRPRVPLTQ 
IFPP/PPPPGGPNQG 


2347 


16248 


A 


2364 


159 


383 


HSHFKNLSS I IKKLHRNNTFTEHFSLSS 
SLNQCFLNLTVFYSHLGNFKNSN/SWPG 
AVSHTOTPNTLGGRGGQITR 


2348 


16249 


A 


2365 


400 


221 


GRLRQENCLNSGGRECSEPRS /HLHCAP 
AWAT\EQDSVSKNKKQNKQKQTHIYTVL 


2349 


16250 


A 


2366 


383 


14 


GGRGCNAPRSCHCTPAWVTERASPQKTK 

kqkkthtkkhissccykmedplrqaStf 
lqcpeaegpsqkaarmelmekqeknqgp 
arhrrqeqpltsrpcpdhlcwlsqvss 
tpaqglsllick 


2350 


16251 


A 


2367 


274 


1 


PRKILKARGKEHLASRGTMIRMTSDFYL 

qtmqarrewskil/wvleekihqhril/ 
pvkssfkseeeiktfsdkqklrglvtsr 
sdlgkdvk 


2351 


16252 


A 


2368 


161 


2 


pffffsetesccwtqarvqwrhlgslp 
gssdspasasqvawitgtrhyawlif 


2352 


16253 


A 


2369 


361 


198 


rgrlifvflvemgfvtmlarlascdppa 
sasqstgirgmshnsqlkcftefdsfc 


2353 


16254 


A 


2370 


116 


300 


hlngdaveerdfmkctvsgi ivahcnle 
flg/ssdpsasaprvagttgmchhiwli 


2354 


16255 


A 


2371 


2 


192 


mkldphlspy'1'kvnsrwikdlnlrpkti 
ki lepnir/ itllg iglgkdsmtrnpka 
iaiktklar 


2355 


16256 


A 


2372 




133 


agellepg\grgcskprschcttawate 
qdsspekkkkkkkggf 


2356 


16257 


A 


2373 


43 


403 


lhdspaiiasqgagttgvshharpaagjn 
srigqaedriseledwlseir\rqsgrn 
mdkrmkmnkqnlqeirdyvetksm/ntr 
ligvpervgengsnlenifqdlihenyk 
gkpirlmvdl 


2357 


15258 


A 


2374 


404 


215 


GQGGRITRAQKFETSLGNWRPGSEAQE 
LLE\PDDRGCSEPRSCHSTPAWTTEQDS 
VSKKRRKC 


2358 


16259 


A 


2375 


3 


397 


skqleftqlytklnqlnktkisdlkkek 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

correspond 

amino acid 
residue of 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=tln known, *=Stop codon7/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YPIRLFSKENTQMASRYRTKYSTWLIIR 
EMQIKTTFRHKIiAALASSHCLLGLG\AT 
SAHCCTVGAPLW 


2359 


16260 


A 


2376 


413 




PKKGLFPKPIIWVTPGFFPPPRFKKPPP 
KKIFGAPKKKKKSPPPPAKKFFFFKGAP 
PPFFFFFFFFFFFFFFSGDSQERVREAM 
PVAGGP\PRPHSLSAPra.PGGTAWTPMH 
PVQTHKAQSPKLPASECPPPTTPLS 


2360 


16261 


A 


2377 


398 


247 


RRFHHAGQAGLELLTSSDLPALASQ\SA 
GITGMSHHAQPSATHFQKHLVS 


2361 


16262 


A 


2378 




143 


QENRLNPGGGGCSELRSCHCTS \AWVTT 
AKLCLKKKKKKKFFGKGGG 


2362 


16263 


A 


2379 


417 


90 


FFFFFFFPGGGRLQVCPPPGILLFCFLY 
KGGSPPGGSTIFPPPPPGEVGPPGPPPP 
GGFFFFFLE/QGGVSPDGPGFFVLPTPK 
KPPPPAPQKGGEPKFKPQVWGPPWPTF 


2363 


16264 


A 


2380 


36 


427 


VhplnhhdqKGoasStqkkKKkkkkkkk 
kkkkkkkkg/wkgggaf 


2364 


16265 


A 


2381 


402 


2 


NFFLKGRGWGPLPPFLPRGPPPRGGPQK 
/RGGLGGPPPFFFGSKTPPAGGSRTPMG 
QKKKGPPLPEGPLFGGAGQAPFLPPPVP 
RGGVPSPKKKKG/APPPFFPPPFFPPPP 
PPFFFFFFLRYHLALLPHLGCSGT 


2365 


16266 


A 


2382 


166 




THGHVIYVDQEMQMMIENM/WPGTVaHA 
CNRSTLGSHGGWITRSGDLMVKPRLY 


2366 


16267 


A 


2383 


2 


457 


TSQPSLLSSWDYRSTSPRLANF\ILFYF 
IYFFFAFSVETGFHRVSQDGIJ!III.TS/S 
/ 1 PS I PRI PKHWDYRHDP\RTWP 


2367 


16268 


A 


2384 




417 


GRVGFSQSNGNPSSLSFTLLKVDFEVTI 
PGEGKDRI FKVS IKWLAIVSWRMLHEAL 
VSGQIPVPLESV/QALDVAMRHLASMRY 


2368 


16269 


A 


2385 


318 


32 


TMEIMLDKKQIQVI F/FEFKMGREAVET 
THNIKYTSGPETVQWWFKKC CKGDESLE 
DEECSGRPEVGNDQLRailAHASADAWV 
DRDSGRCCCACP 


2369 


16270 


A 


2386 


409 


3 


ISQAPSTPPRKRGFR/PPQKTSFLGPPF 
YAAFYQEKKFLFFSSDPPRETGDKGKQK 
GFPPPKVAPKKKGFFKKGPPPGKKKDPP 
SFFKHKSQPTRPPRAPALEGPRSR/SAA 
LQPGDRRR/PPSQKKPTRPPTRPPTRP 


2370 


16271 


A 


2387 


415 


10 


KKGFFPFGPFIFLFPPPGFLPPPPPPIF 
WFSGFCPPPPPLFYFFFRGGPK/HILVF 
PLFFLPPPFFFFFPIFAKESPPKKGPGP 
LNFFFFGPKKN/SPPPFFFFFFCEMEFH 
S CR P GKSAKWHDLGSLQPPP PVFK 


2371 


16272 


A 


2388 


410 


95 


K.a.TMDKSDHIKLKSFYPAKETPTKVKTQ 
PPEWEKVFANYPSDKGLIPIIYKELKQP 
YG\KKSNNSIKKWAKDLNRYFSKDIQMA 
NRCMKRCSRPGAVAHTYNPST 


2372 


16273 _ 


A 


2389 


362 


92 


RFLFFFSPPPKKGFFPKPFFFFSPRVFP 
PPFFLNPPPKLIF\GPPKKKIFFPAPGG 
KKIFFLKGPPPFFFFFFFFFFFFFFFFF 
PVENTFY 


2373 


16274 


A 


2390 


131 


487 


atehekteksslsffsisjcrkkkmeklh 
digpssnflg\rpkaoatkgktdwtsak 

LKICSSRDTISRMKRQPKEWAKTFANKS 

cnnkkpekidknkkkkkkkflggallkk 
tnlkprg 
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SEQID 
NO: of 
nucleotide 


SEQID 
NO: of 

sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophiin, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2374 


16275 


A 


2391 


408 


145 


rwslalspelecsgvssapckvpppgvt 
pfsclslpsnwedrcespppaq/ivfig 
egfyilhgff/hrgpkircfisgcpppv 

LSFPT 


2375 


16276 


A 


2392 


1 


203 


lfffafsvetgfiihvgqaglepptsgdp 
pvsa/fpmcwdyrhd 


2376 


16277 


A 


2393 


386 


16 


TPSAGGRHIELSLSTCPSCAQHQGKKHL 
EGGEGGGAQSLTTAPSSATSSQDPISAH 
AVEDKLSIRLETDILKTKS/WPGAVGHT 
CNPSTLGRRRQaUHUlPGVRDQPGLVIC 
RPRPPKVMGLQA 


2377 


16278 


A 


2394 


3 


396 


QLIiERLKQEDRLSTQSQGCS ELRSHHCT 
PAWATEP\DSVSK 


2378 


16279 


A 


2395 


273 


416 


FFYSFLIKIRWKKQPGMVAQACNPSTLG 
GQGGRI /TLRTGVRDQPGQHG 


2379 


16280 


A 


2396 


329 


76 


FIPIESACSQECLKENLRQEW/YIFGTL 
KLIFFETESHPVTQ/DWSAGEIiL\DPRG 
RGCSELRSCHCTPAWATRAKLCIiKHTHK 
IKK 


2380 


16281 


A 


2397 


210 


7 


GGKKMYCRKPGGGGFFAAVWSPKGLFS/ 
RIWKEAPILSPQKKKKKTNWPIKKWAND 
ANRHFSKEGTQQ 


2381 


16282 


A 


2398 


48 


393 


SILKTRKPPLKKGRGRRKKEKESVHTHV 
FFSYQSNaFPSKPLNNSMTLGRLLSFSF 
FLVHFFFYTDGIIiLCCPGWSRTPGLKGS 
SHLSFPKCWDYIHEPPHPAYHS/LFCR 


2382 


16283 


A 


2399 


128 


383 


EBAPKHFPKPNLH/QKKVLVTAWWAAAG 
LIHCSFLNPWETITPEKYAQQTNEMHQK 
LQCLQLALVNRKGPILLHHNVRIjHITTH 
AS 


2383 


16284 


A 


2400 


54 


384 


LFTFILNSVFHTYMCLYFWTLFFSVKPF 
VSMPIPQCLDDSSFIISLDSEINPNIYS 
QLI FDKKTQRGKNSLPDRWCWENCIFTY 
KRMKWDPYL/ S P YTKI ISKWMKDLNI K 


2384 


16285 


A 


2401 


393 




HRGENTHQQGGGLSRWERHSRQRGTSRW 
IRHTRQWGPSRWITPSRQRGSSRWRiCRS 
RAGDIEVEKTQQSEGALEVRIRSRQRGT 
SRWRSFSRQRVSFRWRICSRQRCSS/KV 
RRRSRQRGTSRWRRHSRQR 


2385 


16286 


A 


2402 


1 


330 


RP PP P PHCWD YR /HEPPR\QPTLWVIPK 
LSVETRLCYVAQIGLELLGSSKESSRLD 
LPKCWDYRHELLCWMVIFQEKLV\SGFL 
FKIPRFFKAGMEIFKQIEGFWSSHPLAT 


2386 


16287 


A 


2403 


3 


407 


ADAWGLRGTHGPWEQAGISGISPSWSFL 
FVCFRQSVALVSQAGEQWRRLGSLQPPP 
PGCGRESC\PSFQGGITPSCFFVFLEKM 
GLHRVGQAGLQLLMSGEPPALCSQRRGI 
TGVSHHARPPSKGFIWHTGAPAM 


2387 


16288 


A 


2404 


239 


573 


VCFGVFSLSHFFESEFSSITQAGVQWLN 
LGSLQPPSRGFKQFLCLSLPSS\YWFTG 
TPPLVGGGSAGLRGG 


2388 


16289 


A 


2405 




122 


PTRPGRGCGELRS CHSTLAW/VNSETTS 
QKKQRKEKKXPLL 


2389 


16290 


A 


2405 


417 


130 


QALHVKHVQLVPSSDFLAKTEAQSVNHL 
LRHSSSSSSSSSSS/SSSSSSSSSSSSS 

sssssssssssssssss\ssssssssss 

SSSSSSSSSSSSSSSSSSSFLFLFFFFG 
LHGAFLGRFCPIRR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2390 


16291 


A 


2407 


327 


3 


RQSQSLRYMAHQPVCCFTCSILIRAELN 
EI/ETKNKKIEKINET/RKSWFFEKINK 
MGAITPDATERQNIIQGYYEQLYMQNLE 
NLE YMDKS-LGRFKP P S LNQEE LD TLNR 


2391 


16292 


A 


2408 


178 


404 


AAYRINSLQEVQHPTKHHSQPRKQSKLR 
KGNAQRMNIRRGPGMVAHACNPSTLGGQ 
GGQV/TUJSGVRDQPGQHG 


2392 


16293 


A 


2409 


94 


385 


TSFALVMiGGRSCSEPRSCHCFPAWVS \ 
SETLSQWKRKTPTLHNAKYWGGRFLRFF 
FLTRGGGGRFPFPGGTKTKSLGAGFKKG 
GGGKPGGPPNKGLG 


2393 


16294 


A 


2410 


1 


384 


PTRQVRIKLFPRIYVHTRKRLKNIYGNT 

etfitqpfknhsvwqlirnnemakkqen 
vahkeeknkkqsietnXqkythnkkldt 
tkkkkkkkkkkgggplkktifkahggen 

NFFFLGPQKUJSQAGF 




16295 


A 


2411 


423 


2 


FFFFPPPQKKSLFPPFFFCGPKIFSSPP 
VFLTPPQKIFFCPPKKKKYFPPPRQKIF 

F IFFFFFFLGFLLSS LLIGVP FLLLNAN 
IMILLQKKIIiKDEVQIISSFPSTKNRKV 

L 


2395 


16296 


A 


2412 


218 


112 


LKKENNTKC YYGWGTTGTL IHC \WW\ KL 
IQVLWKTD 


2396 


16297 


A 


2413 


64 


403 


LKNFFFFFFFKKKGPPGAPQGSKPRGLG 
EPPPPPPQRGGNTGGGPGAHPKKNGGGF 
FFFWRGEPP/PPPPAKGGGQDLGPGPFG 
REETNHFPflPAPPGPKlSnaiLGPHAQKI L 
CF 


2397 


16298 


A 


2414 


217 


409 


VKYLSYSVLLTIICTVQACYQELRPGAM 
AHTCNPSTLGGHQGRI /TLR3GVRDQPD 
QHG 


2398 


16299 


A 


2415 


416 


0 


ERPPPSSFFPPLAPGQVGQFFYGKGPWP 
EKSKKKNRFWGTKPFPPEGGGPPPPPNS 
PPGGGEKSPLFFFGGKGNPTWSPGPKFL 
KKRVIKGPLHPGGKGYKKTREFGLKPGP 
PPPKFFLGGG/PGFQNWEKPPQP 


2399 


16300 


A 


2416 


277 


0 


PFVLLSSSSSSSSSSSSSSSSSSSSSCS 
LLVRYLCKVKHKSSLCAHPGTVPFLELI 
HDTDS FAH/VFPADLL 


2400 


16301 


A 


2417 


205 


486 


RHGYGFGFWKNTCMQHRHSYERVYIHRH 
IHLTFLAYVHRESPEAMSALLTQILLFF 
KLLSFLRRCLA/ S VTQAWQWCS LGSL R 
PRPPGFKRFWC 


2401 


16302 


A 


2418 


364 


1 


KE S LGD P VKDTVI LFATRNPMQMGSNYQ 
FFI YLCLTHLLSRSYCWFGCKMLQ/ PLW 
KTVWQFLKKLNAEFPYDPAIPL\DICIP 
VFTASLFI lAQRWEQPKCASTDEWIHQM 
WHMHTIEHYPR 


2402 


16303 


A 


2419 


281 


492 


LFPPSLPASHPKLSTQQPERSHSVTHCG 
/VDRRGSDLGLLQPRPPRLKPSSRLSLW 
S SWDYRPAP PRPANSC 


2403 


16304 


A 


2420 


63 


465 


HLQRFLLSF \VCLCPAP IGGAYRGRQAS 
LSCGGLRPVEASLLLCLPKQAWAiyiAGAP 
PPASLPPCSLISDCCASKTQRDSVGVGPS 
NPGAGYYLVARRFLSPLEKRSIQVGVTL 
FSRCRLSPLSLTRKGNSLTPCAS 


2404 


16305 


A 


2421 


61 


482 


QRARITGVSHHAQ/LRFCLFDMRSHSVA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
IfSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 

location 
correspon 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \'=pos$ible 
nucleotide insertion 














QAGVQWCDNSSLQPPPSGLKML/STSAF 
QSL/WDYRCEPQHP IHY 


2405 


16306 


A 


2422 


90 


444 


YCFSSECSEKCRCPGHDLQLSPGLCHHG 
VSGLAKPLLCFWPWIWKPQPLCSPTTSL 
PFLPLPPFSQCPQGPAWLELEGRPSLWK 
QGGLQSLAIKR/REGSRAQWLTPVIPAL 
WEAEAGG 


2406 


16307 


A 


2423 




444 


PGPQPQCGPVATLQSPSRPARTDSPSLP 
PHSQLREAEflENED lEflHVRQLQERMEL 
LQAEGATGESLMCPLPRRT/WEEVGRLL 


2407 


16308 


A 


2424 


189 


418 


SREAEP6SFRGCLRVGVPSCTCV\SLWV 
CVCVCWG/VCVCVCVCVCaiEA/CLCLCA 
NFSL/CSHVSLCLSLSLSL 


2408 


16309 


A 


2425 


30 


895 


LDEQCTSEIHRR6EATARPRAPEHPAPP 
ATAVEGRDAASQNL?CRRPGSGTDGIJILQ 
GAEPSRLLRTVaSGAVIPRGTPERAQPP 
PPQDPLGRRRWLSRNTWGPWPGTTQPPS 
PQLLRNDWGSCGFMVPEAARGKVFQDSQ 
EQAHIREETVSKSVCSffiPWRHQHARDPA 
PTNFPLKCQKQHGASTSSGQHGDRVNLV 
FFIDDDYSPPSKR/PKTNEPPQPPVPEP 
ANAGERKMREENSGPHNFVEETKLICLC 
PSGHASCQVHLWTGAMIiGFQSWRKLPG 
SGLKARILQ 


2409 


16310 


A 


2426 


494 


154 


SSRVRCQCANLLGGASQLGCSGVRDPLE 
EAVCPFSDLQPRAGRTTRLFKAQMEMQK 
SPVFCVAHAfiSCEIiEtiFLFGHLGSSLHG 
LWN 


2410 


16311 


A 


2427 


416 


1 


PQRGPLLALEPQRQGftAPVEDLQPQGPD 
KPPPPPLPQPFHARTWTAAVPRHPPPV 
ACHPPQPLAASKPWP/SVAGGDLLPLPG 
PERPVH2VFFIGFIFVHLGLGGVSGRGAV 
APAHSGPVPPRPPSSSTSRFSIiFFLHE 


2411 


16312 


A 


242S 


S4 


409 


DYKHAPTMPRIQWFVYHSPQQPRCENRI 
NFTPKKKKKKKKKKNKKKKKKKKK\KrF 
WGG 


2412 


16313 


A 


2429 


1 


389 


LRDLSSDRSNPGRFLSTSNSSIiY/'EKDK 
RNEAYFTK/RPSPVKDIIST 


2413 


16314 


A 


2430 


456 


0 


PGWPGRGAP\PSQTQWPQRGAPHIPDDG 
QPGRGAP\PSQTGWPGRGAP\PSQTGWP 
GRGAPHLPDNGWPGRGAPYI PDDGQPGR 
GTP\PSQTGRPGRGAPHIPDDGRPGRD/ 

GSSLPRRGGSEAEALLTSQTGWPGRGAP 
P 


2414 


16315 


A 


2431 


3 


344 


CRERRSCHCTPAWAT\SETPSQKKKKKI 
FFGNGPPGGPQAGLKLRAWGFFQKRGTM 
GPGTKNHPGQRGE P PLLQKKQKI TRPGG 
GGPGAPPPQGGGAGKSPNPGGGMFQGGE 
IP 


2415 


16316 


A 


2432 


1 


109 


RPLRRLRQENRLNHGSRGCSEPKLC/HL 
CTPAWAT 


2416 


16317 


A 


2433 


1 


239 


QSFAVLPRLVSNSWAKVICL/ PS VSQNA 
BIT 


2417 


16318 


A 


2434 


3 


464 


DWLQLKMQGEIVALVHSHHGGLPWLSES 
ARRLQVQSDLPWWLVCRGT IHKFRCVPH 
LTGRRFEHGVTDCYTLFRDAYHLAGIEM 
PDFHRENDWWRNGQNLYLDILQAPGLYP 
VPLSAAQPGDVIMCCFGSS VPNHAAI / Y 



692 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 

sequence 


Predict- 
ed end 

btide 
location 
correspon 
ding to 
last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CGDGELLPHIPK 


2418 


16319 


A 


2435 


3 


427 


EGARTSSERHPCNKYLLLHSLAHLFINA 
LNLVLKGLSPSPFPALPISFPIFFFSPH 
FLGTPTLEGGRADLPFLQPPGAPG/QPA 
PLNYGPGPYENPCP/RLPQLKPAGPGHG 
LLKS PPPNPGRNWPLLGSLFDFFKKRTS 
IPLP 


2419 


16320 


A 


2436 


238 


400 


qfrwkrgkaiffffflrqspsvthagmq 
wsnlssvqppppgfkqflcVpsvpss 


2420 


16321 


A 


2437 




442 


LGTRPRATDWGVRLELSEACPVLGHPAK 
HPRPQRWCKVNFSYSPEQADELKLQAGE 
IVEMIKEA/CGMPDMPSVSPGPQRPPKT 
TSDKGWWEGECQGRRGVFPDNFVliPPPP 
IKDAQPLLLFGIDQEAGPTESGISGIRF 
RRLSC 


2421 


16322 


A 


2438 


411 


80 


PQAEEGAPTPGSENFNPPPPRGGGAPPP 
PQKNFFPPRGVNPGGGGGQKHPPPKKGG 
/SPKKNPGGDKNPPPKKKKNIGEGGGFI 
GAPR6TPKKTPPPRDGYFQVFFFIVSLK 


2422 


16323 


A 


2439 


290 


1 


QLNKIKKTPLLFLPWANEKVPKIPPNYP 
PGPKKKGYPPS I FFFFFLRQGL/NSVTR 
AGVQWRDLSSLQPPPPGFKRFSCQKPNS 
AFPHASADAWVDP 


2423 


16324 


A 


2440 


440 


213 


PFSRPLFFFSPPPKKRASPPPFFFCFPR 

vffpppff/skpppkffffpppkkkkis 
ppppkkiffffsppp 


2424 


16325 


A 


2441 


131 


408 


GCVPPEPAFLCFVLEIVSSVAQAGVQtOR 
NLSSLQPSPPGLKRFYRPRLPSS/QDYR 
RAPPLA 


2425 


16326 


A 


2442 


102 


351 


QASSSVLKLCVCVRARLCV/ CACVCACV 
CVC/VCVCECVCVC 


2426 


16327 


A 


2443 


190 


3 


PQGAREKSHRPGPI GRRLKL/DPPFLSP 
HVKINPRWIKDIiNVICPTTIKTLEGtJLGN 
TLLDTCPG 


2427 


16328 


A 


2444 


410 


30 


VCVAPPLCVCIGTLCVRTHTQFCVCVHT 
HSLCVCR/CQFLCVCVQ/VAVSVCVCRA 
SLCVCR\ALCVCVCR\TVCVCVHGTVCV 
CVCRDS FCMCVCAGAVCVCVCVCVCV\C 
LCVCVASAQLLGRGFCSSVNI KGAGP 


2428 


16329 


A 


2445 


243 


1 


KVMVQNKGPFSNLFFFGPTINFFTPQFK 
QGGGQNPNPFLb'FFFFMRQGL/DSVTQA 
GMQWCHAGSLQHLPPGLKQFSCLLP 


2429 


16330 


A 


2446 


276 


408 


MLKNCAFWPGTVAHACNPSTLGGQGGRI 
/TLRSGVRDQPDQHG 


2430 


16331 


A 


2447 


24 


405 


LGDVCAFFFFFYLKNQLNPGGKLILPVG 
PAGENQRWEQFDKLQKGGIKMNPLRGGX 
SVPLTKKKNQ/WARGE 


2431 


16332 


A 


2448 


239 


3 


S PL CGHNVYKP S TVEKTNQ VEKMPPS KQ 
IWELCVELLWQ/SNRGIAGSGAHACNPG 
TLGGQGWRI MRSGDRDHPGQHGETP 


2432 


16333 


A 


2449 


406 


144 


GCSEPRSRPCTPAWVTSETLS/RKKRKR 
RKKKSCLLRAILSTSPELTHFTLTTPLF 
SRYNDYPHFRDEKTEARRVYATCSGSHS 

WKRLG 


2433 


16334 


A 


2450 


349 


1 


GGAIFLTPRPKVPLTPITPLAFNPGHQE 
E I PLPKKKKKI VEAI YDKPTAWr ILHGQ 
KVE VFPLKTSTRQG CPLS S LLYQHKTRM 
PSL\PLLFNIVLDVLAEA1KQEKEIKGI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, P=ProIine, 
Q=Glntaniine, R=Arginine, S=Serine, 
T-^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














QIR 


2434 


16335 


A 


2451 


382 


19 


DLGRMTAGSGDQRCAVGVKLLS FS LGAA 
GKGPKHSEAEASLPRNPGSYNRQAQWPQ 
SSLHLKGTIPDTTSLNTPWKMLSSLKVP 
S WQGAVAHDCNHS TLGG \ RGGW XTRS RV 
RDQPGQHG 


2435 


16336 


A 


2452 


3 


396 


FKLKLMLIYHSENP RALKNYTKS TLSML 
YKWNNKAWMMAHVFTTWFTEYFKP lAET 
YFWGKKKITLKILPLVDNAPGHPRALME 
M YKE INV/ VQI C ILQPMDQGVI STIiRS Y 
R\KNTFCKDIAAIDGDSSDRT 


2436 


16337 


A 


2453 


115 


411 


KGGNFPPPGGGGEKKIW/ PNGGPPPRGK 
RKPPPPPPGGGGKGGHHPPPGPIFFFEK 
KKKGLFGG/PGGAPNSHPKRHPPPWFPE 
GGELTNPPTFFFEGGAF 


2437 


1633S 


A 


2454 


3 


114 


HHV/GQAGLELLTSDDPPASVFQSAGIT 
GVSHRIRSVS 


2438 


16339 


A 


2455 


397 




WSWQK\NRRIDQRSNIESPEMNPRMyGQ 
VIFDKIAKNTEWEKDSLENKWCWKNWI/ 
LKRMKLDHPHLTP \KQKSKWMKDLIRPE 
TEKLPEETGGNPHD IGLSNDF\LDLTPK 
AQGAKVNTDKWDWIKLKKFPTRP 


2439 


16340 


A 


2456 


424 


3 


PKKKKIFFPPPSFfelFFFlEAPPFFE'FL 
CLSHFLLNRSRSLSCTTSCCVSTIPTSL 
CNKSSGVXCGLHCSLLAI/CSLIHLTLC 
PFCVLLVCMCDWCVCVCVCVCDTVCVC 
PCPYGTLDIAFKHFFSRWSLTLVaQAGV 
QR 


2440 


16341 


A 


2457 


266 


379 


HWPGAMAHTCNPSTLRGQSQRI /TLRSG 
VRDQPDQHG 


2441 


16342 


A 


2458 


10 


409 


SRTGPNPRAQTDRPWCVAFACFELPLW 
RSVDSATHESBAGGLL\DPG6R6CSELQ 
LCHCTPAWV/TSETIi 


2442 


16343 


A 


2459 


184 


387 


IVHFQrOHKMIKVAYIIPQFYSFIDEIQS 
HSVTQAAVQWHSHSSLLPLPPGLKQSSH 

/LSLPSSWDY 


2443 


16344 


A 


2460 


110 




KWRVS FFF/ CSFETESRSVTQAGVQWCN 

LGSLQPPTP 


2444 


16345 


A 


2461 


3 SO 


2 


FFFFFSETESRSVAQAGLRT\QWCNLSS 
LQAPLPG 


2445 


16346 


A 


2462 


382 


29 


NGPGHGGPPVIPGTWGGQGGGFPRF\GS 
KPGFTWGNPPFLKNHKNYPGGGPPPVI P 
NFLGGKPGNFFYPGGGGFQ/SGPGAVFP 
PPPGQRRSIFLPKKKKERKLILFFSLEQ 
RLQNCDA 


2446 


16347 


A 


2463 


228 


1 


KKGTLFKPPPPGGRFFFFFFFETQSRSA 
VTQATVQ/WMPSRLTANPTFRTQGILLP 
PGLKGSSCLSLPSSWDYRHV 


2447 


16348 


A 


2464 


395 


1 


RLRRENHLNPGGGGCSEPRS / HPAWQQS 
ETTSQKTKTKTKQKALASDTVLSPRQSS 
EERFHLSLFHSSFVTPFGIFSFLTTSPH 
PILPVWLAPQLPAIYCYGDSVIKSNFQL 
WLSKELQESLSRLLWTQAFS 


2448 


16349 


A 


2465 


29 


299 


ETPNEASPKTSWDYRHVLPCIlMTFFIFF 
VKTGL /HRAGLELLTSSDLLCFPKCWDY 
RH\DRSTWPLSSVFGSIFLVYYWPLYLI 
ETNMLTTL 


2449 


16350 


A 


2466 


1266 


1473 


YFVPQSTQNHAAVFRVGSLLQEGCGKIS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

I'miloadd 
residue of 
peptide 
sequence 


Prcdict- 

location 

ding to 
last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l.ysine, L=Leucine, IVl=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=lInknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLYGDLKHLKT \ FDRGMVWNTDLVETLE 
LQNLMLCALQTVNG 


2450 


16351 


A 


2467 


49 


356 


VQVILLPQLLRRLRQENCLNPGGGGCSE 
PRWSHCCP/PAWVTEQDSISKTKK 


2451 


16352 


A 


2468 


115 


3 


LSHTKWSAWPGAVA/ STLGGRGRQITRS 
GVQDQPDQHGE 


2452 


16353 


A 


2469 


3 


404 


FVAPGGGQGTFPGSLQPWPPRGPGASGS 
PSRAPLIFGFFGGKGVSEVGPGGFLLPG 
SRD/SGPPGPPKGWGSG/GLGPAPGP 


■ 2453 


16354 


A 


2470 


404 


218 


FAQWIKWGALSSPKPPFPGFKLFSRPS 
PPSYWDYR/RRPRPPRLFFFFFLRQGIA 
LLPRLE 


2454 


16355 


A 


2471 


386 


258 


AGGIiLSPGVQGYlIKLLSCPCKPAWTTXS 
ETLSQKKKKKKLPFY 


2455 


16356 




2472 


2 


364 


hhtqi ifvflvetgfhhvgqagvellts 
gdppaswdygrXghrtwpyshi FNNL 


2456 


16357 




2473 


1 


292 


KKKKKTPLLRGPEKKNGNPPFRGFFRKN 
PGFKRKKGGAPPPGLFKIGRKKSPVGGH 
RHTFILGGPLPRF 


2457 


16358 


A 


2474 


1 


307 


SLSCSSIVRRACFPFTFYHDCKFPEA/S 
PVMLPVKPVEL 


2458 


16359 


A 


2475 


2 


596 


MKNftEDILTMEVLKSTMKQELEAA\QKK 
HSLWEIiLRIPNICKRICELSFVSSSSSS 
SSSSSSSSSSSSSSSSSSSTKITAW/LF 
PLEAS FYRSTCLMPARALLFASTIPFWG 
LTLHLQHLGNNVFLLQTLFQAVTLLZUSrC 
VTiPWALNHMSRRLSQMLLMFLLATCLLA 
I IFVPQQEKSSQVEERKCLSLFSQQLPW 
SHLS 


2459 


16360 


A 


2476 


367 


44 


YQEDITIMN/TYALNIGVPTYLANI/DL 
NREIESNIIIVEYFNTSLSKMDRYRSSR 
QNIDKETVDLKYSI/HINQMDLTDRYRT 
FHPTATERDSISKKRIKIKIKINKSLT 


2460 
~246i 


16361 

16362 


A 

"A ^ 


2477 
2478 


12 
^1 


362 

i 


HHEPG\GGGCSELRLC3ICTPAMVTELDS 
VSKKKKKKKSQKKK 
TFWFSFTRffliPTO\M;ECSaA^^ 
QCI 


2462 


16363 


A 


2479 


3 


353 


YMCVCVCACVCICVLPWMCLCVC/CYCV 
CVCWYLCVCLCVSLCGHQHLAVSGKRSQ 
PPSHESFKTSLLLWVFKNLPPLLCAWVP 
QVSQNMPLDGTRTLMNVSLPDEKVHDLL 
LPLTAA 


2463 


16364 


A 


2480 


57 


361 


MNESLLS CKRWSLLAFSHI /CVFCLI CS 
TDI SALCVTVCRCMHVCGCRCVCMCLCV 
CLCLCVCRCMYGRVCAPMCLCVCVHCVL 
ARMCACGCGCVYKAECPI 


2464 


16365 


A 


2481 


3 


381 


ymcacgyicmha; cvpvcmcvctclcah 
vcl/ccmcvctfvpvcmcvc i cvclcvc 
tpiitlpflsqerisfctdlrsfratak 
rsheevkncvylqtiwnqyliifylqnp 
nfgwaqwlmlvi pawe 


2465 


16366 


A 


2482 


2 


380 


IHVIjGNCLYFSEPQFPHWQQRITRSFLE 

gygenlmrlhmsyihmciihicayici/c 
/ lc vy i cvllcay i cvcvylc vc i cmc i 
fvylcthicvfvcvlvciylcv/c/vcv 

CICVCI 


2466 


16367 


A 


2483 


296 




NGTTIRRMLPI FSRYRPGMPGLVQCTI Q 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid i 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=tJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SCTAMSVPYSHGFIYFFIYLLRDRVWLC 
YSGWSAVAPSWLTITSN\YRLKPSSCLD 
LLGSWNYRQMPPCI 


2467 


16368 


A 


24S4 . 




319 


NTVGLCVCV/ CLFVCVLSLCV/ CLSLCV 
/CLCVCWSLCVC/VSLCM/CVSHCVCL 
CVYLWLCVSVCVSLCVSLCVCVYECVIS 
STKRSLGAPSRAEAAKLPRWWAPAKQSH 


2468 


16369 


A 


2485 


3 


334 


NHFIISIDI/DKHFRKIHQPFVPKfllTK 
LGI KGNFLNLIJiLGTSVKSTGNINM\NGE 
KPHNLPLRLVI QGDVLFHLLLE VIASAV 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KK 


2469 


16370 


A 


2486 


17 


409 


CLDNKKPRKEYCQKTHLIKKNPLLSRIC 
KELLKLYDKRMINPIKKWAEALYRCLSK 
GDV/HEMASKHMKRCS IS /L/S IRKMQM 
KMTL 


2470 


16371 


A 


2487 


410 


165 


LECNVTIMAHCNLRNLGSSDPPTSAS \Q 
TAGIHHSLQHFFFT.FFVKTGFCYTQAGL 
ELGSSDLPTIiASQSVGVTGTGHGTWP 


2471 


16372 


A 


2488 


142 


417 


VEQLLEGYRTKSLYmSFFLFVCMFLRW 
SL/DSVSPGLTNCHHLGSLYPPPPGFKQ 
FSCIS\LWAWRRTP 


2472 


16373 


A 


2489 




404 


KAGAQMGIKHPQHSSRLRLRRPSPRLPS 
SQKPIiNTHYFPSFSFCMffiHTHTHTHTH 
THTHTHTGV/HLHPWr 


2473 


16374 


A 


2490 


373 


146 


YGLERKISFKIL/LFVGKSPSYPRALME 
MCKINWFMHftUTTPTLQPIDQEVILGP 
GTVAHACHPGNLRGQSGENA 


2474 


16375 


A 


2491 


427 


85 


PPTPPLGECKKNP\PPPEKKISPPPGFPP 
PPFKKGPPKTLFKTPLKKKKGPGAPQKK 
PFFFKTPPPFFFFFF 


2475 


16376 


A 


2492 


443 


240 


FLFFFFFXXFXXFFFFFXFFXFFFFXFF 














FFXGCLFIYLFl 


2476 


16377 


A 


2493 


3 


135 


ISAHCNRNLLLPGSSNSTASASPVAEIT 
GACNHNRQDFSFI ILQI 


2477 


16378 


A 


2494 


37 


420 


AHVCVCTCVCVCICLSRRVWCAHSHVC 
AGIFLHLINQTFTTHL/CVCKVMALRVT 
PSPCLQGAXQTQLaGQCMCAKVCMNFMY 
I F lEGHI CPQTCSCLGEPIRGGLSPLVC 
IC/MCftGFFVSFC 


2478 


16379 


A 


2495 


402 


178 


RQNKTPFGVKKQITGGGGRGRVSYFLRG 

vsqenplnpggggcsep\rppgwgtknp 

LFQKKKKS SKQQEGRGED 


2479 


16380 


A 


2496 


138 


439 


RTF ILGDHHHHPSAVHFHLPRRKLCTHE 
TGAAPPPPALGATVPPSVSVCLTGGGPQ 
ITPPPPALGRRDLPSVSVRLTMGGPQIT 
PPHPDQQPL/SPPSVSVC 


2480 


16381 


A 


2497 


118 


382 


VKS FCASMRACVCECVYMCVSMCVCVCV 
C VSACLI CVCLCQ/ CDLINKCV 


2481 


16382 


A 


2498 


164 


439 


vksfcasmracvcecvymcvsmcvcvcv 
cvsaclicvclcq/cdlinkcv 


2482 


16383 


A 


2499 


1 


411 


ISTSFYTDT INRTALHFAVGRNHLSAVD 
FLLKHKARVDVADKDGMNALHFATQSira 

vriveyliqdlhlkdlnqpdepkesplh 
lwinnhitwmsllsaqhdidilMrs 
skppiieivaadrgwvklve/lllkagcd 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamie Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginiiie, S=Seriiie, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2483 


16384 


A 


2500 


80 


404 


ETAAAVSTVSFPQDFEGQSPKCTQGVRE 
ALRQ I K/RLVPTGSLRH/WPAGSLA/LC 
QPLSDEKDLTQLFMFARNAFTALflMMDY 
PYPTDFLGPLPANPVKGRRRLPRRRERP 


2484 


16385 


A 


2501 


215 




AAIQQGSLACSHSVPPATTPRAYTPVPP 
QLLVRNF\YPKTLELRSQLRCARRFPRE 
TGADCRHAGAGRQTK 


2483 


16386 


A 


2502 


423 


214 


wqaqvggllep\ggrgccelrs chctpa 
wvt\geilsqnnpkrqqnktkqkmgpgv 
pgswgspraggltv 


2486 


16387 


A 


2503 


294 




lhwnqrqrcppppppplppppplppppl 
plr\hriagagqtgakirwgrwrvga 
glgvaaa/dsre scpdgaaagggegggd 
saatagpgggggggtk 


2487 


16388 


A 


2504 


331 


411 


PFLKTGNKGAPPPTKIFFLFFKKKGG/W 


2488 


16389 


A 


2505 


407 


195 


GGRGCSELRSCPCTPAWVT\GEVLSQKK 
RKSSNKKSRCLEDQAEVGGERQLVQVSV 
PSTCQRRVGAWFIYL 


2489 


16390 


A 


2506 


1 


468 


RQGSMNKLETERQIKKAPARNFERERER 
ERESKRGEDRHTDICRP/RERERQNYRD 
RQ 


2490 


16391 


A 


2507 


27 


380 


FVCLCPAPRGGAYRGRQASLSCGGLHPV 
RASWLLCLPNQAWAMAQAPP PASLLPCS 
LISDCCASNQRDSVGVGPSEPGVGYNLM 
VRRFLSRSEKRNIRVGVTRFSRCVCHPF 
L*LGKG 


2491 


16392 


A 


2508 


396 


234 


RQENCLNPGGGASSEPRSRLCTPAWATE 
RDS ISN/QNKQTNKSHILRAAPSRPSCY 


2492 


16393 


A 


2509 


1 


421 


AKKIKPPFPLKKKKKPGRGGPPPPPPP/ 
AKFGRETPLNPGGKGSIKPKLGPPPPPP 
GGPPNLPPKKKKKKAIPK/PLVPSHVKS 
TGFPPSCS/RSALLRAFTQ 


2493 


16394 


A 


2510 


197 


3 


TGPPDENFFFFLEMKSHSVAQAGVQWRD 
LGSLQPPSPQFKQFYYWGLQGA/LPPCL 
NKICIFSRGGV 


2494 


16395 


A 


2511 


114 


454 


PPWWFKKGFSRTPFEIKKKNPPGILGGS 
PKKFFFFFFFEMVLLHHPGWSALRQSLL 
TTASDF/LRLKQSSHLSLSS 


2495 


16396 


A 


2512 


3 


495 


FFLSRGLFIHLESAPAIFQCLLFWFLV 
FVATGSHYVPQAGLKLIiflSNEPPASNSP 

KCWVYRHEYPALKCLCFGLCLF/AFLP 


2496 


16397 


A 


2513 


25 


437 


PLLFSPPAGHAMEEDILPPAPSFLFYFF 
FYWLVWVKI lYLYVLAQAATTNYHRLSG 
LNNKHLFLTILEPRMP / S IKVPA 


2497 


16398 


A 


2514 


2 


256 


QAPSAEGEMTSYVLLAYLTAQPAPTSED 
LTSATNIAKWITKHSXWAQGGFYYNQDN 
LPYSLCSGRNESTAFGTNGQDIIHPVHS 

S 


2498 


16399 


A 


2515 


2 


284 


KCQCDELCSYYQSCCTDYTAECKPQVTR 
GDVFTMPEDEYTDYDDGEENNNAIRHEQ 
A\GFTSVISDLQAQI IWISEQKHAMSFV 
EEMTVLEVIL 


2499 


16400 


A 


2516 


2 


432 


RDCERESSRAAEYTASLKASCWMGDLAE 
SPSLDPQVTiMGTPPAGPCGWSVGTRSQP 
FPRGPSRGPCCVSPPSWTPEPGTGQ/CG 
EGGGRWSPQVPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Prediet- 

otide 
location 
correspon 
ding to 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, Tr=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2500 


16401 


A 


2517 


401 


2 


GGEAAIRGQTKPQLEGKAPECSECEKKI 
VRS \ P RL I RRQRT PT GE KP YE CE E CGKS 
FSRRYRLAQHQR\TRAGEKT YE CNE CGR 
GFSERSDLINHYRVHTGERPYKCDECGK 
NFSQNSDLVRHHRAHTGEKPC 


2501 


16402 


A 


2518 


398 


1 


ETFGK\SGGRSIVPGQFLAVGPKGRAVM 
ISAIEKQKLVYILNRDAAARLTISSPVE 
AHKASALVYHWGVDVGFENPMFACLEM 
DYEEADNGSTGEAAGNTQQTLTLYELDL 
GLNHVTOKYKEPLEEHGNCI 


2502 


16403 


A 


2519 


384 


3 


GGGID/SDASLVIAGVRLEDEGRYRCEL 
INGIEDESVALTLSLEGEALPLP PHS CV 
AAGPPRLGLPGLLPS I S SAPLGT PAP S P 
RPRRPSSPSAP IRWPSPGSPPPPRCGVS 
VPTQPGPVPVQLLRGVY 


2503 


16404 


A 


2520 




426 


GDRQMITAEiLRKLKQQSRESVEENRPRL 
LKALKELGDFTLELHWDFQSWVPLLSRI 
LPSDACKIYKQGINIRLDTTLIDFTDMK 
CQRGDLSFIFNGDAAPSESVWLDNEPK 
VYQRLRHEES/QEKNHTQVDCLTP 


2304 


16405 


A 


2521 




425 


ALPIGRMPIMVRSSR\CVLTGKTPAEFA 
KLNECPLDPGGYFIVKGVEKVILIQEQL 
S KNRI I VEflDRKGAVSASVTSSTHEKKS 
RTNMRVKQGRFYLRHNTLSEDIP IVI IS 
KPMGVESAQEKIQTVSPQEPEWR 


2505 


16406 


A 


2522 


2 


376 


IQSFSWYGYHLCSQGILSERVSASKFP 
LLFFFFFKKGGPPFMP/QGGKTKGPLKP 
GDPQSPGIKGFPCLAPPNKGKKGGPPPP 
RGNFLGFKKKKRFPQVGGGGSKPPREGK 
PPPLAPQGGGNRGG 


2506 


16407 


A 


2523 


403 


3 


gaprslsekerqlmgminqlssfreqll 
hahyeqkkiaasqiekqrqhmklgkqXq 
qeqiarqqqrlroqqhkinllqqqiqvq 
ghlpplmipvfprdqrqldalaqqgfll 

PPGFSYKftGCSDPYPVQVIPLY 


2507 


16408 


A 


2524 


2 


369 


NARCLILRAAEYMAACVYVCISVFACMC 
FCVRVCVPVCVSGCLCVCVSMNVHLSLC 
/VEQCaCLWMCVSLHVCLYICaCLCVSV 
CEAVCLHLESHCRGGSRPFPVTOPAFSL 
SSCLILAPPSVT 


2508 


16409 


A 


2525 


59 


380 


RiraNFNKErSSFFFFFKKGPWGGGPGGTK 

gpkqrqgep\nhwgqkitfpgqtrkprkp 
tpgppppkki fffkplekkgpgqgpqgg 

QKPGGQKKPPGGAPQKRGKTGGET 


2509 


16410 


A 


2526 


1 


371 


VMPLSRKHPKGFFSNHTLVLIIVAESEMD 
KERVFQATRKSSP/CFVPLLPWPT 


2510 


16411 


a 


2527 


445 


3 


RWLGLALIALKRELKKGDL\PEMRWWDS 

SI ipngXfdlteetpkredyfgiawlve 

HPAQLWPPVDNDTPVTLGVYLTKKEQKK 
LRRQTHREAQKELQEKVRLGLMPPPEPK 
VErSNLMRVLGTEAVQDPTKVEAHVRAQ 
MAKRQKAHV 


2511 ! 16412 

i 
I 
1 


A 


2528 


1 


478 


rptrpkrklkshrlqsrqeskkvrvl/t 
naednemeeetddgpllvprvkvaedgs 
I I ldee sltve vlrtkgp c WE end P I F 

ERGSTTTYSSFRJOJYYSKPWSNKETDMF 
FLAI SMVGTDFSMI GQLFPHRARI E I KN 
KFKREVYASAIEDQATSVHT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2512 


16413 


A 


2529 


385 




QKKCEDLK\DQDNPrVHPPPTPGSCGHG 
GWNI SKP YWQLNWRGFYYLYGAWGRD 
YS PQHPNKGL YWVAPLNTDGRLLE YYRL 
CNTLDDFLLYINAREIRITYVQGSGIAV 
NYWMKYVSMYNTVNV 


2513 


16414 


A 


2530 


1 


1228 


fratlrpetmfgqtncwvrpdmkyigfe 
tvngdi fi ctqkaarnmsyqgftkdngv 
vp wkelmgee i lgas l sap lts ykvi y 
vlpmltikedkgtgwtsvpsdspddia 
alrdlkickqalrakygirddmvlpfepv 
pvieipgfgnlsavticdelkiqsqndr 
eklaeakekiylkgfyegimlvdgfkgq 
kvqdvkktiqkkmidagdaliym\epek 
qumsrssdecwai.cdqwyldygeenwk 
kqtsqclfcnletfceetremfeatlgwl 

QEHACSRTYGLGTHLPWDEQWLIESLSD 
STIYMAFYTVAHLLQGGNLHGQAESPLG 
IHPQQMTKEVWG\YVFFKEAPFPKTQIA 
KEKLDQLKQEFEFWYPVDLRVSGKDLVP 
NHLSYYLYiraVAMWPEQR 


2514 


16415 


A 


2531 


335 


2 


KKKALFFIHPPPFVNGRPHKWPKIKSLG 
SPTPFSLIFLKKKFCFVNQDGVRWLYFG 
SLQSLPSRFPPFFCLNLLSSWEYRGLPP 
RPGKYFFPFIFFYFLVEM\GFTVLAEMY 


2515 


16416 


A 


2532 


3 


380 


AINSYIRGDDPSSYPEWQSASHSSK\W 
SPLPRALHLTDAK 


2516 


16417 


A 


2533 


3 


2083 


S SEG YLRGNMSENEEEEI SQQEGSGD YE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQSPRYEPQSPGYEPRSPG 
YEPRSPGYESESSRYESQNTELKTQSPE 
FEAQS SKFQEGAKMLUSIPEEKS PLN I SV 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLSTPQFEMLQNPLGLTGALRGPG 
RRGGEARGGQGPRPNICX3ICGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHS GQKP YKCPHCGKAFGD SS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
SLRSHQRVHTGQRPFSCGICGKSFSQRS 
ALI PHARSHAREKP FKCPECGKRFGQSS 
VLAIHARTHLPGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQHHRVH\SGERPYKCDDCGKAFS \R 
ASDLIRHQRTH 


2517 


1641S 


A 


2534 


434 


3 


APLHSGKRSPTKCN\ECG\GAWNRSSLL 
DRHKI IHSEENPNKCEECGKAFKQASRL 
TIHKIIHAGEKPYKYEECGKVFSQSSHL 
TTQKILHSGENLYKCKECGKACNLFSiSL 
TNHKRIHA.6EKPYKCKBC6RAFNISSNL 
HKQECI 


2518 


16419 


A 


2535 


46 


454 


PSTFSSKVMDKQTLCSSQATSNTSRYAA 
ALYRQGS lYPKEMKTCVHTKPYTWHSS 
IFMIAKKWKQPRCPSA\DSWRNRMWSIH 
AMGCYLTMEKNEAVMLPRR/WSLENIVi 
SERSH 


2519 


16420 




2536 


13 


476 


ALKTYKYSHKKAFNPKKVYGKCSQKDPF 
PrNHCLPTEKLHLCDKIGEGVPGKVFQT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to firs^ 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














lADHTPVAIKIIAIEGPDLVNGSHQKTF 
EEILPEIIIS KELSLLSGEVCNRTEGF I 
GLNSTOCVQGSYPPLLLKAVIDHYNSTKG 
SANDRP/DFFKDDQLY 


2520 


16421 


A 


2537 


162 


296 


YVCQRYKL\NPFLPSYTKINCKWVTDIN 
VKPTSVKLLQEKRRKSL 


2521 


16422 


A 


2538 


495 


348 


FGW\HAFIVKEPRVEKLCKASAHAKPQP 
PAIIAKTFKAGGITGQYAQAAL 


2522 


16423 


A 


2339 


480 


63 


ARS EAW I S I ADAYMAS PGDRI AQLLLL P 
YIKVGNSEIKRTGGFGSTDPTGKAAYWA 
SRVSENRPACKAI IQGKQFEGLVDTGAD 
VS I lALNQW/ PKHWPKLKAVTGLVSVGT 
ASEVYQSTVILHCLGPDNRDSTSHYSKT 
R 


2523 


16424 


A 


2540 


2 


509 


NVDADDVRLAIQCRADQSFTSPPPRDFL 
LDIARQKNQTPLPLIKPYAGPRLPPDRY 
CLTAPNYRLKSLIKKGPNQGRLVPRLSV 
GAVSSKPTTPTIATPQTVSVPNKVATPM 
S VTS QRFTVQ I PPS QSTPVKPVPATTAV 
QNVL INPS MIGHKN IL I / TTNMVS SQNT 
A 


2524 


16425 


A 


2541 


501 


1 


QWQWEDPGPGPRPREEASSYKCELREQ 
\LPQYVRDFFRKKAE\SGMDSSRNLEKL 
AERFLAKTCSTKDQQFKKDQHVLSPVNC 
CHLLLTQVKRESRAHTTLSDIYLNNI IP 
RFVQVSEDSGSLFKKVQRYFFTEVRGMS 
WDT I FKILLDIMLI TWVTQLSVHQTPV 


2525 


16426 


A 


2542 


473 


287 


EKDFNLPPKDLRL/KTSDV/TSTKENEF 
EDYCLKRELLMGICEMGWEKPSPIQVC 


2526 


16427 


A 


2543 


268 


482 


KKKKAWGLQRGGKNFPAKRELPTHGIQI 

aSCSVRL/VDIKKEKPFSILKVEGQAQA 

RTHr.NP AFDI IVLTRGG 


2527 


16428 




2544 


407 


1 


CKKICYLIHYWWEYKMIQPLWKTVWQFL 
\KKLNILYDPAVILYDIYPEELY 


2528 


16429 


A 


2545 


28 


399 


FRHSSFQRSGRGSQLMVHFLSL/SVNPK 
IGSVAGINYGLVAPPATTGETLDVQM/K 
GEADTEHH 


2529 


16430 




2546 


2 


365 


FWWVDEVGGEALGRLLWYPWT'QRS'^E 
S PGDLSTPDRVMGNHKVKaHGKKVLGAF 
SDGLALLDNLKGS FATLSELHCDKLHVD 
PENFRLLGNVIVCVLRHH/FDKEFAPP 


2530 


16431 


A 


2547 


375 


1 


GFACRRMQKRVREVSHAEAESAASKKVR 
ANGSGKQNEGMMVTWTATLALSQAVISS 
ATYTQMQPHSLIQQQQQIHLQQKQWIQ 
QQIAIHHQQQFQHRQSQLLHTATHLQGA 
\QKQKQQQHEWR 


2531 


16432 


A 


2548 


3 


376 


ELGSDVA\GAEALVDRRQERKGEIDAHE 
DSFKSADESGQALLAAGQYASDEVREKL 
AVLSEERAALLELWELHRQQYEQCMDLQ 
LFFRDTEQVDNWMSKQEAFLLNEDLGDA 
VDS/IKEILKKHE 


2532 


16433 


A 


2549 


360 


1 


RGEMLWTV\NNRFLKNFVPGKIEPFKSH 
SLYPPCYVHDVSFWIDQKKGFDELEFHT 
VGRAVSQDT I I S I Q FLS RFQHP KTQQVS 
LCYRLTCQTCDKALTQQQVASMRSQIRK 
EIQQHQY 


2533 


16434 


A 


2550 


2 


403 


WAEEDTELRDLLVQTLENSGVLNRIKA 
ELEAGVFLALEEQEKVENKTPLVNESLK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 
E=GIutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, X^'possible 
nucleotide insertion 














KFLNTKD\LQGLEGHENLflRDLGlIEAE 
GTVGC3PLLLEVIRRCQQKEKGPTTGEGA 

LDLSDVHSPPKSPEGKTSAQTT 


2534 


16435 


A 


2551 


1 


409 


VPENPTPLGGPGGPILRSREGGPPGLPR 
EPPSVLKKREPPTRGGGRPPGFPLPRRV 
RPEKWVNPGGQTFPEPKLRPRG/HHPGG 
QTKNPFPKKK 


2535 


16436 


A 


2552 


256 




HEMDGTLLGPFPPPGSSKBPSLTGPPIL 
FQSPGP/APHTPSSSPANLKT/CTPVCP 
SHLPW/CCPLCLPMRLPWSPVPPVSKSS 
PPALY 


2536 


16437 


A 


2553 


39S 


3 


ARQQQQLLQQQHKI /NWLQQQIQVQGQL 
PPLMIPVFPPDQRPLAAaAQQGFLLPPG 
FSYKAGCSDPYPVQMIPTTMftGARAATP 
GLGPLQLCJQLYAAQIAaMQVSPGGKLPG 
IPQGNLQAAVSPTSIHTDKRCI 


2537 


1643S 


A 


2554 


20 


^17 


"DRPPSTKRRDTPQiiMMTSlSPRGPA^ 
PECPEH/SPRKKTSACRPPLPLRPSHSS 
PLP/SPQPSHSTPQASCPLPEALSPPAP 
FRSPQSYLRAPSWPWPEEEHSSFAPDS 
SAFPPATPRTEFPEAWGTDTPPATHRSS 
WPMPRPSPD 


2538 


16439 


A 


2555 


361 


3 


kssqealeealrqrleelkklclregel 
pgkxpveyrldpgedppivrrrigpXaf 
kldeqkilpkgeeaslerlerefaiqsq 

ITEADRRLASDPHVSKKLKKQRKTKYIN 
AVKKLQVY 


2539 


16440 


A 


2556 


470 


2 


lkaavtaglevpsdvsdrafe\wlsafp 
lNdspysihhperiqvssekeaapdaga 
eritadsdlaysskvlllsspgleelyr 

CCMLFVDDTAEPRETPEHPVKQIKFLLG 
RKEEEPVLVGGEWSPSLDGLGPQADEQV 
KVSNAIRCAQAQTGTV 


2540 


16441 


A 


2557 


2 


315 


waeedtelrdllvqtlensgvlnrika 
elraavflftleeqekvenktplvneslk 
kflntkdgkmfslflfiylNeflnldti 
weksgisglffvfncyessf 


2541 


16442 


A 


2558 


372 


3 


CNISSHFANKKCQDVIVaARNVMTSQIH 
HAV\KIIEGFHINVPGLPPPDEDTELEV 
QKVSNPQYHEVMNLELENTLDQHSYSLP 
TCRISEYVKKLMELAYHSLLEAASSSDQ 
CADQLFYSVRCI 


2542 


16443 


A 


2559 


64 


435 


WGDAS CTGRAQLGI AHKS VL P TLTDKFW 
TIPAKTPMSFFKEPDKLTPKFT/WKNRT 
PRTVK 


2543 


16444 


A 


2560 


113 


375 


VPGPARDSTQRRAVKNDKKELMSIPGIP 
GPVQVPGGGLST\GMRGFPGSSLGFLTI 
PGGALIPFSPAFFSRVGGDLSPENTGPG 
QKPG 


2544 


16445 




2561 


431 


51 


RKIYRVYERENFRVEIMFRFSHTSKKVC 
KGNV /D VAKF I KLHRDGMHVLNVQ CACH 
QKGGIYWFRSIHVELIGYPPPRSSSHIK 
IGDKVRVKASVTTPKYKMGSVTHQSVGV 
VKGNI IWVIKFLMLTFH 


2545 


16446 


A 


2562 


1 


455 


PEGIALEWTESTGKEREHTFQPGDNVE 
VCEGEL INLQGK I LS VDGNKI T I MHKHE 
DLKDMLE FPAQELRKYFKMGDHVKVIAG 
RFEGDTGLIVRVEEWFVILFSGiAMHGL 
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Predict- 
ed end 

otide 
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correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVLPPPLQLCSETAS/SIN/VGGQHEW 


2545 


16447 


A 


2563 


407 


1 


KWDHPLSKCEVPCGGSIISSNGPVSYPG 
FPSPY\TSSQDCVWMITVPSGHGVRLWL 
SLLQTEPSC3DFIAIWDGPQQTAPRLGVF 
LRSMVKKSVQSSSIQVLLKFHRDAATGG 
IFAIALSAYPNTKCPPPTIHPCI 


2547 


16448 


A 


2564 


3S2 




tqevevavsldrapafqsgcqsetlsqn 
sns snktwi ldhfglsdfldqrfsactv 
ftpreqvsshtrmf laalft laktwsqp 
/kyppvigwikkVmwhiytmeyyaaikk 

DEFMS FAGTWMKLEAMY 


2548 


16449 


A 


2565 


429 


0 


PLQSSCQTSCHQNCSRKTSLHFPGDVLA 
T PEQVGG S PAQVP I P YLDDD I PLLE VEQ 
EPVSLELGDVSLVSVSREGLQPASITGS 
RGHLIVQLQELLHHWVLWSAVKSRWVIV 
GLFVS ILILSLVFAIRLR\SASRAPVLL 
RP 


2549 


16450 


A 


2566 


464 


1 


VIHGVI \NPFVHGDQYKKKFPLK\FYQE 
lYESPFVTETGEYYKQEASNLLQESNCS 
QYMEKVLGRLKDEEIRCQKYLHPSSYTK 
VIHECHQRMVADHLQFLEBiECHNl IRQE 
KKNDMAHMPVLLRAVSKGLLHMIQEHVL 
RALGRIPTSYVRMOS 


2550 


16451 


A 


2567 




178 


YNPGGGVCSDLISSHCTPVWVT\SETLS 
QKKKKKKKKKKiX'FLPETREHALPGKKKG 
GGG 


2551 


16452 


A 


2568 


215 


411 


IHLIFIHLVFIDYLSVRHSSKILGYISE 
QIRKRNS /WLYWAHTCNPSTLGGGGGQ 
ITRSGVQDQPG 


2552 


16453 


A 


2569 


170 


430 


TSSQIAPQCLAV/VSGPLAACPBLTSAT 
SPWLQVRTNAMASPLLKFSAEDLLFKAA 
LSQFCVIMIJ!IAKLSVQKYEK\LI SAFSD 
SR 


2553 


16454 


A 


2570 




463 


CRFFGYSTAAAPMTPSSGGSTLSGITAP 
AVPNIPS PIGVNGFTGLPPQMTGQPAAE 
AVFANGMHPYPAQSPTVADPLQQAYAGV 
QQYAGPAAYPAAYGQISQAFPQPPAMIP 
QQQREWPEGCNLS IYHLPQEFGDS\ELM 
QMVLPFGHVISSKV 


2554 


16455 


A 


2571 




424 


LKTMKGGTGNGLE IMLDIQQDEYLPVWG 
ETGTSPTSGAPIBGSRPQP/PARGFLGF 
TVRPG 


2555 


16456 


A 


2572 


1667 


2046 


YIFFFTAFLWftfiLTFQVTTTLAPLALLV 
RSAro4MRASHDKPTANITtK\6KTGRAS 
KQRREERHRQEVKLSLFTDDMFLYLES P 
IVLAPKLLLLINFSKVLAYKINVQKLLA 
VLYIKSSRESNEEHN 


2556 


16457 


A 


2573 




^99 


FAISQDHPALPSRPPSLHHPKPGTLTFH 
PDLPHQATCSRPIRHQRTWPEDAPLAKA 
DTVSPAEHPPAAATKAP/TR/PAPDKPG 
GTSDPQTGPAP\PASPPCSGP\APQPVP 
RKPSRAAPSKVSVTVPRRVPRTFPP 


2557 


1645S 


A 


2574 


2 


452 


AKVWEMKSPMRKGHTLLKNKEEKLNQLE 
SSLWEEASDEGSLGGSPTKKAVTFDLSD 
MDSLSSESSESFSPPHLDSTBSLTSRKI 
HGLSHSLRQISSQMSSVLSILDSUSTPQS 
PTPLLASMPAQLPP\RDPKSTPTP/SYC 
GRCRGFSLTS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 


Predict- 

nuclc- 

location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=TIireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


2558 


16459 


A 


2575 


2 


369 


TLVYPAXTFILLS I CICYWIVTAVPLTT 
SGVPVYKVIAPGGHCIHKNQTCDPEIFN 
TPEIAKACPGALCNFAFYGEKSLYHQYI 
PTFHVYNLFVFLWLINFVIALGQCALA\ 
GAFATYYWA 


2559 


16460 


A 


2576 


406 


1 


RQEGTGPSYLLLVGITPRSFWGFSPPLG 
PSGKNPIKTFGGPLQGFFFRVQMGVLLP 
LPGPSGN/CSIKVSAIjNSSFSPSGWPG 
EASLPWF/CFFEMESRSVAQAGVQWQPP 
P?GFKRFSCLSLLGSWGYRRPPLV 


2560 


16461 


A 


2577 




410 


YISPFYITHMRAHTNLPGPF/KLNQRAD 
ALVSAAFADAQTFHSLTHLNAAGLRKRY 
GLSWKQVKE IVKHFSAGEVLHLPHQGAG 
VNPRGLSPNSIWQMDITHIPTFGKLSFV 
HVSVDIYSHFIWATYQTGEATAHVK 


2561 


16462 


A 


2578 


386 


1 


TERIRQRQYQRETEKDKRYTERQRKTER 
IRESDRDRERQNQIERDRRAT\RERDRK 
KQRESDRETYRERENQIETBRDRKRQSD 
RDRKTQRETGRIRYRERERQHQTESDRD 
RERESDRETQRHRE CI 


2562 


16463 


A 


2579 


2 


432 


LLYFAKRYGftaFGECCQftaDKAACLLPK 
LDELRDEGKRSSAKQRLKCASLQKFGER 
AFKnWAAARLSQRFPKAiFAEVSiOjVTD 
LT*VHTECCHGDLLECADDRSDLAMYIC 
ENQDSNFSKUIECCEKPLLEKFHCrAEV 
END 


2563 


16464 


A 


2380 


3 


426 


NliUmALAIREKTLGKDHPAGAATLNNL 
AVLYGKRGKYKEAKPMSKRALE IRKKVL 
GICDHPDVDKQLNNLALLCQNQGKTEQGE 
YYYPRALHIYHTKLGPDDPNAftNTKNDL 
VACYLKHGTVKQA* SMIKENIiTSAHEWD 
V 


2564 


16465 


A 


2581 


10 


389 


KLflG*GLWGHTLITNPLTEPLTYPFLGL 
YL*SIIITSSICLLQTDL*ALIAYSSII 
HIALVITAILIQTP*SFTGAVILIIflHG 
LTSSLLFCTiTNSNYERTHSRIIILFQGL 
QTLRPLIAL**LLA 


2565 


16466 


A 


2582 


1 


192 


LIPTLAIITR*GGQPERXJ!IAGTYFLFYT 
LVGSLPLLIALIYTHNTLGSLNILLLTL 
TAQELSNS *ANNLI *LAYTrAFIVKI PL 
YGLHL*LPKAHVEAPrAGSIVLAAVLLK 
LGGYGIIHLTLILNPLTKHIAYPFLGLS 
L*GGQPERIJ!qAGTYFLFYTLVGSLPLLI 
AlilYTHNTLGSIiMILLLTLTAQELSNS 


2566 


16467 


A 


2583 


2 


438 


QAHGPLAGCHLRPRSPSPVLWRRRRRWW 
RQRRKWKTKTRTAaAGMYASWTKaCHGL 
EELINLTRLKVSYNHIDDLSGLIPLHGI 
KHKLRYIDLHSNRIDSIHHLL*CMVG1,H 
FLTNLILEKDGDDNPVCRLPGYEASIFQ 
TFAQL 


2567 


16468 


A 


2584 


3 


415 


GRAILLLEEAIQYLSGIEIDL IDTDRGE 
WDSLTPQARR*KEAGLQMFVQLARFHNI 
VCECEAFGTLAFLTS E I KSL FGHP FLAER 
IISMLNYFLQEHilGPKMGRLEVKDFSEF 
DF*PQQHVSDSGAIYLDliGDEENFC 


2568 


16469 


A 


2585 


2 


453 


DAKMYLSYKYATVIRNLREGTCPSAPSD 
ECKPEKRCALSHHQRLKCDEWSDISVGK 
IKCVSAETTEYCIA* IMNG*ADAMSLDQ 



703 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO; of 

sequence 


SEQID 
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nucleotide 
location 
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ing to first 

residue of 

sequence 


Predict- 
ed end 

correspon 
last amino 
residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=ProIine, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptop)ian, Y=Tyrosine, 
X=Unl4nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFVYISGKCGLMPVLAENYNKSDNCEDT 
PEAGYFAVAWKKSASDLTWDHLKGKKS 
CHTAVGRNAC 


2569 


16470 


A 


2586 


3 


413 


MA.VESRVTQEEIGKEP*KPIDR*KTCPL 

Q I YTWMDATLKE LT SLVRE VYP * ARKKG 
THFNFAIDFTDVKRPGYRVKEIGSTMSG 
RKGTDDSMTLQSQKFQIGDYLDIAI 


2570 


16471 


A 


2587 


1 


798 


LEVMLMLVKAGADQRAKNQDGMNALHFA 
TQSKHVRIVEYLIQDLHLKDLNQPDEKG 
RKP FLLAflERGHVE MIEKLTFLNLHTSE 
KDKGGNTALHLAAKHGHS PAVQVLLAQW 
QDINEMNELNISSLQIATRNGHASLVNF 
LLSEKVDLHQKAEPKESPLHLVDINNHI 
TWNSLLSAQHDIDILNQKQQTPLHVAA 
DRGNVELVETLLKAGCDLKa.VDKQG*TA 
LAVASRSNHSLWDMLIKAERYYAWREE 
HHESIRDPSTGFTL 


2571 


16472 


A 


2588 


2 


285 


AWSIAPSHPHSKVPPGPRRGKAEGRPGA 
iiAQAAEQAEVHPPSSGPSPLPARQPPVW 
QIPPTPSLKTTTRRGAQPQHSRKRLA* S 
RSVSVLFRKM 


2572 


16473 


A 


2589 


11 


438 


AYYGLNWHLGATLSQKKKKKKKKKKNFP 
RGGGPPENPHFWGIGGHPGTGPPRGEKN 
GLKNQKKEKFIiAPR*KKSTGKPLKPPQG 
KGWGGGPKFPKKKRPPRAGGGSLKPKGK 
NREPFLALKSLPGVFLGGFGGNPGMGLK 
NQ 


2573 


16474 


A 


2590 


313 


391 


VHLVRVKLGL*SRIYKELLQLNNTT*DN 
PI *KWATDLDHHFSTKYTQMVDKLMEKC 
STS 


2574 


16475 


A 


2591 


62 


616 


EVHQGTEVRDSEVRRRPQRRGPLMPAER 
AGRQRWLVPALQPRHGGLHR*RGAVRQH 
GftHPHGLLLQDQKI PALPGRKQAGSLHA 
PGTEGEPDHGGDPVLDAGIQHHRQQRHP 
TADHLNPGEHRRGEAHVEARV* PAAGAE 
GAAKERRAHQANTALQVHHR*LGSFAEL 
RLLRKPGRTSVWPSPM 


2575 


16476 


A 


2592 


345 


438 


HKRWLPVPILYQHLF*VFGHPEVYILIL 


2576 


16477 


A 


2593 


479 


313 


QDGLDLLTS*STRFSLPKCWDYRREPPL 
KLINLQSSGVGLHVQLFPPSFCFDQLL 


2577 


16478 


A 


2594 


3 


419 


LTLHSNTLPPPEMSGLT* PPATNAYWTF 
LPSAIRLFPEIFFIAVLLSVSLFDETET 
LSDAHSWRITFKYERDANYHLLMSAQER 
LERLFGLHGGTIPIVPTADFHDSISGAS 

DTD lAHSGLAYTME RS ARQI MRTAMKY 


2578 


16479 


A 


2595 


1 


419 


HKCEGHYRGKTYGGYWSLCATVNKALDE 
RIP ITSASYYATVTLDHVRNILGSDTDV 
SMPLL*ERHRILWETGKFLVDKFGGSFI 
NCVR * RDN I AHKLMHL WES FP S YRD VT 
LFEGN*VAFYTRAQ ILVTDTGTVLEGK 


2579 


16480 


A 


2596 


199 


397 


SPTALNTDDVNPFMGSSSRGHHRQFQTG 
I VS DHPAEAGP I S * DEQE LP YAVLHFHK 
VQPLEEKVTD 


2580 


16481 


A 


2597 


154 


3 


MLSLRGFFCLFVCFETGS*YAAQAGLKL 
LGSGSPPISSFQSARI TGVSHL 


2581 


16482 


A 


2598 


425 


223 


QITGHGGKCL*SQLLGKFGQENFLNLGS 
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sequence 


M 
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beginning 

location 
correspond 
ing to first 

residue of 
peptide 


Predict- 

nucle- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutan)ic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l,=l,eucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RSFN* PKLLPGTSGWVTMLNSLGKKKKV 
TAYLNKWMPTP 


2582 


16483 


A 


2599 


3 


441 


QGPSHDLVSELYQINAFDTPESLLMIGK 
DHSDPIHHTFDHMWRTKEHNEAGWLLLR 
S VDKVMKENDELGDS I SQLQKQ ILSLKY 
AKIALTERLISCPV*TEIVLNYT*SLIM 
LLTDLL*KVHAQSHHAHSGQASNC2<IVMT 
LIGLQH 


2583 


16484 


A 


2600 


131 


423 


GPAPMVILFKLTGFLLAFLPLSHLLTSY 
LVPWILSGTDGHTFRSACLPRWLEAEWI 
FGGVKYQYGGNQEGK* CFFTG* S YVYNG 
SSGKVPWETFSRT 


2584 


16485 


A 


2601 


222 


447 


SGIPSFGL*WEQNRPGRLNAQILKDLG 
VSPGSDYGKLKKGI SSGIiDNGVTTSDHD 
VLKMAIVGRKIWILGDWSG 


2585 


16486 


A 


2602 


2 


457 


F ICD PLI KAIGTEGDTDVLSE I MNSFAK 
SIEVMGDGCLNDEHLEELGGILKAKLEG 
HFKNQELRQVKRQEDNYDQQVEMSLQDE 
DECDVYILTKVSDILHSLFSTYKEQILP 
WFEQLLPLIV]!JI.ICSSRPWPY*QWGLCI 
FDDIVKHCSPSL 


2586 


16487 


A 


2603 


1 


297 


DHRQKLYAHTECGKALLWKSVICVHQKI 
LEEEKPCEGTKYDWIFSNRGCPPVPHMV 
HAVE I PCK*TECEKATGVHGPRGASEFL 
PERPTGMNMaKCESR 


2587 


16488 


A 


2604 


1 


405 








GLRGLYKGVTPIIWSAGLSWGLYFVFYN 
AIKSYITEGSSERLKASEYLASTAEPGP 
MTLCITNPLWVTKTGLMLQYDAWNSPH 
*QYKGMSDTLVKIYKHGGVQGLY 


2588 


16489 


A 


2605 


165 


3 


PEELLLVFFPVLKNTGIRPGAVAHTCNP 
STLGGRITRTGD*DHPGQHGETLSL 


2589 


16490 


A 


2606 


3 


455 


KRYGCFSKRMNKRSATNVFPCARKGEVL 
GLLGHNGAGKSSS I KVITGDT* PTAGQV 
LLKGSGGWDALEFLGYCPHENALWPNLT 
VHQHL*VYAAVKGLRKGnAEVAITHLVD 
ALKLHDQLNSPVKTLSQGLKIKLGFCLS 
ILGKPSCGILA 


2590 


16491 


A 


2607 


1 


429 


VDYTVRKFCIQQEGDMTNRKPQRLITQF 
HFTSWPDLGVPITPIGMLKILQKEKaCH 
PQYAGAI\/aHCKA*VGRTGTFGDIEflML 
DMMHT*RKVDV1!IGFESRNRAQSCQKUQT 
DMQ YVF I YQALLEHYLYGDTELEVTCLE 
TPW 


2591 


16492 


A 


2608 


166 


435 


KFLSNNYA/HFKQNFKKVLKF I KHLVLNY 
FKNIVLGQV*EATPGIPALWEAETGESL 
EPRSSRPTWAMWENS I STKNAQ I KNIVL 
RILDPG 


2592 


16493 


A 


2609 


228 


431 


IFSKICIFNGFNYFPIVGHIGRIYLFIE 
TGSHCVTQAGVQWCDYGSLQP*TPGLK* 
SSHLRLPSSWDH 


2593 


16494 


A 


2610 


439 


175 


RNGGLHLWSQLLRRLRWDYHFSPGSQGC 
SKS*SYHCTPAWVTQGDSSLKKQKRKIS 
TYTSILSVLNETVYIIVYYTKKLDTSFK 
RKKL 


2594 


16495 


A 


2611 




439 


LLaSILSSMQKPRGLVDQETLRKAR*QA 
ARLNKLQEHEKQQKVEFRIRMEKEVSHV 
V*DSGHITT*VQSMNSIERSILHEWEV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
locntion 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVl=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph»n, Y=Tyrosine, 
X=tInknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DGLTSFYFGEDDDCRYGMIFTKDAAPSD 
EELDSYRRAEEWDPHMADDKRKLTELAQ 

RPDDDA 


2595 


16496 


A 


2612 


1 


439 


VIRKVTGTEGSSSTVLDYTIPSSTGGMP 
VRKSEDQTDTKRTVI KTMEDYOTJDNTAP 
AEDVIIMIQVPQSIWDQDDFESEDQDDK 
STQPISSVGKPATVIKWVRTKPSAIVKY 
PEKESEPFAKILKFTMDVSHEVIPHEVK 
SS*YSA 


2596 


16497 


A 


2613 


2 


453 


EKPEKEEClSlLWTEMWfiElWPegFGGIRL 
YLQELMTITQKALHSQPWKMKAQGAi™ 
ASMALQTNSLVPPYLGMILTALVEGLAG 
RTWALKEELLKRIACVETARSAEL*KSV 
PNQPSTHEI IQADLKECSKENGPYKI * P 
VICRftDILHA 


2597 


16498 


A 


2614 


2 


441 


LBPALPGRWQQRSAESPPSQSVRKTRQN 
KQKTPGNGDGGSTS*APQPPRKKRARAD 
PTVESEEAFKNIMEVKVKIPEELKPWLD 
EDWDLVTRQKQLFQLPAKK]!IVDAIL*EY 
ANCKKSHGNVDNTEYAVNEDVAGIKEYF 
NEMLGT 


2598 


16499 


A 


2615 


1 


430 


RGDRDLHCTESQSEASTEEGHDSLSVGI 
FEEDSQLEFIUJPPKSKPPAWLNGIMTC 
♦DFELLNPRRVRFLLEVKDLALTRRQIL 
LHKGLSDYEKSTTLQELVLKKSSRSGPP 
LSIEDLGLNFQL*PSSRVYGFTAEELKP 
SGE 


2599 


16500 


A 


2616 


373 


552 


ickikyklfnsallffrqglallprmec 
sgaikvhcsldplgssiipl*r:<ikestl 
nlekslctrgiflckyeevpkqflkicf 
qifletgsrsvsqaevqwhdqnslqprt 
pglksfsclnlpkcwdyrckpprralcs 


2600 


16501 


A 


2617 


49S 


157 


QLIGSSTHQJUMiRSRSLPPPAGPGTFHF 
HYQ6KRWWKGGTSPDSLPSLLGRGVSV 
QLHPRGKEQRGASDT*HKCPVKLWTIGG 
KYRVSETSRIFSLPPTTLQEAGLDSGSG 
SL 


2601 


16502 


A 


2618 


255 


389 


r.SEFYTYBGPSIRPPIGS*GTNLPLPLS 
YIPRSPSAVDDENLIJ3E 


2602 


16503 


A 


2619 


3 


207 


QHSSLIIIRAIAIKLGIAPFHF*VP*VA 
QGTPLTSGLIiLT*HKLAPIS 1 1 YQISP 
SLNVSLLLSGT 


2603 


16504 


A 


2620 


3 


207 


QYSSLII IMGVGIKLGIAPFHF* VPEVA 
QGTPLTS GLIjL YT *QTLTPISMIYQISA 
SLNVSLLLSGT 


2604 


16505 


A 


2621 


45 


447 


WRRIDCRLNLCVTTASLKHFFSIAHKFH 
LHNGSHLQSQHSCEAQLRRTARFIICCL 
PYGESGQSWPTLTLERANLNWLTVYFNN 
WKDWEYDSDQYMSNFDRSSEMMNIIVW* 
RSIDNLPHIICSTLINTKNSFL 


2605 


16506 


A 


2622 


3 


142 


GWQATPKTAPATMSTPTILVATAVHAYR 
* * VAEKEHPLKFGGEACS 


2606 


16507 


A 


2623 


2 


453 


HGPGGLLDYIDKERIRDFLNGECMCEVP 
DGGLVPKSLYRTA*DLENEDLKLWTDTI 

YQSASVFKGAPHE iliqivdastvitwd 

YIWCKGDIVFNIYHSIOISPLPPNKDYLI 
AHSITFPDGNNVHLNHKVWMLGPDYIMW 
ITTLITNMEN 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue uf 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaiiiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Ly5ine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=l]n known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2607 


16508 


A 


2624 


1 


315 


EEKQAPGFTTGRGKLTPLFCAHAVRF 1 1 
STSLIYKGAYP*ALKGKEKY*LAVFWVY 
NKKGLNNNNLFLDGVHLCFVPEVRKYLA 
GKGLPFS IGLYHLIIjLLFFFF 


2608 


16509 


A 


2625 


232 


478 


LDCGNYFTV*MYIKTSHCIP*IYIILNF 
HLYNFRYNLGDYNGEIVSEVMAQRQPMK 
PTYAIPIIPITNSSQFiaiQEAMDVKE 


2609 


16510 


A 


2626 


3 


430 


TSTPNVHMGSTSLPGDSTTIEDAIQSHS 
ESASPSALSSHPNNLSPTGWSQPKTPVP 
AQRERAPVSGTPDRDKLRPCGQRD *G YY 
WEIEASEVMLATRIVSGSFGTVYQGKWH 
GDVALAILQWDSTPEQFQGFRNDVAGL 


2610 


16511 


A 


2627 


2 


341 


ALQKHEDTDCPCVWSCPHKCSVQTLLR 
SE*SAHLSERVIAPSTCSFKRyGCVFKG 
TYQQIKAHEASFWQHVNLLKEWSNSLK 
KKGSSFDKECVDKNKSIHCWHIQICSSV 
I 


2611 


16512 


A 


2628 


2 


179 


RHTGLWVTSL PAVFPGQVRRTLFI TGLP 

RDAKKETVE3HFR*AHWLVGDFSPCCIS 
RTGEADPVHHRTPQRCQEGDCGEPLPVS 
RWVGS FL IIIYIQEHCL 


2612 


165n 


A 


2629 


1 


418 


GFSFCHPVPVKWRHRDSPQP*TPQLK*S 
CLGLPKCWDYEHEPLRPMJI SY 


2613 


16514 


A 


2630 


3 


442 


FTCGTIIIAIPNGVKEFT*LATLHGSNM 
K*SAAVL*ALGYIFLFTVGGLTGIVLSN 
SSLDIVLHDTYYMVAHFHYALSIRAVFA 
IIEGFIH*FPLFSGYTLHQAYA*IHFTI 
IFIGVMLTLFPQHFVGLFGMPRQYSDYP 
DAYTTH 


2614 


16515 


A 


2631 


2 


454 


AAPMELI CWSGGWGLPSVDLDSIAVLTY 
ARFTGAPLKVHKIINPW*SPSGTLPALR 
TSH6EVI SVPHKI ITHLLKEKYHADYDL 
SAEQGADTLAFMSLLEEKLLPGLVHTFW 
IDTKNYIiEVTRKWYAEAMPFSLKFFLPG 
RMQRQYM*ELP 


2615 


16516 


A 


2632 


131 


410 


WMWSSKa.PHCPHI.PS IGDADTVHQCSiMS 
FQKGHSALEGVLHLVFKPDLVYQTLLQM 
PPRKCCLWPGAVTHA(3rSMTLGGHSR*F 
TRSGVQDQP 


2616 


16517 




2633 


2 


230 


FFSETSSLLEIQGIARHGGTWRLRQEKN 
LNPGGGGCSELRSCSCAPAWVTVRLDLR 

KKRNRNP*KILKKYLKXFVI 


2617 


16518 


A 


2634 


2 


370 


GTSSSDPAQPGDDKEFIDASRLVYDGIR 
DIRKAVLMIRTPEELDDSDFETEDFDYR 
SRTSVQTEDDQLIAGQSARAIMAQLPQE 
QKAKIAEQVAS FQEE * SKLDAEVSKWDD 
SGNDIIVLAKQ 


2618 


16519 


A 


2635 


2 


376 


MTDTDALY*RELFDPADKDKMDHSRRGI 
ALVFNHERFFVIHLSLPERRCTCSDRDWL 
TRRFSDLGFEGKCFWDLKAEELLLKIHE 
VSTVIHSDADCFVCVFLSHGERMHIYAY 
DAKIEIQTLTALF 


2619 


16520 


A 


2636 




461 


DMAFLILTERKILGYGQGRIGPNVACPY 
GLLQPFGDAI*LFTKEPLKPVTSTITLY 
ITAPTIALTIALLL*TPLPIPNPLGNLN 
LGLLFILATSSLAVYSIL*SR*ASNSNY 
ALIGALRAVAQTISYEVTLTIILLSTLL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Tlireonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IKGSFNLRSLITT 


2620 


16521 


A 


2637 


2 


384 


ENFKALAMIAFGQYRQKRPFEDHVKLAW 
D*LNFAKTCVADESAE1SICDKSLHTLFGD 
KLFPIATLRETYLELADCCAKQ*PDIHE 
CFLQLKYDNPYLPRLVKPNVDVMCTAFH 
DTEETFLK*YLYEIA 


2621 


16522 


A 


2638 


1 


373 


TFIYLLFLFSSAYSRGVFGRDAHKSEVA 
HHFKDLGDENFKALVLIAFAQYLQQCPF 
EDHVKLANEVTEFAKTCVADESPEN*DK 

KQEPDENECFLQ 


2622 


16523 


A 


2639 


2 


375 


KGPCYRLVSTGTQRRHPGAVYLNKHLC* 
CNVGKa*GPHCEKCTLPCTFNEEPLEAL 
TFFREHGP*VSDPEVATAPTEKEIPSLD 
QETTKLEPGQPQLSPGISTIHLHPQFPV 
VIEKTSPPVPVEI 


2623 


16524 


A 


2640 


215 


478 


KYFtASHTSLFIICTrtAHLTCTiaEPKQ 
IESHFGKRLDADLVF*KSDDSTWDVIP 
SKPV* TSGLPSGKCL*HIVEGI IRAVDP 

RKI,Y 


2624 


16525 


A 


2641 


208 


376 


I I,RST,I KNHFWPGWAHACDPNTLEAM * 
GQIXRSRDWDHPDRHGEAPSVLRIHRLA 


2625 


16526 


A 


2642 


1^4 


2 


KGEPLPPL(3GWKT*GPRaFPQAQIPHAG 
FKTRGCPPPLPQGRNKAQVPNPFCPRER 
FLPGKGSGRVAPLKNGAQLGGPPFYPHP 
FGGPSPRVPLGFGVQTHLGNKPKPPPFP 
KKKKGGTGCSVGNWGPKSSQTLPSPQAS 


2626 


16527 


A 


2643 


190 


3 


ELSTRGFQGPHRRIAGLKQCLVMFPELE 
RSGaLSEH*HLHLPGSSDSRASAS*lAG 
ITSVCH 




16528 




2644 


2 


339 


CCEKPLLEKSHCIAEVENDEMPADLPSL 
AftDFVESKDVRENYAEAKDVFLGMFLYE 
YARRHPDYSVVLLLRIiAKTY*TTLEKCC 
AAADPHECYAKEFHEFKPLVEEPQNLIK 


2628 


16529 


A 


2645 


81 


369 


VEVTGQPQNASFVKRNRMWLLPLIAALA 
SGSFWVFRTPDG*TSWR*IHNHMTQLI 
NNHLRA* YSVYRDI YFLW lAI * * SS * PA 
SVLLFTADYCPE 


2629 


16530 


A 


2646 


1 


348 


DMDMNPLRPQlSrYRPG*ELKaDIDYHFKE 
DiqDEDDHQLSLRTVSLWaGAKDD*HIVE 
SEAMHYEGCPIKVTLATLKMSVQaTVTL 
GGFEITPPGDLRMKCSSGPVHIMGLHLV 


2630 


16531 


A 


2647 


1 


178 


GYTDTILDVHSQRVRSLLGLSNSEPNGS 
VETKNL*QILNaGESPKQKGQEIKRYDL 
YLT 


2631 


16532 


A 


2648 


181 


244 


TIKRYKNVMIFYYFNFF*R*GLNSIiflQA 
AAQWCNHGSLQNQPPGLKGS SCLSIiPCS 
*GYTI FYLS IHQL I DI WIVS TFSKLFLH 
VSAYSSIKMS 


2632 


16533 


A 


2649 


2 


369 


KWITFISLLFLFRSVYGKFEFLLDAPKN 
DVAHRFIDFGEQHFKaLAE lAFDHYFLH 
CPFEDLVTLVWE*TEFA*TCLADESA*N 
CDKSLHTLFGDKLCTVATLR*TYGEMAD 
CCAKLEPERI 


2633 


16534 


A 


2650 




349 


VTFISLLFLFSSGYSRGVFRLDAHKSEV 
AHRFKDLGEENFKALVLIAFAHYLQQCP 
FEDHVKLVNEVTEFA*TCVADESAENCD 
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nucleotide 

sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
dingto 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RSLHTLFGDTLCTVATLRETYGETADCC 
AKQD 


2634 


16535 


A 


2651 


21 


4U3 


CVCCVCVP.ESRTESLPGDNEDFNVKNAS 
VKDVRCVHFDCDSQNDPPMEATGFTAQV 
TIYSLIWLLHRLKYFETLDLH*IAIFFS 
KVS ILNHPGQI lADYAPALDCHTCHIAC 
KYAELKEMIDRRSVK 


2635 


16536 


A 


2652 




355 


ARMS ITDTYGQHLIAGGLMTQEDVSE I K 
SSYYAKSITOHLNIIMAHYEPTALNLQAHW 
HGLAQAEAQITTWSSGVPFDLLLMVGMM 
TVQVP* ELQMHSHLLKTHDQSRMENMMY 


2636 


16537 


A 


2653 


2 


360 


LFEHLG*YKFQNALLLRYTMKVPQVLTP 
TLVKVSRNL*KVGIKCCKHPEABaiMPCA 
QDYLSWUJQLCVLHEKTPVSDRVRKCC 
TBSLVNRRPCFSALEVDETYVPKEEBrAE 
TFTPHAD 


2637 


16538 


A 


2654 


360 


314 


YTGAIVPCKLFWWCCSIDRVSLCCIYWSQ 
TPELK*SSCLSFPKC*NYGHEHTPfPARI 
IE'IJSILY1SIKSG*TATRKCPPTKVTPSSHQ 
Q*HCTWPYFYFK 


2638 


16539 




2653 




353 


NWTL*TLKKIFSLKnAIKRIKIQSVRMK 
KIFAKHTSDKGVESKYI*RTLINQ*EOTr 
SIKNWSKNLNRRFTKDIQKANKHMLSAT 
SLVH*IiSKLK*Q*YHYTSIRVABCMKKTD 

rvTsw 


2639 


16540 


A 


2656 


340 


10 


GREDKS EKCYLKPGRSQPDRGAPKSS PG 
VP * PPAPPALPGPGRSSPFPQGSLAGKD 
LRPRQPSQPGGPGELIFPVKTKKKKIKE 
RKTLLAWTRSNERPEIAQREGGCLRQ 














2640 


16541 


A 


2657 


509 


261 


IPYFKIHOIJHSALGVAETEKETAEHLDL 
AGASSRPKDSQRNSPFQIPPPSPDSKKK 
SRGIMKIiFGK*VRQ*TSGMGPVIiHVA 


2641 


16542 


A 


2658 


345 


3 


SAHLSHPKCWDYRREPLHLARISFLFKA 
SSPLYGRATFYLSTPPLMDGRWVaSIFW 
LL* IVLP*AWVYKYLSETPLSIF*KTGS 
LSVAQAAVQWHNHGSLQPQTPGLKRSSH 

L 


2642 


16543 


A 


2659 


271 


56 


VILYIVMPIPRPGPHPHPYIHTHTHTHT 
HTHTHTRLGKREF * LYSW * L PGYSTNGN 
KIFLEIYLLPILLKM 


2643 


16544 


A 


2660 


366 


68 


PGQQSKIMPQTEK.KKGRKEEREEREEKE 
EREGRKEGRKNEGTETVIEGGSSKTQEQ 
TR I KAS * L WSL YT TVKPQRRLKGGKKHP 
KDSNFKD*RNISPHR 


2644 


16545 


A 


2661 


351 


199 


LLRRLRQENLLNPEGRGFSEPRSHYYSP 
DWATE*DFWSNKIK*QKVQYSKY 


2645 


16546 


A 


2662 


1 


160 


ERAWLHLSPGDGVCSEPRSHHCTPTWVT 
E*DSVANQPGWDRVRLCLKPTWVTE 


2646 


16547 


A 


2663 


233 


3 


GRDNWLKTVAKPESLSGKEGPSLLRKVF 
NPFPVYILFLPQAPKGKWLSPFPPFFFF 
FFFLRGSLAM*PGCSAmQS 


2647 


16548 


A 


2664 


96 


349 


QMPSSVLTRVANAYYS6FFFWFLKQKFC 
FVTQAEVQGRSIS*PGPLVPGFKEFLGL 
TLLSTWHCRRAP PGLVIRCGFKMSLLTL 


2648 


16549 


A 


2665 


322 


74 


DTYTLPRLNQVESLNRPITGSEIEAI IN 
RPPTKKSPGPDRFKAKFYQRYKEELVSF 
ILKLFQ* lEKGGIGWAEWLMPVr P VL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

0(1 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

nucle- 

location 
correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C:=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gl3'cine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


2649 


16550 


A 


2666 


2 


228 


NLALSPRLECKSQGFTMLIRLVSVS*PC 
DLPASASQSAGIMGVSHHAQPFLHSYLR 
PblblKGDQFKYAFNLFSC 


2650 


16551 


A 


2667 


129 


301 


VRPLftGLGSPLIFFFFFEKSLAFVAQAG 
GQRHNLG+LLPLPPGLKGFSCLNLPSTW 

D 


2651 


16352 


A 


2668 


318 


21 


FSLLVKDFQNDLAVNSSSIFPSTFLFLV 
RS*LD?VNLNFFNFYrjFRDEFSLCCPGW 
THIPRLKQSSRI.RLLSSWGYKCVPLHPA 
INRFL*CDFSISGY 


2652 


16553 


A 


2669 




322 


QDGLDLLTS*STRLGLPKCWDYRREPPR 
PAPINKCLLSIYHGPGPSLGTBGEKGQD 
YIPAIEQLTDQWRRLTWIGCYCAAVAED 
SMGWWPLLPGYEAEKERGRFH 


2653 


16554 




2670 


29 


261 


EFNTLSKCVWIY*LFSLSLSLSLCVCVC 
VCVCVCVCLCVSVCLFYGSPICLHFMHP 
DTLSEESFHFLQIISKELLKC 


2654 


16355 


A 


2671 


5 


224 


GSAWWLTPVIPALWECKAGRSPEVGSLR 
PA*PRDPPALASESAGITOVKKNLrSEL 
WEVKSCILLIFISLDKV 


2655 


16556 


A 


2672 


126 


321 


l«GRRGASARRDSHSALRVSPELALLELR 
PSTMaHICNPSM.GGRGGWIT»DQEFET 
NLSNLEKPC 


2656 


16557 


A 


2673 


360 


74 


SRLLRKUIQEVGLNLGGRGCSGPRLHHC 
TliAWATERDPDSKlirKTlSrQKNKQER**TL 
PETNSLLAWEPDCVCKTNKLATRLEIMR 


2657 


16558 


A 


2674 


192 


357 


RIRKMLYYYYYYLIiRWSLALSPRLECRG 
TIL2^CNFHL*SSSRSPASASPVAGIT 


2658 


16559 


A 


2675 


236 


375 


EEVLYSRKDRHVASYSRK*W*RSGAEAH 
ACNPSTLGGRGGWITRSG 


2659 


16560 


A 


2676 


236 


2 


ATKVS INSIGTliGVKVGGLLEGRRLRPG 
PKQSSGLGLQSSWDHR*VQPHPGNNFTL 
FVETRSMLARLVSNSWPQAIFQ 


2660 


16561 


A 


2677 


164 


389 


LTVQHGLRGLRKLTVMVESEGQASHLLH 
KAAGRRSECQQGNCQMLIKSSYQPCAEA 
HACNPSTVGG*GERITRSG 


2661 


16562 


A 


2678 


208 


1 


C SV YEGS CFC I LRVRRVCVC VCVC VC VC 
VCBRSCIYVRSENH*KQLPSSSILT*-RF 
*IQSTFRSNXRLR 


2662 


16363 


A 


2679 


159 


382 


RWLIKNHPTQARIQVRGLLDRDCQTQTW 
LWESGVQPLATTPRRHSEMGGWPGAVAH 
ACNPSTLGGRGG* ITPPRPESKLEDCLT 
GTARHRLGCGSQGCSLWLP PPGG IQKWG 
VGRAQWLTPVI PALWVAEAGRSLEVH 


2663 


16564 


A 


2680 


29 


367 


DCQSEQLRKLRWEHHWSQGV*GCSQP*S 
HHWTPAWVTEQDPVQNKQTKNYNTHQRA 
GEVWREINCLSTEQLTELYSEITQILAL 
SVRN*KLL* YVKSSVISAETQ ISGGKIF 


2664 


16565 


A 


2681 


256 


1 


ENIVPVRFT.CIHHLFalLIMVQ*LERKR 
DHIVIHLNLTLETVYLKKWQTRPNAVAH 
ACNPSTLGGQGRRITRSRARDYP6QHGE 
T 


2665 


16566 


A 


2682 


1 


395 


LLIEHILIAMALLILTERKILGCIQLRK 
G PNWGP YGLIiRP FTDAI KLFARE PLQP 
VTSTITLYMTAPTLAVTMAVLL* TPLRI 
PNPLVNHDLGIiLC I LATS SI,ADY*IL*S 
R*ASHSNYALIGALHACPH 
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SEQI0 
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SEQID 
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sequence 
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26 


Predicted 
beginning 
nucleotide 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamie Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Met{iionine, 
N=Asparagine, P=FroIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2666 


16567 


A 


2683 


3 


349 


VR I DNAAVCL L Y AW AE F I WAIiANL 1 1 R 
PDLDHPGNLLGNDHIYNVIVTAHAFVII 
FFIGIPIIYGGFGN*LVPLI IGAPDMAF 
ARITNIRF*LLPPSLLLLIjASAIVEftGA 
AGT 


2667 


1656S 


A 


2684 


99 


2 


LTVAHACNPSTLGGQSG* ITRSGVREQP 
GQHG 


2668 


16569 


A 


2685 


325 


4 


SSFPMSFlINSFIirFLYTTIFLlFYDV 
VYSPEFKP*VCFCLTECWDYRREFC!CSF 
LPCKRV*VLAPFIKNFFFFFFFLRWNF1 

FVAQAGVQWRHLGSLQPPPPNFK 


2669 


16570 


A 


2686 


382 


269 


MC-FHHVAQADLELLGTI'mpPAliASQSAG 
ITGVSHHAQPKLTF*TFSY* AC VGGS S P 
QFI LWQHGLAGWS FCCF 


2670 


16571 


A 


2687 


3 


381 


GHLYALTFFFNSQICLFPRIYKTCKKSF 
WLPFL1ILVSLCAIEC3YKTKKVPSNV*KG 
IFIHHQTQFILRMQQWLNI*TSINGIYH 
ISRIEDKNHKTMFIDaEKWFHNI*HPFI 
lENIRPGWAHICaST 


2671 


16572 


A 


2688 


289 


91 


IKYKELGVACSKTSRNVSYYCYHHHHHH 
RCHHYHHHS*RLa2VLCEESGWRKALGST 
SMGRVAFHRH 


2672 


16573 


A 


2689 


3 


211 


LHHVGQDGLDLLTL*SAPLGLPKCWDYK 
TEPPFPAPNEFLLLHSLLSKHYALGRPH 
TPRERKKaSGILR 


2673 


16574 


A 


2690 


2 


197 


DLLTS *SAHFGLPECWDYRREPPRPftSY 
AFIKYFCPALSCFWCIETGPYLIFAYUl 
EESFVRSRT 


2674 


16575 


A 


2691 


253 


356 


AVPVKMaj:VKKTRNNRCW*GC33ElGTFL 
HCWWEC 


2675 


16576 


A 


2692 


211 


339 


PGHPLSI ILQWIWGEDTRRGPMHEALH* 
PCCSELR3CHCTPAWPTE*NS IF* KKKK 
KWLGELKEKQKNMAC2USIDP+PGHPLS II 
LQWIWGEDTHRGPMHEALHGKSGDGVYA 
VRLLTAY 


2676 


16577 


A 


2693 


1 


175 


RHEGXJULGGRGCGEPSSCHCTPAWVTE* 
DSVSKKKRKGKKKSIVLILIFSNGVLFCH 

LG 


2677 


16578 


A 


2694 


234 


13 


KTSREPWSEFLSIKGEQQCSFGLLYVFC 
FQMESHYAAQaaV*WCNLG*LQPPPTaF 
KQFSCLS PLSPSS PASAS 


2678 


16579 


A 


2695 


149 


2 


S QGEDFTKTRALQWRSRRS WAHACNPS 
TLGGRGWWIT*GQEFKTSIiaN 


2679 


16580 


A 


2696 


2 


194 


CIGLGWAGACSSRLRQENDMNLGGGAC 
SETRSHHCTPAWVTE*DSISKKKRKKWR 
WGGTRFSN 


2680 


16581 


A 


2697 


338 


357 


YLIRRKKISNS KS * FSWPGWAYACNPS 
TLGGRGRRTTRSAVENQPGQHGDPKFFL 
VSS 


2681 


16582 


A 


2698 


53 


293 


TI SI KS KKWKNKQGE YRTKWCMERrWLR I C 
RRLKLCPYVLVHSCIAIKNYLRLGNL*R 
KEV*LAHSSTGCTGNMAEKPHGRR 


2682 


16583 


A 


2699 


2 


335 


LYAAKVPLQKACIRTFSYISFFIFFFFF 
FFGKQSSFGPPGLKARGGTPLIGTPPLG 
VKRNFLPQFSGKPGITGCPPLPQKFWFF 
KKKRGLIF*PGGVPNSDPKGIGPPNPP 


2683 


16584 


A 


2700 


166 


3 


DYERVPPRLVKR*GFTW*AGV*WRNHG 
SLQTQTIGLKQSSHLSLATCWDYRYE 
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SEQ ID 
NO: of 
peptide 
sequence 
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od 
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Predicted 
beginning 

location 
correspond 
ing to first 

residue ol" 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
hist amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2684 


16585 


A 


2701 


209 


340 


YRMTVFPARHGGSHTCNPSTLGGRGGWM 
T* CHEIETTLGNMVKP 


2685 


16586 


A 


2702 


78 


341 


EVACNCLLPAIGCFAS VTYV*S SVLL * F 
EL*VLNENSSF*NFIH*FF*NRVLICCP 
DWSVMA*S*FTEALTSQTIRSSHVSLPS 
SWDY 


2686 


16587 


A 


2703 


258 


185 


TMIAPVHSSLaDRARPYLFKQKR*GIiAIj 
SPRLEWTGAI IVHRNLEFLCSSNPPTSA 
SGVTGITEMEVERHGWIQETFWK 


2687 


16588 


A 


2704 


361 


41 


NFSPVYKTNPGPSRPGWdPLGKPPGPQK 
FFPFGVPPSPPFPKKGSVFSPPGGQKGP 
FKGVFWGFP* PRGKBCTPTQQPGPSPKVP 
NWDFPKGGFPKKKKKKRKILRN 


2688 


16589 


A 


2705 


12 


329 


SCTLQLTTPRLK*SSRPSLPNN*DYRHV 
PPHPASFP*FSEMF*DSTQ1FLNSLKFS 
S*YYLVIFFFKMEPCSITQDGVQWCDLS 
SLQPLPPRFKRFSCLSLPSTRP 


2689 


16590 


A 


2706 


232 


1 


KRKRKSFKTYYKKFPLNSSFKNQCPYAI 
GHIFRSIIPFFFLRQSFGLVA*AGVQWQ 
DLGSPQPPPPGFKPFSHLRLR 


2690 


16591 




2707 


135 


1 


CFFLIFFFFSSDTQAGV*WPQPPGIKRS 
SQLSSPRSWDYRRISPR 


2691 


16592 


A 


2708 


1 


137 


LDLLTS * SaEUEfflPKCWDYRREPPRPAD 
YKYFLKEVAS LI VKLYLFCKLNFE 


2692 


16593 


A 


2709 


88 


344 


MQEHYYFIKRQNKMIIRKYVQLHANKLD 
SLDKIDKFQVKNHQN*L*KK*KSWPGAV 
AHACNPSTLGGRGGWITKS *DRDHPWLT 
C 


2693 


16594 


A 


2710 


1 


383 


LHKVTTVLRFLS KFHILVLYFYYTCVHV 
SVSDMWCWFLY* IFTDMY]SIAVFQ*LDFF 
KLSIKFLRCIC* CVY* IFIPVFIYCFFL 
YLI CLFLS YFLFNI YVMIiFYILCI I YF 
YVIFSSSSIF 


2694 


16595 


A 


2711 


227 

L , 


1 


IFFFSNSFLKISLGFFIjNFAFF*KPKFF 
PFLSFKINQNFFFGFPFNFLKIFFKFFP 

NKFFKF* I YLTFFKKTKPK 


2695 


16596 


A 


2712 




136 


KNTKISQGWWPTPVIPATQEPEAGGSLE 
RGRQRLQ*RNLGSLQPLPPTFKRSSCLR 
LLSSWDYRRRPPPLANFCIF 


2696 


16597 


A 


2713 


3 


349 


L I PRGQGS TWL P YNPATS I FGNDLNE I 
KMYGHAKTCIWMFMTSSFIIDRTRKRQ* 
CSSVREWINKQ* CIQTMELVFGRNFFFF 
FFFFFFFFFFFFFFGGPPPPKIQIFKEI 
NPQ 


2697 


16598 


A 


2714 


181 


342 


TSQLLNLPLPIIKR*AIRQEKEIKGIQL 
GKEEVKI,FLFAHDTIVYP 


2698 


16599 


A 


2715 


3 


205 


FRHVAQAGLE LLS AS S LPTLAS RS AG I T 
GVSHCVQP * TVS SLRLGFYDTS FKIAD I 
QSCFSWVLYYCLQSAEL 


2699 


16600 


A 


2716 


388 


23 


ASNQS KNS FEKGEKNQQSVMVKTSQQAL 
NKEAFFDMIRSVYQKTYK*ENA**ERQS 
FSLKVENKKEYPSSLLLENIVLKVLVNT 
GHKEK* KAHRFDRNIIQFANDMIVYVEN 

PKDSTKRLS 


2700 


16601 


A 


2717 




194 


FSCLSLLSSWDYRSMPPHPAWFLYF* *R 
RGFTMLARLVLWSWPHSAGITGVSHHAQ 
PFNYFFI 


2701 


16602 


A 


2718 


1 


212 


LCCPGWWETAELKRSSRLSLPKCWDYRH 
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nucleotide 
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sequence 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
lastHmlno 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ESLHLAFGFFLL*YFGPYLMVYVFSFNL 
YDFSLVYSLYGTSI 


2702 


16603 


A 


2719 


53 


299 


KAVHWIFFITKVHLTMETQIVSLSLFP 
FPSFLHPSLSLSS FL*RDRVLLCYPG* S 
AWQS*FTWPQIPGLR*ASCISLVSS 


2703 


16604 


A 


2720 


139 


320 


QPTMRHSFSF*KQEISIFLKVSSSLHHT 
LRFLFFFFLLDRVS LCHPGWS TVALS * L 
TVAW 


2704 


16605 


A 


2721 


1 


99 


I ILAYSS ITHIG* I lAVLPYNPNITILN 
LTIYM 


2705 


16606 


A 


2722 


215 


361 


LTFFFFFFERESNFVTQflEMQGGNLV*]:, 
KPRPPGLKHFYYLSLQSTGNY 


2706 

^2707 


16607 

1660S 


A 

■ A 


2723 

"2724 


335 

'27 


24 

Tr4 


AIPIfiPEF* THPGQNGKTRFPLKPQNLT 
GVGGNARYSQFPKRLRLKIPLNPEVGPS 

INPNFHPGIPFWAQKENFFPKKKKKKEI 

imlnfslicynlfflyicy 

wkqlkyppsde'SiTkvwc^^ 

rkikvqlcvt*rwrwlmdtkkk 


2708 


16609 




2726 


1 


112 


GFHHVSQDGIiNLLTL*SIHLSLPKCWDY 
RHEPLCQAR 


2709 


16610 


A 


2727 


271 


3 


RSPMLKNQAIKSVIRrVYIGTIWRSAGPF 
AIFFIAEYTNIIIIITLITTIFLGTTYD 
ALSHELYTTYFVTKAVLLTSLFL* IRTA 
YPRFR 


2710 


16611 


A 


2728 


121 




ENKYRPGAVAQAYIPNTLGG*GGWIT*G 
REFETSLTNVEK 


2711 


16612 


A 


2729 


2 


290 


NRHFTKENIQMPTRHKKRDPQSSLVIRE 
MQTKTKVRYYFILTRMAKVKKIDNTTC* 
♦GYEKUSrSPKLQTTKCPIFEWVNKLWHS 
QIVEYFSGIQHA 


2712 


16613 


A 


2730 


98 


264 


LVSSE***IYQrLFFFPEMECHSVSQW 
VRGSNLC*LHPLPPDFKRFSCFSYWEG 


2713 


16614 




2731 


2 


286 


PFYKATVIRTVWYW*KN*QIDQWNRKES 
PEIDLHK*SLLIFAKGAKAIQWRKDSLF 
NKWC*NRWISTCKNESMLGWAHTCNPS 

TSGGQGGRIA 


2714 


16615 




2732 


323 


181 


RDFVLLVE I TMLARLVSMS *P CDLPASA 
AQSAGITGVSHRARPFLDL 


2715 


16616 


A 


2733 


272 


131 


GRVDRLNPGGGGCS * LGSCQCTPAWVKE 
TPSQKKKNKIICFYAFLFI 


2716 


16617 




2734 


227 


87 


AAS TLTLTS LI PP ILTTLVNPNKKNS YP 
HYVKSIVASTFIISLFPTTIFMCLDQEV 
IISN*H*ATMDFT 


2717 


16618 


A 


2736 


25 


415 


FSKYLLDHQLSAGLLVEQHRGSLCPLSL 
LHSDSSLSFAVSGKVSLAASIRNKLELP 
ET*RALMM*NHGHPLEFGGM**PWT*EP 
GS * RC * ELGQPGQARRLACNPS TLGGRG 
GWIMRSGVRDQPGQDVET 


2718 


16619 




2737 


390 


1 


KGFFLETKRKFFFTNGFFPLLGGKGVP F 
PGKRLGP P I LGFYP PQRVFPFPKPPFFP 
PGPFKRGQKPPGGGKNF*RGPPGPPKKG 
GAPTGKKKKKEEERKEKEKKTPRNISKP 
LSHPFPNFPFCATVRGFS 


2719 


16620 




2738 


96 


331 


QFTYTHGLFPFNFNRLCVFFFS * KLVGG 
V*LCCPGWL*TPGLKQSSCLSLLSTWDY 
KGVPPCLAEHFFFFEKKSPLSL 


2720 


16621 


A 


2739 


379 


14 


LRE VFPFFS P PKI KCFS KNS P QKYFPP P 
WFKTPPTPPFFFPPLFGKNFFFPSLFN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


Predict- 
ed end 

correspon 
ding to 
lastiimino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, l^Leucine, iyi=IVlethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyroslne, 
X=Unlinown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFKVFYSLLLLYFLFILK*MLTF 

KTSISTQCY 


2721 


16622 


A 


2740 


198 


38 


ANPLFFFFFFLRDGVLVYRPGWSAWQS 
*LTADSTPWAQVILQPQRNPIEHDF 


2722 


16623 


A 


2741 


290 


39 


LLANPLFFLFFFIjRDGVLVYRPGWSAW 
QS*LTADSTPWAQVILQPQRNPIEHDF 


2723 


16624 




2742 


1 


399 


ILIRKHSDIGTLYLLFDA*TGTIGTDLS 
VLNRTELEPDQATLLGNDPVHEVIDTAH 
ALGIIFFIVIPIIIGGFGN*LVPLIIGA 
PDMAFMlINNISF*IJ:.PTSLIj:,LnaSAI 
EEAVPGTG*TVyPSriA*NYCR 


2724 


16625 




2743 


398 


2 


S PPPLIFFLWGFSYIFPPPKKFFFLINP 

PPKFSPPPFFFKPPPPFFFFFPP*EEKK 

NFC3PF?FSPP?PFFLLPPPLFFFFFFS 
LKKKKKKKKKKKKKKICKKKKKKKKKKKK 
KKKKKKKSARSRSKTSPTRP 


2725 


16626 




2744 


295 


185 


FFFFFFFFFFFFFFFFFFFFFFFFFFLF 
FFLFXFFXF 


2726 


16627 


A 


2745 


374 


4 


LKKHKSTRVP *ISrVKSGK*NFSPPFKIRP 
WAQKEAKKKGAHKKKAD*KKGBFGKLKK 
KKS FRPGKKV*G* PKGPQRNFEKKPVKS 
QGPVHMGVIPAFEKKPRIPPVAKIPI *G 
AQKKRGRPLAI 


2727 


16628 




2746 


2 


382 


QnATAPIIDELISFHDHaLIIICI.ICFL 
ILDALFIiTLTAKLTNTNISDAQEIETV* 
TILPAIILDLIALPSURILYITDEVNDA 
SLTIKSIGHQWY*TYEYTDyGGLVECfSY 
I LP PLFLEPRDLGLL 


2728 


16629 


A 


2747 


2 


374 


DWPPTVKRKTNPRAQSTAADRFILLFTV 
RGLTGIGLANSSLDIVLHDTYYWMFH 
YVLSLGAVFAIIGGFIH*FPnFSGYTLD 
QTYAKIHFTIIFIGVHLTFFPQHFLGLS 
GMPRRYSDYPDA 


2729 


16630 


A 


2748 


55 


230 


QIPAK*INKYLQN*KKKKKKKKKKKKKK 
KKKGACLKKPHGGAKEKSWGRQKLFS FK 
GG 


2730 


16631 


A 


2749 


161 


3 


NGNYTLVKKCENTKEKrKWVKW* IPFPF 
MRWSFTLVAQAGVQWCKLGSLQPL 


2731 


16632 


A 


2750 


362 


1 


PEKKSFAIKGVKKAGKQCKFCDAPKKGN 
PPKKKISrpTGTPKRVLKK*PPREEKKNSP 

QRKKKNTAKNKEKEAPQEEI*KGPPRYI 
FFFISFFFFFFFYFFFFFFSIVINLFGT 
FIATTLEA 


2732 


16633 


A 


2751 


311 


110 


ITLPRHGGPHVQSQLFKRLRWKDHMSLG 
DRGCSELSSCHCTPVWRQSKTFSKIKYG 
RNGTDKAATCNPGRFRSWGRKVA*GQES 
ETSLSNSRTRL* PCNKIWSKRDRQGGYL 


2733 


16634 


A 


2752 


223 


16 


KKTPQKKKNTEGKGPVKKSPPQKFLKST 
PP*IFFYVFCFFFFFLYFIYLLFRIFSF 
FFILFVYYLFILP 


2734 


16635 


A 


2753 


367 


10 


PAPRGGVYRGRQASLSCSGLHPLRASWP 
LCSPTQA*AMAGAPPPASLPPCGLISDC 
CASNERGSVGVALS * SGAGDNLLVCRLL 
SGKCENHPSSASLTLAAVDWSCSYSAIL 
APPLKNS 


2735 


16636 


A 


2754 


14 


341 


PGPNF*FGGQNSPGKNQNLSLPT*KGNG 
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nucleotide 
sequence 
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sequence 


M 
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od 
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USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
corrcspon 
ding to 
last amino 
acid 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














PGNPKNGGPTVPGTPPETRGSLLRTLQG 
PLRPSGTPPTRRAPGTPVGNPGF*PRDF 
FAQKGPNVREGLKQGPNPKGLNPPP 


2736 


16637 


A 


2755 


15 


172 


HSWWE CKLVQPL * KTVRRFL KKLK IDP P 
YYPAI PIiLGI YS BERKS VKQSRRG 


2737 


16638 


A 


2756 


263 


2 


QKPLPPRNDPVFPKGPERTPRPTWGPIP 
GGVPLGKFFTLLVSPFFFFFSPSPNLFF 
F*NRVSLCRPGWSAW*SWLTAASTSQA 
QAI 


2738 


16639 


A 


2757 


311 


106 


FVCFlRQRLALLi'LECSCVILaHCULRL 
LCSSDSPG*QSETVSKKKKKKRKKEKET 
GNL*RSLDMKGR 


2739 


16640 


A 


2758 


23 


328 


NITYYT* IiFVLYFYVPGHTWFFFXFXF 














FFFXFFFFFFFFFFFFFFXXFFFFFFFF 
FFFFXFFFPFFFFFFFFF 


2740 


16641 


A 


2759 


1 


338 


DR*LFSTNHGYIGTLYLLFGA*AGALGT 
ALSLLMRAELGQPGNLLGLDHTYNGIDT 
AHAFAIILFMSLAI IIRSFGN*LVTLII 
GAPVLAAPRITDISW*L*PHSVLLIJ:iA 


2741 


16642 




2760 


120 


2 


INBCYAWLGTVAHAC]IIPSTIiGGRWTA*GQ 
EFKTSLANMVK 


2742 


16643 


A 


2761 


251 


3 


C31GQIGTPTLHYSSSSSSKWIKDLNVRV 
KTIKLSENTGVNLRELELGDCPIiDDTKS 
KNSTRKIRK*SLK]:iETFFLGGTR 


2743 


16644 


A 


2762 


" 


303 


ALVLKRSSCLDLLKCWDYRHEPLHLALS 
EILMYHFCL* IFDI PFYWYSRE IDLI FV 
Y*FFFFFLETGFWFLLPRGDLG*WEPPP 
PGYKGSSCLGLPR 


2744 


16645 


A 


2763 


236 


356 


DWLIFVFSVERGFCHVAQVGL*LLGSGD 
PPASASQGAGIT 


2745 


16646 


- 


2764 


2 


332 


LTQTPGLK*SACLCLPECWNYRCEPPHP 
AGFLF*RHFDKGVS*LLIYPGTGWS 


2746 


16647 


A 


2765 


297 


19 


KKKSWFFFKKKKCWGaATKFKIT*RGFFL 
KFVFYFFFIG*GVFFFFFFLISIIIFFF 
FFFFFFFFFFFVFFYFFFFYFFFFFFLF 


2747 


16648 


A 


2766 


1 


147 


QILRRLRWENCVNLGGGGCSEPRLHHCT 
PAWVTV*NSVSKKKKKrPLLFF 


2748 


16649 




2767 


357 


183 


NWDYRPLSPRPGKFLDF*VNRGFPLLTK 
LVSNF*PCNLPPSTSPKaGlSGFNPR2^ 

P 


2749 


16650 


A 


2768 


2 


193 


RWESCLNPGGGGYSEPRSCHCTPSWVTE 
*DSVSTTTTKEKERKQNRTWSESNSLQK 
YKDTNHF 


2750 


16651 




2769 


333 


173 


VSQDGLDLLTS *S THLGLPKCWNYRHE P 
PRPAQKLLMVIWLGFMSSPKSPLEL 


2751 


16652 


A 


2770 


3 


331 


CMENCMVI P PKKVKHKNYHM I Q KF YFWK 
AVQQLSTELNIL*PYDPAITLLGIYPKE 
LKTSFRTKTCT*MLTASLFVIAKTWKQP 
RCPSVGE *LSNL*YVQTME CYSVLK 


2752 


16653 


A 


2771 


209 


55 


RPGRPQVLRRVRPQNRLNPGGGGFSEPK 
*GPCPPAWGAQRDFISKKKKTIT 


2753 


16654 


A 


2772 


213 


57 


RPGRPQLLRRLRLQNRLNPGGGGCSEPR 
*GPCPPAWVTQRDFISKKKKTIT 


2754 


16655 


A 


2773 


148 


1 


KATGRKTWVKSRCWVGTVAHTCNPSTLG 
GRGR*TARAQEFKNSLRTLAK 


2755 


16656 


A 


2774 


2 


364 


WSAVRRDWITALQPERQRETPSQKKKKK 
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nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=LeLcine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














kkkkkkkkkkkggvrhgvks rkp * tsgr 
pgrknhwrq i qgqp kt prerae alknrv 
glkklfktpghvugagnpkfrngksgkp 
pevhustgaw 


2756 


16657 


A 


2775 


121 


2 


HLRSGVSDQPGRHGKAPSLLKIEKLAGS 
GGTCL*SQLI,G 


2157 


16658 


A 


2776 


195 


3 


GRVDINTLLALLLI IITF*LPQLIGYIE 
KSTP YECGFDPI SPARVPFS IKFFLftAI 
PFLLPDIi 


2758 


16659 


A 


2777 


3 


285 


FLNIENKVSLCCPS *A*TPGLKQSCCLG 
LPKC*DyR 


2759 


16660 


A 


2778 


135 


379 


HWSATVKLYAILLGLLKWESGTLLCSKD 
VLFFL*G* SSAYSI ESV1MQIKAT*VKG 
KAR.VQLGAKKLKAYWQRKSP6I PAG 


2760 


16661 


A 


2779 


364 




GSFTGAVILI lAHGLTSSLLFCLAHSNY 
ERTHSRII ILSQGLPTriLPLIAF**LriA 
SIiftNLPLPPTINLLGELSVLVTTFS*SN 
ITLLLTGLNILVTALYSLYIFTTTQWGS 
LTHHINHIK 


2761 


16662 


A 


2780 


356 


1 


GCLEAHIWPQKGNHEGQVHLFIDKVCRQ 
PMTBDCINEITTQVAQIFLVHPLLRQSL 
TLSEKRKKRKKKRKKRNRIESPEINTHI 
YCQLIEI!IKGAKK1K*G*NSLFNKTYLDP 
WISTCK 


2762 


16663 


A 

— 


2781 


1 


204 


AQVGLQDASSPRIEELITFHDHALIIIF 
L ICFLVLYALFLTLTT*LTNTNISDAQD 
SETDCYSQSYIL 


2763 


16664 




2782 


1 


522 


YKCNEaSKVFNQQflHLAQHQRVHTGEKP 
YKCNECGKTFSQMSNLVYHHRIiHSGEKP 
♦KOSEaSKTFSQMSKLVYHHRIiHSGEKP 
*KCNECGNTFHHNSTLVSHKMHTGEKL 
YKCNECGKVTWQKTTrARHHRlHTAEKL 

yk*eecd:<vfgcksnlethkkmqfsktd 

SAFSLQ 


2764 


16665 


A 


2783 


1 


327 


ENRLNSGGGGCSEPRSHHCTPAWATERD 
SVSEKEKEQKQNFHLNAQSNCQKLKSP* 
KYLKHPEKTDILSKASQYNNQLTADQSI 
LKFKTKTFDQKKGRPKGHGMTYSNS 


2765 


16666 


A 


2784 


335 


128 


fslifcragilpccpgws*tpvlkqssc 
sslpkcsdykqeppylacatlkcyqipn 
fycwphifkrmfy 


2766 


16667 




2785 


208 


3 


RPICPLSLWGVSFLFFFFFLRDRVLICH 
TGYSAVAQYCNTAHCYSPRLK* SSCLSL 
PSSWNHNLIPPR 


2767 


16668 


A 


2786 


49 


332 


VEMGNSPINRKYVYPKSYWRCLKCNTEE 
GVLNDLGIAEFNTCSKSLLLLIFFKGRI 
LLCCPGWIEWQSQLTASSTLGLK*TFH 
LSLFGSWDHR 


2768 


16669 


A 


2787 


331 


187 


GCSEPRSPPCNAAWATKGDSVSKEKKKN 
KKREEKKH*HNTSGEKINLI 


2769 


16670 


A 


2788 


325 


34 


RSQLRGLPSMSIS * I *NSRLR*LRPRRL 
FIFCRDRVSLCCSGWSQTPGFKQSTHRS 
LPKCWDYRREPSPLAQIRYLGLFFSQGL 
SSAFCYVAMVSGF 


2770 


16671 


A 


2789 


3 


150 


AAS TRQLI FHFTS KHHFGFEAAA* YWHF 
VDW*LFLYVSIYWLG 


2771 


16672 


A 


2790 


86 


311 


NIHPLNDIFTRLKNGFKKKEISLVKNEP 
NKGT*SLISIKVI.FHT*K*FIRPGVAAH 
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E=Glutamic Acid, F=Phenylalanine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T*NPSTLGGQGGWIT*SQE 


2772 


16673 


A 


2791 


2 


123 


GGGACSE PR SHLCT PAWETE * D SI S E KK 
EEKK*LQIYTNI 


2773 


16674 


A 


2792 


302 


162 


PLLRRLRPENCFHLLGGGFN* PRSRNCP 
PAWATKEDSLLKKNLKNF 


2774 


16675 


A 


2793 


1 


258 


GGCSC3LRLCHCTPAWKTD*DPVSKKKKK 
KNFTEAKLLIFGTLVLGKVKRGGPFKPR 
ILKLPWETWQNLVFTKKKKIFPLKNRGG 
GE 


2775 


16676 


A 


2794 


199 


3 


VGPIFRHDKPPHSKQLCTYSPAPTQLFS 
SGKILKISriHLWPGTVAHaClSrpSTLGG*A 
RWITRSGDR 


2776 


16677 


A 


2795 


1 


302 


GGCSEPRSCHCIPAWATEPDSI *KINK* 
INK*K**KINENKKGKNHFLSSST* *PQ 
VASSFLTGQHRYTKL* SSQKVLLDSATL 
QSKSNAEVKHVNRTTD 


rm 


16678 


A 


2796 


228 


326 


NDNGQSGWAHACajPS ILGGQGGWI I * G 
REFKT 


7ms 


16679 


A 


2797 


270 


1 


KFGISAPFAPSPKV*KRGFP*lSrPLGNGP 
PVKIPPF*TPL*NFSKPGKNFPPFF*KT 
RFSKIFPNGFFFFFFFFEGVLLCCPGWS 
ALTRSW 


2779 


16680 


A 


2798 


307 


36 


FKNFC(33*EVSLCCPGWSQAPGVK*PSC 
LGFPKCWDYRCELPCLASCSL*CDRNQT 
RISAWLWIEFHLSPILPVLPLFLKKPQ 
AGLLCF 


2780 


16681 


A 


2799 


132 


5 


IFEVTV*CTK*HKR1IMQPGWGHACNPS 
TLGGQGGWIMRSGV 


2781 


16682 


A 


2800 


32 


314 


KQHPGNGPNPSGKGPGHASRFLKKKNFF 
F*KTPPKKPHTPCKPSPKGKGPERLPWH 
FFNQKFGPLPMGPNPDQHAGFCIiRDP*G 
GGKNRSPPTL 


2782 


16683 


A 


2801 


139 




AASTFYIFFETTLIPTLAIITR*aNQPE 
RLNAGTYFLFYTLVGEG 


2783 


16684 


A 


2802 


238 


3 


AASTSHVISSMYNIYIIIQFKTFPVFFF 
RDKVSLCHTGVJSAVAQS*LTAASAALTS 
CVQGILPTQPPEYHTRPLRLFL 


2784 


16685 


A 


2803 


110 


291 


KKLGVFGFGFGFKTKSWFIMQAKGQLPN 
LG*LKPLPRGFPQFSGLSLSGTWDYKHT 
PATR 


2785 


16686 


2804 


97 


2 


SYNNQDSWLEKEKTNRSVKQN*APIPE 
KKKKKNKVGGLTLPNCKTYYKATI I KTV 
WYWRKKRQ1GQ*NRIESPEIDPHKY 


2786 


16687 


A 


2805 


268 


1 


ATFLLFyYVFRPQIPYYL*IFILSLF*Y 
RHSGPPYVGPPQQYPVQPPGPGPFYPGP 
GPGDFPCRLPIRKWWLVWLMPQLHPTVW 
PTIIT 


2787 


16688 


A 


2806 


153 


3 


IPLPKGLLVPLFGVFLKWFFFFFFF*D 
RVSPCRPDWSAVAQSRLTASS 


2788 


16689 


A 


2807 


3 


216 


NAARDHIVKPSP* PLTGALSALVMTSGL 
AM* CHSHSl SLLI LSLLTSTLT I YQ *RR 
NVTR*TSYQGHHTPTDQKSLRYGIILFI 
TSEAFFLS*FF*AFyHSSLSPTPQLRGH 
WPPTGLSPLNPLEVPLLNTSELLAS *VM 
PLSLHIASNTKPTNQHTNHIPMKAQCNT 
INLIPRPPHTN 


2789 


16690 


A 


2808 


168 


354 


FLERESGFVAQAEGQGQNLS*WKPFFPI 
LKHFSCLSLLRSWNYGPTSPVPAKVGGF 
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N=Asparagine, P=Proline, 
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WRNKIY 


2790 


16691 


A 


2809 


61 


360 


YVSNSKCSNHRK*SLFFFFFFERESSFV 
PQVELHGRDLG*LQLWLPGFGKFPGLTP 
LRNGDDGPRPQPPANLGLLVKTGFSPVA 
HLGVNLGTLGDCPALP 


2791 


16692 


A 


2810 


^36 


3^" 


KMFFQMRYTMLKK* QATFGF I LALKNVL 
KTFLAP I FFFLRENSALVAQAGVQWRDL 
GPLFPLPSGLKRFSCLSLPSC 


2792 


16693 


A 


2811 


140 


210 


NAKITKC*KGYGKRGTLMHCCWQFKMGQ 
LLWKTGYQFLKKLKLELLSNVPIPYL*! 
WKKGWPYALLLAI 


2793 


16694 


A 


2812 


230 




VKLCPVSLKTGVRPLLTSSEVIFILNCK 
FNIGV*LLPGSLLLLLASAIVEAGAGSG 
♦TVYPPIAGNYSHfl^lPYALV 


2794 


16695 


A 


2813 


3 


345 


HEVRIDVHTRTCFTFGTI I lAI PTGVKV 
FS*LATLHGS1SIMK* SAAVL*AlHi'IFLF 
TVGGLTGIVLSNSSLDIVLHDTYYWAH 
FHYVLSIGTVFAIIGGFIH*FPLFSGYT 
LD 


2795 


16696 


A 


2814 


2 


184 


ARVGFHHIDQAGLKLLTL* SAYFGFPKC 
VroYRHEPPRIiALKFNNIKKKKKKKKKTP 
GEKI 


2796 


16697 


A 


2813 


276 


3 


ARVHEIDHGLM*HQPLGLK* SLCLSLPS 
SWD* * SMQPCPDDYGTRSDQKKPTPKSP 
YVSQTGLEHLGSSDLPASASQRAKITGM 
SHHALV 


2797 


16698 


A 


2816 


3 


94 


HENQYSSLIIIMAIAIKIjGIAPFHF*VP 
EV 


2798 


16699 


A 


2817 


93 


1 


GGGGCSEPRIiHHCTPVWVTE *lirPVSKKK 
TRA 


2799 


16700 


A 


2818 


1 


197 


GTRAQSLLLGRLRQKNQLNLGGRGCSEP 

KSGHLTPAWAT * PDLVSTKS CTLI YLVT 
QALHITLLP 


2800 


16701 


A 


2819 


109 


2 


GVF*GVFLAQGLTLVAQAEVQWHDLGSL 
QPCPPRLKRLV 


2801 


16702 


A 


2820 


2 


347 


ARAPGFAEWEVFVFSSCFVWMVPGGKGE 
ISI7sJELMQPSSY*ESQHFFPAPPRCSSCV* 
LCSLGPSLLGTLIFCQFTLSELPRFRKG 
* FSSSLKKVFCFGDRVLLCHPGWSAWQ 
TQL 


2802 


16703 


A 


2821 


1 


353 


CTRRDVTRESTYGGHHRPPVQKGLPYGI 
ILFITSQVFFFT*FF*TFYHSTLTPTPQ 
L*GHWPPTGITPLNPLKVPLLNTSVLLP 
SRVSIT*AHHILI*KNRNQIISSLLITI 
LLGLY 


2803 


16704 


A 


2822 


261 


400 


vehsnskfkenfiigqgtgchacnlntlgg 
RGGRI twrsgv*dqldqh 


2804 


16705 


A 


2823 


273 


97 


lwtprmqrltqlatvilrfhkdqgfaml 
prqvsns * aqai CHLGLPKC* dyrrepp 

GQK 


2805 


16706 


A 


2824 


159 


355 


KKFFSIFFFFFFFF*DRISLCHLGWSAV 
VQSWLTAAST 


2806 


16707 


A 


2825 


67 


338 


LSPEL*LY*IF*KPFS*QLSLTSHSPSL 
FSALLLLS I SPPTWLHSQSDKNLHLFS F 
LFRDSVLLCCLGYSAVT*SWFTVASNSW 
AQWLS 


2807 


16708 


A 


2826 


3 


359 


HEQKYSFLHDSQTLFCF^DSIPTPSISMD 
ETQQKSRLELVRISLLLIEPWLEPERLL 
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nucleotide deletion, ^possible 
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RSMVANHLVYDTSDSDDYHLLKDLQEGI 
QTLMGRLEYGSRRTGQMLKQTYSKFDTN 


2808 


16709 


A 


2827 


3 


338 


LERKLDTTFLDPAGGGDPILYQHLF* FF 
GEPEAYILILPRFGIVSHIVSYYFGRLE 
PSGYLRMG*PIISVGLLGFIV*SHHIFS 
VGVDIIIARAYSYFGSLVMAIPCTLEVFT 


2809 


16710 


A 


2828 


1 


406 


RHEGEKLRRPTFGPRHRGAGTAKMSASL 
VRAT VRAVS jKRKLQ PTRAALTLTP SAVH 
KI KQLLKDKPEHVGVKVGVRTRGCNGLS 
YTLEYTKTKGDSDEEVIQDGVRVFIEKK 
AQIjTnLGTEMDYVED*LSSEEVF 


2S10 


16711 


A 


2829 


332 


3 


GI IVMQATIATALIGYVLPLGQISL*GA 
TVIAYIIiRAIPFIGTDLVQ*I*VGCSVD 

SPTVTGFLAFPFVLAFIIAAVAALLLRF 

LQGTGSPNPLGGAS QS DKIAfQKS C 


2811 


16712 


A- 


2830 


186 




YSEREVAFFFFGNGFCF*TQAEWWGGNL 
G*LNLLPPR*KEFSCKIFPRTWNYSP 


2812 


16713 


A 


2831 


303 


590 


NI LTTLLNKQSKSNQLQNKI IHTVYI KI 
KI FFFFLRQSLSVTQADVQWHCLCSLQP 
PPPGFK*FSCLCLLSSWDYKHflPLHPJUSr 
FCIFSRDGISPS 


2813 


16714 


A 


2832 


1 


370 


EELITFRDHAVIISFLICFLVLYALFLT 
LTTKLTNTNILDAQDIQTV*TILPAIIL 
VL IGLPFLRILYI TDEVDDPSLTI K*NG 
HQWD*TY*YTDYGGLIFNSYrLRPLLLE 
PGELRLRDVDN 


2814 


16715 


A 


2833 


207 




QFFI FLRHSFTLFAQAGMQWRDLGSLQP 
SPPGFK*FSYLSTLRGLaEKITLAQEFQ 
TSLGNMGGPHPRA 


2815 


16716 


A 


2834 


223 


1 


DNLAHKGKTRVYLKSRNKLGKGGGAGNL 
VSLDSIGSRRDHRRAPPCPANFVFLV*M 
GFPHVGQDGPELLTSCSC 


2816 


16717 


A 


2835 


1 


321 


GTRKPSP* PVTGALSaLLMTSGLTM*LH 
FHS ITLLILGLLTNTLTIYQ*WRDVARE 
STYQGHHTPPVQKGLRYGI ILFITSEGG 
LFAGFF*AFYHSSLDRTPQLGGQ 


2817 


16718 


A 


2836 


57 


370 


IWMGRVLWKDRIiYGVFCRAFNRAVTSRV 
WAEDPWRVPKTLSVDPRKLPPFS* ISVR 
GQ I YFT* FFFFFETESHTVAQAGVQWWG 
DLG*LQPSPPGFKRFCLSLP 


2818 


16719 




2837 


2 


345 


ARAHRQLDEP * L* RRPGERHPS W* SEET 
VERQRTKT*SESSQTGTS ITSSRWAERR 
ESEKSLHLETLNKEEDC3ISPTFKPSTPD 
HPLKVMPAPSPKENAWVKRSSNPPAESQ 


2819 


16720 


A 


2838 


15 


376 


AKIEPLYSSLGNKSETPSQKGKKKKKKK 
KGKGGGEKKKKKKTEGGTRGLTQETQFF 
GNPKGPKHLGAHN* KHRAKQAKRGNQPR 
EYRFTEEKKGEKLFNPGGGRSQKPKTGQ 
WNSAGGKK 


2820 


16721 


A 


2839 


3 


336 


HELLAS I LL I Y* * CRDVTRERI I PLAHH 
TPPVQKGLRYGIILFITS*VCFLS.GLF* 
YLYHS SLPSTAQLRGHWPPTGI TPLNPL 
EVPLLITSVLLASLCSFI*AHHSLIEH 


2821 


16722 


A 


2840 


338 


3 


KI I FYLFFLRS lAFLAQI EGHWAVFS SF 
KLWPPGFKHFSASTFLVNRVFKGGPRAR 
VNFGFFIKSGFFPIF*VGF*LFFFFFFE 
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nucleotide deletion, \^possible 
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TE SRSVAQAGVQWHDLCSLQAPPPS S C 


2822 


16723 


A 


2841 





149 


ARGCSEPTLHPCTPAWVTE*DSISKKKK 
KKVECSNGNVCRTMC ILCCFW 


2823 


16724 




2842 


348 


175 


SACKWS ITGMHCHA*VIVSVLLV*TGS 
LHVGQVGLELRTSDDMHTLASVSGGITG 


2824 


16725 




2843 


338 


118 


DPGGGACSEPRLRHCTPAWVTKRDFVSK 
KKKK*NIKC*HAKSGTVI.YSFLYLFYS * 
YFLMRWSVHI PGPSTK 


2825 


16726 


A 


2844 




47 


KCVaiiLIiSIiFLS CSVM*RNACFRFAFCH 
DCKFLEASPAMLP IQSVEL 


2826 


16727 


A 


2845 


335 


139 


EVSPSWPG*SQTPDLKRSACLSLSKCWD 
YRPGVSHRftEUSrGSPLTQLYEMLTYFPI 
IWVTLQVFH 


•2827 


16728 


A 


2846 


3 


352 








KHHSLGGQYGVQGFPTIKIFGSNKNRP* 
DYQGGITGEAIVDAALSALRQLVKDRLG 
GRSGGYRSGLQGTSYRSSKKDVIELTND 
SFDN 


2828 


16729 


A 


2847 


2 


269 


ARGLFSTOHKDIGTLYGLFGA*AGVLGT 
ALSLLIRAELGQPGHLLGHDDISRIHVK 
Rl IPGCGTLDGISVGRECACLWTPACSR 
KCARG 


2829 


16730 


A 


2848 


317 


28 


SFLPN*KLAGHGGRPL*FQVLRRVRLEG 
CFSLEG*GCSEPWSCPCPLTWATETDPF 
SREKKKRRRREEKAEKRREWRRVPSLSV 
FLEEYVELTLIL 


2830 


16731 


A 


2849 


326 


107 


FHHVGQDGLDLLPS *SAHLGFPKCWD*R 
CKPPCPASRNFKTLKLYHIFSNNLIE*N 
*KSITRGTLETVQLPGN 


2831 


16732 


A 


2850 


224 


3 


S I LQAWLKVP SP PGNPLGLLQSQVDFFF 
*RQGLALSPGLDCSGMIMAHCSLKLLGS 
TDPPTLAS*VAGTPSSC 


2832 


16733 


A 


2851 


27 


329 


VRTAAINRPGFLLPCFIGQNFFFY*RFE 
TYRLGV*MILTPH*YLLSSLSGLDCVIF 
FSFILFFYWFIVYCHVFMVYLFFFLFVI 
LFVLCVFWCFYFLLFV 


2833 


16734 


A 


2852 


351 


60 


NLLSSLQGGRLPRPPGLSPHPQIFTPHG 
GVPL*LPGPWKSRPEASLNPRS*PPPGQ 
PRGNPFPPKKNFFFFETVLLCRPGWSAV 
AQFQKKKNLLIC 


2834 


16735 


A 


2853 


186 


365 


LTQLLNLI *YQNQIEYPQPYAQCGPRSG 
NAQQIGHSFPNG*QVPASGMHGLPWTHQ 
GLKK 


2835 


16736 


A 


2854 


17 


239 


HFTTLQISMSLLTETLARHDSTHLWSQL 
L*RLRRNDLLKSGAGGCSELGLHHCTPT 
WTTEQDPVSRPGAVASEH 


2836 


16737 


A 


2855 


325 


140 


GFHYVSQDGLDLLTS*SACLAFPKCWDY 
RHKPPCLAAYIFIQSRVFIiAVISISTHS 

SFQLL 


2837 


16738 


A 


2856 


1 


157 


GTRGFHHVGQAGLEVLTSDNLPVSASQS 
AGITGESH*AHERSDGYANTNSPG 


2838 


16739 


A 


2857 


157 


361 


LLLRIVSLFWRKSSPLGVQAEMYRPDLP 
VTKPPPPD*DKDSCLSLLITWDYRLAPP 
LPANLEF*I*TG 


2839 


16740 


A 


2858 


3 


346 


HEGNHKFKIINVHLR*LNQNL*SGLEGD 
KVIWENI *TSVTFLPC *MNGXMHPDLFE 
QFTYEVGRI FFFSLKD* VWLCCLGSS TV 



720 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














DLGSLQPPPLGFKQFSCLSLPSSWEPPC 
PA 


2840 


16741 


A 


2859 


18 


233 


KNPTILCCKGGGAQA*KPRIiLGRLRQEK 
GLTPGGEGCRDIiKSPLQIPPWATRPKLIi 
LKKKKKKGGKKKTLF 


2841 


16742 


A 


2860 


356 


3 


RVNFKIFGKKGFCPGGPGGFEVLTPGDL 

GPLFFQKGGDSPSKPLKPFFFCFALKST 
TLNYFQFAFCVLRPTPN*SVTQAGMQ*H 
DHSSLOPOPPSLK*SSQLGFPSSWVYRH 
APPSC 


2842 


16743 


A 


2861 


115 


374 


IVCGKILNV*YTQ*HNKLQNKNKVD*PT 
IKLMLISQSLGILIFIQKRDFEVNYGMI 
LKKRGWPGPVAYVCNSKTLGGQGGQIT 


2843 




A 


2862 


163 


360 


LKRIWLLILHSHPHTSPHDRGVTQIiEAI 
GPRWY*TYEYTDYGG]:.IEI!TSYILPP]:.FL 
EPGDLRLLDA 


2844 


16745 


A 


2863 




236 


GTS PFFS FFFLS FFPSFLPFLS PFNPTT 
Ij:iLSmSNIS*KKVLFSFEMESHSVNR]:. 
ECSGTISAHCSPAWATGTDSIS 


2845 


16746 


A 


2864 


67 


364 


VCVCVCVCVSVSVCVCVCVYICTYICMY 
VCA*YRIRRYIMLPTLVY]SrSCPEIHDSK 
SCALHTGA.ITEATR*PCLCIHIPVCLTV 
CLTWIiHICLSVCMD 


2846 


16747 


A 


2865 


2 


155 


ARGLTQENRUSriiGGGGCSEriRSHHCI PA 
WATE*nALSQKRKKKKGTYRGII 


2847 


16748 


A 


2866 


344 


81 


IQPNSFIFLKIFISRDGftLIiCCPGWSPN 
SWAQQSSRLNLPKCWDYEHEPPHLWQF 
L*KHYMYKWHKUHCVFSHYNKIFTGQTK 
TEN 


2848 


16749 


A 


2867 


217 


340 


RVFSDFYMNLKWSLRKRRTWPGLVAHAC 
NPSTLAG*RGQIT 


2849 


16750 




286S 


3 


349 


HEATSPIIQELITFHDHALIIIILICYL 
LLYALYLTLTTKLTITIMSDAHEIHTA* 
TILPAIILDLIDLPCLRILYVTDEVNDP 
YLTITSIGHQWY*TYEDTDYGGLIFNSY 
ILT 


2850 


16751 




2869 




352 


GTRIDVYTRAYFTSATIGIAIPTGVKVF 
S*IiATLHGSNMK*SA!^*ALGFIFLFT 
VSGLTGIAnjANSSLDIVLHDTYYWAHF 
HYVLS IGAVFAI IRGF IH* FPLFSGYTL 
DQTYA 


2851 


16752 


A 


2870 


2 


342 


ARARFRTSLLLAFALL*LPWLQEAGAAQ 
TVPLTTLFDHTMLQDHRAHQLAIDTYQE 
FEETYIPKDQKDSFLHDSQTSFCYSDSI 
PTPSNMEETQQKSNLELLRIFLLIilESR 

L 


2852 


16753 


A 


2871 


254 


2 


YPGYQAI SQSDMQSYAYRHL* LQPPGVK 
*SSIVSLLSS*DHRPVPPCLflllEKNFFF 
*RQGLAMLQSRLVLNSWLLLYSHAEPRA 


2853 


16754 


A 


2872 


282 


3 


H'raiLHYIYIHHGHIST*HLSPPTNIYS 
LCNIKILIYTPYDH*IIRIPLHQEHAVH 
RRELIHIHIHTSVCVCVCVSVCVACVCV 
CVCVWVIV 


2854 


16755 


A 


2873 


278 


3 


LWPLKNSGPQPWPQMDLGKPQVKTPWY 
YKKGP*KKFFRKF*KWPGFFFFFFLRQS 
HSIAEAGVQWRDLGSLQPLPPGFKRFSC 
LWQISSC 


2855 


16756 


A 


2874 


2 


348 


AHA*SLILVSLIIPNATSNLLGIiLPYSF 
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M 
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od 
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nucleotide 
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ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Vallne, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPTTQLSIN]LAMDIPL*AGTMVIGYSSK 
IRHALAHFLPRGTPTHLIPILAMIESMS 
LLIQPIAMAVRVTDNITAGHLTMPLTGS 
PTV 


2856 


16757 


A 


2875 


1 


260 


GTRENHLDPAGRGCSKSRSHHCIPAWVT 
D*DSLKKKKKKKPQIILGNGGTEGNQK* 
RGNPFRGPWGPSRQKIKELGPGGKHGKS 

KF 


2857 


16758 


A 


2876 


176 




EHLFCKKLLGEEKGSLSKEFGKCWVNPR 
♦KWNSRPGWAHTCNPSTLGGRGGQITR 


2858 


16759 


A 


2877 


168 


1 


GVRVFGFSSVPEFSFDTCAGAQWKNLGS 
LQPLPPGFK*FSCLSLLSSMDYKRTPEA 


2859 


16760 


A 


2878 


67 


333 


WQGLGRAVARVPRSYEITWAGNLRGAA* 
FQQLFFFFFFHLLFFFFFFFFFFFFFFF 
FFFFFFFFFFLFS FFFLFFFFFFFFFFF 
FFFFL 


2860 


16761 


A 


2879 


335 


1 


GRKEGRREKEKERKKKKERKKBRKKTIF 
KKHIPSPLAFPRKAWKLKEIQIKLV2UI* 
KINRQAIRGCPPNKRPHPFQNMIKPGRE 
EGEGGLETSRDGERMLWIFGPSVGPLR 


2861 


16762 


A 


2880 


3 


345 


HERHETASIILLIAILFM]!TILSGQ*TIT 
NTTNQYSSLIIIMAIAIKLGIAPFHF*V 
PEVAQGTPLTSGLLLLT*QKIiAPlSIIY 
QISPSriNVSIiLIiTLSILSIIAGS*GGLN 
QT 


2862 


16763 


A 


2881 


347 


156 


WLIFVLLVEMGFFHSGQAVLKLIiASSDP 
ASMTSQSSGITGLRHCAQPR*AFLIALC 
LPPSAKI 


2863 


16764 


A 


2882 


338 


2 


I KKaL*HE *LREKKKSPIRTSQSSKaP* 
NLPRSKKKKESSI!I1IYQLL*AHKLQKVKE 
MGKLLETPNLPLLSQKVAEPLNQPITSS 
KIETLIKKPYHPEKSYGHDEVTAKFSRA 


2864 


16765 


A 


2883 


206 


1 


IRTRPFTPMFIFYMFVYrATRSCPFTLA 

GVQWHNHGSLQL* PPGLKESSCLSLLSS 
WDYYACLHAELV 


2865 


16766 


A 


2884 


341 


140 


DGGCREPRSRHCSPAWVTE*DSKQNKTK 
QKQKRYTNISSPQTSPLCYSFSAYTPLH 
NMPGNIMFSSL 


2866 


16767 


A 


2885 


360 


247 


FHRVSQDGLDLLTS*SAHLSLPKCWDYR 
REPPRRPKAL 


2867 


16768 


A 


2886 


229 


351 


KLNNRPGWAHAFNPSTL*KLNNRPGW 
AHAFNPSTLGG*GGWIiyiRSGVRDQPNQH 
GETP 


2868 


16769 


A 


2887 


359 


2 


FYSSSSSSSELVP CRQDVQVPHYLE GL F 
LRSCFMEIKYDT*KRKIICYTHVHTHTHT 
HTHTHTQREKERDFPRFTAKWKAHLEAG 
SGYATASSIRAVLWTTITLV 


2869 


16770 


A 


2888 




305 


ADSHMWKYKRPGITKILLKKSKI *GYHQ 
DWISSRYQDLFHKDMIFKIMSYWYKKRQ 
VDHWDRIESPETSSQMYEHLIYNKDNIA 
EW* EKDS FLSKWCWDW 


2870 


16771 


A 


2889 


1 


436 


I AI LT * YDYTLL * RVGPRGQE FGTEATV 
ITNLLSAIPYIGM)WK*G*GGySLDS P 
TLTRFFTFHFILPFIIASLATLHLLFLH 
* TLSNIPLRITSHSHQITFHPYYTDQYS 
LRLLPFLLSFTTLLLILPNFLFYPP*TT 
LTFPF 
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nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
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od 


SEQID 
NO; in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l,=Leucin6, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl{nown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


2871 


16772 


A 


2890 


1 


218 


EKISPLIKLINHSFIDLPTPSNISA* *]5T 
FGSLLGACLILQITTGLFLAMHYSPDAS 
TAP S S I AHI TRDVNYG* I IRYLHANGAS 
I FF I CIiFLHIGRGLYYGSFLYSET*NIG 
I ILLLATIATAFIGYVLP*GSLLGACLI 
LQITTGLFLAMHYSPDASTAFSSIAHIT 
RDVNYG 


2872 


16773 


A 


2891 


3 


368 


LSMS *AHNLI *LAYTIAFIVKIPLYGLH 
L*LPKAHVEAPIAGSIVLAavtLKLGGy 
GIIRIjTLIIjNPI.TKHIAYPFLVLSL*GI 
I ITSS ICLRQTDLKSLIAYSSI SHIALV 
VTAILIQTP 


2873 


16774 


A 


2892 


3 


342 


HENLI*LAYTIAFIVKIPLYGLHL*LPK 
AHVEAPIAGS IVLAAVLLRLRGYGI IRL 
TLILNPLTKHIAYPFLELSL*GIIITSS 
ICLRQTDLKSLIAYSSISHIALWTAIL 
I 


2874 


16775 


A 


2893 


2 


361 


ARVCLRQTDLKSLI AYSS I SHIALWTA 
ILIQTP* SFTGAVILIIMGLTSSLLFC 
LANSNYERTHSRII ILSQGLQTLLPLIA 
F**LLASLANLALPPTINLLGELSVLVT 
TFS*SNIT 


2875 


16776 


A 


2894 


229 


3 


YTFVQSFIFSWLFICaCVCLL*PAHSMV 
CACECMCVCVCLCVCLSSLKIIPLLECL 
lYLQLILVLSSAIYHSRFE 


2876 


16777 


A 


2895 


3 


391 


gfl itnnis pas pfqtti plylkltala 
dtflglltaldlnyltnklkiksplctf 

yfsn:lgfypsithrtipylglltsqnl 
plllldlt-lekllpktisqhqistsii 
tstqkgiiklyt'lsfff 


2877 


16778 


A 


2896 


381 


2 


GRCRVSGSIIDHRMAPQKWRYTKGGPHQ 
WHMFCFF*KKGVFPYGPKVFLFRAPVFS 
PPGPPKRWEIKV* TPPPALFFFFFFFFV 
EMGSHCVAQAGLKLLGSSTPPTSASQGA 
EIAGVSHRAQPHAS 


2878 


16779 


A 


2897 


324 


1 


KKQQTKNNIKRHITESARKMS* *HRNIQ 
KKKKIENSHRaSD*ITSSGGRSRA 


2879 


16780 


A 


2898 


120 


345 


PPAPSS*YTTSLIQDHLFLiyiMa.VLSSAS 
LMRGNVGSNIMNALSHFLPQGTATLFIP 
VLDIMEAISLLIQPIAIiAV 


2880 


16781 


A 


2899 


381 


219 


CVSQDGLDLLTS* STRLSLPKSWDYRCE 
PPRPAGSGHLYSSFKKQTGDTHHNF 


2881 


16782 


A 


2900 




224 


LSWCLTLYFAYLLALFYFYFLKILDLAI 
LPRLVPYSWP*ASSPASASQSSGITGMS 
HYTWPLPQTmPFLFFVP 


2882 


16783 


A 


2901 


1 


228 


KYL INMRLI TTQQ *L IKLTSKQMITIHN 
T*GY*YNRSGSSFNGVYDYHLLRSESEH 
PWMIVDNTEYDEIYTRGGIE 


2883 


16784 


A 


2902 


2 


332 


LPPSFSLLLAPSPSSSLPPSPSSSLPPS 
PSSSLPPSPSSSLHTSHSSTLTPSASCT 
LTITESFTQRLAQSYITILP*ASRTTMQ 
LLKYRTLQTTRNNTHSMTRPKALTKI 


2884 


16785 


A 


2903 


3 


243 


DIGFGTDFSDMTPKAQTKQVDQLDFIKI 
*NFWPGTVAHTCNPSTLGGRGM* ITKLG 
V* DQPGQRGETTQKLAGHGGVDAV 


2885 


16786 


A 


2904 


179 


2 


CGYFNAYICYSLLCFLYLSLCNQPFWKK 
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Predicted 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
<3=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=rGlutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IFLPLFGLVFFFFFF*WRV*FCCPGWSA 
VSQ 


2886 


16787 


A 


2905 


108 


2 


GRVD IKLTSKQMIT IHNTKGRT * S L I LV 
SLIIFIA 


2887 


16788 


A 


2906 


318 


2 


YKESNSKNNEKKKITRQRKRQKKNRPSM 
MVHTCNPSPMAGRDGQIPSILFNQPHSP 
RIIRLTLILNPLTKHIAYPFLVLSL*GI 
IITSSIGLRQTDLQSLIAYSS 


2888 


16789 


A 


2907 


17 


308 


KLAGYGGMCLWSKLLERPTRMNHLSPRS 
EGLMEP*SH1'JCIPDWTP* QNP VSVQ I TG 
FCFLTPNLAMPPRLNLSGPICCQLSLRL 
LGPRLFAASVSGV 


2889 


16790 


A 


2908 


378 


1 


RSLHSTETAAKTHGHQTHPGLSTFSK*T 
PTPIASIPIPNPHNCLPAFNSTTSASYF 
SECVLTSSFKEIIADFR*KYSSTN*HPY 
HLFLLKFVFFCSRDGSLTMLPIRPGLNS 
RSQAILPPWPPK-i/L 


2890 


16791 


A 


2909 


1 


129 


HLFGTNHRDIGTLCLLFGA*AGVLGTAL 
SLLIRAELGQPSTRP 


2891 


16792 


A 


2910 


1 


323 


CVCWGYRIPECNIGI-IHLKFFFFSLFFFF 
FGNKVWFCSEG*RAGDQLLIMEPLASGF 
KGIFFLNLPQNWE*RVSPKLPGKFWIF* 
LKTGFPLVAQVWELRTSGDQMa 


2892 


16793 


A 


2911 


367 


136 


ETPGNHLSPGV*GCSEL*SCLCTPAWAT 
EQDPISKQKRTKKSKTLLKMTPCADLTRW 
KHIHKSACVCICLCMYMCLRE 


2893 


16794 


A 


2912 


258 


140 


fliffcldcflnikvlgvpgfffslkkk 
qt*rqglalsprlecsgliiahc:tlepl 

GSRNPIiTIiAPQIAGATGMSH*VQGYHEL 


2894 


16795 


A 


2913 


48 


242 


YSMI FCVL ILPVSFTSSDGI FLLINLFL 
FY*STYPIAFFKTESHSVTQAGVRWHHL 
GSLQPPPPR 


2895 


16796 


A 


2914 


309 


76 


GFIICVCVCVCVCVCVCWFIFTEYSFH 
SLFC*ISCCVFyFIALSI.LIGFYriFWNR 
YLEVYNISFEVWASIiENEFLLIi 


2896 


16797 


A 


2915 




122 


FSVETGFHHVGQPGL*PPTSGDPPSLAS 
QSAGITGMSHHA 


2897 


16798 


A 


2916 


167 




GNMCSKESVSGTNINRKPD* * YNPRLGT 
VAHAYNPSTLGGQGRWITGGQEFKTSL 


2898 


16799 


A 


2917 


2 


356 


ARGILLLRIMLTTLTRYP* *RDVTREST 
YQGHHTPPVQKGLRYGIILFITSEflFFL 
PRFLPPFLSSSFPPTPSLLKPFPPSSFP 
SLPSFFPPFIiPSSLFL 


2899 


16800 


A 


2918 


2 


351 


AREYTSLQLILQMTFIMRFTCTDRTI.YE 
lAFEC* L^4RTLDI ITR*GNQP* RPNAGS 
YWLFYTLLGTLPLLIALIYTHHTLGSLH 

ILLLTLTAQELSNS * ANNLI * LAYTIAF 
IVKI 


2900 


16801 




2919 


32 


321 


ALMGITFFFFLGKGVLAPRGGGRGGNPG 
LWGGPPPGLGPFSG*SLQGGGA*GPPPQ 
AGANFGFLILRKTRGSPGCPGGFWIPGP 
GDRPAGAPQGGG 


2901 


16802 


A 


2920 


1 


361 


GTSTRLGVLLLSLHHAGSINPLGITLHS 
DKVTLHPYYTIKDALGLLLFLLSLMTLT 
LFSPDLLGDPDNYTLANPLNTPPHIKPE 
* YFLFAYTILRSVPNKLGGVLALLLS IL 
ILAIIPIL 


2902 


16803 


A 


2921 


3 


152 


HERLYSVPLRILRRLPDPLSIPQGWRHV 
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ing to first 
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peptide 
sequence 


Predict- 
ed end 
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location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Piienyialanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-^lMetliionine, 
N=Asparagine, P=ProIine, 
Q=G]utamine, R=Arginine, S=Serine, 
'l='lhreonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRESAYKSHLTLPARRGLQDGI ILFITS 
EWFFAGFF*AFYHSSLCPTPQL*GHWP 
PTGITPLNPPESPLLNTSVLLASGWIT 
♦AHTKAISHYLREEAFKTG 


2903 


16804 


A 


2922 


410 


-1 


RaGYRQKWGSLATVIGQLGLPVERGWYG 
PSGEGGTEQRHCSELVLDDIiHVITQNRA 
NHRPRCGGSLLSELKFAPLRLQPGRQSK 
TLSQKKKKQTTTKBaSTTKKQPGSVflHACN 
PSTLGG*GRQITLPSGVRDQPGQH 


2904 


16805 


A 


2923 


2 


145 


ARAftRGDVI*SFHVFAQRDLNLLSSRDL 
PALASQSTGITGMSHQCPG 


2905 


16806 


A 


2924 


29 


340 


EMNEVEERGRK*LFHDSIQSR*EYRCAP 
QCPAlISCVLM*RWGFAMLPBajVSSDLPA 
IiASQSAGITGVSHCAWLVFLPLLSTSFI 
SQRHLYLPRA 


2906 


16807 


A 


2925 


2 


337 


ARVLILPGFGIISHIVTYYSGKKEPFGY 
IGMD*AMIS IGFLGFI V*AHHI FTVGID 
VDTRAYFTCAT 1 1 lAI PTGVKVFN*LAR 
LHGCNMI*SAAVL*SLGFIFVFIVSGQA 


2907 


16808 


A 


2926 


76 


388 


RYCTPAWATETPSQKKEKKKKCQTPEPE 
SSFPVCLENGGGGNFISLFFLPLIILSL 
FFYFY*PFFIIFVYSYyYFYFCLFSYFF 
FYFFINLIIFLLLNYFFFFI 


2908 


16809 


A 


2927 


48 


374 


KDQEPTDMGSAHFQVFKGWRQVGGftNED 
RINSHGGPVMFSGQYYGQLVRFLVY* II 
IFFMLLFFSLFYSYFLFISIFFVLVFYC 
LFIFLFYFSFFFIFVFFFYYFIVIF 


2909 


16810 


A 


2928 


3 


294 


HEGEHYISQDGLDLLTS*SARLGLPKCW 
DYRREPPRPAHFPTLRIWLYILKALYTV 
LGCYREYEAYRPWCWTHRMDGGQEVEGR 
REGTGGGHILGF 


2910 


16811 


A 


2929 


7 


420 


IRERDVLLLTLTS L S PTAAR I C YNGGRR 
GNRLNSLAYLSKELLAAWSLRKPSHGLL 
TPIRCVLYIRGHHF*LLPPSHLLLIjASA 
IMGAAGGTG*TV*PTLAHNYSQPGVCVN 
LAMVSLHLSGVSS ILGAITFITAIMNME 
APAITQYQTPLFV*YGLITADLLVLSLP 
DLTAGITILLTDRKLSTTFFDPAGGGDP 


2911 


16812 


A 


2930 


11 


253 


GLLHKAPSP*KFFFSPKPFNFFWKFSPI 
FSPPKKKFLSKNPHIVFKFPPFKGKIFT 
FPPPLKFGPPRVFFKAPPPFFFFFPFFF 
FFFFF 


2912 


16813 




2931 


70 


341 


DSS FFS * LATLHGSNMK*SAAVL*ALGF 
I FLFTVCGLTD lAIiPNSSLD I ALLDTYY 
WAHFHYVLSLGAVFAI IRGLIH*FPLF 
SGYTLDQS*AKIHCAI IFIGVYITFFPH 
HSLGL 


2913 


16814 


A 


2932 


3 


340 


NYSHPGASVNLIIFTVHLAGVSCILRAI 
NFITTIINIKPPAITQYQTPLFV*SVLI 
TGVLLVLS I PVLSAG I TILLTDRNLNTT 
FFDPAGGGDHILYQLLF*LFGHPEAYIL 


2914 


16815 


A 


2933 


2 


364 


RVQKGLRYWIILS ITSEVLLFAGYF*AL 
YHSSLAPTPQLGGHWPPSGITPLNPLEV 
PLLYTYVLLASRVSFT*AHHRLIQ]SnsIEN 
HIlQALLITILLGLYCTLLQASiNFEPP 
FTiSDGIYG 


2915 


16816 


A 


2934 


2 


367 


prveprvryij:,fga*agvlgtai.sllir 
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nucleotide 
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correspond 
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Predict- 
ed end 

location 
corrcspon 

last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














AELGQPGNLLGNDHI YNVIGTAHAFVI I 
FFIVIPIIIGGFGN*LGPLIIGAPDMAF 
PRINNISF*LLPPSLLLLLASAIVEAGA 
RTG*TVYPPL 


2916 


16817 


A 


2935 


3 


365 


YHIV* PNP* PLTWAISALLMTSGLTM* F 
HFHSITLVILGLLTNTLTIYQ*WRDVSR 
*STYQGHHTPPVQKGLRYGIMLFIT*DR 
FFFAGVF*ALYHSSLAPTPHLGGHWPPT 
GITPLNPLE 


2917 


16818 


A 


2936 




144 


DSHGRHViUG*TCVTICFIRQLI(3HFTS 
KHHFGFEAAA*YWHKKKKK 


2918 


16819 


A 


2937 


3 


422 


QRLIiAraHHDIGTi:.YLSFGA.*PGALGTD 
LSLLIRAELGQPRDLLGNDHIYNEI«TG 
HALGILFFIGLPIIIEGFGN*LHRLIIG 
APDMALPRINNISF*LLPPYIiTiTiTiTiRSA 
IVEAGAGTG*TVYPPLfiGNYSHPGASVE 


2919 


16820 


A 


2938 


3 


382 


RTRGLFSTNHIDILTLYLLFGA*flGVLG 
ARLSLLIHAELGQPCNLLCNDHIYNVIV 
TAHRFVI IFF IVIP 1 1 1 * CFGN*LLPLI 
IGAPDMAFPRI1SI]SISL*LLPTSLLLLIA 
SAILEA* SGTG*TV 


2920 


16821 


A 


2939 


225 


3 


NISWRMKQIVPKGEPYNGVPVFVSLWFH 
KMFIFETGSHSVYQAGVQ*RHLGSLHPH 
PPGIKQFYLSLPSSWDY 


2921 


16822 


A 


2940 




247 


PTRPPVIYSTIFAGTi:iITALSSH*FFT* 
VGLEINMLAFIPVLTKKINPRSTEAAIK 
YFLTQATAS 1 1 LLI AI LEKNI KKKKK 


2922 




A 


2941 


3 


369 


TRDSTYLGHHTPPVHTGLRYWRILFITS 
DAFFFAGFF*AFYHSSIiAPTPRIjGGHWP 
PTGITPLNPLEVPLLYTSVLLASGVSVT 
*flHHSLVEDDRYQIIQALLITILLGLYF 
TLLQASEDFE 


2923 


16824 


A 


2942 


3 


401 


LTVCCVIVLRLKTLFFFFNKPFLTQKGY 
FNTPEEGFFKKPNRRVGPPSPMTDPTML 
TNLI KGKVPKAP PRI LMGGG INMTF*GF 
VTPRAPFPLTLRLNPMLCXJGIDLLTLKA 
SGGSPASGNFLKELGLRSINF 


2924 


16825 


A 


2943 


189 


348 


RKftRGWVTDKEKRLRLGMVAHACNSSTL 
GGDGR*IA*GQEFETSLaNMVKPCL 


2925 


16826 


A 


2944 


2 


349 


ANSNYERTHSRI 1 1 LSQGLQTLLPLIAF 
**LVASLftNLALPPTINLijGELSVLVTT 
FS*SNITLLLTGLI ILVTALYSLYIFTT 
SQWGSLTHHINNIKPSFTRENTLMFILL 
SPIL 


2926 


16827 


A 


2945 


621 


929 


GCS SGTGCCP I LCDLPRPWS CRGVGGSP 
SSTAHLCPRGWRSGRCFLPPLSAS*VDS 
AMSLIQAAKNLMNAWQTVKASYVASTK 
YQKSQGMASLNLPAVSWKM 


2927 


16828 


A 


2946 


379 


395 


SQHFGRLRREDHEVRSS*PRDPPALTSQ 
SAGITGMSHCARPLVATSIHKI DNRYI D 
DRS*NINIGTIFINLLIFyLSIY]:iSIYL 
SIYLSIMYSFTVAQAGVQWRDYGSLQPE 
SPGPK*SSCISLSSSHlIHSHTPPHGWVn 
PKIP 


2928 


16829 


A 


2947 


2 


357 


HTYDIRKPRR*TLACALSALLTTSGLAM 
TDCFHYITLLILCLLTNTLTIYQ*WRDE 
TREST YQGHHT PP VQKGLRYG 1 1 LF I T S 
EAY FFAGFF *AF YHS SLRPTP QLGGHWP 
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residue of 
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ding to 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
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W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PTGITP 


2929 


16830 


A 


2948 


2 


357 


PRVRYSTDHSDIGTLYLLFGA*AGVLGT 
AISLLIRAELGHPGNLLGNDHIYWVIGT 
AHAFVIIFFIVIPIIIGGCGN*LGPLII 
GAPDt1AFPRINNISF*LLPASVLLLLAS 
AIVEAG 


2930 


16831 


A 


2949 


2 


217 


KNLKIVQYGDMCLWSQLLRELRWEDRLS 
SGG*GCSELCSCHCTPAWTTQQDCLKKK 
KNFWPVGEVK6RHML 


2931 


16832 


A 


2950 


2 


354 


AIPMTTAELTIHEAYLIILERTTTTTKD 
VKKPRRIAAAITASCLGGGLEDAISCQY 
GIATKDRPTGLGTPEVIiGALPGAGGTQ 
RLPKMWVPAALDMRLTGRSrRAHRALK 
M*LVD 


2932 


16833 




2951 


3 


150 


LFSCSPTFSSDPLTTPLLILTT*LLPLT 

lyASQRIILSSEPLSRKKKKKK 


2933 


16834 


A 


2952 


283 


344 


EKKKKKKKKKKKKKKKKKKK*KLRKKKK 


2934 


16835 


A 


2953 


3 


378 


DflWADAWNQTPLFN*SVLITAVLLLLSL 
RVLAlMSITILLTDRNLlilTTLFDPAGGGH 
PILYQHLF*CFGHPEVYIL1LPGLGIIY 
RIVTDYYGTKEPPGYIGMV»AMRSIGFL 
RFIVEAHHIVTVG 


2935 


16836 




2954 




387 


MVTYYSGKEEPFGYIGMG*AMISIGFLG 
FIGRAHHIFTVGIDVDTEAYFTYATIII 
AIPTGVKVFS *IiATLHGSNMK* SAAVL* 
AIiGFIFLFTVCGLTGI 


2936 


16837 


A 


2955 


2 


432 


PRVRPRVEKHIAYPFLVLSL*GII ITSS 
ICLRQTDLKSLIAYSS ISHIALWTAIL 
IQTP*SFTC3AVILIIAHGLTSSLLFCLA 
NSNYERTHSRII ILSQGLQTLLPLKKKK 
KKKKKKKKKKKKGGGLLKESLGGPNLTG 














EGK 


2937 


16838 


A 


2956 


12 


391 


SYFISSSKPHLSPPWLSSPDEATSKKKK 














KGGGFK*KRIGGSFKKGGGEKKIPPFKG 
GKKKKTGGFFEKKPPLGGEKY*DNLPKK 
NKTVGEKKNFLGSG 


2938 


16839 


A 


2957 


1 


352 


PTRPYFPVUAGEAQHHPRTCRRPLRALW 
SSHHERWKVTLCTHCSLGVFFLYCCTYY 
IFVLFIP*SSCGLTLIFTTCIILFGSIS 
FFLFPTIVFS I I IVTTFKFRLLYS I I FL 
SYLLG 


2939 


16840 


A 


2958 


1 


243 


NLPRLNPKEIEILNRS IIGNKI KAI IKS 
LLSKKSSGHNDVTASFY*TFKEELITIL 
LKFYLTPKKKKKKKKKKKKKKGGPF 


2940 


16841 


A 


2959 


373 


3 


FSSLKKRVTEPPPPKTGFSLEGLHLLKK 
NFPQKPPPPKKSFSQKNPPPPKKKPPF* 
KKK?PPPPPI*HPPPKILQPPPPPFFFF 
FFFPFFFFFFFFFFFFFFFFFFFFTDMR 
GFITOKFRTRG 


2941 


16842 


A 


2960 


2 


136 


PRVRSTLPISYKWNNRALMKAHLIMK*F 
TEYFKTTDELYLHDNT 


2942 


16843 


A 


2962 


1 


^58 


HTPPVRKGL * YG I I LF I TS KVFFVPRF I 
*AVYHSSLCPTLQLGGLWPATGMTWLNP 
L*VPLCTTFVLLA* * VSIT*AHHNLIDL 
NRCHVIYALVITILLRLYFTLLPASEYF 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EftPLTIS 


2943 


16844 


A 


2963 


2 


373 


RAYDIVRPRP* PLLGALSALLMTYGLAM 
*VHWDCITLLILGLLTNT1AIYQ*WRDV 
SRESTYQGHHTPPVQKGLRYGIILFITS 
EDVFFAGIV*AFYHYSLAPTPQLGGHWP 
PTGITPLNPLEV 


2944 


16845 


A 


2964 


403 


61 


LFFPLEKNFLPPGFSAFFSPFSP*KFFF 
SPKALI FWGNFS PFFPPPKI RFLPKI PP 
WFFSPPFWEKLFSSPPPLNFGPPRVLF 
KGPPLFFFFFFFFFFFFFFFFRFEGECW 

RL 


2945 


16846 


A 


2965 


2 


360 


L I PNLAMVTR*GNHPQRLNAGTYLLFDT 
LDGSLALLIGLTYTRNTLGSLWILLTI 
TAQELSNS *AHNLV*LAYTLAFIEKIPL 
YGLHL* LPKAHREAPIAGS I VLAAVLSK 
LGGSGKN 


2946 


16847 


A 


2966 


321 


2 


STGMHFPHINMAINPPPJJPRPPPPLFPP 
NPKTKPNPTQ*KGGFPPLGPF*KKLSPF 
LTLFFFFFFFFFGHHPGPWQKKKKKKVF 
FFCVCFSFQRVHNIYKNTHQQQ 


2947 


16848 


A 


2967 


200 


1 


RRTYTSHLLACIJIQGLAFSPRLECGGKI 
RAHCSLQLYGSSDPPT*APQTAGTKQHN 
QRIAQCNADN 


2948 


16849 




2968 


2 


354 


lltassseiaplqsslGdrarlci.ik*k 
egvlnsi*sgnqgksyanvyrllyldpi 
pkiyaeaytp*nvnstnletkspktiqk 














LLFLV 


2949 


16850 


A 


2969 


99 


400 


ALGIHFIFrVSGLTGIGLaNSSLHIVUI 
DTYYWAHFHYVLSLGJVAFAIIGGFIH* 
IPLYSCYTLDQTYAKIHFTIIFIGVNLT 
FFPQHFLGLSGMPRRY 


2950 


16851 


A 


2970 


2 


377 


NILLLTLTAQELSNS*ftNNLI*LAYTIA 
FIVKIPLYGLHL* LPKAHVEAPIAGS IV 
LAAVLLKLGGYGIIRLTLIUJPLTKHIA 
YPFLVLSL*GIIITSSICLKKKKKKKKK 
KKKKKKKKHGGPF 


2951 


16852 


A 


2971 


1 


407 


GTRS Y tHL !i kVFLEL IKMS VYDLNHTVI 
MVISGHVRLAFYGIVHLTLIIilSrLLTDHI 
LYPFLVLSLSGVI I ISSIFLRRTDLESL 
IAYSSISHIALWTAILIQTP*SFTGAD 
I LI IVHGLTSSLLCCIi2iNSNYER 


2952 


16853 


A 


2972 


10 


259 


SRSVAIYFKGMASA* *RMFSSKKKKKKK 
KKKKKKKKKKKKKKTAITKKTTTAKWKN 
*RTEKIRHTRLPLISSRDAKCVDFLYT 


2953 


16854 


A 


2973 


12 


400 


LNCRTPSLYLQRAGELLSVENPHIWCQK 
CVRKHNFFLFFFFFFKTDLYCPHICIAL 
SITCFIAISISGLFAEFLLNSFKYHFII 
VP*NFSLSIYLCLHSLFYVGFFFSFDMM 
I FS ILT* TLPPCLLFHL 


2954 


16855 


A 


2974 


125 


3 


RPRRPAAQVGVQ*KNLSSLQPVPPGFKL 
FFPLSLLSSWAS 


2955 


16856 


A 


2975 


2 


332 


RECTNYPAQPPPPPLEAEEGFQPLPPTD 
TG*I^CLPPLGGGQHRALGLGATWHLEN 
ACALVLADLGTGPKPSGPFSGQACGASG 
RLPLLSGPILPPLGQGHTLSAWGHHA 


2956 


16857 


A 


2976 


7 


329 


SRDILVMTEDGEFF*GHV*P*DRSRHAD 
rPRLGGHWPPTGITPLNPL*VPLLWTSV 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLASGGSIT*THHRLIEYIRHQIIHALL 
ITILLGLYFTLLQASEYFESSFT 


2957 


16858 


A 


2977 


2 


339 


AHHI FTASI DVDTRAYFTSATI I lAI PT 
GDKAFS*LATLHGSNMK*SAAVL*ALGF 
IFLFTVRGLTGIVLADSSLDIVLHDTYY 
VGAHFHYGLSIGAVFAI IGGFIH* CPLF 


2958 


16859 


A 


2978 


394 


0 


QGCSEL* S CHCTPAWVTE * PPVS KKKKQ 
RK 


2959 


16860 


A 


2979 


149 


383 


PLCFPSQHWGKSSNCLSSLKYFPSSGLP 
LRCKAELDDVKQKADKELDWAHACNPS 
TLVGQGGRITLRSGV*DQPGQH 


2960 


16861 


A 


2980 


2 


378 


ARVS IGFLGLIVGAHDMFTGEMDLXTRT 
YCTFATILIAIHTGVPVFI*LATLHGCN 
MK*SAAVL*MiRFIPISTVCGLTGIVLP 
N*SLDIVLHYMYYAV7«Ii:iHYVIiSIGAVF 
AILGRFIHELPVF 


2961 


16862 


A 


2982 


2 


409 


paVaeaylkpVvddskgsflwgkpdldg 
ireccqhnfgwnrtrtdeslfpvlkqld 
aqqtqlridsfftlaqqekddakriksq 

RLNRAVTCMLRIE I EftASSE lEAVSVD* 
QKELELlDKRVACHIiEXMYETDP 


2962 


16863 


A 


2983 


3 


453 


hasahasahasgqrkqaapaekkcsaea 
qheglelrvenlc3avqtdfssdplqkw 
cfnhdntliatggtdgyvevwkvphlek 
vleskahdgeiedlvlvpegml* ivcpd 
hyspvgindfmwli.*ijsiwgnrlyfphis 
vylmfhfrtf 


2963 


16864 


A 


2984 


3 


417 


LILPGFGriSHIVTYYSGKKEGFGYICM 
V*AMISIGFLGLIVEAHHIFTVGIDVYS 
RAYFTSATIIIAIPTGVKVPS*LSTLHG 
SNMK*SAAV1*ALRFIFLFTVSGLTGIV 
lANSSUDIAIflDT YYV*THFHYVLS I 


2964 


16865 


A 


2985 


273 


408 


MKIFIF*VSPPLFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFLAVFTFI 


2965 


16866 




2986 


176 


335 


TFQPSELWRTSLLPRLECSGLIIAHCSL 
ELLGS S SPLfiSAS * lAGTTGS ILRY 


2966 


16867 


A 


2987 


65 


398 


KfCEFKIGRKAAEKMTHlIIlilNAFGPGTAN 
ECTVQWWFKKFCK6NKSLEDEECSGRPL 
EDDNDQWRAIIEftDPLTTTREVAEELSV 
^raS^aVQHLK*VGKVKIaU)IKWVPHELS 


2967 


16868 


A 


2988 


2 


158 


PGWSLTPDLR*STCLSLPKCWDYHHKHP 
SPTSRQTFDHAPNKNSSHSPPIYM 


2968 


16869 


A 


2989 


435 


23 


GWPPNPQKSFYFPQRLKIWGGGGRKRP 
PPKKKGFSQKTPEGF*KPPPKRRKKIFH 
DPGKKGPPKGl FKRGPPLFFFFYFFFFF 
FFFFFFFIITIECLHSRFPHRHHNKKFP 
PNPPS PRFWPQHLNTSLPNPKNKEP 


2969 


16870 


A 


2990 


410 


3 


GGRGHFFSGAFFIKFPWNKKGISQPLCF 
PRGGGS PPPLGPVEGGGPPCWGPLCHKG 
PVKKTGAPRGKNGISPFFCFPPLGSFNR 
SGFFWALLGSPPLFFFFFF*DRVSLHHP 
G WNAVAQ S * LTKTS TS WVQVI LL 


2970 


16871 


A 


2991 


371 


132 


KQSLPFKVKGFFFPPEVENAIYPDSHTA 
FYSW* KRSTFSKKKKKKKKKERKRKKEE 
RKEGRKEGRKERKKEGKKEIENA 


2971 


16872 


A 


2992 


336 


1 


CPRWSRTAGLK*STCFGLPKCWDNRHES 
PCPASSVLND*QLSTVQRAT*WQTKSLV 
LKAPPARGALPVSRAVSPPPSSHHFAYF 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














LHMI *QVSCLFLR* SLALVPQAGVQ*RD 


2972 


16873 


A 


2993 


317 


440 


HLGRLRQEHHLSPC3*LRKFFFFFGRDGV 
LMLSMLVFNSWTQVMLLPQPPQVLGLR 


2973 


16874 


A 


2994 


2 


181 


raHVGLDLI.TL*SACLGLPKCWDYRREP 
LCPALVILLTAKFTNFRYRVKNVCTFHV 

ESN 


2974 


16875 


A 


2995 


1 


155 


GFHCISRDGLDLLTS*YflRI.GLSNCWDY 
RHEPPSPAPLFISYSICLPLSKLIQFL 


2975 


16876 


A 


2996 


405 


2 


KKGFPIFSRMFFLYKHIEKPPLASQKFG 
DPKWS PHPRPIFFFLLKKGVLYVWREQF 
KBHPP*FPPPGPPKRWE*RVKPPHPPPF 
FFFFFLYVEMGSHCVAQAGLKLLGSSTP 
PTSASQGAEIAGVSHRAQPHAS 


2976 


16877 


A 


2997 


2 


415 


CLFTGGGLTGIGSAGSSLDIVLHDTYYV 
VAHFHYDLSIGAGFAIIGGLIH*FPLFS 
GYALDQTYAHIHFTIIFIGVNLTFLPQH 
FLQLSGMPRRCSD* PDAYTT* SMLSSVQ 
SVITLTAVIVLI IMIREAFDS KRTVL 


2977 


16878 


A 


2998 


349 


370 


HHFY*SITAFRPFQHMEERNFFFFFLFF 
FFFFFFFFVFLFIFLFLFYFLFLYEIYL 
ILFFI.*HHFYSAV 


2978 


16879 


A 


2999 


36 


139 


LCHCTPAWETFSKEKKKKEKRKKKKKKK 
KKKKKKKPGGGS * KTALSHDCATVIQPG 
RLFQKKKKKRKKGKKKKRKKKKKKKK 


2979 


16880 


A 


3000 


104 


373 


SADREXXSKTDNLLGH*TirarKCKVPRV 
IQSVFSSHSGLKLEINNRKMKEKSLNT* 
KINNISYWVQCSLYMLKSNSYTKSSDNT 
TTQYMY 


2980 


16881 


A 


3001 


1 


396 


LDCSKISSYLQKSSSHVLFFSFSFFFFF 
GGGGGFALENPYPPAGLGPQKKKTLSPP 
WLVGGPPFPEKAPP * GGHFLGKKPQKNP 
GGQNPKRPSGEGNPFFQPPGGEKKQIGP 
PTGFGGKPFFSF*PREPPGP 


2981 


16882 


A 


3002 


412 


104 


FFFFFLPVRQTFYPQPFSVFFPLFPFKF 
FFFP*AFNFFWGFFPIFSPPKFGFFSKI 
SRLVFFSLPFWEKFFFFLPPFFFAPLRF 


2982 


16883 


A 


3003 


400 


46 


LFFFFFYKLFPPPAFGGFFPPFPL*NFF 
FPPGPFFFLGGFPPFFPPPK*VFFPKIP 
PGFFFPPPF*KKFFFSPPPFFLPPPGFF 
FNPPPPFFFFFFFFFFFFFFFFFFLCW 

VQVEAY 


2983 


16884 
16885 


A 


3004 


411 


69 


YSPLPPLFFSSPPSKFPWPPFSLFFLTR 
VYKGFFFFPFFPFELFGPPGLSLGSKPP 
PVFLGGD FAFFS I PHPRVGS LPPPPNWP 
FIGPSFW*ALFPAPPLFFFFFLSSFFFF 
F 


2984 


3005 


3 


200 


DAWG*LFSTNHKDIGTLYLLFGA»AGVL 
GTALSLLIRAELGQPGNLLGNDHIYNVI 

VTAHAFVII 


2985 


16886 


A 


3006 


259 


146 


FFGLLNDILL 


2986 


16887 


A 


3007 


118 


414 


QNQPPQNKAIHWKIEKKEKPETKTVAK 
EHNKAKTAEKS EE * TKKEVKGGKQE KVN 
HTAAKVKEVQKTPSKPKEKEDNKKAAVS 
KHEQKDQYAFLRYM I 


2987 


16888 


A 


300S 


427 


111 


FFFFSPVGNFSPPQQFPFFFPPFPPKIF 
FFPPPL*FFWGGFPPFPPPPKKVFFPKS 
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nucleotide deletion, V=possible 
nucleotide insertion 














PPGFFFPPP*GKKFIFPPPP*FCPPPGF 
FLSPPPPFFFFFFFFFFFFFS 


2988 


16889 


A 


3009 


3 


302 


SLASCLSYL,VCVIFLjGQPKPTI *lqnst 
PHKK*NPTERYVKTCTQIFIALLFKKEK 
QPRCPSAGEWINKMLYACTIEYWLAIKR 
YEILIYATV*MYLEKI 


2989 


16890 


A 


3010 


1 


409 


RLHDATFP 1 1 EELI TFHDHRIjVI TFL I W 
LLVLCALFLTLTTKLTNTNISHAQE 1 ET 
V*TIlAAIILftLMVIiPSLRILYITDEDN 
DPSLTIKSIGHQWY*TYEYTDYGGLMFN 
SYILAPLFLEPGDLRLLDVDNRW 


2990 


16891 


A 


3011 


157 


2 


GRVDLKIQKLftRCGGSACLQSQHTQQNHL 
1IPGEKGCSES*LPPCPPD*VTKQ 


2991 


16892 


A 


3012 


2 


423 


ARAREflHIVTrySGKKEPFGYIGMG*AM 
IS IGFLSFIGRAHHIFT* * IDGHTRAYF 
TSATIIIAIPTGVKVFI*LATLHGSNMK 
*SAAVL*AIjGFIFLFTECHLTGIVLMJS 
SLYIVLHDTYYWAHFHYVLSIGAVFAI 


2992 


16893 


A 


3013 


2 


140 


ARANILLLTLTAQELWDPRftHNLI *LAY 
TLAFIVKKPLYGLHL*LPKftHVETPMDG 
PILIAAKLLKLGGSGl IRLTLILNPMTK 
HIAYPLLGLSL*GIIITRSICLRQTEIiK 
WLIAYS*ISHiaLVVTDILlQTP*SKHF 
TTNSHCPRTMGPQSQQLNMTSLHTSFYS 
KETSLRTPLMTP 


2993 


16894 


A 


3014 


2 


420 


PVLflAGISILLTDRDLLTTLFDPGGGGD 
PILYQHLF*FFGHPDDYII>1LPGFG1IS 
HIVTDYYGRECEPSGYIGMV*AMTPVGFL 
GFIE*AHHIFTVGinAHTEAYLTSVSIL 
IVIPTWRQVFS*LATLHGSNMT*YARLF 


2994 


16895 




3015 


6 


292 


AHHIFTV* IDVYTRAYFTSATIVIAIPT 
GVKVFS*LATIHGSNMK*SAAVL*ALGF 
I*LFTVGGLNGIV*SY*LLDIELHDTY* 
FVMGCPRKVYF 


2995 


16895 


A 


3016 


3 


422 


TPI I IGGFGN*LFPLI ISaPDMEFPRIS 
NISL*LLPPSILL]:.LASAIVEAGT*TD* 
TVYPPLAGNYSHPGASVDLTIFSLHLTG 
VSSILGAIHFITPI INIRPPAITQYQTP 
LCA*CDLMTAVLLLLSLPDLAAGITILL 


2996 


16897 




3017 


21 


490 


TPFPGRHLTMFSLHLAGGCS ILGAINYI 
TTLIN1RPPAITQYQTPLFV*SVLITAV 
LHLLSLPGLTAGVTILLflDQWLWTTFFD 
PAGGGDPILYQHLF*FFGHPEVYILILP 
GFGIISH1VTYYCGEKEPFGYIGMC*AM 
ISIGFLGIIV*AHHIF 


2997 


16898 


A 


3018 


98 


402 


lrsqhsksfqisgkpsqeewpqispdst 
DYI int*hfnaqmlkniy*hqphglhdf 

FPKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKftSSSSSKKDPPGGGPT 


2998 


16899 




3019 


399 


229 


PPPPGGGGPQGPPPPRGGFLPKSPGGVF 
YPPPRGGKIFSPPPGFFGPPRGFF*GAP 

p 


2999 


16900 


A 


3020 


2 


401 


SDAVl *ArjGFI YLVLEGGLTG IVI ADS F 
LDLELHDTYYVGAHFHYVLSIGAVFAI I 
GGFVD*FPLFSGYTLDQTYAENHFTIIL 
IGANVTFLPQHFLGLSGMPRRYSDYPDA 
YTT*NILSSVGSFITIiAAALL 


3000 


16901 


A 


3021 


413 


67 


PPPPGKIFFKKTPKKKIFPPPQF*IFFP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
Ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkiiown, *=Stop codon, /^possible 
nucleotide deletion, V^possible 
nucleotide insertion 














PLPPKKFFFSPNP*FFLGGFSPFFPPQK 
KIFFPKIPPNFFFSPPLKKKIFFFPPP* 
IFPPPRFFLKPPPPFFFFFFFFFFFFFF 
FFF 


3001 


16902 


A 


3022 


2 


332 


LTLSrLSIIAGS*GGLNQTQLRKILAYS 
SITHIG*IIAVLPYNPNITILNLTIYII 
LTTTAFLLLNLNSSTTTLLLSRT*NKLT 
*LTPLIPSTLLSLGGLPPLTGFLPKW 


3002 


16903 


A 


3023 


1 


412 


RGPPFFFFFCVFFFFFFFFFFFFNKGCR 
HSKRFFFFFKKKKKSSRPTRDRV*FYPK 
GWRSPLFLFFSPGGRGFFPLSHQVGFSir 
EVLWFKKNLELFRGSPARKKKKKKKHL 

E 


3003 


16904 


A 


3024 


279 


1 


LGRNTELWKSGKGMDlLEmCGKLANEP 
FRQPRVLGIGGEAPRAGSGPPSRftPPA* 
TPGPSSAGSWP* PPGTGEAPHGPAPSAP 
GARSPGRPG 


3004 


16905 


A 


3025 


151 


401 


KKPLGGPNLTGEGKKKFFSLKGGKKKPP 
GKFLKKTFFLGGEKMGKTPPKKLKP*GK 
KKIFKGKRGKKNPKTLAVKKFSKKKKK 


3003 


16906 


A 


3026 


416 


140 


YLSPLKKFFTPPPLRMFIiPPNPLKNI FF 
PPQLKI FWGGWAQNSPPPKKGFFSKNPK 
RVFLPPPIRKRYNFPPHGKILAPPKNL* 
SAPPPIFF 


3006 


16907 




3027 


379 


31 


PPPRRAGVFFFKNPKPKPPPPREGGRF* 
PFPPLKF*FFPKPQNFLGGGGAIPTPPP 
KRGLGQIPTERFNLSPPTQKRINFPPPG 
KGGPPPPLLKPPPPPPFFFFFFFFFFSL 
NSFI 


3007 


16908 


A 


3028 


28 


420 


MQQTTMAHIFLCajKFflNCAHVPRT*SET 
KPMSTPLQFD*TYKGEKSCKY7U3HERTW 
KQ*CVFSLYQIIPT*EKTWKafQCGTNF ' 
riQFFKQTTHL^NHTRENQICFSKIGLEY 
YYRITTRQHLLKHRTVCIL 


3008 


16909 


A 


3029 


1 


401 


LGNNGEAVSEKRKEKSQKEKSHNWVGF 
FFFFWGKPPFVPQAEGQGRNFC*PKPWP 
PGWGEFPGPAPRGGGNGRQKQPGRGNFG 
V*REKHGSMGGPRGA*NPGPKNPHPWPP 
QGPEITGGTTRPHPSGIFKKT 


3009 


16910 


— 


3030 


285 


397 


MFIKGDGLNKLRPGAflAYACNPSTLGG* 
AGRITRSGD 


3010 


16911 




3031 


265 


2 


KKESSCIKaTNSNSLFFLF*SVFFFFVF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFF*HFKSALLWVTDLGFIJSrSD 
PRV 


3011 


16912 


A 


3032 


410 


49 


GFSPPPP*KFFFSPKPLNFWGGGGPNFP 
PPKKEFFPKNPPGVFYSPPKKKKKFFSP 
PP *NLAPPKI FLKSPPPFFFFFFFFFFF 
FFLSMVSNGLTNMYILPCKDPSCPTTFP 

ILGSLISL 


3012 


16913 


A 


3033 


177 


2 


VTPPFFFFFETRSHSVARTGV* * SDRCS 
LQPHPPSIK*SSHFSLPSSWDYRSMPPH 


3013 


16914 


A 


3034 


75 


395 


MS YKHXXKKKKKKKKKKGGAP * KKPWGG 
PKLTRDGKKKFPSLKGAKKKP TWKFWKK 
TLILGGRBCMGTTPPKKLKPLRKKKIFKG 
* RGKKHP KSLP VE KFASRGR I KK 


3014 


16915 


A 


3035 


400 


83 


KKEGPARWSPVTPPLLEGPVGRSPQTR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/5J5,l 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

otide 

location 

correspon 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NF*PTPPTHRKSLPS*KSKIDWRGIPLY 
P PP PEVKP KKS FNP GNRRFC * TKI FP CP 
SPWAPKTHPPPQKKKKKEVYK 


3015 


16916 


A 


3036 


3 


196 


DSMPQT*NK3FAKARKKKKKKKKKKKKK 
KKKKKKKHGGPPLKKTLGGPKLPGGKKK 
NFFFFRGG 


3016 


16917 


A 


3037 


1 


411 


FCYDVCVESGCADYSIVIIMKKKKEK*K 
KKKKKKGGGPLKKNPGGAQNYPGVEKKI 
FSLKGGLKKTPRGNFEKKPYFGGGKNGA 
PPPQKNKPLGEKKKFKREKGGKKLQFPW 
GKKISLPGFYLKKIYPPGRGFFNFS 


3017 


16918 


A 


3038 


3 


429 


NFFFKKPRGGNFFPPPKKGFFSPPSPLK 
FFFFPPPFFFFGGGGPHFPPPPKRFFFQ 
KPPRGFFFP PL* EKNFFFPP PVFFAPP P 
VFFLTPPPP 


3018 


16919 


A 


3040 


1 


111 


IGLSGMPRRFSDyPnAYAT*NILSSVGS 
FMYLQQDNK 


3019 


16920 


A 


3041 


76 


967 


QliLKGGVSGVCPLLMFRCVRSPFLLVGS 
HSSLASGVKPQTFAVSVTVLKAARLELF 
I PPRGLWSLASGVKLQTFAVSVTAHKS 
SVDPKNSGAQLASPSGSHTRAAGGAACQ 
SRCRVPALLSPWWDGTGRRGAGGGiUlR 
GGSGRTGAHGVGGRLRHGGLHVPSPAPW 
KGS *GIi2«WRAQRRWAGTAGGPSTPSAA 
AGPGAKSLTALCEQGWPAAPSAGPTKPT 
PTRKSSWPASVARSPGSRSCLSLHTSLQ 
AEGVGSSLGQPSKGLPQCSGGAEGLLKC 
RQSGS PGRGGTESERGL*GLPQCSGGaE 
GLLKCRQSGSPGRGGTESERGL 


3020 


16921 


A 


3042 


39 


141 


LSIRGIil!IIIIKRQRL*DWIKQQDSTLCC 
P*EIH 


3021 


16922 


A 


3043 


2 


405 


LFSTOHIYIGTLYLLFSTRAGVLGTALS 
LLIPAKLGQPGNLLGNDHIYIIAIVTAHA 
FVI IFFIERPI IIGGYGN*IiGPLIIGAP 
DMAVPRINNITF* LLPPSLLLLLASAIE 
EAGAGTG*TVYPPLaSNYSHPG 


3022 


16923 


A 


3044 


3 


134 


HLNPGGR6CSEPRLHHCTPSWATE*DSV 
SKKKKSRKGWTGLFI 


3023 


16924 


A 


3045 


44 


187 


DPRVRQYQTPLFV*SGLIFAGLLLLSLP 
TLGAGITILLTDENLHTAVFDPDGGGDP 
ILYQHLF*FFGHPEVYILILPGFGI ICH 
I VTYYSSKKEPFSYIGMV*AMIS IGFLG 
FIAGAHHIFTGGIDVDTHAYFTSATIII 
AIPTGVKVFS*LRLIRPNLCRLTSPISP 
NPRGRHHYTTNRPQPPHRRV 


3024 


16925 


A 


3046 


434 


40 


, GAPPPPPGRFFFFLNPREDTFPPPPQKG 
GFSPPPPPKFFFSPPALFFFGGGWPNSP 
PPQKNFFF* KPPEFFFFPPFLKKKIFFP 
PPLFFPPPQIFF*TPPPLFFFFFFFFFF 
FFFFFFFRGCKINFIVRGF 


3025 


16926 


A 


3047 


1 


399 


LFTGGGLTGrVLTHSSLDIVLHDTY*W 
PHFHYGLSIGAGFAIIGGSIH*FPLFSG 
YTLDQTYAKIHFTIIFIGVKLTFFPQHF 
LGLSGMPRRYSDYPDAYTT*NILSSVGS 
LI SLTAfllLI IFMS *EAFASK 


3026 


16927 


A 


3048 


116 


367 


GASMILSSMIFLECIVGGFPVFSVYLFK 
LQILRQSSTMCFVLFCFFEIRSCSVTQA 
GV*RRGHGSL*PQPPGLSHPSSRDHGHV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V--Valine, 
W=Tryptophan, Y=Tyrnsine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3027 


16928 


A 


3049 


6 


345 


SQLLRRLRKENCLNLaGGGYSHQR*HQC 
PLAWATE*DYLKKKKKNFSLFLKTGGFY 
PFLKKAPLGTTLSPKNKLPLSPLYKKRT 
GPWNWERKNFGKKKGGVIGLQGGKTIPK 


3028 


16929 


A 


3050 


261 


2 


NKKS PP'/NLWWIMGFFFKFAKRWLSWK 
GGGGFFFFSWRRSFTFVAQAGMQWRWLS 
SLQPRLPDLR*SACLGPPDC*DySREPQ 
YP 


3029 


16930 


A 


3051 


160 


2 


ICVDEQAGVQWRYLGSLQAPPPGLATLS 
CLSLMSSWECRQPPPLG* FFVCPR 


3030 


16931 


A 


3052 


89 


3 


Pr.TSGLIiLLT*QKIiAPISIIYQISPSLN 


3031 


16932 


A 


3053 


344 


2 


HVEAPIAGSIVIAAVLLKLGGYGIIRLT 
LIIJIPLTKHIAYPFLVLSL*GI I ITS SI 
CLRQTDLKSLIAYSSISHIALWTAILI 
QTP*SFTC3AVILIIAHGLTSSLLFCLftl!r 
SN 


3032 


16933 


A 


3054 


266 


2 


FQPP ISAYTKIS PSLNVSLLLTLS ILS I 
lAGS *GGLKQTQLRKILAYSSITHVG* I 
lAVLPYNPNITILNLTlYIILTTTAFLL 
LNIU 


3033 


16934 


A 


30,55 


3 


32 


KYWSLIMPTMIATITLIJJLYFYLSPLLY 
♦*SSCPP 


3034 


16935 


A 


3056 


3 


33 


KNNSLI IPTI lATI TLLNLYFYLS PLLY 
**SSSPPS 


3035 


16936 




3057 


57 


445 


ANVWAPHBPAKLTNKDNYHIWKSKRLKI 
ANMTIKKLNEVIGLTLPDFKTYVELVQ* 
RQNAID*RKHKQPVKQSPEA*PHSYSQL 
IEHWGAKANHGRKDSLFYK*CW*NWTIS 
SQKLNLHTDLTNFTKIN 


3036 


16937 


A 


3058 


311 


1 


RVGLLLKLNKISWPPPPFYGPS *EKEPL 
CFSQIGLFLTRTMVUSINIiHSPPVKTRPY 
KKIAPFRELFFFEKDRVS ICLPGWSAW 
^SQLTATSTSQASSDPGRV 


3037 


16938 


A 


3059 


138 


411 


WERPWKAQEAVFWI *VSAFWAPPPLMEK 
QI P PDLEQHYRNVPGVNRNQPFVS FFLR 
WSLTVAQAGVWWRDLGSL*PLLPGFKRF 
LCLSLLS 


3038 


16939 


A 


3060 


1 


189 


FCRVGQAGLKLLTS SDPPASAS *SAE IT 
GVSQRAWSKITILKSSSFSYFPKSCKMC 
FWLICLN 


3039 


16940 


A 


3061 


3 


406 


DAWADAWVLILPRFGRTSHIVTYYSGKK 
EPFAYIGMV*AMISHGFLGFIV*AHHIF 
TVGIDVNTRAYFTSATIIIAIPTGVKGF 
S*LATLHGSNiyiK*SAAVL*ALRFIFLLK 
KKKKRGAVLKVPWGGPSLPGCG 


3040 


16941 


A 


3062 


451 


82 


PPPTWYFSPPPAFLPGGGGPPRPPPPKK 
WSPPTPPPWIMPPPKKKKKFFPPPRGW 
GPPPKIF*KPPPFFFF*KKNPPFSPPGE 
NRGVFFSTKPPPPWGKKNFAAPGAPPPP 
PFFFFFPRGGG 


3041 


16942 


A 


3063 


430 


1 


FFPPKQLIFWGGGGPKSPPPKKKFFPKK 
PPGVFFSPPKKKKKFFFPPPLNLAPPKI 
FFKSPPPLSFLFFPFFFFFFFFFFFFFF 
FFFFPLSVQTLLKRTRAPPQPPLLD*EK 
APAPRVP*TGEGMPAV1WAFAPPPFYKE 


3042 


16943 


A 


3064 


1 


409 


PTRPRESTYQGHHTPPVQKGLRYGIILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HWPPTGITPLNPLEVPLLTT 


3043 


16944 


A 


3065 


1 


137 


HTFNFS I YQKATVI KTVWYWVWNRHTVE 
SPEINPYIYS*LIFFLF 


3044 


16945 




3066 




426 


KLEN*KMVI.KEIKEDLNKQTDILFS*LQ 
RLITVEMSILPKLIYKFSAIPIQIPA*F 
L* IKI I IKCMRKGK*TRIAETI FFFFFL 
SQSFILSPKIiDHRGGITANCTPPWAIKG 
KLIiLKKKKKPPKTKPEKPPIQKRTGAEG 
GVH 


3045 


16946 


A 


3067 


411 


187 


RNLPNVPPRPTHFVLLVKTGVSQVQQQG 
GKLLRSKNPPSPAPPKSWDYRGEPPRPA 
PRKFFF*LNKFKIYGGPKN 


3046 


16947 


A 


3068 


193 


471 


QCTCIKVHSGQKTGSTPLVIGELQIKIT 
LGCYYTPTLMAQIKKTDRTKCW*GYGAI 
GIMLILCWRECKIVQSL*KRVWQFII*LN 
lYLAIKLNI 


3047 


16948 


A 


3069 


323 


478 


FFXFXFLFFFILXYFYYF*KLFYLYLFX 
YIFKIYFIYNFIYI ILFYIIFI IF 


3048 


16949 


A 


3070 


1 


378 


GTRRFFFHSITLLILGLLSNTLTIYQ*W 
RDVTRESTYQGHHTPPCQKGLLNGIILF 
ITSEVFFFAGFF*AFyHSSLAPTPQLGG 
HWPPTGITPLNPLEIPLLNTCVLLASGV 
SIT*AHHSLIENNR 


3049 


16950 


A 


3071 


201 


2 


TTPIQLFLKHYHT*NFNYNFFFEIGSCS 
lAQAKVQWCDLCLLQPQTLGLKHSSHLS 
LQSSWDYEHA 


3050 


16951 


A 


3072 


261 


1 


EKAMGGGPFKVKKSPGEGPTTKGWPLKG 
PLEGGQRGLTGPFKSNGRLFFFFFGSNE 
VSLCCPG*SRTPDFK*SACLSLPKCWDY 
RHG 


3051 


16952 


A 


3073 


333 


52 


EIFKKKKKGGGRFFNKRVFPRPRVSNGR 
PRAQFFLETFFFFPERGFFFFFFFETES 
YSVTQAEV*WNYLGSQQPPRFKRFSHLS 
LPSSWI!JYRCK 


3052 


16953 


A 


3074 


7 


214 


SQLQENRLNPGGGGCGEPRSCHCTPVWA 
TE*DSVSKKKKKKKKRGGWV?PSLGGGP 
KKKNPFFNQEGGL 


3053 


16954 


A 


3075 


180 


419 


QKHLSILHYLFKRDVFFFFLKGSFWSQ 
VGGQGHNLG*LKAPPPRLTHFSCLTLRE 
TWKNRPRYFFCFFIKTGFHHVTRE 


3054 


16955 


A 


3076 


^371 


^08 


^KLSMGGSRL* S*PF*EAKAG*QLVSG^ 
GRGCSQL*SHQCTPAWVTE*RLVCKR 


3055 


16956 


A 


3077 


349 


3 


TFCWQKYTMCRSLCHPHPPRTWSTKKKR 
PPFQKGQGDAPPYKKVQRGNPPPPLKGR 
PSRGPPKKCKVFKAPVFLFPRFSPPPFF 
PPPLFFFFFF*DRVLFCCPGWSAVAQLT 


3056 


16957 


A 


3078 


124 


1 


ISSETPAVKSSFWPGVWHACNPSTLGR 
*GKEITRSRDRDH 


3057 


16958 


A 


3079 


205 


1 


CLVQNIWACVSHWRYIMSCKGC*RLGLM 
CVCVCVCVCVCVCVYFCCCCCCCYC*DE 
LSLCHSDCSAVA 


3058 


16959 


A 


3080 


3 


413 


GHHTQPVQKGLLYGIILFIT*EGFFFSG 
FF*AFYHSSLAPTPQLGGHWPPTGITPL 
NPL*VPLLNTSVLLASGVSIT*AHHRLI 
*NHHNQIIQaLLITILLGLYFTLLQASE 
YFETPFTISDGIYGSTFFVRTGFHR 


3059 


16960 


A 


3081 


1 


321 


NSLNPGDGGCS*PRLHYCTPAWGTELDS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQJD 
NO: of 

sequence 


M 
eth 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutaniic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
■p=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISKEKKYPYHVYHPSMKMTIYNYRQQNV 
NHIYIMLVEHSQTQENTCFMISGNFFFN 
LP I VLGEGEKNQHS I SFKLFFNF 


3060 


16961 


A 


3082 


117 


484 


VLKYFIDSEVNAVLFSISCSFVTDFVFL 
FFFFGKGVSFCPPAGIKGGGFGFLEPLA 
SGFKRIFFPNPLEKWE*RARPPPRGKFW 
NFFFFFFLKRELFFAPRWEGRGKI LVYL 
KGPPWGYPHF 


3061 


16962 


A 


3083 


381 


227 


CISRDGFYHLGQAGLELLTSSDPPALDS 
* SVGITGVSHRTRPLLLRLNVQF 


3062 


16963 


A 


3084 


2 


391 


SHAyHIG*PSR*PVTGALSDLLMTSGLA 
M* IDFHSITLLILCLLmTLSI YQ*WRD 
VTRESTYHGHHTPPVPKGLRYGIILFIT 
SEVFFFARFF*AFYHSSLAPTPQLRGHW 
PPTCITPRNTLDVPLLNT 


3063 


16964 


A 


3085 


2 


248 


IMRSGDRDHPG*HGETPSLLKYKRLAGH 
GGMRLWSQLLQKGGTADSHHHVLLILET 
FYSLRERRHLTSVPTLGMNYWAQDIR 


3064 


16965 


A 


3086 


276 


243 


EKWPD*SRAACPVLCRGNGQYSKGRCLC 
FSGWKGTECDVPTTQCIDPQCGGRGICI 
MGSCACNS6YKGESCEEAPRYIPEKE 


3065 


16966 


A 


3087 


3 


130 


GFYHVGQBGLELLTL*SACLSLPKCWDY 
RHEPPRPAHTPPHS 


3066 


16967 


A 


3088 


277 


2 


SSSSVFCLLVWTSSSSSSSAARLPPLTG 
FLPKWAI lEEFTKNHSLI IPTI lATITL 
LNLYFYLRLIYSTSITLLPISNHVKIK* 
QFEHTKP 


3067 


16968 


A 


3089 


404 


60 


FSHGKMRFFSPPSPKKIFFSPQSFYFLC3 
GGGAIMPPPKKRFFFKKTPRGFYFPP*K 
KKBTFFFPPPVKFGPPRGFLKSPPPFFFF 
FFFFPFFFFFFFFSPTFFTVFHLMLKSD 
ND 


3068 


16969 


A 


3090 


405 


186 


INKKPEAFTNTVDQMVLTNSHRTFYPTA 
TACSLSGAHRTFSRMDHV*DHKTSLNKF 
KTE ITLSTLSNHKLEP 


3069 


16970 


A 


3091 


371 


1 


SPPLVQKGLRYGIILFITSEVFFPAGFF 
* AFYHSStAPTPQLGGHWP PTGI TP INP 
LEVPLtNTSVLrASGVSIT*AHHSLIEN 


3070 


16971 


A 


3092 


263 


3 


NKRSPPVNLWWKMGSFFKFAKRVKISWK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SLQPRLPDLR*SACLGPPDC*DYRREPQ 


3071 


16972 


A 


3093 


1 


392 


FFFADFKKMFILINHFKMELTTYFELKR 
NEATASENC*DAVKAVLGGKFIVLSTYI 
RKEERPRINNISFQIKHWTKKN*T*GKQ 

ILTGAWGPIKFLSFITRL 


3072 


16973 


A 


3094 


3 


367 


EM* IEITMRYHYNTARRLKF*KTDNIKC 
* *GHGTSGTLIYHWQE*KMVQPLWKIV* 

QLL 


3073 


16974 


A 


3095 


3 


257 


HEVSQDGLNLLTS*SARLGLPKCWDYRR 
E P P CLAWL I L PDD CV IFQKLKLLHHNLL 
NLSCIDVLMGIYSLSNFSQSNFPFFFFF 


3074 


16975 


A 


3096 


189 


414 


KGLVMVTS*CKMFFCLSIFFFFFERGFC 
FFAQAGVQGHNLSSLEPLPPQLKQFFCL 
TLPRS *KYRPAPPCPANFY 


3075 


16976 


A 


3097 


1 


299 


ENYRP IS FMWTDAKILWKILMQIQQCS 
KRITBJRDQVGFLPGMQGQFYI*KSIKSI 
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nucleotide 

location 
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residue of 
rSete 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniie Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stap codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NAIHHI SRLKKRKHMIPS I EAKKAFNTH 
S*FMKKNKKLAFQET 


3076 


16977 


A 


3098 


362 


126 


FIHTHTHTHTHTHTHIYIYIYI lASFQW 
NAIRGRMKCINKP*KDMEELKCILPSER 
SQSAKATYLLYDSNyiTLEKAKL 


3077 


16978 


A 


3099 


3 


423 


RHEHAYHIVKPTP*PLTGALSAIiLMTSG 
LaM*FHFHSITLLlLGLLTNTLTIYQ*W 
RDVTRE STYQGRHTPPVQKGLRYGI I LF 
ITSKVFFFA6FF*DFYHSSLAPTPQLRG 
HMAPTGITPLNPLEVPLLYTCVLLASGV 


3078 


16979 


A 


3100 


392 


S3 


LRNTNCGHGAPFKNSPFFPQFLVKNS PA 

YFQNKEKDVGKPPFFFR/JDGAPFFKKK 
KK*GLALSSRLEYGGHISLELLGSSDTF 
ASAS RVART T GQCHHAGPT 


3079 


16980 


A 


3101 


81 


247 


GGWGGPPLKPPLF* KKTGKNFWPPFFKG 
KEKPPPNPRGGGKKGGPKPPPVIFFFF 


3080 


16981 


A 


3102 


3 


388 


HEKKEPF*YI6MG*AMISIGFLGFIV*A 
HHI FTVGIYVDTEAYFTSAS 1 1 lAIPTG 
GKVFS *LATLHG]SnsiMK*SAAVL*aLGFI 
FIFTESGLTGIVLSNSSLDIVLHDTCYL 
VAHLHYVLS IGAVFAI 


3081 


16982 


A 


3103 


3 


387 


HERHEELSNS*AKNLI*LAYTIAFIVKI 
PLyGLHL*LPKAHVEAPIAGSIVLAAVI. 
LKLGGYGI IRLTLl LNPLTKHIAYPFLV 
LSL*GIIITSSICLRQTDLKSLIAYSS1 
SHIALWTAILIQTP 


3082 


16983 


A 


3104 


3 


381 


heqshayhivkpsp*pltgali:allmts 
glamgfyfhfitllilglltytltiyq* 
mrdvsrestyqgqhtppvqkglrygiil 
fit*kafffaryf*afydfrlgptpqlr 

GHWPPTGITPIiNSL 


3083 


16984 


A 


3105 


412 


111 


FLGHPLPKKKMGGKKRGLPKMGV*HPPA 
PKGKPPPLKKKKKKPGGGGAPLYSPPSG 
GEGKKNPLTPEGGGPKNENSPPPPPP6G 
KKKNPPPLSPKKKKKK 


3084 


16985 


A 


3106 


298 


98 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 

FVCY 


3085 


16986 


A 


3107 


2 


417 


ATHVGLQDVTSPMIEQLITFHDIBILMNI 
LIICFLDLGALFLTLATKLTKTNM*YAQ 
E I*TD*TILPAI ILNLMALPS LRMLYI T 
DEGDDPSLTIKS IGHQWY* TYEYTDYGG 
I.IFNSYIRPPLFLEPGELRLLDVDNR 


3086 


16987 


A 


3108 


8 


430 


VGLVLFLSELPLNG6ILTFFHQGIYSPF 
PGGRTWALMVGSWGLVMfiSTDLLGPLCH 
AFTPATQLLLlirAVASPL*PAALRIGCH 
SKTTNALTHPLPRGTPTPPRPILVTIET 
MSLLIPPIAPDQRLAAGBTAEHLLLHLS 


3087 


16988 


A 


3109 


3 


453 


PRAIKFYRDWPGHERKRIAWKGDPCHMV 
LIKDEKGLMCQKKKKKTPFFWAPKIPLV 
FPPAQKNQGS*PNPPGGGGNPPLRPGP* 
RKKPPALMGPPPSSPGGINPKSFFYLNP 
GPAH*PRGEN*LGPVFENLFLPLLKKIF 
LWGELRCSQT 


3088 


16989 


A 


3110 


398 


2 


SLFQKWPNPLVG*KKKGKALPQGPPPLF 
PPLGGAGPGGSQGRGWGPPRPPWGNPFF 
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Predicted 
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nucleotide 
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ing to first 

stmino acid 

residue of 

peptide 

sequence 


Predict- 

otide 

location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Ghitainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














F*NPKIWPAKGGAPFSPPSEKG*AGGFF 
LPRGGGFPLTQIPPPPPHLGGKSKPPFQ 
KKKKKNRQGTSWMMQGWGPQ 


3089 


16990 


A 


3111 


3 


371 


SSDPPTSAFQGSGTTDMCHHHHAQLI FN 
FFVETGSCYVAQAGVQWHDHGSLLSQTP 
GLR*SSGATVLNWAPALGPRRPDPTRME 
SLVLMKPRGSLIRSACPDCFLVFWFSFF 
HEAEGCASEC 


3090 


16991 


A 


3112 


233 


449 


FALFS*LATLHGSNMK*SAAGL*ALGFI 
FLFTAGGLTGIVLAMSSLDIGLHDTYYV 
EAHFHYGLS IGREFAI IGGF IH * FALFS 
GYTLDQTYPKIHFT 1 1 FIGGNI TFLPQH 
FLGLSGMPRRYSDYPDAyTT*NILSSGG 
SFNALNAGSIT 


3091 


16992 


A 


3113 


458 


2 


RGPPPPPPPKFFFF*TPGKNSPPPPPEG 
VFFPPSPPPNFFFSPPPLFFFGGVPPIS 
PPPPKSFSPQTPPRFFFPPPPLKKWFFS 
PPPLFLPPPPFFFYPPPPFFFFFFFFFF 
FFFFFFFFFFCGDLEGELPGTGMLACVl 
LLRANRKARTRG 


3092 


16993 


A 


3114 


1 


418 


LNTTFFyPDGGGDPNLYQEJLY*P*3HPD 
PDIHILPRPGIRSHIDTDYSGKKEPYAy 
VGMGWAMTSIGFLWLMVRARPLFTVGVG 
VDAQRYSSFASITIAIPTGAEVFS*LCP 
LPLSGMK*TSAAVWALGLRFI FTCSGR 


3093 


16994 


A 


3115 


1 


425 


PRINNISF*LLPPSLLLLLAYAIVEA6A 
GT3*TAyPPIiaGNYSHPGASGNLTIFSL 
HliAGGSSILGAINFITTIINIKPPAITQ 
DQTPLFV+SVLITEDLLFLSLPGLaflGI 
TILLTDLNLSTTFFDPAGGGDPILYQHL 
F*FFDPAGGGDPILYQHLF 


3094 


16995 


A 


3116 


2 


383 


GLSCTNHKDMGRLYLLLGARAGVLSTAL 
SLLIRAELGQPGYLLGNnHIYlIDIVTAH 
AFVIIFFIVIP'miGGFGI!I*LGPLIIGA 
PDTAVPRINNISI*LLPPCLLLRLACAI 
EEAGaGTG*TVYPPL 


3095 1 16996 

I 


A 


3117 


1 


259 


PTRPALVrrAILIQTP*SFTGAVILIIA 
HGLTSSLLFCLANSITYERTHSRIIILSQ 
GLQTLLPLIAF**LLASLaNLPAPTPTP 
HQ 


3096 


16997 


A 


3118 


3 


174 


LIRGQRGCHELRSRHCTPAWATRARTLS 
QKK*KTTNPKKKLCLIFPGGKKKKLKKG 


3097 


16998 


A 


3119 


155 




PDFFNKSMDKKKTARGWEDSSSSFCFFK 
RDRVLLCCPGWSAVAQS*LTAAS 


3098 


16999 


A 


3120 


420 


2 


PPPKFFFPPTPPFLGEGGAKTPPPKKIF 
FI.KKPPGVFFFPPLKKKKFFFSPP*FLA 

PPKIFFKRPPPFFFFFFFFFFFFFFFFF 
FFFFFFFFFLVFIVLFQVKVHFLKKCFN 
IQFPLASDNS*PSMIHEKFYCESNIEF 


3099 


17000 


A 


3121 


48 


387 


RDPVLQKKEKKKKKKPKNQKKKKKKGGP 
F*KIP*GAKIKPGKEKKNFSPKRGAKKK 
NPGNFE KKTNFGGGKKWGKPPPKN* RFK 
GKKKFLKGKGGKKTQIPWGLKIFFNGFD 


3100 


17001 


A 


3122 


237 


2 


PFPWLPPFPLKflSSSP*SL* FLLGGWP 
NLPSSPNKGSFPKFPSWFLFRPP*GKNF 
YLALPR*PWPPQGFL*TAPP 


3101 


17002 


A 


3123 


448 


130 


PPRFFPEFFYSGPPKPPFFKTPVFLGVK 
PGVFFFSPYQKKPTNFGPKMGAL*RrPL 
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nucleotide 
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sequence 


M 
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od 


SEQ ID 
NO: in 
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09/515,1 
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Predicted 

location 
correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D--Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=JLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FETPIWVFPIKGFHKKKPPVLN*PPTRK 
PPPDKILKKKKKDDCISRLARN 


3102 


17003 


A 


3124 


453 


3 


SSAREGGGVPPPPPKIIFFPPPPIFSWG 
GGGTKSPPPEREVFPKNHGGCFFSPP*K 
RGKYFSPPPRMGPPPGVFFKGPPPNFFF 
FFPFFFFFFFIFFFFFFFFFFTKKKMWF 
FFKAFRMSPKPVPPFFFFCNYRWFRPR 
VRPRVRPRV 


3103 


17004 


A 


3125 


443 


2 


YFPPFGRVIWGDSLGAGVLNPPRPHKGT 
PFFPKKIFSIRPGWGGGPPLPPPQRQMW 
GGPPPPLFGLFRPQGKIPFPPKKKIKPP 
GGVCVGVGVCV* KKKKNPFP P FLWAGLK 
EPCFFFFFFFEMEFCSCCPGWSAMARSQ 
LNCNLHL 


3104 


17005 


A 


3126 


3 


186 


PVIYSTIFAGTLITflLSSH*FFT*VGLE 
GGGL 














3105 


17006 


A 


3127 


171 


38 


FFFFFFFFFFFFFWSERSS*VA 


3106 


17007 


A 


3128 


16 


189 


I L GE V I W V * * f !■ * F I KKK ECKKKKKKKKK 
KKKKKRGAPFKKTPGGPQITPGEKKKIF 
ST. 


3107 


17008 


A 


3129 


401 


85 


LVNFFSPQEKRGFFPPPPPKNFFFSPRG 
FFFLGGGGPIFPPPKKSFFSKNPPGVFF 
SPP*KKKIFFFPPPLFWAPPRFFFKGPP 
PFFFFFFFFFFFFFFFFFSQF 


3108 


17009 


A 


3130 


2 


312 


ANNLI*]:iAYTIAFIVKIPLYGLHL*LPK 
AHVEAP lAGS IVLAaVLLKLGGYG I IRL 
TLILNPLTKHIAYPFLVLSL*GGGVF 


3109 


17010 


A 


3131 


449 


3 


FFFFFFFLGGPPEI FFFFFCPPKKPKPP 
LQGGKKKPPFF*NFPQKPLGVLGaVPPPP 
LCFFIKKKKGGGKKFFSPPLF*KGGPFK ' 
KFFFFPPPPKGPPPFLHSTLRGWVFFKPP 
PK3CKALSFFKKKKKKKKKKKKKKRAAAR 
DLELADAW 


3110 


17011 


A 


3132 


95 


448 


VINRE*KCVC*MKKKKKKKKKKKKKKKK 
KKKERAGKGGG 


3111 


17012 


A 


3133 


60 


442 


LGGFFFFGGKKGFCLWCPRWGAKftG I PV 
NGTPPRGV*ENFLAQPP*EGGITGPPPL 
PQ*F6FLREN6VPLRGPGGFEPPILGEP 
PPLPPQKGGKNGRNPPPPLKGFLVLFKK 
EFSSLVPSWKAEGDP 


3112 


17013 


A 


3134 


236 


45 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFGaaD*VILFRYFQ 

FEATIMGV 


3113 


17014 


A 


3135 


441 


88 


KQQTPPGLIFF*KAPREEIFLPPPIMVF 
FSPPSPFKFFFFLSPFIFFGGVLPFFPP 
PKKGFFFKNPRRVFFCPPLKKKNFFFLP 
PFFFGPPRVFFYPPPP IFFFFFFFFFFF 


3114 


17015 


A 


3136 


2 


328 


TMLSPKPQQLNQQNCSPEHYEPQLKTQR 
TWR*KKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRGGGQKKKMVGGEKKKPG* KI FF 
FF I KKVKKKTALGDKiCKTQFWGGS 


3115 


17016 


A 


3137 


281 


2 


KATKSGTPIPSQGQQSLAWSWAGIGSAQ 
P PALLHS * PIGKI FKNOTPVGRKS PQLP 
RNTSWQLGAVAHPSNSSTLGGRGGRI TR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Li=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, V=Tyrasine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SGVRDRTRG 


3116 


17017 


A 


3138 


448 


3 


FFFLPPS PFFPY* KRGFSLFGRWFNFP 
PPVFPPPWPSQIFGFQALIFLPPPPPLF 
FSFPVGFFQTALFPFNGFFPGFFHGLFP 
LLRFCPRK*VFWGGFPFFFFFFLRDKVS 
LCaPGWNAVAQSEFTTALTSKAQAPTRP 
PTRPPTRP 


3117 


17018 


A 


3139 


3 


436 


DR*LFSTNHKEIGTLYLLFGA*AGVLST 
ALSLLILAELGQPGNLLGKDHIYNVIVT 
AHALGKIFFIAIPI I IGGFGN*IiAPLI I 
GAPEMAIPRIHNIS*GLLPPSILLLrAS 
AIEEAQAGTG*TDYPPLAGNYCHP6ASG 
DLIIF 


3118 


17019 


A 


3140 


362 


2 


KPRRGKFFPPREGGGGFPPPPPKNFFFP 

KGGKFLGGGGGKNSPPQKKGFFQIOirPGG 

VFFPPPKKKKNIFFPPGKMGAPPGFF*R 
GPPPFFFFFFFFFFFFFFFFFGQSGQVK 
LKSPKCKL 


3119 


17020 


A 


3141 


440 


102 


PTPPPCCKFSFKRPPKKTLFFPTTNLVF 
FSPIPP*NFFFSPQALIFVGFIAPIFPP 
QKKFFFSKFPPLFFITPPLIKKFFVPPP 
















3120 


17021 


A 


3142 


1 


79 


FKLDYFSIIFIPVALFVTWSIIEFSL*Y 
INSDPNINQF*KLDYFSIIFIPVALFVT 
WSIIEFSn 


3121 


17022 


A 


3143 


3 


441 


FFFPPPLKKKKFFPPPPNIGPPPKSL*K 
PPPPPFFFFF 


3122 


17023 


A 


3144 


223 


3 


LPYWKLPYLKH***LQDTNQESRG*HFL 
RPRPFKNQMKSGTVAHACNPSTLGGRGG 
RrTRSGV*DHPGQHGEI 


3123 


17024 


A 


3145 


65 


414 






PKKKTRGGPC3NPPPKKKKFPPQKGGKKK 
PPLGF*KKTPPLGGEKIPPPPPKKNTPP 
KICKKKF*GGGGQTPPPPPPGKKFSPPRN 

KKiCK 


3124 


17025 


A 


3146 


410 


31 


RANQPCAFRGKPLCDLAVGKNLSS RTQIA 
LTI*KWINWTILKLRTSGH*KTPIKTIK 
RYPIEGEKISDEELYLQY*KVL* IGKKK 
PDNPVEKWANDLKRSFMKGNILTVFKGM 
QR*LGSMAHICSLS 


3125 


17026 


A 


3147 


182 


241 


SHPSHHSTINITNKGLL*TPLPIPNPLV 
NMLGLL F I LATS S LAWS I L * S GGASN 
SNYALIGALRAVAQTISYEVTLAIILLS 
TLLIRGSFNL 


3126 


17027 


A 


3148 


254 


1 


KTKKGLKI KDPLTRF * ISVC* ITKS I * F 
FKLLPFFPLLKEHIPLKYLFFFFFLFET 
EFRSCCPGWSTMV*SLLTATSTSQIQAI 


3127 


17028 


A 


3149 


T99 


2 


NTPPAAGGGCFFFFFFGEKNFPPPHPTP 
RFF PP P PLKNF FF F S RVFF FWGGGAQKA 
PPPKKVFF* KIPRGFFFPPPKKKKI FFF 
SPGFFGAPPGFFLSGPPSFFFFFFFFFF 
FFFFFFFFFFFFFFFFLINFLLSLQGFF 
LVSKELFLFGLTVKFTRGFRGFCGQ 


3128 


17029 


A 


3150 


256 


146 


KNAKVTQVGPEFNKGPG*HTHTHTHSHT 
HTHTHTLQ 


3129 


17030 


A 


3151 


333 


1 


TISCLCTRGEHPLSPRRAGPYTGSPLHC 
CVDWDKVFSSWKDLTDWPLGDLDIEYF 
TDGSS FILRGVCRAGYAAVTLDSAVEVL 
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NO: of 
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SEQ ID 
NO: of 
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sequence 


M 
etii 
od 


SEQID 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last amino 
acid 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=iyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SVSAETSA*KAELIALTRALWLGKEQK 


3130 


17031 


A 


3152 


43 


379 


NKTSLFGWDYIWEWGAPEPETPPKRAAG 
AlfflSLAQPFSVALPPCFFDPCAPSLSPG 
♦HALRPLPLPGIASEIQTAPSWHVPPKS 
LSPAPQPCPIPTLVPVGYKTPP 


3131 


17032 


A 


3153 


398 


12 


NTTPGGGKFFLKKTREEKFFPPKKKRGF 

FPPPPPKNFFFPQGGKFFGGGGGQISPP 

QKKGFFQI^NPRGVFFTPPKKKKIFFSPP 
GK^^GAPPGFF-RGPPPFFFFFFFFFFFF 
FFFFFFLKKSVJRPLAI 


3132 


17033 




3154 


371 


105 


SPSPQVHFIKGPKPPPPK*IL*RAPNPP 
LPKKKFSNPPTWGPGPQPPPSKFSKFAE 
FPFLPPPFPFKKEPPKKKIFFPTKEGTV 
I*KNPPFSGFQSPDSIK 












371 


DVGADPILHTSTGL* LAMQY* PEA* TAF 
SSIAHIT*DVYYG*VIRYliHaNGA* IFF 
iajLLHIGRGLYYRPFLYSKT*KIG]:jIL 
LLATITTAFIGYVLP*GPI*F*GATVMT 
NLLSAIPYIGT 


3134 


17035 


A 


3156 




398 


IAIPTGVKVFS*IiATLHGSNiMK*SAaVL 
*AIiRYIFLFTEGGLTGIVLPKSSIJTIVL 
HDPYYWflHFHYVLSIGaGFAIIGGFIH 
* FPIFSGYTLDQTYAI IHFTI FIGVHLT 
FFPQHFLGLSGMPRRYSDYP 


3135 


17036 


A 


3157 


401 


46 


PPGGRNFFKKTPGKKI FSTKKKKGFFPP 
LPPKWFFFSPGGFFFGGGGGPNFPPPKK 
GFFSKNPRGVFFSPP*KKKIFFFPPG*I 
WAPPRVFLKGPPPFFFFFFFFFFFFFFG 
KKSVHA 


3136 


17037 


A 


3158 


404 


18 


FFSPPPPPPGGGVFPPNPKKNPFPPPPP 
PFFLGGGPPPPPPPPP*FPPPPTPPNVF 
FFSPPKKKNFFPPPPGPPPPPPKPPPPP 














PPPFFFFFFFFFFFFFFFFFFFLFFFEP 
MEKGPRPGDIGSNPPS 


3137 


17038 


A 


3159 


3 


192 


SLVIAGCPR*NLSSTLNLPTEPSKSPCK 
FNC * KKKKKKKKKKKKKKKKRGGALKKN 
PWGGKK 


3138 


17039 


A 


3160 


205 


80 


VQRDKFGFLQPSPSGVKLFFCLSLPNKW 
DFRCGPPNPG*FFS 


3139 


17040 


A 


3161 


3 


384 


LIVPTIILLPLT* LSKKHI I* I INTTTHS 
LIISIIPLLFFNQIKNNLFSCSPTFSSD 
HLSHP ILKKKKKKKKKKKKKKKKKGGGA 
FKRTPGGAAHWGGGGRETFFPKGGEKKN 
RPGVFWKQTFFWGGK 


3140 


17041 


A 


3162 


348 


70 


GPPPKKRVFSKTPKWLNKPPQKKKKFI 
FPPPVULGPPKNFLKGPPPFFFFFFFFF 
FFFFFYFFWMGCDRiCSWRHSSPPRLSG 
TPKCSPSVT 


3141 


17042 


A 


3163 


2 


353 


LKTIPLTSTCLTIGSLAIjAGIPFLTGFY 
SEDHII*TANVLYTNA*ALSITLIATSL 
TSAYSTRI I LLTLTGQPRFPTLTNI ETK 
KKKGGPFNRYPLGAQVYGGGQNEKFFLI 
GREII 


3142 


17043 


A 


3164 




221 


PTRPRDCSELRSCHCTPAWATEGDS ISR 
KKKKLSTRTAF*YTEAINSLIYSLNIGT 
FKTYFPKI KTYDRHFDF 


3143 


17044 


A 


3165 


26 


383 


I PFY Q * S LI * YTRKKKKKKKKKKKKKKK 
KKKKRGG 
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beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 


3144 


17045 


A 


3166 


251 


381 


GNLCAGWARWLTPWPALWEAEAD*SRG 
QEIGAIIANXVKPHL 


3145 


17046 


A 


3167 


391 


1,5 


LFKKISPHAGIWGFFSPLTP*NFFFSLE 
PFIFGRGLAPIFPPPK*RFLSKMPPWF 
IPPPLMGKPFPPPPPVELGPPIYSFKGA 
PPFFFFFF 


3146 


17047 


A 


3168 


94 


389 


SPGILGQKGQIGPIGNHVPGGIAAPVTP 
* FRFKPRIjP*GFGPKASPPIALKPERAQ 
VGGTPPPGPEGPNGQPPPFKENQPGLGF 
RFRGKLAEKRGFHL 


3147 


17048 


A 


3169 


3 


363 


WATALQPGQQSETQSQKKKKKKKKKKRG 
GPPFFFFFKKKIFFSPPGRKNKKGKFF* 
KPPKKKKKIFFFPPPPLKKKKKKKKIOCK 
KKKKKKMGGAFLKKPRGAPPPPGKRKLI 
FFFLKGV 


3148 


17049 


A 


3170 


2 


171 


KEPLGYIRMV*AMISIGFLGGIV*AEIHI 
FTVGIDVDTRAYFTSATIIIVIPTGVKV 
FS*LATLHGS]SIMK*SARVL*ALGFIFLF 
TVGGLTGIVLTNSSLDIVLHDTYYWAH 

fhyvij:.igg*fyuiyhkihcypeirrqs I 


3149 


1705Q 


A 


3171 


390 


27 


qslxvkspypwiliktkbhh*vmnfigl 
tryqsllcenphirsevcitijtpppllp 
vsespvkhscvqvldsvyssgpnl*dhp 
♦tsvdwelyvdgispsnpckvslkkmts 
papvtprs 


3150 


17051 


A 


3172 


127 


310 


KNPGGAKILRGGERKNFFLKRGGKKKHL 
GIFGKKTFFWGGKKWAKPPK*WRFFSPF 
SP*KFFFSLKALIFFGGFCPFFSPPKKS 
FFSKNSQVFFFSPPFKEKIFPFPPP*NF 
GPPRVFLKGPPPFFFLGVFFFFFFF 


3151 


17052 


A 


3173 


376 


3 


FFFFFIiRHQSLAVTLAGV*RCNLSSLQS 
PPPGIK 


3152 


17053 


A 


3174 


365 


14 


REaiFFSPEGGAPKNKPPPPPPPPGGKKK 
IFFQKKKKKKIFYPWKNFPPPPKKICKKN 
P*KPGPLKSQGFFFFKKNQNLPWGPPQ1 
SFPKKKKKKKAKTVQERKYNSNTQLVSA 
ETQLL 


3153 


17054 


A 


3175 


1 


377 


VPLHSSLGGKAHLHLRKKKKKNPGFLKN 
FGPLALLGMGVGNI KGLKGQKGKNPAFG 
AHTGGGCFSLRGRNTPFPKRAEGIICYN 
SPH*KET*BCALEPRGYKGL*QALftLPNL 
KSGKMEHILRGAP 


3154 


17055 


A 


3176 


352 


125 


GHEVLDSSDLPASASQSAGITGVSHHAL 
*EILLSMFETTWACDLLFQNISFIKSSI 
PCFIGLDFIMPHRYCRFFF 


3155 


17056 


A 


3177 


139 


366 


TAHTS*GY*VKNYINLSFCFFFFFLERN 
HSALQPGGQGHNRS SLQPWFPGLKQFS C 
LSL PRSWDNGLVPQHLVNFF 


3156 


17057 


A 


3178 


206 


1 


KGTLFKKDFFFKNFPKKVFLGVPKNSLF 
*KILTLPPVLNPYPFFFFFFFETVSLCH 
PGWNAWPRLEP 


3157 


17058 


A 


3179 


330 




IVSTLETCYIAYKEEEKDTFITLRIYVI 
GGN6KFLGI * I KQHI KKI IHHDQVGPI P 
GMQGWFNICRSINVIHHISRIKGKSRAQ 
WLTPVI PALWEGDAGGS PEVRSS RPA 


3158 


17059 


A 


3180 


3 


399 


HASAPLQSSLGYRARPCFKKKKKKKKGG 
GGGGFFFPKGGFGPLPKKGFFSGKGGLG 
KWGLGGAGKTPGIKKPLGKGPPKKRGGK 
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Predicted 

nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KGQKGPHF*KKKKKKGGKKLKKGPF* KP 
FKKPIiGRGVQP*NPHLLGG 


3159 


17060 


A 


3181 




204 


CPTACPFW*NKELLMPKKKKKKKKKKKK 
KKKKKKKKKRGGAPFKNSLGGPHFPGAG 
KKNFFFFWGGY 


3160 


17061 


A 


3182 


71 


377 


PKRGGQPKQKKIWGPPPPGAPPQKGMGF 
FNPRGFKKQITFFPPPPPPPEKKPPFF* 
KKKKKKKKSNCSD 


3161 


17062 


A 


3183 


151 


2 


FFFLEELVPLLLKLFQKIEEEGCSPNSF 
NEAF I ILIPKPFR* TTKKENF 


3162 


17063 


A 


3184 


152 


387 


yFQGFIYWaCICTStLFFFFFLKREFC 
FVTQVEVQGPNFT* LNPPLFGLKKFFCL 
TLQIGWHNRPLPPPQVIFCFFK 


3163 


17064 


A 


3185 


22 


336 


YEKCTALLQMVSSFIWMEREGTHQYSFT 
RKDFSIiASKVHIVSYYLSPIVE*FFFFL 
RGS PFAPQAGGQGPNLGSWKPLPPGLMP 
FSCLTLPGGWNCRPPPPGPVN 


3164 


17065 


A 


3186 


344 


1 


WVLKKIFFYPGRGGPPFIPPPLGGQGGP 

IPWARGF* PPRGPPPKNGF* KKKKKRGG 

GGPPPGl'PPPGGPRGGVJJi'FLGGGGPRK 
PKKITKKKNPGEKKKTSFICNQKRKTKIK 
TT 


3165 


17066 


A 


3187 


296 


■ 


NPKKILTLPKKTKVYKCEGENQVPI I FQ 
GIKNIFWKGIF*PKKEREVCV*SMRHVI 
PVFPKKRGSKRSNKSCCYKDTCTRMFIV 
ALFTIGKTWKQPKY 


3166 


17067 


A 


3188 


2134 


1 


GVAflHACNASILGGQGGRII*GWEFETS 
LflNMVKPC 


3167 


17068 


A 


3189 


1 


159 


LQDHPG*HGEPPSIVKIQKIARHGSLRL 
*S * LLGRLRQRMRQETCLNPGJiR 


3168 


17069 


A 


3190 


119 


340 


QIKKHRLVSflRGKNNKHK*IYKP*VDIF 
FKEDIQRUM3KQMKRDLISLI IREIQIKT 
TITCYLIHARMGTITRD 


3169 


17070 


A 


3191 


75 


1 


LSVIBSIFWPGTVAHACNPSTLGG»GG 


3170 


17071 


A 


3192 


343 


1 


IFlLGeGPeCSPVFfQtTGdGGGGfFLf' 
QNFFPPRGKIFRPFFF*KKKLKHPISIWGF 
F*NFnSIPPLGFFNLFFFFFKKKPKNFFF 

EC 


3171 


17072 


A 


3193 


105 


368 


KFKDPPFPPPFFWPPPKQLPPPPGKIGA 
PHF*TPKGPPPPKKKKC*KKKILKGGRG 
KKKKKTPPKHPQKIWGPSKKKNPWGGGK 
TPPL 


3172 


17073 




3194 


1 


365 


FCRDEVLLFCPDWS PTPGLKQFSHLCLP 
KCWSYRC*PPYPAQGVFLK*HLTKSLSC 
LKLFPIASLCLQDKVPAPQPCVKGLSKFF 
LCHLLSTLIPFTHSLFFLFLGWFLRQHR 
SVTQAGVQWR 


3173 


17074 


A 


3195 


1 


362 


GNQPKELNAGTYLLFYTLEGSVPLLIAL 
MYTHKTLWSLNILLLTLTGQELSNC*AN 
NLI * LAYTRAL IVMI PLYGLHL* LPKAH' 
VEAPIAGSI*LAAVLLKIjGGYGIIRLTL 
ILNPLTNT 


3174 


17075 


A 


3196 


137 


3 


KGQPRFPPISLKKGSQGKKGFFFFFF*Y 
RIPLCHPGWCPWQSW 


3175 


17076 


A 


3197 


1 


266 


EGRGCSEL*SCHCTPA*VTVRYPVSKKK 
KKKKKKGGGKGKKKGGEKNTLFGPKKGK 
LRGPQKRGKKIGPEKKVGNNLKKGI FFR 
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od 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing fo first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
lartaniino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidinc, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Thieonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















3176 


17077 


A 


3198 


217 


446 


YWSEHYAMTQVLEGFSYSLQDHFYFCFR 
SIRRIIFYSLIKPS IND*GERELEPITT 
SQALQIAGRAGRFS SRFKEG 


3177 


17078 


A 


3199 


345 


314 


QPGPEGKIRFFLKI PNLTPSGGKSLKFP 
LFKRVKPENCLSLRG*GCN* PI 


3178 


17079 


A 


3200 


2 


330 


SRHYTPAH*QSKTLSPKKKKKKKKKKKK 
PPFPKNTQKKPKKKKRGGLNWGVKTPPP 
LKTQKMGISPGKKLKKNLFPLKKKKPEG 
NPLF*FRGF*KKKKAPFKKKKNPPK 


3179 


17080 


A 


3201 




385 


FPPFLGGPQIPKFLKFFLRAN*NLFGLF 

i'LGGVLKKIi'LRKPi'SLWIKyPPTPFKG 

kkiffktffqkpl?ff*b;kkf?fcfppf 
fflsrgffvffspkkpffffffffffl 


3180 
"3181 


17081 


A 


3202 


355 


2 


FFFFFSHTESCSVAQAGVQWHYLGSG*A 
LPPRFTP 


17082 


A 


3203 


156 




LKSLLWEAKVGGLLEVRSSRPA*ATWRD 
FISTKN*KISQVWWYILVVVTTWE 


3182 


17083 


A 


3204 


352 


175 


QPRGRPAPAHPP*CPLRji:ALPC*CPCPA 
CCPPWAEASPSGVQASPaRAPACPAEAL 

USIE 


3183 


17084 


A 


3205 


280 


373 


QRGTRIFSDIiQT*KKKNKSPFKILLLID 
NALGHSRVLTERYKDIHRPANTTSFLRP 
MVQELIPTFQS 


3184 


17085 


A 


3206 


1 


367 


EIESIQIDGHTKNKFIiGIHLTYLTKEVK 
DLCKKNYKTLLKEIIDDTEiarDMLMD*N 
NIVKMAILPKAIYR5DTISTKLLMSFFM 
ELEKIF*NLy*KA*MAETTlSKKNKAGG 
ITLLDFKLYY 


3185 


17086 


A 


3207 


368 


12 


FAQKKKKKKKKKKKKKKKKKKRMPQN* 
RAPQKPLKPPPRVFLIPPPLGSPPPPAF 
FWRGGGPPPGPFSKKKKKMRLTGGGRLF 
LLA 


3186 


17087 


A 


3208 


2 


356 


KYLFSS I PECSKEICMKGIflllLtTSJt:! tSSS 
CPSLARDLDIQKTRHRNPYNLKKSSPQH 
I IVKLSKVKDKERILKTaRKECFITYKR 
TPIRPGMVAYACNPSTLGG*GGWIMRSG 

DQDHPG 


3187 


17088 


A 


3209 


359 


170 


INIKQLPRCGGTCLWSQLSRRLRRQDCY 
SLGGQGCSEL * SS QCS PAWATERDS VS K 
YKINWFL 


3188 


17089 


A 


3210 


295 


122 


VIKTV*NWNKDRHIDQCSRIESPKIHLH 
IDDQLIFDKDVKTPE*RNWGLFNKWCWE 


3189 


17090 


A 


32U 


218 


466 


ATFDTTLPSCYQNAPNHKFDDLSD*EQE 
IDTMTVNI ILPLRSLNIVITWPYNI *HH 
QHDLDYTYPDTTGQLVNCAYFINLLQP 


3190 


17091 


A 


3212 


340 


355 


LGLQG*REGNFGSLQTSPPGFKEFSGLS 
LLSSWDYRV 


3191 


17092 


A 


3213 


254 


40 


WPGTMaHACNLSTLGG*GGCITRPGDHD 
HPG*Hlin!raN 


3192 


17093 


A 


3214 


198 


25 


YNEKEKPVKMLKI lANSLGAVAHAYNPS 
TLGGQGGRIMKSGDRDHPG*HaDAWVET 


3193 


17094 


A 


3215 


376 


3 


PPGVF*RGPPGFRTLPPKSSSSSPPPKI 
LGGPPFLAFFGGAPPQKKPPSSSSSASS 
PPSSSSEAPLKKGGPFNPAGGFPLFGGP 
PRGGPPFSSSSSSSSSSSSSSSSGRRSR 
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NO: of 
nucleotide 
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SEQID 
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peptide 
sequence 


M 
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Predicted 
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nucleotide 
location 

inTto'S 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
kst amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SRTSRTRGRTRG 


3194 


17095 


A 


3216 


393 


260 


VSQDGLDLLTS * STRLGLPKCWDYRREP 
PRPACTLCISSTYNDP 


3195 


17096 


A 


3217 


3 


387 


HASASSGSRAPPFFFFFNKKTGSKELET 
TRGTVLKTCLFE'KKKKKVI.GGPKPPLGN 
PKGVGKPLKGFPGPNPWPPP*KGPGCHF 
*K1TGF*DPGPKKSHIPSWVGGKGSSLLP 
GPLGGPSEVYFSPLRA 


3196 


17097 


A 


3218 


77 


406 


RMARPELGLPGNLLSNDHIYNGIVTAHA 
FVI IFLIVI PI INGGFGN*LI PLI IGAP 
DMAFPRINNISF*LLPPSLLLLLASAIV 
EAGAGTG*TVYPPIiAGNYSHPQASVK 


3197 


17098 


A 


3219 


394 


2 


KRRYFPDGLNFFWGPGILKIFVKKKVSS 
LKKKKKKNFPPVFLQGWGGNKNFKGGGL 
KFSKPIILI*FFFPQKBAR*KVFFFFFRS 
PQKDPLREFFFFFLRQVSLC3IPDWSAVA 
RSQLTSASISRAQGAGRV 


3198 


17099 


A 


3220 


376 


157 


KFFFSP*KFFFSPKPLKFGGGVGPIIPP 
PKKRFFFKNPQGVFKKPPQKKKKIPFQP 
PVNFGPPRDFLKGPPPFFFFFFFF 


3199 


17100 


A 


3221 


394 


1 


VPPPQKFKTPGPPPPPREFFFF*KKKGF 
PPLGGFUSPAPKNPPPGPPKKVGFPGGP 
PPPPGGFFFSPPLSF*NPGERVFFGPK1 
PKRKFLLKWGKGGKFFPKKPIFPPPPKK 
KKKKKKRAAAEDLELADAW 


3200 


17101 


A 


3222 


207 


402 


SILM* LCCLFPLPGVTP IDGAPHRSYRE 
CYPVLLDGVMVGWVDKDLAPGIADSLRH 
FKVMREKRI 


3201 


17102 


A 


3223 


309 


3 


YPFFHLIDIAIHPCVCFTKFYKATVTQT 
AWSWYQIRYIDQ*NGTEISEIPPHIY]SrH 
VI CDKHDKNKQWGKDSLFNKWCWEN*IA 
ICRKLKLDPFPTPYAKI 


3202 


17103 


A 


3224 




382 


LDRERPPFFFWGARHMDIPQLVHLSINK 
GHWANFNFLGYKKKGGWEKKKKKKKKKK 
RGGPP*KKPPGGPK* PPtSGKR* I FPYMG 
GKKKPPGGFLEKPPPLGGAHLGNPPPQK 
YTPPGKKKNLKRET 


3203 


17104 


A 


3225 


133 


2 


FFFETESHSVTRLECSGTl SAYCNLCLP 
GSSDSPASAS*AAGI 


3204 


17105 


A 


3226 


349 


2 


AGVPPGNPPLWGGEGGGSPRGGGLKPGF 
PQRGNPFFFKKSQPTPPGGGPPLIPPPW 
GGGAGGSPLPQGQRFQ*TKIGPFPSPRG 
KKKKPPFPKKKKKKKKRKEKSEMPGFMV 
LNA 


3205 


17106 


A 


3227 


3 


239 


LNKVGRGCSEPRSRHCTPAWATE*DSIS 
EKKKKKPESRGILKVKGLTQAT.FYIiALW 
LYYLHPTAKQQIWGFFIYFSKP 


3206 


17107 


A 


3228 


205 


1 


IGLKIQNSCPLKDSLKKIKRQATDWRKY 
LQNTSDKASVFII YKEHLQLSNYKAVDP 
IK*WAKE1V!NKLH 


3207 


17108 


A 


3229 


174 


1 


VQMLEDKSFEETP*FSSETLVLKIPHVQ 
PGAVAHTCNP S TLGGRGGRI TRS GDRDH 
PG 


3208 


17109 


A 


3230 


2 


340 


KISrHSVYLLCVFSIPFPTFCFF*F*VFNT 
F* FKLN* IN* FTYIDRVliLCHPGWSAVA 
*S*LTAaiiNSWAQAVPCLSLLIiRHHHRM 
LI IFKKTAYI IHIWYVWYDTTYPFKVYN 
S 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
otl 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparaginc, P=Proline, 
Q=Glutaraine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptDphan, y=Tyrosine, 
X=Unknown, *=Stop cDdon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3209 


17110 


A 


3231 


1 


313 


KKSTPYQRGFDP I SPAEVPFS I KFFLEA 
ITFLLFDLEDALLLPLP *ALRFIFLFTV 
GGLTGWLANSSLDIVLHDTYYVAPHFH 
YVLSIQAVLAIIGGFIH*FP 


3210 


17111 




3232 


207 


355 


ELSPPRLKK?rLY*LC»IVAHACMPSTLGA 
RGGQIA*AQEFKTSLGNMaKP 


3211 




A 


3233 


3 


355 


TTQQ*LIKLTCKQTIAIHNTKGRT*ALI 
LISLIIFIATTNU,GLLPYSIjTPTTQLC 
INIiAMAIPL*ADAEVIGFRSKIKNALAH 
LLPQGTPTPLIPILVIMETINLLIEPIA 
LAERL 


3212 


17113 


A 


3234 


2 


355 


KYLIlIISrRLITNQQ*LIKLTSKQMITIHN 
TKGRT*SLILISLIIFrATTKHLGLLPY 
SFTPTTQI.SINLAMDIPI.*SGAiyiVIGFR 
SKIKETALAHFLPQGTPTPLIPILRIIET 
ISLLI 


3213 


17114 


A 


3235 


43 


370 


QGCVKGWVLBEQVRRGWILDSSEGKTDL 
KQRGSPGSWPEHVGGWSGVMG* SEAWTG 
QARWLTPI ITALWEAEVGGSUIPGVQWH 
NLGSLQPLPPRFKRFSYFSLPSSWD 


3214 


17115 


A 


3236 






LIQPSLKI.MISIHNTKGRT*SVILLSLI 
I FIATTNLLGLRPYLFTPTTQLSINLSM 
VIHL*AGAMVIGFRSNIKNALAHLLPQG 
TPTPLMPILVIIETIRPLILPIALAVRL 
TA 


3215 


17116 


A 


3237 




376 


GTRTNTLTIYQ*WGDGTRESTYQGHHTP 
PVQKGLRYGI ILFI TSEVFFFAGFF*AF 
YHSSLAPrPQL*GHWPPTGITPIJSPLEV 
PLIMTCVLLASGVSIT*BHHSLIENNRD 
QIIQALLITIVLQ 


3216 


17117 


A 


3238 


1 


358 


GTRNG*YTNA*ALSITVIGASLTSAQGS 
RI ILLTLTGQPRFPTLTNINENNPTLUJ 
PIKRLAAGSRFAGFLITNNISPGCPFQT 
TIPLYLKITDLGVTFLGLLTGLDLNYLT 
NKLIIKA 


3217 


17118 


A 


3239 


258 


390 


RQGLLMIAGLVLNSWPLQSSHLGFPKCW 
DYGREPPCLGN*LIL 


3218 


17119 


A 


3240 


2 


372 


ARAEFHHVSRDGLDLLT* *NTHLRLP KC 
WDYRREPLRPGKTFFLKKKKKNSIFLFF 
REGFKEKSILGIKFFRPIGGVLILTGNH 
GWGCKTGTELLVPSRFPGLAFKI CGLWA 


3219 


17120 


A 


3241 


3 


283 


HERLWGGWKTGAAGLGRTSSRPTASLTQ 
T*TTMTH*SRTTGCSTGSGRTWTRSRHS 
CAWGMAGLYRVAVASRGPRGMM*PTPPH 

WLCWAWKVP 


3220 


17121 


A 


3242 


170 


3 


I KSQAGLVGFLGP FS FQDSLNLVFVFGF 
FCF *DRVSFCS PGWSAWQSE FTAALV 


3221 


17122 


A 


3243 


2 


385 


ARADVDTPSYLTSDTIMRDIPTGGQVFS 
* LSTLHGSNMK*TAAPLLTLGFI FLFTA 
GGLTGLELTNSWLDIVLHDTYYVGPDFH 
YVLSIGAVFAIIGGLLH*YPLFSGYTLY 
RT YAQ IHFAIIFIGEN 


3222 


17123 


A 


3244 


1 


106 


GTRYVGQAHLKCLTSSDSPASTSQSAGI 
TGVSHSA*PASTSQSAGITGVSHSA 


3223 


17124 


A 


3245 


2 


360 


ARaNTLTIYQ*WSDVSLENTYQGHHTPP 
VQKGLRYRIILFITSEDYFFSGFL*AF* 
HSSLSPTPQL*GHWPPTGirPLIffPLKMP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=AsparticAcid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vah"ne, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, ^■=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLNTSVLLASGVSIT*AHHSLIENNENQ 
IIQALLI 


3224 


17125 


A 


3246 


17 


160 


gg*gcscsel*sChcspAwVteqdfdsk 
kkpaiiiasclkhustplssh 


3225 


17126 


A 


3247 


236 


2 


WAHIYMTPLSPPSFLKPVQ*KNFYIYST 
ySLDN*NPSSPKAKRAPKKSYTLPYLHL 
CVCVCECV*VCVCVCVCVCVWV 


3226 


17127 




3248 


2 


2220 


FFGGGRPSPPQGYFLIJilNHSSPSPPVKL 
NPGPA* FYPPTKGKNFPPPQR* PSPPKN 
IKTPPPSFFFSS 


3227 


17128 


A 


3249 


2 


372 


AYTISFISKISFYSLH**LPKaHVEAPI 
VGS IVIA?UELLKLGGFG1 IHLTLIENPL 
TKHIAYPFLGLSL*GIS1TSSMWLRQTD 
LKSLIAYSSISHIALVVTAILIQTP* SF 
TGAFFLIIDH6 


3228 


17129 


A 


3250 


293 


56 


EGSPKVIFNKSPPHHLFLFLFFFFIFFF 
* FFFFFFFFFFFFFFFFFFFFFFFFLYL 
LAMFYLSFFFKQDNQRYRQYSIK 


3229 


17130 


A 


3251 


169 


370 


LIMTELRGAPASKPRGQKPPPHYPCHHH 
HHHHHHFL*VTKGQGPHHWPSPTRDPGW 
L*SPS*EDQER 


3230 


17131 


A 


3252 


22 


156 


GERIGLGLGGQGCSEP*LCHCTIjAWVGD 
TVRPCLiOKKKKKKGPPF 


3231 


17132 


A 


3253 


3 




QNQTPLLD*GGLITAVLLLLYIiAVIjTGG 
ITILLADRSLDSTLFYPAGGGDPILYQH 
LF*FFGHPEVYILILPGFGIVSHIETNY 
WGGKEPFGPVGMV*AMIAMGFLGFIG*A 
HHI FTVGVDVHTRA* FTSAT 


3232 


17133 


A 


3254 


373 


31 


REVGPPTP * KI FFFPKGLNFWGGGGP KF 
PPPKKKGFFKKSPVGVFPPPGG*KSGPG 

PGFKKPPQKGKNISFPAGGKIGPPRGTL 
KRAPPFFFFFFFFFFFFLLWWVQVERP 
TL 


3233 


17134 


A 


3255 


1 


379 


LNL IQRQ*R*V* KFL * LPPQT * KKKKKK 
KKKKKGGGAFKKNLGGAKFNGGRKKKIF 
FLKGGVKKKKAGGGFKKRGKGKKCYLGI 
FEKKP F FGGGKNWENP P KKI KGLREKKK 
F*GEKGEKKPEKAG 


3234 


17135 




3256 


42 


376 


FCY I SLVHHCI YMDLS FERKKNI FVPGQ 
INSISSIA*EAHCKNKSLLHAVKKKKKK 
KKKKKKKKKKKKKKKKEKKRGEKKKKKK 
KEGRSSLKKEK 


3235 


17136 


A 


3257 


353 


67 


CYPLSPLKFFFSPHSLKFl'rKGVGPIISP 
PKKKVPSQWSQEAGFPSPNVLKRHPGPW 
FKTTP*KEKKNPFPPPVKFGPPKESLKR 
PPLFFFFCGLQ 


3236 


17137 




3258 


2 


129 


FHRI SQDGLDLLS S * SARLGLPKCWD YH 
CEPPRPAKNKILLS 


3237 


17138 




3259 


205 


415 


QQKNRRLLHFKGARTHNSYNRGQPTTPS 
ITAHMYPRLRQSHTI YVLRVHHP *VPSA 
IEGPVSV*ALLHSSTIWAGNLLLIRFH 
PLAEKSPLIQTVTLCLGANTTLVAGGWA 
LTQNDSKRIAAFSTSRGLGLI IVTIGVN 
QPHLALLHICTHAFVKArLFMCSGSIIH 
NLSKEQDIRKIGGLLITIP 


3238 


17139 


A 


3260 


3 


393 


SFNLSTLITTQEHL*LLr,PS*PLAII*F 
I STLAETNRTPFDI.AEOEAELVSGFNI E 
YAAGPFALFLIAEYTNIIIINTLTTTIL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etii 

0(1 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGTTYDALSPELYTTYFVTKTLLLTSLL 
L*IRTATPRFRYDQLIHL 


3239 


17140 


A 


3261 


2 


4O0 


ISDLSEK*FKRLWKLIMEAPEKGKAQC 
KEIQICMTQEVKGEIFKE*IA*RKKKSKF 
QETUJTL lEMQSALES FSNR I KQVEEEN 
SELKDKIPELTQSMKDKGKHIRKYEQSL 
■< VAWD YI Q * PNI ,G IIGIPEKE 


3240 


17141 


A 


3262 


-^50 


129 


i«lafn*v*?;fcvi,aiklt,kvkrypikw 

EKHFANHISDKGLVSGIYKELFRLSNKQ 
AIDLTF* KWAAGIIGGSPL 


3241 


17142 


A 


3263 


238 


3 


KEKKIGLKKCLQGSHFSIHTAWSI I I YM 
FSPIiTIISRKRMGQPGIVAHTaiPST*G 
G*GRWITRSGVRDQTGQHGKTP 


3242 


17143 


A 


3264 


350 


3 


S PTLLGSiCDENLLGFRFPliWKKGKI ISA 
PLSLGLN*RFFSEVVLIP*KPPK1!TOPGG 
TFLWCFLKRGFPFLSQKKKKKNPG^VA 
HAGNPSTLGGRGGRSQGQELETSLANTV 
KTH 


3243 


17144 


A 


3265 


69 


200 


RLECSGVISMiailtNLPGSGDSPASAS* 
LAGITVMVKLPVIAK 


3244 


17145 


A 


3266 


223 


408 


GGFPFFPPdGGEGfiNPGELEPtPPeLRK 
FFCLGPPRRGD*GPRSSSPGSFWVFKKN 
GVSP 


3245 


17146 


A 


3267 


185 


420 


DQGLWGFriYFYRQSIiA*VHWlINPSSL* 
PRTPGLKHTPVPSLLISWDYGRTPPHLT 
NFCIFBT)RGSPFF*DRVSATHA 


3246 


17147 


A 


3268 


3 


392 


TGGHS IPQAGVQWHNHGLLQPQPPGLR* 
SSCLSLPSSWHY 


3247 


17148 


A 


3269 


1 


398 


KFSCISSKHQKLKLTPKPPKPPPKKSPL 
VLPIGKKIRETFWGAFKSPPPNQPKGAQ 
TLPLKIWDKMGGGGGIiALW*KAPPGNF 
KGPPGGKPMEQP * LGPGPPLKWKGGLPH 
QKBGFSKAPGEKKKGGEGRL 


3248 


17149 


A 


3270 


422 


183 


ESYRTPSRFrLMKTTSEHLIIILPKVNN 
TERIL*MQQDRGNNIQWSSGCSA 


3249 


17150 


A 


3271 


3 


35 


KHNSL 1 1 PTI lATITLLNLYFYLSPLLY 

**SSSPPS 


3250 


17151 


A 


3272 


155 




KDFFFFFLRQSFTLVAQAAAEWCDLCSL 
QPKPPGFK* FSYNSLSSSWDYIG 


3251 


17152 


— 


3273 


413 


71 


P PS TGLFLTEEYEVS FPPFPL*KFFFPP 
SGLFFGGGVPPFFPPPKKGFFPKYPRLV 
FKGPLLGGGGLPPPPP*rLPPLGSFLPA 
PPLFFFFLFFL 


3252 


17153 


A 


3274 


336 


45 


DRVLSCSPAWRA1MAESHDFG*LQLPPPR 
VKVFSCLSNPSSWDPRHVPPRKGNFVFL 
VKTGN PWWLGGQGCRERRLCPCI PAWGT 
DKDSVSKKKKKS 


3253 


17154 


A 


3275 


206 


366 


SVFFFVLFGFFYGALLCFPNPWLECSGT 
ILVHCNLHFPGSKDSSASPS*VAGT 


3254 


17155 


A 


3276 


334 


75 


ENTRRVERRRRTHI ITOIFF*EFMNRFF 
FFIFFDGRFS CVFFFFFLFYFFFFFFF F 
FFFFFFFFFFFFFWFSAESFIYFLFPRH 
VT 


3255 


17156 


A 


3277 


144 


17 


KAPPLFFFFFFFFFFFFFFFFW*FRG*T 
HWNGDACMCKLTKS 


3256 


17157 


A 


3278 


2 


109 


yHIVKPSP*PLTGALSALLMTSGLAM*F 
HFHSITL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

locafion 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcuclne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=pos$ible 
nucleotide insertion 


3257 


17158 


A 


3279 


404 


65 


NLVSSPKRSVFFISPSQKIFFFFFNWKE 
SKISLPKEGGGPLLLLIRQRTPIFFFFF 
FYFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFL*TACLF*TFKFTKTK 
R 


3258 


17159 


A 


3280 


226 


383 


GI*KNSMr)GCCLGPFWFETGSHSHPGW 
SIVWRHIiGSLQLLPPKFKRFSCLS 


3259 


17160 


A 


3281 


91 


26 


LFFFFFFFFFFFFFFFWSERSS 


3260 


17161 


A 


3282 


3 


321 


HTVIYYFGKMEPFGYIGVA*AMISVGSL 
GFMA*AHHI FTGGIDVYSPAYLTCATI I 
VAI PTGVKVFI *IiATLLGSGMKRSAAVL 
*TLGFIFLFTARGLTGIVLSNS 


3261 


17162 


A 


3283 


393 


112 


PPLRVFFPPFPLKHFFFSSERFFFWGGV 
APFFPPPKKGFFSKI PPGFFFP PPLKKK 
I FFSS PP *FWaPPGFFIiKGPPPFFFFFF 
FFFFFFFFF 


3262 


17163 


A 


3284 


360 


47 


PPLHRWVPPPPP*KFFFSPQTKFFGKVF 
SQNPPPPKKSFFPKNPPGVFYSPPKKKK 
KFFLPPPIFFPPPRFFPTTPPPFFFPFF 
FFFFFFFFFFLPLFFWSVSF 


3263 


17164 


A 


3285 


386 


157 


FFgpNPLltWGGGePKFPPPKKRffSKU 
PPGVFFSPPLKKKKFFFPPP*NLAPPRD 
FLKGPPPFFFFFFFFFFFFF 


3264 


17165 


A 


3286 


3 


385 


DAWVPCKLYLGVFFCRFVKISAI LILKI 
LrFNIENKSI*YLLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKK3(KKPTPKKKKK 
KDSR 


3265 


17166 


A 


3288 


81 


270 


IIKI^KKKKKKKKKKKKKKKKKS*K:eHE 
KTRGPKKKVHRDVTKKILCYKI CPFNKG 
ISLIGT 


3266 17167 


A 


3289 


3 


145 


LGRLRQENHLNLGG* GCREPKSHHRILA 
WATEQDS I S KKKRKI GGPV 


3267 ! 17168 

1 


A 


3290 


404 


47 


GPGGK I FFKKNPEEKI PSTQKKKGFFP P 
FPPKNFFFPPGGFFFWGGGGPIFPPPKK 
GFFPKIPRGVFFTPPKKKKIYFFPPREI 
WAPPC-IF*KGPPLFFFFFFFFFFFFFLV 
LNDILLA 


3268 


17169 


A 


3291 


106 


362 


KKKKKKKKKKKKKKKKKKKKKKKRMKMK 
KKKKKGRGGGKKKKPRGAQ II RGKKKI I 
FFFKKVFK* KGGGDL* KKKSFCGENNLE 
NTHKE 


3269 


17170 


A 


3292 


3 


226 


NPLV]SILNLGLLFILATSSLAVYSIL*SG 
GASNSNYALI6ALRAVAQKKKKKKKKKK 
KKKKKKKKKKKKKGGGVL 


3270 


17171 




3293 


337 


2 


FFNRELLKSKNKETEKNHDYHKERIWHF 
DRPYKFYMLFFCFGVLIFFLYLPFFIYF 
FFFFFFFFFFFFFFFFFFFFLNSLNI*F 
FS INTHKCGNRGQGGGAIGPRLSNGRV 


3271 


17172 


A 


3294 


327 


21 


KFFFSPPNKYFFFVFPHKFPTPTKMFL* 
KIPPPPLISPPPKENHQCPPPPHIFAPP 
SHPFFGPPHFFFFFFFLFFFLWRTGSRY 
lAQAGLELLGSSYPPAS 


3272 


17173 




3295 


367 


101 


FFFSFF*KKKNFFPPPPLWPPPPNFL*N 
PPPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFLWSLCRRYRG*SLLY 


3273 


17174 


A 


3296 


178 


2 


KGPPFFFFFFFFFFPDSITLFIITKNEK 
KTRCTS IGE * I TWYTHTGE YYSVMKRNE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 


SEQID 
NO: in 
USSN' 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last amino 

residue of 

sequence 


C=Cysteine, D^Aspariic Acid, 
E=Giutaniic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptQphan, Y=Tyrosine, 
X=Unknawn, *=Stop codon, /=possible 
nucleotide deletion, \=p05sible 
nucleotide insertion 














LS 


3274 


17175 


A 


3297 


147 


1 


RGPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFTW* FALI FLV*NF*AI 
ITF 


3275 


17176 


A 


3298 


3 


347 


PFGEGETNGEFDLDTKKALVSKKKKKKK 
KKKKKKKKKKKKKKICKKKKRGGALKKKN 














*KKKKFLGGDNLAQTPPKKKNLWGKKKN 
FWG 


3276 


17177 


A 


3299 


405 


2 


WRPKPENSPSPGGPNPPGQKN*TPFPPK 

TPKLGRGCGGAPQFPLPVRARGENPPNP 

KEIKICGPRF WAGVL P TG3AEGES I PHLS P 
ASGSSRHPWCFMLCRPSRLCLL 


3277 


17178 




3300 


125 


375 


DQPVQNSETS3LQKKKKKKKLFPGGGGP 
PLFPHELGGWGGGGTRPPG* RKRGKSGV 
PQPPPPGGRVPPPPHPPKKWGNKGGPPP 
PGKSFFPFFFFWRDEVSLFCTGWSQTQT 
PGLKQSIHLSLPRWLDYGREPPHLAIHF 
FPTRPRTRG 


3278 


17179 


A 


3301 


348 


1 


PQRLKILGGGGPPNSPPPKKGFFPKTPR 
GF*NPPPKGKKKNFPPPRKrGPPQGFF* 
RAPPLFFFFFFGPWFFFFFFFFLFLENG 
VSLYCPGRSRTPGLKLSSRLLaSLRAGI 
TGVS 


3279 


17180 


A 


3302 




354 


GRVGGRVGSPTTLTT*NPHSHEKTPSCS 
YPKKKKKKKKKKJCKKKKKJCKKKKKKKKK 
KKKKKKKRKRKKKKKKT*GGQAYIKKKG 

KKTPH 


3280 


17181 


A 


3303 


362 


26 


SSQAAHPTKKKKKKKKKKKKKKKKGGAL 
KKNFGGGKNKGGEKKKIFFFKGGGKKNP 
GGNFKKKNFFGGGKIGENPPKKK*RFRK 
KKKFLRGKGGKKS 


3281 


17182 


A 


3304 


371 


3 


NQKKPPPPVFFFFFFKKKKNPPPATPPP 
GRP* FFPPQKKFFPEGGPPPFFKKGAFF 
PPPKKIirPFEPFFPGGVFFF*KKKNIKVF 




























PARDLEPHAS 


3282 


17183 


A 


3305 


2 


196 


DR*LFSTNHKDIGTLYLLFGA*AGVLGT 
ALSLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFVIIF 


3283 


17184 


A 


3306 


4 


339 


LFSTIIHKDIGTLYLLFGA*AGVLSTALS 
LLIRAELGQPGNLVGNDHIYNVIATAHA 
FVIIFFIVRPIIIGGFGN*LNPLIIGAP 
DRAFPRINNISF*LLPRS ILLVRASAIA 


3284 


17185 


A 


3307 


2 


367 


KPPAITQYQTPLFV*SGLITGGIjLIjLFL 
PDLAAG I T I LLTDRMLNTTL FDPAGGGD 
PILYQHLF*CFGHPEGYILILPGFGIIS 
HIGTYYSGKKEPLGYIGMV*AMRSIGFL 
GFIE*AHHIL 


3285 


17186 


A 


3308 


243 


370 


KGHRPGMVTPVCNPSTLGGQGGWIRRSG 
I*DQPGQQGVTPSL 


3286 


17187 


A 


3309 


2 


363 


LVPLI IGAPDMVFPRINNIGF*LLPPCI. 
LLL lAYAI MEAGAGTG* TAYPPLAGNYS 
HPGASVDLTIFSLHLAGVSSILGAINFM 
TTI INI* PPAITQYQTPLFV*SVLITAG 
LLLLCLPG 
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SEQI© 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 

residue of 

peptide 

sequence 


Predict- 

iocation 
correspon 

last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVletliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3287 


17188 


A 


3310 




364 


LYITGPTriALTIALLL*TPLPIPNPLVW 
LNLGLLFILATS SLAGYS IL* SGGASNS 
NYALIGALRAVPQTISYEVTLAIILLST 
I.LIRGSFHLSTLITTQEHL*IiLPS* PL 
AII*FISTL 


3288 


17189 


A 


3311 


215 


2 


GMAKLKARFLPKKGGKRSLSPLLWGSGF 
PTSGTGETFFFFF*DRVLLCQPGWSAVA 
QS*LTTASTFWAQAI 


3289^" 


T7T90 


A 


3312 1 153 




MNKCICMCVCMCVCFRSHKHTVCAWVCV 
Y*V*TyTVCVCVCVCVCVCVC 


3290 


17191 


A 


3313 


149 


1 


KTPPGFFPGFKGPF*FKGPPLFFFFFFF 
ETEDGVIiCRPGWSAVSRDRA 


3291 


17192 


A 






69 


KIGKTAPFFSWPRRVPFLKKKKKKK*GL 
TLSSRLEYGGMI SLELLGSSDTLASASR 
VaRIIGQCPHAWLT 


3292 


17193 


A 


3315 


479 


60 


MMQHSIKHALVEITGWVRWLRPVIPALW 
EAEVGEFNSISKTNNRKRSNIVKMSILF 
NLIYRINAPFFKFLIALLNKNSKSKSIW 
NLKS**STQ*SSRNNVGGITISDF*THQ 
QATEIRQEFLHTGLVHRSTVSRFRFHA 


3293 


17194 




3316 


129 


3 


SFHOiCVWie>DAVAHaCSE>STLGG*GGRIT 
GSGDQDHPGQHGE 


3294 


17195 


A 


3317 


3 


156 


E FHRVSQDGLDRLTS * STIiGLPKCWDY 
RCEPPRPASFPSFLTVRISPFL 


3295 


17196 


A 


3318 


36 


340 


TGLVIAEHLLFFFFKKGFWFCPPGGHAG 
PQGNLMEFPVSGGKGIFCPGPPKSGE*R 
APPPSRGKFWFFKKKGGLPFWPGGV* TS 
DPRGTPPPLGPKGGEYR 


3296 


17197 


A 


3319 


326 


2 


ICaFFLNFKPHPGFA*GLKFKPNSLLPI* 
SFIRKT*AFLICL* IQLNRADGKIPLKP 
GVIPFKKCFPNFTPPLFFKKWCSSTPNL 
FFFFPFFFFGDGIS LCRPGWS AVA 


3297 


17198 


A 


3320 


175 


312 


QAGVQWCDLSSPXPPPSGLT*FY* CSLP 
RTWEYRCPPSRPCNFSIF 


3298 


17199 


A 


3321 


363 


51 


RWGPRLPVAWLPSVSRRAEECSPGRGAH 
GQGGGGGGQflE*REGPDEWTGGAPVGiyiG 
SPSLNRGAGRGPPQPGPSSKSRRGRVRC 
TPHSCLIGCGLSFFICKMG 


3299 


17200 


A 


3322 


344 


187 


LRQETCFNPGGKNFG*QKLPPCPPAWAT 
RGWVS KKKKKHKKRKEKIRI CVLT 


3300 


17201 


A 


3323 


3 


313 


TRRERERERERERERERERERERERERE 
REREFFFFFF*KKNLSPPPGGKKRGGGG 
FHSPPFWGEKDFLPPRPPKGGERGGRPP 
PPVFFFFFKKEGAPPGGGV 


3301 


17202 




3324 


365 


1 


SLGWWRVTPPPFPLGPRGGSPPRPDKRG 
SPGRVKFGGVKPFPKVKP*PPPKRLDPS 
GA*FFFSPKKICQGPPPRKKKKKEAELS 
ILCTKFPLQEEEVMQVPPPPCTCSSEPG 
INCICKRHC 


3302 


17203 


A 


3325 


1 


269 


ARAiSLSLSLSLSLSLSLEFFFFFLAPT 
QILGGGLFYPP KGGAHLSNHLGGGRVF F 
GPPGKKTAPLGGGRVIKKKTPPGTPL*L 
SPPPGGPFFFQGGQKKLSPPPAGLTGGP 
PPWGGKKAPPPKFGSGPKKKKKTLERER 
ERERERERERELV 


3303 


17204 


A 


3326 




316 


ARGERERERERERERERERERERERPFV 
SPPGPDTLQDRGGVSLFASDPTHEIYLP 
CPLRGPLSERDPEELHGVCFFSEPSHRW 
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Predict- 
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correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPEEGVGDICQC*HRGECGSP 


3304 


17205 


A 


3327 


2 


369 


AHGDVTRESTYQGHHTPPVQKGLRYGI I 
LFITSEVFFFAGIF*AFYHSSLSPTPQL 
RGHWPPTGI TPLNTLEVPLLNTS VLLAS 
GV*IT*AHHSLIENNRNQIIRALLITII 
LGLYFTLLQA 


3305 


17206 


A 


332S 


1 


259 


GTRKKHSTILIIREIDIKFMPRFHISPI 

RLVKI*ATANIRC*QQCGKVG\-T,T,HC*F 
SKLVQSTGRAIWHALVLSLARSRALSLS 
LH 


3306 


17207 


A 


3329 


2 


352 


ARGTLLLLCLLFFALTIYPRWCDVTRKS 
TYQGHHTPFVQKEPLVLGGKLFITS*VI 
FFAGFF*AFYHSSLTPTPLLGGHWPPTS 
ISLFNPLKKPLLNTSGIiASGVSIT*AH 
HSI.IQ 


3307 


1720S 


A 








HEEPLGYIRMA*AMISIGFLGLIVRflHH 
IFTGGIDVDSRTYLTSATI IIAMPTGGK 
VFR*LATIH3SNMK* SQaSL*ALGFMFL 
FTVC3GLTGI VLTNS *LDIGLHDTYYWA 
HFHYVLS 


3308 


17209 


A 


3331 






GTSGDTRAYFTSATI I lAIPTGVQVFS * 
LATIHGSNMK*SAAEL*ALRFIFLFTVS 
GLTGIVLANSSLDIELHDTYYWAHFHY 
VLSIGAEFAIIRGSIH*FPLL*GYTLDQ 
T 


3309 


17210 


A 


3332 


2 


352 


A35GDVDTRAYFTSGTIIIAIPTGVKGFS 
*LATLHGSNMK*SAAVL*ALGFIFLFTV 
RGLTGIALftNSSLDIVLHDTYYGGAHFH 
YVLSIGAVFAIIGGFIH*FPLFSGYTLD 
QTYAK 


3310 


17211 


A 


3333 


85 


370 


QVSHRVRPCFERERERDRYRHTQRECER 
AKEIYSGFFLSASEMESCSLAQAGGCKE 
LRMCHCTPAWVTQGDCVS*NKK*HWEKK 
GLWFLGVNIST 


3311 


17212 


A 


3334 


362 


3 


QEVKVSMNRDHTHSLQbeEQKRirSVSKK 
KKKS SQEKLKRVHTQTHT* IFIAVLFIV 
AKNGKQFKFPSTCEWINNM*YIHTMEYY 
SATKNQLIHVTT*MNLNNIYTKFLKARQ 
KTCCTYSC 


3312 


17213 


A 


3335 


3 


351 


HEGTTLHHLFLLETGSNNPLGITFHSDK 
KTFHPYYTI KDALGLLLFLLSLMTLTLL 
* PDLLGDPDNYTLAHPLNTPAHIKPE * Y 
FLFAYTILRSGPNKLGGGPALLLSILIL 
AIIP 


3313 


17214 


A 


3336 


17 


350 


FIFTLTETNRTPFDLAEGESELFSGLKI 
KYAAGPLALLFITEYTNIIIIDTLSTTI 
VLGTTYDALSPELYATYFVTKTLLVTSL 
FL*IRTAYPRLRYEQLIHLL*KNFLPL 


3314 


17215 


A 


3337 


2 


349 


ARDSL IS PI LRLHPRPNLRQNAFHYHIH 
RGKSNFLPTTLSRP IRNAPTLLGLPRC I 
HHMraPIICRLIH*FPLFSGYTLDQTYA 
KMHFT I I FI GVNLTFFPQHFLGLSGMPR 
RYSDYPDAYTT*NILSYVGSFISLTAAI 
LI IFMM*EAFASKRKVLIVEEPS INLE* 
LYGCPPPYHTFE 


3315 


17216 


A 


3338 


2 


386 


ARAILNAMAFLILTERKILGYIQLHRGP 
NWGPYGLLQPFADAI KLFTKE PLKPAT 
STITLYISGPTLAIJSIGLLL*TPLPI PN 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














PLRNLNLGLLFHATSSLADYSIL*SG* 
ASNSNYALIGALRAW 


3316 


17217 


A 


3339 


175 


16 


ILGDLFPPAWLHPFLFLLLLFSRPSLAV 
TEaAVQWRIILGLLQPLPPGY*SSWC 


3317 


17218 


A 


3340 


1 


369 


GTSNLSLLFIVATCGLAA1!I*ML* *GGKI 
N*1YTLRVALAAIIHTILYEITLSIILL 
* TLLISGSFNLSTLITTQEHL* LLLPS * 
PIAII *FrSTLRETNRTPFDLAEGESEIi 
F*aLNIEYSAR 


3318 


17219 


A 


3341 


1 


363 


GTRGGTILPAMVLMLIDVPSLRILYITN 

K'»T^EPSLTIKSNGHQWY*TYQYTDYGGL 

ILNSYILPPLFLQPGDLRILDVDNRWL 

p I E 1 p :: K I :i NT * qd glhs *agptlglkt 

DAIAGRTOQ 


3319 


17220 




3342 


181 


315 


PKSPPTPPPPTXXXXXPGGNPPGPTL*G 
PFYWGGERGPPF*TGGP 


3320 


17221 


A 


3343 


3 




HEVSQSCRELLTSGDPPASASQSAGISG 
MHHRAWPVS*TFHIFNIYCGPGTVLKPY 
LNL 


3321 


17222 


A 


3344 


83 


354 


VLGARCPCaGVSRVSMEPWFLWQQACLL 
PWWRCCLSTLYAURADRDVP*TSRTGPV 
MVaHACSPSTLGGQGGWVTRSGV*DQPG 
QDGETP 


3322 


17223 


A 


3345 


124 


379 


GQGICFMWSLGLSHHLAHCRYTVGVNN 
LLCSSSLPFLLPLLPLPPPEGPGWDDIN 
I I I FI I FRDKVLLCCLGWSAVT * SQLTT 
A 


3323 


17224 


A 


3346 


3 


368 


YEPIP*PLTGALSALLMTCGLflM*tHLH 
SITLLILGLLSNTLTIYQ*WRDVTREST 
YHBHHTPPVQKGLRNGIILLITSEAFFF 
AGVF*AIYHSSLAPTPQLGGHWPPTGMT 
PLNPVQGPLL 


3324 


17225 


A 


3347 


1 


352 


GTSAGDVNYG.* IIRYLHftDGAKIFLRCR 
FLHSGRGLYYGSyVYSKT*NIAINLLIA 
TIATAFIGYVLA*GQILF* GATVSTNLL 
SDIPYIGTDLSQRI*GGYSVDRPTLTRF 
FTFHF 


3325 


17226 


A 


3348 


2 


360 


ARAEISPLHSNLGNESETPSQKKKKKKP 
PGGGGKNWNPYPSGTPPPNPPKGGE *RE 
EPPPLTKRVF*KKKVFFEGPRVKKNLGG 
KGKKKRGFPAR*KNPVFKRKGKGRNFGP 

QAFFFKK 


3326 


17227 


A 


3349 


226 


366 


PQSFCSVASWPAGDLMEI*AWQKFEYKP 
GKVPMDPEGCLLPLAKKKEEEEEEEEEE 
EEEEEEEEEEEEEEEEEEEEEGEEEELV 


3327 


17228 


A 


3350 


2 


374 


ARGGGYSPHRATLTRTVTFHS ILPFIIA 
GLAAIHLLFLHETGAYTPL*ITSHSEKT 
TFHPYYTMKDALGVLLFPLSLTTLTLCS 
PNLLGEPHDYTLTNPLNTRPHIKPE *DC 
LFAYTKMRSGPN 


3328 


17229 


A 


3351 


1 


266 


LGLVEHFLTLTTKLTSSNISDAQEIQSV 
*NILTAIILNLMVLPSLRMLYITNEVFD 
PSLTIKSIGHQWY*AYEYTDFGGLIFNC 

YILP 


3329 


17230 


A 


3352 


207 


362 


ILFLLKQKIYVLPGFSN*6SSIPELaHS 
DAYQTKEICSSGKVYKYLQCWEKW 


3330 


17231 


A 


3353 


178 


365 


NHGIDKGIiRYRIYTVQQEDNSVKR*I,Kn 
FNQHLSKDNIRSADKHMKRCPTSLVINI 
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sequence 
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ed end 
nucle- 
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location 
correspon 
ding to 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Hist)dine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MPIKPP 


3331 


17232 


A 


3354 


359 


3 


LQVRSLMPLGQFHPP*NFFCSFFPKPKI 
P FFFL FFLKGVF VF FPGWRARGQ FWVFG 
PPPPKFKKFFFFTLLRNWGFRDFPPSRG 
KFFFFFFFWYF* *RLRFTMLTRLVFNS* 
PQVISC 


3332 


17233 


A 


3355 


2 


354 


AREPSP*PLTGALSALLMTCGLSM*IHL 
HS ITLLILGLLTNTLTIYQ *WRDVTPES 
TFQGHHTPPVQKGLGYGIILFITSEVFF 
FAGPL *APYHSSIAPTPHLGGDWPPTGV 
TPLNAL 


3333 


17234 


A 


3356 


1 


364 


GTREMICCSALSPRIHLSFHRRWPDWHC 
ISKLITRHHTTRHVLRWSSLPLCPINHS 
WICHHRRLHSLlSPILHIiHPKPNLRQNP 
FHYHIHGRK*SAAVL*AIiGFIFLFTVGG 
LTGIVLANSSLDIGLHDTYYVGAHFHYV 
LSIGAGFAIIGGFIH*FPLFSGYTtNQT 
YAKIHFTIIFMGVNLTFFPQHFLGLSGM 
PRRYSDYPDAYTT 


3334 


17235 


A 


3357 




376 


ARGGQPEDYILILPSFGI IFHIVTYYSG 
KKEPFGYICMV*AMIS IGFLGFIA*AHP 
IFSVGIDVNTRAYFTSATI I lAIPTGVK 
VFSW1ATLHGSNMK*SAAAL*ALRFIFL 
FTVSGLTGIVLCag 


3335 


17236 


A 


3358 


357 


139 


AQPWFFFLETVFCHaAQflGLKS IGSSDL 
PTSaSQSVGTTGMSHHAWPEHSC*QliITF 
TYYVQSIYSDDIRKSMF 


3336 


17237 


A 


3359 


277 


1 


SCQKDISIGFLFVPGFHVKGVAQEHLVG 
VRMYVLVPTVGaDGKERER*RGIEHTPP 
PSERTOQRERERGRDRDREKERASQRKP 
EMDRETSC 


3337 


17238 


A 


3360 


261 


2 


TICSKCAWIiCFSKTIiLTlSTCASRSDLflH 
GP*CPNIiCSGIFSSAWTDISHLLLFFCL 
FVLRQSLTVAQAGVQWCNLCSPQPLPPG 
SC 


3338 


17239 


A 


3361 


199 


354 


HLPYWPMFMaTFHFFPFFETEFHFFPQA 
GGQWGDFG*LKPPPARLKQFSCLN 


3339 


17240 


A 


3362 


3 


364 


HEDRDTPAYFTCANIIIAIPTGVKVFN* 
IATLHGSNMK*SAAW*AUIYI FLFTVS 
GLTGIVLSYSSLDIVIHDTDYWAHFHY 
VLSIRAVFAI lGGFIH*FPLFSGCTmQ 
TYAEIHFT 


3340 


17241 


A 


3363 


3 


360 


HEETYI PKDQKYS FLHDSQTS FCFSNS I 
PTPSNMEETQQKSVSGCELP*AGMGETC 
GQGSRAAQPLPS LPLQNLKLLR ISLLLI 
EAWLEPERVLKSMFANNLGYDTSDSDDY 
HLLKDLE 


3341 


17242 


A 


3364 


352 


27 


VYLSSQRNSAASWRMTWTDRSLSPYT 
LGWHRPDRLFHFVFSLFVYTFVERGSQS 
lAQAGVQWCDHSS L * P*TPGLK* * TCSC 
LP*VAGTTGTHHYTGPLILLNLFN 


3342 


17243 


A 


3365 


347 


99 


GGPPLPRGFSPPPGARPIAPPWGPPQES 
PPPPPPKTAPQKKKKAPSIPAGQWLVEQ 
NPRPQPFKKPPGVGF*NPPPPLNMGF 


3343 


17244 


A 


3366 


3 


319 


HEECIAHTLCFFNLMS *RTTVI LHRMFS 
IVTAI*YLLSSQYVIGYYIYVCIHTHTII 
MYIHTHRHRTFICVCVCVCIYIYTSIC 


3344 


17245 


A 


3367 


3 


474 


YELLLUSIGPILNARAFLILTERKILGYI 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucin6, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Tlireonine,V=Valine, 
W=Tryptopljan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QLHKGPNAVS PYGLLQPFADAI KLFTKE 
PLKPATSTITLYITAPTLALTIDLLL*T 
RLPIPNPLDNLNLGLLFILATSSLPVYS 
IL*SG*A*NSNYALIGALRAVAQTI **E 
VTPTIILLSHYYKWLEN 


3345 


17246 


A 


3368 


2 


348 


ARGLQDATFPIIEELITFHDHALIIIVL 
ICFLGLYALFLTLTTKLTNTNISHAQEI 
ETV*TILPAIILVLIALPSLRMLYITDE 
VNDPSLTIKSIGHQWY*TYQYTDYGGLI 
FNS 


3346 


17247 


A 


3369 


64 


363 


KKITOFVPQQKKGGGRISVNGTPIFKGKG 
NFPP*PPGEGGTKGPGPKPK*IWIFKKR 
GGLI FWPKQFPTPGPKGTHPPGPPKGGK 
♦RGGPPSPGGPNFKRG 


3347 


17248 


A 


3370 


34 


355 


AKGKEYIETVAEKKKKKKKAPPPI*KAP 
♦NPGKKPPLALKPPPPSQKAPPPALFCP 
*KGPPPRVFPKKKKKFPPFGHKGFFFTG 
GKPP 


3348 


17249 


A 


3371 




374 


FLHSLHLSLRLLYLP*KFSMFI ILLKAY 
YTLCVCVCVCVCVCVCV* PLNKIHINHF 
TCSTVRGRSNPIPFPPLLN*RPFYTT 


3349 


17250 


A 


3372 


429 




FFFFLKKQKISRVFPRGPEFPGQRKPFW 
AKGPPRAPQKGGEKGKNPSPPPPLIFFK 
KGEKPQGGPLFKKGLGFFSPPKVFFKKG 
VFKSGPGGP*NFGFRGFSPPSPPICKGGK 
KRKPQS PGFFFFFFFFEILGCSGRSRSR 
T 


3350 


17251 


A 


3373 


2 


404 


LGTDLSLI IRADLGQTGDIiLGHDHIYNA 
IVTAHaFEIIFFrEIPIIIRGFGN*LIP 
LIIGAPDMflFPRINNISF*LIiPPSILLL 
LflSAIVEflGflGTSGTDYPPLSGNYSHPG 
ACVDLTILSLHLaSVSSILGAI 


3351 


17252 


A 


3374 


2 


417 


AFV*rAHITRDVYYG*IIRYnHAKGaRI 
FFMCLFLHIGRGLYYGTFLYSET*NIGI 
ILLIiATIATAFIGYVLL*GQISF*GATE 
ITNLLSAIPYIGTDLIQ*I*GGYSVDSP 
TLTHFFTFHFILPFIIAGPNPIHLTT 


3352 


17253 


A 


3375 


85 


1 


™pgavayacypstlgg*ggwimrsgvr 


3353 


17254 


A 


3376 


396 


1 


KEGyiKGPRGENFF*KNPERKIFSRQRK 
GGVFSPLPPKKFFFFPKGLI FLGGGCPI 
FPPPKKRVFSKNPPGVFINPP*KKKKIF 
SPPPENWGPPRVFLKGPPPFFFFFFFFF 

PFFFFQTLFLGGCGYNTKLR 


3354 


17255 




3377 


3 


118 


LLPPSLLLLLAYAILVAGAGTG*TDYPP 

ltgny*k:tka 


3355 


17256 


A 


3378 


301 


82 


ERKGPPQRGGKKGPKAQKPEKKKPKKRP 
PPPQN*FFSNPPKKKKGGKDPPSKKEGG 
GKGVPPPRKKKKKKKKE 


3356 


17257 


— 


3379 


2 


195 


DR*LFSTNHKDIGTLYLLFGA*AGVLST 
GLS LLIRAELGQPGNLLCSTDHI YWVI VT 
AHAFVIIF 


3357 


17258 




3380 


21 


411 


VFCTWH*DIGTLYLLFGA*AGVLRSflLS 
LLIRAELGQPGLLLGNDHIYWIVTGHA 
FVMNLFIVIPIIVGGSGN*LGPLEIGAP 
DKAYPRINNIGF*LLPPSLLLLLAYAIV 
EAGAGTG*TVYPPLAGNY 


3358 


17259 


A 


3381 


2 


421 


GRVGGRVGSTNHRDIGTLYLLFGA*TGV 
LGTALSLLIRAHIiGQPGMLLGNDHMYNV 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
F.=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histi(Jine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IVTAHALAKIFFILVPIIIGGFGNGLVP 
LIIGAPDMfl.FPRISWISL*LLPPSVLLL 
LESVIAEAEAGTGRTVyPPLAGTYSNPG 


3359 


17260 


A 


3382 


3 


185 


VI YST IFAGTL ITALS SH* FFT* VGLE I 
NMLAFIPVLTKKINPRSTEAAIKYFLTQ 
ATAS I ILLIAILFNNILSGQ * TI TNTTN 
QYSSLIIIMAIAIKLGIAPFHF*VPEVT 
QGTPLTSGLLLLT*QKLPSSISSRKQPH 
P 


3360 


17261 


A 


3383 


5 


398 


FFGHPEVYILILPGFGIISHIGAYYSGK 
KEPFGYLGMA*AMISIGFLGFIA*AHHI 
FTVGIDVHTRAYFTSATIIIAIPTGVKV 
FN*IATLHGSNMK* SAAM:j*ALGFI FLF 
TVGGLTGIVIANSSLDIVL 


3361 


17262 


A 


3384 




250 


LGNTGETLSLQKNV*KLftECGGTGLRSQ 
LLRRLK*EHCLTPGGRGCSEPRSCHSSP 
AWATGQDLSQQQQQKYPLRIFSAIIKS 


3362 


17263 


A 


3385 


456 


0 


I FFFSQVETGIHHIGQAGLQLLTSGDAP 
CVGLQKSWDYRPEPLHLAWLIHSLLSNF 
YYC*FGFCKLSVFILINVFCNKTCNSSM 
FLFS YS *NPCAQ * KNEGPLLNKBTP CR I T 
DAWADAWVAPVF 


3363 


17264 


A 


3386 


1 


225 


PSRWFLVLVDKLILKFI*KYRGPRTAKT 
TLKKKKKWRLTLLI FKSYYKTIVITIG 
WYGFQDRQVD*WNRIESLE 


3364 

"3365 ~ 


17265 


A 


3387 




400 


LLILGLLTNTLFIYPiHIGDATRESTYQG 
HHTPPVQRGLRYGIILFITSQVFFFAGP 
F*AFYHSSLSPTPQLGGHWPPTGITPIiN 
PLEAPI.LWTSVLLASGVSIT*AHPGLIE 
HNPNPIFRALLITNILGLYF 


17266 


A 


3388 


24 


407 


IASGRPFFFFFFFPKGKFFISRGGGPFS 
PPKI *KNWGKGKS FFPPPQREPKKRGVP 
KNPSLSFYPFGGKPPLGPPPKLVPPSS* 
PGKKAFFFF*TPPPGFLTPLGGGLSPLK 
GGYLLAPHFVPTEGPL 


3366 


17267 


A 


3389 


148 


413 


L^aNNNNNGKI!^HLLNTYSCT*LEEKKKK 


3367 


17268 


A 


3390 


157 


14 


RVSACCQVWW*MPVVPATREAEARESLE 
PGRHRLRSCHC33RDPVSKS 


3368 


17269 


A 


3391 


2 


398 


LFSTNHKDIGTLYLLFGARAGVRGTALS 
BLIRflELGQPGNLLGNDHIYMVIVTAHA 
FVINFFI VI PI I IGGFGN* LVPLIIGAP 
DMAFPR1NNISF*LLPPSLLLLVAYAIV 
EAGAGTG * TVYPPLAGNYSH 


3369 


17270 


A 


3392 


2 


395 


DQKDIGTLYLLFGA*AGVLGTAVSLL1G 
AELGHPGYLIGNDHVYNVIVTAHAFVl I 
FFIVIPIIIGGFGN*LVPLIIGAPDMAF 
PRIKNISW*LLPPSLLLLrjACAIVEAGA 
RTG*TVYPPIAGNYSHPGA 


3370 


17271 


A 


3393 


396 


54 


GYPPNLTFVTSSLTSVPSPSHPP SHVCS 
PTPAIPQTPRSPPSESTPPTEVSLTTPT 
LSLGAISSLASAGPQGASPASPGFWGTS 
PP*PYS*NRPQTAACGVFSQLFFrRPEV 
P 


3371 


17272 


A 


3394 


417 


60 


KGWVFQRILWPGPGGSPRYSRPFGGPRG 
L I PLCS GV* TPPGPHWETPFFENNQPI S 
GQGGGGPLIPLLGGVKPEKIFYPGNQSF 
H* PKIPPHPPPLGAKQNFPPKKKKPQKN 
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last amino 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptoplian, V=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TKGFWV 


3372 


17273 


A 


3395 


417 


3 


FFFPFFFQHHFGFEAAA*YWHFVDW*L 
FLYVSIY**GSYSFSINS 


3373 


17274 


A 


3396 


229 


1 


RSFWTKTMGFSCYKDFF* TJSTGNXDTTYQ 
NLWDTAKRVLRGKF lAUSTVP INKLERS Q 
ISNLISQVKEGQAWWLMPVI 


3374 


17275 


A 


3397 


344 


165 


ELKSPQPRKRNGCFFLAVDPDEGF*NLN 
FCGNGQGLNAPGGYPCEIPMGFVPNAYG 
KTF 


3375 


17276 


A 




T 


409 


HKDIGTLYLLFCARTGILGTALSLLIRA 
ELGQPGNLLGNDHIYNGIGTTQALVEIL 
FIVIPIITGGFGN*LIPLIIGAPNMAVP 
RINNISF*L*PPSLLLLVAYGIVEAGAR 
TG*TGYPPIiAGNYSHPGACVDLTM. 


3376 


17277 


A 


3399 


3 


396 


KDIGTLYLSFLS *flGVLGAALRLL IRAK 
LGQPRKLLRNDHIYNVIVTAHAFVIIFF 
IVI PI 1 IGSPGN*LLTLI IGAPDMALPR 
INNISF*LLPPSLLLLIjAYGIVEAGART 
G*TAYPPLSGNYSHPGftSV 


3377 


17278 


A 


3400 


49 


350 


KSNSHTEVQaGQEQKSFFRSKHKaGRAG 
SDPAPYVLflGCLCRSISSFLii'i'FFSLFF 
FFFFKRDGV*QGYSSGSNTPGVKQSSCL 
GLPKYWDYRREPLHPA 


3378 


17279 


A 


3401 




398 


KPPAITQYQTPLFV*SVLITAVLLVLSL 
PGLSAGITILLTDRSLNATFFDPAGGGD 
PILyQHLF*FFGHPEAYILILPGFGI IS 
HIETYYSGKKEPFGYIGMG*AMISIGLL 
GFIA*AHHIFTG*IDVDTRS 


3379 


17280 


A 


3402 


388 


42 


TPLFMPPGAKCVNPFFPKKLKKKMPSRA 
MGGPPLFPPLWGGKPKGFP*HRKSGAPV 
THFAPPAPPPGGKREPPPPKKKKKKKRK 
EEKESNEAVQIKKROIKKGRLPVLDPAE 
ESQ 


3380 


17281 


A 


3403 


413 


2 


S PHPLKNFFLPPKPPNSGGGLAPFVPPQ 
K*GSLPKIPRGRYKSPPFKEKTWflLPPR 
GKPGPPRASSKRPPLFFFFFRGRVPLCH 
PG*STVQCVQAQCTLELLGSSNFLTSSS 
PVTWGHRQGPPRPIjNTFFIFFRGRVP 


3381 


17282 


A 


3404 


433 


274 


LGWAHACNPSTLEG*GGRTAWSPGVQD 
QPGQHSETPISTNFFWYFRISYFHI 


3382 


17283 


A 


3405 


131 


427 


GPPPQPFFFWGKRGGPNPFFFFFSEKRD 
VF*QFFLRRRGPQKFFKPPFFCLQKTSL 
FSEKKKKKGFGPPLFPQKKKGCGGGPHY 
KKEGFTLNS FLFQGDGVSSVAQAEVQWR 
NISSLQALPPGFT 


3383 


17284 


A 


3406 


413 


137 


SPPPPPQGGIFFFFKPR*ETFSPPPNT,G 
FFSPPSPLKFFFFPKGFIFLGGGGPKFP 
PPKKRFFFQNSPGGFFFPPF*KKNFFFL 
PPVILGPPRVFF*GPPPPKK 


3384 


17285 


A 


3407 


319 


540 


IVFKDQCEESFTTLMVDIRNHQNLLDSL 
EQHGKGDLL*GAIIAYHCEKCNKKVRAVQ 
LCLILITFKGFRKNRFFW 


3385 


17286 


A 


3408 


2 


403 


SLHLGGGSCIVGAIYFITTI INIKPAGI 
TQYQTPLFV*SGLITGILLLLSLPVLAA 
GITILLTDRNLFTTFLNPGGGGDPILYQ 
HLL*LLSRPEVYILILPGFGIISHIGTY 
YSGKKEAFVYIGW*AIISIGC 


3386 


17287 


A 


3409 


2 


142 


KKENYRPVSIJ!IRDAKSLNQIQLHI*KII 
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residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
lust amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=IsoIeueine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














HCNQVGFI PGMQGWFNIL 


3387 


17288 


A 


3410 


407 


48 


PPGGKIFFYKNPEKKIFPTQQKKGVFPP 
FPLKNFFFPPRGKIFGGGGGQNAPPPKK 
GFFPKIPQGVF*SPPKKKKKNFFPPGKI 
WAPQGVFLKGPPPFFFFFFFFFFFFFFF 
FFFLGVR 


3388 


17289 


A 


3411 


3 


402 


SYRLE*QLEDKaAPPQDFTQQIiRWPEKQ 
HRMVWPDKEDMHKQLVEASETLKSQAKE 
ri*DflHQQQKLALQEFM:FNELMfffiLYSH 
iraKVWDKEEEMEVAMHKADMMWQEI*RS 

KKLTKRMLFTQMRRITPVIPS 


3389 


17290 


A 


3412 


1 


224 


LLFQLLRRLRQENHLNLEGRSCSELKMH 
HW'_'yPP;GTE*DCL*LKKKQSAYLPNNTI 
KKLTGSRTIEDICTEYNK 


3390 


17291 


A 


3413 


202 


1 


I VMSWPGWTHA* I PSTLGGQGGWITRS 
GVRDPLASAFQSAGITSVSHCSQLIFVT 
VFYLSPLSHLY 


3391 


17292 


A 


3414 


407 


309 


PSFKJVSKDRLTLLLGftUaAGDLKFTPMF 
lYHSENPRALKNYTKSTLPVPVLCIRNI 
KAWMIAHLFTAtfLNEHFKPIVHTHCSGK 
KISFKILLLINKAPGHREAQMERHKMNV 
VFML2U!ilTTPILKLMDQGVF*TFILCLSI 


3392 


17293 


A 


3415 


404 


46 


riKKPPFQKFLKILKKKKKGGGGGPPP*T 
PPFGGGRGGVPPRAGVSKNPVPQGETLF 
FFKNPKFPGVGGPAP* FPLLRGVNKKNF 
FNLGGGGFKKRKFSPPPPPRGKKFFFPK 
KKKKKKE 


3393 


17294 


A 


3416 


23 


404 


SVLITGALLLLSMPVLGAGITILLTDRN 
LKTTFFDPAGGGDPILYQHLL* IFGHPE 
DNILILPGFGIGSHIUTYYSGKKEPF6Y 
IGMA*AMKPIGFLGDIAGAHHVFTG* ID 
VNTRSYFTSDTINMG 


3394 


17295 


A 


3417 


38 


469 


SGTTHaSALFDPAGG*DFILYC3HLF* IF 
GDPEVYILILPGFRIISHIVTDYSGKKE 
PFGYIGiyiA»flMRSIGFIjGFIG*AHHIFT 
V* IDVDTRAYFTSATIIIAIPTGVKVFT 
*LATLHGSNMK*SAAVL*ALRFIFLFTV 
GGLT 


3395 


17296 


A 


3418 


1 


419 


FSTNHKDIGTLYLLFGA*AQVLGTGLSI, 
LIRAELGQPSNLLGNDHIYNGIGTAHAF 
VIIFFIVIPIIIGGFGN*LDPLIIGAPD 
MAFPRI1SINISF*LLPPSLLLLLASAIGE 

AGAGTG*TVYPPLGGNYSHPGASVDLT 


3396 


17297 


A 


3419 


397 


107 


K1ISAPPFSPIFLP*GPKFAFFPPGKKVG 
KKI F* KFPAPGPQGVYLREGGGS PS PPQ 
TPNWPNKKAAFVLPPPPFFVFGNPKKKK 
KKAVAFSSPNRH 


3397 


17298 


A 


3420 


3 


253 


RGCREMRLCHCSSAWAIAGVSEEKKKKK 
KNWQ* CLLGICS YLDLWLPGLFKWLSQ I 
GRIVELQLLELKQYLRGITKQTLKILH 


3398 


17299 


A 


3421 


1 


393 


RTRTIKTPAITQYQTPLFVGCALITAAL 
LLLSLPVLRAGITILLTNRYLNTTFFDP 
VGGGDPILYQHLF*LFGYPGDYIVILPG 
FGIISHMVAYYSGKKEPFGYIRMD*AMI 
SIDFLRIIV*aHHIFTVEN 


3399 j 17300 

1 


A 


3422 


142 


377 


LS KKCYKPS YVGFVCQCKFFNQLS KFLE 
VKLLYCMVKLC* *EG*MAQPL* KQFVLV 
AIjKLNIVLPYNKAISLPGIYSTD*KTYT 



758 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
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Predict- 
ed end 
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correspon 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutam)c Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T-TIireonine, V=VaIine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DTQNLHTTVySSSLIIIKN*KQQRCPSV 
GE* INKCFI YTTEYYSLTKRKKIiSSHMK 

A 


3400 


17301 


A 


3423 


2 


435 


LILPGFGIISHIVTYYSGKKEPFGYIGM 
G*AKI3IGFLGFIV*AHHIFTVGIDVYT 
RAYFT3AT 1 1 lAI PTGVKVFR* LATLHG 
SNMK * SAAVL * ALRFI FLFTVGGLTG I V 
LANS * VD I VLHDT YYVGAHLHYVLS IRA 
VFAI 


3401 


17302 




3424 


2 


436 


LNTTFS'DPAHGSDP 1 LYQHLF* FFGHPE 
VYILILPGFGIISHIVTYYSGKKEPFGY 
IGMG*AMISIGFLGFIV*AHHIBTVGID 
VDTRAYFTSATIIIAIPTGVKVFS*LAT 
IiHGSNMK*SAAVL*AIiRFIFLFTVGGLT 
GIVLS 


3402 


17303 


A 


3425 


1 


193 


PTRPPrRPACSELRSRHCTPAWVTE*DS 
VSKKICKKKKFSPARGflHVCTPTFLGGQR 
GKKFCFAP 


3403 


17304 


A 


3426 


3 


446 


HKKFLFPKSFFNRVFPPTKKKNPRGPGF 
FFFFLKKtFFFPPGGI*SGGFGSLQLFP 
PRVKKFFSPPPPEKGGFKAPPPGPGNFF 
FFLKKRGFS FFGGGFLK*NPGFPPPPPF 

KKWGVPKGGAPPPPPFF 


3404 


17303 


A 


3427 


202 


1 


FSPGHMLGNTIKLPYTHSFILMHSQHNL 
EKYKVGGIPLSN1.'KTKYKATAIKTMWYW 
HMNRYMD*WNR 


3403 


17306 




342S 


i59 


1 


NPPHEKNFCSFFQMESHSADQAGVQWCD 
LSSLQPSLRGFKRFS*VSLVSSWDH 


3406 


17307 


A 


3429 


22 


298 


ESYASTAKATERDMSDEMLSASGMPREY 
SDYPnAYTT*NILSSVGSFISLAEGILI 
I FMI *EAFASKHKVLIVEEPS INLE*LY 
GCPRLRGR 


3407 


17308 


A 


3430 


267 


26 


GITFFFFFWLEI*LYYWLFLaNKLYFHE 
QIWFGLVFFFGLLLFDFSFYSDYFPLLL 
CLGSFSNF*CLIHCFFSVIIVLLF 


3408 


17309 


A 


3431 


274 


376 


NKG*ISCSISPPQHTVRRGLTLSPRLEC 
SGVISAHCRLCLPGSSDPPASAS 


3409 


17310 


A 


3432 


177 


47 


FYFFFFFFFFFFFFFFFSLLFFFFFFFF 
FFFFFFFFFFFFQIiSKSFIAS FKYHK*V 


3410 


17311 


A 


3433 


379 


83 


FKRHFTKEDLKMVTEHIKSCSSSLATRA 
MQIETTMAHCYTFMRLAKIKKYQYTKCC 
* R* RVTGTLTYCWWECKMTQPFWKT IVR 
RFLKQYIYHVIGYV 


3411 


17312 


A 


3434 


214 


3 


AASTFFFFTRSTY'fVCI FLKS KRMINK* 
NYIKLKSFCTTTTKINKMKRQPTDWEKM 
FANHINHHEGLILK 


3412 


17313 


A 


3435 


1 


353 


GGQGQPGQ*SETLSQ*KVSWAW*LQLHR 
RMRQDNLLSPGDQG*SEPC*CYYTPAWA 
RVRLCQNRSVNEKSLHDICNDKK*QNI * 
I I S VPEGKQHMKGLENLFNE I LDENYPS 


3413 


17314 


A 


3436 


29 


394 


VFMEVDFFFFPFFFFFFFGKRNFFFPPR 
GGGGGPF*IFGPPPPHGKGNFPPPPPGK 
GGKKGPPPPPGFFFFF*KKGGSPPGGGG 
FLTPPLWDPPPWPPQRGGFGV*TPWPGP 
GSDFLNFFFF 


3414 


17315 


A 


3437 


108 


2 


PVAVAHS CNPNTSGGRGGWIS *GHEFQT 
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E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl{nown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SLANTVK 


3415 


1731G 


A 


3438 


215 


3 


LLLLREFFLPKHREVFIFVFYFMRQGLT 
LTEAGVQWRSHGSLWPQTPGLK*SSCLS 
LPSIWEYSAYHHIR 


3416 


17317 


A 


3439 


1 


186 


QTSSFSFRRSVITNFFKLKDDV*TH*KE 
AKWLEKRLDEWLIRIKSVEKTUdDLMEL 
KTSTRP 


3417 


17318 


A 


3440 


290 


3 


PGLGGGPRYSRFSGGLGPKI PUIPEGEG 
SIKIiNPPPAPPPGGPN*DSFLPPPPKKK 
KKKTHVLFAGSGNLLQPLQPGAFTTEVN 
GGLAQQSPKVL 


3418 


17319 


A 


3441 


163 


'2 


PAGIiPVKPPRFQDKLNFPFGPFFFFFFF 
* I*DKVIiLCHPGESATVHSSLGDRA 


3419 


17320 


A 


3442 


131 


1 


RPRRPLTIKSIGHQWY*TYEYTDYGGLI 
FWSYILPPT..FT..EPGS 


3420 


17321 


A 


3443 


162 


2 


PGRAHSTGHLPLRFHIJILGAVTHAXSNPR 
TLGGQGRWIT*GQEFKTSIiftKRVRP 


3421 


17322 


A 


3444 


2 


130 


WEKIFANHVSDKGLIYEIYNELQQLYK* 
KTNGPIKKRPKSTRP 


3422 


17323 


A 


3445 


2 


117 


IiAIiLGLKPGLK*ATCXfiLPHYWDYRHQP 
PHPACFSKGI 


3423 


17324 




3446 


350 


101 


KAFSSSLGSSRKKGDSDAGFCIHFKTVP 
FGFFWTLELKKGI PFFLNFFFFPFLRVL 
KGSKNPF*BtRGPFFFFFFFFDNFYIWN 


3424 


17325 


A 


3447 


106 




IiLQPNGQLGPGTVRHGCNPSTLGG*GER 
ITTSGVR 


3425 


17326 


A 


3448 


28 


333 


GQQEQNSISXKKKKKKKKKTPFKKGKRK 
IPGGKKKPLGKSEAPPPLGGGEKNPPPG 
KRAPEKI FKKKPRGNWETFKRMEKGFLK 
GQ* PGNP * KVKPLWGGKK 


3426 


17327 


A 


3449 


81 


339 


P I VSS CPGVAGALKQAMTLEFKVYQHHV 
VANCRALSEALTELGYKIGTGRDTDGVQ 
QALLWGLRLAS* PIVSSCPGVAGALKQ 
AMTLEFKVYQHHWANCRALSEALTELG 
YKIGTGRDTDGVQQALLWGLRLASQFV 
QPGCGPGSAAAAAMM 


3427 


17328 


A 


3450 


166 


322 


RDFSTTVLITDRTSRQKIPTDVSQLNLI 
DI YKTL* PTIGKYTFFSSTYRSLR 


3428 


17329 


A 


3451 


257 


354 


KFLSFFRDRV1LCCPGWSAW*S*FTAA 
LSSW 


3429 


17330 


A 


3452 


286 


3 


TDPFPIKRAKEREMLQITKIRSIRENFI 
TDQQKTKEL*EEQ L YAHRVDNLDEMDKF 
LEGYK*PSMTHGKIENLaiPITSKDIES 
VIKNLLTDPP 


3430 


17331 


A 


3453 


358 


353 


R**KLLMMPC*IETNKT*NTCTVIKEEK 
GQPGTVAHACNPSTLGG*GGRIT 


3431 


17332 


A 


3454 


108 


353 


THLGGNPVRLYAWLLLHYVASYEIFSVS 
HCKDQTYLPILL*NFLMELGTVAHTYNP 
SILGGQGRRIT*VQEFETSLM#IVKC 


3432 


17333 


A 


3455 


342 


50 


GPVTPNIPPVPPLGGQKELFPPKKKKF* 
KFWGSPPPLFFKKPKNPRETPPESCPKP 
KKTPIPVKV*GKKTP*TFSKKKKTPPIP 
ELPLLKKKKKLIN 


3433 


17334 


A 


3456 


343 


2 


KAPFFFFFKKFWSQKKRGLFFFFFKKFF 
IPGFPGPTFFFWGFFFFKKGPPVF*LKK 
FFGGPLPPLKDPPGVFPFSPF*FWDFPP 
PPPFFFFFFFFFETVSLCRPGWSAVSRD 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


3434 


17335 


A 


3457 


2 


259 


LTKPTESGTEDKKGRDSKESEGKTERTE 
SLRSQNGGENSV* S CPSTSSTAALNTAA 
AJMNPLALNPQVSIiSSLSCCP I S IKGS FV 
P 


3435 


17336 


A 


3458 




334 


AGGNKKlffiKEHM*NKEIF*WFiaKNWPGA 
GAHAC3JIPTTLGGRGRRIMRSG 


3436 


17337 


A 


3459 


207 




YFKPTLIdNDKGYLNETIDKFDPRDINR 
TSNLTL*EHAIiFPSTCGI FMRRDYIEGH 
EVNLNQFQRIRTQ 


3437 


17338 


A 


1460 


268 


3 


PLNPGPKQSS PPGLPKPWPFRGEPLPLG 
PWFFFVKNRGRP *AFFGTNPIiAFFLFFL 
*DRVLFCHPGWSVWQSQLTVQPPPLRL 
KQFS 


3438 


17339 


A 


3461 


350 


2 


AETAHNIlMNflFGPGSANKCTVRWWFK*L 
CKGVKRLEDKALGGWPSKVAINQLRAI I 
KGAPLKPTRKVRKEIiNIEPSAVIQHLKR 
IGKVKKLHKWVPHEIiHENKEHRFEVWSS 
LIIP 


3439 


17340 


A 


3462 


312 




HI VTYYSGKKEPFGYI(aW*AMIS IGFL 
GFIV*AHHIFTVGIDVr)TRAYFTSVPII 
IAIPPGVKVFS*LATIJlGSNMX*SAaVL 
♦ALGFIFLLPVGGIiTRIVIA 


3440 


17341 


A 


3463 


181 


3 


FLPLLftAFSGLLLFSGFTiGVFLFCFFET 
ELPRLECCGTISAHCSL*IiGSSDSCAS 
GTH 


3441 


17342 


A 


3464 


176 


263 


VGHDAHNPSILGGPGGWIT*GQEFKVnA 
A 


3442 


17343 


A 


3465 


239 


336 


EPGEVAYSCajPSTLGGRSeWlI *S!5ePE 
TSLT 


3443 


17344 


A 


3466 


88 


326 


QTSLLNTI **PLH*KNIPLPPPPLPPTIi 
TPPFSPPPPPPPPKKKKRKRK 


3444 


17345 


A 


3467 


4 


345 


EEERRRERKKKRRKKEEKKKEEREREEK 
ERKKKKKKKEKKKRRGRQEKRKGTG* IE 

RKQ 


3445 


17346 


A 


3468 


265 


335 


QMNPDRHLQF * DKVLLCHPGWGTVARS * 
LTATSRSSCLSPPSGWDCRRVPPCLASF 


3446 


17347 


A 


3469 


115 


342 


INKQINKIKDPTSYNILLSQSLI*FKAL 
TLLNSVKAQRGEKATEEKFEGSIS*FMR 
FKQIWHLYMIKVQGEKPSTD 


3447 


17348 


A 


3470 


56 


328 


RTYPTKPQKMRDPEDPTSRPRYI PRHLY 
I TDEVNDPSLT IKGIGHQWD * TYE YTDY 
GGLTFNSYILPPLFLEPSDLRLLDVDNR 
VEPATRA 


3448 


17349 


A 


3471 


3 


329 


RIGAEDE3SLAVTHKPRKIHRKLTLWVF 
VNDSLVMPRPDKNHQWAEHKNCFPLVDV 
GIDPYLVYHLRPHQAQGI I FLYE CVMGM 
RMNGRCGAI LAD *NGLGKTLQC I S L 


3449 


17350 


A 


3472 


154 


2 


PNLLYPKSYWKKKDIWTPMFIVALVTIV 
KIQKQPRCPLMDKSIR*L*YIH 


3450 


17351 


A 


3473 


132 


340 


ALKTEN ITTLCLDF*LI ETVR* * M CWFK 
KKKKKKKKKKKKKKKKKKKKD PP KKKKK 
IK 


3451 


17352 


A 


3474 


3 


345 


VIYSSIFAGTLITALSSH*FFT*VGLEI 
NMLAF I PVLTKKINPRSTEADI KYFLTQ 
ATAS I FLLIAILFMHILSGL* T I TWTTN 
QYSSLIIIMAIAIiCLGIAPFRRRGREFQ 
lY 


3452 


17353 


A 


3475 


1 


254 


FHHVGQAGFTPDLR*STRLGLPKCWDYR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
locition 
correspond 
ing to first 
amino acid 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














REALCLVHILLYLPMIGPFRIQKLLKAA 
VFVFLQLHQAFIQSLLTLFYFI SFPCNS 


3453 


17354 


A 


3476 


351 


186 


SVLSADNTEMGFHHVGQftGLELLSSSDP 
LAP7VS*SAGIRGVEHNTWPVLTITLN 


3454 


17355 


A 


3477 


330 


3 


GEKVFErPLTRDHVSEILRCPLMAYGVE 
PALCDRLRTQPFQAQPHQQKAAVLAFLV 
HELNGSTLII1JEMDKTI.ESMSRYRKYEW 
NVKKRLRRI.KTARA*RLGRSQVICK 


3435 


17356 


A 


3478 


373 


248 


NRLNPGGG6WSEPRSQHSTPTWVTEYDS 
VSKK*INKNKNWK 


3456 


17357 


A 


3479 


3 


92 


EGQGCSEP*SRYCSPAWVTE*DCVSKNF 
F 


3457 


17358 


A 


3480 


380 


2 


VEVPLL]SITSVFFASGVSIT*AHHSLIEN 
NRNQIIQALLITNLGGLYFTLLQASEYL 
ESPFTISASIYGSTFFVATGFHGLHVII 
GSTFLTMCFIRQIIFHFTSKHHFGLEAA 
A*CWHFVDWCIFL 


3458 


17359 


A 


3481 


225 




ERAERWEQVMIALFS PNYIRLSFP I S P 
TPWRHKFRFHLKGYIWPGTVAHTCNPS 
TLGSQGGQITRSGD*DHPG 


3459 


17360 


A 


3482 


332 




IKYFLTQATASIILLIAILFMNILSGQ* 
T ITNTTNQYSSLI I IMAIAIN* GIAPFH 
F*VPEVTQGTPLTS 


3460 


17361 


A 


3483 


277 


1 


LSMS S FS I SS IPSASIlSIS*ALLFSSaES 
NSLECTFSSSFSPNSSESFSSSFPSCSC 
AS * FSNSQSLIKASSSTEASPIIISSaSC 

FLPLPLVG 


3461 


17362 


A 


3484 


404 


209 


ALSVQPLWRPRQENGVNPGQK2iSR*PRP 
PPCGPAWATKGELVPKKKKKKISFLGHD 

QSLELLLQR 


3462 


17363 


A 


3485 


31 


347 


FCILERKAFLYW*CWKFFFFFFFFFLE 
KGFFFLPLLVFFWCYYHLKGASLP*VFL 
FFYLHCYSSLFTLFFIFLQSLTPHFFLV 
*TFFYYDPLMFFFLTLFLELV 


3463 


17364 


A 


3486 


327 


161 


QKKVRDPYEQL*ANKFKN*EEMPRFLEK 
HKLPKLTQEEIKNQ 


3464 


17365 


A 


3487 


311 


3 


PRRGRFQLTQNFALP FNLGNKTKIPFKK 
KKKKTEPLCPCTCVCAAFVCLCFHLCT* 
PCDCAHDCDNI CVCVNI CVYVCWTHLH 
TGLCSCVDCDHICVGTCC 


3465 


17366 


A 


3488 


245 


I 


RPRRLLALLLIIITF*LPQMrGYIEKST 
PYECGFDPISPARVPLSIKFFLVAITFL 
LFDLEIALLLPLP*ALQATNLPLIV 


3466 


17367 




3489 


2 


400 


AAP ILGMLGLRQRGKYKRQRQLLCLWSA 
ILLFLYFLNKLL*LTKKKKKKKKKKKKK 
KKKKKKKKKKKKGGPRGGNPQILKGKIG 
KPPGGFFKGGGGPKKIFFFKNRGALFFK 
KKP PQKKKLGPGP PGLKNFLKEKKKA 


3467 


17368 


A 


3490 


57 


366 


INVFAGKDQLQPLCACPSSPIISVESDR 
PDMYQCGLNHF lYL 1 1 YLRWS FALVAQA 
GV*WHHLGSLHPLPPEPKCFSC 


3468 


17369 


A 


3491 


25 


384 


IRGTF *FFWGGLGGLSKNNPL * KKNFS P 
PKLGVLPPPLFFPGGGFFFFFPGAPPGF 














KKKKPPPKKFFF*KKGAPPFFFFFFFFF 
FF 


3469 


17370 


A 


3492 


170 


41 


AQPDQHGETTFLLKIQKLTRHGGICL* S 
QLIGGLSFITQKITC 


3470 


17371 


A 


3493 


1 


240 


VLS PADKTNVKAAWGKVGAHAGE YGAEA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
ocl 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LERMFLSFPTTPS* FLDFFFFLEKIGFH 
HVGQS SLGFLTSRDPSAYNS LLGS 


3471 


17372 


A 


3494 


359 


204 


TQWLMPVILAPWEAEVGR* SETOSLRPA 
LNSPFFKRLWIMMYIMNIIILS 


3472 


17373 


A 


3495 


1 


178 


DRLSPGV*GSSEL*LCHCTPGWATEPDH 
VSRKKKKRKKVGLGKNKAKQKHPRGYYR 
AEL 


3473 


17374 


A 


3496 


361 


1 


LGPBCLLKKPFWISIGIKGVAGLPGFFPT 
PPQ*G*NFGLKRGGFFFYFFEAKGWFNT 
PGGKQRQPFRSLEPFPFRVKLFFPPNLP 
NNWQHKGAPPKPPFFFFFLRWSFALVAQ 
AGVQWCDL 


3474 


17375 


A 


3497 


276 


79 


AEVyKIKCELGAVaHRCNPSTLGGQGGR 
lTRS*VSASF'i'KKyWQLALMKELLNE 


3475 


17376 


A 


3498 


3 


476 


FTFDHLATPLLILTT*LLPLTSMASQRH 
LFREPLSRKKLYLS riiNSLQSSLI ITFT 
GTELIIFYIFFETTLIPTLAIITR*GNQ 
PERLNAQTYLLFYTLVQSLPLLIALIYT 
HNTLGSIJirLLLTVSAQE 


3476 


17377 


A 


3499 


581 


2 


AaACNtlSfTL6GRGRWIT*G0EFDTSLAN 
K 


3477 


17378 


A 


3500 


380 


135 


RHQSGQIGKTPSLVKIPKLTPPKGGVL* 
SHLTKRARQEKGLNPKSKGFN* PKLRPC 
PPTrVAKKQNPVSKKKKQTRQEPSA 


3478 


17379 


A 


3501 


1 


386 


GSR*GNHPKRTJJAGAYFI,FYTI,*GSLPL 
LIRLIYTDNTLCSLNILLLTLTGQELSN 
S * AHNL I * LAYTIAPI VKI PLYRLHL * L 
PKAHAQAPIAGS IVLAAVLLKLCGYGI I 
RLTLUJJPLKKHIDYP 


3479 


17380 


A 


3502 


2 


389 


EKQL * KDKQVYRATHRLIiVLGAGES GKS 
TIAKHMRILHVNGFNGEGGEEDPHAARS 
YIDGEKATKVQDILNNLKE6IETIGAVM 
SNLVPPVELAHPENQFIVDYILSAMNVP 
DFDFPSEFYEHAKALWE 


3480 


17381 


A 


3503 


2 


379 


PFGYIGMV*AMIS IGFLGFIG*AHHIFT 
VGIDVYTRAYFTCATI I lAI PTGVKVFS 
*IATLHGSWMK*SAAEL*ALGFI FLFTV 
SGLTGIGLTNSSLDIRLHDTYYWAHFH 
yVLSIGAVFAIIGG 


3481 


17382 


A 


3504 


281 


68 


FKMRFGWGHSQTVS ENYKPLLKE IRENE 
NK*KNIPCSWIGRISIVKMPTLPKVIYR 
FNAFPIKLPSPPPTL 


3482 


17383 


A 


3505 


139 


263 


ETGSHYVPQAGL* LLASSDPPATASQNI 
GITHVSYYVQPL*PIFWEAVAGGSLEflR 
S 


3483 


17384 


A 


3506 


2 


427 


LKTSMTMISTTLLT* LRPAWH * AQ YQT P 
FFV*SALITAGLLFLFLPVLAAGITVLL 
TDRNLNTTFFDPAGGGDPILYQHLF*FF 
GHPEGYILILPGFGIISHIGTYYSGKKE 
PFGYIGMV*AMISTGFLGVIG*AHHVFT 
GG 


3484 


17385 


A 


3507 


343 


3 


GWHSRSSGGASSPSS PSHRSRLRVSSGS 
LGRL*WSKSSKTSVSCGWARDTEKPPL 
S KI EASS SDSCES VMAKANAVFFTSTTG 
EKPAS SASLSLSVWASGSS PPGAANSSS 
S 


3485 


17386 


A 


3508 


3 


363 


GPGDKAAERIALENLDIT*KWGMRTHKC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinfe, I=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AKQLLSKQPF*SSRTGDLKLELQIELVR 
ETKRKC*RVL*IDRaLTSHLYSLLQTQD 
ALGDASADLIQKTS *LQEEFGYYAKTQK 
LLCRNGET 


3486 


17387 


A 


3509 


3 


434 


GPGDHVARGISGKKYDIGQKWGMGTYKC 
TKELLSQRFGLGSPTVDLEL*LQIELLR 
ETKRKYESVLHLGRMjTAHLYSLLQTQH 
ALGDAFADLRQKSPQLQEEFGCYAQTQK 
LLCKNGETLLGAANLFVSSIYTLGTKTM 
EDTL 


3487 


17388 


A 


3510 


1 


401 


GTRKCTSFLHDSQTSFCFSDSIPTPSNME 
ETQHKSNLELLRISLLLIEPWLEPVRFL 
KSMFANKrLVYDTSDSDDYHLLKDL*EGl 
PTLMGRLEDGKRRTGQILKQTYSKFDTN 
SHNHDALLKNYGLLYCFRKDMN 


3488 


17389 


A 


3511 


117 


2 


FKFIFSEMESHSVA*AGVQWRNLGSLQP 
LPLWFKQFSC 


3489 


17390 


A 


3512 


491 


90 


KI<VKIVIiAGFFFKPPPPKN*KKKGLK*R 
KGVGPRGYTWPHPCGYKPFGGPGVQNP 
PGQKGETPFFQKKNKKPGGGGGALIPPP 
KGGGRGVPLSPEKFGLKFPPGPPP*GKK 
KNRFQKKKKKKRQN* PQKS QC 


3490 


17391 


A 


3513 


2 


31 


GKGAPTTSLINVAGTKIIAKWEDNKLP 
GAICSLTCGGaHIGTPMftRDERVNLLSF 
TGSTQGGKQVGLMVHERFGRRLVELGGN 
NAI lAFEDAHLNLYVPSTLFAAVGTAGP 
KCTTARRQLIHESIHDEA*NHFPH 


3491 


17392 


A 


3514 


2 


390 


KEFHKTOPLENITLAPDPEGPDGLPPVA 
YNPWMDIRQREDGQTLNIREPYGPIPVD 
F*RKIRQSYFASGA*LDTQDGRLLSaLD 
DLQLANSTI lAFTSDHGWALGEHGEWA* 
YNNFDDDTHDPLI FYDT 


3492 


17393 


A 


3515 


3 


410 


RGTGNNWAQGHYTKGAELVDSVLDWRK 
ES*SCDCLQGFQLTHSLGGGTGFGMGTL 
riITKIREEYPNRIMNTFRVMPLPKV*DT 
EGEPYNATLLVHQLVGNTDETYSIDNEA 
LYDICFRTLKLTTPTYGDRHHLVS ' 


3493 


17394 


A 


3516 


1 


396 


GEDAANIYjaiGHYTIGMENIDLGLDRIR 
KIADPCTGLQGFLVFHSFGGGTGSGFTS 
LLMERISCDYGKKNKLEVSIYPAPQVCT 
AVFEPYimLSTHTTLEHSD*AFMVDNE 
AIYDICRINLHIERSTYTNL 


3494 


17395 


A 


3517 


2 


399 


EKIGYNPDTVAFVPISGWNGDNMLEPNA 
NMPWFKGWKVTRKDGNASGTTLLEALEC 
ILPPTRSTDKPLRLPLQDVYKNGGIGTV 
PAGRVETGVLKPGMEVTFAPVNGTTEV* 
SAERHHE PFREALPGDNVGF 


3495 


17395 


A 


3518 


1 


402 


GEDAAHNYARGHYTIGTEI IDLLLDRIR 
KUiDQRTGLQGFLVFHSFGGGTGSGITS 
LLMERLS IDYGTKSKLEFSI YPAPQVST 
ACSEPYNSILTTHTTLEHSDCAFMVHKE 
AIYDI*RINLDIERSTYTNLNR 


3496 


17397 


A 


3519 




408 


GSCLGQKLYNSTTKTAT*WSSNHTKKNP 
FSKFPKLQNVWTHPESHRDWTAPTRLYW 
ICGHHPYVKIPNQWAGSCI IGTIKPSFF 
LLP I KTGELLGFPVYASimKRS lAI GNW 
KDNEWP P ER 1 1 QY YE PATWAQDGL 


3497 


17398 


A 


3520 


3 


162 


ETVLLCCPGWL*TLGLRQSSCLSLPKCS 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
36 


Predicted 

location 
ing to first 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DYRHATVPGQKYRFFNRYYYGSYS 


3498 


17399 


A 


3521 


184 


354 


ETGSSLLP*GSSELPVSHPYFFNTHSLS 
LSHTHTHTHTHTNSS ISNYHCPLYTNMS 
CQRLVAVYRQKGS FKDQCTDI 


3499 


17400 


A 


3522 


-2 


313 


YYLSPPMGDQPGQHGETLPLQICLAG*GG 
ARLRSQLLRRLRQENCLTPGIQACRELL 
GVIYKSNCAFCQQKPQLLCVNLTLYSSLD 

DFJ\EDSVSTKTKREGRGEK 


3500 


17401 


A 


3523 


13 


127 


LG LQ AETT 3 M E S S * G YRRAP P P C P ALL Y 
F*VETGFHHVGQAGLELLTSTRP 


3501 


17402 


A 


3524 


148 


1 


VCMFLEESCSVAQAGVQWYNHSSLQP*T 
PGLKRSF*LSLPSSRDHRLMT 


3502 


17403 




3525 


1 


98 


V1IPGGEGCSEPRSHHCTTAWVTE*DSIS 
KINK 


3503 


17404 


A 


3526 


138 


3 


AASTHDLG*LQPPPP6LTPFSCLRLLSS 
WDYRPVPLRPMTFLYF 


3504 


17405 


A 


3527 


10 


206 


PTTSLISFTSYSPCLKCSNVYEQWRMR 
VLFYKSKQRLGMVAHAYDPSTLGGRGSK 
IT*GQGRRG 


3505 


17406 


A 


3528 


345 


3 


vtbliqsnkaklkkKSkhkUbqslqki* 
dyakqpnlrisgvleeeekskslenife 
kiieenfpgltrdleiqiqkaqrpgkfi 
akrlsprhtvirlskvktkgrilraerq 

K 














3506 


17407 


A 


3529 


3 


90 


AVL*ALGFIFLFTVGGLTGIVIiaNSVDA 
A 


3507 


17408 


A 


3530 


3 


90 


AVL*ALGFIFLFTVGGLTGIVLANSVDA 
A 


3508 


17409 


A 


3531 


354 


3 


AVINSLPTKSPGPDGFTAKSYQRYKEBL 
VLFLLKPFQRIEKEGILPNSFYESSVIL 
IPKPHRGTPKESFRPISLMNLNAKILNK 
RIiAS *NHQH I KKVIHHGQVGF I PRTQGW 
FNKQ 


3509 


17410 


A 


3532 


2 


357 


FTRPRFSFTFFVLIRKYALSCGKISIQH 
S Y * KKKSKGKLGGL*GINFCFFEEKFHS 
VTQVGGQKHDPG*LKFPPPGFRGFSRLS 
LPSSWNHKRGTPRLANFCTPNKNGGRRG 

RKFQIY 


3510 


17411 


A 


3533 


76 


365 


KKKKKKKKKKKKKKKKKGGAGPE I PKLK 
MGKSEKPPGVFLRGVGEKIKFFFLKKGG 
PFF*KKKPLKWKWWAREPSFKKIFKKKK 
KKAHFFFGPKILKK 


3511 


17412 


A 


3534 


7 


353 


FFGFGGVFLVFLFFFF*KYPFFFPPPPL 
FFFP FFKWAPRRP P PPFFSS FFKNFFFF 
FPPPPPFFFFFPQKNFPLFFSPLFFFFS 
PKFFFFFFPLTKKKKKKKKKKKTKTPKI 
HPQTQRK 


3512 


17413 


A 


3535 


354 


41 


AGGVPPGIPPPWGGRAGGSPKGHGFGPP 
HPKRGTPPFPF* KKKKKKKKPGGPAIPA 
TREGEGGKSP*PWKPRVQPT*NAPPPSP 
PGEGAKPPFQKKKKSMFHKR 


3513 


17414 


A 


3536 


174 


364 


TFNSLAFTFFFFF*RETGSHFFLEAGVQ 
GNHSSLNP*PPGLRGSSCLTLPKC*DYR 

REPLPPA 


3514 


17415 


A 


3537 


38 


368 


PGRLRLQEAVMHCTLE PGQQ SV* KKKKK 

* TLKI RPKE IRNQGGSQQRKKAELGKLQ 
QTYPVLNFKaTFYGEQGDYY*TFFKP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 

ussn' 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predicf- 

otide 
location 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lyslne, L=Lcucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=lInknovvn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3515 


17416 


A 


3538 


107 


349 


SVTFPFFFFFLKQTGSHFW*AGGQGGN 
FD*LEPKPPWLRESSLLTLPSSLDYGPP 
PTGPSNFGIFFEKRGFCLVPQASFE 


3516 


17417 


A 


3539 


274 


3 


PRRWSLPLTQMGPLPSRMGPKTKLCPKK 
KKKKIPPRKILEPDVFTGQFYLMFKEIT 
SIFLKLFQQCEEEGTFSNSF*EAR1ALK 


3517 


17418 


A 






374 


ACfiliVRNDHlYYWGPTYALGELFFTET 
PIII*G*DN*LGPLVIGAHWALTLINY 
ISL*L* PPSVLLLLACAIPEARAGTGCT 
AYPPLAGDYSHRAAYVNLTILSLHLARV 
SSILWDTDSITT 


3518 


17419 


A 


3541 


427 


76 


RGSLSPGV*K*PGGQNQTFPKAPKLRPG 
APRCFPAGKGSIKPTPGSPPGGRRGTPC 
SKKKKNFRETPPSFFWGGPPFLKHRGGF 
MFWASKTKNGLNKTLGPNPGPPLGGLRS 
PFWVG 


3519 


17420 


A 


3542 


234 




FFFFFFFFFFFFFFFFFFSLFFFFFFFF 
FFFFFFFFFFFFFW** ILYFV 


3520 


17421 


A 


3543 


30 


419 


TFSFFPFLFFFGFLVLGGEKQGKWDPKP 
PGAEKPPTFSPPKI *KKGAPPPPRVNFK 
KF*KKGPLPFSPGGKI!ILKSFWGrWSIG]Sr 
PLWVTPPKNPQKKPPFFKGPKECNFFFKG 
PKPGGKKTQlFFFIiWEKK 


3521 


17422 


A 


3544 


159 


411 


VSRDISPVRLSPSSCFH*PFl.HrSSSVE 
GRFSSFAQFLLSCLNFCTFPLLFLLLRF 
* DRVSLCHPGWSAVA* SQLTATSTFWVK 


3522 


17423 


A 


3545 




351 


CLAHTSGVES *ASLPCGQISrSYIHyLKLL 
CMEELSFPPYLLIYSISYLY+HTIMVIY 
FILVL*SILIiNLIYLFVTQIFPALASGS 
SFNWLLOiFNTTLAVHVCVCVCVCVCVR 
VCAC 


3523 


17424 


A 


3546 


397 




KPPPPCSYGKKKKIFLLVFFRTLPTKPP 
GAGRKI FPRGPGFFLKNPKIiL* TTFAPP 
PFrQKKGPFLLQQA*IPFPVFFLFKKKP 

FLCSP*GGIGVSLFFFFFSETESCSVTQ 
AGVQSCDLGSLQPLPPGLKR 


3524 


17425 


A 


3547 


76 


354 


FLFNFMGKIFKNFRGGVFFFFFHGGYNI 
FLVFLKIFFFFFGGFFLPFFFFLKKFFF 
FLGFFFGKISQKIFFFFVN*IFLLXXGA 
PPLFFFFFL 


3525 


17426 


A 


3548 


197 


1 


INKMKQQRMD*EKIFAKHVSHKGLISQI 
CK * L I HLNS E KTNS PVRKWAEDLNRHFS 
KRDTKMVNM 


3526 


17427 


A 


3549 


175 


309 


KS TLKVKIL I S I S I FNFYFF* DRVLLCHP 
GWSAVA* SLLTAASTRP 


3527 


17428 


A 


3550 


3 


196 


GFHRVSQDGLDLLTS * SACVS FPKCWD Y 
SHEPPRPAHKTSLKKNKMRVRI IIMMTI 
TYPAPWC 


3528 


17429 


A 


3551 


3 


360 


RAVEIFCYDVCVESGCADIQLLLLCPTS 
IKVCFSSLSLDIVTVIPDIYICIYIYII 
FFFFFKQARS*CQCLNDKE*YCSLWPTP 
PGFKKLSCVSLPSIWDLRCAPSHPPNFC 

VFGKNGV 


3529 


17430 


A 


3552 


326 


1 


JCDNHRILKLMLAKSGKVEHYVFDFGKQI 
SKMSEKNIGVKPPIGKEKTKKPKKKPSR 
TEDPKICTYVL*PDKKKWRRFLNT*GIQ 
LG*KEIKLSLFADDMIVYLEISrPIV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

Ofl 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

fast amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Vafine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3530 


17431 


A 


3553 


186 


1 


RPRRLPIKLPMTFFTELEKTTFKFI*NQ 
KRVRIAKSILSQKNEVGGITPPDFKL*Y 

KDTVTN 


3531 


17432 


A 


3554 


3 


366 


KTSLGNMVKP'™fFV*TFYI* *KKEIKK 
NSIGVL*PGTEQSKHEFKKILPSKRNKI 
ASKRIKYL*IWLT*EGQr)MYTElIYKTI,L 
KEI IKDLNKYKDIPCS * IIRLDLVMMAV 
LTKLIYRFN 


3532 


17433 


A 


3555 


239 


1 


GRGMMLPVSLIJSriTRVTPAPCSGSIILV 
LIALPSLRILYITDEVNDPSLTIKSIQH 
QWy*TYEYTDYGGLIFlSrSYILPP 


3533 


17434 


A 


3556 


3 


123 


QAGLEV*TSSDLPTSVSQSAGITGGSHR 
ARPANIINSVP 


3534 


17435 


A 


3557 


237 


2 


KGFSPPTQKFGFLGVFPPQKKKTKKNPR 
PPKGLTPVS*KGMLTEPriEPFFFFFLRQ 
SRSVAQAGVQWGNLGSLQALPP 


3535 


17436 


A 


3558 


87 


379 


KKKKKKKFTLQKPVGFIiNPPNGEAKSQI 
INELLIiKIATKKKiaSfLKLRLKKKGKDLF 
RGNYNPLFKKIKDDPTNGKNFPCS*MGK 
ISFEKMTLLPKAI 


3536 


17437 


A 


3559 


390 


181 


KKKKKNPGGGGPPFFPTFRGG*GGGPPQ 
PGGGGAPKV*1SPPPPPPGGKKSPPQKK 
KKKKKKKGPFLPHL 


3537 


17438 


A 


3560 


2 


101 


VIVTAHAFVIIFFIVIPIIIGGFGN*LV 
PVDAA 


3538 


17439 


A 


3561 


2 


126 


INTLLALriLIIITF*LPQLNGYIEKSTP 
YECGFDPISRRRG 


3539 


17440 


A 


3562 


16 


349 


FAPQKITIAMEKATEAQTGGAKKACQGT 
NNIMKFLKKKCIPTTIFWISFFSFLFFF 
FETKPHFFNWPEGKGPNIiG*LKPLPLGL 

KQFCLTLPRKGNYRHVLPPPVNGFFFF 


3540 


17441 


A 


3563 


6 


130 


TLRRI LEDHFSLRS *GCGEPCLRHCMPA 
WLTE*DPVSKNIY 


3541 


17442 


A 


3564 


3 


152 


GFHCVRQDLDLLTS * SAHLSLPKCWDYR 
RKPPHPAGKGTYFYIDYSTIE 


3542 


17443 


A 


3565 


241 


3 


NLVSHVISMRKVINLVEQNVTSSNRSNH 
TWLS F IKLYFI 1 1 IS I VFGEQWFGYVA 
KFFSDDF*DFGVPITQTVYTAPN 


3543 


17444 


A 


3566 


71 


256 


SVFLLTVFLYPLATSFFFLNRVSLCCPG 
WL*TPGLKQFSCLSLPKRWNYRHEPLTP 
GSNFF 


3544 


17445 


A 


3567 


1 


392 


AGAGTSGTDYPPLSGNYSHPGACVDLTI 
FSLHLAGVASMLGAISVMTTVMWIEPPA 
ITQYQTPLFV* SGRNTAGLLLLCVPVLA 
AG I T I LLTDHNLNTTFFDPAGGGDP I LY 
QHLKGKRGI IICSHRDRI 


3545 


17445 


A 


3568 


334 


409 


LGTVAHACNPSTLGG* GGHI TRSGV 


3546 


17447 


A 


3569 


3 


467 


PKNPPGGFFSPL*EKKYILPPPP*IWPP 
PGFF*KAPPPFFFFFFFFFFFFFPTFQI 
YFITF 


3547 


17448 


A 


3570 


3 


420 


FFDPAGGGDPILYQHLF*FFGHPEEYML 
ILPGFGILSHIATYYSGKKEPFGYIGMV 
*AMI S IGFLGI I V*AHHI FTG * IDVDTR 
AYFTSATIIIAIPTGVEVFS*LflTL,HGS 
NMK*SAAVLSALGFIFLFTRRGLTSIV 


3548 


17449 


A 


3571 


1 


418 


NLIISIHSFIDLPTPSNIFA**NFGSLLGA 
CLILQITTGLFLaMHY*PDASTAFSSIG 
HITRDVHYG*IXRYLHaKGASIFFICLF 



767 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
jiucleolide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
Ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, Ij=Leucine, M=!VIetliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














LHIGRGLYYGAFLY*ET*NIGIILLIiAT 
IATAFIGyVLPGGQISF*GATVITNLL 


3549 


17450 


A 


3572 


3 


414 


SSGLAM*FHFHSITLLILGLLTNTLTIY 
Q*WRDVTRES TYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSIjAPTPQ 
LGGHWPPTGITPLNPLEVPLUITSVIiLA 
SGVSIT*AHHSLIENNRNQIIQALL 


3550 


17451 


A 


3573 




423 


GTLYLLFGA*AGVI.GTALrLLIRaELGQ 
PGNLVGNDHI YNGIVTAHAFVI IFFIVI 
PIIIGGFGN*LVPLIIGAPEMAFPRINN 
ISL*LLPPS LLLLLASAIVEAGASTG*T 
VYPPIiAGNYSHPGAYVDLTIFCUILAGV 


3551 


17452 


A 


3574 


411 


117 


DGGPLLQLLRRARQENCFNPRGGGCS*P 
RLCPCPPAWGAKLNSLSGKKKKKKRHVE 
KYAQHDYI INVLCWCVHIKWLSLYMYS 
ITKSIDKICKKAS 


3552 


17453 


A 


3575 


2 


394 


ALANMWEQIRSIHQYAVHRIISLFSLLS 
KKHDRVLEQATQYLRGSLNTNDVPLPDY 
AQDLTVIEELIPMMLEIINS*LTNSIiHH 
I PNLVYALLY*RDLFEQFRTHPSPQDIM 
QNIDLVSVNEDIYYDSFLS 


3553 












GWREWTELIIFYIFFETTLIPTLAIITR 
*GNQPERLNRGTYFLFYTLVGSLP 


3554 


17455 


A 


3577 


232 


443 


PSVQTFPCCPLSEEGPWLLCVAMTPSPG 
VPVPTEEALYSMECAFHPLFS LTS GACR 
LDYRTPDNR*VQLP 


3555 


17456 


A 


3578 


48 


410 


gglllheaGcwcghadrradOpdssdph 
slhaaclpppgaahaa£3apspa*tpaaa 

CPAPVPAPCSEHRC*PPGLSQRPCIiPGL 

pdarppgaaahggws lptrlvlhdrshp 
eeaprrhed 


3556 


17457 


A 


3579 


422 


186 


VYSKFL* SQLLRRLRQENRLNPGSRGCR 
EPRS*HCIPAWVTQQDSL,SISSSTSHEI 
CGSHCSPGYLTVHI CDQPVFSL 


3557 


17458 


A 


3580 


442 


3 


GSAISSRCTQESTYQGPHTPPVKKGLRY 
GIILFITSEVFFLAGFL*AFSHSTLAPT 
SQLGGHWPPTGITPLNPLEVPLLNTSVL 
liASGVS IT * AHHS L I ENNRNQI I QALL I 
TILLGLYFTLLQASE*LESPFTISDGMY 
CGRSRG 


3558 


17459 


A 


3581 


428 




IRDIFMASSLGM*FHFTPITLLILGLLT 
T PLT I YQ * WRDVSRES T YQGHLT P P VQK 
GLRYGIILFITSEVFFFAGFF*AFYHSS 
LAPTPQLGGHWPPTGITPIiNPLEVPLLN 
TSRLrjASGVSlT*AHHSLIENNRCIAAA 
LE 


3559 


17460 


A 


3582 


160 


415 


RCLSHCLAFPMLSILFIPSQLLSLPFHV 
KWI FFFFFLNKKFNCVPQflGGQGKNLI * 
LHPPPRGLKKFSCLRITSS *NYGREPQP 

S 


3560 


17461 


A 


3583 


399 


2 


STHLGLPKCWDYRCEPPRPALLAIVKLL 
CKTIHVRINNVCCLPLLTGAASVIGLSA 
SLLFSVSPGTSLQLEAR*TFPWPHMSST 
CVWRF I FTSLGRAQPQHSDLQTCPRRDV 
DLAEPGLLFNGQMLVPPGRS 


3561 


17462 


A 


3584 


215 


1 


KSPGLLNAFSRGSQIYQASRAMGA*LNQ 
RFWPGTVAHACNPSTLGG*GARIMRSGD 
GDRNSKDGVSPCGRV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, JP=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iVletliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 


3562 


17463 


A 


3585 


138 


19 


IiNQRFWPC3TVAHACNPRPLGG*GARIMR 
SGDVYRNSKDEV 


3563 


17464 


A 


3586 


262 


3 


GRPEVFYKPPPRGEFFFFFFFLFFFFFF 
FFFFFFVLFFFFFFFFFFFFFFFFFF*Q 
VRLYIiEKYELFFFFLFYPQEISLCCPG 
WS 


3564 


17465 


A 


3587 


381 




KKFTPPPPRKMGPPPGFF*RAPPFFFFL 
GGGGGPLFFLGGPPPPGGGFFFSPPPPV 
FF*KSPPLFSFF*KKRGPPFFKRGGSPK 
TPPCFFFKKKNPPKKKKKKKKKKKKKKK 

GRPSSKSDAWVRTRG 


3565 


17466 




35S8 


198 


1 


PKSPPR?LARGKE?Ni':H'SKKKKKKSI*N 
LNTNSKLDMVAHICNPSTSGGP.GG*PHL 
RSGVRDQPGQ 


3566 


17467 


A 


3589 


373 


1 


GIFFPPPKRGGFPPPPPKKFFFFPGGFI 
FGGGGGPFWPPPRKGFFLKKKKGVF* SP 
P*KGKKFFFFPGGQVaPPRQFLKGQPPI 
IFPFPFFFFFPFFFFFFFPATIFLITYK 
DLTCHPGTHFTH 


3567 


17468 


A 


3590 


506 


98 


HTSRGPPPPPKKKI PPTRPFFFCCFWW 
FFPPKKKFFFFCSPRSFIHSPPKKKKKN 
FFFPKGGGPPLSPFLTPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF*LFVSLFSI 
FVLLIFFLFYNAITTI 


3568 


17469 


A 


3591 


407 


1 


LVRGQITNKKNLFNFLGEELREWRWFF 
LPQQRLFFFKL*LQ*KEIYLFI *KMTCS 
RGGAGEKSSLYFFFRSPPQWLFFFFFF 
I FFYFF* FFI FFFFFFFFFFFFCUDFMI 
YLYISPPLPILEKKRGGRSRSRS 


3569 


17470 


A 


3592 


380 


68 


FRRGVGPQWPPPQKKGFSQKPPGGFKRP 
PLKGKKITFPPPGKFGPPRGPFKRPPPF 
FFLEGKTLFLGPDP**NLGWGG*RGHBK 
♦GPANPRVPPVFFFFEEYSY 


3570 


17471 


A 


3593 


394 


45 


ALMFSTGQEGRQEKHPPPHQKKKASRGE 
EPRGGVRKNYPPPKKKSFPKKNPPGVTP 
PKKKKKAKLPPRPL*GGPQRRPQKTAPP 
PIUFYPPKEPTKVLSIFFSFFFFFSftND 
VELY 


3571 


17472 


A 


3594 


402 


62 


AMIVLIHSSLGNKaRPCLFKKKWAKDLN 
KHFSKEDSQMflNKYMKRCLI SLVIRNMQ 
MKI TVRCHF ILTRMAKI KKMVNS KCRKE 
VQKIPTLKHCMWECKIV*ILWTNLAARV 


3572 


17473 


A 


3595 


220 


425 


FISGFLNLQEEREIAFFFKVITIRLGAW 
HTCNPSTLGGRGRRIT*SPGV*DQPGQH 
GEPCLYLNLQKN 


3573 


17474 


A 


3596 


13 


412 


AEIAPLRSIHRAEUCLKKKGRKGRKNPG 
GDGGNPQIGPAFFGGGNGKTPGQKPHQP 
QTWGQGRHLKGGGGGPKRKGVPPKKKTP 
TIWGGTPIiAFGVEPNWWGHTGNPGGADP 
KPPGREK*SGGAQITGPHTGP 


3574 


17475 


A 


3597 


2 


293 


CQLSP*ELGTPRERERERERERERERER 
ERERLPPRDLREKRAQDVFATHPNYEFV 
CVIRHYRGCWYTPLTTAHSIDAHKMCV 
CVKKRPLYKKKKK 


3575 


17476 


A 


3598 


243 ■ 


9 


ILLRQEKQCWSKGTRFQLQDEQVLNT*D 
RTWEVMWPGAVTHACNPSTLGGRGGR I M 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

location 

ing to first 
amino acid 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSRDRDSCRIHQGGSKPKSPR 


3576 


17477 


A 


3599 




329 


PSARRDWKNIPKLPPPRGKNPTGFFFLK 
PGGPQRGPGKKGGKNF *S FGKPF* KGGG 
PPQFCPPPPRGGFFFFFPGGPPGEKKGG 
VKGPPPPFFFFWPPPPPNLFFPPGA 


3577 


17478 


A 


3600 


206 


1 


LRALGIMERAAHTEVRLVFIQI LGSPL * 
YIF*RQDLSPSPRLECSGTIAARCSLDL 
LSASNSPASASQ 


3578 


17479 


A 


3601 


322 


99 


KRTGAFGFFFPPLFNKKKKVFLGPEIGPK 
IVKKKKKKEKKEKQNKNRKQTKKK*KTN 
NKQNKKHETDIFGVEDET 


3579 


17480 


A 


3602 


410 


25 


LFFFKKKTGGGGGGPPPFFHPFGGVNGG 
VIRVGELSPPCPPGGTRFFFKKTKNYPG 
G*GGPFFPPPSGGGLGPKKrr.SPRGGAS 
L*PNFGPSLPPKGKGGPPFSKKK3CKKKK 
QSRPRDS*SLRRGIRK 


3580 


17481 


A 


3603 


27 


349 


RSSIQCGGILLFYFLYFMVFNDIFWL* 
GFSAITSILI SMLFSCI * YYFFKMYCLL 
FNFRCHWI FYILF* MKTIAQIGN* IFFR 
DAYLGYFFLFLFFFFFFGGKLYL 


3581 


17482 


A 


3604 


88 


435 


RAI STCIjQNEQYKWLTTITSVEKSIJSrDL 
TELKTMV*ELHDKCTSFSSRFIQLEERI 
SVTEDQMNEMNNPPLIHGFIFYGSSYSW 
STMVRKQDKLRVKDAGIKHRHQNKGFTA 
EKSA 


3582 


17483 


A 


3605 


331 


428 


YGHYGQIEATFLPAITTEKETLKTRKLT 
* SS IISrLLK*T*NVRPGTVGH2\CWLSTLG 
GRGGQIMRSGIRDQPNST 


3583 


17484 


A 


3606 


3 


294 


GG*GCSEL*SCHCTPSSLG*lSfTVRKKER 
KKGKREREKERKREIIPGFQPCAHHKGA 
VTLGNLPPRVQVSQLVQRRKII ITVIPD 
RLGEIVLVSQSL 


3584 


17485 


A 


3607 


263 


426 


RKLAGTFPFLWKSGPFQTILAAAQRMGG 
NDPKETRVFHAGLSSSLGENK*VWKI 


3585 


17486 


A 


3608 


213 


439 


LSMACaAEWTVTQPANFPliU YILP FFLL 
IRESSKF*RHMATQLETAaSRPGWVHA 
CNPITLGGRGGRITRSGVH 


3586 


17487 


A 


3609 


2 


441 


MTDLESFPLESELSANSPVQCSGLCKPV 
LTIFQVISHKGHCHSKYYbLNANYKVFL 
HLSNSVLPRLLSGQRTETQKHFYTAVLS 
VOLS lYLSRDRVLLCYPGWSAVM* FYSL 
*PQIHGLKQSSCLSLPKC*DYQHGISPP 
WCN 


3587 


17488 


A 


3610 


102 


354 


EPGAWADRCWLGRVLLTHLLWPCCLYFF 
YFIIFILNSGDRVSLCCPRWSCLK*SSC 
LSLPECWDYRCQPLHPACaVFIENSKR 


3588 


17489 


A 


3611 


462 


371 


GLNFLTL*SARLGLPKCWDYRHEPPRPA 

WS 


3589 


17490 


A 


3612 


463 


319 


RNSHDGLELLTL* SAHLGLPKCWDYRCE 

PPRPVLC 


3590 


17491 


A 


3613 


49 


476 


PHWSITLLILGLLTNTLTLYQGWFDVTR 
ESTYQGHHTPPVQKGLRYGIILFITSEV 
FFFAGLF*SFYHSRLAPTPQLGGHWPPT 
GITPLNPLEVPLLNTSVLLASGVSIT*A 
HHSLIEEMRNQLMQALLITILLGLYFTL 
LQ 


3591 


17492 


A 


3614 


223 


467 


TPS * PLAII *FISTLAETNRTPFDIiAER 
ESELVSGFNIEYAAGPFALFFIAEYITTI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 
ed end 
nuele- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]VIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














MIINTLTTTIFLGTTYDALSPELYTTYF 
VTKTLLLTCLFL*IRTAYPRFRYDQLIH 
LL*ENFLPLTLALLr*YVSIPITISSIP 
P 


3592 


17493 


A 


3615 


2 


405 


AEGLQEAGAVQNDPVSRLFDHAMLQGHR 
AHQMAIVTYQEFGETYIPKDQKY* FLHE 
SQTYFCFLNS IPTPSSMEETQHRSSIiEL 
LRISLLLIESWLEPVRFEiiSMNANNLVY 
DTSDEDDYHLLKDLEEGIQTLM 


3593 


17494 


A 


3616 


328 


63 


PSKF*KPCLISPPPLGPGPPN*TPGFLK 
KPKFPPRGGRHP* S PliFRVRPGDPLNP 
RGQGFP*PKLPPGPPTRATKPNFVSQKK 
KKKE 


3594 


17495 


A 


3617 




379 


GTSVGIPTGGQEFS * FAALHGSNVK* SA 

AftT,*ALRIMI LFTVRGLTG I GLANSS LD 

I VLHDTD YVEAHFHYVLS IGAVFAI IGG 
FIH*FPLFLGYTLDQTYAKSHFTIIFIG 
VSVTLFPQHFLGLC 


3595 


17496 


A 


3618 


494 


17 


GSANPRNPRRSGSKTRETSPRHVMSLMS 
PMSPTLPMRPTSPGTPMHLTNSMGPMQP 
PNSSRPVNPASSATPVSPASPARPRKPS 
CFPAPQ* * SPPL* SPEPKHGLSCLSWVK 
KECETWHWSMHIVEDRHSKPYHGPGMV 
AHACNPSTLGG 


3596 


17497 


A 


3619 


350 


480 


LaSWDyVCKPPHLAVLLL*RQSFVLLPR 
LVTNS*AQVIHLPWP 


3597 


17498 


A 


3620 


2 


400 


AAaBPHGCYAKVLDEFKPLGEEP*NLIK 
QNCELFEQLGEYKFQNALLGRYTKKVPQ 
VSTPTLVEVS *NLGKVGSKCCKHPEAKR 
MP*AEDYLSWLNQLCVMHEKTPVSDRV 
TKLLTESLVNRRPCPSALEVK 


3598 


17499 


A 


3621 


3 


476 


LGYSVHPPQPLflQVLSPPLPGASRAGWL 
LRMRGPPSPHPPRTPAGLQAPHAALVPA 
HASPSTPPGKLREWAPALASPERGSHSA 
VGG*EAPQMPPKWEPRQGRCQEQARALR 

TASMLSPLSFTHPVTLSLLNFPTSLPPR 
NSPSWPWSVR 


3599 


17500 


A 


3622 


40 


390 


MKGALTQPGRAQQLGSQSQMDMASTPKP 
SPDDQGCSIHLWGFPFLVSLLPPSLSPS 
PPSPPRHSVGTHCAP*VLPRPLGALRVP 
HSSVPRPHAVPWGTTYRYSRGAWPLSL 
DFR6L 


3600 


17501 


A 


3623 


52 


475 


AVEFHLLFLQPLETTNLLS VCMDLP ILA 
NSPK*NQIICDLFVFFHLA*CF*DLFML 
EHVSVLHFMAE*FFFFFF*KGVSLFSPG 
WRAGAQFWLTKTPPHSPARLKQFSCLTL 
PGSWNYRRAPPSPANFFVFSGKRGFTIL 
P 


3601 


17502 


A 


3624 


333 


466 


S IIGKIRLWKMRPWPGAVAQACDPGTLH 
G*GKRITRSGVRDRPG 


3602 


17503 


A 


3625 


2 


13S 


LRRGNRLNPGGGGCGEP*SRHCTPASST 
E * DS VSKNQKTKNSQWH 


3603 


17504 


A 


3626 


3 


337 


LSLRPPALPPCPPRRLPTPGQGHPGSAG 
FPGRPTL*RLPPLPDPPPSLGLSPGDDP 
GLAREELKSPCPWGPHPLCHLPFP*CWA 
SSQPS INAYEWRKKKKKKKKKKKKKKK 


3604 


17505 


A 


3627 


2 


393 


LGTDLSLLIRFELGQPGNLLGNDHIYNV 
IVTAHAFVIIFFIVIPIIIGGFGN*LFP 
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residue of 
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last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalaiiine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leiicine, M=Methionine, 
N=Asparagine, P=PrDline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LIIGAPDMALPRINNISL*LLPPSLLLL 
lASAI VEAGAGTG* TGYPPLAGNYSHPG 
ALVDLT I FYLHIiARVCS I 


3605 


17506 


A 


3628 


2 


379 


QnATYPIIEELITFHEHALIIICLICFL 
ALDALFLTLTTKLTNTNISDAQEIETV* 
TMLPAIILILIALPCLRILYITDEGNDP 
SLTI K* I GHQWY* T YE YTDY6GLI FNSY 
ILPPLFLKPGDLRL 


3606 


17507 


A 


3629 


1 


400 


PTRPPTRPPTRPMTNTIiNSIHPPLPR*K 
KKKKKKKKKKKKKKKKKKKKKTAAGGGA 
* KKKNHGAKTHRGS KKLNFFFKKRNKKK 
PPVKl EKKTFFLGGKKKNKPPQKNNALK 
KKKNYFEGEGESLLLL 


3607 


17508 


A 


3630 ■ 


416 


1 


SQLGYSGVRDPLEEATCLFSDLKLHAGR 

TTPLFQAVRQGHLILQRFLQPFA*LCPA 

PRGGVYRGRQASLSCGGLHPVRASWPLC 
LPTQFSAMEGAPPLALLPRSSS ISDYCA 
SNEQGSVGIGPSEPCTGYNIjIjVCHUj 


3608 


17509 


A 


3631 


275 


S3 


RWGSCYWI1AGL.QLLGSSDSPALASQSA 

GITGVSHCTQPKHVI*KISHLTVAGPAA 
DKTPQTPR 


3609 


17510 


A 


3632 


186 


3 


FFFFFFLFFFFFFFFFFFFFFFFLFFFF 
FFFPFFPFFFLVCQVPnLCML*HLVYLF 
YMVL 


3610 


17511 




3633 


3 


235 


GIISFLLIR* *yARADflNTAAIRAILra 
RIGDIGFILALA*FILHSNS*DPQQIAI 
LPHPSDSKKYLYLSHIQLKKA 


3611 


17512 


A 


3634 


1 


428 


RTLKTTLFISrPDGG*DPILYQHLF*FFGH 
PEVYI IILPRFR* I SHIVTYYSGKNEPF 
GYISMV*AMISLGCLGVIA*AHHIFTV* 
IDVDTHAYLTFATIMIAIPTAVHALS*L 
APUIRSNIK*SAAKL*ALRLIFLFTVGG 
QT 


3612 


17513 


A 


3635 


2 


129 


PVIYSTIFAGTLITALSSH*FFT*VGIiE 
INMLAFIPVLGFWD 


3613 


17514 


A 


3636 


2 


422 


DR*LFSTNHKDIGTLYLLYGA*AVVLGT 
AIiNLLIRAELGQPGNLLGNDHIYNVIVT 
AirAFGIILFIVIPIIIGGFGN*LDPLII 
GAPDMAFPRIKNl SC*UiPPSLLLLLAS 
AIMEAGARTG*TAyPPLSGNYSHPGASV 


3614 


17515 


A 


3637 


14 


479 


QNCKIJMJM FPMYFLPHQSEFPRKSCKQQN 
T ICFRLSNQNTMKKTTENHRLLWIRKWQ 
T*KQQCKSNSGYTPSTPSRAKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKPGGGV 
LKKL 


3615 


17516 


A 


3638 


2 


449 


I II ICFVGLCALVLTLTTKLTNANILDA 
QEVETG*AILPAIMLVIIALPCLRILYI 
TDEVNDPSLTIK* IGHQWY+AYEYTDYG 
GLMFNAYILPPLFLKPGDLHLLDGDNRV 
VLPNEAPIRIVMTSQDViyiHS*AVRTLSL 
RTDAIPGRL 


3616 


17517 


A 


3639 


347 


95 


HFFLGGGKGGGFSPFPPQKFFFFLRGFF 
FGGGGGPNFPPPKKGFFSKKPQGVF*TP 
PKKKKKFFFFPRGFWGPPGIFFKGPPP 


3617 


17518 


A 


3640 


2 


426 


DR*LFSTNHKDIVTLYLLFGA*AGGLCT 
ALSLL IRAELGQPGNLLGNDHI YNVI VT 
AHAFVIIFFIGIPIIIGGCGN*LVPLII 
GAPDMAE-PRIKNISF*LLPPSLLLLLAF 
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NO: in 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

Son 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P=Plienylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AIVEAGAGTG*TVYPPLAGNYSHPGACA 


3618 


17519 


A 


3641 


94 


294 


GRGGGGVAGGRGAMWSKGRQHGRNGPVR 
GREGGRKSHFRKEG * KEGHKE P P P P P KK 
KSRGEKWQREG 


3619 


17520 


A 


3642 


1 


405 


RTRGFDLERFVLFDVAQQMVES GYVCEG 
DHKTMflNAIIDRVSLIKRKREQRQLVRE 
EQEKKKQEESSLKQQVEQSSASQTGIKQ 
LPFADTGIPTASTTSASVSTQVEPEEPE 
AD *HEQLQYQQPSLSVLSDGTVD 


3620 


17521 


A 


3643 


34 


405 






GPLKKMPGGAKHNRGGEKKI FS P *GGVK 
KTPLGIFEKKPYFGGGKKWANPPKKIKT 
LKEKKKF*RGKGGKKPENPff6EKDESSS 
PK 


3621 


17522 


A 


3644 


430 


81 


EFAQLSPPQIRGFPSPPPNFLPPSLFGF 
FSPFSP*KFFFSLKGFIFVGGFFPFFSP 
PKKSFFSKNPQLVFISPPFKKKIFLFPP 
PLNFGPPRVFFKRPPPFFFFFFF 


3622 


17323 


A 


3645 


399 


2 


AFFFTKKiOCBFFPPFPFKKFFFFSRVFF 
FLGGFFHFFPPQKKIFFLKIPRGFFLTP 
PKKKKIFFFFPR*FLaPP6FFL*APPPF 
FFFFFFFFFFFFFFFFFFSPCSWSCHQV 
FPHASNRIHNPSNSYPIiQQY 


3623 


17524 


A 


3646 


49 


326 


KKKKKKKKKKRGQPFKKTPGQAQKNPGG 
EKKFFFL*GGHKKPPR6IFKKKPFFGGG 
KNWTTPPQKNKAFGGKKKFLGGKGEKPP 
PKPRVKiaSr 


3624 


17525 


A 


3647 


327 


94 


QKKKKKFFPPPPPKNFFFSPRGFFFGGG 
GGPFSPPQKKGFFSKNPGGVFLPPPKKK 
KIFFFPPGGFGAPPGFF*RGPP 


3625 


17526 


A 


3648 


2 


301 


TSSVASTFIMSLFPTTIFMCLDQEQIIW 
N*H*ATTQTTQLYLSFKLDYFSIIFIPV 
ALFVTWPIIEVSL*YINSDPNINQFFKY 
LLIFLITILILVTGC 


3626 


17527 


A 


3649 




406 


STNHKDIGTLYLLFGA*AGILGTALNLL 
IRAELGQPGNLVGNDHIYNVXVTAHaFG 
IIFFIVIPIIIGGFGN*LVPLIIGAPDM 
EPARI1SNISF*LLPPSLLLLLGCAIVEA 
GAGTG*TVYPPLAGNYSHPGACV 


3627 


17528 


A 


3650 


3 


418 


HftYADAWVHPHS ITLLILGLLTNTLT I Y 
Q*WRDVSRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWP PTG I TP LNTLEVP LLNTS VLLA 
SGVSIT*AHHSLIENNRNQI IQALLI 


3628 


17529 


A 


3651 


526 


2 


- WGGGEKGRAGGAGEETPHQESQRQGCAL 
PQEPPLYPPCPAIHPPPLPLGGLLLFPS 
LPPLP*AAWFPHPGQPGHNLNGESPTTR 
EKRLWGPPLPTPLPWPPTQPTPLSRRTG 
CPPCPREVPTPSPPWAEPPPLPGPPSEI 
AHLTTVRRGAALANVNHEN I S DTDENKL 
NAFVTA 


3629 


17530 


A 


3652 


464 


297 


RMS FHRVSQDDLDLLTL * S PCLGLPKCW 
DYRCEPLLPASYIDVS 


3630 


17531 


A 


3653 


162 


467 


NSKKQQFVLLSEITSYVYWLHKRDLDSH 
TIIMGDFNTPLSTLDRSARQKVNKDIQE 
LN*ALHQADLIDIYRTLHPKSTEYTFFS 
APHCTYNFLFQSYFSSQF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 

location 
eorrespon 
ding to 
t»st amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=^GIutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Onknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


3631 


17532 


A 


3654 


372 


1 


LIWLSIFSCVHLAFVCFLWRREMSGVSL 
CCPDYSGAAAVHRCNHTALQPQTPGLK* 
SSCLCKRHLQLRPQVYTTMLGYLFLFLF 
LRQSCSVAQAGVQWHDHSSLQP*TPGLK 
R3NSPTLV*DLD 


3632 


17533 




^655 




^41 


VGQDGLLDLLTS *SACLGLPKCWDYRRE 
PPRIAY 


3633 


17534 


A 


3656 


466 


315 


VGHTGLELPTSSELPA*ASESAGITGVS 
HCTQQSGCAFCYSTMIPTKPYTC 


3634 


17535 


A 


3657 


40 


276 


EPD'I'AHAtVIIfi'FlVIPIIIC3<3fGN*LV 

PLIIGAPDMAFPRIHNISF*LLPPSLLL 

LLASAIVEKKKNEKKKKTGAPLV 


3635 


17536 


A 


3658 






FTTLPVTSASLPLKTLAWPGMVAHACNP 
STT.GG'GGRIT 


3636 


17537 




3659 


40 


182 


GIRKENHLNLGGRGRIEPGSHHCTPAWA 
TK*DSVKIKKTRIQKTPLI 


3637 


1753S 


A 


3660 


160 


461 


INGKDTDTPGMVAHAYNTSTLGGGGGGl 
A*GQEFQPSrjRN 


3638 


17539 


A 


3661 


145 


437 


YFSLSLCLLELMNSVILFFFLVLERGFY 
PPPLVEMQGPNFG*IJIPPPPGLKQFFCL 
TLPKSWNNRPPPPRGGNFLDF*KKRGFN 
YVGRVDLKLRTSG 


3639 


17540 


A 


3662 


255 


451 


LLSFNPFFFFFFFGKGGPPSLPPGGKGG 
GUSTFI *TLGPQGKRNSPPQPPGEPG*RG 
PTPHPGLFF 


3640 


17541 


A 


3663 


185 


56 


PLEQPIiRK*LYLFRDGVIiLCHPDWSflMM 
QSRLMATSAEF 


3641 


17542 


A 


3664 


42 


193 


FQLLTRLRQENHtNAGeTSCSEPRSHHC 
TPA*VTETQSQKQKVKVLGFSL 


3642 


17543 


A 


3665 


216 


2 


KSKGKCSFLHSRVNRGNSKELLFLRDGV 
WRYCPAWS*TPGLKQSPHFNIPKCWDYR 
REEFHHPGLWYEFE 


3643 


17544 


A 


3666 


43 


267 


GRLWSAMTPGKLKTLCKIDWPALSVGWP 
LEGSLDRSLVSKVWHKVTYKPRNPDQFP 
YRDT * LELVLD P P P P THS G 


3644 


17545 


A 


3667 


212 


2 


SQLLGRAEARGWLEPRSLRKINKTSQVW 
WYTPVLPSSCDYRHTPPRPANFCIFSRD 
GVSPC*PGWSRTPD 


3645 


17546 


A 


3668 


104 


334 


TVLFPPCFLLHIDFCMVLDFFSNFFVLN 
YF*NYRRVQKMYEEFPFFLFFFLIFSFS 
FLFFLSTFNFFLCPLCDLWYH 


3646 


17547 


A 


3669 


369 


260 


DGLDLLTL* STHLSLPKCWDYRHEPPCP 
ANMPSFKN 


3647 


17548 




3670 


111 


342 


NGVSFFPPKKGGIWANGNFRFPPKPGG* 
RNFLA*PPQKFGMAGPPPPPQPVGIAGP 
P PHARE I LE FFFFFFKKKGV 


3648 


17549 


A 


3671 


377 


3 


FAI IGGFIH*FPVFPGYAIAQTYAKIHF 
T I I FI GVWLAFFPQHFLGLSGMRRRYSD 
YPDAFTT*NILSSVaSFISLTAVHiIIF 














MI*EAFASKRKVLIVEAPSINLE*riYGC 
PPPSHTLEDPVY 


"3649 


T7550 


A 


3672 


39 


348 


AKIAPLNSSLDERMRVSKKKKKGHGLTP 
KPFLKKKLGKNFPLSQLEGVGKKNQKPF 
GFKKGLELAP FNGKGNGENPNTWRWG * T 
MSFFFFF*DSHPFIQAGVQWRDLGSLQP 
RPPRFKRF 


3650 


17551 


A 


3673 


327 


3 


SKDPVFPKVFKJCNPPTPNIGVFPFALPL 
KRGQFQPLFNPKGFPFFPFKPVQLGRGK 



774 



wo 01/64835 PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
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nucleotide 

correspond 
ing to first 
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peptide 

sequence 


Predict- 
ed end 

"tide" 
location 
correspon 
ding to 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISPQFFFKKRFWG*TKSFFFFF*DSHPF 
IQAGVQWRDLGSLQPRPPRFKRF 


3651 


17552 


A 


3674 


379 


267 


EPRLSPCTPAWATE*DSISKQNKTKKPK 
LAINKKKMF 


3652 


17553 


A 


3675 


2 


270 


FCRDRVSLYCPVRS*TPGFKGSSCLDLP 
SSWDYRHAPPGLAKIGGLHVFCLPCCGS 
LLVCWSSHCSLAHQELLLENKHTDVWMQ 
GWMDR 


3653 


17554 


A 


3676 


3 


54 


PVYKWKWKRPDAVftHTCWPSTLGGRGGQ 
ITRSGDRDHPG*HL 


3654 


17555 


A 


3677 


291 


30 


YSLVLLIFFFYCFSISYFISLMNFCHNL 

YYFLLLDFGLVCSYFPSILWHNEIYFIG 
SMFFIFSVIV*IFFCWEMSCNIYKLHQE 
MF 


3635 


17556 


A 


3678 




298 


RDFSASASQVAGTTGMHRHAQLIFVFRV 
CVCVCVCVCERERERERDGVLLCRPDCC 
SVAI!T*VQGRDLGSLQPPPPGFKRFFCL 
KLPSSWGHRCWDAP 


3656 


17557 


A 


3679 


20 


376 


VPPPHL2UTFLNFILLFVDTGSYYVAWAG 
FKLLAPCNLSASNS * SVGITGVSHHALL 
LIiftLKCNQLFQFVIKILLHAVCIYDFIM 
VRFSLQCFPQFYDFPFCFILFVSSKAQF 
CFLRLPY 


3657 


17558 


A 


3680 


110 


334 


NTTDYFKLP* ILSGTR*DMYKQPL*HLN 
TVREYTSKYGWRPGWaHACNPSTLGGQ 
GERITRSRDQDQPCQHGET 


3658 


17559 


A 


36S1 


326 


511 


LTMMLQMKISVICCLLRNI FWPGALflHA 
CNPSTLGS *GGR lARSRDLENS PS 
QPSS 


3659 


17560 


A 


3682 


448 


248 


AKDIJ)ITFFTKE*TPIi3«srKHMKKGSARFT 
YRNVQIQTT*RCLGTPTRVTGV*MRNSA 
RHR*GCGGLV 


3660 


17561 


A 


3683 


486 


176 


PAPINPPQLRSLVQRSGIRFC*AGQVGF 
KLLVSSDPPTLASQSAG ITGVSHQTWP * 
NYS*NSGMPTHPRQASSWRLHEQWYHVS 
LLPCAYYKMGRQNSLL I 


3661 


17562 


A 


3684 


238 


27 


NKHILCYVGQEMGMGWGSHSHGWKPHFK 
S FQVCPSAVAHACNP STLGG * GGQ I TLV 
QPGKHGKRRIPALF 


3662 


17563 




3685 


1 


299 


LITQTCHVGCLKI KG I * TSKKKKKKKKK 
KKKKKAEGPLKKKGPLKTPEKPPGGP FK 
APPFWGGPPPRFFFGGGGAPPPVFLKKK 
KKNSPLWGGGGFFFR 


3663 


17564 


A 


3686 


494 


91 


PLGKRFPGGDIALTKLSRPWAWAPPG I P 
PPLGGPGGGVP*VTPFKPTPFRPGEPPL 
PLKNQTFFGVPGPPPYFPPLGGLGPE I P 
FTPGGPGSIICLAPPPSPPPGAPKRNFFP 
KKKKEVMKMLCFHFNLLVQNI F 


3664 


17565 


A 


3687 


17 


249 


YSRFGRHFGHDPGLYFQILRRLRWENHL 
NPGGRGCSKL* SHHCTPAWATE * DPVS * 
KKNVTTIAKRWKQLTCPLADG 


3665 


17566 


A 


3688 


296 


461 


FWLWVSSVLKHMNWPGWTHACNPNTLG 
GQGE*SA*aRV*DQPG*HGETCSAKEK 


3666 


17567 


A 


3689 


149 


422 


AS GRQGPDRGCWGGQAPRHPARGAKPAA 
DMGMSGQGYTKQAAP *GDPSQRAGTOLP 
REPTSRKASGGSGGHSRASGGAVPGQRV 
AQPSPAR 


3667 


17568 


A 


3690 




415 


GDPAGGGDPILYQHLF*FFGHPEVYILI 
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amino acid 
residue of 

sequence 


Predict- 

nucle- 
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last amino 
acid 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plieiiylalaiiine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metl>ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LPGFGI ISHIVTYYSGKKEPFGYIGMV* 
AMISIGFLRFIE*AHHIFTVGIDVDTHA 
YFTSATIIIAIPTGVKVFS*IATLHGSII 
MK* SAAVL*AL* FI FLFTVSGLTGIV 


3668 


17569 


A 


3691 


1 


393 


GTRKYS FLHDS QTS FCLSNS I PTPSNME 
ETQQKSNLELLRISLLLIESWLEPWFL 
KSMFAIJNLVYDTSDSDDYHLLKDL*EGI 
QTLMGRLEDGTRRTGHILKQTYSKFDTN 
SHNHDALVKNYGLLYCERK 


3669 


17570 


A 


3692 


402 


2 


FLTQRWGAPLPIPPFWEAPFdPgp*APd 
FKPPLPPRGNPPFS*KTQIYPALGAEPV 
I PPSLGGGGGES FLPPGGGFP* PRLFPC 
PPPRGAKPNFFFQKKKKKCLFCKAPQAQ 
HRSACIiHECLPLGNLLKLCPR 


3670 


17571 


A 


3693 


3 


434 


IFYIFFGTTLIPTIiAI ITR*GNQPERLN 
AGTYFLFYTLGGSLTLLIALIYTHNTL6 
SIJSriLII.TLTGQELSNS*ANNLI*LAYT 
IAFIVKIPLY6LHL*LPKZUIGEAPIAGS 
IVLflAVLLKLGGYGIIRIiTLILNPLTKH 
lAYP 


3671 


17572 


A 


3694 


452 


208 


INKSARHGSMNL* SQLLRRLREVCLSPG 
VQGCIEPWSHHCTPAWETE*DKTLSQKQ 
TNKKTCQHLYDHRPSALEATFRLSH 


3672 


17573 


A 


3695 


464 


146 


LKQTRATKGNLFLP * KPKLTPPGGERPP 
FQPFWKPHPLGPPKSGVLDPPGPQGKTP 
FVLKKPNIiiGGVIPTLFFPLSGGLGRKl 
PSPRGGGGAKKRKPPRQKKKKK 


3673 


17574 


A 


3696 


2 


465 


VTRLRERPALLVSSTSWTEDEDFSILIiA 
ALEKFEOLTLDGHMLPSLVCVITGRGPL 
REYYSRLIHQKHFQHIQVCTPWLEAEDY 
PLLLGSAEPGVCLHTSSSGLDLPMKWD 
MFGCRLPVCAVNFKCLHELVKHDENGLV 
FEDS* ELAAHLEKL 


3674 


17575 




3697 


173 


5) 


TFISWFGEQWFGCMEKFFSGDF *DFG 
APITQG*SFLSI 


3675 


17576 


A 


3698 


380 


26 


KQRTNFFLFFFKGTGAFWFFFPPFFNKK 
KKVFLGPGGPKIVKKKKKKEKKENQNKN 
RKQTKKK*KTNNKQNKKRETDIFGVEDE 
TWRQRWGLGGGRVANSSAEMGKGNSTTR 
PSWNS 


3676 


17577 


A 


3699 


11 


414 


RLVVTAILIC3TP* SFTGAVILIIAHGLT 
SSLLFCLANSQQKKKKKK 


3677 17578 

i 


A 


3700 


1 


401 


SSSLTLSNNNKSFLNQFVTFNEMWILYD 
NW* *PAQWLDQEAPKHFPKPNLDQKKVM 
FAVWWSAAGLIHYCFLTPGETITSEKYA 
QQFKEmRKLQPLQSAMVNRKGP ILHDN 
A*SHITQLKLQKLN* LGYEVL 


3678 


17579 


A 


3701 


395 


140 


TPPKKKKKIFPPPPKIFPPPKKYKKPPP 
PFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFSIYIDFNLCICDNL*LQKTLVFL 
E 


3679 


17580 


A 


3702 


407 


371 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFLWYKDI r IKGKNKQWV*CS 


3680 


17581 


A 


3703 


36 


409 


LLKCEGIWKKEGKKKKKRREKKGGPFFR 
VLLGGPTFRGGAHKKSLLFRGVFFNLGG 
FFFEGPFFLGGALLGPPPPP I FTPLGKK 
KIFKGKRGPPPLIFLFF*IRGPPPPPSP 
PSTKPPPPPPPTP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

residue of 
TeTuitce 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl4nown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


3681 


17582 


A 


3704 


56 


415 


QAPWGKKKKKGKEKKKKKKKKKKKKKRG 
GPLKKWPGGGPTMGGGEKKYFSPWGGEK 
KNPGGWLGKKTPLGGGGKtfAPPPPKKKG 
AGGKKKI *RGGGGKTPLIGGGKKNGGPP 

RTSPR 


3682 


17583 


A 


3705 


108 


1 


NLAK * IQ* * IQTIMHHDQVS FI PGIQGW 

LNILKSIN 


3683 


17584 


A 


3706 


425 


52 


PFPH* * * L*KVDPI PLPPQNFLHPSNNI 
YFSPLSPKKFFFSLQPFFFFGEIFPIFP 
PPKKNFFS*NPNLVFISPPLKKKIFFSP 
PPPNFAPPKNLFKSPPPFFFFFFFFFFF 
FSGKNTLYWVDK 


3684 


17585 


A 


3707 


117 


421 


VLLVIQAEYLLPEVFGTRSVSDFGFFRF 
MNICIILSG*ASLI*KSEI*NALKNISS 
FKKKFIFILEPRSHSVTQAGVQWQDHGS 
LQPQTSGLKLFSHLSDP 


3685 


17586 


A 


3708 


308 


3 


RSPPWFFFGFFFFVLFFFFFFVFQLFFS 
KYNITLYIKKKTETFTETNDNGNATCQN 
LWDTAKAIPREKFIAISTYI *KVGKHQI 
NNLMKHLKEIEKQKQTK 


3686 


17587 


A 


3709 


94 


420 


KKKKKKKKKGGGAFKKKPGGAKI*RGKE 
KKNFFLKGGGKKKKVBNFGKKPYFGGGK 
KREKTPKKN*SLKQKKKFLKQKGGKTPP 
KPWA*KKWSS 


3687 


17588 


A 


3710 


413 


184 


LGECLSLRSKKLWS6PAGGGVCL*FQQV 
RRRKWEDHLSPGIRG*RALSSHRS IPAW 
TAEQDPVSRGLVWGLPGGSW 


3688 


17589 


A 


3711 


3 


104 


GGCSELRSCHCTPAWRQS«TSSQRKKKK 
KKKMY 


3689 


17590 


A 


3712 


2 


393 


GLFSTNHKDIGTLYLLFGA*AGVLGTAr. 
SLLIEAELQQPGSLLGNDHIYNVIVTAH 
APVIIFFI6IPIIIGGCtaT*LVPLIIGA 
PDMAFPRINNISL*LLPPSLLLLLASAI 
VKAGaGTG*TVYPPLAGISr 


3690 


17591 


A 


3713 


424 


1 


FFFFFFLHRSLTWQAGVQWR]SrLGSL*A 
RPRV 


3691 


17592 


A 


3714 


2 


130 


GRVGGRVG*NLKLYSQRLKKKKKKKKKK 
KKKKKKKKKKGGQPF 


3692 


17593 


A 


3715 


3 


197 


FHHVGQDGLNLLTL * S TRLGFPKC * DYR 
CEPPRPAAFFFFFLKKKKTLAPKKFKTL 
DLTTRERS 


3693 


17594 


A 


3716 


3 


497 


RSPPSGFAWPGPAGPWEAGAAAAGQPLG 
ARPHS PCTGGCRSAGTPP* PAGPAGPPP 
HAPAAAERKPRGAEQSQRRVQETGPPPT 
RAESNPSGLRGREVPGSAGCWSGSQESD 
FGAGCPAVT*GTYPYRIAGQVSSRAPHP 
AESRGCTAGLSIGPDGRQ 


3694 


17595 


A 


3717 


40 


379 


AITSPRS PPAVHGS CLARTHPNS PQCLP 
LSGTLLLPSSSLSLSHPATAAGE*IWDA 
EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
SPGGALKKKNPGGPNPPGGGKNISSPPR 
G 


3695 


17596 


A 


3718 


445 


286 


KKKK*METRSCYVGQSGLELLASSDPAT 
LASQSVGITGVSHQAQPVNKLPRVQ 


36% 


17597 


A 


3719 


15 


383 


GYFLEAS CLIiHPLLHPMYQNCGHVACLA 
AKENGKFMQAACGQLNSPALSLKNGRLS 
AAYGGSDCVYLFPMSKMVLGTQ*ALNKY 
TLNGPGAVAHACtf PSTLGG *GGEMLRSG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y--Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VGDQPGQHGET 


3697 


17598 


A 


3720 


14 


472 


RSCaRNPGLTHASAQYQTPLFV*SVLIT 
AVLLLLSLPALAAGITILLTDRWLNTTF 
FDPAGGGDPILYQHLF*FFGHPEAYILI 
LPGCAIl rHIVTYYSGKKEPFGYIGRV* 
AMISIGYLGFIVRAHHIFTVGIDRDTRA 
YFTSATIIIAIPT 


3698 


17599 


A 


3721 


3 


298 


EAAIKYFLTQATASIILLIAILFNNILS 
GQ*TITNTTNQYSSLIIIMAIArKLGIA 
PFHF*VPEVTQGTPLTSGLLLLT*QKLA 
PISIIYQISPSLNV 


3699 


17600 


A 


3722 


86 


505 


AGAVTY1VTYYSL*KKDSFWIHSVWC*A 
MRVNWES *GLSV*AHHI FTVGIDVDTRA 
YFTSATI I lAIPTGVfCVFS *IiATLHGSN 
MK*SAAVL*MiGFrFLFTAGGLTGIVIiA 
NSSLDIVLHDTYYWAHEHYVLS IGAVL 


3700 


17601 


A 


3723 


432 


48 


FKTSFLPFLLPSFCPSFPS FSLFPFSFS 
CFLSCPSSLSFLFFFLLSFHPFFFPTLL 
ISF*VFLF 


3701 


17602 


A 


3724 


2 


406 


HHIFTERIDVDTRAYFTSATIIIAIPTG 
VKVFS * LATLHGSNMK* SARAL * ALGI I 
FLFTVSGLTGIVLANSSLDIGIiHDTHYV 
VAHLHYVLSIGAAFAIIGGFIH* FPLFS 
SYTLDQTYAIIHFTMIFIGVNLT 


3702 


17603 


A 


3725 


174 


158 


KFF6GGFPHFFPPQKKGPFPKNPPRGFF 
TPPLL6KNFFFPPPVNL6PPRGFF*GP 


3703 


17604 


A 


3726 


2 


392 


LFST1SIHNDMGTLYLLFGA«AGGLGTALS 
LLIRAELGQPGSLIiGNDHMYlirVIVTGHA 
FGIIFFIVI PI IIGGFGNRLGPLIIGAP 
DMARPRINNISC*VLPPSILLLLaSAIA 
EGGAGTG*TAYPPLAGNY 


3704 


17605 


A 


3727 


48 


474 


KKKKKKKKKTKKKKKKKKKKKKTRGPPP 
KKTPAGPPPPPPREKKNPPPEGGQKKPP 
PAH*KKPPPPRGGQQPPPPPPQKNPPPE 
KQKTMGREGGTPPPKPPRKKNGSQQlflD 
PPSNA?UaJRHPTQEPPPQSQCIPPPPTP 
PP 


3705 


17606 


A 


3728 


387 


53 


PPPKKGVLPSPPNFYTPPK*GVFSPLPP 
*KFFFSPKGLI FWGGGGPNFPPPKKKFF 
SKKPRGGFFFPP*KKKILISPPREKLGP 
PRIFLKGPPPFFFFFFFFFFFF 


3706 


17607 


A 


3729 


1 


397 


PTRPLQPLSPEFKRFSCRTSPDVAW*VS 

LPSPPPSGRSFFVFLiVESFCVWIKKRF 
TMLARLVLDS * PSDPPAWASQSAGI TGA 
SLRASLLYI Y* CFS IQLGLLPEFLI QYV 
QLLIVYPRIHHLYLSVPHLH 


3707 


17608 


A 


3730 


407 


0 


TLKHTKKKKKKEOCKKRGGLLKKTLGGPK 
*IIGGRKKKIFFLKGGKKKQTGEILKKKL 
ILGGEKKGKNPPKKKKPLGEKKNFKGEK 
GKKNLKMFGV IKLGTS S KKI I F 


3708 


17609 


A 


3731 


3 


200 


GGCSELRSCRCTLAWVTE*DSVSLKKKK 
KKKGPFKVLRGKGPSGNKREGAKKKIQG 
KPCPFWESF 


3709 


17610 


A 


3732 


3 


380 


TNHKDIGTLFLLFGRiRGVLGTALSLLI 
RAELGQPGNLLGNDHIYNGIVTAHAFVI 
IFFIVIPIIIGGFGN*LVPLIIGAPDMA 
FPRINNISF*LLPPSLLLLLASAIVEAG 
AGTG*TSYPPLAGN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l-ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3710 


17611 


A 


3733 


6 


394 


AMISIGFLC3VIG*AHHIFTV*IDVNAEA 
YCTSAT 1 1 lAIPTGGKVFS * LATLHGSN 
MK*SAAVL*A1RFIFLFTVSGLTGIVLA 
NSSLDIVLHDTYWRAHFHYVLSIGAGF 
AIIGGFIH*FPLFSGYT 


3711 


17612 


A 


3734 


27 


238 


GIPGSSHaSAHIG*IIAVLPHYLNITIL 
NLTIDI I LTTTAFLLLNUJS STTTLLLS 
RT*NKKKKKRGGPF 


3712 


17613 


A 


3735 


402 


249 


ASAHaSAPLFV*SVLITAVLLLLSLPVL 
AAGITILLTDRNLNTTFFDP 


3713 


17614 


A 


3736 


100 


430 


VTWPKSDS YEEVKI KLEPRQTDLRALGF 
CLSFTGKBaaC*KKKKKKKKKKKKKKKKD 
KRGG 


3714 


17613 


A 


3737 


158 


306 


FLSSVFSLVQGVLFSSLCUDKNLPDMMQ 
LWSEIFNK*LYLQELSII.C3VY 


3715 


17616 


A 


3738 


279 


3 


PGHCFYHFNLNFYFQIFFQNVLANSGSL 
LFLYQTLILSFNIFYILYLIIQISIVFQ 
SVIL*FIFSNSYSWWYGFLVCLMTFEGK 
KQTCAHV 


3716 


17617 


A 


3739 


268 


388 


HFFFFEMEaDSYAKAGV*lTODVGLQQPP 
PPGVKRYS*LSL 


3717 


17618 


A 


3740 


395 


153 


GWLMRFKERCGLQNIKV*GEAASaDaEA 
AAS * PEDLAKIftDEGSYT*QWIFYVGET 
AFYWKKMPSRTFIAREKSMPGFKL 


3718 


17619 


A 


3741 


271 


386 


TGPPPLKLWVYELGLL*RKNMRQGAVAH 
ACNPSTLKG*GGRISRSGDR 


3719 


17620 


A 


3742 


406 


190 


QLLKRIiRQENPWRWGG*GCSKPRSPPRP 
QAWVTEHDFVSKSKTKTNQPKQWGEHKV 
FFWFFFLRKPKVQFEI 


3720 


17621 


A 


3743 


272 


408 


MGLQACaNSPL*YVDIAIPCNNKGAHSV 
GLMWWMLAREVLRMRGT 


3721 


17622 


A 


3744 


211 


407 


KGKRMILFGPVKNIFCDTQFTSCVNFFR 
EEVSLDCPGWSQTPALK*SSCLGLPTCW 

D*RREQPRL 


3722 


17623 


A 


3745 


405 


189 


CVSQDGLDLLTS*STRLGLPKCWDYRKC 
WTNQLFVNGQKSLNS FVCYS FMVLDTTG 
FKEIVMLWHLVPLF 


3723 


17624 


A 


3746 


120 


362 


LKA*EISWLNGTIQTHQEKPEVQNQVIL 
EENIVFLGPQDRIFECQAVTAELEPEKK 
AIVAGCSDSHLYPSTLGGQDGWIT 


3724 


17625 


A 


3747 


401 


233 


NSGGRGCSEPRSLPCTPAWVTDRDPISK 
NICNKTNHQ*QKKCKR*TGKVIFMTRSIL 


3725 


17626 


A 


3748 


353 


109 


LGLANFCIFSRDGVFPCWPGWSRTPDLR 
* SflHLSLPKCWEYRA*ATMPGLLMCFL I 
KAJjDFQGCVGRWGGGSWTDIKLCQM 


3726 


17627 


A 


3749 


412 


221 


HGLKECLVYPKWCQNSNISLQGIHFSEV 
AM*QRAHNHGIHWFYHDSFPRKIWINWA 
RRGGSCL 


3727 


17628 


A 


3750 


1 


355 


FHHVSQDGLDLLTS * SPRLGLPKCWDYR 
REPLRPAENTVYVHFSSWSFTHHSGKIN 
*TLDQWKEQLPGWL* ILRGVFFFNPSIW 
NS*QKKASIYVFPLCWKRDIFLYHFPVS 
GTFRGF 


3728 


17629 


A 


3751 


423 


125 


QVCEKVLISLITEMQIKTTVRYHLTPVK 
MAFRQKTDDNKCWRGYGEKGTPVHCWWK 
CKWQPL*RTAWRFFK*RKIELPYDPAT 
PLLGI*PKERKSGQV 


3729 


17630 


A 


3752 


3 


171 


FQPIAFLYANSEQSGKKESTS I YSS YEE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YLGINLTKEMKDL*KENYKTLMEKIELD 


3730 


17631 


A 


3753 


2 


247 


FHHVS * DGLNLLTS * SAACLGLPKCWDY 
RHEPRRPALTRFLFVFLYLQRLDHLIRI 
NSFFIMSVNGMFTFWLKEiFNLIHA 


3731 


17632 


A 


3754 


3 


197 


GLELLTL *SACLGLPKCWDYRHDPLCPA 
QYLSFLYAINKLFSSTYLEIHNRLVCCI 
VTLLIYKP 


3732 


17633 


A 


3755 




103 


VSQDGLHLLTL* STLLSLPKCWDYSCEP 
PRPAW 


3733 


17634 


A 


3756 


103 


2 


FRPGMVMIACNLSTLGG*GGGITRSGVQ 
DWPGQ 


3734 


17635 




3757 


394 


244- 


ASTHPANFFCRNGVLLCCPGWSQMPGLK 
QSS*LSLPKC*DYRCEPPHSAS 


3735 


17636 


A 


3758 


341 


2 


AEIAPLHSSLGMKRDSVSKKKELKKLNK 
QKP]!INPLKK*AtCDMNRHFAKGIQGAKEH 
KIKCSTSTlSILREMQIICrTVKSLLI*VRM 
AI IKKSTKriSCW*AEREKGILIYTWWEC 
K 


3736 


17637 


A 


3759 


2 


322 


PPRPTRPLNNTINQMETICMYTKCYLI I 
AE*TLFPSAHGTLIKIGHILGHKASIjNK 
FKRIKI IQSAFFKHNGIKLENDKRKITG 
KHPNTWKLRKHTSKPMSQEMFF 


3737 


17638 


A 


3760 


444 


422 


VRLPHRPSPPPDPRVRFHHVGQRCLKLL 
TSSDPPTSASQSAGITVPSliQ*QP* SVR 
LP 


3738 


17639 


A 


3761 


266 


1 


GGGVPLGPVIPPLRPPQRVGFQALAPQP 
GPGSKIF*IFFGPQFKNRVFFKGRNLVF 
FPPLFSLLPFFFFF*DGVSLCHPGWSLE 
CSGM 


3739 


17640 


A 


3762 


70 


381 


HSITSYI*AFIWGFFLFCFVFFLIKEME 
SHYVAQADHELPASWAPPTSASQYAGI S 
GGSHPTQHYFIFYW3FYFTVFCGGGFLR 
RSLAPSFFAQAGAQWRDLGS 


3740 


17641 


A 


3763 


229 


390 


DIPYSFCFFFFFWKGSFVFVPQAGGKGR 
KLG*LKPPPPGLRGFSCLTLPGSWEN 


3741 


17642 


A 


3764 


119 


1 


QTSCRPGAVAHTYNPSTLGG*GVDHLRS 
GVENQPDQHGE 


3742 
3743 


17643 
T7644 


A 


3765 


400 


200 


ITAYCSLDLLQPHFLFFSHDKVSLCCPV 
WSRT?ELK*SPCLSLPSCWD*S*ATASG 
LKHFIKSLPL 


A 


3766 


376 


44 


TQPIFLGPGFQAPPGPMG*PNFLF*EKK 
LVFFPKKPKTAPAGGGGPKFPLPRKPKQ 
KKGPHPGSQNFQKPRPKFPPRPPTWGKK 
REFFSQKKKKKKRERDVKITDLNTST 


3744 


17645 


A 


3767 


403 


3 


DHFFFKGPPSQKGEFPLFLKTFPSELGG 
FG*ISLFPPPINFPPPPKKRIFPGPLSF 
FFFKKKKTYPAGKGLLFFPFPKKPGGGF 
KRWPGPPKKFFFPPPGFWACSPPFFFFF 
FFGAKVSLVAQAGARSRSRTS 


3745 


17646 


A 


3768 


407 


177 


FHHVSQDGLDLLTS *SAHLSLPKC *DYR 
HDPPRLARCIKNFCRGWRCNILRKSPLS 
LPPSAGELRAPFLDFLEDAQ 


3746 


17647 




3769 


292 


391 


RPGWAIITCNPSTLGG*GGQITRSGVQD 

QTGQY 


3747 


17648 


A 


3770 


395 


198 


LDMGFHHVGQAGPELLTSSDLPASASQS 
AAI TGMSHRAQP Y * VI SMCQQC YKHF IR 
IPFNLHWNV 


3748 


17649 


A 


3771 


1 


241 


RTRGRTRGRTRGLLSAHTGRYQKHPRIR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPQIFSTGTTKAEAGELLEPGGGACSEL 
RSCHCSPARATRVKLHVKKFKKIK*KHP 
RIRTPQIFSTGTTKAEAGELLEPGGGAC 
SELRSCHCSPAEATRVKLHVKKFKKIK 


3749 


17650 


A 


3772 


112 


406 


FFFFFFFFFFFFFFQGEGVPPI*KPPPP 
K*FMFPKKKLPPPPPPALKLFFFSPPPP 

FFFFF 


3750 


17651 


A 


3773 


24 


197 


IFSVEMGFCKVGQCDLDLLGSSNPPASV 
S * S AGV I GV S I IHAW ?L VKL F VKRNKL S Y 


3751 


17652 


A 


3774 


85 


201 


VNFFHLI*FVNKRIKLEKKKKKKKKKKK 
KKKKINWKKKKK 


3752 


17653 


A 


3775 


1 


382 


WENHLSPGSRGS SEP* SHYCTPA* VTE* 
DSVSKKKKPNLQEAEKRLTLLLIKKAIF 
KNRGSPCFSIKWA6FKKIKQGWFFVATR 
MGDRSRGTPNLGENFLKSFLSFSIKIQR 
FYYLGLKNSPSRNIY 


3753 


17654 


A 


3776 


1 


181 


IKMATVTKATYRFCVIPSKLPMLLFTKL 
ETKILKLIWNIjERA*IPKAILS*KNKM6 
LSNS 


3754 


17655 


A 


3777 


70 


385 


LNKMQTKTFDSDTHTSICFTLHFINTKL 
QFIFLLAIYLLFVWFPFIFINFFEEPGF 
CFDDLSSCFLFY*Crp*FYYFFSAFFRL 
HLFFFYRFLSLLTLDLFSFLI 














KDDRNTEHQVHPNASDQEDEAFDPYENF 
FEDSDSPTKSSSTEPSPHIHPVDIQMTI 
FHCADNFARQYILAKLAKKKKKKKKKKK 
KKKKKDSSSPGGGPKKGAPPPPPRGEKN 
FFFFGPPLIFSGGGF*NG6GGGP 


3756 


17657 


A 


3779 


403 


176 


FLFPILFFFLRQSFTLVTQTGGQQRDLG 
SMQPPPPRFK* FNE6K6KRKLWGGA 


3757 


1765S 


A 


3780 


3 


248 


GLNQTOLRKILAYSSITHIG* I lAVLPY 
NPNITinNLTIYIILTTTACKKKKKKKK 


3758 


17659 


A 


3781 




330 


VDRLRTGVQDHPGKHSESPSLPKKKKKR 
*KILPKRGGVYLnPHFFERLRGENNLTP 
GGEGQRETESPPCLPAGGiaSKIFV*KIQ 
KKKRGGPFKGNKFFPRGGGKKIFF 


3759 


17660 


A 


3782 


399 


187 


FPPPAKGDFLPSHLFGLPPGFSPPPVFK 
PRPRNLILGPP*KKFYLPPPRP*TWFL* 
NGPPPFFFFFFFLDF 


3760 


17661 


A 


3783 


392 


11 


KKLKVNLPDHPAVPPLGIFPKEMKAGI* 
RAI CTLMFIftALLT I EK*WKQPNWLLVD 
K 


3761 


17662 


A 


3784 


324 


80 


PGPFFFFPPPAKRGFFPPPFFLVPPGFF 
PPPLF*TPPPDFFFLAPLKKFFFSPPPA 
LNFFFFKGPPPFFFFFFFFFFFFFF 


3762 


17663 


A 


3785 


354 


24 


PPNKIFFSPPPQKKYFPYKTPPPTFFFL 
SSFFFFFFFFFFFFFFFFFFFFLC*MVH 
FIVKYRKCKRGTRIILHTPTLRQLVLCC 
LSDVLLGCCIFIYVTMCDLCAPKLG 


3763 


17664 


A 


3786 


3 


369 


SCRGLVWVYGELLLHPCRGNQISRVMQK 
KKKKKFGASPPREGAPPGEPP*RGP*TQ 
KEPPPLKK*APPPGPGGGRPPPMGLGGG 
PAPPPFFWPKAFFKKGPFSGQEFGFPLK 
GLFFWKFPF 


3764 


17665 


A 


3787 




125 


FRHRFSRDGLDLL IF* SAHLGLP KCWD Y 
RREPPRPAHSIII 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unl<nown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3765 


17666 


A 


3788 


427 


251 


LTPKGRGCSEPR*CHCTPAWVTEGDSVS 
KKKQAKNRCSLRMKYERNEFTSSPC 


3766 


17667 


A 


3789 


250 


2 


GFPPVEKIKKNPFPIF*KKPGFFPFPNP 
FF*ISG6K6KIFFLGPPPFSFFSPFFF* 
DRVLLCHS6RSAVAQSQLTATSTFRR 


3767 


17668 


A 


3790 


412 


3 


6STPITTPFWGVGGALPPGGGVPPPPAP 

PGGTPFFFKKKKKKKFQNFPRPRCSRYN 

GGT.GRIvNFFTI.KAKGSTICPKFSPAPPPG 
GQNKT?FPKKKKKSR*MTLQSFPRPGTA 
PGAAM*PVSPPEPCGPPAPALSTK 


3768 


17669 


A 


3791 


10 


1S7 


GCSEL*LIHCTPAWVHSETLSQKKKGRG 
KGGFKKNGIPCFGKNKGGKHFGKSFGHF 
STG 


3769 


17670 


A 


3792 






FRVPLGRLSRGEHQQQ*QQQPPPPPPPP 
GPLHPIAGPSRKGSFKIRLSRLFRTKSC 
NGGSGG6DGTGKEPSGELAASAASLTDM 
GGSAGRELDAGR*DRPGSGRQTSFSW 


3770 


17671 


A 


3793 


1 


431 


FRKGSARCRAPKPAGADGVHWGHHRASP 
QTAQAPPVTCSARGSWMDFSHASGPQDR 
DLHSPSPVCPRGWAPGTHSPTGASGA*G 
RTPVGGGK*VAGVQQRIHPGLAPG*LVL 
PGIAVHITKKPGSKGTITPGTVAHAOTP 
NTLG 


3771 


17672 


A 


3794 


342 


48 


KRENMPGVAYNPSTLGGCNPS IVGGFCN 
RVCNPSaLGGCNPSFLGGFCUHVCNPSP 
LGG*GGRISRSAAQDHPRQHGEKMSPQK 
IQKISQACFARVG 


3772 


17673 




3795 


2 


189 


LNPGGRGCSEPRSCHCTQAWATE*DSVS 
KKKKKKGSRFQNTIPSALPGFPVSGELI 
LGGPNL 


3773 


17674 




3796 


1 


401 


LNPGCKGCSEPRSCHCTPVWVSE*DPIS 
QKKKKKKKKK 


3774 


17675 


A 


3797 


279 


3 


IIAKVYPMVNS FLEKIHYFLLVELCCFL 
QHVLYYLTFKTTETGSRSVAQAGVQWHD 

RSSL*PLPHGLKRSFFLSLPRSWDYMHT 
PPCLATE 


3773 


17676 


A 


3798 


229 


394 


VFVKS LLCQPGMVAHACNPSTLGG * GGR 
ITMSGD*DHPGKHGKVSTYNTKKKKGG 


3776 


17677 


A 


3799 


380 


206 


AGHM* S* LLRRLRQEDPLSPGVQGC SEP 
*S*PCPPAQVTVRISLKKKRLHEHDKSS 
AS 


3777 


17678 


A 


3800 


450 


3 


lYLLRGRHSLHYSRPSFYFSNYRGACSY 
MRPPQQATLLGRLNQGITREGPAIAHSP 
AVRPRLSIoAHAHVRRLCLPRPGAAPEFP 
AFFLPCLLVAQAGVRWRDLGSLQPPPPR 
FKRFSCLSLPSSWDDRRPPPLSS*FAFL 


3778 


17679 


A 


3801 


301 


10 


KKQPPKKKKRCFPPLFPPPKGFKGSPPF 
FFLTPPPLKNLPWTLKALPQIFLRTFFF 
F FFERGS P S VTQAGVLWHDLGSLQP * TP 
GFKHMPSCPVLLC 


3779 


17680 


A 


3802 


420 


51 


SASSPPASFFFFPPPKKKIFSHPFFFFF 
PPFFFPPPFF*PPPPFFFFFPPKKKKIF 
PPPPQKIFFFLKPPPPFFFFFFFFFFFF 
FFFCGFVWCQFSVLI 


3780 


17681 


A 


3803 


253 


73 


QTTPSKRYSYVLISrCCMS[*ITTCKRIKLD 
PFLTLDLKIN*KSFADIiHUEAKTIDTIK 
KVKG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, l^Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3781 


17682 


A 


3804 


385 


55 


AGTRPVFAALAAAAPAPRLLPLAPHGSA 
ATGFPPPPCPHGEGLRGAPAASSSPPLA 
HHSQTPNWVDCGESPIPDPQFPLL*NGP 
PALCLPQGLETVHVESLflLCFTSHGN 


3782 


17683 


A 


3805 


191 


2 


QKHLKAlJSrFLPRCVCVCVCVCVCVCVlStV 
TF*ACKICTCSDFP6SPRKKYLEFGTKI 
INIVDFN 


3783 


17684 


A 


3806 




367 


KLRQVI YLEDLPTI IDEGSYTKQWIFHV 
N*TAFTOKKMPLKTSVTREEKSMPGFKA 
SKDRLTVFLGANWSDFKLKPMPIVNSE 
NPKAIUNDAKSTLPMLYKBMNKACIAaH 
LFTAWFTEYF 


3784 


17685 


A 


3807 


3 


145 


LGPGGGGCSEPRSCHCTPAWATRGKLCL 
QEKKKKR*RLDPEAEVAVSRGRAIALQP 
GQQEGNSVSRKKKKKGRDKVSI.CCPAGF 


3785 


17686 


A 


3808 


362 


86 


NPPF-CSPPFFPPLWGPKQWPPWCEG* 
TPPGPPCEIPLPPKKKSKSPPGVWGPPF 
FPPPWGGGAPKFFLPPKWGFPQTQICPL 
PFGLGEKN 


3786 


17687 


A 


3809 


245 


377 


QPLPPGFK* FSCLGLP 


3787 


17688 


A 


3810 


3 


268 


KRNFLNLIKRRYEKPIANFVFNSKIVNT 
FHLRSSTS*GYPLLPLLIJiriLKVLPNEI 
RQQKEIKGRKNKGKKTISVYSIMLSANT 
EKPK 


3788 


17689 


A 


3S11 


2 


388 


SAENCLWPGQRGCGDLRWCHCTPAWVTE 
RDCVSKTNKKETEKYMRN* TGNMLTI ID 
AGHHYMRCHQTIFTFFFFWRGSLIFLAP 
PPSQGEGIFLPHPPEKWKKKVRITTPTY 
FCFLETDGFLPCGPGGF 


3789 


17690 


A 


3812 


292 


136 


PPTVIARKGKPRPJCLSSEPWQFSAEQLG 
LPSFLCFFVCVRCCCLLLFLLLFCRDEV 
S ICCPGWS * TPGLKQS SHLS FPNCWDCR 
CEPLC3?AIF*FFCLCTLLLFAFVSAAVL 


3790 


17691 


A 


3813 


359 


52 


KKGRFFFFEKNPFGDLVPKKKKKKTPPP 
PNPKKKKRGapPFFFFFFFFRH*VPLCC 
LGWFRTPGLKGSSCVGLPKPWDYRHKPL 
PLTKDDYYVYGLPFVSLQ 


3791 


17692 




3814 


382 




GGPGMEAPLGYHIKTPFPLKPQKMSTPGG 
GAKPWPPPPGG*AGKMGLPPRQRFPLT 
QKGPPPPPLGGKREPPPQKKKKKRVTAM 
GDP*NSECR*GCHLRVQKEMRLPGLRNS 
PPHYI PNRNTDMCS C 


3792 


17693 


A 


3815 


472 


340 


TGFHRVS QDGLDLLNS * SAGLGLP KCWD 
YRRGLFI r I FNMGEWS 


3793 


17694 


A 


3816 


247 


378 


PPPPGGGXGXI^LGLLKPLPPG*REIPPP 
PPRGGGNPGGAPPPPG 


3794 


17695 


A 


3817 


25 


465 


TPPLGG*GKRFPRGPGFSPPPPPKGNPP 
FF*KFKKKPRPGGGAP*FPPPGaGRPKN 
PLPPGPKGSH*PKFPPPPPPGETKRKPP 
PPKKKKKKRKKEKKKEI*AYKAYICFDC 
PLTGPGDLKSDKTLAFPSVIGLPRPCTC 
LSPR 


3795 


17696 


A 


3818 


475 


147 


PPKG*RHPPGPPRFRRGPGNPPPHRFFF 
VLRTPGYSFPF*KKRGPPPTFFFTLPGP 
PGFLKGPPS PGGGGQKI PFPRGGGGPLK 
*IPPFPPPPGKKKDFFFLTKKKKKKP 


3796 


17697 


A 


3819 


461 


57 


TNQKKAAPPPLLKKKFFFFFKKHTGGAP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
Site 


Predict- 

otide 

correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGEKKKRGGGEKKKRGVYRKGPRAPSRG 
CAPPLYYRGGLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFHVKNILFVF*DYIVIIADL 
IQSLRHKDFQELLRR 


3797 


17698 


A 


3820 




418 


HAYHI * KPSP * PLAGALSTLRMTSALAM 
*FHFHSITLL1LGLLTTTLTIYQ*WCDG 
TPESTYQaHHTPPVQKQLRYGIILFITS 
QAFFFAGFF*AFYHS SLAPTPQLGGHWP 
STGITPLNPLEVPLUSTSVLLASGVS I 


3798 


17699 


A 


3821 


385 


1 


LQFPPFSPLLFGVQRPFFSSFPSCPWWG 
PVPSKLLPLLQILGRGFYPQKSLLVSLG 
FFFFFFLI IKKHRNEVLLCCPGWSRTPE 
LK*SSHLGTPKCSDYRCEPPHPPDPWDI 
CSYVPFLQSTHFGLNY 


3799 


17700 


A 


3822 


72 


424 


KKKKKKKKKFPPRKKGAPPAPPLFGKaa 
GEKLFPPRGLIFKKPKKAPGEKKKNLFK 
KKKKRGGP*KKKKLKAGGGEKKIFFKGA 
QKKNPGAGFKKQQRGKNR6APQKKGQEQ 
SSS 


3800 


17701 


A 


3823 


154 


2 


GHHTQHTEPQRLNNSYRYAELKHLSHTK 
NNPI*KWAKDMNRHFSKED1C2M 


3801 


17702 


A 


3824 


422 


212 


MGSSNPSPTASWVDEPTGICQHT*FYVE 
MGFCJHVSKaaLGLLGLRDPPPSASQSAG 
VAGVSHCTRLWIIL 


3802 


17703 


A 


3825 


2 


417 


KRTOTRGFPPPPPFKTPPPKIFMGPQKK 
KNFSPPPPIKIFFF*KAPPSP 


3803 


17704 




3826 


423 


65 


ACGRGKYFKSGGRVKPFCAVDFPPSGKK 
ANQPTPKNPRKKQKPSGR6MSPMGKPPG 
CPCPMPKASPGPVS FNKGGVAPATKDFC 
PI6KTA*IPSKKKKKKSDPa*VKKaKPN 
LYSIYMK 


3804 


17705 


A 


3827 


285 


136 


FFFFFFFFSGA* KI FI lATIHY 


3805 


17706 


A 


3828 


425 


316 


QDGLDLLTS * SAELSLPKS WDYRHE P PH 
PAQTFFKQ 


3806 


17707 


A 


3829 


3 


169 


LSN3 * ANNIil * liAYTIAFI VKl PLYGLH 
L*LPKAHVEAPIAGSIVLAAVLLKLGG 


3807 


17708 


A 


3830 


6 


452 


IGFCQGFAKDFCSKKFQRNVRVSPRGGE 
GRKFLLFKGGGFQKFPPLPPGWGAKTPF 
LKPPPKKKKKKERKEKKEKEKKRNPGPP 
VSPSSWSPVP*APSEQRSGPQLPPSRGD 
*VVRLPSPPRPFRDCVRRSATRGNSPPC 
RGSQGP 


3808 


17709 


A 


3831 


45 


387 


NFGVJCVFS *LATLHGSNMK* SAAVL*AL 
RYIFVFTVGKKKKNKKKKKKKKKKRGGT 
PPGGEKRKFFSPKKKKKKGFLKNPQPKK 
NFFWGKPQKKKGGKTPFWGGGKKAPKKT 
P 


3809 


17710 




3832 


463 


288 


DGLDLLTS*SVRLGLPKCWDYRHEPPRP 
APCILMATFFQFEVLFSEFLETIVKSVL 


3810 


17711 


A 


3833 


294 


90 


FFFLFFFFFFLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFF*AGIYFI 
TCAPPPAYPRS 


3811 


17712 


A 


3834 


427 


37 


HACNPITLaG*GGRIMRSED*DH 


3812 


17713 




3835 


499 


53 


AKLMYFQSRENVAKR3YRRNQKA3MVHP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

nucleotide 

correspond 

residue of 
sequence 


Predict- 
ed end 

location 
correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph9n, Y=Tyrosine, 
X=Unknown, *=Stop codon,/==possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NQEPAVIAGQGTIAi:iEVLNQVK*LSSyH 
CRLLVGMyHGCLLVSSVTPFCNIFSGLK 


3813 


17714 


A 


3836 


304 


3 


RLKITKTNNNQTEINS*YIKDLNARAET 
IRLLEEM[IGGKPHDIRFAMIFWI*HLKK 
TKNIYKLDyTKILDFCISKDTINKVKR* 
PLEWEKIFJiNHISDKG 


3814 


17715 


A 


3837 


475 


135 


MEGLFCPAPMVHPFIPPPLGG*GGGSPG 
PGIQTPPGYPGEPPFFFTLKKLPEEGAV 
FFFPPPLGGLGPKNVFTPGAGASLEPGW 

APSPPPGGPP*NPFPKKKKKKRLKIGWH 


3815 


17716 


A 


3838 


415 


245 


ETGFHHVGQEGLDLLTS*SACLSLPKCW 
DHRREPPHPDPQVSFIRMjFPFMRAPLS 


3816 


17717 


A 


3839 


460 


413 


WDYIKLTGFCTAKETRVKRQPTEWDKTS 
AKYISDKGLKTRLHKEQ*KMTNLI *NWV 
K*LGRySH 


3817 


17718 


A 


3840 


2 


470 


GGDAPHFPDGVAARRRGSSLLRWGGCQA 
QGLLTSQTGWPGRDAPHLPDGVTAGPRE 
SPHLRRWflAGQTRSSLPRCDGGREEALL 
TS*VGWRPGRr(APHFPDWAaRQRGSSHP 
RRWAAEQERSSLPRRSSGRAEAAI SNSA 
SLQRVSKVSVSCRPGG 














3818 


17719 


A 


3841 


54 


332 


QFGFHHVDQRJiPELLTSDHLPSSASQSA 
GITGVSHRARPSIiMSFTHSLAL'PTPA 
HSLIHWLTHSLTHSLSFPCSFSSPPSHT 
HSLARPLGV 


3819 


17720 


A 


3842 


208 


468 


ENPTPPPLWCKALGVL*Er)GFSLYSLSA 
HVCEIVFLYCPVLESARPLMFLWNLaHV 
CYYEWHLFSVSLEFHTVS 


3820 


17721 


A 


3843 


406 


2 


PSSSSSSSSSPPAAFFLGCPGVFPSPPF 
LKPAPGFFFWGPKKKNFFSPPRGVKFFF 
FKGAPPLFFFFFFFFFF*DGVLLCHPGW 
SAAHAS 


3821 


17722 


A 


3844 


370 


253 


VGLLEPRGQGCSELRSHHSTPAWATE*D 
PILKQNETEQK 


3822 


17723 




3845 


369 


73 


LRQDNGMNPGGRACSEPRSHHCTPAWAT 
E*DSISKKRKSQNSVFTESYFASEHCYS 
LTGGGVAILMEGPTGPDGAI FMCHLRNL 
CRCSPILPLALQCDH 


3823 


17724 


A 


3846 


198 


388 


FRKSINVRYHVNRINLKNHTI IP IDAQK 
P*EKLQQHLTMKYS I IMNSRKLSQPALR 
NLF*KPS 


3824 


17725 


A 


3847 


391 


20 


GFPPPRVF* HGPANF FLGP P KKKF I LP P 
PR**TWFLLKGPPLFFFFFFF*DGVSLC 
RPGWSAVSKN* QTNKKS I ILP IRGKF ML 
FVHSMLIKGFTCDLLIFKNRKVKESTKT 
IDHIFSRQLMA 


3825 


17726 


A 


3848 


227 


2 


KRRNPFHKAIAAIDSSDGPR*SKLKNFW 
RGFS lADAI KI FRDSWEVKIS ILTGFWK 
KWILTLMDDFEGFNRTRGR 


3826 


17727 




3849 


373 


2 


S SPQKYWGGPGFFPP PGFKTPPP P FFFW 
APTKKKI FFGAGRENFFFLRGPPPP I I F 
FFFFFFLFFFFFFFFFFFFFFFK*YNSF 
I *DIKTHVFLLHSVGFRVLVYKTRLYFT 
LYHSFLKNELG 


3827 


17728 


A 


3850 


394 


2 


EACSVTQAGVQWENLGSLQAPRRQS FCL 
RGSLLS PQH * EDRT*HIGGTQQTSMNE * 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
otl 


SEQ I© 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 

location 
correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \°:possible 
nucleotide insertion 














MNLYLQWLCPLLYQLCSTSYT*EVIQLN 
lAFCVCLVGFFCFFVFLFFETASQSFA* 
AGVQWCHHHSLQPQLPKL 


3828 ' 


17729 




3851 


3 


164 


NHLNLKGRGCSDLRLCHHTLAWATEGDS 
ISKTKTVQ*RKI,QN?IIPHEHTCKWL 


3829 


17730 


A 


3852 


35 


390 


NFVSNSFFFFFFGNGAPEGPRGGGQGPN 

LG*GEPPPPG*KQFSFLTQPHGGNNGGG 

PPPRTOFGFLRIOSrGVPPGGPGGPQPPDj;, 
GTGPPPPPEGSPPPGPQFSKTKRTAPGQ 
PKGGPF 


3830 


17731 




3853 


6 


437 


PNK*KTPGLPPPPFFYFFPPPPPFLSFL 
KPFSFPPPPPPPLFFFLPPP 


3831 


17732 


A 


3854 


1 


235 


PTRPLRWLRQENRLNPEGQGCGEPRSCT 
YTPAWATE*DSISKKKKGGGRLKGPTLT 
SGGGQQLFFLRQPQKSTSGAAF 


3832 


17733 


A 


3855 


52 


351 


TVGIMEPQTTDVYVKUJKCTDFLLPDYI 
FNKQHyPKKKKKKKBOOaCKKKKKKKKKK 
KKKKKKKRTKQKKKKREKKKKRGGGAPQ 
*LTLSPPPPPLYYFFF 


3833 


17734 


A 


3856 


72 


420 


IITYGKKKKKKGKKKKKKKRGGREKEKLW 
EAGGGGTTRGL* KENSGGNMGKPRIKKK 
GGKaRGAPLGVSSTQLL*RGEGEKCLSP 
GGGGCSEHKTTPLAYPGLGKKKETQSGK 


3834 


17735 


A 


3857 


410 


107 


SSSSSPLARYFLGVAPGFPLPTLFKTGP 
RIFFLGPP*KTFSLPTPGR*T*SP*KGP 
PFFFFCRDRVLLW* PGWSEMa3r.KQSSC 
LCLQKCWRYRYKPPQLA 


3835 


17736 




3858 


419 


239 


TTRSQIiLLFLVEMGFHHAGQAGLKLLTL 
R*SA.CLPKCWDYRRESLYLADA*GFYNL 
EDPL 


3836 


17737 


A 


3859 


262 


1 


KEPPFFFFFFLFPFFLFFF* RRVLPCCP 
GWS*TPEFK*STCLGr.PK*WDYRHEPPN 
PAS 


3837 


17738 


A 


3860 


357 


133 


NTPPPGAGRNKENPLFPKKKPRGVGPLF 
PPPRKAQTEEWLYPGPK*FFFFPPPPPG 
GKNTKKKPPPQKKKKKKSL 


3838 


17739 


A 


3861 


253 


375 


TLSPQG*GGPPPPPPRGGGLEAPPPPPG 
LFFVFWGKRGVSP 


3839 


17740 




3862 


3 


482 


YITDEViroPSLTIKSIGHQWY*TYEYTD 
YGGLIFNSYILPPLFLEPGDVRFL 


3840 


17741 


A 


3863 


1 


129 


IjANSNYERTHSRII ILSQGLQTLLPLIA 
F**LLASLANLAKEG 


3841 


17742 


A 


3864 


289 


372 


NHLVKKWAKDLNRHFS*EDIQIAHRHMR 


3842 


17743 


A 


3865 


266 


31 


KI CFI FSPPQKKSVFFPKVRFPQKRKPP 
PALLFSAYFFFEAESSSVTQAGWQ*CN 
LGSPQPPPPGFKRFSCLSTLGG 


3843 


17744 


A 


3866 


1 


395 


NTSE FC : KP RAYNES EHHWDMVRRHL * G 
KEDNLTLDISKLKEQIFETSKAQLNLVS 
ETEAMVKAVDSLTNUSTPVTWVKTIGNST 
lANFVLILGCLASLLWYRYIQQLRRDS 
DQREGAMMTMAVLS KRKGG 


3844 


17745 




3867 


137 


2 


APLIKKHECLRAVAPACNFSTLGGRGGE 
ITR*GV*DQPDQHGETP 


3845 


17746 


A 


3868 


1 


108 


ENRIiNPGGRRCSEPRSCYCTPAWATERD 
SMS*INirK 


3846 


17747 


A 


3869 


198 


302 


IJJ!II]!IHISrYGGV*GCSEL*SHHCTLAWAA 
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amino acid 

sequence 


Predict- 
ed end 

location 

correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C^Cysteine, D^Aspartic Acid, 
E=Glutaniic Acid, F-Phenylalanine, 
G=Glycine, H=H!stidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
0=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASLHPKKKKKKRKKKSVKSQGAFYSHYA 
MAILNYIREKKLFLKWK*LLNINHNYGG 
VLEKKKP*SYTYNLKKSQNLPQLL 


3847 


17748 


A 


3870 


237 


419 


X3IQSKQGLRAICIPMFIAASLMIDKG* 
RQ P KCF WRE E * I WXLWYHILMAYIISALK 


3848 


17749 


A 


3871 


239 


232 


KAFLHTRVKRGIQPGAVAHACNPSALGG 
RGGRITESGDEDETPA'HGFSWGKCIiGIE 
LVGHMFNFLRNCQTVFQSGYIIYILPMP 
LDS 


3849 


17750 




3872 


417 


219 


CHVAQADLKLLSSSDPPASASQSTGIAG 
ISHRAWPRGTNPIHKGSTIiMT*LPPQRP 
HLLTLSPRGF 


3850 


17751 




3873 


45 


414 


NILFFFFFWKKSNLVPQGEGQGQSLG*W 
NPPPRGLRGFFNLTPPRTGNKGGDPPPQ 
ENFGF*RKGGVTQGGGGGSKTPALGGPP 
GLTPQRGGNNGQDPRPPAKKFSLGKKKK 
NRQLPFLGFFR 


3851 


17752 


A 


3874 


278 


2 


LIS lYVS I I YLYHSGFQLSKISPPQGQL 
AYLSIYLSVCHFLIYEIGVITMPASKID 
K*DLIKIKSFCTAKETIIRaiIR*PTEWE 
KIFAIYSS 


3852 


17753 


A 


3875 


233 


379 


QCDYFNELLRMEILGPGAVAHACaSrPNTL 
GGRSG* ITRSGV* DQPGQHGE 


3853 


17754 


A 


3876 


6 


247 


MLITSASYPEDLAKIIDEGGYNKSQIFW 
VDETAIiN*KKTPSKTFTARDKSMPGFKA 
SKDRLTLLLGANFKLKPMNDLILI 


3854 1 17755 

i 


A 


3877 


411 


3 


AKIPPLTSSQGiHRTRVCQKKREREREKE 
KEEGRKERERERKKGRKKERKEERKGS * 
VRiyiLFLDKVDFRSQKIITDKEGHYIMIR 
E * MHQ *D I ILNM YI SNNRALKYMKQKLT 
ELIGRVGRPGRSGTVPGRRNGVQ 


3855 
T856" 


17756 




3878 


174 


375 


SFQQ*TAELEIKNTTSFTLTPPKLKYLG 
INLTKYVQDLYK 


17757 


A 


3879 


1 


106 


FHRVSRDGLNLLTS * SARLGLPKCWDYR 
REPPS PA 


3857 


17758 




3880 


383 


24 


FALNNFPQSGAFSGPLTLSPQVGLFFAP 
GPPPPGGGFSCPLAQSFFRIFPQKSGPP 
ALSFSPLFKKNPWFFLRTQPLLPLFPQ 
PRVFFFPPFFFFF*DRVSLCCPG*SAVA 
QYGSLQP 


3858 


17759 




3881 


400 


71 


RIRQENRLNPGGGCYSEPRSRPRTTAWV 
TG*DS ISKHNNKNK*NLSFIYFYYYFLR 
WSLTLSPKL 


3859 


17760 


A 


3882 


282 


3 


IFNFCGYIVSVYIYEVQEIF*YKHAMCN 
NHIVENGVS IPSS lYSLCYKQSNYILLV 
ILKYAGWAQWLTPVI PALWEAEAGGSRS 
QEMETILA 


3860 


17761 


A 


3883 





175 


L YKWFFKLAEHGGVYQ I LRRLRQEDCL S 
PGG*GCSEFKSCHCTPAWVTQQDPVSKK 
FF 


3861 


17762 


A 


3884 


415 


233 


SQDGLDLLTP*SflRLGLPKCMHYRHEPP 
CPAANFFLKARYFWLMVARYFWLWCNYS 

ALPS 


3862 


17763 




3885 


237 


367 


GPVLGPGMVARTCNPRILGGRGRWIT*G 
RE FETNLANMAKP CL 


3863 


17764 




3886 


359 


108 


TGQGGGGPPPPPLGRLKKKNGWPEGKN 
SNKPKLAPS PPPGGKRKPPSKKKKKKKG 
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ing to first 
amino acid 

peptide 
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last amino 
acid 

residue of 
seq.*'ete 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIalanlne, 
G=GIycine, H=Histidlne, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamlne, R=Arginine, S=Serine, 
T=Tlireonlne,V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unlinown, '•=Stop codon, /=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














G*HLPMLPRLGSNSWAHDMILPPYPPK 


3864 


17765 


A 


3887 


245 


377 


QFIKNI SWPGAVAYACNPTTLRG*GGWI 
KRSRV*DHPDMVKPHL 


3865 


17766 


A 


3888 


406 


2 


SLNI FFFFFFFFFFFFFFFFFKPGRVFP 
LKNFFFFPFKGCRIPFPPIFPLWLKVFF 
GPTPL*NPRG*KEGFCPPPFFFFFFFFF 
*GRVLLCRPGWNAVAQSRLTATSaHAS 


3866 


17767 


A 


3889 


3 


114 


EFHCVSQDGLDLPTS*SAHLGLPKCWDY 


3867 


17768 


A 


3890 


395 


1 


iyiVLlS*PCDPPTSASQSAETTGISHRAR 
QFPMLLSQ*LVCHSVTEVFLIFriLKLFY 
lYDI liYI lYHYILLI lYYLFLFLLFFE 
TGSCSVIQTGVSSTITAHYSLKSHPSGL 
KRSSHLSLPSSQDYRCAPT 


3868 


17769 


A 


3891 


2 


189 


GRVGERGGTLL*SQLIiRR*ENHLNLGGK 
GCSEPRIJHHCTPSWATEQDS ISKKKRSF 
IKALHM 


3869 


17770 


A 


3892 


410 


258 


GC*GCSEI*SRPCTPAWATE*EPVAQKE 
KRKKKRKEKKKHVLATLAISSLICL 


3870 


17771 


A 


3893 


244 


412 


GIEEACGNFRVIVLKSSLALEGEEPSGR 
DSS*EASGGRARWLTPVIPALWEAEAGG 


3871 


17772 


A 


3894 


208 


2 


risppitpfppwvtkktkg* pgfffqkn 
*ksfffflrwsi:alvpqagaq*ldlgsl 

lppppgykpfsc 


3872 


17773 


A 


3895 


407 


18 


PLLGHPPKKAGAPLIFNLPRRPK*KPEF 
PPPPLSPTQPPPPPPPPICPAWLPPPGV 
PSSFKG*MGGPP*RPEV*GPINPFWAPS 
LPVWPQNPIPFPKKKKWGEIHCMVCPLS 
FNETFSFFKMESLRDGK 


3873 


17774 


A 


3896 


82 


401 


KGFLKNSPKGEGLFLFSFPFFFLFLATW 
LWPRLKCSGAIIARCSL*LLGSSNPSDP 
PTSTFGEAGITMGHAQFPPPDSPPPKT 
RAHHVTQPGLKLPS*SSPPASA 


3874 


17775 


A 


3897 


263 


396 


VWWPGGLG*Y*TPPIiLGGQGGRITRSGD 
QDHPG*RGKPVSTKNT 


3875 


17776 


A 


3898 


2 


386 


KTGLELLGLSNPPASASQSAGIIGMSHC 
AWPNLFLYWNHHLYPHKK*VWPGTVAHA 
YNPSTLGG*GRRIA*AQEFKTSL 


3876 


17777 




3899 


140 


33 


IFFFHL*GEIGFTMNMLDIGGGFTGTEF 
QLEEVNFSVNDINKIIAFFVT*GVDV*S 
R I F TMAKTKP YN ISLKLCS*KTF 


3877 


17778 


A 


3900 


377 


2 


TPPKENGGLPKIKVPYFRGNKFKKFPLC 
F SKV P P I STMWVLKNQFQP P I P FLKKNF 
FKTPKKGVSHYNFPPQPKS * GVFFFS KK 
KNFYPFFWEEQKKQRGPNAFLFFLKKKK 
RAAARPTRPPTRP 


3878 


17779 


A 


3901 


2 


123 


RVSQDGLNLLTS * STCLGLLKCWDYRHE 
PPCPALRCVFYG 


3879 


17780 


A 


3902 


188 


3 


LRCTWCPGTDGWVSHLAHL* FKKGRPGA 
VAHACSPS ILRGQGSWISRSGVQNQLGQ 
HVETP 


3880 


17781 




3903 


422 


238 


ILHVQLPSSWDSKRKPFFFFL*R*GFTM 
LPKLLLNS*GSSDPLaSACQRLGITGMS 
HYTWP 


3881 


17782 


A 


3904 


414 


1 


STHLGLPKCWDYRHEPLHPASMSYL* * I 
FFIYLFTHWSVSLTARMQFAEDRDLIFL 
LHCYISSDSNSTSWL*YILVFVHSPWL 
VTTIALILGFYVFVFVFLFFFETGSHSV 



788 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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correspond 
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residue of 
peptide 


Predict- 

otide 

correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unl<nown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAGVQW*GLSSLQPSIPPGFK*FSC 


3882 


17783 


A 


3905 


396 


236 


DRVSLLSPRLECNGTISAHCNLCLLGSS 
NSPGL*NR<aiQIFSIEGQIPLQAMSLSQ 
I.LCSAI1WQKQPYTMYKGMEMA.VCL 


3883 


17784 


A 


3906 


377 


57 


RGLVFFFQNPQA* KNLGRGRVFFPSGAQ 
KNPQKGGWAFFGRPPPHFGGRPHGPGaE 
CPLNLTPGGPKGKRLGPPPWGWGSKFQK 
GVPLQWVGGNSGLFFSKKKHKL 


3884 


17785 


A 


3907 


341 


3 


TRFFFFSPLGKKGFFSKGFKLGTPRVFP 
LPPFUJPGPGIYFWGPIKKKKTFPPPGV 
KFGSFKRAPPFFFFLMFIAALFMWKTW 
KQPRRPSVSE * *NKLS YL*TTEyFFS IK 


3885 


17786 


A 


3908 


3 


162 


MMRKaiRGHL*NNPM.EKLLPHIRGNVG 
FVFTKEDMAP FQKPALLQHHSQ YI 


3886 


17787 


A 


3909 


1 


421 


PTRPPTRTRWGFTMLVRLLLN* KHSQAG 
AVMIACNPSTLGGQ6RQITRTGVQEQPD 
QHGETPSRTRGRTRG 


3887 


17788 


A 


3910 


2 


424 


PRVRTSPGLALSGLTGLKFSKWQKKEGK 
KPQTTKTQKKKKSPKKKKTPKKKKGGAP 
LKKKKFNflRGGENFFFF*GPKKKNPGaE 
FKKGGGGKTRGTPKKKAVGKTPFFRGGE 
KKKESPSQFYKKIKKKVPGGKKIFP 


3S88 


17789 




3911 


3 


236 


GFCHVGQADLELLATDDLPASAPQSAGI 
SGLSHCaWPV*PHVMSSLKSLLATRPSL 
HRVKLQKNGLRKTRELYMKNS 


3889 


17790 


A 


3912 


304 


2 ' 


VAKKGKSKPGGLKKNPPKIGLGYPQGGP 
QNQIWEKFFFLTPKNRWGAPL* GGGGPP 
SLPKKRGPPPPGWSNPPPPFFFFFFLRQ 
SCSVTQTGVQWRDLEL 


3890 


17791 


A 


3913 


91 


344 


HAAPVCGRRTLKVHPHAPSWPPSSWAHP 
P PTWP E P SGTTMAPE GT WTLGP P QE LS 
LRRGVSYHCPPP* PLIWGRKWPPPLPCL 


3891 


17792 


A 


3914 


328 


402 


KSWGGWPXAVAHVCHPSTLGG*GGR 


3892 


17793 


A 


3915 


16 


179 


GGGCSELR*RHCTSAWVTRARLSQKKEK 
KKKGPRKGEVATGSVWTRGELKSWAL 


3893 


17794 


A 


3916 


229 


46 


AGPELLASSDPLTSASQSSRITGIPFLT 
GLYSKDHIIETiySriSyTNA*AL*ITLRT 

RGSTR 


3894 


17795 


A 


3917 


42 


383 


AFPCFVIGRVISKGKECTLYFFLFLFFP 
FFFEKKFFFFSQGGKKGGEFGSMEPFPS 
GLKETPLPHLLEEGKFWAPPPPPVYFFV 
F* * KRDFTLVARWSNRLPSREKGLDYR 
P 


3895 


17796 


A 


3918 


402 


286 


PPPPF*TPPPPKIFGAKKKKKNSPPPPG 
ESFSF*KRPPP 


3896 


17797 


A 


3919 


349 


380 


KKSPFPQ*RNFGSLQPPFPGLKPSSPLS 
FPSSWVPR*TPPNPANLFFFKKMRISHV 
AQ 


3897 


17798 


A 


3920 


28 


327 


KTTLLNQAT 1 1 QAKAGTWGHP P P LS PRE 
NKQNQKKKKKKKKKKKKKKKKKKKKiCKK 
KKKKKKKKiasIGGGGP*KKKNFSGGGGEK 
FFFFW6AKKNFGGGGF 


3898 


17799 


A 


3921 


414 


1 


NFPKVSGPKLMSPSPPHFFFPKKSFPPP 
PLFLEKKNGRFSPPFPEKKV*F*KKPPF 
FFSPKKKPRFP PKVLKVGGKKRVLKNLG 
PGLLS KKGRGTPP PP PLFFFFFF* DGVS 
LCLPGWSAWRSQLTVTSASQVQVQE 


3899 


17800 


A 


3922 


I 


169 


LRSCHCTPSWATE*DSVSKKKKKKTLKN 
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SEQI0 
NO: of 
nncleotide 
sequence 



SEQID 
NO: of 
peptide 
sequence 



NO: in 

USSN 
09/515,1 



peptide 
sequence 



Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=lInknown, *=Stop codon,/=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 



LLSVWRIPCAQKKERICKRYECECFLIVF 
VFDLSEENSSYISFKPILCVCVCVCVCV 
CVCVW* VKNFRKS * *SSDRSICrKTIKKH 
SHSYLFFLSFFCAQGILQTERSHLLPHL 
EVSLFKE 



3901 
3902 



MLPLPGATDYPASGGGGCNEMRSCHCTP 
AMVTGAKPHLRKK* KKKRQMLKTISKEM 
GNFNiyrVYILDKITK*FLYFKQLILSFLM 
GELGVK 



qlgavahacnpst*ggqgqaehlrsgiq 

dqpgkhgk 

e tnf r^tny^ttllwnrqgials pri> sg" 
gaiitiigtlki-:ldssepptsds*vagts 

GTY-4HA 



VRPGAVAHACNPSTLGGGEG*ITRSGDR 
DH 



3905 
3906 
3907 



KEHAIWGTELNMVS KI FP I * LKQRKNL* 
PPNIKCYiraLLRP*LGPGTVAHACNPST 
LGSRGGRITRSGDQDH 



GQGGGFRFIQPPSSRGKRFFPPPPPPIW 
G*RIGPPPRMSrFFFLFF*KRGFFLFGRE 
IFFFPPPGVPPPWPPQNFGFQGGAPPPP 
PPFFFFFFP 



RGPPPLNPPLWEKNHBtQPPKVGQ*KPPP 
PTKKKPLFS *KTKI FPPGGPPPLIPPPR 
GGKPKKFLYPPKFRFREPKKGPFPPPRG 
TKKKTPPPKKKKKKKKVS 



LKEFFPSRNSRSVLHTSNF*HLFI*LII 
YFFEIESYSVTLAGVQWSDLSSLQPRPP 

GLKQ 



KILIKVEIEGTYRKITNAICEKPTANIV 
PHRKKLKAFPLRNGTR+GC PVS PVLFNl 
TLEVLARVIYSEK*IKGTKIQISIATME 
NSKKFPQKTENKTTICSSNSTTGCISKR 
KEMH IS KRHLHS P VYCWT 



FPPPPQTRGPKSSSRAHSPPGPGPGSTT 
GAR*PFRPLLLVRASLRPPLGLPTPTGP 
TGGMQRPEA*RTPATDKPQKISRRGP*Gft 
WGRPGCRCRGGA 



QWTAQTWSVQNELLLFCLKPHPPPSLLP 
PSSCPGPHLGTPWTPAFPSHPTSTLKAH 
PIGSGPSSPLLLTSPRMPPEIQTTATG* 
PASSLTPTVCSQHGNRFFKKGVRAGRSG 
SHLESQHFEGPEAGG 



PPPLLLWKIGAKKKAFFCPSPFGPRKIP 
CGQKTKKSVFPFRVP*DKSTFDCFLCPI 
VYVKKQIWLAPLHCVFSGRVAQGSKKSP 
LCLSLPFFFFEMESHPVTQAGVQWRDFR 
SPQPPPPGPKPFSCLGLPSS 



KIiLFLFLFlYFF*DGVSLCCPGWSAWQ 
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M 
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od 
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09/515,1 
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Predicted 

location 
ing to first 
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sequite 


Predict- 

otide 

location 

correspon 

last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireorine, V=Valine, 
W=Tryptoplian, Y=Tyrosire, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 














TLLTEPSAIPGAGGTPVSPl 


3917 


17818 


A 


3940 


230 


381 


LISSSYREYIiAKDLRFSSKQS*WLCSKE 
FRGLGAVAHVCNPSTLGGQGRW 


3918 


17819 


A 


3941 


1 


107 


CWPVWS * TPDLKQS TCLGLPKCWDYRQA 
QPHRKLL 


3919 


17820 


A 


3942 


2 


95 


DLLTS*SACLGLPKCWDYRRKPPRPALF 

AFS 


3920 


17821 


A 


3943 


163 


404 


CRHKAQWr,PQT,PGHLLT,SEKTLKYSSPK 
GLRSQE*KLLFLFCFG*TESRSVTQAGV 
QWRNLGSLQPLAAGSKRFSCLSLL 


3921 


17822 


A 


3944 


3 


200 


GLDLLTS*SAHLGLPKCWDYRREPPRIA 
YFTFIKKKKKDGCELLNLFPNSCMDTAH 
VHPAHLTGC3 


3922 


17823 


A 


3945 


423 


2 


LFFFFSEV*SHSVAQflGVQWYNLGSLHS 


3923 


17824 


A 


3946 


209 


413 


TTCIEQCMKTRRLGAVAYACNPSTHRG* 
GRWITRSGVRDRPDQHGETTKNTNISTT 
KKSTKNTKISCJA 


3924 


17825 


A 


3947 


425 


3 


SPSQGLRDKTPISLGQSTWGKGQMWAQR 
QQT*TSWPDGSEES SRSPSTFEQQISER 
SSSAKGQTNSSSGSLNSVSPDWETPPSR 
SQQTPHTGELQLASGRCSSGMNLPEEGT 
DSNLCCSATHAS 


3925 


17826 


A 


3948 


299 


3 


ASQTTFREGCFLFFSFFISVCLFVCLFE 
MEFHSCEPGWMECNGDRARLHLKTTITT 
TIEIVKDDTNKMKHrH*RLPNIKMSILP 
KVIDILFNAIPIKI 


3926 


17827 


A 


3949 


331 


374 


NAPKSERASKAF*AILEENKEKLPLDIN 
P WHPHGHI FKCVAPPHCLGWDLSAVPA 
SHALGFPQPQNSAWHVARSKSAASCLGT 
AVTSAGSGVLGTPNKMPQKVRERARLFW 

R 


3927 


17828 


A 


3950 


439 


69 


TNQKVWVPPKEKGLFKKKQLRPRVGGPA 
FHSPPLEGQGS*FP*GQRFKPRPPPWVK 
PLFSKNPKNYPGGGAPPFFPPSPGG*GG 
KFPPTPKAKVPFIQISPPPAQAGGPTQV 


3928 


17829 


A 


3951 


422 


3 


CFPALGIGPPHPPKRVG*KGGTPPPGLF 
FGKFLKEKNFGKWPGPGQGGPFFPPKKQ 
RGPFNNGRPGKVKKKGAPGPGVPKKKPP 
RVPPPPGFFGNPPGAPPKFFFSPRGGED 
QKRKNFFFFFFLRWSFTNEAQAGVQWR 


3929 


17830 


A 


3952 


223 


3 


ALFFWGRGTLPRVKGLNPKGGGGKFFFF 
FFF*ETESHSVAQAGLQHHDLG*LQPPP 
SRFKPFSCLRLPGSSLS 


3930 


17831 


A 


3953 




388 


EKGVSPGGRTCSEQRWCHCTPVWATERD 
SVSKKKGMSNNFSLILRSWNNKGQGGTY 
FQQKFIYPCFLKIKGHSYINCKIFKK* * 
KLSLHVHVTDVNVRIILQKLDCQIHHVD 
FNFELNILQCQGLGGCK 


3931 


17832 


A 


3954 


1 


396 


KLRPKDQDTWPTQQSAIWEFVPPSVYF 
VKSISEILFFFFLKKNFFLGPQLKGHGQ 
ILVKGNPCPGGKKKFPAPPPQRGGKKGG 
PPPPQFI FVFLKKTGV*KIGPGGV* TPA 
LGIAPPFFPKGGE*RGGPPP 


3932 


17833 


A 


3955 


2 


199 


PGGGGCGEPKTRGCTPAWATERDSVSKI 
NK*KLKLKNNSWLYAKSIjaSSPTGR*EB 
WGCFTNEHI 



791 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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ingto first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 


3933 


17834 


A 


3956 


393 


2 


KSFPLFPPGAPPWPLSGPFSVLEKPKCG 
GGWPGINTPPGGGGFPQGSLWAGDFP* 
PLFSPQIPFFKKQKFFFFLPRGGLWAPF 
FPPFGGPLFSQNF*KTGF*GGFFFFFFG 
DGVSLCYPGWSAVAPHAS 


^934 


17835 


A 


3957 


278 


390 


LVY1FKIRTCWPGAVAHACNPGTLGG*G 
GRVTRSGDQ 


3935 


17836 


A 


3958 


385 


31 1 


GPKNPPASASQEGETTEV*HHaWLIFFL 
FPVELWSSHVAQAGLKLPASSDQS*TLY 

I 


3936 


17837 


A 


3959 


410 


119 


S P PPRFKTPPPEFI FGAPKKKFFS P PPR 

PLNLILLKGPPLFFFPLPKAFKPHINFV 
FPPQKKKKP/*I1ILYFPPPKKENESS*GK 
HFALHCNPSCISK 


3937 


17838 


A 


3960 


" 


400 


TALP LKPKKKKKKKKKKKKRGGPLKKKK 
FKAPGEEKKNFFKGaPKKKFRGRV*KTG 
ERKKPGGNKKKTFGKKPPFPRGEKKKKP 
QA 


3938 


17839 


A 


3961 


2 


194 


CL*SQLLGRLRWEDHLSSG1I*GCSEP*S 
PHCTPAWVTE *DPVSRKRKKyP WNQXL 
RGFLSPAH 


3939 


17840 




3962 


52 


409 


NSKTSSQEKKRRVIRLSLRSRVGDDTKV 
NTRRWGVFGWHLKNLPTTLVLKKMEKIIi 
ETYFMTPDLHDPHTTSYFNGCX2LC3AIKV 
GGTITDLIVF*1VSKPNAAGLSLNENVN 
AKFNDLI 


3940 


17841 


A 


3963 


382 


96 


LSSQLLREAGER1IE*TGWLQQPLQMQIH 
KNPCIEPRSRYCTPAWATEQDPVSKNKQ 
TNKQTKKSLHL 


3941 


17842 


A 


3965 


3 


169 


DAWELRSHRCPPAW*QSETPSQKKKKRG 
KKKI FRFYLFFNKRKKFLNGGMLGVI P 


3942 


17843 


A 


3966 


1 


121 


AGFHRVSSDGU)LLTS*SARLGLPKCWD 
YWCEPPHLAENS 


3943 


17844 


A 


3967 


200 


2 


PPFRIYWGEVLTPLFLRVKPKFFKPWGF 
PFCLJ'FFFFFEMESRSVAQA*MQWRDLD 
SLHPPPPAFK 


3944 


17845 


A 


3968 


471 


440 


MELRFYHVGQTGLKLLTS * APPVMASQS 
AGI LGVSHSTW*EAPNSLAI TFLS SHLS 
PRQP*THLGC 


3945 


17846 


A 


3969 


214 


2 


EGQPE CQLQAGDWVSLWNLVCSGLEVRA 
WREAPRLEWS FCLFFVSEM* SHSVI * DR 
VQWHNIGSLQPLPP 


3946 


1 7847 


A 


3970 


53 


398 


GDLPKKQQQQKPLNFCFQLLLYARLFGP 
I FLV*H]aATYKNKHLLECVGKVMINCQA 
D ILNKME I S BCVGAQKKKKKKKKKKKRGG 
PLKKKKFIPRGGEEMFFFLGAPKKKTGG 
GFK 


3947 


17848 


A 


3971 


396 


160 


KRERIKEKEDNKEEKDKEKKASGKKERK 
KEKEIRDRESKEESSNLLLQEELSLCCP 
SPGHPPWSSL*DPYFATGNSLRC 


3948 


17849 


A 


3972 


416 


3 


FPFPRVPEGVPGGEGVFPPGPPGKSGVP 
FPRPRGGGAPI*KPPPFPGGPPMFFF*K 
KTGPPFSPPQGAPPAKRAPPPFPGGGGG 
KIFTPLKKGGLFFFGPPKKGVPFGPKKT 
LFFFQKKKKKKKGRPPRSHASAHAS 


3949 


17850 


A 


3973 


232 


381 


YKTFSLI F * LFFFRDRVS FCHPDRSAW 
LS*FTVASISWAQGILLAQPLE 


3950 


17851 


A 


3974 


396 


141 


CFFPPPPKRGFFPTPIFGGPPGFFFPPF 
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Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














F*NPPPKI FFGAPKKKKl FPPPGGKNFF 
FLKGPPPFFFFFFFFFFFFFFFFFFEDG 


3951 


17852 


A 


3975 


286 


422 


RKQLISPHSQDNSKQKEQSWRHHATCLQ 
TILQGYSNQNS * *APIAKTILSKKNKAG 
GITLPVFKLYYKATVTKTAWYWDQK 


3952 


17853 


A 


3976 


3 


116 


GFHRVGQKGFDLLTS*SACLGLPKCWDY 

RHEPPRPAR 


3953 


17854 


A 


3977 




399 


EGP3TQSFFDTETOSVTQDGyRRKNr,GS 
LLPFPFGIK*FFYLSLPSNWDYRRLPPR 
PVIJ 


3954 


17855 


A 


3978 


227 


422 


SFPSPCIFFSLVDLGVSLYCLGWSQTLD 
LKGSSLFSFPKCWDYRQ*ATTPGLPVYF 
FPFPVFTFF 


3955 


17856 


A 


3979 


519 


313 


KKNLLNPGDGGCRE IKSSHCAPAWVTEK 
DCLKKKKKKTSVCVCVCVCVCVCV*KHL 
SSLGLFPEPVGUSr 


3956 


17857 


A 


3980 


217 


414 


I YI FCGDGVSLCCPGWLKQS FCLSLPKC 
*DYRHVPPRPTSFHLLTNIGVLQFLKLQ 
ATLSFTLFLL 


3957 


17858 


A 


3981 


2 


409 


KLTFAV* DY* EKLLLSLAS LTSQAAFLF 
PEHPYTVISNPCSPANYIFTLSSGTTFH 
QFYAKS * CVTYE * KKKNLLS PPTGDKPN 
*GRSQTKKIQTKY* SEICFFVHLFETGS 
HHVAQAGLELLSSRDPPTSASQST 


3958 


17859 


A 


3982 


234 


416 


SVLPSTPLUSKL* KSKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKBCSRGGAPKBCK 


3959 


17860 


A 


3983 


414 


291 


GGRRCSEPRSCPCTPAWATERDSV* KNK 

QPNKQKNKKDLL 


3960 


17861 


A 


3984 


192 


415 


TNANKLDNLDEMDRFLETQNLLRLNNEE 
TEyQnRPVTREEIE*VIKWLLTEKKNPG 
PDGFTGEFYQTFKEKLIP 


3961 


17862 


A 


3985 


403 


145 


AFFFFFFPPGEKGD FS PTVLFGGPPGFS 
PPPVFKTRPRNFFLGPPKKKYSFPPPGP 
KNWFP*KGPPPFFFFFFFFFFWKLGIEV 
IS 


3962 


17863 


A 


3986 


255 


425 


GGRFKGSNFTSAGLQGFXXXXGPPKFIS 
RPGV*QRGEWKNPGVPNLFALSTSPFGR 


3963 


17864 


A 


3987 


247 


2 


EEAVRVSNIiYTKLDEEQ«WKKQLNYVE 
RLKK*ELFLQDLYRKVPLKI*IF*VKNL 
NS FWPDAVAHACNPS TLGGRGGWIT 


3964 


17865 


A 


3988 


1 


394 


HHHTWI I FKFFCKDGVSLCCLSWARTPG 
LKRSSCFRLTKCWDYRHEQLCPAFKHFF 
LY*HNFMSTEKMQE *HKNS * YPSPTVPK 

IVSLNHVCYSNEAPLSUMI 


3965 


17866 


A 


3989 


208 


402 


ETDRRLKSGGSLSKTLIKCMTCFWFRD 
GVSLCCPGWIQTPGLK*SSCISLPSSWD 
YRHMPPCPA 


3966 


17867 


A 


3990 


2 


408 


THRPYHATPTYLPFYTNNLIKGGKKKKK 
KGGKKKKKGGPPFKKKHFSPPGGEKNFF 
FYGPQKKKSRGRV*KRGEGKNPGAPQNK 
PLGKNPLFPGGEKKKKPRFAS *PPPPPF 
S FGKKRAPPTKYPGAGLHP 


3967 


17868 


A 


3991 


81 


406 


KGGLFLLPRGKGKGKIWVKGNLGLKNKK 
NFPA* PSEEGEKKGGPHQAGKILKFGGK 
RGLNKGGKTGQKLKPGGIPPPGP*KVGE 
KKEKPPAPGQKLFFICKRKGQKKNF 
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nucleotide 
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peptide 
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residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3968 


17869 


A 


3992 


396 


54 


YSSPPPTKKTDDSDASRGPFFFFPPPEK 
RGFFPNDFFWVPPGFFPPPGF*TPPPVF 
FFWAP*KKIFFPPPRE*KFFFLKGPPPF 
FFFFFFFFFEHLQVLLIC 


3969 


17870 


A 


3993 


367 


193 


KLWEKKKILWKKG-''GEIKSCHCTPAWAT 
RWLCLTKKKKKKKKKGGEGE I FKKKS F 
S FLGKQKKKRGPAFF 


3970 


17871 


A 


3994 


188 




KRANTLPDFKTYYKAIVLKTLWYWRKDK 
HRSMKQN*ESNEVELLYHNIHRNKLKMD 
HRLKCQ 


3971 


17872 


A 


3995 


2 


147 


QENHIiNPGG*GCSELRS*HYIPAWATE* 
DS VSKKNYI **TSHIFSGDF 


3972 


17873 


A 


3996 


23 


329 


RNSPIRSAFSKVLNKWLFLTSEKECKKK 
KKKKKKKKGGRKREYLTRRGAGENNKYF 
yK*KKKKKTCX3*KKKGWGRKREREKKRL 
GGKKKFPPFEEEGKKKPG 


3973 


17874 


A 


3997 


164 


392 


KPLGRAGLVPCLCSSFTGFLM*KTEPCS 
WQAGVQRGHLCSSLQPQSPMFKRFSLLS 
LPSSWDYRVLLRHPSWSAW 


3974 


17875 


A 


3998 


396 j 3 


FFFFFPPGGKWGISDSPQLFFSGVKNFS 
PPPPPV1IWGIJIWPPPQKNFFFFF*KKV 
FSFFWGGPPMPPFFFSLLLGGPPPPPIF 
SGEKIPPSLFFFFFFFEMESCSVAQftGV 
R*RHPGPLQPPPPWLKQF 


3975 


17876 


A 


3999 


383 




VPSPEKVGIKKIQPLDPFF*FLGDCQRK 
KKSPS *RPPPPPEGKKIJSI* IRLMGPPFN 
S KEKFF* KP*AFKPFFFPPKGEGI *GFF 
SRAHSFPNQRALFWFFFFEMESHSITQA 
SMQWHDLRSLQPPPL 


3976 


17877 


A 


4000 


213 


1 


RGRQNKTLPLTKPNQTKPNQTKPNQTKP 
TTFL*S VIE IRVKKKYKTTRKTTKNPTT 
SLTRSLGRYKSPTRP 


3977 


17878 




4001 


412 


129 


GSLSSPRLECSGTIPAHCBLNLPGSSSP 
CTSAPRVAGPE*MGQACMPPCLPVWFFF 
WGGGGrr.VETRSC*VAQADLEI.LSFSDP 
PASASKVLEL 


"3978 


T7879 




T002 




"293 


RDTRCLPP*LDNFVPLVEMGFHHVGQAG 
LEIiLTSSEHIA 


3979 


17880 


A 


4003 


46 


392 


QDFFFFFFPLGKGAPPPPGKKGGTPFPK 
KKTKGGGPRGPPLEPPPLGGPGPPEPGG 
GDIiKKPWPPQGTPPL*KKPKMGGGGGGP 
p*TPGTGGGGEKNPI,TPGGGPKNGPPPP 
PPG 


3980 


17881 




4004 


360 


i 


KCLGFQFFFRFWNI CI I FI SQLS lASQV 
QKSEIQNAPMSIFFEHHVSFKQFQILEH 
FGFWI F ILGMLSTYKTLI LLKLN* RFNA 
MPINIPWIFMEIDKLILKFTWKYKEPR 
RVKAILRN 


3981 


17882 ■ 


A 


4005 


152 


408 


NKFVSHFYWMGKVLSRYPQFT*QFSFCH 
VKLPSVF*YLYTRNKILIFPflHKAFFLD 
AKYVCVCVCVCVCVCVCVYFGIGEGNLP 
S 


3982 


17883 


A 


4006 


411 


67 


YLPPPSSPTPGVFFFFPPPKKEFFPNPF 
ILGTPRFFPFPPF* KPPPKFFFLGPKKK 
K*FSPPRGKKIFFFKRAPPFFFFFFFFF 
FFFFLLGGSLSFMYFHIKTYK 


3983 


17884 


A 


4007 


217 


2 


PLFFFF KHLKAYYS DVI HI S FAI LRYLL 
F*DRVSLCHSGWaAVARSWI.PVAGQNTF 
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nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iVIethioninc, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Uiiknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITRQEAPYLLAI PRV 


3984 


17885 


A 


4008 


235 


406 


GRHRVS SNP I S I * RRRGLS LLPKLE CS G 
VS IAHCSFDLPG*SDPSASASRLLGRI,R 
Q 


3985 


17S86 


A 


4009 


833 


237 


SLDNRi\RPCLVSFIFKKGRSrAERCGAP 
GTTS PPFPRLSERGHP 1 1 PS *RGWERKS 
QLCSHLQSSRDPSGLGRAGCGGGGDIiRE 
SPGTFPGNGGSDLVPVPTAPIPIEPQIC 
QATCAVMKLSFDEEYRRAMNELGECPRS 
SGKPSSSPRTGGGSSAAKGTQAGSSELG 
LGIGARRGTDSLESHSTSHEWTVTLGPL 
SP 


3986 


17887 


A 


4010 


276 


2 


VEVKGNNSFSEGSITLIPNPDKQPKETT 
DQPLMSTYAKIESPGKNPSHDG*LIFDK 

GAKI IQ*VKNS I FNKWCWDNWDFHVKTM 

NPHFTLI 


3987 


178S8 


A 


4011 


18S 




REKFQIYTII2JDKGSITTDTTER*KIIR 
D Y YEHL YAHKLENLEEMHKLI G I PP 


3988 


17889 


A 


4012 


1 . 


202 


CLHQKVIISN*L*ATTQTTQLSLSFKLD 
YFSI II IPVALVVTWAIIKFSL*YINSP 
PKKKEAEKDLN 


3989 


17890 




4013 




403 


CLDQEVIISN*H*ATTQTTQnSLSFKLD 
YFSIIFIPVALFVTWSIIEFSL*YINSP 
PKKKKKKKKKKEKKKRGGAPLKNSRGGP 
NFWGGGKNIFFFFGGGDKKPPRAFWKKP 
LFLGGGNLGPPPPQKFTPWGKK 


3990 


17891 


A 


4014 


67 


439 


TELIIFYIFFETTLIPTIiAIITR*GNQP 
ERLNAGTYFLFYTLVGSLPLLIALIYTH 
NTIiGSUHILLLTLTJ4QELSNS*ANNLI * 
LAYTIAFMRKKTKKKKGGGLLKDPWGGQ 
ILAGREKIKFFP 


3991 


17892 


A 


4015 


3 


260 


LI VPTI ILLPLT*LSKKHI 1*1 INTTTHS 
LIISIIPLLFFNQINHNLFSCSPTFSSD 
HLSHPILLKKKKKKKKKKKKKKKKGGGA 
F 


3992 


17893 


A 


4016 


258 


3 


lkkifspptiUlffppfplKkfffplsl ■ 
*fflgvfspfppppkkgffpkippgvff 
sppfkkkkfffpppfyfapprvffkgpp 


3993 


17894 


A 


4017 


287 


2 


KKQKHFIFFFKNLSFFGR6AFFPPLFPP 
ILRGKGGDFFYPRGLKPALGNQ*NPFSR 
F* IFFFFFFFLRRSLAPVTQARVQWHDP 

GSLQPSPSGFK 


3994 


17895 


A 


4018 


228 


487 


S YDVKQEKPN6I *ASKYWAYVFHSFFLS 
FFFFFFKKKNFFFCPQGGGQGPNLSLRE 
PSPPGVKKLFGUHLSKSWE*QNCPPPPI 

IF 


3995 


17896 


A 


4019 


35 


228 


EELNHLNQGGEGCSEQRSHHCTPAWATE 
* D S S QKKNKFWY IHPCNR I VTTTNTPMT 
ESPKHYAE 


3996 


17897 


A 


4020 


2 


402 


ARGNLNTTFFDPAGGGDP ILYQHLF* FF 
GHPEVYILILPGFGIISHIGTYYCGKKE 
PFGYIGMG*AMISIGFLGFIV*AHHIFT 
VGIDVDTRAYFTSATIIIAI PTGV*VFS 
*IiATLHGSNMK*SAAVL*ALG 


3997 


17898 


A 


4021 


1 


420 


TRGATELIILDILVETTLIPTLAIITR* 
GGQ P * RLNAGTY FLFYTLVGS LSLLMGL 
IYTHKTLG*VWILLLTLTAQEL*NS*AN 
NLI*IAYTIAFIGKrPFYGLQL*LPKAH 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VEAP lAGSIGLAAGLLKLSGYGI IRLTL 


3998 


17899 


A 


4022 


409 


3 


LEMASCLI ICNNKNPFFYQIVTCDEKWI 
LYENW*1jPAQWLH*EEASKHSPRPNLHE 
KKIMVPVW*SADGLI[reSFLNPCETITS 
E*YAQQINVlSai*KLQGLQLTFINRKGPI 
FLHOTTTKLHVAQPMLQKlJSrELA 


3999 


17900 


A 


4023 


464 


374 


HLPPWEDKARRHLPDRRTFILAFLRSRN 
VRNKFFSLQATQFLVLYASSTKITKPLK 
TNQKKRLKKSEQSLRNMWTPIKRTNICI 
VGI lEGKEGEKGAERI FEEI VPQTF*NL 
VKDIK*KVQ 


4000 


17901 


A 


4024 


58 


323 


RHKLLCEHVKKCSAVIiI IREMQTKTTLK 
YHFLLVTIAKIFNLIHF*RSVGETIiICQ 
WAGRlirAKS*PFWRRISQYLTKLYVCVCM 
YTHS 


4001 


17902 


A 


4025 


480 


150 


RNSLLRRLRQED6MSFGCQGCSEL*LHH 
CTPAWVTEQDPVSKKKKKKKAYPDPIPK 
KTDKSNGKKGPDFSRKN5PKNSPPPPWQ 
RSTAFRGRAAKSCPTH 


4002 


17903 


A 


4026 


204 


I 


KKKFGJ^FFFFMQGLTLLPRLEYR*WCD 
HGSL*P*PPGSSDPPTSVSQIflGTMGVH 
HEFLRSGUVLSK 


4003 


17904 


A 


4027 


116 


472 


MPPLPKPVSGPRPHSCSPLGGPQLSTPL 
TGPRPASLSLPISEGICGCITAILWAVW 
YLVLQMLGLSLFMA*RGflGSSCSAHKKW 
WLGLASPSC*TFFK]SIRTKGRARWLTPVI 
PALWEAE 


4004 


17905 


A 


4028 


355 


444 


VSYKKK*SRPGAVAHACKPST]:.GG*GGR 
IT 


4005 


17906 


A 


4029 


410 


181 


ASI ILLIAVFFNNILSGQ*TITNTTNQY 
SSLIIIMAIAIKLGIAPFHF*VPEVTQG 
TPLTSGLLLLT*QKLAPCF 


4006 


17907 


A 


4030 


49 


474 


PXRXXFCHIVTYYSGKKEPFGYIGMV*G 
MISIGFLGFIV*AHHIFTVGIDVDTRAY 
FTSATIIIAIPTGVKVFS*]:jATLHGSNM 
K*SAAVL*ALGFIFLFTVGGLTGIVLXN 
SSLDIVLHDTYYVXAHFHYVLSIGAVFA 
II 


4007 


17908 


A 


4031 


274 


427 


FFF*EMESHSVTQAGLLGRNHSLLVPQT 
PGLKRSSLLSLLSNQYYRSGPHT 


4008 


17909 


A 


4032 


205 


393 


TWGGFKNPSPGEG* IKKMHDFNEMFLQN 
CWKILFFGGGFFTGQIFPHAAPPINKNI 
LVWGGGK 


4009 


17910 


A 


4033 


2 


386 


SEPLSRKKLYLS ILISLQISLII TFTAT 
ELiIIFYIFFETTLIPTIjAIITR*GNQPE 

TLGSLNILLLTLTAQELKKKKKKKKKKK 
KNIGGPPLRAPSGAPN 


4010 


17911 


A 


4034 


3 


169 


GLFLAMHYSPDASTAFSSIAHITRDVNY 
G*IIRYLHANGASIFFICLFLHIGRGL 


4011 


17912 


A 


4035 


3 


317 


HASADAWAFIGYVLP*GQISF*GATVIT 
NLLSAIPYIGTDLVQ* I *GGYSVDSPTL 
TRFFTFHFILPFI lAAjjAALHLLFLHET 
GSNMPLGITSQKKKKKGGPF 


4012 


17913 


A 


4036 


446 


194 


KKKNQPGGPPPPPAFPVGGLIKKRFPPT 
PGBCKFICFR*KFFPPKGGFFVEDPPPKPG 
CWEREKKKKPLLNTDKWVILEGSEAGLF 


4013 


17914 


A 


4037 


213 


1 


GMAKVKARFLPQKRGQRVLNPPWWGKGF 
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G=Glycine, H=Histidine, I=Isaleueine, 
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PNSGTXETPFFFF*DRVLLCQPGWSAVA 
QS*LTTASTFWAQAI 


4014 


17915 


A 


4038 


307 


36 


RKGFFFSKCGEFKKLPGPPLGGWYLKLI 
FCQKKFFHPKFPGFPG*IFLPPPKIFFI 
FSFEMEFCSCCPGWSAMGRPRLTATSAC 
WAQANS 


4015 


17916 


A 


4039 


2 


372 


NDTSLTVKS IGHQWE * TYE YTDYRGLVF 
NCYILPPLFLEPGDLRLLDVDNRVILPI 
EAPIEIINTSQDVLHS*AVPTLC*KTDA 
I PGRLNQTTFTGTRPEVYYGQCSEICGA 

HRRSMLIVLELI 


4016 


17917 




4040 


107 




LLLLft'lv.vi,J3KSLAK0.PHWECRGKI*GH 
CKLRLPS PCHS PA* AS PEAGTTGWTYGC 


4017 


17918 




4041 




351 


GSHYIAQAGLKNPGSSHLPASAAHSAGI 
TGVSHHAWLKI FFKESVSDYS PHLNSRK 
NAVYVSFFFFWKKS FVFVPQAGGKGLNL 
GSLKFSPLRLKQFS CLTLPRS *E5fGLAP 
PPPV 


4018 


17919 


A 


4042 


1 


385 


RPRRPDI E * QERRTQEVLQAVAKKVKEE 
SQLPGTGGPKNVLQPVPRAKAERPKKQA 
EASGLKKETDWLKVDAQEAKTEPFTQG 
KGGGQTTPESFEKAPQVTESI*SSELVT 
TCQAETLAGVKSQEMV 


4019 


17920 


A 


4043 




282 


TAILIQTP*SFTGAVILIIAHGLTSSLL 
FCLANSNYERTHSRI I ILSQGLQTLLPL 
lAF* *LLASLaNLALPPTINLLGELSVL 
VTTFSVDAA 


4020 


17921 


A 


4044 


2 


407 


GTRLE IMSENHG I F P FTLE I FKDNE FE E 
PYREALPTLKLRDSLYFGIEPEEHVSG* 
ESLEESCFATPTSKIDEVLKYYLIRDG* 
VSDDSVKQYTSRDHLAKHFQVPDFKFEG 
KDHKEVILH*RDLVCGVMDERSR 


4021 


17922 


A 


4045 


4 


380 


ENWLTIIRQAWHEADRNUMTTFFDPAGG 
GDPILYQHLF*FLGHPEGYILILPGFGI 
lSHIVTyYSGKKEPFGYIGMG*AMISIG 
FLGF1V*AHHIFTVGIDVDTRAYFTSAT 
I riAIPTGVKVFS *LATi:iHGSlIMK*SAT 
IIIAIPTGVKVFS 


4022 


17923 


A 


4046 


2 


94 


ADENUJTTFFyPAGGGDPILYQHLF*FF 
GHPEGYILILPGFGIISHIATYYSGKKE 
PFGYlGMV*AMISIGFIiGLIV*AHHIFT 

VGIDVDTRAYFTSATINIAIPTGVKVFS 
*LATLHGSNMK*S I PTPIL IFRPP 


4023 


17924 


A 


4047 


376 


138 


LLARQRQENGVNPGGGACGEPRSRHCTP 
AWATE*DSVFLLWREALIjTQTAPFNRLQ 
LLALPSEFFHFRILIVLQPHHSQ 


4024 


17925 


A 


4048 


145 


226 


EPIQACSSRRVCEPIQACSSRRVCVCVT 
PS RHALPGCVCV* E P IQACSS RRVCVCV 
RAHPGMLFPACVCV* EP IQACSSRYVCV 
RAHPGMLFPACV*AHPGMLFPACVCVCD 
PIQACSSRLCVCV*ERaHPRLLF*ACVC 
VCESPSRHALPGVCVCVRAHPGMLFPVC 
VCESPSRHALPGVCVSPSRHALPGVCVC 
V 


4025 


17925 


A 


4049 


2 


414 


ADRNLNTTFFDPA*GVDPILYQHLL*FF 
GHPEVYI LI LPGL6I ISHI VTYYSGKKE 
PFGYIGMV*AMIS IGFLRVI V*AHHIFT 
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VGIDVDTRAYFTSATI I lAI PTGVKVFS 
* LATLHGSNMK* SAAVL *TLGFIFL 


4025 


17927 


A 


4050 


472 


228 


LTRDGGACPFSKLFRRVRQKNPLVLGE * 
SCSEPKLCPCPPAKKTKDYPVSKKKKKF 
IWI INRTKIPRKRKKNKAYHKVKYF 


4027 


17928 


A 


4051 


238 


3 


RRHGSFAPLPPGLPLCISYIWLFITILC 
NILYLYFETGSRYVAQVGLKLLCSSEPP 
PST3*STEITGMSHGARPHNIP 


4028 


17929 


A 


4052 


225 


172 


lARYPFHIQSCTNMPSPNKSITSKEEDN 
GPGAVAHACNPSTLGGQGGQITKSGVRD 
HPG*HGLV 


4029 


17930 


A 


4053 


471 


30 


GPGGIKISGALFPLNKVKRVPWFPRPP 
LIFPQKGVFNQNGGDPGGGFWPGETPGR 
VPPPPGVFFVKSPFIiSLPFPPPPV*KTA 
VKTGFPYRPCGVPPRGLVFPKKKKSVLBT 
RPDLSMGPQPE+SLPLPGEPSVSHAGCL 
PQAWRG 


4030 


17931 


A 


4054 


425 


41 


NWGSGTLCPQKLGVFPPPPPGEGGVPGA 
PPPARLFFFFF* KKGFSPVGRGGFKFLP 
PKNPPPLPPQKVGIPEGGPRAGPIFSPP 
QLFFFFLTWSLTPSPRIBCSGAILAHCT 
PAWETEQDSISKQTKK 


4031 


17932 


A 


4055 


312 


1 


IKNARAHLLPQGSPPPLIPIWI lETIS 
LLIQPIALTVRLAANITAGHLIMHLIGS 
AAIiAISTIILP*TRIMFTIIIERTILEI 
AVALIQAYVFTLWSLYLHA 


4032 


17933 


A 


4056 


648 


123 


DPDPVPSARGCVPSQAPGRQAQLKKAWD 
GALA*LPPCLCRQELPSFSLGTGGPAVG 
LSVNS ERDGRLGEVS INAEFTVAAFLKG 
TQSRRGINGMPLPQREPSSRYCLGLGGE 
SSAQGTSGLTGQRV*QNSDLPELASGDD 
KGQRSSQGGAVSQSSPRGRQIPPSSPPA 
ALFNFL 


4033 


17934 


A 


4057 


135 


464 


QHNSRLMQQAKKGVTVLflRVI DLD YEDE 
ISLLLHWGGKEE*AWNTGEPLGCLLVLP 
CPVINVNGKLQRHNPGRTTNGPDTSGMK 
VWVTPAGTKPQPAEVL2VEGKENKECG 


4034 


17935 


A 


4058 


3 


575 


LRSRPLHIILSCGLLVTPRSLSPPPPPQ 
RUILCRPSRGAAEFFFSLRTKLHFATLP 
LRIEGSKTL*KCC*LYHGC*IS*IHHGG 
LFDVAAKNI IHEEVEKYDKQYRGKELLG 
FATYKTFE I IVHQY I Q*LVEPALSMLQK 
AMETIQQAFINVANKHFGEFFNLNQTVQ 
STIEDIKVKHTAQAEYMIQLQVR 


4035 


17936 


A 


4059 


389 


3 


TFKGPKVFRYPTFFLCPPPRGFFSKNIW 
TPPPPPFFGVFPEREKIFFFLKGALPLF 
SFRVTPFFGLKRGFFLSPTPPFFFPPFF 
WPPPFFFFFFFFFPLRRSFALVTQAGVQ 
WRDLG*LQPLPPGFKR 


4036 


17937 




4060 


397 


1 


KRRGNFFFKKGISLFI*TPPRGPLWVFF 
F*KAPKGFCAKFFPFFYPPLPKENGKTH 
RQKTPFPHFCQ IFKNGFWVLHAFSLGDF 
PARVWGPPFKEFFKGIFKEFPPYGSPPP 
KGPKKKKRTAARDLELADAW 


4037 


17938 


A 


4061 


38 


410 


AWISI *ERPFTLVRVFKISIFLAISGLLFS 
YFRINFLYVQIHIHTFFRKLFVLSYVFR 
FTDVKFFFYILPFFPPHFLFGWICSYAA 
L*FFIYFCFLPFFFNGPTSP*IGLFLLI 
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FFFKKPTFGFGG 


4038 


17939 


A 


4062 


405 


81 


GGANVFPQKSPPFGNKKGGHFFSYKILN 
NPGHPPFLKKKKNAPGGWGWPWFS PPWK 
GaL*PKKKNPPKKKKKSPPPKGGKKKNP 
FPPKKKKKKKKA*GDNFDMGTGN 


4039 


17940 


A 


4063 


415 


239 


RPSFFFFPPPKKSCFSTFFFFLSPGFFF 
PPPFFFTPPPFFFFFPPKKKKYFFPPPG 
KKFFFF*PPPPSLFFFFFFFFFVFFFFF 
FFFFFFFFFFFFF+GLTSPHKQIGLVLA 
FLLALSKI THAS I PVPVSS PSKKKKKKK 
KKKKKKKKKKK 


4040 


17941 


A 


4064 


95 


241 


SPCRSPHRWVNS*ANNLI*LA,YTIAFIV 
KIPLYGIML+LPKAHVEAPIACSSIVLAA 
VLLKLGGYGIIRLTLIUSPLTKHIAYPF 
LVLSL 


4041 


17942 


A 


4065 


28 


288 


SASQSVSITGGSHCAQP*VTILFTTCTL 
HVNG*I*CIKFLrHNIPILVISLQYIFG 
FFL 


4042 


179'I3 




4066 


338 


134 


PGGGCSEPRSGHYSHCTPAWQ*SETPSK 
KNKKKKITYFSVKNWGIRVFIEI I LNS F 
TTVRP 


4043 


17944 


A 


4067 


26 


365 


WVSLMTSALAMRVDFYFITLLILGLLTS 
TLTIYQ*WRDVTRESTYQGHHTPPVQKG 
LRYGriLFITSNDCFFAGFF*AFYHSSL 
APTPQLGGHWPPTGITPUSfPLEVPLLNT 
S 


4044 


17945 


A 


4068 


491 


174 


TQLKTH*GSTTDNRTEVEVEPRVETNYK 
DLLKFL*S KGYDFESETETETIAKLVKY 
MYDNEESQDTSFTTLVERVIQQLVLSHI 
FKDNYflNINAEKLSGN 


4045 


17946 


A 


4069 


210 


402 


NVSKGr.HKVPSTQ*VINISCRPGWARA 
CGPSTLGG*GG* ITRSGVRDQPGQHGET 
PSLGPGAV 


4046 


17947 


A 


4070 


202 


1 


DSAIALQPRQ *EQIISVSKKKI YHT IDLR 
SEYFARWSFTLflARAGVQWRNLGSLQPP 
PPGFKRFSHAS 


4047 


17948 


A 


4071 


412 


143 


FLaAGV*PELGPKG*TPFLLKNQKLIGH 
GGGALYSQYFGGWGRSNSFNPGGKGFTN 
QNFPPSLQTWGKKGVSFSKKKKKERKTR 
KWGGE 


4048 


17949 


A 


4072 


2 


324 


RRGNFCMF*SDRVSSCCPSMS*SPGLKR 

SSCLSLPKCWDYRFEPLYLARLVLMRCY 
S THNT Y I MYCQMQRLFQGHRYLS VSTS S 
ATRPYLDNFFFFCKNKVMPCCQG 


4049 


17950 


A 


4073 


2 


326 


RRGRMLPRTP6RPGNDAQQPQKHSLVD* 
LFIYLFYFLWTGSHSVAQAGGQWRNH6S 
LQP*PPSLKGSSRQSAGRVKGVSHCAWL 
LLDFLSFGKPYTYKKVDKWSFEA 


4050 


17951 




4074 


422 


116 


EIKQEKNPGPFFFFPPPAKRGFFPTPLI 
WGPPGFPPSPFLKRRQGGFFLGPL*KGN 
PPVFPPPKVFFFWAGPPFFFSFKKKFFF 
LVPQSTFIHFIFHICYLL 


4051 


17952 


A 


4075 


3 


294 


VFCHVGQAGLELLDSSWLPSVASQSAGI 
TDTSHLT*PDLSFCCKTKFD 


4052 


17953 


A 


4076 


60 


389 


PSAQLGSTKHIHMAVQP*PPPSPELLHL 
PKLMLCPHSAHTPRPPPNPGTVCTYSPR 
SSPAFPQEGPSSGEGLGGLVSLFSSPPN 
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nucleotide insertion 














HVHP APTS PE SGRGRGGRVPREGAYV 


4053 


17954 


A 


4077 


93 


385 


KPLFAPGVLFGGPGEPKREKKGGAQGGP 
KKFSPPPPPFSFPPPKTLWVAPLFQETQ 
GLRLCPPFFF*RGKGaLF*KRGGGRFKP 
PPPPPGLLGPPTP 


4054 


17955 


A 


4078 


2 


365 


RKVEGRVS*DEDLKLTELLRYYMLNIEA 
AKOLLYRRTKAIilDYDNSNKALDKARFK 
SKDLKSDCMIPRDG*RA*APYFLFAKNE 
LWPTGEQRHFLQDVPLIVQRTLTLRTP 
DHTSLPLSL 


4055 


17956 


A 


4079 


5 


298 


PLAIMGGFFTLAETNRTPFDLAEGESEL 

VSGFNIEYAGGPFALFFIAEYTNI II IN 
TLTTTIFLGTTYDAPPHP * LSPSLFYYE 
PPSPYPTPWSTST 


4056 


17957 


A 


4080 






CTPAWQEGKTPSQKNKKVLLIALKV*SI 
FHLMES*KYLKPGFSLDLITVIPTYFDL 
FRC 


4057 


17958 


A 


4081 


347 


1 


VLKPRPGNI I FS PKKKKNLPPPPGRNYF 
FFFPPPPFIiFFFFFFFFFFFFFFFFFFF 
FFFFFFFF*RGGLLLFRLKIQLPPTPHP 
YLSNTLRGWNPGEGf5CWHSEC3GRGEDCL 
QDH 


4058 


17959 


A 


4082 




354 


STFI ISLFPTTIFMCIiDQEVI ISN*H*A 
TTQTTQLSLSFKLDYFS I IKKKKKKKKK 
KKKKKKKKRGGGGLKKKKFNGGGGGKKF 
FFWGSKKKNWSGSVKKPGGGKKRGGKKK 
KG6GEK 


4059 


17960 




4083 


384 


85 


FFFPKEFFGCPRFFFSPPGFLTPPPLQV 
LEFPIKKKLSPPPPPLKNFFFYTPPPFF 














FPFFLLM*AKKFISWR 


4060 


17961 


A 


4084 




122 


PHRFSRDGLDLLTS*SftELGLPKCWDYR 
REPPRPAFLPQP 


4061 


17962 


A 


4085 


223 


401 


WGKLVSVFFGFCFCFFEM*SRCVRQARV 
QMRIFSSLHFLPPGFERISCLSLLSRWD 
YRC 


4062 


17963 


A 


4086 


!60 


408 


ALPPRRGGGGAPSFNFCLLPPGP*GKKG 
GYGDAPSCRSGLGQHGETAAPLKAQKSF 
GHGGRGYQVRRRVGPPGPGGRRQKLNEG 
APPPPKLGGKAAP 


4063 


17964 


A 


4087 


261 


81 


YDIDGRIFCQKLESSHYSPGAVAHACNP 
SPLGG*GGQITRSGVQAIHPSWPPEVWG 
LQA 


4064 


17965 


A 


4088 


428 


140 


QPLKNPPLGGRAPPFLRGKKLPPPPPQG 
EPPPF*KKKKKKRGGGGPPFFPPPPRGE 
PKKSL*PQKGGGKKKKKSPPPPPQKKKK 
NPPPKKKKKKKK 


4065 


17966 


A 


4089 


1 


205 


FRLRQENCLNPGGGGCSELRSHYCTPAW 
VTEQETLSQKJCNFFN*KTFHKAKSTGNF 
HMPNTSIQQLPI 


4066 


17967 


A 


4090 


154 


410 


I TGCS II KYFYRAMREREKSE VGYFPS S 
SIMTPLEHELSDGVLDSRITSLSTHSPA 
PFIKSKNRPGTVAHACNPSTLGG*GGRL 
TK 


4067 


17968 


A 


4091 


428 


57 


LPHPPPPPPPLSSSSSSSSSSSSSSFPP 
P PP LL PGQGRTLTRP LPAPAGQLG SRE S 
CGAPPSSLGASLHVA*WADGRQ*RAALS 
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TPHGPLTGSPAAWRGERBQDLLLLLCS 


4068 


17969 


A 


4092 


431 


3 


APTSNKGSTIPILQIRKLRATPCSSIGP 
SPSTLPVKLEL*SVLLPGQWLNFSFCHF 
FPRPKCSNRPRYTLPVPKLPSSGLPPRP 
SPSQQVTAFPPLTSRKEPSSPSTPASDA 
FPFPPAPVSAERQEVICESIPSVTMHE 


4069 


17970 


A 


4093 


151 


1 


S IMPMWMnAVaHACNPCTLGG *GGWI I K 
SGVEDPPDQHGBTSSPPKTQKL 


4070 


17971 


A 


4094 


35 


363 


LQSKIYCTFFFFLERGFFFSPRVGRPGA 
HPGLGAKTTGVKaNPPP*PPKGGDPGGV 
PPPPGCFFWFFLGKGGFFLLPGGG*NFa 
KKKKPLVWPPKGGELKGGTPGPPPY 


4071 


17972 


A 


4095 


376 




LGKSPPFLGAPFFLKKGSLLGPLLGK3F 
TPP *LGGPHPI.EKGPLLGQW]SrPNQKVWK 
KRGFPQKGGENPPFFFFFFFFRDGASLC 
CPGWPQTPGLKQSSNLStPSSWDYRCLS 
KSLFKENTETLRS 


4072 


17973 


A 


4096 


65 


446 


PFFLFFLNKGPLFTPKKQGKPRNFPT* T 
PCPRGKKNPPPPPP*GGEKRGGPPPPGK 
FFFFPGERGFPGGGVLFLDLGTPPLGPL 
KGaEKRGRPPGPGPKKPFL*KRKKNSPP 
TGGERGRGGKKHCLP 


4073 


17974 


A 


4097 


l4 


387 


I KEADEYVLTRLACVFVGDME PLSS PHR 
LHTRIGS SP STTCCTPACPCS SW* HGAS 
VIPSQTAYEDWFITLYNVLYTSLPVLLM 
GLLDQVGASHKQGHFWTDENALEKSQAN 
VSMHSSGVYFFL I KP YYNSWRHQGSWL 
CFLFOJKERGF 


4074 


17975 


A 


4098 


80 


422 


ILHVI.YTCIILCFVLQS*ILWFVLQNYI 
LHVLYTCI ILCLHVLYTCI lYMCYIHV* 
FCILALFAGLS*DSPYHESL* * SFFMEA 
DISLCRCVITDQLNFICSLLQH* ELRMF 
QF 


4075 


17976 


A 


4099 


74 


402 


lYLLSTHLHYQFSWISITFDISFFFFFF 
LKKESLFFPQRGGGGGNLGLLKP PP PGE 
SPFPPPIFRGGGKKGPPPPPWVIFFFF* 
KKt3GSPIYPGGG* IFGPGDPPPPPP 


4076 


17977 


A 


4100 


3 


335 


DAWASAWVTEYDSVSKKKKKKRGGGGKK 
KQFPKKIGPRGSLFKPRGGEKPLSLKTP 
PPPFF*F*KPPFKKGGAEPWEPFLKPPF 
CLKDRGGAPKIYFFPNPPGAPRGAAFIK 
RGGGK**GPPGGLLL 


4077 


17978 


A 


4101 


406 


1 


RFP KP FP PKPKMS KWALFK* KGFFP PRE 
LLIGPPPPPPKKKKFFPTLFFAKVLPPR 
FF * NFNQVSKKRPTFPLKVGQ * P PPMLV 
QKKPTRGFPALFKKKKRSSLVIREI KRK 
PTMRYHLTPIRMAVFHKSKNNRC 


4078 


17979 


A 


4102 


2 


183 


WPGGGACSEPR* CLCTPAWVTERDSVS 
KKRADNDKQGALRSFFLGFPFFLPLHPT 

SFAF 


4079 


17980 


A 


4103 


119 


391 


DNLQCFFFFFWKRGLLFLPRGGGFKSTG 
PFSFGVPGTPPPPPPGKGGLTPKPPPPG 
* ILFFLEKGDFPLLARW*IIRQKKKMPP 

PSPPKGG 


4080. 


17981 




4104 


413 


64 


GFKPPPPKDPPPPPPQKGGFMGWRPPPR 
PFAPLFS PP PPRGGGGI FP *GGGGGAKK 
FHIWGPFG*KKPPPGGG*NSGGQKNPPG 
GGGKNPPQKKKKKKQPTTANIWFLCLE I 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LSGM 


4081 


17982 


A 


4105 


2 


196 


RGRVGKVLISLFNTDLLWLLCILICFHK 
TQ*LCNG*CHSRMKCVCVCVCARVCVCV 
CVCERERD 


4082 


17983 


A 


4106 


1 


335 


FLVET*FHPVGIiAGLBLLTLGDPPR*ST 
LGDPKCWDYRCEPLDPASVSFFLCLKLF 
IFVLLKLFFFFKPPLYSKIiTRGRDNNYlT 
YFPPPNPGPLPHWRADKSIIGGNERLL 


4083 


17984 


A 


4107 


1 


148 


GERGCSEriRSCHCTATWaAE*DSIYKKK 
KKKEPPKFFLAGFSGGGKKTP 


4084 


17985 


A 


4108 


331 


41 


GLNFLAQKFSGRFSPPPPGRVPLGPKPF 
FFLGKPPILEGFPGP* KVFFSP ILPPKE 
VLTFFSPPGEKIGPPFSGGACSGFPPKG 
AKKKKKKRPISSN 


4085 


17986 


A 


4109 


137 


3 


TKKEENINRGPGAVAHTCNPSTLGG*SG 
RITRSGDRDHPGQHGE 


4086 


17987 


A 


4110 


3 


269 


GFHRVSQDGLYtLTS *SACLRLPKCWDY 
RHEPPHPGFLGFFCFFFFFRDSVQPGQQ 
SENPSHKKKKDEGPFWKGAGKI FLFKPN 
SSAL 


4087 


17988 


A 


4111 


196 


330 


GAIARGRSNCDCI *GS VGHVL*NDPTLN 
CILDQDTLYGSIMINEYYBRASDAWALL 
PDEEQLPEGDQTVIA**A*DMSGGQQHR 
VSIiftEAVYSGADVYLLDDP 


4088 


17989 


A 


4112 


426 


241 


LLKRVRHEILFTIRGKGFSEPRSPPCTP 
AWATERISVSHKEKKRKVLVRP*VDLSN 
SSLSVL 


4089 


17990 


A 


4113 


394 


1 


GPGVFSPSGCLYHARAQYFWPHKKKKYF 
PHPGQKITS VFLKGRPLFGFGFLLI FFFF 
LVEMGSCCIAQAGLELPTSKNPLTSASQ 
RAE IKDVSHRSQ * FFVFLFETGSCS VTQ 
A*VQWCNHSSPRTPGLKQF 


4090 


17991 


A 


4114 


328 


2 


KKRAHPIWPDTGIEVPNSPSPNIRFLGA 
PESFPPKRNLPQGDPLEKPNPCYFFFQG 
VTTPPFPLFPPPPKI PFFFF* DGVSLCR 
PGWSAVARSQLTATTHLPGSSDSP 


4091 


17992 


A 


4115 


220 


3 


FKKGEPGDPPPVWFGLGWWVWVGFFFF 
ERHSFTLVTQVGVQWCNFSSLQPPPRGL 
KSFSHLSLPSS*NYRH 


4092 


17993 




4116 


342 


2 


PINFLYLSFNP]SIQV*RSPNFSKILHFCF 
KILSFFYTLAFLPFSSQLIRQSFFRVYL 
INT I CFLQDLYYLYILNLYL I LFLFFHL 
ASRLLDFCLFFVFETVLLCHPGWSTVAQ 
S 


4093 17994 

1 


A 


4117 


124 


352 


TSGFT*THMKSKIPT*RSFIIDLLEAKM 
KEKFLKAARETQLITYRGTSIQMPVDFS 
WRKTEAKAQWNLMFEVLKEKNCETRSLH 
PATMSFRNEGKTKMFSDERKLRDSVTLP 
LKD 


4094 


17995 


A 


4118 


11 


387 


KTGKLI , ATS APGDG I MVMVET F CL WS F 
FL S VS KKKKKNPLPQ IKNPLPKPCWVFK 
RAPTPNH*KAFPGPPKTPP*KQPKKQIP 
CRPPKKGPFCI*PNPGINPSPLFFEKKN 
KGKAPPFKNSKQK 


4095 


17996 




4119 


2 


407 


NTFQDQSGSSSNREPLLRCRDT^RDLEL 
AI GGVLRAEQQ IKDNLRE VKAQ I HS C I S 
RHLECLRSREVWLYEQVDLIYQLKEETL 
QQQAQQLYSIiLGQFN*LTHQLECTQNKD 
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E=Glutamic Acid, F=PIienylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
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X=TJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














LANQVSVCLERLGSLTLKPEDST 


4096 


17997 


A 


4120 


157 


2 


LTGSHSVT*AGVQWCKHGSLLPRPLGSK 
RSSSLNFLYSWDHRCGPPHSMJF 


4097 


17998 


A 


4121 


3 


168 


IM,KSGRKVDIRELGHWCPGWS*TPEL 
NQSTCLSLPRCWDYRREPPVfLALNMF 


4098 


17999 


A 


4122 


-^r 


376 


ETGFHCVSQDGLCLLTS *STRLGLPKCW 
DYKMREPPRPADGFINIEDFNPLLRTST 
CCCEIVKICniFASPFTMIVSFLRPPQPCR 
TDPIQNTLPLVIMSPVAILGCDSFSAFP 
VLMNLPFSSDCTK 


4099 


18000 


A 


4123 




221 


GTQLLLRIiRQEDLLNLGRGQCTEPRLHH 
CaPFWETEPDPV*EKKQAAKAKNSLLTA 
ALLKTQVKKDSVAW 


4100 


18001 




4124 


2 


125 


AI IKKTKSNRWW*GCR*RGMLVHCFVIKC 
KLVQPL*KAAWRL 


4101 


18002 


A 


4125 


425 


152 


TPVILflLWEAKaGRSLEPRS*RSAWATW 
QNPISTKKYKK*GGRITSAWGG*GairGL 
*WCRSTPASATETPSQRKKKK*EKNLFT 
LILKVIK 


4102 


18003 


A 


4126 


3 


99 


CQaGLELLTL*SDSIiRLPKCWDYRCEPP 
RPA 


4103 


18004 


A 


4127 


396 


58 


ENCNPLSWSGGIISGLLLVSGYSNKFPS 
FLVCLLPVIGPPENTAGPSLVPGTKWHD 
HSTL *TSQTPGPSLVPGTKWHDHSAL*T 
SQTPGPKLSSRLSLPSSWDYHHKTPYPA 
H 


4104 


18005 


A 


4128 


31 


549 


LGQSYLLLLRKCFSEHFQVGDLDISYIN 
lEGITATTSPESRGCTLWPQSSKHTLPT 
ETSPSVYPLSENVEGTAPP*AHQSFMSP 
PSWGGSPLHLFFGGGGFEKEQSPLKKKS 
FTLYPL6PPSEGEGHF*PSPLCFPPGFQ 
SPPKTGIPSGDELGFF*TTPGPDKKK 


4105 


18006 


A 


4129 


97 


362 


RK*SACLSLPKRWDYRHEPPRPACSC 


4106 


18007 


A 


4130 


376 


279 


DQDDLDLIiTS*STHLGISKCWDYRHEPP 
HRAS 


4107 




A 


4131 


133 


344 


IIFFFF*NFVLQAGGQGG]SrLGSLKPPPP 
GLKQFLGITFGRSWDHGPTSFTRANFCI 
FRKKRVLLCCPGWS 


4108 


18009 


A 


4132 


164 


3 


M]SIEMSYESSLLDHLLKRQ*NQCCLGSSE 
PSPPRLKQFSHLSLPSSWNYRKTSK 


4109 


18010 


A 


4133 


355 


2 


GKKQNRKTGNFKTHSASPPPPPKERSSS 
RATEQSWMENDFDEMREEGFRRSNYPEL 
REDIQTKGKEVMTFEKNLEECITRIPNT 
EKCLKELMELKTKARELREE CRSLRS * C 
DQLEE 


4110 


18011 


A 


4134 


193 


357 


HHCNPSLCQNDLFWHLVLSPPTGVQWYN 
HGSLQP*PPGLE*SSCLSLPGS*EYRH 


4111 


18012 


A 


4135 


357 


104 


LRRL*YENHLNPEGGGCSEPNLLRCTPA 
WATEQDS ISKIIIII* CNFLH* QNMQKI 
KQNPKTNSSKILTEIKISLKGNLRELRE 


4112 


18013 


A 


4136 


179 


381 


DTSLHIIQIIRLSISKKKL*TLYFIY*DR 
VPLCCPGWSAWQS*FTAALTSLKRSSC 
LSLLSNWDCRC 


4113 


18014 


A 


4137 


89 


318 


CLEISCTKIGQDLYTENYKTLLGKVCKC 
PNK*NTTFMDQAEVSILKISFLPRLIYR 
LRI IPVRIPAGIFVEILKFG 


4114 


18015 


A 


4138 


315 


60 


KKLPAEIHAPLAEEATD*AELKPAGETH 
AEVQAPTEQTPAAEAATTIAEASVKVQP 
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nucleotide deletion, \=possible 
nucleotide insertion 














PPAEEASLAKFPAE IQPPADQERGSED I 


4115 


18016 


A 


4139 


377 


51 


GFTS PFPED* RI PGLFSCIC3WTLQSFHF 
KMIGSVPFS PPGFFFFPQEMRVLLCCPG 
WSQTLGLKPSSCLSLSNSWVYRCAPLVP 
GMP I F*LS CFLAIGFLEFVI YIGY 


4116 


18017 


A 


4140 


413 


175 


NFGIFCRDGVLLCCPGWS * TPRLKPS S C 
liPKCWDYRREPRTVPGLPFLFLNtiRKDG 
NSVLEIYLBYKITQIIVQRLIiVL 


4117 


18018 


A 


4141 




394 


EIFVTKRWSGLGWAHftSNPSTLaG*GG 
QITHSGV*EQPDQHGSPERRGLKERRGL 
MVMK&TFLGLRHHLEHIVGHRELEDLHN 
NSKKKISSHPLSTPHVPGTVLKPC3JLIL 
CFKHHQKPVRKPPNSPSYR 


41!8 


18019 


A 


4142 


22 


376 


LVWKGKTIFFFLKRGWLPPKGKGRGIW 
TVlirGSIJSrFRGQGNPLA*PPKKPGTKGGG 
HNPGEI**FFGKKEA*QCGPGaSGIPGP 
RRPSGLTLQKGGTYGREPS PLPQPTKEN 
PRLTKT 


4119 


18020 


A 


4143 


161 


1 


PIPAKYEHFCFFVSLF*DRVSLCHPG*S 
AW*SQLTTaSTFQGSSDP]:.TSATP 


4120 


18021 


A 


4144 


18 


159 


KHLDPGGGGCREPRSCHCTPAWVTTAKL 
HLKK*K*VELKSPSVIHT 


4121 


18022 


A 


4145 


383 


109 


LFFYKIQNYTGH66PCLLTPFLQRVKQK 
NCFNPGG6GCR*PKLVFCPPTW6KKQGF 
VFQKRLKKPKPKPKPKPNWRQLPLPPFS 
KRPKPKI 


4122 


18023 


A 


4146 


374 


164 


QCQLLRSLRWEDCLSWGG*GCSEP* SCP 
CIPAWVTARSCLQINKSLPAEIVI PSLY 
NQIAEG 


4123 


18024 


A 


4147 


25 


375 


RKiCKALFFCQGGSQPPPSNIiMDPPPPGE 
KKI SWLNLFPKKEKKKF*KKKKFF* KVK 
KNPFLKPKGRPPPGEKKKGKKGKKPTPQ 
KGKGFLFAPPPKKPIKKSPHFKIPKKQK 

NFWQR 


4124 


18025 


A 


4148 


376 


202 


hynskvfpgGpkrsflfflkalsfffap 
apipfwlhskifffff*drvslchpgwk 

LR 


4125 


18026 


A 


4149 


3 


144 


LFCPDWTPTPGLKQSSHLGLPQCWDYRH 
*VTVPGLPISFVFFFFPP 


4126 


18027 


A 


4150 


3 


188 


QLQSQLLSSLRPEDHMHLGVEGCGKLSL 
HHCTLA*MTE*DSIS*KKILKKKCF*KK 

HSFLY 


4127 


18028 


A 


4151 


83 


358 


GWARWLTPGIPALWEAKRIKSLEDLNA* 
A*OTYAPS*EESLHPIMNTLIRDPVWT 
SFRVTVDTSTIAEHWLFCTfDLLQASIGH 

ILILGPG 


4128 


18029 


A 


4152 


162 


1 


TVWYWHKNKLSNQWNKI*FLELDPITYK 
HLVYDKTGI SNHWVKDALVNTQCRAG 


4129 


18030 


A 


4153 


369 


20 


FQLLARQRQENGVNPGRGAFG* PKSRPC 
PPAWGTK*DSVSKKKRKKKSCEFVHRIL 
KLEEAYHSLGSNTAQRLFVSCFHKKILC 
LSISIPDPYLQFKMWPVPPSKPLYQIDW 
VGLL 


4130 


18031 


A 


4154 


117 


2 


QIFFFFFFFKMESHSIAQAGVQWRHLGS 
L*PLPPRFKR 


4131 


18032 


A 


4155 


369 


217 


FFSPPPFFKPPPPPKKXFPP*KKKNPPP 
PKKKKIFFFFFFFFFFFFFFQTP 


4132 


18033 


A 


4156 


214 


283 


ASPSEGQGLSSGP*GLIW*SDKPSVTHS 
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C=Cysteine, D=Aspartic Acid, 
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X=l]nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PKPHCETA*GGTSFLSSSGSSPQAPKSP 
HHGPDSPLSDSIPMVISPPSPPPLRAKG 
CPQGPPGASWWGTT 


4133 


18034 


A 


4157 


319 


712 


QDFVSKQHSYCLKNLKSSSLIQVAMNRA 
QVCLISSSKSGERHLYLIKVSRDKISDS 
NDOESANCDAKGNNFI CIMLYKIWLN*K 
RLMK*HWTAI*LKIYFSNKMKFSNLNLS 
SVINSFDVKKERLNANTLN 


4134 


18035 


A 


4158 


183 


369 


LKRKQSQRGEVNFLKVTQPECGKRGILF 
I YYFFEMESHSVGQTCJVQWCNLG*LQPL 


4135 


18036 


A 


4159 


190 


336 


DPISIKNFWPGTVYFLIETGFLHVG*AG 
LELPTSGDLSTIASQSAGITG 


4136 


18037 


A . 


4160 


10 


390 


QGILLPCFSMSSEERGRIYSNTFSFFFF 

FWKGSPINPQGGSQAKKPR*REPSPSGL 

TPQRPGNGGP P PPPGQNFFFKKKGG* PG 
G PG3A* T PGPSGT P PLGPPKAGNKRGGP 
KGHAKNFLKNQNGPT 


4137 


18038 


A 


4161 


53 


376 


YLFAFFFFSKREAWQGGQPGMEGAKKKF 
MAP *PSKGGGGEFSG1:jTPPNQWNPRAQP 
PPRGELILILEKKRPAPNAQPGPENLGP 
REPSPLAPQKGGNKSENPWPPPP 


4138 


18039 


A 


4162 


T 


356 


GVFILVSFNFWAFFLSFFFLGKQILVLM 
PQPESQGQNLAPQNPPPPG*RGFPALTP 
PRTGNKGMAPQAPQFFGFLKKGGFSHGG 
QGGFKTPTLGNLPPLAPQGLGNKGLGPW 
PPPFFF 


4139 


18040 


A 


4163 


345 


26 


APGFKPALGPPGGPFFSKKKSKNLPGLR 
GSTLQAVGPTFLGG*GKRSLKPPGG*AS 
RGPGWGPSPPPGGPN* IPVSEKKEEEGK 
KKKGREKKKETFSSNLLIFLSF 


4140 


18041 


A 


4164 


18 


284 


TLQGTSGIFEGNFFFFLGTGISLYCPGW 
RAWGGKHGSLQL* PPGSSSPASTPLGRS 
WDYKHVPPGWASFFFFKTLPNGVTQAGL 
ELWG 


4141 


18042 


A 


4165 


377 


30 


fsgkkkffqlgwgcgppplipplwgv* p 
gqpprvggfnpplppwknpvffkkpkqp 

GVGGASPYSPFLGKFRPRIPFTLGPBGS 
G*PNFPPFPPPWAPKKNFSPKKKKKFYN 
CLW 


4142 


18043 


A 


4166 


305 


1 


BSRQVLTSRLTLLSIHEETSIYLLMNSTYK 
HITSLPINIRQYGQDLKTLKKETKDLNK 
*GNIPCLGIGRIN1VKMSLLSKLIYKFK 
AMPVKIPGELFLRNQQA 


4143 


.18044 


A 


4167 


394 


78 


egkkogrkkgrregs*egrrerektr*k 
aekegmkeggregkkegrkegrrkgrke 
grkeegqkggrtdggwwvlrsldtsqli 
sfasgpkwreasslvwaslcl 


4144 


18045 


A 


4168 


1 


431 


CNTCVR* CAYWCV*MCGERLCWCVCSCN 

rvccv*vervcvlvcmsvcwcvcccarq 

CVSVCHSRIESSRPSSGPVAPSKCQTHL 
YLPKLSAPPAAGLWDLPSEATASLSGSL 
CQPSCPRTRSFATLVSSMYHRPVTGELQ 

SIT 


4145 


18046 


A 


4169 


3 


423 


YRARE I EAAAGRDHATALQ PGYRVRPCL 
KKKVLKLGQAWWLA* F*NFFFETGSHSV 
PRLECSGVISACCSLIsajPGIV 


4146 


18047 


A 


4170 


118 


402 


QICTCTPKPKMKVKKITMLSNNMRYLYI 
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E=Glutamic Acid, F=Piienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HLTQDI * DLD IDL* MTADMNLKDLNK*R 
EIHCLRI*RLNIIKTSI FLKLPYRVHAI 
PFKISARSFVE 


4147 


18048 


A 


4171 


404 


285 


RGCGEPW*HHCTPAWV'rE *DLVSKNKMK 
QNDNLSHLSLA 


4148 


18049 


A 


4172 


274 


431 


ESLVGQRNWVAKYFYRITAPGRAQRLMP 
AIPflPWEflEVGGSQGLEIETLL*NNSSR 
PGAEAHACNPSTLGGRGGRITRSRD*DP 
PGQH 


4149 


18050 


A 


4173 




366 


FLCLFHTCVLAFPVQQLCSVGWGCKVEN 

ADREEKQPCDS QAREE PRLCRRGFNRAP 

NINTRIIEKLFFVCLF*TGSlrSVTQAGVQ 
GCGH S S P Q P QL P GP KR S SHT , S T ,RS SWDT 


4150 


18051 


A 


4174 


397 




TEPVLANETWGPWDRGPLGLGGLGPENT 
DNSHAVGADQLLKEPPFPLSTRSIMEPM 
SLNRWLDSPHREMQAGTPLSLCGDTYET 
QVM*SWGIRGILPQQPWKMGASLSLTAS 
LSLSFAHTLKHKHTHTHTLY 


4131 


18052 


A 


4175 


90 


387 


KGRWLFFPQGGERGHNG*TAPPQTGGKE 
NPPPPPPKDRGKKRNAPPPGGIWYLKKN 
GFFPIGC2GGPEPPPPRGPPPPAPPKBGE 
KRGGPPHRPGKKLLK 


4152 


18053 


A 


4176 


398 


2R0 


RRLKWEDPLNPRV*GCSKP*SHRCTPAW 
VTE*DPVSNFF 


4153 


18054 


A 


4177 


253 


395 


LFNFFFLVKMGSRYVAQAGLKPLR*GNP 
PASASPRAGITGGSHHTQP 


4154 


18055 


A 


4178 


366 


48 


PWASPGISLSFSFLRTKPTVKVR*YHVG 
PQRQPCPSRWAPPACPFSMFPPAWVSPP 
GSAASRPPRHAGSQVGATASPPPAQGIiG 
APAWTAGLGEKQKLAARMRGT 


4135 


18056 


A 


4179 


^4 


413 


GDGVNSAFFFFFFLEKKVWFIPPGGGPK 
PEFGFRAPPPPGVKKIFGFTPLRTGE*R 
PLPFPPGKGWPFKKNGGSPMWPGGV* PS 

DPKGAPPPGPPKGGE*RGEPPWVGWNFA 
LLKERKPLFKE 


4156 


18057 


A 


4180 


396 


235 


HEVSLCCPDWS*TPGLKYSSCLGLPNCW 
DHRCRNHGPGKTQLEQGHRQVA6LSR 


4157 


18058 


A 


4181 


85 


408 


YTCSLRITQDHFFFFFFFLGKKIFFFPP 
GWGAGGQILTSPKTPPGVKGIPWLNPPR 
GGGPGGPPPPPTLFLGF*KKRGFPGGGQ 
KPRPKGDPPPLPPKGGGLKGGPTP 


4158 


18059 


A 


4182 


1 


149 


NHLNL*GRGCSEPHLHHCTPAWVTE*DS 
VLKKKSY IHLGQNPTVRKVLL 


4159 


18060 


A 


4183 


310 


3 


ATAPSQFIRIN*RNHCHYIEVQSEAASA 
NLETAAnsniPEWLGKIMDEGG*MKKQIWK 
EMTSRIFIVEKSMPGFKTSKAQALILGA 
NIAVNFRLICPMLIYHLEN 


4160 


18061 


A 


4184 


1 


359 


PTRPLVLDRDRPPFFFFLGSGKKKNPGV 
FLYFGGGWTPPKKKKNSLGWGGGGLKKG 
GGGVFFKPLPFLGGGPFLKNSGGVDKKK 
RAPPFFTKTPL*KKPPPPFPPKIFKaRG 
FW*KMGEPFFFCPPPPSFLKKGPPPKKE 
GV*KKPPPPPFLTPPPPNQGNFFFFWGG 
STPPQNIKKPLGFFSSQTPKKKKRAADR 
DLELADAW 


4161 


18062 


A 


4185 




362 


HFTKYLFISC*K* *LLCWMQDCCiCKGIP 
MNTNVIQEKATSLYGSLKHKEGEGPKAG 
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EFNASKGWLDNFRKSFGFKKVKITGEAA 
SMFQKAADKFPDAIKKIIEEKGYLPEQI 

FNAGESAL 


4162 


18063 


A 


4186 


366 


47 


AALFLVDPPGRGGVKGPPPPPGPFFVFF 
QHRGVPPLSPGGFFFRPPNSPPPGPPKG 
GGSGGGPRGPA* THGVFLKKNPPNCGAG 
KKKGFFLKKKKKKPVFWFSVK 


4163 


18064 


A 


4187 


394 


112 


KNTEBTLKISWAWWHVSMVSAPNEAETGG 
STEGTGCSKL*SCHCTAWVTVRETIjSPR 
KFFiCKRANYLTFRI*KVCSLYHRKTKNS 
FKKHTQGRKF 


4164 


18065 


A 


4188 


230 


372 


KPMFLPFNPAILIJ,GIYLKEYKSF*HKD 
TCMRMFITALFTITKTWNQ 


4165 


18066 


A 


4189 


149 


2 


NRSLRFI)aRFKEGSHLPPl*VQGKAASAD 
VEAAASYPEGLGKTINKGGST 


4166 


18067 


A 


4190 


2 


159 


MHQPWLLRRPKQENCUStPGGGGCSEPRS 
HHCIPA*AIERDSLSKKKKKGGAF 


4167 


18068 


A 


4191 


180 


415 


LGFLQLLLFVGRSMFIFALELSHGWLGS 
SLSF*PVCVKQSSHLSLSSSWDHKHMSP 
YPANFCIFCRDKTLPCCLGWAQ 


4168 


18069 


A 


4192 


424 


41 


PPLLRAPPKGLL* PKNSS'PPGAPP* IPF 
PPKKIKIFKFTPGGGGAPLFLLPRQVKA 
EGSLPPRVFGPPPFFFPPLPSP LGAKPN 
PFFFFKKKKKKEWLHKSSLFYSAHFVLC 
LKYSLSFFYQSVKTK 


4169 


18070 


A 


4193 


105 


424 


ELKRLVIKLIRGIPEKGKAQCKEIQKIiA 
QEVKGEIFMEIGSLKKKQ*KIQETLDTL 
LKMQNALESLSNRI EQVEERNSELEDKV 
FKLTQSNKDPSQIKKKILYNV 


4170 


18071 


A 


4194 


3 


240 


LCLQSQLLGRLRQENPLNLRI *GCNEP* 
LHHCTPAQVTERDPVSKENKGSFI PMKl 
GKLVSLTLNRMCNCKNSEKENV 


4171 


18072 


A 


4195 


385 


1 


KMI I LTKKME I KHKDEKELQKTEVDLKI 
ENNTTRARLTNN I KKKPPESEKTNTKNH 
TKKSL*DQDTKKETRPQKKKNPFKSR*V 
RHWRMI LNLESRNRGKKNPSGVQKRRDS 
VPTLITQTQETWWWCF 


4172 


18073 


A 


4196 


3 


272 


LFYLIWALKHKSSDVGNSNIYAKEKLES 
ASFKERI *KSYVEVAKIYGKNEYCIYEI 
VKKEK*IMHSIYRVQYYLQFQTSLEVLE 
RIPHG 


4173 


18074 


A 


4197 


236 


379 


GGRFKGSNFTSPGLQGNSFFMGPPKLNS 
RAGV* QRREGKNPGVPQFNR 


4174 18075 

I 


A 


4198 


375 


1 


NFKIPAPPPQNKGPLRVGTPPFLIFSPP 
GSPPKSFPKAKWAERDRPF*NPGETLQE 
G*VFFSGRFFFFFFFRDRVLLCHPGWSA 
VAQSRLTTSSDSRVHTILLVGRGCSEPR 
SHHCNPAWVTRVK 


4175 


18076 


A 


4199 


1 


270 


PSRTGPQIPRRPTRSNCPNQFLGF*GCG 
EPRSRLCTPAWATEQDSISKKERHGLLV 
FLKGFIHYIRLLINLFAARGRKRVLFAL 
GGREMW 


4176 


18077 


A 


4200 


388 


1 


PPGIINLFDPGL*PPPPWVMGPPPS*KL 
ISPQKKKWKVPPPVPPINIFSGPPPPTL 
VFWVLFFVSLKRQLIL* *KSGVSAASLV 
RHDGAPRLTSSAGKGHAEPTFQPRPTHH 
RNACS FVLI HI QS YKVF 


4177 


18078 


A 


4201 


298 


158 


FKI VINFQFLTPI FAPSEGGRGG * GRGE 
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N=Asparagine, P=ProIine, 
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LRSCHCTPAWVTE*EPI*KRKIKGKKRT 
EQESRRQNLA 


4178 


18079 


A 


4202 


128 


358 


BCLMIPPSPGQPPRPGGRWGGHPPARPAA 
PSGR *GAPLPGRPY * EKRI PSARTPPRL 
GGWPSMSLRTGHDDNGGVWE 


4179 


18080 


A 


4203 


406 


1 


FFFFFLRQSFTLVAQRGVQ*ENIiGSQQP 
LLPGF 


4180 


18081 


A 


4204 


322 


2 


KPRPRKTPGGPFPT *HGGGAPPPRGGAP 
KGGSPPPPPFPPPGGKKQGKKKHNPPGG 
*KPWGGFFFFPPPF*TPPKKRGLFLKKK 
KTQNPiOCKKKGGRSRYRTSPRV 


4181 


18082 


A 


4205 


3 


127 


GFHHVSQNGLYLLTL*STRLHLPKC*Dy 
RSEPLCIACFLFL 


4182 


18083 


A 


4206 


1 


216 


GFRRVGRDGLDLLTS* SflCLGLPKCWDC 
GR* PPRPAHLGGNSNAKEPGLPACPLLS 
GHMHRAVWVRWATATL 


4183 


18084 


A 


4207 


427 


0 


LFFTTNFFFWARVFFLPPFF*NPPPGFF 
FLPHKKKKNFPPPPGFFFFFFYAPPPFF 
FFFFFFFFFFFFFPFFFFFFFFFFFFFF 
PWLKTWSC 


4184 


18085 


A 


4208 


2 


404 


PRVRieiWCKEbLFliCyPFFFLEftCSFCF 
LAQGGGPGGNFG*RAPPPPGEKKIPGPP 
PRGGGEKGPREPGRKIFGFLKKKGGGPF 
GPGGVGPPKKGPPPPKPPKGWGYGGKPP 
AQKRGGFFFGKKEAPFFNWVNG 


4185 


18086 


A 


4209 






PLFCLSEKPRYFEYAIYLMLSLKFLESC 
IMSVNSSAVSASVYY*W*KIEREPLTSW 
TLEKLDPEVEHQKFAFT*RAKAILRKKN 
KFEGLTLSGFKTHYKTTIITTVWY*HKD 
TQIDQWMRIESS 


4186 


18087 


A 


4210 


219 


392 


HFFFLNYNFLGKGVSFCPQAEGQGRNLG 
*LHPPPPGLKQFFCLTLLRNWNHHLVPP 

PP , 


4187 


18088 


A 


4211 


409 


2 


LERKTVFFSPPFIFFFAPVFFLSPFFYT 
PPPLYIFCPPKKKKIFPPPPGKIFFFFK 














IVFNISYNLIKVDSHLYLFKDKKILLCT 
ISSDAW 


4188 


18089 


A 


4212 


24 


399 


ADAFSTTNLHGLGPDLFTPTTQT.YINLG 
MAILL * AGAVN IGYRSKIKNALAHFLPQ 
GTPTPLl PILVI lETINIilLATALWH 
LTAIITAGHLLMHLIGSASLAGSTISLP 
STLTVFTILILLT 


4189 


18090 


A 


4213 


411 


197 


SWLTAPS FKRFSCLSLLSS *DYHLPVPH 
PDNFCIGSRDGARMVSIS*HHDPPASAS 
QSVGFTAPKVLGLQT 


4190 


18091 


A 


4214 


218 


126 


KEPEPLLFRSSWSVGAIDLKNVMRRVLI 
KSCTPGWAHTCNPSTLGGRGGQITRSG 
DRDHPG*HKSALGSCASCFIYLWDLLLH 
IVFE 


4191 


18092 


A 


4215 


1 


332 


MAPSLCLSNKKRGFIGPGFAGAPKHPGR 
GQGHPREKLAFFKKGPPKPGLKSFLVLK 
TPPPKPFFFLSPKGGALKTEEEP*TKKK 
GGAE KKPPPGEKRPLGPFCPKKPNWG 


4192 


18093 


A 


4216 




185 


KLYLSILISLQISLIITFTATELIIFYI 
FFETTLIPTLaiITR*GWQPaNFLYLLV 
SFHEN 


4193 


18094 


A 


4217 




388 


LRFCWETLFPLP3fNPFSFHFSFLFFSFS 
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FFFFLGGKFCFCPQPGGGGPNLNLRDPP 
PRG*KEF*APPPPHGGENQGSPPPPPNG 
GGF * KKGGLNHGGGGGFKPPPQKKPPPL 
ASQRGRNKGGAPPLLKF 


4194 


18095 


A 


4218 


3 


195 


VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAFIPVLTKKINPRSTEAAIKyFLTQ 

ATASHPA 


4195 


18096 


A 


4219 


261 




EPERRERKERKEKKKGATPSRI CMKRER 
EERFS *ERRERGEEGERGEKEERERGER 
ERERSEEERERERERPKN*MRLNGRTRR 
TRG 


4196 


18097 


A 


4220 


5 


375 


DEMLHLKFTYIIiISr*TLBmTIIPKWSIENL 
YN*DF]liNSKVQGTPPPPRPSRPPSSPSP 
DPPPGPPPAGAGRRJ^PGRRGSPPGGP 
PAPPPAPPPRAGEAAGQRGRPPARPGGR 
GPAARPAGGGG 


4197 


18098 


A 


4221 


258 


267 


GGC3aLKKKIYCGG<3dClNPFF*SPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
lYFIFILIF 


4198 


18099 


A 


4222 


127 


330 


KKKKKKKKKKKKKKKGGGPFKKKKFLPR 
GGGKNFFF*GGQKKKLGGGVKKKGGGKK 
PGAKKKKKRGEK 


4199 


18100 


A 


4223 


1 


377 


RRFHLRRENLQEKQTWMGIFFFFGVAGP 
PaPGGKSFSPPSFFF*GKNTFFTPPPGG 
GGQNLVGENSNSNPPPPRDPPPSGPSEK 
I *KKTPPPFLVGPNTKNKKHFGGGGSPK 
KKTLLPPPPHDPF 


4200 


18101 


A 


4224 


223 


431 


IRKTGPLGFGGQNNPLPKV*PPPGLTFF 
QKBPPPPPFPPFQKPRFIiKFYPiCREGPP 
PPECVGLTPPPGLL 


4201 


18102 


A 


4225 


173 


2 


RCKLHCLFPTIAIITR*GtfQPERLNAGT 
YFLFYTLVGSLPLLIAIIYTHNTLGPTR 


4202 


18103 


A 


4226 


11 


377 


FI,TWFICISLFGRLMGI*FFVNFFCEL 
FFHAFFSLFLKTDIIFSIDLLEIIIYAH 
VCHICCNLIFSLYFVFYFNGSLKYIKC* 
NLFAI ILDFADVSFLIFMFRTGKRQNDN 
QFSLKILIGF 


4203 


18104 


A 


4227 


1 


412 


KITSKVKNATDLLKNASESPHSRIDEAEE 
RI SELEDRLFENTKSEETK*KRI KK* SM 
PTDLENSLKRANLRVIGLKEKVGKEL*V 
GS FFKDI I * QNVPNIiEKDNI QAQEAYTT 
LSRFNPKTTSRHLAIKLSKVKDKER 


4204 


18105 


A 


4228 


373 


163 


IGVFLGGAPFFFFFFFFFFPQGPKRGGG 
KGGASGKTG*GGGGLKIjGGGKQGWPPFF 
FLRTWPTFTRAPGL 


4205 


18106 


A 


4229 


392 


3 


PPC*NEAPGFNFGGPFKKIYSSPPPRGK 
FGSLKGPPPFFFSAPPGGPKFSKGGGPL 
FFGAPDF* GGFGGKNYFWGQGFP * PHS P 
P P P FS WGP KRRAP FS KKKKKKRRRGWGE 
GEGQGSNHLLKHGYFQI 


4206 


18107 


A 


4230 


211 


400 


YALIWLYCILNWYALHSVFFWCCFCFLI 
FKFIYVFLNLLFLFftF*LVLCLLKVQNH 
SGSLSIVSYFPLLEACRIFPLFPLF*NL 
*MI CFNMWLYFKQLCSALSVFLVLFLF 
FNF * I YLC I FKFI VFVCFLRRNPTLAQA 
GVQWHDLGA 


4207 


18108 


A 


4231 


69 


386 


KRI FFFLGPRGGGKGKMWVNGNP P P KGK 
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K=Lysine, L=Leuclne, M=Methionine, 
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GNFS A* PPQRGGPKGPGPKNKVI FGFKK 
KRGFYGGGQPPNLGKELAGPPKGGEKRG 
KTHPGGKFYAFLGGGLFSKKRN 


4208 


18109 


A 


4232 


394 


287 


FKEKKKPPPPPLFSPPFF*KRGGGDPLF 


4209 


18110 


A 


4233 


442 


112 


LLGRLRWEDGLS PQGRS CSEP * S CHCTP 
AWA7Q*DPVSNTNNKNPNPQNACSSLNP 
SLPWTAAVWRTLGEHCNGREAIiPARAKN 
ATKAPVLTTHHICQTVRTISCALLPH 


4210 


18111 


A 


4234 


415 


58 


SLQKGHFPPFQELDPONPGRSGWSPVLP 
TTGLFFSLDFIRFT*FMAGVLVGYKHCW 
EWTAALVLLLFLRDRVSLCFPGL IHTLG 
LKRSSCLSLLSSWSYRHTIPHPAST*FF 


4211 


18112 


A 


4235 


406 


168 


HGETPSFLKIQKLAGRGGTLL*SQLLGR 
WRQGNHLNWGGRSSVYIYTEKHTVHTQQ 
SVTTPFMSMSQTLAYSPLSLKSS 


4212 


18113 


A 


4236 


247 


397 


TVCFSVYIYTELLPPQVK* FPCLTLPSS 
MDYRSVPPLPAYFCIFSRDGVS 


4213 


18114 




4237 


36 


405 


RLYCFIKGLMVKNKTFRIVFFFFLETNF 
PFGPQGGGEGAWSGFPEPLALGVKKGPR 
PPPPGGGE*GAGPPGPGNFGLLKKKGVP 
RGGGGGPKPPTQRFPPGEPPQKKKNFFG 
PALGKKKVFFF 


4214 


18115 


A 


4238 


407 


3 


KKKKKLTRPGGGGPFSPLPKRVRQKKGG 
NPGGGFSKKKKSPPPPPPRGKKKKPFSK 
KKKRFLSGNNTGKNRLGKDPQS *PAGNA 
RKKPGWERALftRGRAPGMTDRKPQHSGL 
DGSQQSGTGQEPGNSGPVPKVH 


4215 


18116 


A 


4239 


144 


405 


PTVKLVLLYLVLLKVAVSKNLR* *GGLT 
V*QQRQILWKRILSNTHFLYLRQSKMMH 
LVL*PTVNTLKSLKSGQARWLTPVIPAL 
WEA 


4216 


18117 


A 


4240 


369 


40 


PLLQGTPASWPRWGRaGDRPRKEG*LHP 
RETRVPPALHHQPAPAGVSLTS PGPPFW 
LSLQRLRQHSSSDSLPAVCQYSGSWREE 
KAAAEAPALTPAICTCTIGSVLTWL 


4217 . 


18118 


A 


4241 


397 


1 


ISKSLFFPIPFWKKKKKGFSPPLNFLGG 
VFFFFFLKIFEIFFLKRFFFPIFLGAPG 
FKI FKKYFPFPRGFKTPPKKKIFLNPFF 
KKKGGGGGALFFFFFFETGSRSVTQAGV 
KKHDHGSLQP*PPRLK*SSDP 


4218 


18119 


A 


4242 


230 


393 


AERPGSPLPSAPETGSHSIAQTGVQ*H1T 
YGS P * PLTPGLKLS SCLSLPNCWDYR 


4219 


18120 


A 


4243 


434 


104 


KQPTFQSTTGGSPSAGRPPPPRAKKKKP 
PFF* KKKKKSGNIGGPLFPPPPGGEEKK 
NLFPRKGEGSHKTDSSPPH*RGGKKKNL 
SPKKKKKKKAWCWDYRRKPLCPATYF 


4220 


18121 


A 


4244 


2 


425 


FVFKIILKNLSEIQENLQFSKIKKTIHD 
LNKKFNKETGNI TKNQTE ILELKNSMSE 
IKNTVKS FWNRLNQAEERI SELEDRS FE 
MTQADKKRKKGEES LQDVLYTMK* TNIL 
WSFQKEKRKEKGLQWILNBIVAENFPSL 


4221 


18122 


A 


4245 


12 


424 


IYYKTLNCT*IF*FGGAIiACFHFSYPFY 
IHYVGFVP3PVYLFSHVIISVWTSGYLL 
YSLGCNPILSLFIFLLKLFHFGYLTIQV 
GIYVWILFVCLFLSNSFINI*FTYHAIP 
I P I Y I PLKVYNSMF YS IFTTK 
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nucleotide deletion, \=possibfe 
nucleotide insertion 


4222 


18123 


A 


4246 


215 


425 


LSIQISQIHNLIYLFINIMGKILRAKRK 
*GTSDAWNQTI FLRPRTVAHSCNPSTL 
GGRGGRITKSGDRE 


4223 


18124 


A 


4247 


413 


2 


WEVESFSLGKAFMWGCATENPGPPHPKF 
LLFIMKAPPPWRVGFQDCFNPKDGRFI * 
PRTPPCPRARGAKKNSVSKKKKIKIWKR 
WPPTPLPPPGNGGTRKFPSSPPPPPPPT 
TPKITPPGGERQL*PQRAEGAGDTK 


4224 


18125 


A 


4248 


61 


281 


ITWKDFDSFSVLRVSKAFHLALPAFIGN 
CIKCHIKMGIYPEEKFKNIHKYLYLEID 
VCGH*TWKKIFFLFFFLRQRFTLFAQAG 
GQWLDLSLLQPPPPGFKRFSGLS FLSK* 
DFDS FSVLRVSKAFHIiALPAFI GNCIKC 
HIKMGIYPEEKFKNIHKYLYLEIDVCGH 
IRLGKKFFFCFFF 


4225 


18126 


A 


4249 


23 


13 


RVRPFASPGGREVTVCGGLGREGQERRL 
RWHRKPFLPLAATSCAWPESVTPSVTS 
GSADGDGQSLAVGAGTLATVGGLELLNS 
NDPPASASQGaGIAGVSHRAWP*T 


4226 


18127 


A 


4250 


182 


342 


KGFFFIiPPGGGGGGEF*FNEPPPPRVKG 
IFPPPPPGKGEKPPPPPFPGYIFVF 


4227 


18128 


A 


4251 


2 


219 


PSLRKMQKLARHGGTLLWSQLFGELRRE 
DCLSSGGRGSQTHGSEL*SYHCIPAWAT 
DGDPVSKNTKFFFFPF 


4228 


18129 


A 


4252 


223 


324 


LGAVaHiaJPSTLGQRGRRITR*GVRDQ 
PGQHGE 


4229 


18130 


A 


4253 


165 


336 


ESDAQGHQVAARVSHVLEKDALLVFRSL 
CKLAMKPLGEGPPDPK*ADSSSWPPSVQ 


4230 


18131 


A 


4254 


3 


233 


ETAFCHVGQAGLELLTSSDPPASGLQHA 
GITGPATKSAP* WS * VSGPHLGAFDSLA 
MLIiAQDWRALLESAKFFPLL 


4231 


18132 


A 


4255 


394 


44 


AKIFPPGVKKFFASTPPGGGKKRGPPPP. 
PVNFFPFKKGGGFPPWPGGV*NPAPINP 
RPWPPKRWGFKGEAPPPPPKLHLFFPQG 
GGGKNFFGKKKGAPTGGPPFFFFEMEFH 
CRPD 


4232 


18133 


A 


4256 


152 


2 


YWSPFYCRSCCIRQGTVAHTCNPSTLVG 
QGARFTRSGVRN * HGQRGESPR 


4233 


18134 


A 


4257 


404 


190 


AQL I RGWRQ * DHLS LGGGGCNE PS SCHC 
TPAWITEPMQSLSQKTQKQKMLIPN*YG 
*TVSPPKSHPELQFS 


4234 


18135 


A 


4258 


288 


407 


ENCRPRAVAHACNPS * ENCRPRAVAHAC 
NPSAFGGRGEQIPRSGEQEQPVQHG 


4235 


18136 


A 


4259 


406 


163 


GWGVQTHPGHHGETLFFLKKKK*VGGGG 
PTRYS Pi FGGVGPQKGLNPQGGGSKNPK 
LNPLLPPWGQKGVFFSKKKKKKERD 


4236 


18137 


A 


4260 


101 


276 


LTSIVPSLWEAEGAKLRGLGRPACRESS 
LCHCTLPWVTG*WLVS KKKKKRGGALGF 
FF 


4237 


18138 


A 


4261 


2 


414 


WVAAT PNLVFVGRGNCQDW FTGP FT ,TRG 
GWGGVLFPPVKRVIKGLVAPFSGPIPIR 
KNHQKVGGFLE PNKTNFKE *NFVENKNT 
RRGIKKHVPEKKKPQKGIPLPPQIFNEN 
QTWGDFDFFFFAKKKKIINSFLGWA 


4238 


18139 


A 


4262 


337 


382 


RDARFLHCSLV*LIT*LQKEFFFVGQAG 
VQWHYFGFLQPLPSGFNQFFCLSLLSRW 
HYRGPPPSLGKFLVGWLVGWLVFETKFW 
FCWPGWRAITGFWVPATYTFRVQAI FL 
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sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^amino 

residue of 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 


4239 


18140 


A 


4263 


406 


184 


QLLGRLRQENCHLGGEGCS*LRSPPCIS 
AWATKLDPVSEKPKTKNQKPTGLGGLNQ 
GPASLLKKIRKLGKKRPG 


4240 


18141 


A 


4264 


233 


440 


LPVHHGWGSLRKLTILGKGEANPSFFTW 
GQQGKAKQKGQKQPGAGAHAClSrSSPLGG 
QGR*ITRSGVQDQ 


4241 


18142 


A 


4265 


2 


230 


GARL*SHLLVRLRKENHLKLKGKNCREL 
KWPHCTPAWGTKKDSAAKNKNQPLPPPK 
GVKSPPQREHGLIVTCMFVQ 


4242 


18143 


A 


4266 


1 


349 


HKTNIHFKLVMVSNVFTFFFFGKGFSFC 
APGGEAGPLFGLVETPPGREKAFFGPNP 
PREGE*RAMAPCPGKILVF*KKTGFPRG 
LNFWPGGPPPFGLQRGGNNGGNPWPGPE 
FFWF 


4243 




A 








RSEQPAGSASSGrrEGLSTRflSGCGGCTG 
SPNSaSPPALCSISRRALAAFPRGRRED 
LQPAMPEPPTHSVGS CAAKPPR *APPPA 
PRRPVPSTTQGLRSAGAQRGTGRHLHLQ 
PQCRIHWVKPAG 


4244 


18143 


A 


4268 


377 


3 


TPAWMTERDCIWRRRTSAPGGSWPSGPV 
PSPGAQ*RPPSQGLGLWWflAAAAPRC*T 
APGPRPPPHGPGSPQGASPPTRPPRCRP 
HPRAGSAGPTGATPPGSTQGQRRRHSHQ 
LPGHPGHRVALG 


4245 


18146 


A 


4269 


1 


294 


LEDWGGRGARAHYDGFSLEPESDHYRLR 
LGQYHGDAGDSLSWHNDKPFSTVDRDRD 
SYSGNWALYQRGGCWTHACAQ* ILDAVy 
ATGPKYQVHCEALH 


4246 


18147 


A 


4270 


2 


218 


TGRII ILSQGLQTLLPLIAF* *LLASIiA 
NLALPPTIl!irLLGELSVLVTTFS*SlSriTL 
LQKKKKKKKPDKKKIFF 


4247 


18148 


A 




32 


443 


LHSDVDQLAGLVFPGCPWPLASPARRAP 
AGPWPREAAAPPS*DAPAPRLAVSAGSP 
AWPPPST*GLPAPAAVVASPASASPTSA 
RSRS*ASPTARRCRPGPGTAAPTSASRQ 
CQWRP*PCQRSPPSGTCSSAAAAPT 


4248 


18149 


A 


4272 


434 


57 


HLSFPPASAAPKPLPQAASLWSSPPSIV 
LAQVPPMNTDPCPQP SA* PAS ELSPEMP 
PARPQAPPENWSHP*GCGAQTVAPLVPG 
ATPAHEPAS PS PVTLPVLTAGGHCF YLY 
TYQIFLKFHYIKR 


4249 


18150 


A 


4273 


288 


424 


GLSLSVAQAWQWCDLVSL*PPPPRVKQ 
FSCLSPPRIWDY*HPPP ' 


4250 


18151 


A 


4274 


1 


429 


NTRGAAGPPQMPHPPRASAFPENPCGRK 
N*GQVSGP*ASGSSPVKMAGPAGTWRKG 
GLGWGPTRVRGRGPRRRPGAS SGYARQQ 
GPGHPGFPPSPRRRLSVPACALCPGQTS 
GTRAGLAVWPQIGLCLKAQGAESCPRDH 
SSD 


4251 


18152 


A 


4275 


253 


3 


PHQFNAVI YFPSPVSRGCPNNAL YS PFV 
IWLFCFVKTGVSVTQAGV*WHDHSSLQ 
P*TAGLKRFSSLTILSSWDYKSTPPCI 


4252 


18153 


A 


4276 


406 


3 


PMVSQGGVCAK*QARAI PREG I FRRSAD 
TQWREVQQKYNMGLPVDFDQYNELHLP 
AVILKTFLRELREPLLTFDLYPHWGFL 
NIDESQRVPATLQVLQTLPEESYQVLRF 
QTAFLVQISAHSDQNKMTNTCI 


4253 


18154 


A 


4277 


373 


3 


DGVSLCPPGWSAVAQSWLTAVL*ALGFI 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, IfcPhenylalanine, 
G=Glycine, H=Histidine, I=Isoleueine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=UnlcnDwn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FLFPVGGLTGIVLANSSLDIVLHDTYYV 
VAPFPYVLSIGAVFAIIGGFIH*FPLFS 
GYTLGQTKAKIHFPIIFIGVNLSFFPQH 
NLGLSGMRRCI 


4254 


18155 


A 


4278 


71 


176 


AGGRGCNESRSHHCTPAWATE*DSVKEK 
KKGLFFW 


4255 


18156 


A 


4279 




379 


SPFVPLQSSLGNKSETPSHGRKKKRRKE 
KRTDElSrQP*KHPLRARPGPGAPRAQGGA 
QGSGGGRRAGPCRKLVHRGHTGNPKREP 
GLI PQGEGRSLGIHPSNGCHKPCPSRGR 
PPPRPKQGRGRMQN 


4256 


18157 


A 


4280 


2 


338 


CDMPRRKLDPLSGRNTLGFVCWWAQTS 
GLK*SSCLCLPKCW0YRHEPLRPVWVYF 

SLFCSCLPPCCSRVTLVGSESSCLWPAfi 


4257 


18158 


A 


4281 


407 


■ 


EERGVATGHTAERGRADNEEER6EVANK 
RGGREVQPEARRMTTTAT6TTAATRGAR 
T*TAATTATAPVTRTAAPATTASSTRTL 
RTPREARTASVLWARCRSRWSATWRASR 
TCDCRTASHSRPTATRSRTARMA 


4258 


18159 


A 


4282 


367 


225 


PCDSACLGLPKCWDYRREPPRPAGVLFL 
*ASFS*QSKEVYVCKLTHA 


4239 


18160 


A 


4283 


452 


1 ~ 


NTC6LQSVCRGTQRGGPKRCPCHPRERW 






DLGDEQESSKEGRPGREGVQGQGLLGAT 
PGAQ6SH*GL*AEAAMSSGHAL6PGQVP 
LTPLSPSPPNPHRHPRAGEASRQSRBST 
EAQRAVPSQGflflPGWETDWGSSHQWQPC 
QAQQEGRTGR 


4260 


18161 


A 


4284 


436 


22 


CHVTGTQPIKVSWAKDSREIRSGGKyQI 
SYLENSAHLTVLKVDK6DSGQYTCYAVN 
EVGKDSCTAQLNIKERLIPPSFTKRLSE 
TVEETQGNSFKLEGRVPGSQPITVAWYK 
NNIEIQPTYN*EITFKNNCIAAARRI 


4261 


18162 




4285 


301 


403 


LSIS*PCDLPALVSQSAGITGMSHHAEP 
RVKSLI 


4262 


18163 


A 


4286 


380 


2 


AQVLYS SREQERRQDLPEQVI QAEGE * V 
KASACQLTFEDEEAMESGPAALDKDFQC 
TRKHHFAEVQGSPRCMSSRYLVDGCPKT 
FAEAQKVCS*CCEANLVSIHAFTFILRI 
QWCTS TVNQAQVC I 


4263 


18164 


A 


4287 


453 


3 


YIYEGSIMEEPRKPVPKGPLGLHCPGKF 
QG*RNSYNHHAVRVGTRCAPEGVKDLTS 
SLQSVITKPEQNIQELlflKHFKKEKSEAE 
NHIRTLKAESLEEKMMAKIHRGQLEKLK 
SQCDRLTEELTQNENENKKLKLKYQCLK 
DQLEEREDV 


4264 


18165 


A 


4288 


52 


400 


LDLYFFYRQCLALSPRLQRSSAILAHCN 
LKLLGSGDPPTPASQSKEITSMRYHI*P 
NL*NSMC*NCVFDTTHRIGENIWESYI* 
*EINIQNACRIPKTQQQKTHFQNKQGLH 
SKLN 


4265 


18166 




4289 


2 


263 


IHGAHLWSQLPGRLKWEDHHSPGSHGCS 
EL * SHHCTPVWVTEGDPVSKKEINI CKQ 
NLNEKTRLTVDTPRHS FTHKI KS LNHFQ 
MFL 


4266 


18167 


A 


4290 


3 


142 


YLGVHRLSIDGLDLLT* *SARLGLPKCW 
DYWRDPPRPANFYFILFK 


4267 


18168 


A 


4291 


324 


19 


EMSKRHEQKFHRREDQMANKHYKRTSLA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=ls()leutine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IREMKIETTMKYYHTPIQMA*VRNNDTS 
QVQWHTLQVASPLVAEVGGWNEAENLRL 
Q I VKMVPMNSHCTPAWVT 


4268 


18169 


A 


4292 


2 


204 


IHFCRDRDSLCCPGWPHTAELKQSALLG 
LPKCWDYHH* ATKPGLQSVFLLGLKI FL 
ILLVRGVDRYL 


4269 


18170 


A 


4293 


427 


3 


RRKTVTVPRTGERGI I SNEDNGLCKAVG 
NI PGSPLCNVSKLNIMKHAPGGHAARFC 
IWTESAFRKLDELYGTWRRAASLKSSYN 
LPMHKMINPDLSRILKSPEIQSALHAPR 
KKIHHRVLKKNPVLRPL*RIKLTSRHKR 
R 


4270 


18171 


A 


4294 


24 


372 


FICPLQDYVICSDVTSCWKKILATTGR 
RLRDDSVQFS FFFFGKCSGSLIiAPRVKEQ 
GQDLG*WNPRPPRLREFPGLALPRCWM]Sr 
GLAPPPPIilLVFLEKEGFPLAGKMGLNL 
LHSR 


4271 


18172 


A 


4295 




332 


IHSS YFFPQS *FFGTINTSDKPLRILIK 
EKKREGTN*HIMHEPWKFG1DPEDIIKV 
KKKYYKQLCTHKFDNLEEMVHFLKKKTl 
HLI*NIl*FE*LYKYYRN 


4272 


18173 


A 


4296 


407 


30 


WSVIYGEN*ESLGWGAHPSHVANIRVTG 
LRYLFSYGRHALDMLESSQDNMRSWWVS 
QMSSEIDVDNLGHISLaSrAVQ*IRlJLNP 
GLKTSKIELKFKELHKSKDKAGSEVTKK 
EFIEVYHELYAVG 


4273 


18174 




4297 


350 




YKTVVMNKEKW*YVGYN1EQEQKLALKT 
TVLDEWYTLPDGRI IKVGGERFEAPEAL 
FQPRLINVEGVGVDELLYSTrQAaDVGT 
RSEFYKHIVLSGGSTOYPGLPSHMEREL 
KQHV 


4274 


18175 


A 


4298 


623 


1 


SRRGCftATCDGS ITAWPQRC2AAQGKPSV 
HSKLEAEAKPTPGDHAASESTGFSCLGP 
GGVHKTAHAHARNMPGDSNTGSGQSPAG 
RRWEARGSAPRHHPTQPDWTHPDALRQA 
iyiAENPASSF*SF*RCCTAASPPVPTPPS 
PVIiMRGPVPGGGGGQGKIRPLQEEAPPP 
SSPWLSR*PQaGTPSSPAVSSLYHBGL 
SPTGRQDRWGR 


4275 


18176 




4299 


469 


3 


PVRNCLGRRFRVSGRAflHEiaL*QSASTL 
QGDPRTKRQAl SA* SSAFDVQDVSHVTL 
PFYPKEAQSKDLIKEAILDNDFMKNLEL 

SQIQEIVDCMYPVEYGKDSRIVKEVDVG 
S CVYVMEDGKVGVTKQGVKLCPVGPGKV 
YEELAILYICRRPCI 


4276 


18177 


A 


4300 


506 


321 


KPLSLLKIQKLAGHGGSCL*LQLLRRVR 
QENCLNLGGEGCSGLPKFWDYRCEPPCP 
DCWPS 


4277 


18178 


A 


4301 


454 


1 


NGEKPTYSGKKYVFLF*NTRRPTLFTWG 
KTIsr*GERPRSSVYSCSSWRLSSSSLGRS 
*NS VGSLNI ILATLTAQELSNS * AUNL 1 
* LAYT lAF I VKI P LYGLHL * LHKAHVE V 
P lAGS IVLAPVLLKLGG YG I IRLTLI LN 
PLTKHIAYPLY 


4278 


18179 


A 


4302 


3 


463 


AVSLPLPLPHFGRLPPLRVGWCPLRCH 
ILRTQHLEGAPSR*QTaAKQAGVWEELH 
SRLSLGPELGRIRPYSGDPGQVTDSSGA 
LASSLERGCYPAALLVGEGEGEMYLGPY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HPT*DGLKRLSLAKPPGGETHLTRSGHT 
FWGE6IPPSAVCL 


4279 


18180 


A 


4303 


469 


2 


FFFFFSETESCSVTQAGV*WHDLGSLY 


4280 


18181 


A 


4304 


194 


3 


FFFFFSETESCSVTQAGV*WHDLGSLY 


4281 


18182 


A 


4306 


517 


254 


WEETAFRLIGKTGIGLVTSGDLAASAS 
QRGGMTGVSHCNRKEFVLMNtlVMSSGL 
TFLPFSPDEV*GS*GIiSFLPSPLftliNTG 
LRN 


4282 


18183 


A 


4307 


276 


3 


KI101EGIPLHIPNPLVNLNLGLLFILAT 
SSLAVYSIL*SG*ASNSNYALIGaiiRAV 
AQTISYEVTLAIILLSTLIISGSFNLST 
LITTLV 


4283 


18184 


A 


4308 


234 


398 


KKFFFFPPLEGGGPITTIWSPPLPG*HE 
SPAPPPPHGQIKGLAPPPNLFFFLDKH 


4284 


18185 


A 


4309 


403 


100 


QPSTEGGaFWPTQGPKPTGI.PPLPG*LN 
PRNPTPP*WPPPPPPFKTRSKGPHTPSP 
TPLC3PPPKNPRLFFFF 


4285 


18186 


A 


4310 


399 


85 


WKPIPLAPEWRGVIRFRALPPCRPIHCP 
TLKLTAPSVIHQRT*VHWGFLVGVSLSV 
FliERESCSFTQVECSGTKLALHSLELLG 
VSDPSISAFQRAGIAGVSHHA 


4286 


18187 


A 


4311 


230 


2 


WTEEDTRRCLVLFS FFLS FFLFCFVLRQ 
RPACCPGWSAVTQSQIiTVASTSLAQAKR 
SSHi;.SPPQRS*DYRHTPQCI 


4287 


18188 


A 


4312 




420 


NTSWQVGELSLIVIVINMLLPYVWLPKG 
KFFFPQNSRFPSAPPHSSPPGLRSDFSH 
2GGLFFHLEVLWGLPLPPPPPPASTHVH 
RPIiGTQGCP*LAWYIHLISASYQKANAA 
PQLSCILQDCIRSKGDII,IFTFFTLCr.S 


4288 


18189 


A 


4313 


403 


207 


ESSE6* LNPLAHIiAMRYKGCPFKDVREK 
SEFILKSIQVRKSILNTRMSLLQLDFTC 
RYSRCVCFY 


4289 


18190 


A 


4314 


469 


328 


TSEGGGFlteLRteHCTPVWLTK*DSVSI 

SQRIRKKNLKTQSLAA/RCP 


4290 


18191 


A 


4315 


1 


212 


KTILPGVEVLKRLRWEDSLNP ♦ GQGCREP 
* SCHCTTAWATEKDPVSKKKKKKKNPGG 
FRPHSLSQPFGPPK 


4291 


18192 


A 


4316 


63 


559 


SNLTFWQCAVPAVETTPSSLCGPVPSPT 
S SATPKPI PS *AACPPPDCALAAEVRAL 
PAAGRPRFSEACLTPQPNPCVLWPCIiWD 
HSPTSCPTGSS*APHSCaFLPVPRLGGG 
MTTVHPTVHLP IHS SLTFAWE * P S P PAC 
LTAPRPTGHCLLSVPPSAYSSSSPS 


4292 


18193 


A 


4317 


135 


405 


PAPPSPHPKPDSVSCVIPPS*PAPPSPH 
PKPDSVSCVIPPSHVAGPSGLPEMTLLE 
PRGPCPPEVPPTSPFVG*AWWPHPPARR 
ASGRMDGRTDGRGRHLDLRSTHS FLTP 


4293 


18194 


A 


4318 


413 


1 


LYLVGRA* SFHRSFPGQERIHPGGKPYD 
CKECGETFISLVSIGRHMLTHRGGVPYK 
CK\rCGKAFDYRSLFRIHERSHPGEKPYE 
CKQCGKAFSCS3YIRIHERTHTGDEPYE 
CKQCX3KAFSCCKYI RINERTHTGGV 


4294 


18195 


A 


4319 


479 


342 


GVGVDNDGILVLGATNIPWVLDSAIRER 
*DNTVNI LFLTADVTSR 


4295 


18196 


A 


4320 


400 


26 


HSEAVTTVRSHHSPVGHIHSLNAPPTVA 
LTP*YNKPLNIFV*ARAQKIQGQFLTHT 
IPHILLLVITRDTSRACVCVCVCVCVCV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

location 
correspond 

residue of 
sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CVYAYIWVCMCVYIYVCVYIYICNYTFI 
PTLVHLPINKMY 


4296 


18197 


A 


4321 


370 




SDRGQGKAGQATRLGSRQTGQDEDKGTE 
KS I P SWDKGP VNNE FGKS VWS SNLVTQ 
E PSPEETSTKRS I KQNSNPVKKEKS CKC 
NECGKAFSYCSALIRHQRTHTGEKPy * C 
NECEKAFSRNV 


4297 


18198 


A 


4322 


342 


82 


GLGLTTNWLLTGKSAGRGCaPSSGGLWL 
AVODS VY YHRETEGGKPSPGRPCC* PS * 
AGLAPPSPPPPGCPEAPOSCCIiHYRCIi 
TAH 


4298 


18199 


A 


4323 


284 


3 


ITKKFPRVIGGGCFSPFFMGLSKKNGLF 
PEGJVLSISLIFGPAIPPGGAQVISSAKE 
KKKVLSRPGVVAHA.CNPSTLGG*GGRIK 
RSRVPDQPV 


4299 


18200 


A 


4324 


365 • 


3 


QVEVYVVE1IHPFRLEESIYQ*CRLEGAT 
SVAGEQISEYNISMRASDGGSPPLSTET 
HITLHVIDINDNPPTFPHLSYSAYIPEN 
NPRGASIFSVTAQDPDSNKNARITYALT 
EDTLQGVY 


4300 


1S201 


A 


4325 


253 


3 


PSFLSHD*SHIHKRLECSGMISADYNLH 
LPGSSYPPTSPSQVAGTAGPSMSLQEGR 
ASHDITSSSRSIGAKELLRPADHPQCI 


4301 


18202 


A 


4326 


307 


95 


FLFLLKDKILLSPRLECSGTIIAHCSLK 
LLDSSYPPAVAS*VAGTSGMaiYTWLRW 
KNHLSPGIQGCSEI 


4302 


18203 


A 


4327 


371 


2 


WAPNHISPTPRSGSTTRIWAPTPPSNSP 
RPCQDPGHRTDPWYPEEEFLLTNPDPPR 
APASWSFPFQEKRLHFPALPCP*HLDSS 
LGPTTLFSFSFPPTSIRPSQTNHSKGTP 
PPLSYAQHDCI 


4303 


18204 


A 


4328 


409 


1 


RLLEARQPELEMAALIFFTLKYKHVERE 
QKYHQLQDEYFTSAWLTLIIiAALFGLV 
YLLIFPQSVWnLLLVLCICFLVACVLY 
LHITRVQCFPGCLTIQIRTVLCIFIWL 
IYSVAQGCWG*LPWAWSSKPNLY 


4304 


18205 


A 


4329 


262 


2 


LHGAYLVLDIIGAQNHKPRYSFKMRVGQ 
FFPLVFSPRDSLTLLPRLECSGTVMMC 
HLDLLGSGDPPTSAS*I*GTTGAYHHTQ 
HV 


4305 


18206 


A 


4330 


1 


399 


NTF*FLKGMGLVNHVFTEDNLKKLYVSN 
LGIGHTRYATTGNV 


4306 


18207 


A 


4331 


408 


3 


S SVGIHVRTHTGE KPYECKHCGKAFSCH 
S S LREHVRTHSGE KP YE CNQCGKAFSHA 
QYFQKHVRSHSGVKPYECTECGKAYSCS 
SSLRVHVRTHTGERPYECKQCGKTFHYL 
ASLQAHTOTHAGA* I YKYSGHV 


4307 


18208 


A 


4332 


110 


"426 


GLSPGTWSDMTGGPAVTAFPCTLTGTHA 
GHIIIDGIDIAKLPLHTLRSRLSIILQD 
PVLFSGTIR*APPPLRPTQPQAGSVPSD 
LEHKEEGVGGDQDPRGVSCSC 


4308 


18209 


A 


4333 


133 


1 


EPCCPSALAAPEVLGPEKYDKSCDMWSL 
GVPMYIL*VPSPPPLY 


4309 


18210 


A 


4334 


165 




1 1 LTFFKKNERKWKAVQYW* KNR* VDQ 
WWRIAS PE IDPHE * SQLISDKGAKE CI 


4310 


18211 


A 


4335 


42 


440 


SARRAGDPARGAPSRHNASLPDPRELTG 
RPAGGLQ * GNCPAPPTPQLPS PSVSRPV 
SGRGPPPPPSFGDPHANRPQNPKGAKW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

imi*n1>tid 
residue of 

sequence 


Predict- 
ed end 

Son 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LTVQMGEPGE IRRAACPQLSWAMTFPGS 
LPEPKRTGPQIQKSPPSRRGG 


4311 


18212 


A 


43 36 


1 


160 


tJTCGGGAQL*SQLLRRLMWEDLNSGSQG 
CSELRSHHCTAa*ETKGDPLSINEy 


4312 


18213 


A 


4337 


236 


1 


QRSLLCYIMEIRTVAVHIVAIKGVESEV 
Y1ANSEEGTVYAKKECNEYGIF*ELVLE 
SHYYTYAAAVLRPVCCIESKVS 


4313 


18214 


A 


4338 


408 


3 


CQS S VS KKERTNGAQNPGAAKQGNNELR 
DSTEQFQEYYRQRLRYHQHIiEQKECXJRH 
I*QQiyiLLEGGVNQEDGRDQQQNRNEQFL 
NRS IQKLGEUJIGMDGLtaSTEVSAL IQQC 
NGSKGNGSNGL*VNSFDTPPHV 














RSQSSLPKSFKRKISWSATKGVPAGTS 
DT*GVQPGWQ*RWGASTATTQKKPSISI 
ATESLKSLIPDIKPLftGQEAWDIiHTmD 
SRISEDETERNGDDKTHDKGLKICRTW 
RARYSINEV 


4315 


18216 


A 






353 


DVFLDTIjARPLRHS CNFFLCYLL* DFFK 
FIPQFFYWMFYFDYIFISERNFYFLLLT 
HFFTLPLFLVIRIQCLHLFLFFSFFFFF 
FGKGNPFLPPGWRARAQFWVHGSPPLRV 
NALLP 


4316 


18217 


A 


4341 


376 


3 


LELREGGFLPHIADKVREERSPALLDDR 
AGRCQGQPRIQVCLTPKSMLFIiAFHM*T 
CEHCLEQCFSTSL* PIEIRTLH*D*GCV 
CMCAYVCICVCVHMCMCAYICICVAEMA 
KRPLESIHTSCT 


4317 


18218 


A 


4342 


158 


2 


LAFFFFCETQSCCQWHDLSSLQPLPQRF 
R*FSCLSLPSSWDKKHTSPHPTCI 


4318 


18219 


A 


4343 


145 


2 


IFGEQWFDYMNKFFS6DF*DFGASLTQ 
AVHTVPNV* SVIPCHPPRV 


4319 


18220 


A 


4344 


1 


434 


RSLIFRATAYEYRYIDQADLKLPASKRS 
S CLGLPKGWDYRHE PPCLARFI FWQI F I 
AYKLR*KHYKSRQRSLPSRGSC*CACDT 
FVSDPSH1«IGT^4AQILGGWITLGQVQRSC 
LAr)QPLLPLLPPTLAPCLP*NDVLLFFF 
TQSQ 


4320 


18221 


A 


4345 


4 


475 


KHSCRSLEHLDRPLPPALQETCPVRAEP 
LLLVRINASGGLILRMGAINRCLKHPLA 
RDTPVCLLAVLGEQHSGKS FLLNHLLQG 
LPGLVHAGRGRREAGSKDGGSCLGEAGS 
GIPVCGDKEPTN*CSPFSPLQESGEGGR 
PRGGEASLQGCRWRANG 


4321 


18222 


A 


4346 


313 


442 


DSTLNFFFFF*TESYFVAQAGVQGHNLS 
SLKPPPPGFKRFSCL 


4322 


18223 


A 


4347 


437 


338 


AEVQEFYGDYIAVNPHLFSLNILGCCQG 
RNWDPAQLSRTTQGLTALLLSLKKCPMI 
RYQLSSEAAKRLAECVKQVITKEYELFE 
FRRTEVPPLLLILDRCDDAITPLLNQWT 
YQAMVHELLGINNNRIDLSRVPGCMRPL 
*RIHPGTLERSIRLLFMPSSSWTMA 


4323 


18224 


A 


4348 


409 


118 


GLHDASFVAAIPSTQWYFFFEETASEF 
DFFERLHTSRVARVCKMDVGGE3CLLQKK 
WTTFLKAQLLCTQ PGQLPFNVIRHAVLL 
PADS PTAPHI YAVFTSQW* AAGPSGRWS 
TS SAAAFRRPRHS CRL 


4324 


18225 


A 


4349 


2 


418 


YTIiPDFPHHPHRLSHCELLOSRTAETYQ 
SPGACTPISQGHPGQRSSPCSSSSNPAC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
corrcspon 
ding to 
last amino 

residue of 

seTnete 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














*SSSSCGSTGFRCPGSAHPGCCSYVL*M 
GR*RSSQAPRASPIAQGPRSAPAGRAGH 
PALPSPLP*GAPPPPSAASSPGL*SPA 


4325 


18226 


A 


4350 


401 


276 


DPLNVGGUGYS * PRLCHCSPTWVTEGDL 
VS KTNTHTNKICHL 


4326 


18227 


A 


4351 


446 


277 


AQDFVLIKDGEVIGGICFRMFPSQGFTE 
IVFCAVT*NEQVKVSAAPRPQPQSTAHS 


4327 


18228 


A 


4352 


408 


3 


PKS S LFTSRPNGKSWSTKRPAI SLAYSN 
ENACJGSWNGDQDAGPLLVENPRDYSTEL 
SVTIAVGASLLFmVLAFAALYSRKDTR 
RQ*PLCQLSPHRGIGASELGHSlPQ*ELP 
ALLLGP IRHECEAGPPHDTLPV 


4328 


18229 


A 


4353 


62 


280 


VRIGCLTSVPEaSSCGLPTTPGCC*PLG- 
TLPCRSRSRTRSFRSRSRTQSRSRPKTY 
SPGRRRRSRSRSPTPP 


4329 


18230 


A 


4354 


364 


3 


PVGEEGKSLVPKSPVEEEGKS PVSQS PV 
EEKAQYRVPKSPVEEAKSKAEVGKGDQK 
EE * EKEVKQAPKEDKVEKKEEKPKHVPE 
KKKAESPVKEEAVAEWTIAKSVKVQLE 


4330 


18231 


A 


4355 


3 


558 


lgptllwrrgkvpkanspiptslrrtlp ■ 
raspgpsrgplfmrqagstpmtlprstp 
smspml*mvwpptavpvp*kplmwappa 

PLV1,LVT1LTEIQEPATPAPLTQF*KPT 

slmvsrpvcpwqgprttrstlcattia 
pshatlrpglstttsplwqtlslllegq 

ASLPKG*NTSITLPSVS 


4331 


18232 


A 


4357 


258 


1 


lgfhdpgpagwsqphggrsaqaq*ppqp 
ptvprtsladvsdaplppcghsrpqhhp 
frlphtsfstpglsavclivhpaarvsr 

CI 


4332 


18233 


A 


4358 


97 


343 


drdlgdeygkkqvhgdvfrpsshplifs 
sligsgcqifavsliviivamiehlyte 
♦vptlncimsllflnsfnhwnllsqs 


4333 


18234 


A 


4359 


182 


2 


KDYFGSSVENGGEEELKQEDQLGGYSVI 
*AEDDEDEDDDDDDDDEDEEEDDEDEDD 
DDCl 


4334 


18235 




4360 


88 


2 


rgsrtdtg* rgsdrgtg* RGSHTGTR * R 
gsrtdtg*rgsrtdtggrgsrtgtggrg 
srtgtggv 


4335 


18236 


A 


4361 


2 


250 


ELRDEGICASSAKQRLKCAGLQKFGERAF 
KAWAVARLSQRFPKAEFAEVSKLVTDLT 
KVHTECCHGDLFGCDEESAGLAKYIM 


4336 


18237 


A 


4362 


169 


456 


EQLLCAGQGQLS CEPLHLPS CPVGPQTP 
AHRPGRMAPGPPSFLSPGSPCGLGLAIP 
CSCQPWVAGVPDHHLFPSPMGRAGLSPS 
ERSAGEGVIPWP 


"4337 


18238 


A 


4363 


1 


419 


PEFKLQKLKRSQNSAFLDIGDENEIQLS 
KSDWLS FTLE IVIMEVQGLKSVAPNRI 
VYCTMEVEGEKLQTDQAEASRPQWGTQG 
DFTTTHPHPWKVKLFTESTGVLALEDK 
ELGRVILYPTSKtSSKSAELHRMWPKW 


4338 


18239 


A 


4364 


375 


448 


. PGAVAHACMPSTLGGRSGRIMRAG 


4339 


18240 


A 


4365 


J 


444 


DFLTNHYLHFLH I AGSQLTGLGTAVQLY 
S AYEENNRTFLLAAVKRNHNQYTOP SGV 
ATFFESIKEILLRQSGVKVESVDHDSCV 
HGPCQNGRSCLRKLAVSSVLKSRESLPV 
I rVANEPLQPFLCKCLPGYSDSWCE I D I 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO; in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ingto first 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanlne 
C=Cyste)ne, D=A5partic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Hisf idine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DEWLPFPW 


4340 


18241 


A 


4366 


52 


402 


CPQEVWFHSFCHCTSAFMKDLYFLCLVR 
FGDLKEDKVTRHDGAS SDGHLAHI FRHA 
AKELFNEDVEEVTYRRLRCGAVSTACLC 
LWLAEVHGVPVWI CACGGSMDGVYTSQN 
QKDFF 


4341 


18242 


A 


4367 


119 


247 


WVliLS FQLHGVAVLGI AL lAMGEE IGAE 
MALRTFGHLLRYGEP 


4342 


18243 


A 


4368 


2 


430 


GGTWDGQPGFHGDSLSKAPGMNSLEQG 
MVGLNIGDVSSSAVKTVGSWSSVALTG 
VLSGNGGTNVHMPVSKPTSWAAIASKPA 
KPQPIMKTKSGLPPSPIKHNMDIGTWDN 
KGPVPKAPVPQQAPSTQACPQSQQVGQT 
LP 


4343 


18244 


A 


4369 




406 


ETSTPEGEAGPIQRLDIPVENPVESKNI 
FLGAPLIIGHVIDKESPLYDISATDLflN 
QDLEVIVILEGLGETTGISTEARTFYIA 
EE IQWGHRFVS lETEEQRPDSVDYSKFG 
KTDLVTTSRCMARELDEKPS ILI 


4344 


18245 


A 


4370 


2 


284 


GTGTLCDLTALLSARYDGVETCILPCWK 
TISTIPVAIiLIHYHQNQaSCGKEailLE 
TRQHRLFCADPKEQWVKDAKPHLDRQAA 
ALTRNGGTFE 


4345 


18246 


A 


4371 


3 


184 


ED raiLLS IMKGTTRP INKE lEALNDTV 
DQLELTDLCRTLTPTIAEYSFFSSAPKV 
LIQP 


4346 


18247 


A 


4372 


44 


304 


GLPHS IGCQGLPRLPEWPTWVGGCHRGL 
IMPSVPTQVSBRPLMFLUDTPGVIAPRI 
ESVETGLKLALCGEPGLGLGLGPGPLPP 
HL 


4347 


18248 


A 


4373 


1 


422 


LHLFNPAASAPSRSLFSGPILDPLSRAL 
GVGPGICGLASSPGISEGWDQIRSWTHP 
PDPDHLSGFCRSQVYMHPSLSPPSTMIL 
SGGTALKPPYSAFPGMQPLEMVKPQSGS 
PYQPMSGNQALVYEGQLSQAAGLSASQM 


4348 


18249 


A 


4374 


2 


423 


NSGTHTPGLLEDLSKNGRLPEIKLPVNG 
CSDLEDSFTILQSKDLKQEPLDDPTCID 
TSETSLSNQNICLFSDINLNDQEWQELID 
EIJiNTVPEDDIQDLFNEDFEEKKEPEFS 
QPATETPLSQESASVKSDPSHSPFAHVS 


4349 


18250 


A 


4375 


292 


429 


LKISVFPSSTEISLQQRLEYIARAILTA 
KSSTGIASISADGESLRE 


4350 


18251 


A 


4376 


544 


32 


ALHACACLGLSPEACFLHLLQLTGTSPG 
PVDGQGLDNQGFRLLKAILGVSPAPEPV 
HGASEHNADTGVS VARACVLPTS PTLAR 
TAAAKDESPPLAGLTMGPSREPGGSCLP 
LPSGLLSSPTRLALPRRPPSVSVWQDAG 
S SGSMPTEPLVQCGHHF YGTLVS WAVMV 
LK 


4351 


18252 


A 


4377 


378 


232 


PAKEAEEELLLHDTRCWLNEGAMPEARH 
PRTGASALHVAAAKGYIEVMR 


4332 


18253 


A 


4378 


291 




QMLSVDVAS RYRAPS TYVUSIS LKEGMDG 
LHGESCSSFLLGPSVAMNMQTAGLEMGI 
CDGHFRQKVGCGYVLKPDFLRDIQSSLH 
PEKPISPFKGQSL 


4353 


18254 


A 


4379 


171 


458 


RGPLFLTLHIVNSLQGHNFRREYIVTQG 
PLPGTKDDFWKMTOEQNVHNIVMVTQCV 
EKGRVSKQLSWHQSFAFCHLDRILPHHQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

nucle- 
otide 

correspon 

peptide"^ 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, Lf=Leucine, M^Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=posslble 
nucleotide insertion 














SETPQNNVGIM 


4354 


18255 


A 


4380 


214 


410 


LDHLP THP S P TGE I E VKWFEAVQTGLPM 
CILGAFFGPIRLGAQSLQVIDSELIPWA 
VQN6RIAPC 


4355 


18256 


A 


4381 


2 


74 


IGDSGVGKSKLLSRLTRMEFNLGL 


4356 


18257 


A 


4382 


290 


119 


lATVSUSIIVKMPRLPKAIYRENAIPIKI 
PMTFFTEIDKTILKCMWMHKIIPGNSPA 
H 


4357 


1S258 


A 


4383 


2 


423 


LERVCWIKDIWaVTGEMMEVMKSIIQK 
YQHKRISLVEAaVTRHRSIFNQLKALAE 
DQINSKLSKPEWI IHDAVHPFVEEGVL 
LKWTAAKEHGAAGAIHPLVSTWSPSA 
DGCLDYSLERARHRASEMPQAFLFDVIY 


4358 


18259 


A 


4384 


356 




VnrPCCLLASLICVFRGLFYLSLCLVLRV 
PVPLCLQFPGSLLGSLLLFSLLRQGLAL 
WPRLECSSAIRAHCSLELGSNDSPASAS 
QSAGITQMSQCAWPLSIiSVSPRVPGLS 
SVPLIF 


4359 


18260 


A 


4385 


1 


125 


SGSKKTLRSSLEEPTILDIEKFHRESFF 
YTHLINFSGECRYC 


4360 


18261 


A 


4386 


53 


215 


WQLYWTTWCQKISRNQGLAHVPRMECSS 
MIIAHCNLKLLGSSDPPASASQIDPS 


4361 


18262 


A 


4387 


2 


390 


PSLAARVLARGYGNFSVFVLWLRKNDRM 
HGECAPNVSVAVSTSHTTIIGGGIRGGG 
GGGYGSGGSSYGSGGGSYGSGGGDGGGR 
GSYGSGGGNYGSGCAQSGHGSYCSGSSS 
GGYRRGSGGPGSCRSGG 


4362 


18263 


A 


4388 


2 


441 


I KTR I SVIHECEE FGLTP lEGAI EDMKNK 
TLQLAPAINQEPSDAKMLQMVLKGSEGA 
TVNQGPLEVAQVFLAEIPADPKIiYRHHN 
KLRLCPNEFIMRCQEAVEKKKRLITVDQ 
MEYQQELKKNYNKLKEHLRPMI ERKI PE 
LVKPIF 


4363 


18264 


A 


4390 


242 


24 


KLGNFLGGCVKESPFVPQGGMQWPDFGS 
LQHPPSRFKQFSCPTLPGNWNFRNAPLR 
SGKFFPS PVFLVETGY 


4364 


18265 


A 


4391 


1 


244 


VDQQMRQNLFTTGAHHLQQAHIQFRTDI 
ARTEYLSNADERLRWQASSLPADDLCTE 
DAIMLKRFTRYE LG CQGEAWE E CGW 


4365 


18266 


A 


4392 


2 


146 


LDLLSQPCHAVY I FAKKND I P FE LRI VD 
LIKGRSSLGFGEPKSQEGDR 


4366 


18267 


A 


4393 


204 


489 


GAASEHP IP KGESHGTSKAS S LCADAGP 
AGHFASVYNPLAWTVTTIVTLTVGFPGV 
RVTDEAGHPVPSQVSGIQRCLQGMKAPP 
VTVTVSGEKGS 


4367 


18268 


A 


4394 


2 


171 


LSDGFQPSRSIIFASWSAGDFGSVGATE 
WLEVFFI IPSHIGHELVGLGCCPEVMFL 


4368 


18269 


A 


4395 


2 


197 


WLELLKFYTLDCALEEYVICVRIQDILT 
RENNNWPKRRIAIE6EMICCISFVEYFI 
CATHYLIQC 


4369 


18270 


A 


4396 


2 


285 


TVDFP FQPYKCQQE YMTKVLECLQQKVN 
GILESPTGTGKTLCLLCTTLAWREHLED 
GISARKIAERAQGELFPDRALSSWGNAA 
AAAGDPIGPS 


4370 


18271 


A 


4397 


366 


461 


TSNFSCSLVLEEEASDYLELDTIKNLVK 
KYSQ 


4371 


18272 


A 


4399 


157 


3 


DVLRNNFARSAVL YLF IDRVTLWNAPRL 
ECSGAIIAHCSLKLLGSNDPPTS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


4372 


18273 


A 


4400 


2 


343 


DIDFKYHFDFFSVNFNEELVALYGGSLQ 
KQTKFVHECIKTILKLYKGQEFAPKSVA 
IIGHSMGGLVARALLTLKNFKHDLINLL 
ITQATPHVAPVMPLDRFITGSTGWLKDH 


4373 


18274 


A 


4401 


214 


430 


ELGNILNYWRWEAEAELSNRWFNGQAV 
HGELSPVTDFRESCCRQYEMGECTRGGF 
CNFMHLRPISQNLQRQ 


4374 


18273 


A 


4402 


3 


227 


LTQVSPQMTGHRQIiNTAQaaQMAKVSEL 
KHFQYFLPMRGLSLRVMRKMGPRGPLLF 
FLLGSPRATHAWKRQISF 


4375 


18276 


A 


4403 


158 


313 


NRDEFOIVAQAGLELLGSSSPPAaTSQS 
ACITGVSYHVRPTLSNKFGCSSIi 


4376 


18277 


A 


4404 


3 


528 


HCIREGGQDVPSNKDVTSLDWNSEGTLL 
ATGSYDGFAEIWTKDGNLASTLGQHKGP 
IFALKWNKKGNFILSA6VDKTT1IWDAH 
TGEAKQQPPEHSAPAtJ)VDWQSNNTFAS 
CSTDMCIIIVCKLQQDRPIKTFQQHTNEV 
IS-AIKWDPTGNLLASCSDDMTLKIWSMKQ 
DNCVHDLQAHNKE I YTI KWS PTGPGTHN 
PNANLMLASASL 


4377 1 18278 


A 


4405 


35 


180 


MCVDYHYFFCLLGPNNKNPQTSAVRTPT 
QTNGSNVPFKPRGKEFSPGK 


4378 


18279 


A 


4406 


1 


438 


DFQRPRDDHGDVDWEKIiVELLTDCSNLQ 
DQTYILYILYVIKGPSWDTNLSQQNQVT 
VQNLLGELYGKTGLWQEWGLI PYI SSLF 
RKKVDVLAEACTDLLSDQKQLTVGLSDE 
PREKI lYAPLPPKELTKLI YEASGQDII 
IGV1.TQ 


4379 


18280 


A 


4407 


1 


428 


TESVNAYFKGaDPTKCIVKITGDMTMSP 
PSGIIKVPTSNPTPAVLCFRVKNISRLE 
QILPNAQLVPSDPSQCDSNTKDPWMNMQ 
AVTVYLKKLSEQNPARSYYNVDVLKYQV 
SLNGIQSTPUStLATYWKCSASTTDLRVD 
YK 


4380 


18281 


A 


4408 


174 


383 


KNSWPGAVAHAOSrPSTLGGKGRQITRSL 
YRDHPVQHGEIPLYLBOiTALAVTFLGLL 
TGLHDLYLTTNLT 


4381 


18282 


A 


4409 


1 


288 


LMAEKDSLDPSFTHAMQLLTAGKGASRV 
PLGRPAI PGMSGPGFVPtiASRPLGTEAV 
GSSGLILIFGSLDSTFPPQPGVRGRISR 
LQWRPEGDFAIP 


4382 


182S3 




4410 


292 


422 


LVHS S CFQVS S KI LELARKQRMNTD IRR 
NIFCTIMTSEDFLDA 


4383 


18284 




4411 


412 


238 


FFSRHRVSQWDQAGLELLTLSDLSAIjAS 
HSAEITGLNHHAQPLTTNFNKYCWISLQ 
VH 


4384 


182S5 


A 


4412 


1 


405 


VTSSDKSLKVLLDAEDKVFNEIRNEHFS 
SIVFGFLSQKARNLQAQYDRRRGMDIKQM 
KHFVSQELKGLKQEHRLLSLHIGACES I 
MKKKTKQDFQELIKTEHALLEGFNIRES 
TSYIEEHI DRQVS P I ESLRLMCL 


4385 


18286 


A 


4414 


2 


149 


GFHHVGQAGLEVLTSDNL.PVSASQSAGI 
TGVSHCAQLRSDGFK3CRSSPA 


4386 


18287 


A 


4415 


1 


366 


D I CHRKLGVE CP SHASI NRLWQWS S I 
TASLRFEGPLNVDLIEFQTNLVPYPRIH 
FPMTAFAPI VSADKAYHEQFSVSDI TTA 
CFESSNQLVKCDPRLGKYMACCLLYRGD 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=GlutamicAcid,F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














WPKEVNAAI 


4387 


18288 


A 


4416 


1 


164 


RRRCEMSTMFADTLLIVFISVCTALLAE 
GITWVLVYRTDPFGTLAGSEGPQQRY 


4388 


18289 


A 


4417 




141 


DIAI PCSMKGAHSVGLMWWMIiAREVLRM 
RGTISREHPWDLTSSVMG 


4389 


18290 ' 


A 


4418 


136 


1 


EQTKAFGQLHKVLGMDPLPSKMPKKPKN 
ENPVDYIVQI PPKDPMQ 


4390 


18291 


A 


4419 




280 


GNYCKYYGYRGPSCEDGRLRVIjKPEWFR 
GRDVLYLGCNVGHLTLSIACKRGPSRMV 
GLDIDSRLIHSARQNIRHYLSEELRLLP 
QTLEGDPGA 


4391 


18292 


A 


4420 


3 


446 


TVSTSQVRC3NYHRDSQAAINRQISLELY 
ASYVYLSMPYYFDRDDVALKNFAKYFLH 
QSHEDRDHaOKLMKIjQNQRGGRSLLQDM 
RKPDCDDWESALNAMEWALHLDKNVHQA 
LLELHKLTTDKNDPDLSDYIETHYLNEQ 
VTAINELG 


4392 


18293 


A 


4421 


6 


182 


DPSFCSEPRSCHCTPTWRTEQDSISEKK 
KKRNILKNliJAKNYITLESEKERKSKQRS 
RR 


4393 


18294 


A 









GP I S YVWMPKKRQALVEFEDVLGACNA 
VNYAADNQIYIAGHPAFVNYSTSQKI SR 
LGDSDDSRSVNSVLLFTILNPIYSITTD 
VLYTIOJPCGPVQRIVIFRKNGVQflMVE 
FDSVQSAQRAKASLNGADIYSGCCTLKl 
EYAKPTRLNGSK 


4394 


18295 


A 


4423 




257 


FHHVGQAQLKLLISVDPPALDSQSAEIT 
GVSHCAQPVLCILNQLCAKAVLSPDSHP 
YPPKSGRHEGQVREQKMSPCSYIYFKRT 


4395 


18296 






106 




CCCFKYFFSNVKICFYRDLKPENI LLNE 
DMHIQITDFGTAKVLSPBSKQGV 


4396 


18297 


A 


4425 


3 


364 


MDEIEKYQEVEEDQDPSCPRLSRELLDE 
KEPEVLQDSLDRCYSTPSGYLELPDLGQ 
PYSSAVYSLEEQYLGLALDVDSEYLTVK 
VISLHLVFQIGVIFLFQVPLTDPRDVIA 
AGRTYGRI 


4397 


18298 


A 


4426 


2 


241 


HHVWLICFVFFVCVETGFCHVAQVGLER 
LGSSDLPTSVSLSAGITDVNHWAQSKPS 
PSKGTGMNRPIGLRVGKGVWSRL 


4398 


18299 


A 


4427 


2 


438 


TRLIGPNCPGVMETPGKCKMGIMPGHIHK 
KGRIGIVSRSGTLTYEAVHQTTQVGLGQ 
SLCVGIGGDPFNGTDFIDCLEIFUroSA 
TEGI ILIAEIGGNAQENAAEFLKQHNSS 
PNS KPVESFIAALTSSTGRKMGHAGQLL 
AEEKV 


4399 


18300 


A 


4428 


34 


427 


AGMLPAVGSADEEEDPAEEDCPELVPME 
TTHSEEEEKSGLGAKIPVTIITGYLSAG 
KTTLLNYILTEQHSKRVAVILNEFGEGS 
ALEKSLAVSPGGELYEEHLELIITACLCC 
SWGNGLIAIENLMRQKGE 


4400 


18301 


A 


4429 


330 


8 


YSDRI SHWIDMCPSKYIFRGPVHPPHI S 
TFLHTAHNLVARGNYENMSPHDEEKGAY 
QRKEKCCDDPMAGAARIHFLCFFEMGFR 
SVSQAGVPWPHITSMQPQPPRIK 


4401 




18302 


A 


4430 


446 


222 


LTSSDPPASAPQSAGITGVSHRPQPIWG 
VSIQLYFSGCPDTPGQGRLTREHCDGQD 
GEQGGPCSESVAQFCHDWS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaninc, 
G=Glycine, H=Histidine, l=Isoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4402 


18303 


A 


4431 


3 


243 


AREDSADFVSFFPDFVWTLRDFSLDLEA 
DGQPLTPGTSQKDJMFILPRLCIRKFTS 
LIRTSRSYQSPDTVSVASSRGAR 


4403 


18304 


A 


4432 


433 


0 


PPPSPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPSPSSPSSSS 


4404 


18305 


A 


4433 


25 


185 


ARWVACSELRLRCCTPAWATEPEPVFKK 
KKKVQQQGGWPGGLTPVSYRFGRLT 


4405 


18306 


A 


4434 


2 


329 


VRDIKEKLCYVAVDFEQEMATVEFSSSL 
EKSYELPDGQVITIGNERFRCPEABIQP 
SFLGMESCGMHETTLNSIMKCDVDIRKD 
LYANTVLTGGTTMYPGIADRMHKEI 


4406 


18307 


A 


4435 




374 


WVTFI SLLFLFSSADBRGGFEHDAHTSE 
VAHRLKDLPEENFKALLLIAFAHYLHQC 
PFEDHVELLNEVTEFAKTCVADESAENC 
DQSLHTLFGDTLCTVATLRETYGDMADC 
CAQQEPERNECF 


4407 


18308 


A 


4436 


2 


332 


KVRRIDSRIiAELDREIRNPQI SCQLIYS 
PSLFLGAFFLFFFFGFFFPPPYLtAPKN 
QFNPPPLCTGGPVPRPKPPIPPWEPPRP 
APPLFSPQWDLGFQIAPFLWVINGGGPS 
GGSRA 


4408 


18309 


A 


4437 


2 


369 


QSCETLFHSWKDVEVCSSALSCLSQVSV 
HLQGLSSLFILPGMEVEQRDSQMALVES 
LEYVEGEISKflMADFTTWKTHLLTSDSQ 
GGNQMLDEGFREDFSEQME lAIEAILCA 


4409 


18310 


A 


4438 




231 


SDTSRPQLPCPAAHSiRFPDGPLSLRPLP 
FFLSQVTNEQIHDLLEPKBPLAIREDPD 
KGVWQGLSFHQVWDmRVG 


4410 


18311 


A 


4439 


1 


365 


MAHAMREVKKCLGPDMMDDlCHEQFIiEL 
SYIiWQVPEPSRGRGVPVRGREAAPPPPP 
VPRGRGVGPPRWALVRGTPVRGAITRGA 
TVTRGVPPPPTVRGAPAPRAITAVMQR I 
PLSPPPATK 


4411 


18312 


A 


4440 


92 


510 


LAALPDGGQELHIPRSRAGPAGELSWAV 
VGTGAIiSSLSILFQMAROlPWPPLRPNS 
PLQGWIFNWPWLVAIPASLFSGFLSDH 
LINQGYRAITVRKLMQGMGLGLSSVFAL 
CLGHTSSFCESWFASASIGLQTFNHR 


4412 


18313 


A 


4441 


2 


266 


FPPHLPELGKWECPWHQCDECSSAAVSF 
CEFS PHS FCKDHEKGALVPSALEGRLCC 
SEHDPMAPVSPEYWSKIKCKWESQDHGE 
EVKE 


4413 


18314 


A 


4442 


2 


279 


KGAELVFLPSNTRSLMYPLDQGWRTFK 
AHYAGYSMERIVSAMEENPGREKIMKKL 
LKLSS PKEI PAGEKLCPNDAHDFTELDD 
RVIHENHE 


4414 


18315 




4443 


2 


371 


KWVTFISLLFLFSATDSRGVSRLDAHTS 
EVAHRFKDLGEENFKALEAIAFAQYLQP 
WPFEHHVELGNDVTESAKTCGADESADN 
CDKSLHTLFGDTLCTVATLRETYGEMAD 
CCSQPEPCEKG 


4415 


18316 


A 




2 


372 


WVTFISLLYLFSSGYSRGVFRRDAHKSE 
VAHRFKDLGQENFKALVLIAFAQYLQQG 
SFEDHVKLMNEVTEFAETCVADESSDDC 
DKSLHTLFGDTLCTVATLRETYGEMADC 
CAQQEPERNEC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4416 


18317 


A 


4445 


1 


283 


FRKNAEDILTMEGLRATMKHELEAAQKK 
HSLWELLRIPNICKRIWILSFVRSAGVC 
WRSTPDPVCLGITSGGCRTAQI PACS FL 
WKLHSIC3RPA 


4417 


18318 


A 


4446 


173 


2 


KKKKCVLYPEC3HC3VFHALVCVCVCVCVC 
VCVCIQVNFRNFCFISWINLSVQVSLLD 
Y 


4418 


18319 


A 


4447 


104 


22] 


SCHLHYSLQDKSETLSKNHHYHHHHHNH 
HHHPAGYPKMK 


4419 


18320 


A 


4448 


241 


1 


WVSWDKA&SWVReLeSGSRflGVWGGGSP 

PHRVTDVSCSFLPDGRHWEMEAKTDRD 

LCKPVSAFGAAGATVGPQGLGGRC 


4420 


18321 








349 


WICRSHSFGLDWAHACNPSTFGGQGGRI 
MRSGVQDQPGQY 


4421 


18322 




4450 


76 


189 


TESKPAATTRSSGGGGGGGGKRGGKKDD 
SHWWSRFQKV 


4422 


18323 


A 


4451 


40 


359 


SGEPLFPFFFSLARLIPGVCFVEMHTRV 
RLL IQLKIWEKKVPQVSTPTLVEVSRNL 
RKVGSKCCKHPEAKRMPCAEHYLSWLN 
QLCVLHEKTPVSDRVTKCCTES 


4423 


18324 




4452 


2 


371 


KFQNALLVRYTKKVPQGSTPTIVKVSRN 
LGKVGIKCCKHSEAKGMPCftEDYLSWL 
NQVCVLHEKTPASDRVTKCCTESLVNRR 
PCFSALEDYETYVPKECNAETFTFHADI 
CTLYENERLFQ 


4424 


18325 


A 


4453 


118 


369 


GRYILLLKTKKQTANNNIKTPQYLSNMS 
KKFRHSEFFFPYLKWSLaLQRCafLGSLQ 
PTPPRFNQLSCLSLPNSWDYRCVPPNLV 


4425 


18326 


A 


4454 


364 


3 


GHLSLQRLLLPFVWLCPAPRGGAYRGRQ 
ASLSCGGLHPVEASRLLOiPNQACTMAG 
APPPASLLPCSLISDCCRSNQRDSVGVG 
PSEPCAGYNLLVCRFLSPSEKRSPSAGV 
MRFSRCRL 


4426 


18327 


A 


4455 


1 


379 


AAFTECCQAADHAACLLPKLDEiLRDEGK 
ASSARQRLKCaSLQKFGERAFKAWAVAR 
LSQRFPKMFAEVAKLVTDLTKVHTECC 
HGDLLECADnRSDLAKYICENQDSISSK 
LKBCCEKPLLEKSH 


4427 


18328 


A 


4456 


1 


352 


LRPARSLVFPWFAPGGSGLRGLKLLEAK 
CQGDGVS YEETTI PRPSAYHNLFGLPL I 
SRRDAEWLTSRELDSLALNQSTGLPTL 
TLPRGTTCLPPALLPYLEQFRRIVFWLG 
DDLRSWEA 


4428 


18329 


a 


4457 


250 


330 


EWWTHLWLNEGFASWI EYVCVDHCFPE 


4429 


18330 


A 


4459 


96 


319 


YLTIPFSQAMEWCKDARLTTSIGAYNFJM 
HRLLEMILNKPGLTYKPVCWQVSALSLL 
SFLFFMPLFLSYCQVSIH 


4430 


18331 


A 


4460 


50 


339 


NSQTLKLSNVQCLKTRALFFFPKGIYPQ 
IKTLPPCPTSGPDPYVRVNLLPKGKGPC 
LKKI SGNRKTLKPLFDKTLKFLVPMEKV 
PKSPLNVPVKNI 


4431 


18332 


A 


4461 


2 


357 


GYDGVKRWTRDGDIFHKELLLIPIHLEV 
HWSLISVDVRRRTITYFDSQRTLNRRCP 
KHIAKYLQAEAVKKDRLDFHQGWKGYFK 
MYCKHLALSQPFSFTQQDMPKLRRQIYK 
ELCHCKF 


4432 


18333 




4462 


182 


325 


IRLSSWLCDCASCQVYEFRVKESSI lAP 
APAEDVDTPPRKKKRKHRL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 

peptide 
sequence 


Predict- 

last^aniino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pljenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Met)iionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine,V-- Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4433 


18334 


A 


4463 


1 


338 


SRHSPGPTPQPDCSLRTGQRSVQVSDTS 
SCSQLSSSSGSSSSSVAPAAGTWVLQAS 
QCSLTKACRQPPIVFLPKLVYDMVESTD 
SSGLPKAASLLPSPSVMWASSFRPLLSK 


4434 


18335 


A 


4464 


2 


343 


IIDFFLGAYLKDEVLKIMPVQKQTRAGQ 
RTRPKAFVAIGDYNGHVGLGVKCSKEVA 
TAIRGAIILAKLS I VPVERGYWGNKIGK 
PHTVPCKVTGRCGSVLVRLIPAPRGTGI 
VS 


4435 


18336 


A 


4465 


1 


371 


INSTLKMSFVGENSGVKi^GSEDWEKDEP 
QCCLEEIFFALASSLSSASAEKGSPILL 
GVSKGEFCLYCDKDKGQSHPSLQLKKEK 
LMICLAAQKESflRRPFIFYRAQVGSWNML 
ESAAHPGWPIC 


4436 


18337 


A 


4466 


246 




LCFTPLPSYILFFFETKSHSASPKLECS 
GVITATSEVILLGGGGCSELRSCYCTPT 
WVTKIiDSITKKRRKRKRKDLY 


4437 


18338 


A 


4467 




355 


WVTFISLLFLFSSAYSRGVFRRDAHTSE 

VAHRFKDLGEENFKRLALIAFAQYLQQC 

PFKDHVKT.ANEVTKFAK-^CVADESAENC 
DKSLHTLFGDKLCTVATLRETYGEMADC 
CAKQE? 


4438 


18339 


A 


4468 




361 


EDHVKLViiEVTEFGKTCVADESAENCDK 
SLHTLFGDKLCSVATIRETYGEMADCCA 
KHEPERNECFLQHIOJDNPNLPRLVRPEV 
DEMCTAFHDTDETFLKKYLYE lARRHPY 
FYAPELLL 


4439 


18340 


A 


4469 




373 


SSQPSSPSESVSGTVSVSPSSLPSSPCL 
SLSVCLLVCVSLSMFFSLPVCLSVPLCV 
SPRPHPSVSHAHPHHASALVNCPWGPAP 
TSAPVSFYKPQSPALKTGQASPHPHPHP 
FPTPSPISIQSQKK 


4440 


18341 


A 


4470 


236 


371 


KVLGAVLKDKGGLFCFVFGDRVSLCHPG 
WSAWQSQLTAAVTSQA 


4441 


18342 


A 


4471 


345 


201 


AWMTAHLFMAWFTEYFKSTAQKKKFVFK 
TLLLIDNFDPLWDSKEGKL 


4442 


18343 


A 


4472 




144 


AGFHHVGEAGLGLLTSRDPPASPCKRAG 
ITGLiniHTWPFSVILTYTYY 


4443 


18344 


A 


4473 


3 


333 


KNHLEASIGEHPFFPDHSPSCPHLLSSS 
NYHSLSCFLGGFFSETGSAPSPRLECSG 
VIAAHCSLKLLGSTSPLTSASWVAG ITG 
MCWLYSPAVSPRPSPDLTCTSQLENI 


4444 


18345 


A 


4474 


288 


1 


TSAVKTRIS IKSQADQEATEQKLRDYSHQ 
GIQILAWQKFEYKPGKVTMDPEDCILQL 
AKKKEEEEEEEEEEEEEEEEEEEEEEED 
QEEEEEEGEEEE 


4445 


18346 


A 


4475 


3 


124 


PRLECSGMIMAHCSLDLLDSSDPPTLAS 
QVAGTTKKISLH 


4446 


18347 


A 


4476 


196 


2 


SCYEHPGRIVYGFLFLRQDLAVTQAGVP 
WHDHGSLQPL PPELKRFSGT SHLL SS QG 
YRRAQPRR 


4447 


18348 


A 


4477 


3 


162 


LLERNKELEGSLQQMYSTNEEQVQE lEV 
RALHVLAHTPRGPLGLCVPLCNLE 


4448 


18349 


A 


4478 


167 


2 


ACG I S P F S L S S S C FGHVRHAS L PLAFHH 
DCGFPEASPAMLPVQPSQLNLFIKYPV 


4449 


18350 


A 


4479 


195 


26 


WVSMLDIHDNGPHEVAVICFVCFLSQSL 
ALIiPRLECSGVISAHCNLQPGRQSQTPS 


4450 


18351 


A 


4480 


175 




NRSATKCDWGEKRTSQELNLFFWFFVCL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

s^'etce 


Predict- 
ed end 

otide 

lociition 

correspon 

last amino 

Sdueof 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FEMESHSVAQAGVQWRNPGSLKPLPPRF 
KR 


4451 


18352 


A 


4481 


122 


357 


YHAFLYQWYNLKREHKRFRCSLL I I FLG 
GERKLQSIQLWNISHYIQIRRYVLEIPS 
GLGAVRHaCNPSTLGGQGEWIT 


4452 


18353 


A 


4482 


137 


^ 


FYSYLKILEEMGPHYVPQASLELLGSRD 
PPDSAYQSTEIIGMSHH 


4453 
4454~' 


18354 


A 


4483 


347 


208 


GQAGLELLTSSLPASASQSAGITGAGHH 
TRPRIYFLKKYLAFVLGS 


18355 


A 


4484 


411 


268 


RLNRRGGGCBEPRSCRCTPAWATEQNS I 
SKKNQQKTTKIjPLCCRLILL 


4455 


18356 


A 


4485 


2 


102 


PCLLKMQQLAGHGGMHLCSQLLRELECF 
LHSYP 


4456 


18357 


A 


4486 




113 


HSAHSTHAGHAGllTSLPKCWDYRStePPR 
LAKFLNTRY 


4457 


18358 


A 


4487 


163 


353 


QVAATLWNLAVLYGKHGKYKEAEPLCKR 
ALEIREKVQKKGAWFFEIFVLCVCGFF 
FFFKRDP 


4458 


18359 


A 


4488 


408 


307 


RGCSEPRSHDCSAAWLTERDSVSKKlSrKlSr 
KSPKKQ 

YSINFSiiLIFTiFTLGVLVWAAIINYHR 
LHGLHNKHLLIMIWMVGSPRLSCG 


4459 


18360 


A 


4489 


If] ^ 


T79 


4460 


18361 


A 


4490 


229 


345 


LENETLGRARWLTPVIPALWEAHASRSR 
GQEIETILANM 


4461 


18362 


A 


4491 


180 


365 


ENNLKTLVALDMSFYFILFLFYFILFYF 
ILFYFLRWSFALVAQAGVQWRYLGSLQP 
LPPRFK 


4462 


18363 


A 


4492 


269 


385 


GNLQNLEKVFYKIQHPFMIKTLNKLVID 
RTYLNIIKAIY 


4463 


18364 


A 


4493 


103 


336 


QLLVSLINLFLQAFKLQIDCGHTCMHFW 
RQNLCLLPPTLPTPRRKLTLSPSLECSG 
TIWAHRKLRLPGSHHSPASATR 


4464 


18365 


A 


4494 


305 


379 


YIKIENFCLGWAHTCNPSTLGGRG 


4465 


18366 


A 


4495 




224 


ETATKTQRRWETALVPQHPHRPIjAPVTW 
VKTTGNS T I ANFLL I LVYLS FLFLVYS C 
IQQ 


4466 


18367 


A 


4496 


3 


129 


ITLNPGGRGCSELRSCHCTPAWATERDS 
VSKKICKICFPPPPF 


4467 


1836S 




4497 


396 


112 


LPELEFEAAVSHDCASELQLGHQSKTLF 
QKKINRIINNNNNSHIAISLLGIYPKEF 
KAGTQRDICTPMITASLFTIVKRQKQPK 
CLATNELVYRM 


4468 


18369 


A 


4498 


1 


152 


LSLPGSQTWWRAIIVPATWEAEVGGSPE 
PRRSRLQVAMITPLHSSCGRRG 


4469 


18370 


A 


4499 


92 


2 


KRPGWTHACNPSTLG6RSGPITKSWR 

DQ 


4470 


18371 




4500 


116 


1 


AGIAGMCLHRQLIFLYFLVETRFCHLGQ 
AGLELLTLWS 


4471 


18372 


A 


4501 


99 


1 


QGTPVHRWWKCKLVQPLWRTVWSFLRKV 
KIDNS 


4472 


18373 


A 


4502 


140 


288 


RQPHYVAQAGLELLGFSNLPALTSQSAG 
ITGASHHNQPTSEFLKEKKIG 


4473 


18374 


A 


4503 


330 


181 


FFFFLFFFFFFFFFFFF}{XFFFF?FFFF 


4474 


18375 


A 


4504 


171 


2 


VETVSKESSLRICLQDIRYFFFETKPHS 
lAQAAVQWHYQGSLQPQPLRLKQSSRLS 


4475 


18376 


A 


4505 


345 


74 


SVERTCHSPKPLMLFLPLVFOaRDDIUr 
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SEQU) 
NO: of 
nncleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 

peptide 
sequence 


Predict- 

location 
correspon 

last^amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
JE=Glutamic Add, I^PIienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GSHPVSFEKACEFQGFQaQIQFQPHVEH 
KHKPG FLEACHP YQTAEPASALTLDFPA 

S3TVQK 


4476 


18377 


A 


4506 


258 


102 


XNiaELWFGAVAHACNPSTPGGRGRWIT 
RSGBVJLNLRTCEHVILQQKGKFRFLIS 


4477 


18378 


A 


4507 


1 


67 


GLPKCWDYRREPPHPARKICKYF 


4478 


18379 


A 


4508 


339 


180 


ANFFVFFEEIVIGFHRVAQTGLE VLGS SDL 
PILASKSAGITGISYSVRPQTNTE 


4479 


18380 


A 


4509 


174 


3 


NEPKWIYI IPGDPLFFLLLLFFETESH 
YVTQAGVQWHDLQSLQPLPPRLQRFSCV 


4480 


18381 


A 


4510 


84 


3 


AHHIFTVGIDVDTRAYFTSATI I lAI 


4481 


18382 


A 


4511 


127 


2 


LLIKIHCWPGWAEiACaSPSTLGVRGRWI 
TRSGVQDQPEQYG 


4482 


18383 


A 


4512 


109 


345 


EIFLIQGHEGSCLCFLFGSFMCFVATGF 
PHVTWLILTLSNLPTSSYSRAGITGVSH 
HARAMNLFCPGSMPSHGITGLISrF 


4483 


18384 


A 


4513 


290 


13 


RDLRPTPQKGPKMHTAPNPLQRAQTHVK 
KGRGPTPPTPTPSKDPGPHPPLRKPKPW 














RKIQHSRF 


4484 


18385 


A 


45)4 


234 


362 


DGDVLSKFVNFFS FKWEGTPLIDGRRL 
KYRLNGDFLFLRLKL 


4485 


18386 


A 


4515 


190 


330 


CLALQSECFRRWSFALIAQAGVQWRDLS 
SPQPLPPVFKRFSSLHPFE 


4486 


18387 


A 


4516 


2 


199 


.cvcvcvcvcvcve I FFFFLSKKWGPPPL 
CPPGGPKTLGPKESPPLAPKWRGNSGLT 
PPPPPTFFF 


4487 


18388 


A 


4517 


94 


339 


SAYLGLKFFLFFFFFFFFFFFGVFPPFF 
FPNKKKLKPPQIFFPPQKRPPIFFFPPP 
PGGKWFFFFGKFFFNPREKGPPKWLG 


4488 


18389 


A 


4518 


95 


3 


LYLGPGAVAHACNPSTLGGRGGQITRSG 

VQ 


4489 


18390 


A 


4519 


14 


121 


LIFFGRNRISLCCPVWSQTPGLKPSSWL 

SLPKCSD 


4490 


18391 


A 


4520 


207 


2 


I SREKLCTVRIPDMHVIEDS ILFYYLFY 
SLALPPRLECNGTIIVIAHCSLHLPGSSNP 
PASASQVAGTTS 


4491 


18392 


A 


4521 


169 


1 


ANLTQVGSYYSLSLHTLIQVTFFKNIRA 
TOPGTVAHACNPRTLGGPDGWITRSGDR 


4492 


18393 


A 


4522 




82 


GFILCVCVCVCVCVCVCVCVLYFIFF 


4493 


18394 


A 


4523 


300 


3 


LFLPPSPLPSQIPGGQKPLPLIFPPRVP 
KKFIPPNFWAPPLISPRPFWVEFLGPPT 
QGENYPLGPGPPPPPPFSLGFKKSPNPF 
FIKKKFSPPPKKQK 


4494 


18395 


A 


4524 


348 


94 


KPGEG6K6PQI!TPPP6GXiKP6NPLNPQGQ 
GSPEPNFPPWPPPRGAKKKTRPQKKKKK 
KKGRKESRLHYGMEKRMKEKEKVFLSAL 

N 


4495 


18396 


A 


4525 


313 


161 


RGHPKNLKSKEVNIRREflPGHPRRNKKK 


4496 


18397 


A 


4526 


118 


3 


FFFFLRGSFALVAQAWQWRYLSSLQAL 
PPGFKRFSCP 


4497 


18398 


A 


4527 


153 


2 


KDAINKMKKWPGWaHA(iNPSfLGGQGG 
WITRSGAQDQPSHYGETPSLPK 


4498 


18399 


A 


4528 


258 


2 


GGQKRGLPGFIKKPPFRKIFLrarLGEGP 
GKNSLSKFFFFFFFETESHCLTQAEVQW 
LNLSSQQNKPPRFKQFIiCLNHPSSWDYR 



827 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














R 


4499 


18400 


A 


4529 


296 


39 


SIRIINIQILLGFFETGCHSVAQAHLKL 
LGSNDPPASASESTEITEMQHCTRLSIQ 
IFVAFPySWIKLSTKKEPILIKLLLYWC 
K 


4500 


18401 


A 


4530 


292 


37 


SIRIINIQILLGFCETGCHSVAQAHLKL 
LGSMDPPASASESTEITEMQQCTRLSIL 
IFVAFPYSNNKLSTKKESILIKLLLYNR 

K 


4501 


18402 


A 


4531 


3 


138 


GLWNGSCLCTCVCIFMGMGACLVCiCTC 
LYCCVPVNTCLCMDGR 


4502 


18403 


A 
A 


4532 


380 


2 


GGPPKKRESAKAPGGHNPQGGGREKTPP 
PKKKRPPQKGGGKKTPPGGGGGPPRGGG 
KKKPPPKREKECHRGGRGGGGPPWTPK 
EGPPFFFFFFFFFFFFSLPFFFFFCFFI 
FILFFPNIVINGFY 


4503 


18404 


4533 


3 


191 


KKKKKKKKKKKKKKKQGGQFKKTLQGHN 
FSRGLE 


4504 


18405 


A 


4534 


370 


48 


GPAPEVSSNGGKGGGCPKGLGKPLGSQF 
NGVKGYPSPKSSPLFQAPLGGQGFPKPR 
GNGGTPPFFPKKKKKKCALGDGTPTSGC 
PRRSWMHPETQDGCGGWGSHVPR 


4505 


18406 


A 


4535 


147 


1 


KGPPLFFFFFFFFFFFFFFFFFFFFFFF 
FFFFSFYFYIIYLFFLSFLRL 


4506 


18407 


A 


4536 


198 


1 


NPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFLSFIFIFLYFNFI 
SSWIVflGFY 


4507 


18408 


A 


4537 


2 


356 


QNRTTGWSKTQSASPPPKECSASPATEQ 
SWMENDFDELREEGFTRSNYSELREDIQ 
TKGKEVENFEKHLHECITTITNTEKCLQ 
ELTELKTTARELRDECRSLTSRRDQLAE 
RVSALE 


4508 


18409 


A 


4538 


347 


220 


KCSGTIIAQCNIKLLGSSNPPVSASQSA 
GITGMSHHVQPKYC 


4509 


18410 


A 


4539 


308 


2 


FSISCPHDTPAAASQSAGITGVSHCARP 
IFSFICFFFICIMCIKYLLRTGDSVMYF 
HGLGCFGLCCYLESYIRKLSHMKKLSIL 
ITSREKCKCILQINIGDR 


4510 


18411 


A 


4540 


2 


194 


I KQWSTALSPRLECSQTITAHCSLKLP 
GSASASWARTTGTCHHAQLIYFSETGS 

HYVDVDAA 


4511 


18412 


A 


4541 


101 


2 


MEENLGNTIQDIGRGKDFTSKTPKAMAT 
KAKID 


4512 


18413 


A 


4542 


1 


84 


AEEAEAEAEAEEEEHEHEBEEEEEEIKY 


4513 


18414 


A 


4543 


3 


183 


KIQILFFVSS KTRLRPSAWHPRFVFCFL 
RWSLALAAQAGVQWHHLSSLQPLLPGFK 
VDAA 


4514 


18415 


A 


4544 


-J 


335 


RKKKKKEKKKEKKKKKQGRKKEICHERKE 
EEAKKKTKNKDK 


4515 


18416 


A 


4545 


104 




QKKKKKKKKKKKKKKKKKKIYDFFLQIN 

CMCSHK 


4516 


18417 


A 


4546 


5 


108 


FKRLSCRSFPSSWDYRREPPRPAMFCIL 
SRDGIS 


4517 


18418 


A 


4547 


131 


1 


RPRRPQVQLQDATSPIIEELITFHDHAL 
IIIFLICFLVLYALF 


4518 


18419 


A 


4548 


156 


3 


KGGQGMSCLSRSPSALLKFECRVISSYW 
AWWRAPWPATRGAEAGESLEP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 

last amino 

Br' 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoptian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4519 


18420 


A 


4549 


2 


340 


LNPGSAGCRELRSHHCTPAWTTERGSVS 
KKKKKKKKKKKKKKKKKKKTFFGGGPLW 
ERE FLLGNFWEKKGGGAM FF PLENPNG P 
QI.KTWGKKftFFPRKGGGPLNPPQGFYQG 
A 


4520 


18421 


A 

— 


4550 


165 


335 


PHYLPRRS I IHRPFKSHSPEPGAVAHAC 
NPSTLGGRGGWITSGQEFETSLINMEKS 
L 


4521 


18422 


— 


4551 


123 


3 


GRVDRLSCLSLLNSWDYGHLPLHLMJFC 
IFSRDGVLSSW 


4522 


18423 




4552 


3 


131 


HPEVDIPILPGLGI ISDIGAYYSGISEP 
FGYVGMDVSYDPVRL 


4523 


18424 


A 


4553 


87 


364 


ILASQSWFRYAASAASGGLLETESQPS 
AaGTASAAVPSLEftQTPAAFRPTPPGLA 
PSSPPAPGPPAPTGKVCGRHFLRSESSA 
SGPLRALL 


4524 


1R425 


A 


4554 


114 


0 


LTWAPHSLIETIRNQIIQALLITILLGL 
YFTLLQRSE 


4525 


18426 




4555 


91 


3 


TTTTYIAIATACVCVCVCACVCPCVCVC 
M 


4526 


18427 


A 


4556 


239 


364 


CIKITFSRPGAWHACNPSTLGGQGGRI 
AMSHDRDHPGQQGK 


4527 


18428 


A 


4557 


250 


1 


STEHVKQLRNEFMKISVILIFLEILLYI 
KVHLKLCTLTTDHLIiGIKSLKILFRPG 
AVAHACNPSTLGGRGGRIPRSGDRDHS 


4528 


18429 


A 


4558 


2 


90 


QDGLDLLTWGSACLSLPKCWDYRSEPSL 
F 


4529 


18430 


A 


4559 


65 


394 


DPVSKKKKKKKKIKNEKEKKFFGKKKPL 
KKKLRADLILKPKKKNFLKKREWKKREK 
GGDPFWKNKPQVGKTGNNLNliGQNGLGK 
GPSDLKKNTPETKLNLVQKKFQIFF 


4530 


18431 


A 


4560 


93 


2 


GI I SAHCNLCLPGS EDSPASASQVAGIT 
GQ 


4531 


18432 


A 


4561 


125 




GAYILYLNKKSKSIRPGMVAHACNLSTL 
GGRGGRITRSGD 


4532 


18433 


A 


4562 


3 


84 


RSCHCTPAWATERETLSPKKKKKKNF 


4533 


18434 


A 


4563 


145 


3 


GFYHWPGGVEQSPCLSLPKYWNYHCEP 
PPLASIGFFFFFFETQSRS 


4534 


18435 




4564 


125 


388 


IRKPQILYSPQSSENMQHLHISVLMGVT 
GKRGKFMDILDVTERCJRFSLKKCTGIQE 
DLNKWEGIPCLWIIRLNTVKMAVFSKLI 

CRFS 


4535 


18436 


A 


4565 


59 


325 


VARKGNLVSFLVIFYPLFDIEEVIEPLW 
VLISSFDEGEKVELEFFFFFFFKKKSPF 
VTQAEFQWPNLSSLKPQPPRLKQFFCLG 

LLIT 


4536 


18437 


- 


4566 


325 


3 


PEKEFRRLVIKLIREAPEKGEAQCKEIQ 
KSIQEVKGEIFKEIDRIKKKQLKHQETL 
D I LLVMQNALE SLSNRI EQVE ERNS ELE 
DKVFDLTQSNKDKAKRIRKYEQS 


4537 


18438 


A 


4567 


333 


209 


EQAFDPYGTLSQGI FSPKFHEPFHKLMA 
RPAVALPFQVMLK 


4538 


18439 


A 


4568 


125 




HNQSMSRSIDCSDPILGHCNLCLPGSSD 
SPASASQVAGTTG 


4539 


18440 


A 


4569 


212 


2 


GVWHACNPNSLVGQGGRFMRSGVLEQH 
EQHAETPSLLKTLK 


4540 18441 


A 


4570 


142 


14 


GRVDRSNPGRFLSTSNSSLYERPRE IRP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

residue of 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^mino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TSQAPSPVNHIIST 


4541 


18442 


A 


4571 


2 


419 


ARDS FLHDSQTSFCFSDS IPTPSNMEET 
QQKSNLELLRISLLLIESRLEPVRYLRS 
MFANWLVHDTSDSDDYHLLKDLEEGIQT 
LMGRLEDGSRRTGQ ILKQTYSKCDTNSH 
NHDALLTNYGLLYCFRKDMDKVETFLR 


4542 


18443 


A 


4572 


3 


194 


HEGRLQSQLLRRLRQKNCLNTGGGGCS E 
PRSHHSFPAWVTEHGFISQISLKPLFIS 
HPEPFLL 


4543 


18444 




4573 


2 


286 


CRDGLTYNDFLILPGYIDFTADHVDLTS 
ALTKRITLKTPLVYSPMDTVTEAVMAIA 
MALTGGI GF IHHNCTPEFffiVHEVRKVlIK 
YEQGFITDPW 


4544 


18445 


A 


4374 


44 


325 


RQWLQEEAYITKEQKYSFLHDSQTSLCF 
SDSIPTSSNMEETQQKSNLEIiHISLLL 
lESRLEPVRFLRSTFTNNLVYDTSDSDD 
YHLLQDLEEG 


4545 


18446 


A 


4575 


374 


33 


LRLAERSGASLC6SRSGGPPTCANICPR 
YAPLIPPOITSLLEAQGLVMWGPDFRV 
RLVTSSERRFRYQPPPPGFNRFPCLSLP 
SNWDYRHRPPRLANFNSLFFFFFLVETG 
Y 


4546 


18447 


A 


4576 


400 


225 


PIFFWFFLFVLFLVETHLCaiWQAGLE 
LLGSSNLPISASQSAKITGMSHPAWPRA 
EF 


4547 


18448 


A 


4577 


2 


372 


ARETIFCFSNSIPAPWNMDETRQKSNLQ 
LLRISLLILLYRTRliTLPCITQLHILQT 
YBCVNARCSHFHSSSTCGPLYSCTLYVSL 
TGLDMKLCHSTLLKQNSRLMSLLKIQVir 
LNQVSPVFNiffl 


4548 


18449 




4578 


349 


174 


GCGEPRSHHCTPAWATRftKRCLKKKLKL 
QLKNWKESKHQPLQNFSHLTTSTINTQ 
EN 


4549 


18450 




4579 


3 


124 


HELLNPEGEGCSE PRSCHCTLAWVTRAR 
LRLKKKKKKKKN 


4550 


18451 




4580 


181 


1 


MSSYYFSPCSRLLMATVYNLHLPSSSNS 
PASASQVAGNAGARHYAWLIFVLLVETG 

LCSC 


4551 


18452 


A 


4581 


227 


3 


HRKENRCVSRTALLFSLGDGVS PCLKI I 
1 1 IKVLFWLGEVAHACSPSPLGSRGGRI 
TRSGVEDQPSQHGETPSC 


4552 


18453 




4582 


257 


2 


RVEQLGAFSLSPGSGSFSKILGALKIMG 
PPRAGPVGSPRYSRPLGGPGRQI PRSGD 
PGPPGLPRGNPVPLKKNPKRSLLILPAR 
A 


4353 
"4554 


18454 
T8455 


A 


4583 


3 


94 


HEIAMAFLILTERKILGYIQLRKGPNW 


A 


4584 


345 


50 


I YI YI CI FLNRNRVSLHCPGWSKTPGLK 
QSSHLGLPKCWDYSHEPLCPAKTSHFKA 
QDRIPKNDPSCTPSPAQMSTNLIGWASS 
YLQSYSPAPYNNKN 


4555 


18455 


A 


4585 


338 


19 


FQLSRRFRPKNGGNPGSGACSEPRLGPC 
PSAWVTKRDSVSKKKKEYHTLSGLSTTG 
lYFSQFWMLGVQGASRFGVWGGAVSWSI 
DGAVLLRPHMAEGARQLSGLGF 


4556 


18457 


A 


4586 


2 


224 


ARGVGRAGLELQTSGDPPASASRGAGIl 
GVSHRVQLLGDRAKFSLKKKKRGI J LPG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 

otide 

location 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


4557 


18458 


A 


4587 


344 


168 


PPGTWHTYNPSTIiGGQAGRIGSLGVQD 
KPGQHRQGLSLGVEDKPGQHSETPSVQS 

NP 


4558 


18459 


A 


4588 


212 


328 


GLAVAQAGLELLSSSNPPASASQSAGIT 
GVSHHTQPYF 


4559 

T56(r 


18460 

18461 


A 

"A 


4589 

4591) 


no 


323 


LSNQKNQLYNPNLLIYCWRKLKSSYDK 
QAEVQGSKLGSLQALPHGFTPLSGLNLP 

SSWEYTRPPPRPTNF 

ARGCS EP3SRHCTPAWATEQVS KKKKKG 

REKTFGQFFFWVI 


4561 


18462 


A 


4591 


211 


2 


ARGSLEAGSSTPAWATINQSTNKKTDKT 
ALYWiCKMPSRTYKRRQKSMPGFKSSKDR 
LTLFLGAKVSCLV 


4562 


18463 


A 


4592 


3 


120 


HEKTKTMNPIKBCWAKDMNEQLSKEDNDN 
KINGWPGAVF 


4563 


18464 


A 


4593 


388 


227 


GHVGQDGLELLTSDAPPSLflSQSAGITG 
VSHRAQPKWLLFQHRSPCLPEAAGK 


4564 


18465 


A 


4594 


1 


348 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PLPPPPPPPPPSSPSSSPPNPPATTSVA 
C 


4565 


18466 


A 


4595 


2 


138 


ARAARGFRHVGQAGLELLTSGD P PAS AS 
RSAGITGVSHHTQPTNF 


4366 


18467 


A 


4596 


315 


2 


ALKPKYSSTSMNIKARLPQYNINNLNLA 
PLYSSTSNQINIIVIILINKENNHTLFP 
CLVSNSRLKQSSHLGLPKCWDYRRELLC 
LACDKFFISYSAPAIQLLSC 


4567 


18468 


A 


4597 




146 


GTSVDQGSLELLTSGDPPASASQSAEIT 
GVSHRTQPPLT I IQLYLGYC 


4568 


18469 


A 


4598 


293 


3 


GGTPWPRKKLGQRKFPHRPPGGSGNPDF 
SLLFFFFLRQSFALVAQAGVQWCDLGS L 
QPPPPVNPFLLRKQFGFSTSCFPILPAS 
GSFFGREASASC 


4569 


18470 


A 


4599 


174 


2 


RSVAPFLKSARPQTYRKEPTPDTGSPQP 
PPPRFKRFS CLTRLS SWDYEHAP PHPAS 
C 


4570 


18471 


A 


4600 


59 


252 


LAP FGHE LGTS E SAHQTDNRHRL EGYQK 
RLDASGLERASYPLAAEFKVGGRGCSSQ 
PLTLCGYP 


4571 


18472 


A 


4601 


233 


366 


IHYLPDAVAHACNPSTLHGQGGRITRSR 
DRDYPGQHGETPFLLK 


4572 


18473 


A 


4602 


182 


325 


LFVFCLFLRQCXSVAQAGVQWHDHCSLY 
PQPPSLKQSSRLGLPSSWDY 


4573 


18474 


A 


4603 


202 


2 


GGSSSGLLRPCAVLGLKQSSYLSLVRRW 
DHNSWLKRSTHLGLPKGWDYRREPPHLA 
CILKFSINLV 


4574 


18475 


A 


4604 


317 


2 


FRLGPQVFFSP PGAGFFFCGPVWGP GP F 
SRSLQLLTFGAKFFFCLSLPESFGAPLR 
GPPSGFMAHCFFFLVLLFFFFFFHDSVL 
LCCPGWSAVAIHRRDPTTARA 


4575 


18476 


A 


4605 


233 


71 


FFVLLVE lEDQLNEKKQEGKIRKKKRMK 
RKKQSLQKIWDYVERPNLAGHSGSRL 


4576 


18477 


A 


4606 


114 


I 


HPRPARVSSCCPNWSAMVWSRLAATSAS 
RVQAILQSRA 


4577 


18478 


A 


4607 


348 


3 


LKFQNFRGWHGARCPS FFGGLGRGI P F 
TPEVEVLVWLGPAPALPFWPPSQTLFSQ 
NQKKNQKKKQERKREKERKEERKKKKER 
KKECEKERERICEGRKKGQEKKRKI TAKSL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

re!fdueof 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop eodon,/=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














LG 


4578 


18479 


A 


4608 


174 


3 


TWMWQQCVFVCLLKQGLALS PRLECT 
GI IIAHCNLKLLGPSDPPTSVEKTKKLV 


4579 


18480 


A 


4609 


171 


334 


DPHSSLNIRFSRSVAQAGVQWCNLNSLQ 
PPPPGFQRFLCFSLPSSWDYRCa.PP 


4580 


18481 


A 


4610 


2 


164 


ARGSRGHIWSAFCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLNTLYSMS 


4581 


18482 


A 


4611 


2 


164 


AREPRGHIWSSFCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLSTLDSMS 


4582 


18483 


A 


4612 


180 


334 


CCLMGSSYLLPRKPNTLRTAGFLQQRKS 
LIRPGMWHaCNPSTLGGQGGWI 


4583 


18484 


A 


4613 


189 




SGFSRWENGLKCKGQKGGQSLVLLPIR 
VFFFFFFEMKSHSFAQfiEVQWGNLGSLQ 
PLPLGFK 


4584 


18485 


A 


4614 


216 


358 


KSTPRLGTVAHTCUPSTLGGQGKRITRS 
GDRDHSGLHKGTPRLSKNI 


4585 


18486 


A 


4615 


179 


393 


EKQEEEGSWGWAKTEDGVRLMLSEAIGS . 
CWIKAFLTDAETE-rFHADICTLSEKEKQ 
IKKQTM.VELVKHKP 


4586 


18487 


A 


4616 


2 


126 


ARVQRYLFLFFTLDGPLPDPSMIRGSVP 
NQMMPRITPQSGK 


4587 


18488 


A 


4617 


338 


181 


RVGEAGLELLALSDLPASDSQSAGITDI 
SHHTRPGSPFLEHHLYPRYRISGK 


4588 


18489 




4618 


3 


197 


HEKRPEVQNSTVEFIASSDYMLRPPLPL 
PFCFFLFFFKIKMGVSPCCPGSCQTPSL 

KQSSCLKV 


4589 


18490 


A 


4619 


, 1 


339 


GTRTFGSGNGPTKPDLLQELRVATERGL 
VIVNCTHCLQGAVTTDYAAGMAMAGAGV 
I SGFDMTS EAALAKLS YVLGQPGLSLDV 
RKELLTKDLRGEMTPPSVEERRPSLQGN 

T 


4590 


18491 


A 


4620 


3 


306 


LEPGDGGCSELKSCHCTPAWA/TERDS IS 
KKKKKKKKGGPLKGTLGGPKFNGGWQRK 
IFSQKGGQKKPNLGILGRNLNFGGEKNW 
NKFLTKI KVLREKKNF 


4591 


18492 


A 


4621 


397 


29 


NTKEKILSARKKWQVFPPHTQKKNFFPK 
KRKNFREGIPPNFPPPKKSPPQKNHPPQ 
KKKPPKEKKKTIARPPKKRAPPGAFKKK 
PPPFFFLSSLCISYSFIYFFSLINLFFF 
FFFFHDSTVY 


4592 


18493 


A 


4622 


2 


235 


VSLCHPGGTISTHYNFCLPGSRDPPTSA 
S AYALL I FVF FVE TGFPLVAQAGP KLQG 
TNS I PAS ASQSAGI TGMNHCT 


4593 


18494 


A 


4623 


209 


324 


ILRVLWSGTVAHACNPSTLGARGRQIMR 
SGVRDQPGQY 


4594 


18495 


A 


4624 


52 


287 


NPFSFLFFSFLSFFLFFFFLEKGPRRGA 
RAQKGKPHPKKKKKKKEGGGKKKKTQKP 
FSPLKGPRGPPRGKGQTTPKGP 


4595 


18495 


A 


4625 


102 


1 


THSFPVSISLSLSHT'HTHTHTHTbTEIY 


4596 


18497 


A 


4626 


125 


1 


DKHVPQHTANFFWLIEMGFLHVGQAGL 
KLLTSGKPLVPLV 


4597 


18498 


A 


4627 


3 


167 


ERLRQQNCTNPHSEGCSEPGSPHCTPAA 
WATEQDSVSKKEKEKRKKEKKKKIW 


4598 


18499 


A 


4628 


135 


3 


AASTLSRGTITAQGNLCLPVSSNSPASA 
SRVAG ITGAHHDTWIi 


4599 


18500 


A 


4629 


320 


210 


WFLGI IFRRVENAESVNIDLTYDIQSFT 
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peptide 
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residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, if=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
l^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *^=Stop codon,/=possible 
nucleotide deletion, \=possible 














DTGKTPSS 


4600 


18501 


A 


4630 


98 


326 


S S P TLAFQFQYFFS CATKEAVKLS FPKS 
LLGTGSRSVAQPGEQWHNHGSLQPQPAQ 
LRLS SCLTLVSGWDHRCAPP 


4601 


18502 


A 


4631 


275 


2 


RVKLLLFYMRQSCSVIQVESSSTVNGSL 
QPQPPGPRQSSHVSLLSRWDHKRMPPCL 
ANFFLFFWRAKGLAILPVHSNLVMTESP 
LISIKSN 


4602 


18503 


A 


4632 


180 


1 


KGLTLYFYFFIFKMRRGLftMLRDWSQTP 
ELNQFSCLGLPKCWDFRHEPPHPVKEFK 
PFHR 


4603 


18504 


A 


4633 


234 


5 


KPLRDKPLRSFSNLPKKNLNFFWPFFPQ 
KIWGFFFFFFETESHCVTQftGVQWLDLS 
SLQPPSPGIiRQFLCLSLSSS 


4604 


18505 


A 


4634 


171 


2 


GRVDrLTPWQHFETLFSFFSFFWQGLSV 
TRLECSGVIMAYCSLDLLGSGNPPTSAS 


4605 


18506 


A 


4635 


324 


211 


LRNI I FMPVTMNPGMYTGGCMCVCVCVC 
VCVCLCVCVC 


4606 


18507 


A 


4636 


95 


"1 


MRQS INIiSPRLECSGTIPAHCNLCLPGS 
SNS 


4607 


18508 


A 


4637 


234 


1 


lYKVWKNSPNYSSLTHHINNIKPSFTRE 
NTLMFIHLSP ILLLSLNPDI IT 


4608 


18509 




4638 


2 


243 


WGKVNVDEVGGEALGRLLWYPWTHMYF 
EFFGAFYISFCLFFLIFPHLFFVSSIHI 
VSSFYLLFYNFPIIFFTLFCHFFL 


4609 


18510 




4639 


118 


2 


GRVDLQCSGAISAHCKLHLQRSRHSPAS 
ASQVGAHATT 


4610- 


18511 


A 


4640 


50 


168 


NKINKMHLITFLKKKKKKKKKKKKKKKK 
KKKKKKKKKKN 


4611 


18512 


A 


4641 


315 


87 


G I FLRDRiSTLALLPRLECSGTI I AQCNLE 
LLS PKDS PAS PS PVARTTGVQPRPDTLT 
LFELIYIRYKGMIFSQGDC 


4612 


18513 


A 


4642 


383 


2 


KFKKKKFAGGKAPARLSPPLWGPKGGGP 
PRAGVLSPPGPPGETPFFLKMPKFPGHG 
GGPRKSPFFGGLKGENSLTREKGGPINP 
NLAP PAFP PKGQNPGPFPKKKKKKKNRS 
SPAWHWDWIGTNAGK 


4613 


18514 


A 


4643 


3 


205 
[i 


LKRFFCFSLGGQGCSEPRSCQCIPAWET 
QRDS I S KKKKKKKTRPGGLTLE PPPFGG 
VKPKGPLGAGI , 


4614 


18515 


A 


4644 


149 




FFP IKADLVFFFFFLRRNMDSQGAVQW 
HDLGSLQAPPPGFMPLSCLSL 


4615 


18516 


A 


4645 


474 


344 


AIHEHYLPESSASGDCDEDLQAACTILV 
NALMDFHQRLAAHTV 


4616 


18517 


A 


4646 


165 


357 


LCNALSHLLPQCTPTPLISILCIIDTIS 
LLIQPITHAVRLSANITACHLLMHLILL 
STLSICTL 


4617 


18518 


A 


4647 


2 


339 


ARGENDLSPGGGGCSELRLHHCTPTWAT 
EVM CVSQKKKKNQPKPKPNQTKPNQTRK 
GSPGFPLLWLKVISPEKSIMPNILKTAP 
KVTFRUWLTPE I PPFWKMEEKGS FDPR I 


4618 


18519 


A 


4648 


313 


187 


LTSGDPPASASQSAGITGVSHHALPPLV 
QFPSHSLTPTPLFC 


4619 


18520 


A 


4649 


236 


324 


GQVMWPGMMAHACNPSTIGGRSRWIVRS 
R 


4620 


18521 


A 


4650 


225 


313 


GQVMWPGMMAHACNPSTLGGRGRWIVRS 
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peptide 
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ding to 
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peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, impossible 
nucleotide insertion 


4621 


18522 


A 


4651 


240 


76 


FCVHAHTHTHTPSFQASGSSEHMHTYTH 
PSGSPEHTHTYTHTTFLASRSAEPQLP 


4622 


18523 


A 


4652 


273 


379 


KQLAAQCVRPLS PSQAIASPVTCTYMPR 
WPEVTEE 


4623 


18524 


A 


4653 


369 


189 


GTVKQAGAQCRNLGSLQPTPPPGFKQFS 
CKSLYSIMICYSKDCTMSTTLGIRJaOTO 
LHL 


4624 


18525 


A 


4654 


168 


346 


DRICRASSYTIHLMLYFVFLIFLFYWYI 
YFLFFVLFHFTIFIILFYSFIILLLLLY 
Fm 


4625 


18526 


A 


4655 


56 


354 


VJKRTFPFFFFLETGSGFASRPCTLLLFL 
YSLGPNPRVLIRVFLFLSSSRCYYYYLF 
LGLSLLFLFFYIMYLLFIRLFFFVLS6L 
r.rjjLSSYYRFIIFFF 


4626 


18327 


A 


4656 


50 


186 


SACLGLPKCWDYRHKALRPAMYLLTGSS 
YVAOAGLNLLGSADPPE 


4627 


18528 




4657 


131 


366 


DHFVASS ESVKEEGLFFVFVFFFFIFI F 
FFILYICFFYPKFIFIILLYIIIIPIFI 
FYFFYLYYLILIFFLIFFIFIG 


4628 


18529 


A 


4658 


290 


155 


LEYIVEKGFHQDDKASPELLTSRYLSDS 
ASQSAGI TGHRAWPTKG 


4629 


18530 


A 


4659 


163 


372 


LRLFALFAFLFMFFFYFLFLLYVFCFVF 
FLFFFFTFFLYYFFLYFFLYFLCSLVFL 
FFICFIFFWFFVFF 


4630 


18531 


A 


4660 


227 


378 


FIFFSNCTSTALVHPLFYLSELVLSEaSTH 
NKFIGRARWLTPVFPAIiWEaEA 


4631 


18532 


A 


4661 


3 


141 


rsrhctpawQqse ilsqkerkkerkkkk 
erkkekerkkerkkkkkk 


4632 


18533 


A 


4662 


346 


3 


TFKFLGGVCPFFSPPKKRVFSKMSPGGF 
FLPPFLGENFFFAPPVKFGPPGGFFKGG 
PPFFFFFFFFFFFFFFFFFFFFFFFFSQ 
DSFDNLGIFVIPYKIQDDFSSSMSNATS 
ILIKIAL 


4633 


18534 


A 


4663 


302 


3 


GDINRRSHSPFVWSKPFAPCFMSSLDVF 
YKS SDFFYFFFETEFHS CCPGG I SAYCN 
liCLPGSSNS PASASQVAGI TGVRHHTQL 
IFASWETFKITYKN 


4634 


18535 




4664 


2 


76 


RLAPTPQLGGHWPPTGITPLWPLEV 


4635 


18536 


A 


4665 


1 


210 


S PTRS PKPTPPYY'QTTLAKPFTKKKKKK 
KKKKKKKKKKKKKRGGAPLKKS PGGPKI 
NRGKKKKFFFLKGG 


4636 


18537 


A 


4666 


2 


356 


GSRPGGGKRTARRWTRPWRLRGACLGTG 
MAFAFHHQVIYKIKSFLRKKKKKKKKKK 




























GGFLKKLWGGKFFMGGGKI PPFFIRGGK 

KKMWGG 


4637 


18538 


A 


4667 


1 


323 


RTRGIFCYDVCVESGCADIQLLLLCLKK 
KKKKKKKKKKKKKKKGAPLKKNPGGAQH 
NPGKKKKNFSPKRGPKKTPPGNFEKKPF 
FGGGPNGAPPPQKNKTPKEKKKF 


4638 


18539 


A 


4668 


167 


446 


ELNKGLCKPKKKKKKGGGRSQFKPTRPK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KS 


4639 


18540 


A 


4669 


321 


52 


GVFSPFSPKKFFFSPKGLNFWGGFAPIF 
SPPKKRFFSIQJPQGGFKyPPLRGKILIW 
GPRENWGPPGGFLKGAPLFFFFFFFFFF 
FFFFWG 


4640 


18541 


A 


4670 


82 


359 


NAYGIILIKGNKDTKTVKWKSSCQLANT 
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peptide 
sequence 


Predict- 
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last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=rsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WEFHIRALLGFNCFFLIFKRRGLTLLPR 
LDCNGVIIIHNNLESLGLSNPPMLASQV 


4641 


18542 


A 


4671 


249 


60 


LLIFIYYFYFFLFFFLFFFFFFFFFFFF 
LFVQVHLPVHLPCYDLSPLYKCVGVLVK 
CPFKYT 


4642 


18543 


A 


4672 


2 


319 


AAASTLVFVFSFFWVLGFFFFFFFFFWG 
PPKKI LKKKGGGGGGGGGGPRGGPRGGG 
KRNPGGQKKGI GGANGGQKKERNFQGGG 
P PRGE KRGGGGGPGN P KGGGL K 


4643 


18544 


A 


4673 


3 


451 


HES FNPVI I SQLERGEDPWVLDHKGAKK 
SQGLWSDYSDNLKYDHTTACTQQDSLSC 
PWECETKGESQNTDLS P KPL I S EQTVIL 
GKTPLGRIDQENNETKQSFCLSPNSVDH 
REVQVLSQSMPLTPHQAVPSGEKPYMCV 
ECGKCFGRS 


4644 


18545 


A 


4674 


446 


189 


SISLSTLVNVCQLISLSDPASGFVDHVC 
CIFTFYSINFALILIFLFYLGVLWAAI 

TWYHRLHGT.NTKHLLIMIWMVGSPRLSC 


4645 


18546 


A 


4675 


26 


170 


HM7WLVI\fMIWA33RQI ISHWKNCTRiT" 
DCPVCLPLKNASDKRNQQSK 


■4646 


18547 


A 


4676 


3 


378 


HEDEFDKMGNQHQALLEAMEQQS I SLAK 
AGWCILPARTSIIAVAHSFAGLYLIFN 
TVSWFFFFSNFlFlIIiSLFWYFCISiy 
ILFLIIFIFIFISSLIVFFCLISFRFLL 
FIFFSALSRIFIY 


4647 


18548 


A 


4677 


18 


242 


PIGLALSPRLECSGVIIflHLLGSNNPPT 
SGSRVAGTTGIHHHAQPHCENCIFPMSS 
EVBCLFICLLLIHILSRfl 


4648 


18549 


A 


4678 


288 


2 


PKYRWVPEKPTTASENAEIPSERQKEGI 
KLTIRISSRKKKPDSPPKVLEPETKQEK 
PEKEEEKTNVGRTIiRRSPRISRPPAKVa 
QIRDPKftHKKK 


4649 


18550 


A 


4679 


340 


208 


SVGQAGLELLTSGDPPATASQSAGITGL 
SQCYPPDSVPETFLPL 


4650 


18551 


A 


4680 


4 


379 


EENTSll!ILHDLeFGNePLDMAPKA.QATT 
AKKKKKKKKKIKIKSFLFLGGLFKKGKK 
TPPKGEKKFGNFKFWGGDLTPKI FGDPG 
ASTKKKKPFLKGGRGLKKTFFKKKKKRG 
PHPLESPSPFSVF 


4651 


18552 


A 


4681 


1 


350 


SRFSGAKEAKMEKKITGYTTVDI SQWHR 
KEHFEAFQSVAQCTYHQTVQLDITAFLK 














MAMKDGELVIWDSVHPCYTVFHEQTETF 

SSLW 


4652 


18553 


A 


4682 


365 


45 


SPKWFVASYDPYAGPGRPTRPDNSRDET 
GSHSVSESSSGSDSKRESTSSDREANEP 
SHRAS PE PE PP PTtJKWQLDNWLIMKTO 


4653 


18554 


A 


4683 


238 


372 


SATPLISSFSFLFFFFFFFREGVWLCHP 
GWSAVSRDCTTAAVWPG 


4654 


18555 


A 


4684 


3 


462 


KT KESREAVE KE FE P LLKWMKDRALKDK 
I E KAWS QR LTE S P CALVAS Q YG WS GNM 
ERIMKAQAYQTGKDI STNYYASQKKTFE 
INPRHPLIRDMLRRIKEDEDDKTVLDLA 
VGL FETATLRSGYLLPDTKAYGDRI ERM 
LRLSLNIDPDARG 


4655 


18556 


A 


4686 


410 


0 


FFFFFFFFFFFLLGKGGFGI I KFL 
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beginning 

nucleotide 

location 

correspond 

Imi'notfd 
residue of 
peptide 
sequence 


Predict- 

location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4656 


18557 


A 


4687 


151 


2 


IFFFFFDRVQWHKLGSLQPLPPRFQQFS 
CLSLPSSWEYRHAPPRVRPRV 


4657 


18358 


A 


4688 


342 


183 


FFLFLRWSFALVPQAGVQWCDLGSLQPL 
SPGFKRFSWAEVGGLHEPRSQRLQ 


4658 

^659 


18559 

"18560 


A 


4689 

4690"" 


126 

— 


404 

12S 


MCQVWPLRFTGHSVFSVHPQWFILFYCF 
ALYFIFLEMGSYSVTQAGVQWHSIiGSLQ 

FDGKVPWV 
"SESLFYIV3TRD'A^J<ERDQDDSDWL' ' 
LEKKKYEVPLILLL 


4660 


18561 


A 


4691 


407 


2 


YFFFPLPRMKIFFFPHQKGFFFGGCPI I 
SPPPNEVFFSPKEGGFFFSPPKKHVFFH 
WPPEGFFPPPPFFMSPPPPFFFFFFSLF 
FFFFFFFFFFFFFFFFFFFFFFFFFSFR 
IILLNQKKNLSACVELRSCCTFM 


4661 


18562 




4692 


1 


176 


SRRLNPC3GRGCSEPRSHHCTSTWSTERD 






SVSKKKKBCRFFALKFLGEMSrVKPTIGGQ 
KN 


4662 


18563 


A 


4693 




403 


SKAEIGRELGLLRQTVSQWNAKEKFLK 
EVKSATPIMNTRMIRKRNSLIADMEKVLV 
VWIKDQTSKNIPLSQSLIQNKALTLFNS 
MKAERQVEAAEEKFEASRGWFMRFKERS 
HFHNIK2\C2GEAASADVEaAASY 


4663 


18564 


A 


4694 


343 


425 


HCFLSLQIiNSFEQLCINYTNEKLQQLF 


4664 


18565 


A 


4695 


395 


2 


KNKGLFFFFFIiRWSFTLVTQASNVTTAV 
CNVTILAHCNIiHLPGSSNSPASASRVAG 

ITGSCHQG 


4665 


18566 


A 


4696 


429 


0 


VTKWCYITKKKKItKKKKKKKKKKKKKKK 

KKARG 


4666 


18567 


A 


4697 


371 


217 


IPVFKQSSCLSFPRSWDYRHMPPCVTQK 
KKI E YQHI SLYKKP SKRCALFL 1 


4667 


18568 


A 


4698 


415 


59 


LQQQCECFISELTKKKKKKKKKKKKKKK 
KRGGGLEKKKKKGFFAPYPPH 


4668 


18569 


A 


4699 


431 


0 


LKKIFPPGGRVLSPSPPPPPPPPPPFPP 


4669 


18570 


A 


4700 


371 


245 


TLGD P PASAS QS AG ITGVSHHAQ PLFG I 
FIYKFTYICRKTRT 


4670 


18571 


A 


4701 


410 


0 


KQCWFTQKKKKKKKKKKKKKKKKKKKKK 
DRG 


4671 


18572 


A 


4702 


340 


402 


P FAGGG YRLGAAPE EES AYVA 


4672 


18573 


A 


4703 


188 


2 


TSLPKCWEYRHHPPCPAHTLliIFLFFIF 
VDMGSCHVAQVALEIiLGSSDPPTSASQS 
AGIIGM 


4673 


18574 


A 


4704 


423 


34 








KKMVGGGNYKNVGGAQILGYKKIiCFSFF 

YRVF 


4674 


18373 


A 


4705 


2 


297 


PRVREREKEMASMKEEFTRLKEALEKSE 
ARRKELEEKMVSLLQEKNDLQLQVQAEQ 
DNLADAEERCDQLI KNKI QLEAKVKEMN 
ERLEDEEEMNRSLP 


4675 


18376 


A 


4706 


2 


179 


QENGMNPGSRACCEPRLRYCTSSWATER 
DSVSKKKEKKRS2iSIiRRFDILFGALTGR 

Kas 


4676 


18577 


A 


4707 


432 


578 


PRIVCFQELRKPSYAEICQRTSKEPPSS 
PLQPSSSSSSSS 


4677 


18578 


A 


4708 


419 


35 


GSFFFFFLKDQPKKGRKKEGCPPPPPKK 
I lYPPPATKWGGGGGKKPPPPKKRGPPK 
KNRGRKKBCPPKKKKKKFPPPPKKRAPQK 
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location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

iiucle- 

iocation 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
TC^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histicline, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, i»=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKPPPPIFFYLYFFKNFI FFFFFFFY 


4678 


18579 


A 


4709 


147 


31 


NFFFFFFFFFXFXFFFFFXXXKFFLFFF 
*FFFLKLKIFF 


4679 


18580 


A 


4710 


3 


229 


HAS AHAS DQ PE RLNAGT YFLF YTLVGS L 
PLLIALIYTHKTLGSLNILLLTLTAQEL 
SKKKKKGGAVLKNPWGAQS 


4680 


18581 


A 


4711 


1 


158 


PTRPPTRPRSCSELRSHHCTPAWVTERL 
CLKKKKKKFPNLSVQFKILGILKQ 


4681 


18582 


A 


4712 


163 


404 


KKKKKKKKKKKKKKKKKKKKKKKKKKKP 














KKGGCSAQNRftGGKRRPSRGGGEKQN 


4682 


18583 


A 


4713 


3 


378 


FEEPVYIKSRQKRKESNPPKLVSSQPHG 
LHDFFKKKKKKKKKKKKKKKKDGGGPLK 
KKPGGAKNKPGGEKKNFFLKGGGKKNPW 
GFFKKKTFFGGGKIGAKPPKKKKSLKKK 
KKFLRQKGQKKT 


4683 


18584 


A 


4714 


2 


208 


VSNPAVSVPHLPFSVyKSSPMftSMTFSK 
KKKKKTKBCKKKKKKKKKKKKKKKKGGPQ 
KKKPWGAQNKPG 


4684 


18585 


A 


4715 


290 


56 


SAPPPIFFLFFFLFLFIPFIFPFPFKlQI 
LEKKGGPPPFFLFFFFIFFFFFFYRMGC 
DRWWFILIQGTFHRKQKPVLKV 


4685 


18586 


A 


4716 


50 


424 


GGFKIKFLFTQFLFIiFSPSMFLISKSP 
AYLWQSSPKQVPMWFGKTNFLVSIECK 
KEGIYFFCIPFVPFGFPHPKILCSLLPF 
LAPLRHSLALLPRLECSGAILTHCaiPCL 
PGSSNSHAWA 


4686 


18587 




4717 


161 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQflGVQWCNLGSLQPLPPGLQ 


4687 


18588 


A 


4718 


2 


115 


VYTTAMAIEEYPEAWGSGVEVLVSVLVG 
LflMEVGSTRP 


4688 


18589 


A 


4719 




416 


GNQGGGYGGGYDNYGGGNYGSGNPSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSS 

sssssssssssssssssss 


4689 


18590 


A 


4720 


1 


454 


QPDSGISSIRSFKTILPAAAQDVYYRDE 
IGNVSTSHLLILDDSVEMEIRPRFPLFG 
GWKTHYIVGYNLPSYEYLYWLGDQYALK 
MRFVDHVFDEQVIDSLTVKIILPEGAKN 
lEIDSPYEISRAPDELHYTYLDTFGRPV 
IVAYKKNLVEQ 


4690 


18591 


A 


4721 


3 


173 


DAWAGFHYVGQAGLELLTSSHLPASASQ 
SAGITDVSHRTQLDVPFSFSLSPLPHQC 


4691 


18592 


A 


4722 


2 


201 


LKPGGGGCSEPRSCHCSTPAWRQSETPS 
QKKKKGKWGLERIGWGGSTTWNGAKKQR 


4692 


18593 


A 


4723 


2 


183 


LKPGGGGCS EPRS CHCS TPAWRQSETPS 
QKKKKGKWGLE 1 1 CWGGS P TNKRATNS P 
VEKF 


4693 


18594 


A 


4724 


1 


395 


QDATSPIIEELITFHDHALIIIFLICFL 
DLHALSLTLPSALT 


4694 


18595 


A 


4725 


388 


3 


SFSPLEENLGNTIQDIGRGKDFTSKTPK 
AMATKAKID 


4695 


18596 


A 


4726 


223 


44 


WLFFFXPLFFFFFWGGI IFFLLKKLYI F 
FFFMSKIFFFFFFFFFFFFFFFFFFFFF 
FFFF 


4696 


18597 


A 


4727 


41 


374 


KKKKKGKKKKKGGGGGGNPKKKKGKMEK 
PPGGNKRGEGGKKKNFLKKKGGGLFKGK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Trypfoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KPPKKKNGGRKPWKKKNFKKKKKKAPFF 
LGPKI FKKKKKRGLKKKLGFFSQSVGP 


4697 


18598 


A 


4728 


323 


69 


FFFLRQSLGSASQAGAQGHNLGSLQPLP 
AGFQLFLQPPPPGSKQ FGRLNLLGNKDL 
WGLPVLRAHPQAPPLAFIL 


4698 

^9 


18599 

T8600 


A 


4729 

473(J ^ 


3 


197 


CLALLHLSRQRQYALPLLGLPGCPHIHT 
HSKTHMDHTHYTS PHAHRHAHTHTHMH 

TLPYTTHCI 
^=LTLS?RLE^SMM^YSKKLLGSSDH 
PTSASQSGGIIGMSYMA 


4700 


18601 


A 


4731 


3 


172 


DAWAFFNGPGVSLCCTGWSRTPQLKRST 
CLSLPKCWDYRPEPLSPAPKTLFSYVLL 


4701 


18602 


A 


4732 


98 


232 


QTVPLLIWLIALCSPGVLQTVKRFLEFR 
CIFSLGIVTFICLYRN 


4702 


18603 


A 


4733 


238 


1 


LGPPPPSPLGPPPFLVIFRGFSFPKTPL 














VTQAGVQWRDLGSLQPLPPGFKE 


4703 


18604 


A 


4734 


121 


424 


LIQGHRWIYFSQLRKVRGRQACDSTLL 

MPVDWLVDQHHWRYQTGKWVQCGKAEG 
TMPGAR 


4704 


18605 


A 


4735 


319 


8 


NPPQKKKKKIFFPPKKKTPPKKIKKNPP 
PFTIFFFFFFFFFFFFFFFFFFFFFFFF 
FFYFQHVWFCFEIIHVIFFLIEVLFLAF 
HVGDS lADAWSTKNSNRTK 


4705 


18606 


A 


4736 


297 


442 


TQGFFGGGPRKIGFFFRGLKNGGGGNPK 
LRFLKPGQLVGRGKTKKNQG 


4706 


18607 


A 


4737 


430 


0 










4707 


18608 


A 


4738 


137 


1 


IHY YFFLKQGFPLLPRLECSGAI lAHCS 
LEFPGSSDPRPSASQVA 


4708 


18609 




4739 




3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGFFTP PHKEKNFSLPPPVNLGPPKDP 

LKRPP 


4709 


18610 


A 


4740 


188 




TSLPKCWDYRHEPPCPAHTLLIFLFFIF 
VDMGS CHVAQVALELLGSSDP PTSASQS 
AGI IGM 


4710 


18611 


A 


4741 


410 


I 


RRPGGGFCPPPPPKIFFSPPPPPFFVGV 
LPQTPPPPKKFFFLWPPPGFFFPPPKKK 
KIFFFSPPFFSPPPIFFFPPPPPFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
GKDRVSLCCSDWSRTPGPKPSSLL 


4711 


18612 


A 


4742 


225 


444 


ENLKLKSFPCSHFHWDRWFNLTIGALSR 
S FDEAALLFFQNDTYWPGTVAHACNPST 
LGGRGGRITKSRDRHHS 


4712 


18613 


A 


4743 


2 


452 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 
KKKKKTKKKKKKKKKKKiOCKKKKSGGPQ 
KKKKR 


4713 


18614 


A 


4744 


3 


144 


LAAVLLKLGGYGI IRLTLIMPLTKHIA 
YPILLLSLNPDIITGFSS 


4714 


18615 


A 


4745 


206 




IQNPQTERVNSTSKLKISSKDTINRAKK 
QPMDWEVICKKHIFGKELISRMYKELQL 
NNKKQIVPPTRP 


4715 


18616 


A 


4746 


523 


178 


RHRRFFMDIEREQVKEQQRQKEQKKDIE 
KI KDKREQECYAAEQRILRMNFHEDPYS 
GEKLS E ILAQLQLQE IKGTREKQQREKE 
YLRYVEALRAQ IQEKMQLYWI TLPPLC 


4716 


18617 


A 


4747 


3 


518 


ENRUXLEGGGCGEPRSRHCTPTWAT 
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SEQH> 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last^amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nui^leotide deletion, \=possible 
nui^leotide insertion 


4717 


18618 


A 


4748 


491 


145 


LCHVGQAGLELLTSSDLPTLASRSAGIA 
GVSHRTRPHSHFYFYFLYSSSFFFFFFL 
AQRGNRAPRGKKTPFGLATLNI KS WGGV 
PWFLFLGFGFLKREPTFALGLKDRGGNL 
V 


4718 


18619 


A 


4749 


6 


362 


NKLPNGTEFLSSLVPNTFFFSFLFFPFF 
FFFFFWGKGGPPRGGGAKFYPLEFPPPG 
VPPPPKGGKKGGPPPPTNFLFFKKEGGS 
PFGPGGVKTPGPKGFPPPSPQRGGKKGQ 
WAQKPP 


4719 


18620 


A 


4750 


380 


3 


SLLLAFALLCLPWLQEAGAVQTVPLSRL 
FDHAMLQAHRAHHLAIDTYQEFEETYIP 
KDQKYSFVHDSQTSFCFSDSIPTPSNME 
EPQHKSNLELLRISLLLIESWLEPVRFL 
RSMFANNLVYDPS 


4720 


18621 


A 


4751 


1 


345 


LFYIFKKTWSLFLCFLWMMCVFIFFFFL 
liBVFFLCLGGILVSFIPPGGEKREKKKR 
RGGEKKKKKKNWFKKKKKNPWGGGGGL 
KKQRARGGKKKKKKGGGGSGGGKKKQSP 
LRG 


4721 


18622 


A 


4752 




407 


QEFEETYIPKDQKYSFLHDSQTSFCFSD 
S IPTPSNMEETQQKSNLELLRISLLLIE 
SWLEPVRFLRSMFAKNLVYDTSDSDDYH 
LliKDLEEGIQTLMGVRVflPGVTNPGTPL 
ASRAGGEKYCCPLFSNKaRDQEN 


4722 


18623 




4753 


357 


1 


PLFFFKRNLFFFFFFTKKKEKKAPVFFF 

FPPPSREWGGFFFRVGGVNTQFLHPFFF 

SQNTSVFFSTLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLFYCFIYFSR 
DRGLTLL 


4723 


18624 


A 


4754 




207 


LGLAGVEEEVASRLNAGGRGCSKPRSHD 
WTPAWAMEQDSMS KKTKMEYQRRQLIHL 
QSSTSGFLYVHEQ 


4724 


18625 


A 


4755 


2 


336 


HEERERERERERERERERERERERERES 
RAQFRARKNSQAPHPRVCETLWTEEGGS 
FYPLTEKDFAQCERAHFCVLGVSHPPGW 
GLeKKMNPECERATHSLHGDADISGVL 


4725 


18626 


A 


4756 


230 


446 


VGFSSLPSRYCTILISASSLTKELCEKK 
RSFLFCFAFWLLGFLRQSLALSPRLEYN 
GAISAHCmHLPGSSD 


4726 


18627 


A 


4757 


3 


390 


QTSFCFSDSGPTPFHMEETQQKSNLELL 
RISLLLIESWLEPVRFLRSMFATOH.VYD 
TSDSDDYHLLKDLEEGIQTLMGRLEDGS 
RRTGQILKQTySKFDTNSHNHDALLKNY 
GLLYCFRKDMDKVETPL 


4727 


18628 


A 


4758 


234 


387 


LAKIFLSFSSATGWIENRPQSPATGRTP 
VFVSPIPPPLPPPFPFAFVT 


4728 


18629 


A 


4759 


396 


0 


I XTHTHTHTHTHT Y X YMYVFM 1 1 


4729 


18630 


A 


4760 


176 


384 


MVI PLFNKDTQHI I LFWGYVL IFFFLKR 
NFVFVAQAGGQGRYFGSLKPPPPHLKPF 
SCLSLLGTWDYRR 


4730 


18631 


A 


4761 


419 


208 


ITPPPPPWEPKGGSFLKKKKKKKKPGPG 
VPPFIPPPLGGQAGGFPNSKIQIPPAPQ 
GKPLFFQKNQNYLA 


4731 


18632 


A 


4762 


158 


2 


GNQLNPPPPSGSAFCFLFFLRRSIiALVT 
QARVQWHDLGSLQPSPSGFKRFSC 


4732 


18633 


A 


4763 


489 


398 


A6FELLTSSDPPASASQSVGITGMSHCT 
QP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

otide 

location 

corrcspon 

last amino 

rrifdueof 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4733 


18634 


A 


4764 


420 


1 


FFFFFLGEIFPPGGKGGVFPPPPLKKFF 
FS PRGLFFLGGGGPNFPTPKKRVFFKKP 
EAGFFPPPGVLKHGPGPGFLTPPQKEKN 
FFFPPPGVLGPPKGFFKGGPPFFFFFFF 
FFFLGKKKKKKALFFTKKKGGGGTVRKA 


4734 


18635 


A 


4765 


167 


3 


lYFFFFLKKI FFYKKKINKLKKKKKKKK 
KKKKXKKKKKKKKRAAARDLELRHAS 


4735 


18636 


A 


4766 


424 


222 


FQNIHKKNFPSPPLLKTGPAPLFKKPPK 
KKKKYF?PPP3KLGPPREFLKGPPPFFF 
FFFFFFFFFFF 


4736 


18637 


A 


4767 


254 


423 


VIKSWARRNKNKWDWPGAVALSCNPSTL 
GGPC3QANPLRSGVQDHPC3QHGETLSLLK 


4737 


18638 


A 


4768 


363 


0 


KTTTSOaEGPTSVIPAAQETEAGDSSNP 
SGYGSYSSSSSSSP 


4738 


1S639 


A 


4769 


51 


169 


YYTHTHBRTCFVKKKKKKKKKKKKKKKK 
KKKKKKRGGGF 


4739 


1S640 


A 


4770 


188 


3 


ISPKAUUPHREVGPIFPPPKKKVPPQNP 
PGGFFTPPQKEKNFTLPPPVNLGPPRDP 
LKRPP 


4740 


18641 


A 


4771 


1 


131 


KLDKENALDEAEQAEADKKAAARGKQLE 
DELGSLQKKKKGGPF 


4741 


18642 


A 


4772 


363 


452 


RLECSGMIIXHCSLNLPGSSDSPTSASQ 

LA 


4742 


18643 


» 


4773 


121 


1 


RPRRPVAQAGVQWHNLSSLQPLPPGFKQ 
FFHLSLLSSWDY 


4743 


18644 


A 


4774 


31 


226 


MI SSQLS I I PTPTQEQGLKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKGGGLLKKILG 
GAKFLGGEKKNFFFF 


4744 


18645 


A 


4775 


161 


^ 


KPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFLFFFFLKKSWRP 
WGLKSAPAL 


4745 


18646 


A 


4776 


394 


67 


SPPRGEGGVAPPPPPKNFFFPRAVFFLG 
GGGGPMPPPQKRGFFPKTPGGVFKPPPK 
GGKFFFPPPGEGGPPPEFFKGPPPPFFF 
















4746 


18647 


A 


4777 


2 


409 


GVCIFSGHKHNKAPLPFSVLYSIQYINS 














KKT 


4747 


18648 


A 


4778 


456 


0 


FFYKKKKKKKKKKKKKKKKKKKKSSS 


4748 


18649 


A 


4779 


438 


0 


LL KI DKKKKKKKKKKKKKKKKKKRTSKR 

GG 


4749 


18650 


A 


4780 


58 


330 


S P FYQLKKKKKKKKKKKKKKKKKKKKKK 














KKKKKKKKKKKKKKKKKKKKKKKKKKKK 














KKKKKKKKKKKKKPQQKGGEKKKNPGEL 
GKKNCGGGGVLVGKPHKEKKRGGGKKKN 

FSG 


4750 


18651 


A 


4781 


435 


73 


FFPPPPRGKFFFKKPPGGKLFSPPELGV 
FFPPFPLKIFFFPSTPFFFWGGFPPFSP 
PPKKVFFPKFPPGFFFPPPLKKKFFFSP 
PPFFLPPPGFFFSPPPPFFFFFFFFFFF 
FFFFFFFSP 


4751 


18652 


A 


4782 


420 


109 


LFKKFFSPTEREGVFPPQPPKNFFFSPN 
PLFFGGGGAQNFPPQKKNFFPKNPPGFF 
FSPPKKKKKFFSPPGKILAPPGYFFKGP 
PPFFFFFFF FFFFFFFFFFF 


4752 


18653 


A 


4783 


230 


2 


LGLWGALYSPFLGGLGGRKFLLPWRPR 
VPLTP ISPPPSRLGGP PKKKKSQAWWHA 
PWPTTREAEAGGSLEPHHL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=A]anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 


4753 


18654 


A 


4784 


271 


440 


VLYSKRDNNKSFKNNTYTGIGPGAVAHA 
CGPSTLGGRGGQ IMRSGVRDKP YRHGET 


4754 


18655 


A 


4785 


2 


266 


KKYGIIIFCSFFLSLFLKDQGLAPLPRL 
ECSWFAAHRGLGLLGWSDPPKSAFLVA 
GITGMHHHAWLIFKVQFSNLMGPWSYTW 
SAVD 


4755 


18656 


A 


4786 


216 


435 


HRSGDSSLKKKKKKKKKKKKKKKKKKKK 
KKKKKIWEFWKGPFLWGGKKRKSGGG 


4756 


18657 


A 


4787 


249 


437 


MNNEWPLKTETSRPDWaHACNPSTLG 
GRGGQITRLGVQDQPCQHGEAPSLPENR 
KYLAWSV 


4757 


18658 


A 


4788 


107 


368 


AHGSLDFLPGGPKKSHQWFPKPKDGKPP 














GGYPRGGDPGGPFFFKKPGQKKKTPFLL 
RRD 


4758 


18659 


A 


4789 


1 


225 


PTRPFITNNIFPASPFQTTIPLYLKLTA 
LAVTPLGLLTTLDLNYLTNKLKIKSPLC 


4759 


18660 


A 


4790 


2 


216 


SGRNSRVDGRQEAACRDYQSSLEDIjTFN 
SKPHINMLTILAEENLPFAKEIVSIjIEA 
QTAKVFIHPftASYYF 


4760 


18661 


A 


4791 


389 


30 


ekksppqkkkkkffppppqkkffsppnq 
ktcggggpkkpppkkkgspqknpkgvkk 
pppkkkkkifpppqkkgpppkilkkppp 
pfffffffffffffpffffyfslffsgg 

FSYKIFL 


4761 


18662 


A 


4792 


164 


3 


lfpgffnpgvkgekgisfppffffetes 

hsviqargqwrrlgslhplppgfkr 


4762 


18663 


A 


4793 


550 


413 


aemgfhhvgqaglelltssdppalasqs 
agitgishraqphrpic 


4763 


18664 


A 


4794 


456 


352 


rsqdglylltssfaclglpkcwdyrrep 


4764 


18665 


A 


4795 


242 


3 


kpfsggggafqtaffitfgkkfwsrlg 
llkkkvflkflpqaffffffeteshsva 
qagvqwrslgsmqpsppgfkrls 


4765 


18666 


A 


4796 


218 


346 


gfscfvfetvslcrpgwsawrsrltat 
sasrvqaiilpqppe 


4766 


18667 


A 


4797 


383 


80 


KFFFNQGIGGFFPPFPPKNFFFSLKPFF 
FLGGFSPFFPPPKKIFFSKIPPGFFFSP 
PLKKKIFFFPPRLILAPPRFFFKAPPPF 
FFFFFFFFFFFFFFRPI 


4767 


18668 


A 


4798 


1 


126 


ILIKLSSTWEGIQAGKELEEQHGIHCNM 
TLLFSFAQAVACAE 


4768 


18669 


A 


4799 


81 


356 


ILFFFFFFPGGGTPKKSQVFFKNPQNPI 
GKPTRMGKGEKPPPPRGKKKKKGGPPTL 
GKNKNPVPGWPGAFLPPLEEKKKPFPGP 


4769 


18670 


A 


4800 


1 


99 


GCS EP RS RHCTLS WATRAKLHL I KKKKK 
KKKGA 


4770 


18671 


A 


4801 


238 


5 


NHQNSVQNYFKNLRIFQLIYSLSMNLPR 
SSFKLKSMTHS ISFSFLFSFFEIQSHSV 
AQDRI KWHDLGSLQPQPPGLK 


4771 


18672 


A 


4802 


166 


62 


I FLFSFFLPLRDRVLLCYVGWSVMVQST 
QTPGLK 


4772 


18673 


A 


4803 


102 


6 


aastteSpSvahagvqwcdlcSslkpgsr 

RFQ 


4773 


18674 


A 


4804 


17 


374 


FGSRAATLFFFFFFLGRGGFFLGPRGPK 
KKTPGGEGFTRFFDPGWGKGPEKKTPPP 
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SEQID 
NO: of 
nncleotide 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

s'equfnce 


Predict- 
ed end 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WGGKGGPKKMNLWGAGKIKFLEIKGKSF 
PRGTTPNFFGPPLFFSPPKTQGPGNFLD 
FSFKLKN 


4774 


18675 


A 


4805 


59 


175 


SFYFPLMMKKKKKKKKKKKKKKKKRGGA 
LKKKTGGGPN 


4775 


18676 


A 


4806 


406 


300 


SSGVIYSRAPTYFCGQTLTFEQVDIKTS 
HLPQEAE 


4776 


18677 


A 


4807 


123 




KSIAWLGTMAHSCtrPSTLGGQDGQTTRS 
GVRDQPGKHGETP 


4777 


18678 


A 


4808 


319 


406 


LYKSYSSWPNWAHTC3SIPNTLGGQGGQI 
I 


4778 


18679 


A 


4809 


7.36 


351 


GFWPAPGVHAOilPSIIiGGQGGRVTGPKL 
GNLLGQHGEI 


4779 


18680 


A 


4810 


173 


2 


EVLFLGFRSltlSLLQMNYLLYLIAKFA 
FVAQflGVQWHNLSSLLPPPPRFKRPSCL 
R 


4780 


18681 


A 


4811 


2 


94 


TRILKVWQKRKCSVKNGFLTISHGTVS I 
LF 


4781 


18682 


A 


4812 


358 


2 


FFFFFSETESCSVAQAGVQWEDLGSL 


4782 


18683 


A 


4813 


362 


3 


YSEVCIISLQQAHDSLSVINFHLAECVI 
QLPVNIiRELPGAVAHSPLGSELCFSDSG 
SVPSHTRSFPKAAIRLTANHRRGCTESG 

WPFSLVFLPERCMWPGTVAHTCNPSTLG 
AQGGQIT 


4783 


18684 


A 


4814 


341 


1 


WNQKRAHITKSILSQKNKAGGIMLPDFK 
LYYKATVTKTAWYW YQNHD IDQWMRTE P 
SETMPHIYNYLIFDKPDKNKKWGKDSLF 
NKWCWENWLATCRKLELDPFLTPYTKIN 
S 


4784 


18685 


A 


4815 


2 


283 


VYTGEEI LQKKE I T IRDQEAP YLLRNLS 
DHTVAISSSTTLDCHANC3VPEPQITWFK 
NHHKIQQEPELYTSTSPSSSSSSPLSSS 
SSSSSSSSS 


4785 


18686 


A 


4816 


266 


3 


AASTLQAHRAHQLAIDTYQEFEETYIPK 
DQKYSFLHDSQSLRQPLHTHTGSVGTRT 
LAASLSSGRTAHTRTHMHTQHTRKRTAH 
TNA 


4786 


18687 


A 


4817 


365 


1 


SFFFFFSETESES IGELEFSGTl SAHCN 
LRLPDSSWSPASA 


4787 


18688 


A 


4818 


87 


372 


ASYKKDKYCMIPLIRVIKFPQTERKKWL 
SGLGGEGNKELLFNGCNMAVFLVLGGGF 
ETEPPSVTQAGEQWCDLSSLKPLPHRLE 
RVSCLSLPSGW 


4788 


18689 


A 


4819 


5 


175 


FAPLTIPLSRLFDNAMLRARRLYQLAYD 
TYQEFEEAYILKEQQDSFLQNPQTSLCF 

S 


4789 


18690 


A 


4820 


235 


351 


FFXFFFXFFXFFFFFFFFFXFFFXFXFF 
XFFXFFFFFXF 


4790 


18691 


A 


4821 


2 


155 


IFLLLIESWLEPVRFLKSMFAWNLVYDT 
SDSDDYHLLKDLEEGIQTLMGRL 


4791 


18692 


A 


4822 


90 


2 


LFFFFFETESPSVTQAGVQ5JHDLGSLQP 


4792 


18693 


A 


4823 


247 


3 


KVKGSPSHRRAMVAKGNLSLHERMKSPG 
ISNYMGNSKILFSYFKNCWPGTVAQ&YN 
PSTLGRGGWITRSGVRDQPGQYSET 


"4793 


18694 


A 


4824 


93 


407 


QHPKKKECKWGGRGPLKNKEKVFKHKTFN 




























GLLKKFWGGPKFPGGKKKFFFPFGGGKK 
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SEQID 
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Predict- 
ed end 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 


od 


nucleotide 
location 

ing to first 

peptide 
sequence 


otide 

correspon 
ding to 
last amino 

residue of 

sequence 


E=Glutamic Acid, F=PhenyIaIanine, 
G=Gly.cine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NPPGVFLKKNFFWGGEKKKKK 


4794 


18695 


A 


4826 


2 


129 


TTAHCSLNLPGSSDPPTSRSLVAGTTHV 

Ci-INAWVTJFVFCRRG 


4795 


18695 




4827 




140 


CVGGVQKFTFPPPPQLSRSLFFLFFFAP 
PPETDNPFFFS 


4796 


18697 


A 


4828 


1 


259 


GGGCSEPRSCHCTPAVJVTEYDKKKKKKG 
RGKGKKKIGKGGFQRKFKSPNPHGRFLR 
ESHNSPKKGFPKKFFGNPPPGTSKKVPL 
PH 


4797 


18698 


A 


4830 


125 




KGTRTFETESRSVAQAGVQWHDLGSLQP 
LPPGFRRLTCLSL 


4798 


18699 


A 


4831 


1 


323 


ARGEREREREREREREREREREREPPPL 






SRGGSESYLKSAWRERIiLERDPLFFFG 
GKMGHLFPPPPPPVGRGliTGGGLITGGP 
FWWGECVSHTHHTHTHTHTYMSV 


4799 


18700 


A 


4832 


3 


297 


TRRERERERERERERERTL 


4800 


18701 


A 


4833 


1 


150 


AREREREREREREREREREREREKKKKIir 






HSRGGGPGPQFPLGGGVKKFGGGV 


4801 


18702 


A 


4834 




98 


ARGERERERERERERERERERERERERE 
RERGAGGGPT 


4802 


18703 


A 


4835 


2 


56 


HEERERERERERERERERERAAR 


4803 


18704 


A 


4836 


2 


73 


PEERERERERERERERERERETAR 


4804 


18705 




4837 


146 


2 


FGLVYSISQFCLFFVRRSLALVTQVGVQ 
WRNLGSLQALPPGFTAFSCL 


4805 


1870S 


A 


4838 


354 


0 


PPPPPPSPPPPPPPPPPSSPPPPSPPSP 
SSSPP 


4806 


18707 


A 


4839 


62 


358 


GLVHDKHSPHGGCGSHHVLPSLNPSFPT 
FSTKS PSVPPAKALWPPUSRWAJjQVGHL 
GLTPTTRQESAQAPRRKLALLEGSFQGQ 
AQWLT PVI PALWEAE 


4807 


18708 


A 


4840 


121 


13 


RPGRPPPEFTPFSCLSLPGSWDCRRPPP 
RPANLLYF 


4808 


18709 


A 


4841 


3 


221 


LVMGFVGFSSKPSPIYGGLALIVSGWG 
CVI ILNFGGGYMGLI VFLIYLGGMMVGF 


4809 


18710 


A 


4842 


3 


84 


CHCTPAWTANLDSIKKKKKKRKKFKN 


4810 


18711 


A 


4843 


33 


230 


LTVTI SKAAEVLKHKKKKKKKKKKKKKP 
KKKTPKKNPNPPPQKGELGKKIFFFFWE 
RFLPQKKRP 


4811 


18712 


A 


4844 




368 


HELNPRGRSCGEPRPCHCTPAWATRAKV 
HITKKKKKKKGFNLFKPGGSLSNFFPKK 
KETPWSPQNFEKKGPRRFSPRGKNPPLA 
LRGKKKGGKSLSGKI FPPFPRGKTPPWA 
FMGTQKGEYS 


4812 


18713 


A 


4845 


163 


2 


HHGHSISKFLLTVSSPKSFWQLRPDAVA 
HACNASTLGGRGGW ITRSGVQDQLV 


4813 


18714 


A 


4846 


387 


234 


KGCGEPRSCHCPPAWATIAKLLFKKKKT 
KPKKNPKKQEKTFPYNLNRVCY 


4814 


18715 


A 


4847 


189 


349 


KFSRFSGQKALLLKDILRLGTAAHTCNP 
STLGGQGRWIMRCQVFKTSLAKMVK 


4815 


18716 


A 


4848 


3 


95 


HENFYCPGSALGRNECailFPNPEATFVK 

Eiyi 


4816 


18717 


A 


4849 


2 


325 


DSHTI IMGDFNTPLAILDRSTRQKVNED 
IQDLNSaLHQADLIDIYKILHPKSTEYT 
FFSAPHRTYSKIDHKiySQGKHETDCRy 
GKKGGGWKDERFPDVDCEKFKSQ 
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od 
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nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, F=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threanine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4817 


18718 


A 


4850 


241 


348 


SFFSQTQASEQLTKSKQGLALRTLPPVQ 
FPPPPPRP 


4818 


18719 


A 


4851 


2 


176 


LVETGFCHVGQAGLELLTSGECWDYRCE 
PPSPAYCFLHTIVTIYPLLCLAFFSYLC 
DC 


4819 


18720 




4852 


2 


191 


VGRVGLELLASSDLPTSGSPSAGITGMS 
HCPWPNFLLIFIGYFFTVLTPCEYYSGG 
MTVPLWL 


4820 


18721 


A 


4853 


67 


376 


IKCTTFNIKKKKKKKKKKKKKKKKKKGG 














KHGPGRVLKKGEREKIFFWNFLKKNFFW 
GGGNLGQPPPKKLRALGKK 


4S21 


18722 




4854 


278 


363 


IWPD WAHTCNP STLGGQGRWTMRSGVR 


4822 


18723 




4855 


3 


282 


RGCSEPRURYCTPAWATEQDSASKKKKK 
KKTQ I FWEGPGGGVSPFKPHFFKGPGGK 
KPWGQKFKNPPPQWEKPLFFFKKKKTKF 
SPGGGGPP 


4823 


18724 


A 


4857 


382 


224 


PVIQLIGGVRQEKNFNLGVKGFNEPKSP 
PCPPTWVAKQNSLSKKKKKKIGKK 


4824 


18725 


A 


4858 


392 


102 


FFFFFFIiRQGLSWOaRVQWCFCSNLGL 
LGSSNPPTSASLSCFIiNKKEYRWENVTN 

KV 


4825 


18726 


A 


4859 


3 


111 


FHRISQEGLILLTLGSAHLGLPECWDYR 
REPPRPA 


4826 


18727 


A 


4860 


282 


83 


AQIYJWQNWGSSGNKCrCVCVCVCVCVC 
VCIYIYMLCMSRFKYKCIVGCMNIDSYV 
VYIPNIQCYI 


4827 


18728 


A 


4861 


3 


417 


NHSNLGGRGCSDPRSRHCTPAWVTERDS 
ISKKKKKEOCKKIFFGLREKKLTLFPFLG 
KKNENSGFPLNPNFFFGGKGKKATLGPL 
GLVLRGVPFFRGGBTQNGENQKPGTTLE 
KFEKHFWVGPPPLEKISPHKGKQTQK 


4828 


18729 


A 


4862 


1 


88 


MEEMYKKRHAAILENPVYEKKPKKKKKK 
K 


4829 


18730 


A 


4863 


274 


369 


VSLCCAGWRAAVESQITAALISRAQAIL 
PPQP 


4830 


18731 


A 


4864 


37 


444 


DPRVRSEKKIPFKIIjLLIDNAPSHPKAL 
MKI YEEINVI FMPANTTS ILQPMDQGVI 
STFKSYYLRNTFHKALAflMDSDVSDGSG 
QSKLKTFWKGFT I LDAI KNI RDSWEEVK 
LSTLTGVWKKLIPTLIDDYEGFKT 


4S31 


18732 


A 


4865 


341 


210 


VYMCHVGLLYHQPVIYIRKRIEASEITP 
HIYNYLIFDKPSHQF 


4832 


18733 


A 


4866 


206 


412 


QLCLSQACTLAEGNAKGFLKYIHRNNVS 
MPSVAGHTWGPDQLVKGQGETCPATIHF 
SDSPSLKYTMYPM 


4833 


18734 


A 


4867 


248 


78 


SHFEAGLAAQLTVSWFPQGTPVFVHaGP 
FANIAHGNS S I lADRI ALKLVGPEGFVG 


4834 


18735 


A 


4868 


637 


2 


EFDGRWPRQIVSSIGLCRYGGRIDCCWG 
WAHQSWGQCQPVCQPRCKHGE C I GPNKC 
KCHPGYAGKTCNQDEHIPAPLDQGSEQP 
LFQPLDHQATSLPSRDLNECGLKPRPCK 
HRCMNTYGSYKCYCLNGYMLMPDGS CSS 
ALTCSMANCQYGCDWKGQIRCQCPSPG 
LQLAPDGRTCVDVDECATGRASCPRFRQ 
CVNTFGSYICKCHKG 


4835 j 18736 


A 


4869 


276 


426 


VSFFLFFFFWGAKPPFVPQAGGQGGFLG 
SUIPLPPGLKHFFCLTPPSSGN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

ImiloTcid 
sequence 


Predict- 
ed end 

otide 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=CysteinB, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I>=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, y=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4836 


18737 


A 


4870 


100 


485 


EERJMREMTERCSVTQAGVQWHDLGSLQP 
SPPGPKRPSCLSLPSSWDYRLECDGiyiNL 
AHGKLRLPGSSESPASASRVAGITGMCH 
HAPLARLVSNTLPQVILPPLGLQQAEEL 
PRRQEVEHNTPSLVPI 


4837 


18738 


A 


4871 


718 


408 


EVLTLQIAHFPGFLGAHWWNQQDAALGR 
ATDSKEPPEELCPDVLYRTARTLHGQET 
YTPRLIIWDLHYDVSLGALVLTAAGYRA 
SSCSQSVAVCTGSVLEARA 


4838 


18739 


A 


4872 


392 


246 


RGGCSELRSCHCTPAWATRVKIjHLKKKE 
KRKCKIRPEGNE ILVNNAKG 


4839 


18740 


A 


4873 


1 


156 


GGGGYSEPRSC3ISTPAWTTRAKLRLLKK 
KKKKKGGVCFGVGFLGPRPKGGFF 


4840 


1S741 


A 


4874 


246 


2 


AHGISIYASIMIAMSIFSRENVSFRRAGQ 
VKAVFPVSWFGPLHEHHLGEHFLFFLR 
QSFTLVAQAGVQWCHLGSPQPLPPA 


4841 


18742 


A 


4875 


72 


2 


CFPAAPDEDSTTNITKKQKWTVE 


4842 


18743 


A 


4876 


375 


221 


GRLRQENRIJfPAGGGCGEPRSHYCTPAW 
VTEQDSISNICIYINIFSWHGQ 


4843 


18744 


A 


4877 


151 


2 


RAPFFFFFFFFVAQAGVQWRVLGSLQAP 
PPGFTPFSCLSLPSSWDYRRP 


4844 


18745 


A 


4878 


401 


70 


PHRREGGCVPPPPPKNFFFSPGGYFFW6 
GGGQNPPPPKGGFFPKTPPGFFFSPPQK 
KKIFFFPPPERGPPPGFFIiRPPPPFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF 


4845 


18746 


A 


4879 




152 


LPYSTPSTPFRAKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKRGGNFK 
KR 


4846 ■ 


18747 


A 


48S0 


267 


1 


TPPKKKKKKIIPPPKNGPPPHIFKKTPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 




























RGR 


4847 


18748 


A 


4881 


2 


408 


LQDATSPIIEELITFHDHALIIIFLICF 
LVLYALFLTLTTKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKGGGGPKKKNRGG 
PKKLG6EKKIFFFFKGGKKKTRGGVGKK 
KIFLGGGNFGPPPPKKKKPRGKI 


4848 


18749 


A 


4882 


413 


69 


LPPPPPGFKHFWAPPPPGRGVPGPPPPP 
RVNFVFLGKKGVS P FGPGGPRPLFPNPG 
GVGAGDPLDPRGQGFHGPKFMP CP PARG 
TKREPPFPKKKKKEKEEKKEPERKEKKK 
EI 


4849 


18750 


A 


4883 


46 


394 


KQYQVSVKLLFVTQNLGTKKKKKKKKKK 
KKKKKGGALKKKPGGGQKKPGGKKKNFF 
LKRGGKKKPPGIFEKKTLFGGGENWGTP 
PQKKKPPGKKKKILKGEGGKKTLYFGRG 
KNFS 


4850 


18751 


A 


4884 


399 


3 


FFFKKFFSPNEFWFFFPPFPLKIFFFPP 
RLFFFWGGLAQFSPPPKKVFFSKFPRGF 
FFPPPLGKNFFFFPPGYFWPPPGFFLRA 
PPLFFFFFFFFFFFFFFFSVSIRNAIAI 


4851 


18752 


A 


4885 


391 


47 


PHFFWGGVFPPPPPKKKKFFFLPQKGGG 
GGAPKKKKKKKKFLPPNFSGGFFFFS PP 
GGPPRFFFFIFFFFFFFFFFFFFFFFFF 
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sequence 


Predict- 
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correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucinc, iVt=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFLSKMFTFILKYTLKQNE 
FY 


4852 


18753 


A 


4886 


32 


379 


LLPVKKKKEKKGEKKKKKRKKKKKKKKK 
KKQKKKGGGPSIKIIGGGHLSSGGRKNN 
S FSHRGGLKKPLFFLGRTYFLGGGGLWP 
HPPPKKIAGGAKNIILRGWGEQKPLLFP 
PGEK 


4853 


18754 


A 


4887 


178 


33 


IFKRKWFFFRFFFFFFLYYFFFKFFFF 
FFFFFFFFFFFNICHHPKLY 


4854 


18755 


A 


4888 


238 


3 


KPPtJPFKRKKIGPQKKKKKFPPFLKINP 

PKYKTQNINIYKYIKIFFFFFFVRGSDS 

VAEAE VQWHHLGS LKF AS PG FK 


4{i55 






4889 






DGILLLSPRLKCSGTIS.flHCHLCPPGSS 


4856 


1S757 




4890 


6 


219 


lEKAHKALGTVPGSTFVRYHHYHHHNHY 
HHHCHHHHPHQnUSEWEKAQSLEPDKLF 
LNPGFATSQLRGLR 


4857 


18758 


A 


4891 


1 


364 


TSSRSRAQRPMDFGFFFKKKGHREPPGG 
VEKKKKNWGSRKTPPSGVKNFPGFDPPK 
MWNKGGPPTPPVNFFGFKKKRGFHKGLK 
PPPPEIAPtNPQKGGEKKGGPPPPPTPF 
LGGKFQGKR 


4858 


18759 


A 


4892 


387 


81 


GVFSPPRGGPPPGGKTPPFFPRGGPPCG 
PGGRGPIFPPLSHKKFFFS PPPPLLGGG 
GGPKSAPPPKGVFKEFPENFFWGAFIKK 
SFFLKKGGFFGPPRGFF 


4859 


18760 


A 


4893 


213 


14 


KTPPPFFFFFFFFFFFFFFFFFFFFFFF 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFMIjEVMFLVNHRGKICRVPFTFFNL 
Sli 


4860 


18761 


A 


4894 


384 


10 


PIiPPFIPPPPPPPPLFGGGVCPYFSPPP 
HLGSPSPPPGGRPSPPPPPPRSTPPPPP 
PPFFPPPPPPVSGAPPPPPHSCLAPPFF 
FFFPPPFFFFFFFFFFFFLPFFFLKKSD 
FIFSLQMKYFMLI 


4861 


18762 


A 


4895 


240 . 


355 


LI ITPALFFKI FVLWLGAVAHACNPS tl 
GGQGGQIMRS 


4862 


18763 


A 


4896 


339 


3 


GVGWHLFFPPQKKFFFFFQVTGASCAAK 
SSKESHKGLHLNIYLPKIFFYRPPPPFF 
FFFFFFFFFFFLFFFFFFFFFFFFFFFF 
FFFLWREGYNQHFRGMGPIGHVGGRV 


4863 


18764 


A 


4897 


56 


335 


KKRGGPLKKNLGGPKKNRGGKKKIFFFK 
GGEKKNLWGFFEKKPFFGGGKICPTPPQ 
KKKPPGEKK 


4864 


18765 


A 


4898 


7 


206 


LCLQPDNRWWDDQSKSFLIEKKKKKKKK 
KKKKKKKKKKKKKGGGPLKKWPGGAPNN 
PGGEKKNSPP 


4865 


18766 


A 


4899 


74 


250 


INICSEKLPKKKKKKKKKKKKKKKKKKK 
KKKGKKIKEGLNSQTSLLKKREKKKKRG 
KI 


4866 


18767 


A 


4900 


151 


312 


KYHFHKNYFFSLLYFSQNVSQLSPDGPL 
PQLPLPYINSSATRVFFGHDRRPADG 


4867 


18768 


A 


4901 


1 


364 


LNLGGRGCSEPRLRYCTPAWATEQDSAS 
QKKKKKKWPNF 


4868 


18769 


A 


4902 




96 


GGGGCGELKSCHCTPAWVAEQDSVSKKK 
KKRG 


4869 


18770 


A 


4903 


3 


225 


GFHRVGQDGFDLPTSWTARLGLPRGWDW 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienyIalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop eodon,7=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RREPPRPAVWAGLQLLTSSDPPASDSQR 
AGITHLSHLTLFCHLRF 


4870 


18771 


A 


4904 


384 


194 


HWVGRPFFNLWGQKNPPPPPPKRLEYQG 
WGPPPPPPFFFFKKTPIFSQVFNPPPFF 
FFFFFFG 


4871 


18772 


A 


4905 


3 


387 


TIITPILLTLFLITQLKILNTNYHLPLP 
AQYPLPQtOCKKKKKKKKKKK 


4872 


18773 


A 


4906 


1 


327 


PTRPIEPIPENPKFSVPPITPHPKKKKK 














KTWGGPKKRGGGKKIFFFLRGGKKKPRG 
EPLKKTSFWGGENLATTPPKKKKARKKK 

KNF 


4873 


18774 


A 


4907 


134 


2 


ALYSLKVKWPGAVAHACNPNTLGGQGEW 
IRRSGVRDQPGQHSET 


4874 


18775 


A 


4908 


59 


449 


atrhytimlaytSvgipitawlgkcjai 
kgarfqvrrttenydkqqlihkgggrpp 
gchsthtvllppvtwvkttgnstvanfv 

LILVCLSFLLLVYRCIQQLQRDSNQREG 
AMMMWVLSKRKGGYAGK 


4875 


18776 


A 


4909 


337 


3 


LKTAWATRGNPPLYKKNTQISGARGEGP 
PIPLIGGGLSQKNFFTPGGENS INPDPP 
PSPPPGAKKETPPPGKKKKKKPTTKELG 
KDWTLELWDSDSQRAGKLAEVPGESSP 


4876 


18777 


A 


4910 


14 


162 


AESGAKRETAFPFAPQQGATDKRLNI lY 
AAKBOHMKrCLPSLAIREMQIK 


4S77 


18778 




4911 


331 


2 


LPLLAPKEGQGKILGYKKPPPPGLPPFF 
GSTPPRNGEKGGPPPPRGNFWVFKKHGG 
FPRLGGRGPFPGPQKPPPRPPQKGGKQG 
GNPAAGPFFFFFETEFHSCCPGWSA 


4878 


18779 


A 


4912 


116 


316 


ACaHAHTLFFFFFLKKKPPFVPLLGGKG 
GNLGYLKLWLPGLKKVCLTPPRTGNYGG 
APPHRIIFCFL 


4879 


18780 


A 


4913 


93 


16 


SPPGWPGTVHHACHPSTLGGRGGRIT 


4880 


18781 


A 


4914 


316 


2 


KPRPGNPLRARVFNPPGPPGETPFFPKN 
PKFTRGGRGGPIYNPFPGRVRPKNFLYP 
RGSRFHWPNS IP CPFAWGTKPNFFSKKK 
KECKRKKKEKLCFSASVPDAQ 


4881 


18782 


A 


4915 


3 


289 


TSCNPSTLGGRGGRITRVGVRGQPDQHG 
ETPSLLKNYIYIYICVDVYLYVCMYVGA 
YIYICICGYIYIWRYLSLTSEHTHTRAP 
GQYRVIYFPCG 


4882 


18783 


A 


4916 


1 


222 


ARGERERERERERERERESRHQGGGAGP 
PTHLWGEEGWLSSPQTGVREKNAPIWG 
CflAPVLAPRVGRTPFGVD 


4883 


18784 


A 


4918 


1 


281 


ARGEREREREREREREHEREREREREHE 
RERERERERGAPPPPPRGGVREHGTPCP 
S ARGARALS PTI S CGDPHRGPKI YRGVG 
VHTQRYFSVGSSLCINTHT 


4884 


18785 


A 


4919 


1 


259 


ARGERERERERERERERERERVFVEKGD 
PWTLPAGIYHRFGGGEKKYTKAVRVCVR 
KPGWRAHNRPADHFGARGPHVKFLAQTV 
YE 


4885 


18786 


A 


4920 


797 


910 


NTMQPGWTHTCNPSTLGGQGGRITGSG 
VQDQPGQHGE 


4886 


18787 


A 


4921 


2 


67 


LEERERERERERERERERERERSS 


4887 


18788 


A 


4922 




698 


TLLLAELGTICDPYRSCSISEDSGLSTA 
FTIAHELGHVFNMPHDDNNKKKKRGGPF 
KGPNFSPPVRASKNFFFGPPNLNSWARF 
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ding to 
last amino 
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residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=iVIetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unl{nown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














LTPGEGKTPGVPPFNPFATKAEL 


4888 


18789 


A 


4923 


323 




LVCRGT IHKFRCVPHLTGRRFEHGVTDC 
YTLFRDAYHLAGI EMPDFHREDDWWENG 
QNLYLDNIEATGLYQVPLSAAQPGDVLL 
CCFGSSVPNHAAIYCGDGELLHH 


4889 


18790 , 


A 


4924 


3 


225 


HEEALHLFQTLMECMKRKELITVFHIGS 
DEHQDIDVAILTALLKGKPLRTFLFVSP 
FILCMYMCVYVCICVCI 


4890 


18791 


A 


4925 


371 


169 


HTQSISWLVERRFHHSGQENPIiNPGGR 
GCGELRLCHCTPAWATRAKSRLKKTKQT 
KKDAIKCSLCai 


4891 


18792 


A 


4926 




352 


ARaaRAGRIIKELPPFFFFCaTLKKKKGL 
FWWPGGVKKPDLGNPPPWPPKGGGITGL 
TPRPGPMGVIFKKFGKKGSPSPTGPKSL 
GPREFFGLALQRGGDLGLNHRGPPFFFG 
VLETM 


4892 


18793 


A 


4927 


2 


379 


ARANMEETQQKSNLELVRISLLIiIKTGL 
BPVRILRSMFTNKLVYDTSDSDDYHLLK 
DLQEGIQTLMGRLENGRRRTGQILTQTY 
SKFDTNSDNHDALLKNYRLLYCFTKDMD 
KVETFLRMEQCRSV 


4893 


18794 


A 


4928 


175 


31 


ILGDLFPPAWLHPFLFIiLLPFSRPSLAV 
TEVAVQWRHLGLKQPLPPGF 


4894 


18795 


A 


4929 


368 


247 


VDRLFPCCPGWSPSLEIiNQSACLSFLKC 
WDYRCEPLCSVS 


4895 


18796 


A 


4930 


2 


no 


ARGEPRSHHCTPAWAMSETVSKKKKSGG 
LFFLPRLV 


4896 


18797 


A 


4931 


150 


1 


KWAPCRPLFLVLSPCRESCFPFPFCHD 
GKFPES FSEATIMFFIiQPJ^PHa 


4897 


18798 


A 


4932 




155 


ARDDLNPGGGCSELRSRHCTPAWATERD 
S I SKKKKNPQNIWGNLKKKWNLK 


4898 


18799 


A 


4933 


144 


249 


ETLYPQPPQSLNASCVCVCVCVCVCVCV 
CLCVCV 


4899 


18800 


A 


4934 




153 


ARES FVEKGFRHVGQVGLKLLASGNPPN 
LATQSAGIISISHCTQPWRQGL 


4900 


18801 


A 


4935 


1 


355 


GTS QEF EGRKDRLAS PLETGNAGAGRER 
LAEVS TCPSGAS KPLQTPRPGGQEGTPG 

lgghrmrhsagpspnpawesxrdpslts 

TALLVACS I FHIHBDP ILHDVL I SRCMS 
IKTSHR 


4901 


18802 


A 


4936 


112 


358 


niikfskafffsfyifilinsvglastf 
mwggelkfsftsgiliifqpfkikghpq 
rgi lrtgtfkedpddpegnkvs sllq 


4902 


18803 




4937 




258 


GTS pyvtqqffsgwgrqgltls PKLEHS 
elrlchctpswvterdsiskkkkkklfy 
lagptffpfgggnfnlalgrllkrggge 

KK 


4903 


18804 


A 


4938 


380 


1 


fffffs etepysvaqagvqwc 


4904 


18805 


A 


4940 


255 




lgnniagykirglysislrnnrhpspvi 

S L CGNL FYVIRFFDMESCS VTQAGVLWC 

nigssqaltpgiapfsfislpstrqnsr 
p 


4905 


18806 


A 


4941 


10 


173 


KYIEYDTNKWKHNSCSQIGRrapVKMSL 
PPKaiYTFSVFPIKIPSVYLPVHLSI 


4906 


18807 


A 


4942 


144 


3 


dkcirrpgtvthtcnsstlgdqggrvmr 
sgvrdqqgqhsetpsvrv 


4907 


18808 


A 


4944 


484 


274 


PSSQES PRTflRMIALSTHPTTRSMKFCTI 
VAQAGLKLLGSSDLPASASQNVGISGVS 
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E=Glutamic Acid, F=Phenylalanine, 
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Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=posslble 
nucleotide deletion, \=possible 
nucleotide insertion 














HHTWPRFYYLS 


4908 


18809 


A 


4945 


103 


412 


KKKKKKKKKKKKKKKKKKKKGGGPLKKN 
PGGAKKKRGRKKKIFFFKRGEKKNPGGI 
FEKKPFFGGGKNWAPPPKKNKPFGEKKK 


4909 


,88,0 


A 


4946 


309 


81 


KFFFFLKGLIFLGGFCPIFPPPKKIFFF 
KIP PGVFFS PP FKKKNF FFPPPLIFGPP 
EVFFKGPPPFFFFFFFFFF 


4910 


18811 


A 


4947 


18 


264 


NYTQRFKVEYLNNIINQIiDLTDIYRILy 
PTVEYTFFKNPHGTLCRTDHMLGHNTGL 
NKPKRTECTQSMSSDHNGIKYHSAP 


4911 


18812 


A 


4948 


418 


293 


QLLRRVRQENCFNPGGGGCSEPRSPPCP 
PTWVTKQVFFSKP 


4912 




A 


4949 


3 


252 


DSAKHLGLKHWYCGLENVKRLTDGKLE 
VPHFDSKGEVEEYFWSIGIPMTSVRVAA . 
YFENFIiaAWRPVKASDGDYYTLGKTK 


"4913 




A 


4950 


434 


270 


KRGFPPKTPPGFKKPPPKKKKKIFPPPP 
KIGPPQKILKKPPPPFFFFFFFFFFF 


4914 


18815 


A 


4951 


399 


1 


VGFSLFPPPQKKGFSPKPPRGFLFPPPK 
GKKKIFPPPGKIGPPQGFFKRPPPLFFF 
FFFFFFFFFFFFFFGPPSVTLLFLVTIL 
LQIQTTALGRKSWGKGFRIPKPVCPVPA 
QQTKQGAKHLLGQWEINSCa 


4915 


18816 


A 


4952 


235 


56 


FTWPPPKKKKKNPPGGGGPVPPPWGGK 
NKKIFLPPKFKGPFSFGRPLPFPPGQKK 
KPL 


4916 


18817 


A 


4953 


13 


194 


SILISLQISLIITFTATELIIFYIFFET 
TLIPTLAIITKKKKKKKKKKKKKKKKKG 
GGAF 


4917 


-188T8 


A 


4954 


177 


1 


PPQKKKKNIFSPPGKIGPPQGFFKRGPP 
LFFFFFFFFFFFFFFFFRFEGECSDLDG 

RV 


4918 




A 


4955 


450 


0 


LPNKftDKKKKKKKKKKKKKKKKKKAR 


4919 


18820 


A 


4956 


29 


158 


AMIVPLYSSLGDFFKKKEKKKKKKKKKK 
KKKKKKKKKKGGGGF 


4920 


18821 


A 


4957 


397 


0 


PSSPPPPSPPPSPPPPPPPPPPPPPPPS 
SPPPAPPPPPPPSPPP 


4921 


18822 


A 


4958 


257 


397 


FFFLSFTFVAQAGXQGXXYGSLDPPPPG 
FNLFSCPSLPRIWNYTAPP 


4922 18823 


A 


4959 


419 


266 


RFHHDGQDGLELLTSSDLPASASQSAGI 
IGVSHRAQS IKSLAPQYWYFPSF 


4923 18824 


A 


4960 


300 


190 


FFFFFFFFFIVAFFFFFLFCFLFFFFFF 
FFXIFHNIF 


4924 j 18825 

I 




496, 


3 


179 


DAWVLPFIIAAIiRALHLLFLHETGSNNP 
LGITSHSDKITFHPYYTIKDALGLLLFL 
LS 


4925 


18826 


A 


4962 


2 


182 


RVNAKDSKWLTPLHRAVASCSEEAVQVL 
LKHSADVNARDKNWQTPLHIAAAMKAVK 
CAKL 


4926 


18827 


A 


4963 


273 


440 


KIHIQHCWWEYKLLQSLWKAAWHFLKEL 
KAELPFNPAIPLLDIYPEEYKSVYLKDT 


4927 


18828 


A 


4964 


179 


3 


SLPFCCNLTPKKKLFFKFCRKNGFFFFK 
MESHSVTQAGVQMCNLNSLQPCPHGLKQ 
FS 


4928 


1 SS29 


A 


4965 


268 


364 


TVCVIELLTSGDRPASASQSAEITGLSH 
HAQP 


4929 


18830 


A 


4966 


374 


2 


QNFFLKKKKKKRVGPGVPPFNPPPFWGQ 
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sequence 
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location 
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last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, F=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W^Tryptophan, Y^Tyrosine, 
X=Unknon'n, *=Stop codon,/=possible 

nucleotide insertion 














GGGFPRSILGNP IKPLFFLNSPKISPPL 
WPGPFFPLPWGVWAGKNPLPRGGGFPLT 
KFPPLPSSLGKKKKFRPKKKKKKKGQAR 
WLTPAIPTPREA 


4930 


18831 


A 


4967 


79 


346 


DKIFSNISCGIVIDLLAIYMWDAYATHL 
GIYKHWDFIIIDKDTSSCRLSFSSYPHF 
LESLDDFYILSSGLILLQTTNSVFWKTL 
LKQVI 


4931 


18832 


A 


4968 




93 


FLHVGQAGLGLPTSGDPPASASQSVGLQ 
A 


4932 


18833 


A 


4969 


158 




SKSNFAYIFPIKDNGLTGKKTINRWTK 
NWANDLNRHFSKVNIQMAKRYMKK 


4933 


18834 


A 


4970 


2 


266 


VGQAGVELLSSSDPPASAYEGAGIPGVS 
HHTWPKHFFPALFVACISSLVNCLFQLF 
ACIS IGLSFFFFFFREGDLGNLLKIAGL 
TFLG 


4934 


18835 


A 


4971 


3 


96 


GVRHHAHLIFVFIiVETGFHHVGQDGLNF 


4935 


18836 


A 


4972 


"253 


348 


NFIFLLFIYFEMESHSVAKAGLQWHCLS 
SLQP 


4936 


18837 


A 


4973 


326 




PMEIPQHKLSLWPGFAISVSYFERKLLF 
SADVSYKVLRNETVLEFMTALCQRTGLS 
CFTQTCEKQLIGLIVLTRYNNRTYSIDD 
IDWSVKPTHTFLKRUGTEITYVDY 


4937 


18838 


A 


4974 


3 


244 


DASLVFKVAETANEEEVKKMCMYKYPGN 
KCGREGGREKKREEERREGRKEGEEGLR 
EGETDEEREEGEEGLS YS PFKNS I 


4938 


18839 


A 


4975 


6 


79 


THFSLTITSLQPEDIGTYYCQQYD 


4939 












PGEAGNCLNPGGGSCSEPRSRHCTPAWA 
TERNSVSKKKKKKKKKKKI PRGRGLPPV 
SHPFWKAGGADWFDLGTLEPPWPTGETP 


4940 


18841 


A 


4977 


204 


275 


I lEVDPDTKEMLKLLDFGSLSNLQ 














RRP S PHGL VGAVS VGG2iGVMAVETLS PD 
WEFDRVDDGSQKIHAEVQLKMYGKFLEE 
YTR QLRR I EDALDDSIGDAWDSNLEPMA 
WRRLPYEQSSVLELIKTENKVLNRWTG 

YAGL 


4942 


18843 




4979 


219 


3 


AASTMAMSFEWPWQYRFPPFFTLQPNVD 
TRQKQLAAWCSLVLSFSRLHKQSSMTLM 


4943 


18844 




4980 


142 


3 


FQGFFFLRDKVLLCCPGWSRTPELKHSS 
CLGLPKCWDYRRELPRLA 


4944 


18845 


A 


4981 


351 


2 


FFFFFSETESRSVAQAGVQWRDLD 


4945 


18846 


A 


4982 


133 


3 


SFFFFLFETECCSVPQAGVQWCDLHPLQ 
PLPPGFRRFSCLSLP 


4946 


18847 




4983 


3 


161 


LGSLHDTANTLWPRLERSRT IMAHYSLD 
LPGSSDPPTSASHVRGTTGMRSTRP 


4947 


18848 


A 


4984 


363 


3 


GPSNKKRGRSKKAHVLAASVEQATENFL 
EKGDKIAKESQFLKEELVAAVEDVRKQG 
DLMKAAAGEFADDPCSSVKRGNMVRAAR 
ALLSAVTRLLILADMADVYKLLVQLKW 
EDGILKL 


4948 


18849 




4985 


101 


1 


LQVCYGRRWMNSREYGAWKQQVESKNM 
PFQDA 


4949 


18850 


A 


4986 


341 


1 


PPPMSPPDVIiKTSPVADAAGWVDVDKET 
LQHQRYPNVFGIGDCTNLHTSKTAAAVa 
AQSGILDRPISVIMKNQTPSKKTDGYTS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutanilc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysinc, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPLVTGYHRVILAEFDYKAEPLETFPFD 
Q 


4950 


18851 


A 


4987 


86 


1 


KCSGTISAHCNVELLSSSNSGPSASRAI 


4951 


18852 


A 


4988 


124 


3 


LPWLCSKFETKTFQFQIYYEQUCGVKTVW 
YSHKDRNIDQWN 


4952 


18853 


A 


4989 


305 


150 


WLLNYRYSHSAEVQVGFQFLTWVFSTLG 
SPDHFSKFFNQYFFKICFDYWKN 


4953 


18854 


A 


4990 


64 


340 


KKKKKKNLLKFPGQREKPLSPNFLGGLG 
C3GKHGPPGGKPSRDPDWPPGI KPGEKKK 
NPLKKKKKKKKKPLSKGQGGKKTPLKKK 
KKPPPKGP 


4954 


18855 


A 


4991 




158 


PSLVQTRLRHAGQAGLKLLTSTDSPASa 
SQSAGTTGVSHHftMPKDCSLNSN 


4955 


18856 


A 


4992 


116 


66 


FFFS'FFFffFFFFt'FFXEl 


4956 


18857 


A 


4993 


3 


107 


GQAQWLTPVIAALWEAKVGGSFEVRSLR 
PAWPTQ 


4957 


18858 


A 


4994 


1 


105 


KLDRLARHGLYEKKKTSRKQRKERKNRM 
KKVRGTW 


4958 


18859 


A 


4995 


110 


3 


RRIGRYKTVFLCTQLEILMflEYPLPPSP 
KPIKIKN 


4959 


18860 


A 


4996 


109 


3 


DEVSILCPRLEOfGTlIAHCSrRRLPGSS 
DSPALAP 


4960 


1 8861 


A 


4997 


1 


148 


ACCPFCTIYLLPMFMIIKAPLMGTSKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


4961 


18862 


A 


4998 


3 


110 


VILQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4962 


18863 


A 


4999 


3 


330 


PIADRGAEYVSAREWMTICFELLQLLKA 
HKKAI RRATVNTFGY lAQAIGPHDVLAT 
LLNNLKGHERQNRVCTTVAIAIGAETCS 
PFTALPALMWEYRVPELNVQNGVLK 


4963 


18864 


A 


5000 


3 


110 


VILQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4964 


18865 


A 


5001 


25 


329 


NSRRRRNDQGSPNLCQTFTFMTLPYLPE 
HRSLLLKIRSCAERETKKKDDIPEEDKG 
NIKQCEINYVKKFQSFQDHKLKISKEES 
KILKKAQKDGFLHETLLN 


4965 


18866 


A 


5002 


278 


345 


EDEEGYNDGEVDDKEDEKELGG 


4966 


18867 


A 


5003 


101 


3 


VRI I I SGTGKKKKKKGKLPKNYDPKVTP 
DPER 


4967 




A 


5004 


18 


350 


VSHECLI FHIQNVLTGLVISLSCPSVPS 
HTHKHTHIHTHTHTHTHTHTFS 


4968 




A 


5005 


26 


147 


KEEVRMALFANDMIVYLENPFVSAp-fVL 
KLISCFCKGRLG 


4969 


18870 


A 


5006 


85 


1 


SASRVAGI IGKHHHTRLIFVFLVETGQS 


4970 


18871 


A 


5007 


370 


3 


EAHKPLLWVLKKI WS PWI YKKKTRINFQ 
KEKKSPFFSRPFFLFKKGGFLPPGVFFS 
RGFLKKIPPFFFFFFETESRSVTQFGVW 
SVQWCNLGLLQPEPPGSKQLLCRGLPSS 
WDYRCLSTRP 


4971 


18872 


A 


5008 




85 


IRILSKIKNALTHFLPQGTPTPLIPILF 


4972 i 18873 


A 


5009 


218 


93 


FFFFFFFFYFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


4973 1 18874 


A 


5010 


3 


250 


RPRREELRQENHIiNLGGEGCSESRSRHC 
TLTWATEQDSVSKKKKKRVFPPPHPRGK 
FFWAGGPTFFPPLKTVPFPGGGKPNF 


4974 1 18875 


A 


5011 


336 


0 


SSSSPPSSSSSPPPPPPPPPPPPPPP 
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nucleotide 
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Predict- 
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last amino 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsolcueine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4975 


18876 


A 


5012 


2 


80 


SNQNGTGGE S I YGE KFEDENFHYKVK 


4976 


18877 


A 


5013 


53 


324 


FLVFFVPENSEKISLQLHLALTSNSSWV 
QSPSHLELMNLGRHINIRVDPTGLREGM 
HYTEVaSYDIASENARSLFRGPITAVIA 
AQTOES 


4977 


18878 


A 


5014 


329 




FFFFFSETESRSVAQAGVQWRYLGSLQA 
PPPGFM 


4978 


18879 


A 


5015 


193 


32 


PLLILCSERLCHCTPAWAIERDSVSKTN 
KTKQQQQQQQNQGTKCMFIiALLGQI 


4979 


18880 


A 


5016 


354 


235 


CGEIGMLIiHCWWECKIiVRPLWTNLFLRD 
GSTLTVGSKSSF 


4980 


18881 


A 


5017 


348 


2 


RLLVGKLMELHGEGSSSGKATGDE 


4981 


18882 


A 


5018 


76 




IFFLSQVSPLSKEDAGEYECHASNS 


4982 


18883 


A 


5019 


68 


356 


YFGGVGGFFFFFFFFFWFGPSGGVFFVG 
VQAIFFFFFFFLGGPFFLVRDYFFVAVF 
WWGLLFFFVFFFWFCFVFFFFFGWKRK 
NIFYGWVFLFCW 


4983 


18884 


A 


5020 


1 


288 


FFLNLEKiMIKMLSSYTDNGIVVATAEDF 
MQNFKNLVGYHNS I TEENLPQLGANENL 
ESQSGNFSWFIFFNADRKRGMVLLLPN 
NEMTILKPRTSV 


4984 


18885 


A 


5021 


133 


1 


WATETMWSTKPKMWPGTGAHTCNPSTT, 
GGQGGQITSPQKFKTS 


4985 


18886 


A 


5022 


2 


337 


RRSDPNFKNRLRERRKKQKLAKERAGLS 
KLPDLKDAEAAQKFFLEKIQLGEKLLAQ 
GEYEKGVDHLTNAIAVCGQPQQILHVLQ 
QTLPPPVFQMLLTKLPTISQRIVSAHSL 


4986 


18887 


A 


5023 




325 


VDGCPANLLSSHRSLVLRAETISLGEHP 
CDRGEQVTLFLFNDCLEIARKRRKVIGT 
FRSPHGQTRPPASLKHIHLMPLSQIKKA 
LDI lETEDCHNAFALLVRPPTEQA 


4987 


18888 


A 


5024 


135 


2 


ATMFLNSKVSKYSGYLLGFHECREKGWM 
TWDGERDPSPGILQLQ 


4988 


18889 


A 


5025 


2 


94 


KTATKLIGGXHYDSXNIKAIRDGLLARR 
HAL 


4989 


18890 


A 


5026 


2 


362 


QELERSMAQRCVCVLRLVAMLLLVFPTV 
SRSMGPRSGEHQRASRIPSQFSKEERVA 
MKEALKVFPTWSTSFIQHEWEEYSHL 
FTIQGSDPSLQPYLLMAHFDWPAPEEG 
WEVPPFSG 


4990 


18891 


A 


5027 


222 


84 


AASTFCHVSQAGLELLTSDDPPASASQS 
AGITGMSHRTQPMVHLY 


4991 


18892 


A ■ 


5028 


366 


0 


SGRSKKKKBCKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGR 


4992 


18893 


A 


5029 


231 


68 


AGMGSRALPKPLLMSHSSLKAVELPDSF 
SPELRSLLEGLLQRDVNRRLGCLGRG 


4993 


18894 


A 


5030 


2 


366 


LNLGGRGCSELRSCHCTPAWVTKTPFKK 
KKKVPHGGYFVAQLLPPRKMRTHRRPQA 
QGERTGPGVJGAPVHTGLLPRRGQAFTQP 
PSAFKGFFHPTNKSSSEAKKKKKKPRVG 

RKKm^GYIL 


4994 


18895 


A 


5031 


241 


368 


QVEMfFKSQSGAEAHICDLSTLGGCGGQ 
ITRSGVHDQPCQHG 


4995 


18896 


A 


5032 


279 


396 


DGGMWPGTVAHACNPGTLGGQGRWITRS 
GVRDQPGQHGE 


4996 


18897 


A 


5033 


173 


398 


SNESSIiRVlIPFFFFFFLKTNFSFCPPGG 
RQGPNFSLIiDPPPPKVKEIFCLTPPKRW 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 
location 
correspond 
ing to first 

residue of 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EYRPAPPPPRNFGFLIKTG 


4997 


18898 


A 


5034 


- 


143 


SNPPALTSQSMGITSMTRHALPIACFLV 
FVFLFFEMESHSLAQAEVQ 


4998 

"4999^" 


18899 
18900 


A 


5035 


350 


2 

"100 


QEISSVQTSTQLFNGMTVKARATTREVM 
ATYTIEDIVIELI IQLPSNYPLGS I IVE 
SGKRVGVAVRLFIFITQKSFIFLFSFLT 

LCLCLQHFHNDFLLLIVPILIAMaFLML 

T5RK 


HSCGSSSCRTLSVQEAAAYLirv'SKEIRI 
LIAIF 


5000 


18901 


A 


5037 


247 


334 


LQIDISAWAYTIAVKEDGELNLMKKAA 
S 


5001 


18902 


A 


5038 


2 


296 


DKAPMLKVIVNSLKNMINTFVPSGKVMQ 
WDEKLPGLLGNFPGPFEEEMKGIAAVT 
DIPLGKVHLEALKKKVIKFFYKFPLRCD 
IHTAKCTGLRYTAQ 


5002 


18903 


A 


5039 


152 


335 


RPIWSYYLLFFFLFWGLYLDWKRIFTF 
ILSEKYFDMKKNQCKEGIiDIYKKFLTRM 
TRISE 


5003 


18904 


A 


5040 


336 


1 


GGLTISSLLKEKEGSEVAKFTLEQLCLI 
CNIMSTAEYCVAATQQLEEKLKEBCVDVS 
LFDRINLSGDMGTFS PVISSS IHLLVQD 
VDAACDPAMVAMSKMQCQNVQHVGVKSS 


5004 


18905 


A 


5041 


3 


204 


LNLGGGGCSEPRSCHCTPSWVTARmLK 
KKKKKKRGKNPLKNGGKENFKFLQILVN 
PKNSLENIAV 


5005 


18906 


A 


5042 


382 


148 


WCNHRGPRSRKKRSEGSTKSRRLGATIR 
MVTPHTTRTCaRDPELTSKEKCVYIEEH 
THTHTHIYIHTHTHTCIYVKTH 


5006 


18907 


A 


5043 


137 


1 


RPRRRYMKKFSTSLI IKKMKI KI TMKYH 
LSHLIPVKMAITKKTKD 


5007 


18908 


A 


5044 


198 


34 


KPASRFCHVGQADLELLTSSDLPASASQ 
SAGITGVSHCSQPNFITLCLWTDHFF 


5008 


18909 


A 


5045 


11 


357 


LLTYFIIMFKILEIYEKEEQLIIKQISE 
GQEKVKELRQFKEHRKAKDSALQSIESK 
MLELTNRLRESQEEIQIMRKKKRKNMR 
GPYWKFKYSQRWKSPPGPFQWGWGPRKN 

FFV 


5009 


18910 


A 


5046 


2 


193 


GLQLLTSSDPPASASQSAGIAGMSHCTQ 
PQVHLMPSLYHFRFLQVDTKDLLRASAD 
LIHRGIT 


5010 


18911 


A 


5047 




196 


KLCLVMNSPMKAAPRNFSCAPSLSLLPF 
SFRKTTTTLTSNTADDFAYFDLCKMEPC 

SMVLFFVFC 


5011 


18912 


A 


5048 


386 


276 


AQAVLELLDSSDLPPSASQSAGISGVSH 

CTQPDSIF 


5012 


18913 


A 


5049 


1 


324 


VDfiAAEKLEASTGWLMRFKERS CLHNI K 
MHGEATVADTEAAAGYPEDLAKITDKGV 
YTQQQ ILNGDE lAFCWQKI PCRI FIiftRE 
QAVPGCNASKARLTVLLWftNAAGD 


5013 


18914 


A 


5050 


1 


129 


PRS CHCFPAWVTEQDSVSQKKKKKKKICE 
KEKKKNPVEKSAKVI 


5014 


18915 


A 


5051 


193 


356 


RSFIPSASASASTLCLKCSQNGQPGAVA 
HACNPSTLGGQSRQLTRSGVQDQSGQY 


5015 


18916 


A 


5052 


192 


341 


AKRVKRKHFFFFFLKQFCSVPQAGGRGP 
DPGSLKPLPPGLKGFSCPTPLN 


5016 




A 


5053 


340 


1 


KRIPDKPQKELRRLATKLIREAPEKGQA 
QGKE IHKS I QEAKGE I FKAIDRI KKSQF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
nucleotide 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaianine, 
G=Glycine, H=Histidinc, I=rsoIeucine, 
K=Lysine, I^Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DRQKALDILLVMQNALESLSNRIEQVEE 
RNSELEDKVFDLTRSSKAKGKRIRNYDH 

S 


5017 


18918 




5054 


2 


383 


WKVATQPADWPLDVLSRKLHLGPIWGR 
DVPRLSLPGKLVFPSSTGSHFFMLGIGD 
IVMPGLLLCFVLRYDNYKKQASGDSCGA 
PGPAlIISGRMQtvVSYFHCTLIGYFVGLL 
TATVASRIHR?iAQPC 


5018 


1 891 9 


A 


5055 


2 


383 


GSVLSKKGDYLKYKYNASLLDGTLLDST 
WNLGKTYNIVLGSGQWLGMDMGLKEMC 
VGEKRTVIIPPHLGYGEJV3VDGEVPGSA 
VLMFDIELLELVAGLPEGYMFIWNGEVS 
PNLFEEIDKDGNGEV 


5019 


18920 


A 


5056 


2 


383 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NNYARGHYTIGKENIDLVLDRIRKLftDQ 
CTGLQGFLVFHSFGGGTGSGFTSLLMER 
LSVDYGKKSKLEFS I YPAPQVFTAWE P 
YHFILTTHTTLEHSD 


5020 


18921 


A 


5057 


26 


452 


ICLMSLRLRQAWHEAAIDEFRTGTYRQ1.F 
HPEQLITGQEDaANNYARGHYTIGKENI 
DLGVDRIRKLADQSTGLQGFLVFHSFGG 
GTGSGFTSLLMERLSVDYGKKSKLEFFI 
YPAPHVFTAWEPYHFILTTHTTLEHSD 
CA 


5021 


18922 


A 


5058 


2 


385 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NNYARGHYTIGKEI IDLWLDRIPKLftDQ 
CTGLQGFLVFHSFGGGTGSGFTSLLMER 
LSVDYGKKSKLEFYIYPEPHVCTAWEP 
YNFILTTHTNLEHSDC 


5022 


18923 


A 


5059 


3 


390 


GDAANNYARGHYTIGKEIlbLVLbRIRK 
LADQCTGLQGFLVFHSPGGGTGSGFTSL 
LMERLSVDYGKKSKLEFSIYPAPQVSTA 
WEPYllS ILTTHTTLEHSDCAFMVDNEA 
lYDICRRNLDIERPTYT 


5023, 


18924 


A 


5060 


103 


3 


KIFFFLRWSFALITQAGVQWRGLGSLQP 
LPRAT 


5024 


18925 


A 


5061 


262 


412 


KAiSpFRTGTAFGNGKTSDYLLLGNFGYT 
FGGITGCLKAGLETSYWTWFTH 


5025 


18925 


A 


5062 


293 


406 


VUGS IFEVIWAVIKPGTSPGISVLRAL 
RLLRIFKVTK 


5026 


18927 


A 


5063 


419 


15 


WEEEGPLPKKKKGGFSNKEGNIIMGPFL 
KRPPEKKPPPPPRKINPFFFFFKRKGPP 

PPRVKGKGAIKGSRKPPLFGSRDFFSPA 
PQKSGAPGGPPPPPVNKNQFFFFFFLVE 
TGFHHVTQAGPELLSSSSPPTI 


5027 


18928 


A 


5064 


2 


216 


GLTNLFIFCRDRILLCCPGWSQTPSLKQ 
SSHLSLPKPWDYMYEPLCLYFLYPWFCL 


5028 


18929 


A 


5065 


259 


36 


SQPHWDYRASRAEEIATFLEVWLQAIFV 
CCLFLRQSPFIAQAGVQWHYLKSLQPPP 
PGLKQFSCFSLLLVPRLA 


5029 


18930 


A 


5066 


2 


404 


GKGAPTTSLISVAVTKI lAKVLEDNKLP 
GAICSLTCGGANIGTAMAKDERVWLLSF 
TGSTQVGKQVGLMVQERFGRSLLELGGN 
NAI lAFEDADLSL IVPSALFAAVGTAGQ 
RCTTARRLFIHES IHDEWNRL 


5030 


18931 


A 


5067 




400 


GENMITGTSQADCAVLNVAAGVGEFEAG 
ISKNGRTREHALLAYTLGVKQLIDGVNK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspai tic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IW^iViethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














MDSTEPPYSQKRYEEIVKEVSTYIKKIG 
YDPDTIAFVPISGWNGDNMLEPRAKIPL 
VTGWKATP I D CDAS GTTLLDA 


5031 


18932 


A 


5068 


41 


392 


GS PHHP CAHI ERKKKPYWSWIGFYTKRir 
ALRVAEWMDDYKSHVYIAWNLPLENPG 
IDIGDVSERRALRKRLKCKNFQWYLDHV 
YPEMRRYNNTVAYGELRNHKAKDVCLDQ 
GPLEN 


5032 


18933 


A 


5069 




395 


GTPTRPHILLQALFIRAMLPCPDYDTDT 
KTGLDQALKI CQAMLDEAANQGWL VTVL 
NNTNLIQMAIQGRWVKDSSLLTLPYIEN 
HHLHLFKKRKPIMKGPHAKSRTSIECLP 
ELIHACGGKDHVFRSLVQQ 


5033 


18934 


A 


5070 


3 


405 


PRASEVCGFSCHITCVNKAPTTCPVPPE 
QTKGPLGIDPQKGIGTAYEGHVRI PKPA 
GVKKGWQRALAIVCDFKLFLYDIAEGKA 
SQPSWISQVIDI4RDEEFSVSSVIiflSDV 
IHASRKDIPCI FRVTASQLSAS 


5034 


18935 


A 


5071 


3 


393 


ITRQEFIDGILASNFPTTTIEMTVGADI 
FDREGDGYIDYYEFGAALHPNKDAYRPT 
SDAPKTFHQQTRQVAQCI CAKRFLVEHI 
QENKYRPFLCNHFQDSHQMRLVRILLST 
VMVLDGGGW1«ALDQFLTT 


5035 


18936 


A' 


5072 


3 


394 


ITRHEFIDGILASKFPTTKIKlMTVdaDl 
FDRDCSDCSYIDYYEFVAALHPKKDAYHPT 
SDAYKIEDEVTRQVAQCKC3UCEIiLAEHI 
GENKYRFFLGNQFGDSHQLRLVRILRST 
VMVPVGQQWMALDEFLVND 


5036 


18937 


A 


5073 





393 


GEDAiUJNYSRQHYTIQKEIIDLQLDRIC 
KLftDQCTGLQGFLVFHRPGGGTGSGVTS 
LLMEHLSPDYGPKSKLEFSIYPAPQVPT 
AVHEPYNSMLTTHTTLEHSDCAFMGDNE 
AIYD I CRTNLD lERTTYTN 


5037 


18938 


A 


5074 


39 


482 


LGLHSAWRDDKIGYNPDTVSFVPISGWN 
GDNMLEPSANMPWFKGWKVTRNDGNASG 
TTLLEALDCILPPTRATDKPLRLPLQDV 
YKIGGIGTVPVGRVETGVLKPGMVATFA 
PVMDTTEVKSVEMHHEALSEALPGDNVG 
FNVKNVSV 


5038 


18939 


A 


5075 


1 


390 


GVSMAVSLVI I YQYWRNMPDPHISrLP IV 
AGWKKYPLFFGTAVFAFEGIRWIiPLEN 
QMKESKRFPQAUJIQMQIGTTLYVTIjAT 
LGYMCFHDEIEGSITLNLPQDVWVYQSV 
KILYSFGIFVTYSIQFYV 


5039 


18940 


A 


5076 


330 


462 


VNPFFREGGTESPSIAQRHVPWCSLNSL 
QPLPPGFKRFSCLSLP 


5040 


18941 


A 


5077 


1 


403 


GGW6VBAEDFEYAPDVEPLEPTLSNI IE 
QRSLKW1FLC3GKGGVGKTTCSCSLAVQL 
SKGRESVLIIFTDPAHHISHAFDQKVSK 
VPTKVKGYDNLFAMEIDPSLGVAELPDK 
FFEEDNMLSMGKKMMQEAMSAF 


5041 


18942 


A 


5078 


348 


1 


RPQGNNIERWQGGEVLI IQLLRIAKSVK 
NVQLLWKTQFLIKLHILLFYDPRIPFVS 
NYPKEKKTYA/HKKTYAQIFISLPRTFLF 
LRQSLMMLPRQDLSS 


5042 


18943 


A 


5079 


248 


3 


PGQVLEREHTCRRPWAWNVCICLAGPA 
KPNREDVSSSQEESLQLNSIPPTPTLTS 
TAVKSRQPLWGLKEMEEEDGSELDF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 

location 
correspon 

last amino 

relidueof 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]Vlethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopiian, Y=Tyrosine, 
X=llnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5043 


18944 


A 


5080 


3 


192 


GDC3SELRSCHCTPAWLTERDTSSQKKKK 
KE KKTWWPAPELFFFLLLEKKGQNQGKP 

WMKKVL 


5044 


18945 


A 


5081 


10 


171 


CFCLGPVGVCRSLLKPQLCFLC3FPWLQD 
ARQKFRSVLVEATVKLDELVTRSTRP 


5045 


18946 


A 


5082 


94 


1 


DFWPGKVAHACNPSTLGGRC3GWITRSGV 
QDT 


5046 


18947 


A 


5083 


142 


3 


AaSTGSHSVAQAGVQWHDLSSLQPLPPG 


^047 


18948 


^A 


T084 


"257 


"177 


FKRFSFLSLPSSWGRKIA 
TSLOEVAHACNPSTLaaRaGQITRPG 


5048 


18949 


A 


5085 


143 




GQESHSDTOaSVQWCTLGSTQPPLTLTF 
KQFSCIiNLPSSWHYRCVPP 


5049 


18950 


A 


5086 


3 


87 


HVGQAGLVLLASCEPPSLASQSDGITV 


5050 


18951 


A 


5087 


375 


220 


LHSLSQHIATRSKKPAENLFAFGTLSPV 
STNWVGFNPISFEGLMLIIPPFFP 


5051 


18952 


A 


5088 


40 


203 


IPLPLLLVSFFFYILTWGPTFFWGLTLI 
NDLLNFFFGINSGIS FSWFGSRAGELG 


5052 


18953 


A 


5089 


1 


145 


MUMRVPAQLLGLLLLWLSGAHCDIQMTQ 
SPSSIiSASVGDREDGDVDAA 


5053 


18954 


A 


5090 


146 


328 


FMPFPLLESLEIFSSKLFMPPNLFFFFE 
TESCSVAQDGVQWYDLGSLQPPSPGFKR 
LSCLS 


5054 


18955 


A 


5091 


338 


1 


PNLPSVQLPPTTSCTPLIQDLLVARAHF 
AGWKHQGINSPSLPMAPREGMEAPRNPF 
HPHIFLCLIHGFHLAVSKLYSLFYYYYY 
YFSRWSFTLVAQAGVPWRDLGSLQPLPP 


5055 


18956 


A 


5092 


2 


327 


PPLTPPIFPSSPKWKNPPPPHYNPPPPP 
FRTPPPPPYPPLSPPKSPPPPPRVDPPL 
PYSIFPRPNLISPPPYSPFYLLPQLQAF 
P 


5056 


18957 


A 


5093 


133 


3 


AQtCTPSTQINSKWITDLlWKGKTtKLL 
KDNIREKLDDLGCGE 


5057 


18958 


A 


5094 


170 


350 


AGGQGGNFYSLQPNPPWIRESSHSTLPK 
FWNYRHTPTGPPNFGFFFLKMGFGLVAQ 
TIFN 


5058 


18959 


A 


5095 


133 


274 


RDTTIFI.ESGTKGYFLFFSDKITSELVS 
KIGDKNWKIRKEGLDEVAG 


5059 


18960 


A 


5096 


2 




REMQIKTTVRYHLTTARMVLIKKSEKNR 
CWHGCSEKGTLLHCWWECKLLQPLWKIG 
WRVDAA 


5060 


1 896 1 


A 


5097 


235 


8) 


FPKKKKAQIKKPQAAQSLNLSLSLYIYA 
FTYTHTHRHTHTQTHTHTYIHIS 


5061 


18952 


A 


5098 


355 


169 


KILLLITDSCAHIS CRFSECMKYFNFFF 
HPHSVYVTDDKIRLLEEQLPHVFSNKME 
PFKVCN 


3062 


18963 


A 


5099 


160 


1 


FSAS YFNS KKKL I FLLLKLRMNFFVCFF 
GMESHSVAQMVQWCHIjGSLQPLPS 


5063 


18964 


A 


5100 


364 


151 


QMGFCHIGQMDLELLTSSDPSaSASQSA 
GITGVSHYASQEFLKKEFHSAHLIPLQG 

TCVQGKTAQPYSEAL 


5064 


18965 


A 


5101 


205 




AASPGVQWPDSRLLQPRPFGVKQFFPLS 
FPSSWGYRGAAPPPGHFFFFFPSRDEIS 
VCLPGWSPTPK 


5065 


18966 


A 


5102 


349 


207 


DOaSLCCPDWGLELLSSSGLPALaLQNP 
EITGVGHHTQPWTLSLLIY 


5066 


18967 


A 


5103 


425 


0 


CEFFKNKDFCFLSKKKKKKKKKKKKKKK 
KKKQSSSQTSYS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 
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09/515,1 
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nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
scid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylafanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5067 


18968 


A 


5104 


437 


184 


LFLFFFPPPKKVFSQKTPRSVYSPPPKK 
KKKNSPPPLKLGPPQEIFKTPPPPIFFF 
FFFMFFFSFFFFFKGHRPRFISLWSPPL 


5068 


18969 


A 


5105 


401 


10 


MFSPPFGGPRGGVPLRPGFKNPPGPKGK 
TPFFLKNQNYLGPPLYSPFLERLNKKKA 
FTLEAKGPNNPNSSRS PPPGAQKKNPPF 
KKKKKKKRKIIFMPQAHPAVYWCSGNQ 
LP ITFPYS I ISGPLSSFT 


5069 


18970 


A 


5106 


413 




PNKKPSPPPFSPPHKTPSSPSPPPPPNK 
PTPTPPPPNKKKFPPPQPPPKKKNPPQT 
PPPQKKTPPPPKKKKHPPPPHKKTPPPP 
HTKNPPPPPPLPPSLSIPPLFPLYINSP 
FFIPFFFFFFFKIFFYLGRVGGRV 


5070 


18971 


A 


5107 


269 


115 


GFFFFFFFSFFFFFFFFFFFFFFFFFFF 

FFFFFFIKKCHFIIFKYIGSTSN 


5071 


18972 


A 


5108 


238 




QKLPLTOR I Wy I LL ?FDS LNLKMFPQ YQ 
I G LQ KI KVI':^!WT T F YF L PS F FLRWS FT L 
VTQAGVQWCDLGPLQPPPPRFK 


5072 


18973 




5109 


496 

_ 


0 








KKKKKKKKKSSK 


5073 


18974 


A 


5110 




243 


DHLSLGGRGCSEQRSCHCTPAWVTERDS 
VS KKKKSFRALD I FFFNGKKILGTLRGK 
NQKPPREVSSGNYFHFKGLFGPCYK 


5074 


18975 


A 


5111 


78 


4 


LGMVAEIACaiPSTLGGRGGRITRSGV 


5075 


18976 


A 


5112 


2 


96 


AGVQWRDLGSLQPPPPGFEQFS CLSTMP 
NFL 


5076 


18977 


A 


5113 


128 


6 


lYKDVLEPGVLRLLDVDNRWIiPIEAPI 
RIIIPSQDVLHS 


5077 


18978 


A 


5114 


132 


2 


DDLSSLQPLPPRFKRFCOiSIiDSWDYR 
RPPLRPAIQEAEAAE 


5078 


18979 


A 


5115 


1 


146 


AVETGFCHVAKAALELLHSNDPLASASQ 
SAGITGMSHRAMPSLFSILS 


5079 


18980 


A 


5116 


213 


3 


GAVGFSSGCFLLYLYALLGPFPPLPLLQ 
LKFI CWPGAVAHVCNPSTLGGRGGRI TR 
SGVRDQPDQHDET 


5080 


18981 


A 


3117 


407 


0 


LGWAWWFTP VI PTLRBVKAGGS PEVRS 


5081 


18982 


A 


5118 


215 


414 


KIRPKEIRNKGGSQQRKRAQLGKLEQTN 
PVLNFKAPFFGEQGDYYKSFFKTCLDNL 
PRRGKGVFFF 


5082 


18983 


A 


5119 


40 


326 


KKKKKKKKKKKKFKKTGGAKWAPFLNLK 
KKGIKPQKKKGFFQGKKiCRGKKKKIPKI 
LIQGFFQKKKNFSKKKISQKKKKNGFWP 
LGKGFCPKKKT 


5083 


18984 


A 


5120 


99 


2 


GRSTRHSPAHTHTHTLRHTGTLLHTHSH 
RHSP 


5084 


18985 


A 


5121 


3 


219 


ELRLCHCTPTWATRVHLSQKKKREKKKK 
NFVGKKKGKKGKKPPNPKKKGSGLGFKG 
FLFSKFFKTRGTTII 


5085 


18986 


A 


5122 


262 


351 


TRPGTliAHACNPSTLGGRGGWIMRSGVQ 

DQ 


5086 


18987 


A 


5123 


340 


0 


KKKKl SQAWWWVP WPATWEAEVG 


5087 


18988 


A 


5124 


94 


2 


LMP 1 1 PALWEAEVGGAPGQEFQTSLANM 
VK 


5088 


18989 


A 


5125 


310 


68 


LSPFYKLGFWGRVIYPPPPPKVSPPVFG 
APPPPEKKKKPPPPPPKSSIKKPTEKKI 
CLAASGKGLEVCFCKKPLIQSYFL 


5089 


18990 


A 


5126 


343 


131 


SRPRRPGLELLTSDDLPAPATQSAGITG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

last^amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonin6, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSNHTQIWPISVGDLMNTGWIPEIIRIV 
SELLREIHFLRICL 


5090 


18991 


A 


5127 


2 


145 


FSLLGSSVGRLEGTGTISTHCTLPASRD 


5091 


18992 


A 


5128 


1 


130 


SRSOTRLECSGTISAHCNLPGSSDSPAS 
DYRLTPPCPADFLYF 


5092 


18993 


A 


5129 


' 


338 


VWKEGIRFEKEKSKDFKNHVIKYLETL 
LYSQQQVCKLWTFI 


5093 


18994 


A 


5130 


346 


3 


HTLFLKNTPPRGRIFFFLPPLLLWGGWA 
PPPPPKKNFFFFFFFFFFFFFFFFFFFF 
FPFFFFFMPPTQSLFKDQEGPVQGGEEA 

PV 


5094 


18995 


A 


5131 


3 


114 


FLHIGQAGLELLTSDDPPASASQSAGIT 

GMSHHAQP 


5095 


18996 


A 


5132 


292 


180 


AASTDSIEGHGASLPSKKTPSEEDFETI 
KLISNGAYG 


5096 


18997 


A 


5133 


338 


216 


RGENRLPPGGRGFSGPKSHFCP PAMATE 
RDSLSKKKKWPV 


5097 


18998 


A 


5134 


337 


3 


IWLGNGKGIIHNNQIFSNKEAGIYILY 
HGNPWSGNHIFKGRAAGIAVNENGKGL 
ITENVIRENQWGGVDIRRGGIEVLRSTL 
ICFGYSDGVWGDEGKGLIEGNPISAK 


5098 


18999 


A 


5135 


227 


343 


NYVKEKL I PTWNWMVS IMDSTEAQLPYG 
SALTSWDPGQ 


5099 


19000 


A 


5136 


166 


375 


ATFVSLGVFCSAVILLYFKNMMKLDLSL 
TPHTTINSKWIKRLHTRPEAIKFLEVNI 
GKKFFDIGHEIIL 


5100 


19001 


A 


5137 


90 


2 


KTTLWPGAVAHACNPSILGGRGGRITRS 

G 


5101 


19002 


A 


5138 


213 


379 


PTFQSVGETGSLLKMHILGPGAVflHAOI 
PSTLGGRGGRITRSGVQDQPGQYGETP 


5102 


19003 


A 


5139 


384 




FFFFFFLSVMESCSATQAGVQWCDLSS 


5103 


19004 


A 


5140 


41 


317 


TMSRDRPSDKTWTYHRSNWMPDDGAPF 
RYSFSALKDRHKAVEVNWIDPNMGWETA 
TELVEDTQAIARYGENVTKMDAFGCTSR 
GQAHRAGL 


5104 


19005 


A 


5141 


29 


523 


VKAEAAYKKaDDIWNLRKDDYFVNDEAR 
AUYWDDREKARLALEARRKKAEQQTQQD 
KNAQQQSDTEASRLKYTEEAQKAYERLQ 
TPLEKYTARQEELNKALKDGKILQADYJM 
TLMAAAXKDYEATLKKPKQSSVKVSAGD 
RQEDSAHAALLTLQAELRTLEKHAG 


5105 


19006 


A 


5142 


432 


144 


GGFLPKAFLPPKKKKGAFFPPPGKGAIP 
PLFWGGWTVFLI PEKNPPRGFPPPKRGE 
KPVGGPFFRIFAPPPQKKIGGGGFFLER 
FFFWDWGKTGP 


5106 


19007 


A 


5143 


403 


6 


FFFFKKQIGPRGGGPPQKSPPLGGGGRG 
GSPRPGVLTPLGPQGKHLFFLKKKKKLG 
GGGGPRNPNFLGGGPGKSLYPGGQRFQG 
PKILPFFPPRGKKKKIFFPKKKKKKSHR 
KLIQTLILQTGKLMKMLWRT 


5107 


19008 


A 


5144 


115 


427 


KKKKKKKKKKKKKKKGGGPKKKKNFSPG 
GGKKKFFFNGAKKKKPRAPVKKTGEKKK 
GGKKKKKCFEKNPFFGGGKKKKKKCSSS 
YPCPKKQ 


5108 


19009 


A 


5145 


2 


79 


GCSEPRSCHCTPAWVTEGDSISKKK 


5109 


19010 


A 


5146 


317 


444 


IHQPVCVAHAYNPSTLGGRGGHITRSGD 
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SEQID 
NO: of 

sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 

peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanjne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Ptienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














RDHPGQHGETPYPL 


5110 


19011 


A 


5147 


3 


413 


SFYRGFIPVLQTVTVLIIGDPSLHGDAW 
Sl-JMEFFLTVIFIALVJMLPLIGLTIWNA 
IWFQDIADLAFEVSWTKPHPFPSVSQII 
ADMIF PLLLQAI FL I QGIAVS LFP IHLV 
GQLDSLLHMTLLYSLYCSEYRLVIN 


5111 


19012 


A 


5148 


409 


247 


QTK?RH\'AQAGPHFLGSSDVPTSASQSV 
GITGMSLHAQHQHFFTASMGRAFGEN 


5112 


19013 


A 


5149 


284 


1 


AEASRAHMEAHAETQGTPGSKAEGVGPV 

PPHPANFVFLVETGFFRVGQAGLEFPTS 
GDPAASASPV 


5113 


19014 


A 


5150 


439 


120 


FGSPRLECSGVISAHCNLELLGSSDSPA 
SASHVAGITAEPNARI IKAFGAPSTCTV 
VWERHEQQTCIQRSDLPAGRQKKTGPKN 
RPGLVAFPFNASPLGGPGGRIA 


5114 


19015 


A 


5151 


415 


0 


FFFFFFFFFFFFLPFVQSnflYIYMYI 


5115 


19016 


A 


5152 


67 


179 


I CLSLFLTVHVCV-CVCVCVCVCVCVCEM 
VFGVSIPCN 


5116 


19017 




5153 


280 


2 


KTPFFUSLKEKTRAPFFLKKTFFKKKPF 
KIFFSKKRNFFFFPPPGKRPPTLFKKKF 
FFFFQKKKNFFFSPKKKKKKKKKKGRRS 

KSHTSPRV 


5117 
5118 


19018 




5154 


296 


406 


DRVLLCHSSCSWAESRLTAASTSWAQA 


19019 


A 


5155 


114 


402 


V'WRrKASVEKKKKKKKKKKQKKKKKKKR 


5119 


19020 


A 


5156 


110 


1 


QRFLLTLKSLI LRLGAVAHACNPSTLGG 
QGGRIMRS 


5120 


19021 


A 


5157 


382 


95 


SSSSPHFLTPSQIiRLFFPFFPLKIFFFP 
KAFNFGGGVFPIFSPPKKKFFFQNSQVG 
FKNPPQKEKNFSFPPPVKFGPPKGFFKR 
PPPFFFFFFFFF 


5121 


19022 


A 


5158 


2 


350 


TLQPGRQSETPFQKKKKKKKKKKiOCKKK 
KKKKKKRGGGFLKNLGGGPKIWGVEKFF 
FFFFGGGYKKPLGFFLEKIFFLGGGIWA 
PPPPKKISFWGKKKFFWGGGGKTPLFFC 
GGKK 


5122 


19023 


A 


5159 


1 


208 


PTRTLYLVKLGLMGPAEIFLSSFQEIRL 
I DYDKMVDHRGVRWAFGQWAGVAGKYA 
RAYITHFUdWLGE 


5123 


19024 


A 


5160 


1 


208 


PTRPLYLLKLSLMGPVEIFLSSFQEIRL 
IDYEKMVDHRGVRWAFGQWAGVAGKYA 
RAYITHFLNWLGE 


5124 


19025 


A 


5161 


419 


25 


AQKKKKKKKKKKKKKGGGPLKKNFGGGK 
NKRGKKKKNFFLKRGGKKKTGGILEKKN 
FFGGGKKGEKPPQKKKAIiKEKKKFLRGK 
GGKKPLKWGGKKKW 


5125 


19026 


A 


5162 


475 


150 


KPRTSGTARVPGD PPPRCLDDKRLPSRP 
VI FVFLAETECDRVGQDGLQLLTSGDHP 
HLTFPNCYNYRREPPRPTSSGFQHDIET 
LHYRRGKADRFKAHLPK 


5126 


19027 


A 


5163 


237 


409 


FHNQKTYLDSALCTLVYFWFVLRDGIF 
LLPQLECSGTVMRHCIII.KLI1GSRSPPAL 

A 


5127 


19028 


A 


5164 


373 


476 


LGFILLS FLETRSHS VAQAEVQWCDHGL 
LQPQTP 


5128 


19029 


A 


5165 


374 


132 


IFYSPFLYYHMLGKYHKIKNLKIKHMWL 
nAVAHACWPSTLGGRGGRITRLGDRDHP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutaniic Acid, Ifc=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=!VIethioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vflline, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GQHDETPSLLFLWTFASSLSGFPL 


5129 


19030 




5166 


332 


2 


RPQGIiAWCFPGMiP PTKSEGFPGQSPGF 
QIWGGKAFFLRGFPFi'PQVESKGAVSPP 
CKVPPPGLRPFSAPTPPKNGDKRGPPPG 
RATGFFFFFVFLVETGFHHVSNDGLY 


5130 


19031 


A 


5167 


^38 


-j 


SRIGVGNLYRLMRGYATGGYVGTPGSMA 
DSRSQASGTFEQWNHGSAGESRG 


5131 


19032 


A 


5168 


I 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5132 


19033 


A 


5169 


3 


127 


DEDFSDFDEKADDEDFVPSDASPPKTKT 
SPKQVSYLIWVLS 


5133 


19034 


A 


5170 


20 


254 


lAPPAHIRHEERERERERERERERERER 
ERERERERDTRAEWRARLFFFFFFFLPA 
DQCFFIKPSGQRFLPPGGGGVP 


5134 


19035 


A 


5171 


36 


479 


TXGXDNKKDLLISVGDLVDRGAENAECL 
ELITFPWFRAVRGNHEQMMIDGLSERGN 
VNHWLLNGGGWVFNLDYDKE ILAKALAH 
TADELPLI lELVSGDKKYVICHADYPFD 
EYE FGKPVDHHQVIWNRERI SNSQDGIV 
KEIKGADT 


5135 


19036 


A 


5172 


278 


448 


RFLCPLLHPFFSSETKSLTLLPRLGCSG 
VI lAHCSLKLLGSSNPPTSASRVAGMTG 


5136 


19037 


A 


5173 


205 




HARLIFVFLVETGFHCVGQAGLELLTSG 
DPPSASQSETPSPLKILKSAGCGGANST 
ALASGLLFSLGP 


5137 


19038 


A 


5174 


409 


194 


FFFLRWSLALSARLVQWRDLGSRQAPPP 
GFTPFSCLSLLSISFQPSATYMKFLNTI 
TIVEYSFAYFPISQP 


5138 


19039 


A 


5176 


339 


472 


AKFTCISSNIKLSNTRPGTVAHACGPST 
LGGRGGQITGSGDGDH 


5139 


19040 


A 


5177 


426 


148 


GGRGPPPLSHPFaGGGGGGPLRAGGQKN 
PGPKGETPFFGQTQKKNRGGATPPFSQK 
LGGEKHKNSFTPGKENSIKFDFPPAPPT 
WGEKKNFFF 


5140 


19041 


A 


5178 


328 


474 


NNTEYNKNRLGAVAHACNPSTLGGRGRW 
ITRSRDRISP 


5141 


19042 


A 


5179 


448 


330 


BTGFCHVGQAGLELLGSSDLPASaSQSA 
GIIGVSPHAWQ 


5142 


19043 


A 


5180 


259 


14 


LRYKAILCSWIRTTNIVKMftELPKVIYI 
FNAIPIPKIPDMFCRIARFILKCMWNVK 
VSTIDKTIFKKEimVGIPPRVLMW 


5143 


19044 




5181 


2 


423 


BCFYATLVRYVGDRKULVCHEMSMALLSN 
LAQGDALAARAIAVQKGS IGWLIS FLED 
GVTMAQYQQSQHHLMHMQPPPLEPPSVD 
MMCRAAKALLAMARVDENESEFLLHEGR 
LLD IS ISAVLNSLVASVICDVLFQ 1 GQL 


5144 


19045 


A 


5182 


182 


312 


FFIQVGFHHVAQAGLKLPS S S S P PHLAS 
QSAGITGVSHCAQPG 


5145 


19046 


A 


5183 


301 


472 


GIPFFFFFLRRSFALVAQDGVQWRDLGS 
PQPPPPGLKQF FRNS TAWTKGLQP FPRL 


5146 


19047 




5184 


216 


413 


KPXXKKKKKKKKKKKKKKKKKKXGGAVK 
KKGPQKTPVKPPGGFFFAPPLGVSPPPG 
VFFAGGGAPP 


5147 ! 19048 


A 


5185 


357 


465 


IKPTKILAPGPGAVAHACNPSTLGGQGG 
WITRSGD 


5148 : 19049 


A 


5186 


467 


0 


FFFFFFFFFRFLFFFFFSEKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

amino acid 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

relillueof 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=lInknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5149 


19050 


A 


5187 


346 


3 


RFLPLGQGGVEI LTPRSTCLGPPKWWDY 
KGEPLHPAQKCKFFKKKRHHKKVQNSVA 
VYMVYHSMCGENKLRWRWWGIFTHTHTH 
THTHTQKKQNILSRTHKKLVITTGSEEG 
TL 


5150 


19051 


A 


3188 


11 


109 


GETPSLLKVQNIXWAWWHMPVIPAMWEA 

EARES 


5151 


19052 




5189 




151 


QLLRRPRQSMHLNPGSRGCSEPRLHHCT 
PAWATVQDS I SNTNNNNNKCP 


5152 


19053 


A 


5190 


2 


373 


AVAADKGVPLYRII lAlILAGNPDLILPVP 
AF SV INGGSHAGNKLAMQEFMI LPVGAS 
S FKEAMRI GAE VYHHLRG VI KAKYGKDA 
TNVGDEGGFGPI ILDMMDALELVKTAIQ 
ARGYPDKGAIGR 


5153 


19054 


A 


5191 


55 


421 


AR VADVCE SMKEHLLVLVERAKYIPGFC 
ELPLDDQGALVTAHAGEHLLLGATKRSM 
VFKDVLLLGNDYIVPSHCPELAEMSRVS 
IRILDELVLPFQELHIDDNEYAYLKAII 

FFDPDKETEA 


5154 


19055 


A 


5192 


163 


466 


TCFL.GSQSAGI'.'SVHHCPSREVFFLKLI 
HKRQGGQVALLVATPHSPCCPQYRLAPI 
PR RRHDFACASL T FAC TliVHVLLMPRV 
SAGRGVGVRPAGIQAGR 


5155 


19056 


A 


5193 


470 


0 


E CLWPGTVAHASNPTTLGGRGGR IMK 


5156 


19057 


A 


5194 


491 


281 


RGLALLPMGQCSGMMLAHGSLDLLDS SD 
PPASASQSVG IMGVSHHAWPSLVNISFV 
CLIHRSPKTKPEGR 


5157 


19058 


A 


5195 


3 


222 


LS FHSLHRCLYMLGTTSKNVS PFSLLEL 
LSRLATLLGDYCGSLSEGTISENVALVY 
ELLDEVlfiLESRCVAQA 


5158 


19059 


A 


5196 


20 


191 


STWWNSSRGGGCSEPESHHCTPAWVTER 
DSVSKTKECKINNKPALALPAVHPTSHPG 


5159 


19060 


A 


5197 


460 


350 


RRLRRENHLNPGGGGCSELRSSHCTPAW 
VTERGPSD 


5160 


19061 


A 


5198 


174 


41 


SLGELLTECLFETKSHSVTQAGVQWRHP 
GSLQPPPQGFKRIPPH 


5161 


19062 


A 


5199 


131 


454 


FEYFKNRKPFFFFFFFGKGVSFCAPGGK 
AGD P FKLREP PPPRVKGFFGPP PPSGGN 
NGPPPPPRVFFWFFKKRGGSPFWPGGVL 
TPGGQPPPPAPQRGGINGLDPPAR 


5162 • 
5163 


19063 
19064 


A 
A 


5200 
5201 


168 
414 


51 
497 


TVKTPIRPGAVAHACMLSTLGGRGMWIT 
RSGAEAQPNR 


5164 


19065 


A 


5202 


381 


614 


VQPGAPPEWVALSTCPSflAPEGSQQPYI 
PPTFCFFPVKWLLSVTWLYLFIYFLSWI 

SVAQAGVQWCNHGSLQPRPSRL 


5165 


19066 


A 


5203 


377 


485 


PKEPGPQPCAPQPQPHAPQPQPRAPQPQ 
PCAPQPQP 


5166 


19067 




5204 


54 


434 


MRTPENLELTtTPQEFGSSWAAVECPDTL 
DPRDMCVLNPLRE P FAKKE CSILLSEVF 
EICHPWDVTWFYSNCLTDTCGCSQGGD 
CECFCASVSAYAHQCCQHGVAVDWRTPR 
LCRECPRQSPEGTE 


5167 


19068 




5205 


201 


418 


GDAGAHAVLRLLPGHPGTSHLPFSSLTT 
SPLGLKRVTGGFNSKNRCDARTYCYLLP 
TFAFAHKDRDVQDETY 


5168 


19069 




5206 


27 


177 


NSFHHAGQSGLELLTSSDPPASASQSAG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 

correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptoplian, Y=Tyrosine, 
X=lInkno'wn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITC5VSHRAQPKAGTFLSPFHR 


5169 


19070 


A 


5207 


6 


TT75 


DLC3PHTPAWTRPKREDLVYQSTVRLPEV 
RISDNGPYECHVGIYDRATREKWIiASG 
NIFLNVMAPPTSIEWAADTPAPFSRYQ 
AQNFTLVCIVSGGKPAPMVYFKRDGEPI 
DAVPLSEPPAASSGPLQDSRPFRSLLHR 
DLDDTKMQKSLSLLDAENRGGRPYTERP 
SRGLTPDPNILLQPTTENIPETWSREF 
PRWVHSAEPTYFLRHSRTPSSDGTVEVR 
ALLTWTLNPQIDNEALFSCEVKHPALSM 
PMQAEVTLVAPKGPKIVMTPSRARVGDT 
VRILVHGFQNEVFPEPMFTWTRVGSRLL 
DGSABJ'DGKELVIERVPAELNGSMYRCT 
AQNPLGSTDTHTRLIVFENPNIPRGTED 
StJGSIGPTGAELTLVLALTVILELT 


3170 


' 1907F 




5208 . 


401 


3 


FFFFFSETESHSVAQAGVQWHHLGSLQG 
PPPGF 


5171 


19072 


A 


5209 


402 


229 


GFFKKGPPFFFFFFFFFFFFFFFKMVWL 
RLSGSKVEWVWGSQKLYMNFGCCRGRSW 
FT 


5172 


19073 


A 


5210 


198 




PPQAKILSSSSPPIRPPPKKGFFPKNPQ 
VGFYSAPHKEKTFTLPAPVKFGPPKDPL 
KRPPP 


5173 


19074 


A 


5211 


516 


-m 


GSTLRLTQRSMPASSSTMAPSSKRLLLP 
RPERSVPAAAGTAGHHBASRNCGRGGAG 
ADEGPATKGDSHPKPGYCRAAHPSAAPW 
PPGPEKKFMRVG 


5174 


19075 


A 


5212 


103 


203 


DKVSLRCPGWPQTPGLKQSSCLSLPKCW 
DYKCE 


5175 


19076 


A 


5213 


409 


249 


RRNQVGGPRQUSrPGKRIKSPGIDPYIYG 
QLIFDKAVRAIWWRKGSSFIiQKALE 


5176 


19077 ■ 


A 


5214 


2' 


170 


ERKILQYIQLRKGPNWGPYGLLQPFaD 
AIKLFTKEPLKPATSPYPLQSPAFPLKP 


5177 


19078 


A 


5215 


270 


464 


GQLNKNVWGWDPGKVTFRDGVSLYCUHW 
NAVAIHGHRHSVPQPQTPGFKQSCLRLP 
SSWDYMPLY 


5178 


19079 


A 


5216 


156 


248 


PDKSYI ICVCVCVCVCVCVCLCVWVCME 
CP 


5179 


19080 


A 


5217 


412 


2 


PPPQIFFFSPPHFFSPPPKGGFFPPPPP 
KIFFFPPPLFFFWGFFPPFSPPPKKFFF 
PKSPPFFFFPPPKKKKFFFFPPLFFFPP 














FRMGCDRWH6EFWILRDGFD 


5180 


19081 


A 


5218 


165 




QVKYHKTKKTHNPIKIGKRSEQTFHEKDT 
QMANKYMKHSPTSLRIKGIQIKTIMRY 


5181 


19082 


A 


5219 


408 


88 


PSSSPHFFSPPPLGGVFPPFPFKIFFFP 

PGLFFFWGVFPLFSPPPKRVFFPNSPPG 
FFFPPLFGKKFFFSPPPFFWPPPGFFLS 
PPPLFFFFFFFFFFFFFFFF 


5182 


19083 


A 


5220 


3 


259 


NLLLL I VP IL lAMAFLMLTERKI LGYI Q 
LKKKKKKKKKKKKKKKKKKKRGGAFKKK 
PRGGQNFPGEKKKKFFFLREVKKKPRGN 
F 


5183 


19084 


A 


5221 


10 


479 


KLMPAESDGRHRERERERERERERERER 
ESSSAEAECVSLSLDRYKNRVYMKVAEA 
LCECRLLAYISQAPTQMFFLLRLINIIH 
AHTLTQENDMCLHTTLEI LTtARQRERV 
PLYLRVIQRMEHKKKHPGFVLHNSHNLV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 


Predict- 

otide 

location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionlne, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














RLWQQLYLHRDRDSIC 




19083 


A 


5222 


142 


56 


KFQPGAVAHTCNPSALGGWC3GQIKRSGV 


5185 


19086 


A 


5223 


14 


471 


LEMRSEIDSQNYCNCDAGRNEWTSDTIV 
LSQKEHLPVTQIVMTDAGRPHSEAAYTL 
GPLLCRGDQSFWNSASFNTETSYLHFPA 
FHGELTADVCFFFKTTVSSGVFMENLGI 
TDFIRIELRGKLATLDKSQGTHYIiAIKA 
LTQKKKKKKTRG 


5186 


19087 


A 


5224 


480 


0 


FFFFFFFFFFFLIVAQFYFL 


5187 


19088 


A 


5225 


16 


422 


VRRTARI RHEGKP YECNACGKAFNRSAH 
LTEHQRTHTGEKPYVCKECGKTFSRSTH 
LTEHLKIHSCVKPYQCNECQKLFCYRTS 
LIRHQRTHTGEKPYQCNECGKS FSLSSA 
LTKHKRIHTRERPYQCKKKKKKK 


5188 


19089 


A 


5226 


99 


436 


GHP S FE I PAIMTAAP ASFP PQ VPWEDVR 
YLFDEIMYGGHITDDGDCKLCRVYLEEF 
MNPSLLAALSGTHACGEKPRLPANSHVS 
EPSWKWILQPQSSLQMTMLMTILTATS 


5189 


19090 


A 


5227 


473 


9 


FQITATPHIAVYDPTVQFEFWFSEKQIA 
D IRQVEASTRYLGTALYWIAAS INIKPG 
HDYYFYIRSVNTVGKSAFVEAVGRASDD 
AEGYLDFFKGKIAESHLGKELLEKVELT 
EDIASRLEEFSKEWKDASDKWNAMWAVK 
I EQTKVGEEDVTSG 


5190 


19091 


A 


5228 


1 


473 


PPPIDRLPNCTACRNSARVMTDAGQPHS 
EADYTLGPLLCCGDKSFWNSASFNTETS 

YLHFPAFHGELTADVCFFFKTTVSSGVF 

MENLGITDPIRIELRGRLATLDKSQGTH 
Y LZ\ I FJvLT Q KKKKKT HGGAGP P FP L IG S 
R I T IHGP P FNNAAMREK 


5191 


19092 


A 


5229 


170 




KTPLLGPGTVAHACNLSTLGDQSGCIMR 
SGVRDQLRQHDETPSLLKRI 


5192 


19093 


A 


5230 


465 


0 










ARG 


5193 


19094 


A 


5231 


419 


55 




5194 


19095 


A 


5232 


79 


451 


AGSNIiQEHRGLRPESPENPGQGGCREPR 
RHHCPPAWAKEAKLRLKKKKHGENQEVA 
QGRHGPFFPGLGPEGFLPRRPGSRKRRK 
GRNVPGPELKGPGAVLFSRHRRGGADLR 
PPRKGPRGGGGG 


5195 


19095 


A 


5233 


39 


457 


VQQQQRAQEQQQQHPVLHLQPQQIMQLQ 
QQQQRQISQQPYPQQPPHPFSQQQQQQQ 
QAHPHQFSQQQLQFPQQQLHPPQQLHRP 
QQQLQPFQQQHALQQQFHQLQQHQLQQQ 
QLAQLQQQHSLLQQQQQQQIQQQQLQRM 
HQQ 


5196 


19097 


A 


5234 


194 


45 


MI LKFFFNFKIHFFFFLRQSLALVAQAG 
VKWCNVGSQQPPPPGi''KRNST 


5197 


19098 


A 


5235 


204 


475 


ASITVHLWYVEKCYSQSTEKVTPMPSSS 
WFGPQPDAQLKHGLRPGAVAHACWPSTM 


5198 


19099 


A 


5236 


59 


357 


NHWRKIYCKVYWRKKIKHNTTNTTKHYQ 
NFQRNRTKSSFTIFRIQSHYNKHIKKHI 
QRECRQSIETDLEMTQMLELVWIFKWIV 
ILTKIKTPKTLKING 


5199 


19100 


A 


5237 




166 


ELLAFWQTQYNNMNAVWLFFFWI KAPG I 
PTPSFASRRTHSPFFSFLFFLFFFLGKK 
GFPLCGPGGPKSLWLKTPPLGPPKGGK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

"eptidr' 
sequence 


Predict- 

location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=pos$ible 
nucleotide insertion 


5200 


19101 


A 


5238 


34 


296 


LSVAEFIjAFWQTQYNNMNAVNLFFAWIK 
VPGI FTPS FASRRTHS PFFS FLFFLFFF 
LGKNGFSPCGPEGPKFLNLITPPLGPPK 
GWE 


5201 


19102 


A 


5239 


340 


472 


S LNE TH VRS PAHHR PGATAHAC NP I T LG 
GRDGLIMRSGDQDHPG 


5202 


19103 


A 


5240 


460 


0 


PKKKIFTIYFFYFFFFFFFFFFFFFL 


5203 


19104 


A 


5241 


161 


46 


SFYSTHRKPCHWPGSVAHACNPSTLGGR 
GGRITRSRDR 


5204 


19105 


A 


5242 


175 


36 


LKY YLI FI YLLS FFFLRQSRS VGQAGVQ 
WRDLSSLQPPNSTAT,QYR 


5205 


19106 


A 


5243 


11 


442 


DTI RWGLPTLGSKS TTNEKKREKRRKKK 
EQQQSEANELRNLAFKKIPQKSSHAVCN 
AQHDLPLSNPVQKD3REENWQEWRQRDE 
QLTSEMFEADLEKALLLSKLEYEEHKKE 
YEDAENTS TQSKVMNKKDKRKNHQGKDR 
PLTV 


5206 


19107 


A 


5244 


459 


333 


FLRVTQAGLKLLSSSDPPTLASPKCWDY 

RHEPLCPAQWSVS 


5207 


19108 


A 


5245 


229 


^6 


1?NS TWRKRC^LGGF lE3^"Li^^ 
WSLTYVLCVSFFVCL?LRQSHFVTQAG 
VQWHNLGSLQPLP 


5208 


19109 


A 


5246 


227 


30 


VFFARWVFLRQDLDLSSRLECSDVILVH 
CNTOLflSSRGTPSSVSKVAGTTGMLYHT 
MVFFYEFHR 


5209 


19110 


A 


5247 


173 


442 


FLGSSSKAAITLYYCQYMEFFFFFFFFE 
KNFFFSPPGGGEGPNFILLEFPPPGLKK 
ISCPTPPRGWDYGVAPPPLVNFWFFKKN 
GVPPPW 


5210 


19111 


A 


5248 


255 


358 


TGPGTVAHTCDPSTSGdRGGQlTMSCiDR 
DHPGQH 


5211 


19112 


A 


5249 


3 


130 


QPQLAAPSISWAPTSASQVAGTTGVCHH 
AWLIFLFWVESRRG 


5212 


19113 


A 


5250 


90 


1 


S TRLEOTTGTMSAQCNIiRLTGS SNSPNSA 

SR 


5213 


19114 


A 


5251 


358 


447 


SFISYRRRSPSPYYSRYRSRSRSRSYTP 


5214 


19115 


A 


5252 


55 


202 


CWNTSMRGP IWE2\EAGESLE PGGRRS CG 
EARTCHCI PAWWRAKLVDAA 


5213 


19116 


A 


5253 


330 


3 


EEMGFSHVGQGGFKItASGDTPAWAFQK 
GGISGVSPRAWAGFIIFIWHFSGGPQTG 
FPFYPFRGI FFPEAKINYIRVLUPKGLC 
QKGVFFFFFSGFPHYRCRLCQISV 


5216 


19117 


A 


5255 


175 


3 


MYLRFPLGLSLSERPGMVAHACNPCTYW 
ESKAGGSLSLGVQDQPREHGGTLCLQKS 
K 


5217 


19118 


A 


5256 


263 


1 


RPRRRFMRDKTALLQEMPFRTFTALGKL 
ISDFKDSKDGLTLYFVETASRSWLAGL 
ELLTSSDPPTSASQSTGITGVSHFSQP6 
LAF 


5218 


19119 


A 


5257 


1 


103 


CHVSQAGLELLASSDPPASASQSVGITG 
VSHRTW 


5219 


19120 


A 


5258 


128 


2 


IHKWRDCKMVQPLWKRAWQFLMWLNTHL 
SCDPTVLLLDILTQ 


5220 


19121 


A 


5259 


381 


2 


FFFFFSEMESPSVAQAGVQWCGLGSLQA 
PSPGFT 


5221 


19122 


A 


5260 


2 


173 


ENYI4NMGGGGCSEPSLHHCSPALAIEED 
SVSTTTTTTKQQQQNNQLGLFSYPFNPL 
I 



864 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nude- 

location 
correspon 

last^amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine,lVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5222 


19123 


A 


5261 




175 


IKGNQIGKKEVKLFLFTDDMILYMENPK 
DSAKNYYKQSKTFCLPWLGRSCKRTCP 

LA 


5223 


19124 


A 


5262 


2 


145 


QVSLKLLGSSDPPALTSQGAGIMGVSHR 
TRPRUTDFKNMTQKSKNTQ 


5224 


19125 


— 


5263 


182 


3 


STTS fLNFFLTEMVSHYVSQAGLKVLGS 
SDPPSSASHSAGITGMSHHTRPQQPLLN 
LKV 


5225 


19126 




5264 


183 


369 


WNSLDFFFFFPSFFRGYSSMSKVQDAFI 
FYRRFIDKSKPRAKTWGSIGGLYQQQIH 

PMDALR 


5226 


19127 


A 


5265 


329 


207 


RGCSELRSRHCTPWVTEQDSVSQKRKK 
KSPCHLQLGTSQ 


5227 


19128 




5266 


177 


13 


TLTILLNFFFLRQSYSVAQAGVQWHNLN 
SLKPPPPGFKRFSCLSLPEFQQGHNIR 


5228 


19129 


A 


5267 


474 


122 


FICSPHPGGFLTLGPKLNILGPSQPLFP 
LRGGSKKQGIPFYSYKXKGPPGAGPPGS 
ARYSNPFGGYGGRVPLPPEFPDPPGEHK 
PRFFIiKKKKKGHQVPTSKYIiPEYLFAYY 
NASLE 


5229 


19130 


A 


5268 


55 


311 


GIQTFQKNVWAaRSKNVGMPIAMLLHT 
DGEHERPGGDATVTIAHRYTPKEQLKIH 

TQLADII IVAAGSLSISHAGVQWRNHGS 
L 


5230 


19131 


A 


5269 


238 


133 


NKNIVn/QKKCVFYIYLPLICVCVCVCVC 

vcvcvcv 


5231 


19132 




5270 


336 


517 


PVLYILEHSPYNASIYQLALKKYQSRPG 
AViUHACaDPNTLOGRGGRITRSGDRDHPG 
QRGE 


5232 


19133 


A 


5271 


262 


467 


VI.SMRPRIHGSAAEEEDEHPYELLLTAE 
TKKWLVDGKTKGTFPTTPGRTNSKGIF 
KVCDPEWKGKMS 


5233 


19134 


A 


5272 


213 


2 


MKIYINVYILLFSLKKRQGPTLSPRLEC 
SGMI IVHRSLKLVGSSDPPALTSRIAWL 
laSEHTPDCSVCKF 


5234 


19135 


A 


5273 


284 


105 


YSCSYSYEDEPVELRNSSFERWUHSSDS 
YWKKKSSKDTEPVLKPPGYSARYECKTV 

GSS 


5235 


19136 


A 


5274 


290 


449 


IFYFSGRVHAKLSAPLAGMGNAKADSRG 
RSRTKMVSQSQRMYCLSALLYLSLC 


5236 


19137 


A 


5275 


248 


423 


NKVWITLMTKLEKDTTKKENYRPISLLN 
IDMKILSKILQNQIQQYIQSIIHHDQVK 
LL 


5237 


19138 


A 


5276 


425 


30 


TRRGMPHRQGS PRKAPVS VRRQRVREE C 
GREPLLWFLQSQLLGRLRQETHLNWGGR 
GCSEPRS CHC3 PAWATRANS I FCNFQAS 
S VEVRRSARKKLFSD ILKRHNTITWRVS 
GLLLVDSYFGRLATPVRTQ 


5238 


19139 


A 


5277 


25 


185 


SLFSVAEFAFNAYKVQTEKEEZEEARSK 
YKEAKES FQRFLENHEKMTSTTRYK 


5239 


19140 


A 


5278 


263 


483 


VFT WFFQLFCVYKLI FSFLTECRS VAQ 
A!UIAGAHWHNLGSLQPPLSGFKQFSCLS 
LPEFPRMYSVLSPSP 


5240 


19141 


A 


5279 


25 


252 


LETLGLMTLVQCGGIPGFCHVGQAGLKL 
LTSGDLPASASQGAGIAGMSHHTRPILL 
HFIYFILLYIIPPCLAYFIIF 


5241 


19142 


A 


5280 


326 


203 


lETESCYVAQDGLELLGSSNPPTSASQS 
AGITGVSHHTWTT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peSr' 

sequence 


Predict- 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3242 


19143 


A 


5281 


210 


459 


NKNNKKFNEKKKKNYKKWARELYRHFSK 
DD IHVAKRYMKRYPTSLI IRKI QI KTSM 
RYHLTPIRIATIPR 


5243 


19144 




5282 


262 


454 


LLFTDYHE(3MLHSVKLLFCFLGWSFTLA 
IQAGGHWRDLGSLCPLLPRLKQSSCLNL 
LSSWDYKR 


5244 


19145 


A 


5283 


451 


303 


IIGMGHHAQPKQYSFVILTVQ 


5245 


19146 


A 


5284 


217 


489 


PVSISIILRCLSLPTPGSRVLCMLEGMD 
GISLGWVPLPSLPVPLHPLDLSLPVCRQ 
VGGTKTGWRYVGETDFAKGEWCGVELD 
KPLGKND 


5246 


19147 


A 


5285 


247 


99 


FFFLKNKTQPYFIYKRNILSM 


5247 


19148 


A 


5286 


2 


117 


PRVRPRVRPRVRPRVRPRVRPRVRKKKK 
KKKKKKKKKKGGGF 


5248 


19149 


A 


5287 


121 


3 


KFWPGAVAHACNPSPLGGRGGWITRSGD 
RDHPGQHSETP 


5249 


19150 


A 


5288 


416 


48 


IGKPGSFFPPPPSPRGAPPPPPKIFFLP 
PPPFFFRAGFPLFPPPPQIFFFPKIPPP 
FFFTPPLKKKFFFPPPPIFESPLGLFFF 

PPPPSLFSFFFFFFFFFFFFFFFFFFFF 

FFFFFFLGVR 


5250 


19151 


A 


5289 


503 


0 


Ft't't'FFffSEt't r't't'SSSffffft'Ffff 


5251 


19152 




5290 


70 


236 


I F CN SQ'L TS FHFJ-IQ KKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGGDFKKKIGGAK 


5252 


19153 


A 


5291 


443 


287 


QFTKKKKKKKKRKKKKNKKKKTKQKKTI 
IPKPPLF 


5253 


19154 


A 


5292 


334 


115 


BaJPNALFFFFFFVDRISLLPRQWHNTiGS 
LQPLPPGFTQFSCLSLPNSWDYRCEPLR 
PALLFFSSKSKIHIKPL 


5254 


19155 


A 


5293 


413 


3 


SSSPPIFFPPQKKKIFSPPPPKKFFFPP 
KTFFFLRGFFQIFPPPKJOSIFFPKKBQNF 
FFYPPKKKKIFFFPPPKIFPPPKIFLKT 














LEKGFPDAWMIAWAIIAW 


5255 


19156 


A 


5294 


291 


11 


YINVPGFC3HKGFI I IGSSQKAQKMIFFF 
LDRVLLCHPGRSTWQSQPTAALKNWAQ 
KTLPPQPSE 


5256 


19157 


A 


5295 


2 


401 


NPRSTEAAIKYFLTQATASIILLIAILF 
NNILSGQKKKKKKKKKKKKKKKKGGGPK 
KKKIGGGKKKPGGKKKIFFFIHGEKKNP 
RGEFEKKTFFWGGEKRANPPPKKKTLGE 
KKKILGGGGGKKFFFFVGKKK 


5257 


19158 


A 


5296 


49 


363 


RLQKKNKEKKKKKKKKKKKKKKKKNKRK 
KKKKKKKRRNGGGGKLLWKGGGQILTKE 
KILILVAVSYTCWIIMNLGRQRKWIIGGG 
KSEPNYE FKRNIGAGRGGSLL 














EYLNTCVRICHSHAHFLPPSYVSFALLP 
RFLFKWLLYMQVFGEGTEAVKKSLEGl 
FDDIVPDGKVKINVCFLF 


5259 


19160 


A 


5298 


595 


0 


VCVCYCMCACICIC 


5260 


19161 


A 


5299 


412 


141 


TVDTSYSEKKILFQILLLIDNGPGHLRA 
LMEMYKE INWSMPANTTS IMQLMNQGV 
I SI FKSSYLRNAF CKAIAAIHSDSFDES 
GQSNLL 


5261 


19162 


A 


5300 


216 


2 


SQDWLRKFCYHPRVFQRQPIQKLMGLFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 
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eth 
od 


SEQID 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last^amirio 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tr>ptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCLPNMVKNGVFFFFEMEGRSWQAEVQ 
WHDNSPLQPKPPGLK 


5262 


19163 


A 


5301 


368 


3 


IFFFPQGKEGGGLFYIFFPPKKKGFFFK 
KKIFFFYPPPKKKKKIIRAAEKRGPPLF 
FLKKTRPNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFi'FFFFFFKFFLVREK 
VFNSTISTQ 


5263 


19164 


A 


5302 


389 


2 


AMAQASGSCLCSRLLVDGFIPWGVFRYF 
I VEAGS CYVDQGGLAILflSSYPTTLVSQ 

VPKANVFSLLEKPTGLTQALRITLANVS 
NAPGQTRSGLEGWGCFF 


5264 


19165 


A 


5303 


84 


333 


FHDCCRPERSRILRHFLLSSAGIQSIAD 
AYRLALPQGLLYGPTNFAPI INHVARFA 
AHARHQGTASVRWGGSGSGGNQGLP 


5265 


19166 


A 


5304 


485 


93 


PPLFRRFSPPPPPKFFFSPRAFFFGG,GF 
FSFFSPPKKSFFFKNPPGFFFFPPLKKK 
KI FFLPPLFLAPPKDSFLAPPPFFFFFF 














I I IFIYLFRDRFLFCCLG 


5266 


19167 


A 


5305 


157 


2 


PPPPFFFFFFFFFVWWVITLNVELEPSF 
SPNTESQIGPEEAMERLQENRVE 


5267 


19168 


A 


5306 


1 


133 


TQESIMDLENDKQQLDERLKKKKKRPAV 
LKKPMGGQSLPGPIARE 


5268 


19169 


A 


5307 


239 


2 


PPRNWGFFSPI.SPr,KSSSPPKAFNFWGG 
VGPNFPPPKKRFFSKNPPGGFFYPPLKE 
KNI FSLPPVNLGPPRVFFKSPPP 


5269 


19170 


A 


5308 


454 


116 


FRLPSSSNSPAPASRVAGISGFRCVGRA 
GLQLLTASDPPASASQGAGIADGVSFTQ 
CSMVPRLECSGMISACYNLHLPAACLGP 
PKCHDCSLCMATPSGK 


5270 


19171 


A 


5309 


123 


1 


CSVSKWYPIAFLKNIIMLWEAaaOGSRG 
QEIKTIIjySITVKP 


5271 


19172 


A 


5310 


153 


33 


WPGWAHTCNPPPLGGRSGLITRSGVQP 
QPGQHSESPSI 


5272 


19173 


A 


5311 


350 


3 


NSHNLILTQEHSPEEKNEFLFSLPLQSV 
SMNTTHSPLLSNSLPHFI IS FLPTRFLV 
GPSPTRSFPPAQSPDRISFSSRLECSGT 
ITAHRSliNLPGSSDPLISVSQVAGTTCM 
CHH 


5273 


19174 


A 


5312 


1 


392 


RTRGRTRGICKSITIIHHINRTNEKKNH 
MI ISVDAESAPLQIHSAKLKK 


5274 


19175 


A 


5313 


1 


239 


LKTSFHHVGQAVLEFLASSDLPAIiASQS 
AGITCMSHCAQPYISINFIPAAETQQSS 
PELVPLPASQKTPSFSTPLFSLP 


5275 


19176 


A 


5314 


345 


237 


LYFYIYFLKSACVI ILSTLCVCVCVCVC 
VCVCLCM 


5276 


19177 


A 


5315 


268 


412 


GMISSNNSSDSFGNCLNPGGKGCCEPGS 
YHYPPAWATDTFSKKKKKKL 


5277 


19178 




5316 


112 


224 


PWMLENELGLHASYLAMSTPLS PVEI EC 
ASKKIFTFC 


5278 


19179 


A 


5317 


377 


139 


KPPPRIFYLGPQKKKKLFYPPPFiasrCFF 
T,GPPPPPT,FMIFFFFFFFFFFFFFFFFF 
PQCrayiFLFFLLSASNQNSPSWSP 


5279 


19180 


A 


5318 


216 


385 


RGVLLCCPAWSQTPGLKGSSCLSLPKCW 
DYKHELPRPATSCNSILFFNLQGAFMYH 


5280 


19181 


A 


5319 


376 


3 


RIDFGGPKKKILLPPPPAVKIVSLKGPP 
LFFFFLNSFFAPAGGQWGVFQLISSGDP 
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nucleotide 
sequence 


SEQ ID 
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peptide 
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nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPAPESVRIPGVSFCAGVPQGFFFFFL 
NSSVAQAGVQWRDLGSLQPLPPRSKRVS 
CLSHPSSWDYSH 


5281 


19182 


A 


5320 


377 


287 


ELLTSSDLPALASQSAGITSMSHHTQPG 
SF 


5282 


19183 


A 


5321 


381 


65 


SSSPSSRTSGPFFFFSPPKKKKNFQKKK 
NFLYPPFFFPPPFYKNPPKKKLKGPKKK 
KKISPPPPKKNLSLKMPPPFFFFFFFFF 
FFFFF 


5283 


19184 


A 


3322 


132 


396 


EGGFVLKI IREGIGPHFLGLEEEGSKFH 
EHIFWEKHRESFPKQGPIPHFMELGTWG 
LSKNPYFRVKQRVEHIEGFKNFFNEKKD 
FLKE 


5284 


19185 


A 


5323 


366 


1 


GFLKKGVPFFPPGKRGGGNQTPPKPFVL 
GGKQPPPPPPLKKKNPKGGPPQQKKIFF 
FLTRGGVSLFGPGGGFPFFFQPPKKPPP 














CSCRPGRTRG 


5285 


19186 


A 


5324 


131 


1 


DGALSPRLEWCSGTILAHCSLRLPGSSA 
SCVSASRVAGITGVC 


5286 


19187 


A 


5325 


112 


1 


DGILLLLPRLESNGAISAHCNLHLPSTS 
ESTASAFLS 


5287 


19188 


A 


5326 


404 


18 


PVSPHGKNHEKHTVTCVGSTSGAFLHGE 
EHCHGNTHHPFEPSNPQTACQSQANRIR 
HGNS FKNLWRSWAWWRTPWSATQDAEM 
GRPLEPRGLSLAWATQQDPSLKQKQKQK 
KPTHLWSYILGHHQLP 


5288 


19189 


A 


5327 


109 


3 


KCVLTPVIRALWEAETGGSRGQDIETIL 
SDTVKPR 


5289 


19190 


A 


5328 


1 


117 


ETGFCCVAQSGLQLLDSSNLLALVSQSA 
GVTGMSHRSRP 


5290 


19191 


A 


5329 


3 


120 


DAWVAAEAEGQNDTIEEPNKVQKRKRGI 
NDNVPflGQAH 


5291 


19192 


A 


5330 


105 


2 


RSGWAHIQTPSTLGGQGRWITRSGVQD 
QSVQDG 


5292 


19193 


A 


5331 


204 


411 


SKKKQKKKKKKKKKKKKKKKKKQDKKG 


5293 


19194 


A 


5332 


185 


2 


FFFETESHSITQAGVQWRDLSSLQPLPP 
GFKSLPSSWDYRCLPPCPANFCIFSRDR 

VSLCW 


5294 


19195 


A 


5333 


561 


325 


WSTLHS FP IWLLNTGYYFKE LTGFFFFL 
TVALSPRLECSDTI lAHYSLRLLGSHDP 
PTLVFQSTKITSVSHCTQPSVRF 


5295 


19196 


A 


5334 


3 


96 


EGKAAFSQEKSRRVKEENPEIAVSASTI 
PE 


5296 


19197 




5335 


375 


228 


LSPWLECSGTILAHCNLRPLGSSNSPGV 
FGFYFFLTLILGSGVHVQLC 


5297 


19198 


A 


5336 


403 


0 


NISVPASiCHISPFPKKKKKKKKKKKKKK 
KKSQKKKSRG 


5298 


19199 




5337 


397 


SO 


IIITSKKKKKKKKKKKKKKKKK 


5299 


19200 


A 


5338 


406 


34 


KRTLKDSKKKRKEKKKKKRKKKKKKKNR 
EKKKKPKKKKKKKKRGC3AQKKKiraHPAG 
GKAYFFFLGGTKKKRGGGDTKPLGEKNS 
PAQTKKLGGNNPPFVERGKKKKTPGI 


5300 


19201 


A 


5339 


410 


0 


KKQKKKKKKDAPGGGGGGRKKKGGPQKK 
KKGSHLKTPKKEKTGGA 


5301 


19202 


A 


5340 


306 


392 


IQYKILPQKENDWDKIQGQLAIPVSSLN 
F 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=C}'steine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Try pto ph an, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5302 


19203 


A 


5341 


3 


173 


TFTATELI IFYIFFETTLI PTLAI I TRG 
GNQPERLNAGTYFLFYTLVGEGSLLfiRS 

R 


5303 


19204 


A 


5342 


247 


426 


GRAWWFTPVITPLWEAEAGVQWRDLS SL 
QPLLPGFKRFSCLRLPNNWDYRHVPPRP 
ANFL 


5304 


19205 


A 


5343 


3 


149 


SWFLHFGEAGLELLTSGDPPALALPKCW 
DYRLVSNSWPEVILQPQPPK 


5305 


19206 




5344 


464 


147 


LRRLRWENRLRPGGGGCSEPRSHRCTLA 
RATTESLSRVQCQRDPRRLAQTSRRPGAS 
SFPFAGPAFSAVNSTYQRALPPAQGAPW 
TPRLPLGPHGTTS ST3S YLPV 


5306 


19207 


A 


5345 


228 




VIRLEETYFLLFFVLFLFLFFRQTRSGS 
FAQAGVQSSHLSLPSSWDHRYTPHIANF 
CIFLQKQSLAMLPRLVSNTK 


3307 


19208 


A 


5346 


2 


106 


FVLELLGSNNSPSSASESAGITGVSHCG 
QPLFN? 


-im 


19209 


A 


5347 


140 


436 


FLCDMRVLYVTFFFFGKGVSFFPPGWRA 
GGEFGLIEHPPPGFWPFFLLNLPKKGEL 
RAPPPAPrFFFFFFEKRGFTELTQEGLN 
FWTWEPPPLTLPEGG 


5309 


19210 


A 


5348 


1 


465 


LESRRFEDGTRVPGSTISWDPPEEAVCP 
FSDLQLRAGRTTTLFKAVRQGHSSLQRI 
LLPFVWQCPAPRGGVYRGHQASLSCSGL 
HPVRASWLLCFPSEAWAMSGTPPPASLP 
PCNLISDCCASNEQGSVGIGPSEPGVEY 
NLLLRHLLKPEEKRS 


5310 


19211 


A 


5349 


183 


58 


EHDYEEGPGAVACaCIPSTLGGEGGWIT 
RSGDRDHPDEIVDP 


5311 


19212 


A 


5350 


288 


404 


lEXNGMILAHCaiFQLPGSSNSPASASPV 
AVITGMNHYAR 


5312 


19213. 


A 


5351 


413 


305 


AEAGLKPLSSTYLRASASQSAGITGVSH 
HTQPLLFF 


5313 


19214 


A 


5352 


91 


356 


TVPKRNTFSPKMVAITTMTVSTSVLAL 
GNRESALPTRLGWALCCFIFWGGRGHRV 
PLCHPGWSTVARSRLTATSTSQVQAILL 
PQPP 


5314 


19215 


A 


5353 


2 


410 


FVQWQFLKTGSDRSHKRVESLGYEKVEK 
AMTVNYSFRSLNTKGKLELLGKISVNLY 
AKWKHFREKGRRYTNTLGPTATAAFYST 
EKTEFLPWEQTCKPFIDSFNKYLLPGA 
ADHAOfPSTLGGRGGKITRSGDRD 


5315 


19216 


A 


5354 


3 


288 


SLDDDLLKLLLPLMLQYSDEFVQSAYLS 
RRLAYFCAERLSLLLSDSPNLLAAHSPH 
MMIGPNNSS IGAPSPGPPGPGMSPVQLA 

FSDFLSCAQH 


5316 


19217 


A 


5355 


187 


3 


KKILWQTPGVNLFSFKGAPLFFFFLRRV 
LLCCPGWSAVARSPLTATSASRVQAILL 
PQPPK 


5317 


19218 




5356 


383 


261 


GYKINGQKSQAFLYTNNRQTESRLWYCY 
DSPQRLIHLSIK 


5318 


19219 


a 


5357 


367 


241 


FFFRHVAQAGLQLLGSNNPPASPSQSVR 
VTGVSHCTQPLCFV 


5319 


19220 


A 


5358 


217 


1 


SCDRENTGLKLLSILLLIHCETDTRISL 
LCISLSIKHEIWLGAVARACNPSTLGGR 
GGRITRSEDRDPGETP 


5320 


19221 


A 


5359 


255 


412 


TQLCRKRHAFFFLATNHSQKGQSRWGTV 
AHACNPSTJjGGEGGRI trsgdgdh 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5321 


19222 




5360 


391 


271 


RS OUAQAGLKLLAS SNPPASAS QNAE I 
TGVSHHLTPICC 


5322 


19223 


A 


5361 


232 


397 


EGLKEPAGLFTMEYYTARKRNKIMPFW 
TWMQLEAIILGESTQEQKTKFCMFSPV 


5323 


19224 


A 


5362 


213 


392 


LFIKVEGKFFLRSINILNESLLFLSSNG 
PGLLKKLGVWPGWAHACNPSTLRGQGG 

RTA 


5324 


19225 




5363 


306 


418 


S DG I TGTLP LASQE S AWEDLL YVLVGV 
DGRYVSAQP 


5325 


19226 




5364 


347 


48 


KKRWL KPGGGKNP P KKGGKKGGT P P FFQ 
KKGFFFPPFFFFLEKRDLSPQAGGKGEK 
KKPPQPWKAEGKKNPWGNPPQRGGYKGN 
PTRPPPQGFFFFFFL 


5326 


19227 


A 


5365 


412 


172 


PDSNLSFGAHYQASLLGSFSLMPLLLYS 
KCSGQWDVAESAECTSKSSQNKERKKER 
KKERKKERKKERKKTRKQKKNPLT 


5327 


19228 


A 


5366 


400 


296 


LVEGGFHHVGRAGLELPTSSDPPTSASQ 
NLGTiQA 


5328 


19229 


A 


5367 


162 


1 


APSSLDETPTLLKKTLFGPGLVAGICNP 
STLRGRGGRITRSGVRDQPDQHGETP 


5329 


19230 


A 


5368 


152 


403 


TTVLCKLVQPLWKTVWRFLKELEVDDPF 
NPAVPLLGIYPTEKKSYEKGTCTHVFIA 

AQFQIARIWNQSKCPPINEQKKKKGAPV 


5330 


19231 




5369 


170 


367 


IVFEILVrSYFKSFPQKTTiiV/NTB'FB'HP 
FKVFMWPSTyaHACWPSTLGGRGRQITR 
SGDRDHPAR 


5331 


19232 




5370 


396 


1 


QMCVRAHTFAGGRARTHTHTHTHTHTHS 
WATPRVSHNHCVCVCVCVCVC 


5332 


19233 


A 


5371 


139 


3 


PKLECGASI lAHCSLELLGSKDPPPSAF 
QVAETTGCAPPCLAFFF 


5333 


19234 


A 


5372 


1 


341 


PLINILSHL ILLCHNYFSFVKFCrRHTS 
LLYWISLFHVLLCWSFLTDREAHNAS 
VTYGFVLPVTFKWSPLPGAViyE£ACWPST 
LGGRGGQrMRSGVQDQPGQAESSRPALP 

T 


5334 


19235 


A 


5373 


280 


449 


ICHKLQLCPSKDTIEKVKRQPIEWEKIF 
ANHISHKGLSS 


5335 


19236 


A 


5374 


170 


1 


QKRGFSMGTPMVSIFRPQDLTPLGFPKA 
WDPRVNPWGPWPCPLFFKGNPQLFQAPG 


5336 


19237 


A 


5375 


462 


155 


PPTRFLHWQTGLELPTSGDLPASASQG 
AGITGVSHHAQPRGHVFDVLLEYSQQGV 
FTLLSFLGFSVFFEPLFC 


5337 


19238 


A 


5376 


3 


451 


PRAKLGTRRGLRNLDTIGVAVDLILLFR 
ELRVEQESLLTAFPCGLQFLEITIASRHE 
DSQSIGWVHAFPPPVWPCLSLPPPHTGS 
SSSITLPPLLPHRPHPPPHSHPPMATLS 
RPpyPPPPHPPPHPLPPSLSPPALSPPP 
PPPPPPPSL 


5338 


19239 


A 


5377 


460 


300 


KVLLCCPRLECRGAIMAHHSLELLGSSN 
P P TS AS QS AGVTGVSHCRS P WE FL 


5339 


19240 


A 


5378 


452 


191 


DSRFHHWQADLEHPTSSDLHASASQSA 
VITGVSHRTRPDIAFAPSISSYFSCVAE 
ISLLHLSESFPKQFL 


5340 


19241 


A 


5379 


240 


395 


LATS S VAITLRNTF YLKLKSSVLGVEAHS 
CNPSTLGGQGVRITKSGVRDQPGQ 


5341 


19242 


A 


5380 


210 


411 


TSVNSAPNSGQKASVEECFGQVTKVKIQ 
NVFLGWAQWLTP V 1 PALWE AGAGGS RDQ 
EIDTILAHTVR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Prcdict- 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=:Lysine, ]j=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5342 


19243 


A 


5381 


406 


300 


FHYVGQPGLE LLTLGS TCLGLS KCWD YR 
REPPHQA 


5343 


19244 


A 


5382 


364 


479 


INDGISKISLFLSHLKDQAGQVWWLTPV 
IPALWEAEAG 


5344 


19245 


A 


5383 


182 


395 


HHLRGSAASFLKSAQEPTDFRHTGFKRF 
SCLSLPSSWDYRCPPPRRGGGVLCLPGW 
RTWQSWLTTTS ISQ 


5345 


19246 


A 


5384 


37 


478 


GGIHSYPVICLFILMGSSVSLPFSSVAC 
LLAWLLPSFLPSLLPSFLPSFLPSLTGR 
AFL 


5346 


19247 


A 


5385 


201 


458 


WNPP FT .SPRKWAAB ADTEWGVGKTAQE P 
GLSSRLQAYEERQRHWQREREALREDCA 
AYAQCAQRAQQLLQLQVFQLQQEKRQLQ 
DD 


5347 


19248 


A 


5386 


206 


407 


FFSPGGRAGEKFGFMEPPPPGFKKTLWL 
NPPGKWKQRAPPPIPVYFLEFKldGGWP 
NMVQVGLNPRP 


5348 


19249 


A 


5387 


468 


0 


NFFPPRPAPPFFFFFFFVAQAGVQWRVL 

GSLQAPPPGFTPFSCLSLPSSWDYREPA 

RVSPHC 


5349 


19250 


A 


5388 136 




LFFFFLFETGrCSYPHLECSGTIIAHCS 
LKLPGSCNSPTSVSQV 


5350- 


19251 


A 


5389 


397 


0 


ILYPSQKKKKKKKKKKKKKKKKKKSQ 


5351 


19252 


A 


5390 


299 


3 


AQPLAPFPGSFILMYFFMLLLFGIPLLY 
MEVIMGQWLUVDNIHVMKQLVPWLCSMS 
YAHSLVNEGPSPKPSPISLCGLQVSAHR 
VHPQVCflSVSLYNS 


5332 


19253 


A 


5391 


264 


202 


FFFFFFFFFFFFFFFFSENKI 


5353 


19254 


A 


5392 


2 


150 


PRWFLIETEFCHVGQAGLKLLTSVDPP 
TSASQSAGITGVCHLVRPEKS 


5354 


19255 


A 


5393 


382 


0 


NSRAQAWULMPVISPHWEAEVOGSLEPT 
S 


5355 


19256 


A 


5394 


395 


239 


SLPSSWDYRHAPQCWANFIFLVETGFSS 
VILCSFRLGGFGLSLNILCSLKSL 


5356 


19257 


A 


5395 


24 


265 


LLPHRKSRNYRTRPTRSLRQKNGMNLGG 
GACSESRSHHCTPVWATERDSVSKKKLF 
T lELVYVFLKICNSKGEPLPKVLGL 


5357 


19258 


A 


5396 


174 


118 


FFFFFFFFFFFFFFFFRSS 


5358 


19259 


A 


5397 


406 


134 


PSAPSSPPPPPLIWGPPGFFPPPLFKNP 
PPKFFFGAPKlOCrYLPPPRALNFVLriKG 
PPPFFFFFFFFFSFF 


5359 


19260 


A 


5398 


294 


64 


IFFFFFFFFFFFFFFFFWELHILWGALF 
LLKSRI PPGKKKWTCKIFLCTPLTGY IP 
LLKCFQKQNKKSQSQNTELKS 


5360 


19261 


A 


5399 


1 


211 


LLLPTPYHHINQDSSNKYHETKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKRGEKKKK 
KKKKGGGGGYKKKR 


5361 


19262 


A 


5400 


305 


1 


SPPRPPFLLPFLFFLPSFFFKAPHTIWH 
FFSPPPLFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFLKLYT.FYKMHRINyTSWI 
LKINIFVFSSPGSGGRR 


5362 


19263 


A 


5401 


416 


3 


ENTERDRGRQRREDRSRAETQETNRAGQ 
QESTPPPQKKCNTYSIPSSSSSSPTPQK 
GKKGGPPGGPP P PAGEKGGP KKKKRGP Q 
KKKIRPPPPGKKKPPPKTPPLFFFFFFF 
FFFFFHKPQTFIVGVNTYILLRSVS 


5363 


19264 


A 


5402 


154 


416 
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SEQID 
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nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=C3'steine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKDSGGGAF 


5364 


19265 


A 


5403 


327 


2 


PPPQKFFLGGVFFFPPFFYNPPPPFFFF 
SPKKKKYFSPPRGKIFFFFFPPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LRWSLALVAQAGVWWLNHGSLQSL 


5365 


19266 


A 


5404 


372 


181 


YKGLAHTRCS lENFLNFILI lEMGSRFV 
AQAGLEPLGSSDPPTSASQSDGITGVSR 
RTRPINIC 


5366 


19267 


A 


5405 


110 


~ 


PVPVSSTEVKKLVLWPGTWHACNPKTS 
GGQGGRIT 


5367 


19268 


A 


5406 


2 


401 


GPHLTTYGNRKQMQRHVREASARPLQGT 
PPPGHALEWTLENVLLAAPRPRKAQVLF 
AIVASETSSWDREKLWTLSLEAKCKGIT 
LFVLALGPGVGTHELAELAELVSAPSEQ 
HLLRLQGVSKPEVNYAQGFTR 


5368 


19269 


A 


5407 


3 


181 


QLILFHDKLSPKLLSWGNVDILINFFLS 
HCSPERARKLQAKMMTNLVMAKDRLQLL 
GIN 


5369 


19270 


A 


5408 


371 


^2 


SVEMGFHHVAQPSLEPIiGSSDPPASASQ 
SAE ITGVCHCARPNHDSYRRS 


5370 


19271 


A 


5409 


204 


416 


NQVGEWVnRKGTLGnLIMSMLFHFLWR 
SKDEKNTKIWPGAVAHTCOTSTLGGRGG 
QITRSGVRDQPGRHG 


5371 


19272 


A 


5410 


236 


308 


AfikPGTVTHTfiNPSTLGdQdGRiT 


5372 


19273 


A 


5411 


375 


216 


RKEKKKKLLTTENSKKftLKRNGYL 


5373 


19274 


A 


5412 


1 


381 


RFHHVAQflGLNLLGSNDLSASASQSAGI 
TGMSPRTQP 


5374 


19275 


A 


5413 


144 


3 


RPPPIFFFLAKKLRALFFFFHSTTnOAGV 
QWRNLGSLQPLPPGFKQF 


5375 


19276 


A 


5414 


412 


32 


LFSKKKKKKKKKKKKKKKKKKKK 


5376 


19277 


A 


5415 


414 


2 


NNNSFVFFSKPIFLKTVGGRPPPSPira 
PIPIFDYIYPPHKWGGVYIPTHMyGGGD 
IWGYMYTPPPPPLFIYFFPPPPFYIEKK 

KKTVFCFFAQGGVRLCLKKKKNKDRGWG 
DTMTMGHGGRLEMVANVCGLKRYW 


5377 


19278 


A 


5416 




240 


GGEGCSEPRSCHCTPAWATERDSISKKK 
KKKGGPFKRTHFSSPGLPSFFFFFGAPK 
FNSRARFLTPREGKNPGLPPFYPA 


5378 


19279 


A 


5417 


2 

322 


63 
1 


LCLKNKWHISnsnSnsnmNNNNKD 


5379 


19280 


A 


5418 


WPLCFSRCSKKRGSPLGFPFSPPLGQSK 
KFVRFACKTPSAFKPFPSLSFFQRWAGP 
PPFLQGLGELPKGLFFQQCFFFFFFLRW 
SLTLVAQAGVQWHDPGSLQPPPL 


5380 


19281 


A 


5419 


1 


392 


FRMAGSYFEGAPAILADKRQQFGSRFLS 
DPARVFHHNAWDWVEMSEEQAAAAERKV 
QENS I QRVCQE KQ VDYE INAHKYWND FY 
KIHENGFFKDRHWLFTEFPELAPSQNQN 
HLBCDWFLENKSEAPSQKK 


5381 


19282 


A 


5420 


273 


91 


RWYIVLDNLIFILENICCETSLKIYLGP 
GWAEACNLSTLGGQGGQITRSGVRDQQ 
GQHGE 


5382 


19283 


A 


5421 


1 


149 


GGGGCSEPRLRRCTPAWATERDSVSKKK 
NLKKFFCPGLGLYQKQWTWTL ■ 


5383 


19284 


A ■ 


5422 


381 


179 


SSCLGLPKSWDYRCEPLCPARTALNLKT 
IKKSKDNRRWRGCGEKEMLMHCWWECDM 
VMLCVPTQISC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
Dd 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

Ing to first 
amino acid 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5384 


19285 


A 


5423 


195 


369 


CSLSPIKECKLHISGLQDSSISLVNLGL 
AtRNlRNKPGWAHSCnSTPSTLGGQAGWI 
T 


5385 


19286 


A 


3424 


253 


414 


KKKKKKKKKKKKKKKKKXKKKKXKXKKK 
KKKKKXXRGGAAKKKPHSPAPAERDL 


5386 


19287 


A 


5425 


160 


2 


WLKGSAPFSKKTSPPFLVKENPLFFFFF 
EMESLSVAQAGVQWRDLGSLQPLP 


5387 


19288 


A 


5426 


53 


426 


ASSSFFPFFFLGKGGSIFPPKGGGEGKN 
FFLWKPGAPGGGGPPPPTPRGGGKKGPP 
PPGQKIFLGMFFEKKRlf'HHGGGGGlfKLR 
GKKKPPPLAPKRVKNKGGNPGGGQPLKF 
KKTLPEKMGLKK 


5388 


19289 


A 


5427 


60 


1 


KWPGTVAHACNPSTLGGRGG 


5389 


19290 


A 


5428 


87 


247 


KKKNPQKKKKKKKKKKKKKKKKKNKKKA 
PAGAGDLSLYYPRAGRNNLFFNTLP 


^390 


19291 


A 


5429 


209 


405 


KLNLV15FSSLSSLKAMISAILLKLSVPI 
FPMKIVILSTLNSGSWPGAVAHACDPSA 
LGGRGGRIT 


5391 


19292 


A 


5430 


324 


237 


VMNAFVCVCVCVCSCICVCVCVCVYFLF 


5392 


19293 


A 


5431 


409 


3 


FFFFFSETESRSVTQAGVQWRDLGS 


5393 


19294 


A 


5432 


270 


402 


MKI IPHLIRITGWAQMLTPVTPALWElAE 
TGGSRGQEFETTVKPC 


5394 


19295 


A 


5433 


107 


277 


YNFNNHHYREaiQKWDEftLQLrPNDATIi 
YEMKSQVI IMKNFTFLMKLCYVCSVSEQ 


5395 


19296 


A 


5434 


95 


1 


RNRKNNGQSGTVAHACNPSTLGGRDGWI 


5396 


19297 


A 


5435 


412 


105 


GGEGCSELRSCHCPPAWATRAELHLKIK 
KKERWDLAMLLRVGYSGHRQVHCTDHGT 
LQPQPF 


5397 


19298 


A 


5436 


276 


488 


PLPQPNKDSRLLPLLVCLGFLFAPLLML 

GHGPKEARLPIFFPQEAYFITFRLVFAE 
KKGYRGPLTMCT.APT? 


5398 


19299 


A 


5437 


179 


2 


G I SPLKGGT PSLKKKKIfAKDFWRRVS KK 
DIQFTNKHMKRCITLLIIKQMQIKITMR 
YLP 


5399 


19300 


A 


5438 


3 


156 


TPLYSQI LI SLRQEHYFKAEGRGCNELT 
SYHCTPAWAKDGDSVSKNKQNT 


5400 


19301 


A 


5439 


47 


173 


SLQLFLQTEERKDSDDEKSDRNRPWWEK 
RFVSAMPKGNFIKY 


5401 


19302 


A 


5440 




122 


TTLTLTS LI PPI LTTLVNPWKKNPPKL I 
RILVLFQTTEK 


5402 


19303 


A 


5441 


390 


489 


KYKSYSLQPPP PKFKHFSCVT LP KHWD Y 
RREPP 


5403 


19304 


A 


5442 


86 


275 


RQEFETSQQGETPOiLKIQKLAGRAHLN 
PGGRGCSEPRLCaDCTAAWVTRAGLSQIK 

KIKKKKK 


5404 


19305 


A 


5443 


424 


166 


CFFVSPIOUCSQGWLGSPPPPPPGLKHF 
SAPTPRGLFLFFFFPRGVPPLGKGGSKS 
PPPGVFPPWPPQKGGDSGGYPPPRAKIF 
LF 


5405 


19306 


A 


5444 


356 


3 


AFFWFTNPFPPGFKGFFCPHPLSGISKG 
GPPPRGNFFFFGKAHFFSPFFKFLPPIF 
LPFLAPQKVGFPGFGPPPKPSQGVFLFF 
FFFFFSFLFFFFFETESSSVAQAGVQWR 
DPDAW 


5406 


19307 


A 


5445 


198 


86 


P WPLGAQAI LL PQS S CLKL PKCWDYRRE 
PPHTALLNS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

rmim)a'cid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cj'Steine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5407 


19308 


A 


5446 


198 


86 


PWPLGAQAILLPQS SCLNLPKCWDYRRE 
PPHTALLNS 


5408 


19309 


A 


5447 


143 


3 


TMIYFYFYFFLSQSLALLPSLEGNGTIS 
AHCNLRLPGSSDSSADAW 


5409 


19310 


A 


5448 


324 


1 


KKNSATSFFSVEPFSTGGVLANKGALTQ 
EKILPCFKIKKIRKRVGAVAPARNHFLF 
FSPFFKEGLQGIVYFFFFFFELESRSVT 
QAGVQWHNLCS PQPLPPGFKRTRG 


5410 


19311 


A 


5449 


403 


0 


LFFFFFFFFFFFFPKENH 


5411 


19312 


A 


5450 


114 


409 


LNIWTL.KirLISJ<HLLIVCI.yi)TII.SSLC 
LIKYlGLQGKVLIFFFKQFSAIDKIiGQN 
I AWGKFGFAHYS LLTKKWKLFGNI TQE 
QNMIVTGGLSWWND 


5412 


19313 


A 


5451 


192 


406 


RNFFLGHLKYVKRFEQIGWGRDFMFKTP 
KaRATKNKMDKGDLIKliKSFCPTUCETTF 
RGTGHPTKGEKI FAT 


5413 


19314 


A 


5452 


254 


48 


HAFGWLCVayALQFIDKSVKlSrSVCWLeA 
VAHACaiPSTLGGQGGWITRSGVRHQPGQ 
YGETLRTRASTR 


5414 


19313 


A 


5453 




288 


PLLFlIiIPSPFSFPHPFCGPIKFILPRE 














KKKKKKKKKKKKKKKKKKGGGPFKKKKl 
FPRGGKKKIFFFWAQKKKIRGGVLKTGG 
RKKPGGKKHKSG 


5415 


19316 


A 


5454 


285 


415 


TSGCCFVFLVLKtSl FCKflKVLNAlEDN 
GIiKNSTFTYFTSDHG 


5416 


19317 


A 


5455 


372 


125 


ITPAFSFSPMGKKGVFPPPKFFGFPPGF 
PPPPFFKPPPGIIFLGPKKiCKFFSPPPR 


5417 


19318 


A 


5456 


3 


159 


AHASflHAYDTIKDALGLLLFILSLMTLT 
LFSPDLLGDQKKKKKKKKKRGGP 


5418 


19319 


A 


5457 


109 


392 


NIIUKRSTSLFLRPIFPPHLFPPILYL 
FISLRTLLFFFFFKRBFCFLPQVGGQGP 
FLNSLKPLPRGLMQFSCLTLLRSGNYGP 
PPPPPINFFF 


5419 


19320 


A 


5458 


243 


402 


FLKIGFIHKYLKEFWGILNIVFNRFNS 
HERAPHAIVFVNGTTIEGHVEKCYW 


5420 


19321 


A 


5459 


394 


105 


IPPPSPRPGGPGAAPPPGGGGSTTPPPK 
KKPPFFPKNKNQPGGGGGAPPSPPPRGL 
GPKIPLTPGTKVFKETKPPPFPPPRGAP 
PKFLSKKKKKET 


5421 


19322 


A 


5460 


56 


379 


VCFSPSIYGAWLNKSPLRSHGSSSTSG 
WGGLPSPFPAQRLPLASQVQEHWGHVFA 
SYQVS IPQSCEQCLSYIWLMDKALLCSG 
EWLPHQAPKPSMSPTTPHCSRGLQ 


5422 


19323 


A 


5461 


184 


420 


FFFFFLNNNLFFFFFFFFFFFFFFFFLG 
RN 


5423 


19324 


A 


5462 


295 


384 


LGAVAQARWPITLGGRGGWITRSGVQDQ 


5424 


19325 


A 


5463 


413 


257 


FLNPGGRGCSEPKSHKSTLAWVTERDLI 
SKKKKKEKCWIPQRMPERGQWSPD 


5425 


19326 


A 


5464 


2 


181 


SLYCSGWSQTPGLKQSSCLSFPKCWDYR 
CELPRPVYVLLYKTSFPKRYVFGIHLCF 
MSQ 


5426 


19327 


A 


5465 


1 


229 


TGSCYVAQAULELLSSSDASTSVSQNAG 
TTGVSHHVQP YTS SFSQVPPLRAQSAAT 
E TTS I LWVQKLRLQ WRCSV 


5427 


19328 


A 


5466 




368 


KEHETKKEREEERRMRRRREGRKEKGRK 
ERKKEREKERKKREEKKRNWLGQSAGIT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














GMSHHTRPKSHWVLSKFTNLCWAAFKAI 
LGHMWGRLDKtNVEQCFSTELSALFS 1 1 
ATSHMRLOTI 


542S 


19329 


A 


5467 


183 


3 


VSLCYGEKFSPQMNESQLYFRPGAVAHA 
CNPSTLGGRGRQITRSGDRWHPGQHGET 
GHV 


5429 


19330 


A 


5468 


1 


128 


GGAGYSELRLCHCTRAWATRAKLHLKKK 
KKKGPLRALKKIGG 


3430 


19331 


A 


5469 




155 


NKFPVLISNVINIIGMCHCTPAWATEHD 
SVSKRKKKEKKRSKIiaRHSVLCL 


5431 


19332 


A 


5470 


113 




FFFFFFQKTSSRLGTVAHACNPSTLGGR 
GGRITRSGD 


5432 


19333 


A 


5471 


407 


281 


NGMI PGGRACSELRSRHCTPAWATEQDS 
ISKKQNKSKNPGLK 


5433 


19334 


A 


5472 




177 


GGGCSEPRSRHCPPAWATERDSYmKMK 
QSKTKGERMEVACYCI IILFMLTDftLEV 
LMA 


5434 


19335 


A 


5473 


76 


267 


KBRREEKBKEKRKRSERKGRKKEGRRDR 
EKEKEERKDERREGEEGEEGRKEKKEKD 
IKRRGWP 


5435 


19336 


A 


5474 


133 


2 


LCRMWLFCFEAGSHSVTQAGVQWCGLN 
SLQPLPPGFKQFSCL 


5436 


19337 


A 


5475 


5 


394 


YGVGTKGFRAGVKEFQALFLLITVHSQL 
CHIEGTRHHSRHIIWVDFINQIENGTF 
FPLVLSNGCIWLLIVYLFWNIFTRQEGQ 
FFYGKLSEIYS VFVFCFLRRS FALAQAGV 
RWCDL6SLQPLPP6FKRF 


5437 


19338 


A 


5476 


163 


2 


SE KPWWALTLMTFSHFYFF I YFYYI F I 
FQMESCSITQAGVQWRDEWSLQPLP 


5438 


19339 


A 


5477 


111 


2 


FLFLKQDLPGAVAHACNPSTLGGRGRQI 
TRSGDRDH 


5439 


19340 


A 


5478 


232 


0 


HLVWRMESGSVAHAAVQWCDLSSLQPPP 
PRFKPFPSS 


5440 


19341 


A- 


5479 


388 


225 


ALFSSKKKKKRKKERKRKKEKKKKKESM 
QRKGGHLPR VCCGASMVS PGHKLS L 


5441 


19342 


A 


5480 


150 


2 


HRVAVIYEVFFVCLFFYFLFLRQSLTLV 
TQPGVQWRNLSKLQPPHPGFK 


5442 


19343 


A 


5481 


156 


395 


DSVCTCVLHTCHSSGLSQGRSCLHCSLG 
LPNSNCPCVCVCVCVCVCVCVCVCVS FC 
PLHQGVCVCVCVCVCVCVFLSAHFTKGC 
VCVCVCVCVCVCFSPPI 


5443 


19344 


A 


5482 


391 


115 


RACPDCRVRFPHTS PPCLPCGPEAEPGP 
GPALRELVQPLPGQLQPPFGMPPSGLGA 
APS PAPPACAWTRPPHLHPS SFSSSVPQ 
ISSLFLCF 


5444 


19345 


A 


5483 


392 


2 


FFFFFSETQFRSCCPRLECNGAISVHCN 
LRLPGSMNSP 


5445 


19346 


A 


5484 


381 


282 


PVPWGQAIFLPPPPPYLGIPGPPPPPGL 
FLFP 


5446 


19347 


A 


5485 


407 


3 


FFFFFFLRRSLTVAQAGVEWCDLGSLQA 
PPPGFT 


5447 


19348 


A 


5486 


2 


417 


QENHLNPGGVGCSEPRSCHCTPAWVTEY 
DKKKKK 


5448 


19349 


A 


5487 


3 


413 


VYLHPSLQPSLSSLVPDQALSHDIWLKK 
KKKKKKKKKKKRGGISPPSPVGGVGPRE 
TWGVWGPGAISILFQRGGGVRELFSQKKKG 
EKGQRNPEKKPPHKRGPGGPKEKFIFFE 
KKKKKGGKPFGKKTKPPPPRVUPGG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
add 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, Lf=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


5449 


19350 


A 


5488 


229 


421 


IHGPGFKNGGRGKTLGLPNLMALKKIPF 
SPGGYKPKNP 


5450 


19351 


A ' 


5489 


190 


50 


YHHHHH CHHHHHHQ CHHRHHHDHH YHGD 
MDYS SHHSRSTCHVAVTV 


5451 


19352 


A 


5490 


615 


402 


RWCLTLSPRLECSGMISAHYNLHLPGSK 
QLSCLSLPSSWDYRCPPPGKTTFSRGIR 
DALKPRKPKEINLI 


5452 


19353 


A 


5491 


338 


53 


KGPPFFFFCKKGGLFCSNWFLTPGGKWF 
SPPTPPRSGGKRVPPEGGGKGFSLFFFV 
EGGS CHIAQAGLXLLGSSDLVTSASQSA 
GIMGKSPHAWP 


5453 


19354 


A 


5492 


266 


412 


NPNFFERKSCSVSQAGVHRRELNSLKAA 
PSGYTPASCVSLPSSWDYRRP 


5454 


19355 


A 


5493 


285 


412 


RDGVSALSPGLECGGMIIAHCNLELLGS 
SDPSALASQSTWNY 


5455 


19356 


A 


5494 


1 


1350 


MGFMTGICIEDNWGPSPESGSVLTVGG 
VRIQMLDRCHTAHCPVIPGLLRGQLWTE 
NLAERNSHDRRPFPTWCSHFAQDLWPE 
QSIItDSYQKVILRKFEKCGHGMLHFKKG 
CESVDECKLHKRGYNGLNQCLTTTQSKI 
FQCGKYVKVFHQFSNSKEHKRRHTEKKP 
LKYIEGDKAFWQSSTHTTHKKIDTGEKP 
YKCEECGKAFWRSSHLTTHKITHTREKP 
YKCEECGKVFKYFSSFTTHKKIHSGEKP 
YICEECGKAFMYPYTLTTHKIIHTGEQP 
YKCKECDKAFlvlHPATLS SHKKI HTGE KP 
YTCDKCGKAFISSSILSKHEKIHTGEKP 
YKCEECGKAFTRSSHLTMHKIIHTGEKP 
YKCEECGKAFTWSAGLHKHRRTHTGEKP 
YKCEECGKAYTTSSNLTEHKTTHTGEKP 
YKCKECGKAFNWSSDLNKHKRIHIGQKP 


5456 


19357 


A 


5495 


4 


146 


PPTRPTCYYLSFYVSYPTGYKEDTRMVIT 
KHMKRCSTSLVYLT.WINFY 


5457 


19358 


A 


5496 


5 


388 


CQCFFFSPPPKKKVFPPPFFFFAPGFFP 
PPLFFTPPPQFFFWAPKKKFJJSPPPPGK 
KIFFLKGPPPP 


5458 


19359 




5497 


231 


163 


FFFFFFFFFFFFFFFFXXTNKWF 


5459 


19360 


A 


5498 


163 


2 


SVKFTLGKVSQFLKKHTPAGWAHTVIPA 
LWEAEAGGSRGQEIETILTNTVKPH 


5460 


19361 


A 


5499 


229 


388 


RTHLCJASSDLFCLYSLKTGS CCVAYAGV 
QRHDLSSLQPPPPRFKRSSCLRPPE 


3461 


19362 


A 


5500 


405 


289 


HVAQAQLKLLASSDPPTPASQSAGVTSV 
SHCTHPRCLFV 


5462 


19363 


A 


5501 


286 


375 


GDFVRFSFWPGGGAHSaSPSTLGGRGGQ 
VT 


5463 


19364 


A 


5502 


20 


374 


ILQFQMFWIVLFKNFYHPGKFLIYILWW 
FSGLFMYYFSELSCFSLSSKINILNLLS 
GILNVFFSISGVRKKIFLIKRYCWLGTV 
THACEPSTLGGQGRQITRSGDRDHPGQR 
WKIQN 


5464 


19365 


A 


5503 


1 


185 


FHIIIGQAGLILLTSSNLPASASQKMIT 
GMSHCTQPIIIFLYSLFLQLSFSWQDFV 

MRKKF 


5465 


19366 


A 


5504 


202 


402 


NSFSSTAKLREKYRQFLYTPIPTGTQPL 
PLLTFCTRPDTVAHACKHSTLGGQGGQI 
TRSGACNQPGQ 


5466 


19367 


A 


5505 


118 


7 


I KKENNWPGAVAHACNPS I LGGRDGH I T 
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nucleotide 
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correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 

dingto^" 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=-Cysteine, D=-Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSGDRDHPG 


5467 


19368 


A 


5506 


205 


422 


ANHFTSIRMAIHTONKQTENHKCWQGQG 
EVGTLVHCWWECKMVWMLRQ I VWS FLKK 
LNIQLSRDPATLLLAI 


5468 


19369 


A 


5507 


3 


425 


RSCHCTPAWATERDSVSQKKKKKKKKK 


5469 


1937Q 


A 


5508 


3 


86 


EKKLQVLLIICMTE I YYQFKKDKAERSK 


5470 


19371 


A 


5509 


395 


108 


GVLE CNDT I TAHCS LDLVGS S DPPTS AS 
QVTGTTGTCHHAQLGLPKCWDDRHEPRL 
PAWNKTLRRECNWHSLLPWPAIQELEVG 


5471 


19372 


A 


5510 


421 


59 


FFFFFFLRRSLAVSQAGVQWWDLGSIiQA 
LQA 


5472 


19373 


A 


5511 


168 


3 


EICKMCFPVPFGVFFLWIVFVFVFVFVL 
RWRMESLSVSQAGVQWCNLSSLQPPP 


5473 


19374 


A 


5512 


446 


149 


FFFFFFFCFFETES CSVftRLESSGMISA 
HSSHNLCI 


5474 


19375 


A 


5513 


74 


398 


DRI IDAGPKGNYARFMNHCCQPNCETQK 
WSVNGDTRVGLFALSDIKAGKNHFRILQ 
LTSEFQGFCFLQTASSDHSTFKHFyMYRS 
LELSSFTFLCKHHHHPSPELFSSS 


5475 


19376 


A 


5514 


73 


335 


EGGSLFCPGWSAGAIiFWLTAPSASRGWG 
GAPGVPWGEVGQKNCVSINPGGGGCNELG 
LHHCI PPWGTGKTLS PKKKKKKGATDMG 
GAS 


5476 


19377 




5515 


416 


0 


WLGPGAVAHTCNPTTLGGRGRWITRSRD 


5477 


19378 


A 


5516 


242 


423 


AGSPLIHQGGQAI ILTQNPTPGWGTMVF 
QPVLRPVKGLYNMHVTSSPVASQPIIF 


5478 


19379 


A 


5517 


2 


151 


GRVGFVGMGSHCVAQAGLELLGSGGPPT 
SASQSAIIAGVSHRAWPYFPS 


5479 


19380 


A 


5518 


312 


405 


GQVRPGAVAHACSPNTLGGRGRITRSGV 
PDQ 


5480 


19381 


A 


5519 


3 


387 


ACSGPRSCHCTPAWATHQDSVSKKKRKK 
KREKPFLLEKKRGPTLTWGE PKKG I PLG 
ERNGPPSHFKKGPPLAPKPPGGKLHPEI 
AQGPGSPTDGNKKGDLKPTPVFYKFKTP 
EQAIHWAAPPFPSEK 


5481 


19382 


A 


5520 


292 


140 


KKGPPRARPGGGGSPPLFFFFFFFFFFF 
FFFFFWESTLKKRELHRSDLCV 


5482 


19383 


A 


5521 


1 


399 


KKWLFLS SAQI PTKLSRRRRNRETWPKE 
RSKIKLHKP ILKKQRHELPDKEFKI AL I 
KMLYELKVTMHEQNENINKETENTKKKK 
KKKENPPLFGTGKNQSPPEKFQMGGQPP 
PPPPQKKKPPPPNRFFVFSPP 


5483 


19384 


A 


5523 


316 


399 


MLLYWPGTVAYACNPSTLGGRGGRITRL 


5484' 


19385 


A 


5524 


400 


0 




5485 


19386 


A 


5525 


3 


429 


TTNIGRAMGATTS IVGSDTSQAERPGGT 
TWSPGASSTSQSSRPGTSVTPDSSASBf 
SETVTTKEFSGTTAISRTSHTGTPAASG 
GQATGSLTATTGVAPGTTVAPGSSNTEA 
TTSVGEKKKDQKKKKASSP 


5486 


19387 


A 


5526 


135 


3 


GGKICTL I KQSVI QESWPGTVAHACNTS 
TLGGRGGWIMKSGDR 


5487 


19388 


A 


5527 


105 


5 


KKKKKKKSGRWPGAVADACNPNILGGQG 
GQITI 


5488 


19389 


A 


5528 


411 


31 


GQRVPLSEGVAKKKKFi:,WNPGGGAGKNG 
GKNFLKKKPPGAPPPSGGPKKGPGPPPP 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, lVI=]VIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














KKPL I LVS P P PRKGGAHNMHFRQKKS GN 
KGGEGSTKGGKEKI FLGKFVQKHFFFFF 
DPEFHSCCLGWSAMP 


5489 


19390 


A 


5529 


411 


142 


PRFSFLFSPPRKKEVFPPPKILGGPRFF 
PPPRFLKPPPHFFFWGPKKKKKFPPPPG 
KKFFFFKGAPPPFFFFFFFFFFFFFFF 


5490 


19391 


A 


5530 


174 


401 


QVQ PQQKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKAS PGGGGKKKKK 
ERGGGGERKNI I WVP LLFP P 


5491 


19392 


A 


5531 


410 


0 


QPQSKKKKKKKKKKKKKKKKKKKKKKAS 
QKKKKKKKKKKKKKKS SGGGFFKE V 


5492 


19393 


A 


5532 


383 


145 


GGFFFNPPAKKGSFPQPHKLGGPPVFSP 
PPVFKTPPRNIFFGAPKKKKFFPPPGAK 
KYFFKKAPPPFFFFFFFFFFFFF 


5493 


19394 


A 


5533 


418 


328 


KTGFRHVGQAGLKLLASSDLPAIASQSA 
GL 


5494 


19395 


A 


5534 


169 


3 


I ILI IHPCSHVPRRETLGLKACTFKNLP 
WPGMVAHACNPSTSGGRERRITRSGDR 


5495 


19396 


A 


5535 


1 


254 


PTRPCKPPQTSPKSGKLSPSSNGCMKTL 
HISSTNTVGEVIEALLKKFLVTESPAKF 
ALYKRCHREDQVYACKLSDREHPLYLHD 


5496 


19397 




5536 


43 


404 


QGGLTVLPRLVLKSWPDKRSSRLGLPEC 
WDYRYEPPHQAN 


5497 


19398 


A 


SS38 


2 


212 


KENHLNPGGGGCSELRSCRCTPAWATER 
KSVSKKKKKNFRGGGLKGKGPGFYPLEK 
KFPGVGFFPKTNLV 


5498 


19399 


A 


5539 


2 


387 


I FQEHKNCGEMSE lEAKUKYVKLARS LR 
TYGVS FFLVKEKMKGKWKLVPRLLGITK 
DSVMRVDEKTKEVLQEWPLTTVKRWAAS 
PKSFTIDFGEYQESYYSVQTTEGEQISQ 
LIAGYIDIILKKKKKKD 


5499 


19400 


A' 


5540 


317 


407 


GLSHHVFFEEGKLKKPRIKDKDKK7PEP 
DN 


5500 


19401 


A 


5541 


120 


3 


KFFFFFLRRSFAIjSPRLECSGTI SAHCN 
LHLPGSSDSP 


5501 


19402 


A 


5542 


430 


0 


QKAQSKGPEKQEKRGVIQNFKRTLSKKK 


5502 


19403 


A 


5543 


172 


2 


FIFNFLHIFSFWFITKFGCFIVLFLRQS 
LALLPRPGVQWRDLGSLQPPPPRVRPRV 


5503 


19404 


A 


5544 


149 


2 


EKDIQELNSALHQADLIDIYRTLHRKST 
FFSAPHRTYSKIDHIVGSKAL 


5504 


19405 


A 


5545 


84 


276 


TLKKRPKE IRNQGGS PQRKRAQLGKLQQ 
KYPVLNFKAPFFGERGDYYKS FFKTCLD 
MLPRRGKG 


5505 


19406 


A 


5546 


2 


914 


RPAWHEEQKSKKEPEDTEDKEGCQMDKE 
PSAVKKKPKPTNPVEIKEELKSTSPASE 
KADPGAVKDKASPEPEKDFSEKAKPSPH 
PIKDKLKGKDETDSPTVHLGLDSDSESE 
LVIDLGEDHSGREGRKNKKEPKEPS PKQ 
DWGKTPPSTTVGSIISPPETPVLTRSSA 
QTSAAGATATTSTSSTVTVTAPAPAATG 
SPVKKQRPLLPKETAPAVQRWWNSSSK 
FQTSSQKWYMQDSSSSSSSSSPQNQQQQ 
PQSSQGTRYHTRQAVKAVQQKEITQSPS 
TSTITLVTSTQSSPLVTSSGSMST 


5506 


19407 


A 


5547 


2 


244 


TRL I FVFLVETGFRRVGQAGLKLLTSGY 
VGTSASQRMTAHLCLPRCWDYRCEQLHP 
AQHFFKRMLSVAVDAALLHTSGGM 


5507 


19408 


A 


5548 


406 




FFFFFSETESCSISQARVQWCNLSSLBCP 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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26 


Predicted 
beginning 
nucleotide 

location 

correspond 
ing to first 
amino acid 

sequence 


Predict- 

locat/on 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leueine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5508 


19409 


A 


5549 


417 


43 


I R PARKGCS SGLFGAPLAGGGAP P P P P P 
RQTSPPPPHTKGGGGGPHKPPPKKKRPP 
PKKKGGGKKNPPKKKKKNTPPPPKSAPP 
PAAKTDPPPPLTVPPFLPPYIFTKKIPF 
LKSYFFFFY 


5509 


19410 


A 


5550 


3 


136 


QPRFPTLTNINENNPTLLNPIKHLAAGS 
LFAGFLITNNISYHSP 


5510 


19411 


A 


5551 


75 


194 


NKELSRNKKKKKKKKKKKKKKKKKHGGG 
LKKKNGGGAKT 


5511 


19412 


A 


5552 


343 


493 


IIFSFFFFFFLKGSFVFAPQAGGHGGNF 
GPLKPPPLGLKEFSCLTLLSSW 


5512 


19413 


A 


5553 


148 


3 


DAVSLFCPGSLEHSDTIIAHYSPDLLGS 
S SLSASTSRVAGITGI CHHA 


5513 


19414 


A 


5554 


407 


252 


KKKFLKKLAGQGGMRLQSLLLRRLRQED 
HSSSGVQSCSELWGGHLIYSKNR 


5514 


19415 


A 


5555 


3 


231 


HEFVFLVETRFHHVGRAGLELLISSYPL 
ASAFREAKSRREDLLSPGVKDKPGQYRD 
TLTLKKKKKSFFIYLGECW 


5515 


19416 




3 


131 


SVWSWEESKDAKELVGYYIEASVAGSG 
KWEPCKNNPVKGSR 


5316 


19417 


A 
A 


5557 


3 


131 


SVWSWEESKDAKELVGYYIEASVAGSG 
KMEPCKNNPVKGSR 


5517 


19418 


5558 


462 


255 


RSHHSAQAGLELQASSDPPALTSQSAGV 
TNVSLRAQPILACSSVEVSSRCQVOjHV 
RRGKCPRflESYR 


5518 


19419 


A 


5559 


445 


29 

Ins 


KSSNILILQKVENGDLSNKILKITDFGL 

AREWHRTTKMSAAGTYAWMAPEVIRASM 

FS KGSDVWS YSVLLWELLTGEVPFRG ID 

CST.AVAYGVAMNKTiALPIPSTCPEPFAKL 

MEDCWNPDPHSRPSFTNILASCQAWR 

GAVESSCTEEHMWIPEKAVEGSLTPTPE 

SRSVAIIAGVQWUNVSSLQPPPPGFRRFS 

CLSLPSSWDYTSAGEGCaaSYRSICVLK 


5519 


19420 


A 


5560 


5320 


19421 


A 


5561 


24S 


142 


NKNIWVQKKCVFYIYLPFICVCVCVCVC 

vcvcvcv 


5521 


19422 


A 


5562 


262 


456 
















KKKKKfiSSPPPLIKRGPKN 


5522 


19423 


A 


5563 


314 


444 


LKSMFCFVLFCFFEIESRSVAQAGVQWC 
HLSPLQPPPPGFKRS 


5523 


19424 


A 


5564 


220 


486 


LNSSHPESPLPLFGLVLQFFYPSNPKSF 
NKKNLEIRMLLTVDRGPNHPDSLFFVHS 
NVDTVFLPQEGPSFLQPLGQGITEAFEH 
NQAYV 


5324 


19425 


A 


5565 


176 


492 


CTKTAVAS KAEYLQALLYALAS VTI AAT 
ATRLASISGASPIFRSFPYLQPTAPGLV 
LPS ILILVDVGLEKEKGPDTVAHACCSPG 
TLGGRGGRITRSGVRDRPDRH 


5525 


19426 


A 


5567 


24 


470 


PMDXKXPXXXETPLNGGAGENQDDMFAK 
LKEKLFNE INKIPLPPWALIAIAGVAGL 
LLLTCCFCICKKCCCKKKKNKKEKGKGM 
KiaaMNMKDMKGGQLPQDDDDAKTGLTEG 
EGEGEEEKEPENLGKLQFSLDYDFQiUSTQ 
LTVGVLQAA 


5526 


19427 


A 


5568 


45 


485 


GXDGRLLRGHNQYAYDGLDYIALNEDLS 
CWTAADTVAQITQRKWEflAREAEQLrAY 
LEGLCVEWLRTHLENGKETLQRADPPKT 
HVTHHPVSDHEATLRCWALGFYPAE I TL 
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SEQIB 
NO: of 
nucleotide 
sequence 


SEQIO 
NO: of 

sequence 


M 
eth 
od 


sEQ ro 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaiiine, 
G=Glycine, H=Hlstidine, Msoleucine, 
K=Lysine, L=Leuclne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=pos5ible 
nucleotide insertion 














TWQRDGEDQTHDTELVETRPAGDRTFQK 
WAAVEVP 


5527 


19428 




5569 


221 


422 


FFFLTGRVRQSLADLEQKIEVHSMPPFE 
SLALSPTLYCRGAILGATSASQVAGITN 
I CHHAQPS YLF 


5528 


19429 


A 


5570 


401 


477 


I FSRDKDVAMLPRI VLNSWAQVILL 


5529 


19430 




5571 


102 


36 


EVENNDDISHHHHHHHHHHST 


5530 


19431 


A 


5572 


106 


407 


AGRQGWLEGCRPAAWPTLLTPPATSSR 
GASRPPTATCPTPDPAPATTTWQPWPPR 
PLWAGVQWHDLGSSQPPPPGIKQPSCLS 
CPSSWDYRGRRAVCHH 


5531 


19432 


A 


5574 


253 


363 


YHSNQCVGLRKMPGAWHACNPSTLGGR 
GGWIMRSGA 


5532 


19433 


A 


5575 


1 


132 


MECHYVAQAGLELPGSSDLPASASQSTG 
ITGVSHCAWPSPFFFP 


5533 


19434 


A 


5576 


18 


378 


TPGGGGCIEPRWHHCTPAWATRAKLSYK 
KKKKKKK 


5534 


19435 


A 


5577 


21 


232 


ILRCTTIETLLKVQDNHLNPGGGSCSEP 
RSHHCTPTWARERDSVSKKKKKGGAALW 
DPGGGC2PSAGSQTR 


5535 


19436 


A 


5578 


377 


2 


LSL IQT I TLCLGAI TTLFAAVCAVTQND 
IKKIEAFSTSSQIGLIIVTIGINQPHLA 
FLHICTHAFFKAILFMCSGSriHNIiNNE 
QDIRKIGGLLKTIPLTSTSLTIGSLALA 
GIPFLTGFYRTRG 


5536 


19437 


A 


5579 


2 


227 


lESWLEPVRFLRSMFANNLVYDTSDSDD 
YHLLKDLEEGIQTLMGRLEDGSRRTGQI 
LKQTYSKFDTNSHNHVDAA 


5537 


19438 




5580 


113 


3 


NFPFFFFLRQSLAVAQAGVQWCDLGSLQ 
PPPPWFKQ 


5538 


19439 


A 


5581 


184 


3 


IMPPHSRLSPTARFCLQKKTUmiNTPIE 
MASIQCPGWSQTPGLKPPSCFVLPKCWD 
YRCE 


5539 


19440 


A 


5582 


381 


3 


FFFFLGDRAPPGGEKNPLGGGLKTPGKP 
PGGWGVSPPPQKKKMPPPNFFPPGKKKS 
FGGGGPI FFPGBCNFPPPTQKKKRGGGGF 
FFFFKKPGGFFFFFFPFFFFFSSFFLGK 
SFIKISFKKRRRG 


5540 


19441 


A 


5583 


314 


389 


FSCLSLLSSWDYRCVLPHLGNFLYF 


5541 


19442 


A 


5584 


131 




FPLWTSEQGVGRNKQTYVTWQADCKENA 
GGDYYWTFFPQPTFV 


5542 


19443 


A 


5585 


484 


3 


MTEILLKQAMVGIVGSIGSAIGGAVGGG 
ASASGGTAI QAAAAKFHFATGGPTGTGG 
KYEPAGIVHRGEFVFTKEATSRIGVGNL 
YRLMRGYATGGYVGTPGSMADSRSQASG 
TFEQNNHWINNDGTNGQIGPAALKAVY 
DMARKGARDSNSAGDQI FHH 


5543 


19444 


A 


5586 


136 


48 


NTMQPG WTHT CNPS TLGGQGGR I TGSG 
R 


5544 


19445 




5587 


306 


479 


DSFFPGVGSAFLYQVAAWTTKYFKKRV 
ALSTDIARSEMRLTFLLAPFTKFLIDVC 
DW 


5545 


19446 


A 


5588 


201 


2 


ILETGFHHVAQAGLNLLASSNLPASTSQ 
SAGITGVHHHAWPSFLIQGTKHSFKNSK 
PVFHIKLSKN 


5546 


19447 


A 


5589 


30 


477 


TPSLftRGVYPCWLYTGFLHDFLFSPKIE 
ERFMDVGVLRATEPSNSLFAI lYSTPIE 
LAIKSWHIIGDGLSIELVAVERGNRrTr 
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NO: of 
nucleotide 
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sequence 
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od 
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26 


Predicted 
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nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 

location 

corrcspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plien)'lalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, F=Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '■=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














ISRLPEFEKSSLSDQSLVTLTSGYFAVY 
RVKLTAKKLEGIHEGAFQITTDYE ILTI 

PVKflRNALC 


5547 


19448 




5590 


208 


91 


KGHFWVTGKKKVPCPPPRFFFFFFLDGI 


5548 


19449 


A 


5591 


325 


467 


ADLCSPCDILQLDFRHIRKTVDTLLALG 
EKRPPPTSALESEDI.TSFCF 


5549 


19450 




5592 


332 


403 


LSLCVXHTHTHTHTHTHTREIFIY 


5550 


19451 


A 


5593 


292 


26 


FLLDMEGAEAGAELGLYHCLLCAWETPS 
RLAVLQHLRTPAHRDAQAQRRLQLLQNG 
PTTEEGLAALQSILSFSHGQLRTPGEEE 


5551 


19452 


A 


5594 


387 


51 


LKKIFLGHGQGVSLFFSPPLGGPGGRFL 
GLEIGALFGLPGKTPPFPKIKKNYPGLW 
GGALFPLPLEGLGGKMAFPRKGGAPFSQ 
KNPPPPPLGGKTKP P PPLKKKKKKKVL 


5552 


19453 


A 


5595 


88 


362 


AETSPGQGSYQDGLRRPGFAEGPPAGRN 
PRRKKTQTDRRGGSAGSQPQACADGVRL 
CTRLQCSGAISSHCNLCLPGSSDCPASA 


5553 


19454 


A 


5596 


92 


366 


DQPWPGQLPGRSPEAGIRGGSTSRKKPQ 
EEENSDRSPGRQRG I PASGVRGRCARLY 
QAAVQWCHLGSLQPLPPGFKRLSCLSLP 
SSWDYSH 


5554 


19455 


A 


5597 


3 


138 


TRPRTRGVGQAGLELLTSSDPPTLASQS 
AGITGVSHRARPQSSS 


5555 


19456 


A 


559S 


T50 


3 


SCVIYSWMKFFSFLRWSFTLVTQAGVQW 

CDLSSLQPPPPRFKLPSCPG 


5556 


19457 




5599 


148 


3 


TQHU^ILAKKSQITPGMVAHSVAHAGRQW 
CDLGSPKPLPPGLNRASCLP 


5557 


19458 


A 


5600 


195 


1 


MVYFKKFCGFFLFVRSRIACSVAQVGMQ 
WCNHSSLQPRTPGLKQSSHFSFPSRWNN 
RHMQPCPGA 


5558 


19439 


A 


5601 


228 


415 


VNVS I PKDVFLL III ILFFFGPAPQAGG 
RGCNEKSLLPWPPGLKQFSCLTLPRSWN 
YGLPPP 


5559 


19460 


A 


5602 


405 


2 


AATGVDRHAQRTQAGLPLKTfQTPdPNF 
TSKARGTPSAQHPSCTRQRHQftSNPTQR 
RRRRPQAEHSIRTDAHGPGLPASGHMAG 
ERPTRKAQSRDVSVLRQRIiftLLPRLECT 
SAISAHCSQNLTGSGNPPVSCL 


5360 


19461 


A 


5603 


3 


236 


RQENGMNPGGGACSEPRSRHCTPAWATE 
RNSVSKKKGrKIKGHCSSAMHHQYLSPF 
GWRQVCSHCVCPKPWLLVRGH 


5561 


19462 


A 


5604 


184 


2 


RPFLRIQGKrRMFSGSRFIFNFLXLETW 
SHSVPQARVQWCHRSSLQPKPLVFKSSS 
YLLP 


5562 


19463 


A 


5605 


261 


2 


GWFLKGPPLFLGPKKPGGLGFFQKKLVL 
QKKUPPFGIGPPKNLKKTPLFFFFFLRR 
S lALS PRLECSGTI S AHCNLRI PRTSNS 
PA 


5563 


19464 


A 


5606 


134 


2 


GSPFKNLVFFLGQSLTLSPRLECSGTIS 
AHCKLRIiPGPHQSPAS 


5564 


19465 


A 


5607 


3 


269 


RQESHSNPGGGGCGEPRSRHCTPAWLAT 
ERDSVS KKKKRKRNFLNFIGKKKKPCLP 
KPKLTKKKKWGGITALVLKLYKRAIAPK 
TGGY 


5565 


19465 


A 


5608 


36 


380 


PPFXXPPGGETFGGTPRGPPPKKKKSPR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KHGGGGKKKPPRGAPPPKHLLGGKWRGG 
GTPPPPRKTPPLNPKKKRGGGKRGVFFP 
PKGGIFQKRGFFFSRGGGGGAPPPHPPP 
GGG 


5566 


19467 


A 


5609 




411 


SSRSRAA^UjFYFFFFSRRGQKEKEPFLG 
GKT P P VFVFFPGGVGKKNRGFP P P P PAG 
KGPF SGGGGREQTP IWS PPPPPRGKWGK 
PPKPVTPNFNFLGRVGFLGKNPSPGGVC 
PPPKTPIFFFGRPPPGGFFWGGGG 


5567 


19468 


A 


5610 


364 


483 


ISWVLFFLFLKWSFVLVAQAGEQWRGLG 
SLQPLPPGLKGF 


5568 


19469 


A 


5611 


399 


285 


ggcsepksrpctpawgterdfvskkknk 
nnylvkksn 


5569 


19470 


A 


5612 


408 


3 


ke vfs fp pglehggvf ys l p pl p p rvkg 
fspppppggggprgpppppgyfllfpkk 
ggfpplpggfsspqiwpprppqkvgvpg 

GftPRPGPNFFFFFFRESFPFIiARGFPKK 
KKKTCHWVTISHSASVQSSVLL 


5570 


19471 


A 


5613 




112 


lAQHVFFLLNTS lASPADSSELLMEVHG 
NGKRPSPER 


5571 


19472 


A 


5614 


2 


389 


FVREGERKKEKRKKEKKRKEGTKEKGKE 
EGRKERKKEKVRKKTVDDRRRPQVTPQA 
VNLESEGAPGNCGVTAPPGGHIWLKQVE 
VQTHLENRCHRLKATVTTTTRPGAEPRS 
PPFIITCGEIMNVFSDM 


5572 


19473 


A 


5615 


385 


3 


KPPPGDPGKTPFFLKSQKITGGGGGAPL 
FPPPQKVKAKKRGYPRRQSFKRARLGPC 
PPSRGGKKKAGFKTNPPTKKKNQTKKPM 
KAKTSKLLQENRSGPGTVJiHACNPSTLG 
GQGGW IMRS GDRDT K 


5573 


19474 




5616 


218 


402 


RSYSFSQLVGTVIKARKPESSCWLCAPM 
RKNAFWPGAVAHTCNPSTLGGRGGRIMR 
SGDGD 


5574 ' 


19475 


A 


5617 


2 


235 


KSFGCASRLQMHGRTHTGEICPYKCKQ CG 
KAFGCPSNLRRHGRTHTGEKPYKCNQCG 
KVFRCSSQLQVHGRAHCIDTP 


5575 


19476 


A 


5618 


347 


421 


IYLXXKKKKKICKKKKKKKKQKKKKK 


5576 


19477 


A 


5619 


92 


1 


ENLALLPRLECNGTILaHCNLRLPGSSD 
SP 


5577 


19478 


A 


5620 


3 


402 


APPPSFFFKKGLIMAISPTLGAIKLINF 
LFFFKKGKKKKKPPLEKKKKKKKKKGGA 
PLKKICKFSPPGGGEKIFFKGAPKKISRA 

RVKKQRRGEKTRGNPKKKRGKKKRPFPW 
TVSR 


5578 


19479 


A 


5621 


420 


3 


HPPPQGSANFFPPPRKIGINGPPPPAGI 
IFVFLECRGFPQVGQGGFKIRPPNKPPP 
PPPKNLGLQKKSPPPPPIFFFFFLWRWG 
LMTLPRLVSNSWLKQSSHLGLLKHWDYM 
SEAPHLAKNFLKEYEWSLSFHVEGS 


5579 


19480 


A 


5622 


246 


132 


EVAPHLLGC IS PTMCVCLCVCVCVCVCV 
CVCVCLCSW 


5580 


19481 


A 


5623 


219 


424 


I LNVMVLG I GTFGRLLGLEGRALTNG I K 
ALIRRDMGPGAVAHTCNPNILGGREGWI 
MRSGDGDHPGQH 


5581 


19482 


A 


5624 


32 


454 


GLEKVHSMTXDRNSRLSVQLRDWDGGGE 
LLQFSVHLDGEDTAYSLQLTAPVAGQLG 
ATTVPPSGLSVPFSTWDQDHDLRRDKNC 
AKSLSGGWWFGTCSHSNLNGQYFRSIPQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QRQKLKKGI FWKTWRGRYYPLQATTML I 
Q 


5582 


19483 


A 


5625 


166 


2 


LFGLVKLTALKGLKMI LPSQVWWLTPVI 
PTLWEAKAGGSQGQEFETSLANTVKP 


5583 


19484 


A 


5626 


120 


406 


GEAATQENLAELRPEPELLS PS TVLSRE 
PELPSPSTVLSREPELPSPSTVLSRKPD 
LLSPSTVLSRKPDLLSPSTVLSRKPDLL 
SPSTVLSRKPDLLSPSTVLSRKPDLLSP 
STVLSRKPD 


5584 


19485 


A 


5627 


324 


16 


LSPPPPLFKTAPRPQKKNPPPKKKKPAP 
PKKKNIjFFFFFFFFLVFFFIWCSWRLAE 
WNETLGECDMRWGQAVGTGTSHELRPGG 
SRSQPRRLAGKELGPHCAP 


5585 


19486 


A 


5628 


378 


72 


GRGGERPPPPQKSGAKKKPGVLNFFPQR 
GKRSGPPGPPGFFKIFSQEAPGFFSSFF 
WGRSIiALFPSWGARGGFLVPANLGPPAG 
APKGNPPSKKKICEKIED 


5586 


19487 


A 


5629 


3 


204 


QKSCLNMGGRGCSHPRSCHCTPAWATHK 
DSVSKKKKKWKFGGFYEVIGNFLHLFPK 
WSYPPSFRIF 


5587 


19488 


A 


5630 


177 


1 


SFHEMRFWKLIFIIRIYVHFERRSCPVP 
QSRVQWHDLGSLQPLPIjGFKRFSCLGLL 
SNW 


5588 


19489 


-A- 


5631 


154 


3 


KWFQMGSHSVTQAGARQCDHSSLRPPTP 
QLKRSSCLSFPGGWDYEHVPPC 


5589 


19490 


A 


5632 


239 


17 


EGVRNYLlLQPRSLCRIJSrCSFFPNREKE 
LCHHSSFSSPLAAAKESQGEKGRLLSQD 
EGLLLWEVFVEDVEINS 


5590 


19491 


A 


5633 


2 


110 


GCSEPRSHHCIPAWVTEGDSVSKKQPPP 
QKKKCIAG 


5591 


19492 


A 


5634 


433 


0 


FFFFFFFFFFLKKI PPPEGGGF 


5592 


19493 


A 


5635 


306 


433 


FSISIERSSSGRAQLLTPVMPALWEAEA 
GRSRSQEIKTILAN 


5593 


19494 




5636 


2 


285 


QSRTSVIFWSEIOiERGEETTEEKFEASR 
GWLVKFKKRSCLHNIKVQSEAI SADVEA 
AASDPEDL&KI IDEDGYTKQHI FGQALC 
LMSNPTTLGG 


5594 


19495 


A 


5637 


160 


1 


LYGGKIYWKFIRLNFFLSQGLTLSPKL 
ECTGTITAHCSLKLISPSNPPARAA 


5595 


19496 


A 


5638 


3 


261 


KIQLNPGSQGCSGLSLCHCS PAWATETL 
SQKKKKKNPPFGi'KIKGEIEPLHEFGTK 
KKKGGFLKKFPPHPLGFFLEPFTMGGFW 

N 


5596 


19497 


A 


5639 


2 


121 


SELRSCHCTPAWRQSKTTSHKKKKNIKK 
KNKFFLPFREN 


5597 


1949S 


A 


5640 


370 


ISO 


vemefchvdqsglellisgdtptsashut 

AGITGVSHRTWQPSFFSFFKERRNLKSV 
TSLNVSP 


5598 


19499 


A 


5641 


254 


1 


FYILRKVFVFLFPAVSYGKGTYPAVDAS- 
YSAKDTYSKPDSNGRKHMYWRVLTGVF 
TKGRAGLVTPPPKNPHNPTDLFDSVTNN 


5599 


19500 


A 


5642 


292 


106 


MFVILFYLFKOGSHYVAQAGLEPLSSSD 
PLATASQRAGITSVSHRflQTKQAETYW 
PFSLFT 


5600 


19501 


A 


5643 


3 


100 


DTFFEVEVEMEKEVCRDLMCTSPKDEEG 
FLLI 


5601 


19502 


A 


5644 


1 


282 


EKVPKAKRLYGKRGDPFYEAQENHNLIG 
VANVFLECLFCDVKLQYAVPI ISQQGEV 
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SEQ IX) 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
lociition 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 














STGQPWRTMWQVFCFGLFWFSFYLSYV 
LEVDQETFCAN 


5602 


19503 


A 


5645 


142 


67 


NKMRPGTVAHACYPNTLGGQGGQIT 


5603 


19504 


A 


5646 


350 


213 


GLELLTSSDPPASASQSAGITSMSHRPW 
PPLASRLDLCCGTLYLI 


5604 


19505 


A 


5647 


3 


180 


EHNAERAFLAQAHLGPGAPCTPGFPS FI 
PTQFPPFRSSGLPNIPVQTISRAAAEKL 
FG 


5605 


19506 


A 


5648 


366 




RSWFTTYKERSLLHIIKVQREAASADIE 
TAASYSEDLAKITHESGYPKQQIFNVDK 
TAFYWKMIPLSTCIAREEKSISGFKASK 
DRLTLWLGANAIGDIKLKPMIVYHFKTP 
RALKNYASSL 


5606 


19507 


A 


5649 


1 


355 


QIQSSQSGKGLLLTSFIDEETKLFFFGN 
KVWFCIPPGPGGKLCFKKKKIFGLYPGG 
GGFWGPQTPPIAPTFPPGFPPPPGPNFS 
PRGPFPKMGPALRGPGKKKKPPEKTKPA 
KGGTGP 


5607 


19508 


A 


5650 


348 


124 


RAPPVFFWVFLGQFPKKKSTNSPRGGGP 
PRS FQPFSLPPFFRNVFKKKPPPFFFEG 
GRGPPAPPFKMPFFFFSRL 


5608 


19509 


A 


5651 


194 


43 


KKKFFFFFYEMESHPVTQ2WVVQWHHLGS 
LQPPTPTFERFSCLTLSLIAIP 


5609 


19510 


A 


5652 


27 


244 


feQQSiLFLFYFLSPlLFFFLVKTGSHY 
AAQAGLELRflSSNPNTSVSLPKSWDYGH 
EHTVPSPTWFLAAKTP 


5610 


19511 


A 


5653 


38 


429 


GLFLSCLFFFFLEKGVPLTPPGATGENP 
APKKKBTGQKLLKNRDFFLKKAMFGKQK 
QKKFFFAPTITGDQRflPRVFKGPGAQKE 
PFAQKKKERGGGKKTR?«iPNKKKKPQAF 
FSLKKTPLWAGGGPPPFF 


5611 


19512 


A 


5654 


391 


68 


NLYPHGGVPGVSPGKGSLPKAPNPLGGI 
PPSGEPPPLPPQGFFPCLKPKKRL6KAP 
GPFFFFFLAILLGWFQISNTDSCVQVFI 
IQTIWIIFTTIIKBCHLHFTFLF 


5612 


19513 


A 


5655 


159 


34 


GARFLFFSPVLSPLIEHCVCVCVCVCVC 
VCVCVCVCVLCaCMCT 


5613 


19514 


A 


5656 


88 


388 


S IFCFIYVFFLRQSFTLVAQAGVQWRDL 
GPLQSPPQKKKGKETFSFLQENRYLQPK 
NFPTFLTQVCFFEKIPPNGLINGBCRPKW 
IPGCEFLKTLWGPW 


5614 


19515 


A 


5657 


176 


371 


hfyfcfsdinlaaepecvnrgkagVkrSa 
aemygsvtehpspspllrsgtllfital 

cpsvgifsf 


5615 


19516 


A 


5658 


3 


113 


CSEPRSRHCTPAWATERDSVSKKQQTKN 

LSYKKIHW 


5616 


19517 


A 


5659 


409 


67 


sssnkpptrpvfilpffghgvprlppgk 
kppgpppgppppqveqmygrkggfalnl 
pprrqnknmlyspelgek 


5617 


19518 


A 


5660 


2 


377 


ORIS csrntplrhrslmlivcglrmpls 
gflpwvgqvflali pvnpqps s lgvrti 
mdmlrccrdfiygqfwgqfmaelwgacs 
qqyteffrfkiifissllfnivlevlar 
airqekelngmlq 


5618 


19519 


A 


5661 


256 


381 


nfntgqawwlmpvilaiweaeagesrgq 
eietilantvkphl 


5619 


19520 


A 


5662 


19 


313 


lysslailgflpllinfiiiiiffffer 
dpgqqnqnfvslkkretppppfslgvpg 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
corrcspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GlLtamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KHSLLPPPPGPKHAGPPVLPAGLWGGPE 
PPTWGRGAKQKREI 


5620 


19521 


A 


5663 


78 


365 


ETGRKRKETRWNLKKVCLLKMSLQGEKE 
ERKRE KYRRTEMKRQKNSEKQRGREKGK 
KSKYQRRRKKEKKGENQTKPKPKVEESS 
SRQ 


5621 


19522 


A 


5664 


159 


28 


DTSTLPTPTGFKRFFCLSLPSKWDYRRP 
PICLPNFFFFLWTGY 


"5622" 


19523 


AT" 


5665 


350 


151 


IRCLCPDKFLAFCRDGVLLCCSGWSRTP 
GLKRSPCPSLPKSWDHRLCLADIYIYIF 
LKDTSTIMSF 


5623 


19524 


A 


5666 


394 


248 


LTPGGGGCGKLR LRHCTPAKATRANLPL 
KTKKKRKHFLFTL?^TSKVY 


5624 


19525 


A 


5667 


1 


194 


RLNPGGRGCS E LRS RHCI PAWATRTKLH 
LKKKKKGDQGFRGDHTKNIWLFPRVFLP 
LHSLNTLK 


5625 


19526 




5668 


2 


312 


RGQNSARGIIIIAILKKVCVYIYRCGLT 
LSPKLKCRGTIIAPCSLELMGSCDPSTS 
ASQIAETADMHHHTRLIIASLTFCLLKK 
KKNVFCGTYNHCPPKGFYL 


5626 


19527 


A 


5669 


1 


245 


RLNPGGGGCGEPRSSHCTPVWATRVKLS 
LKKKKKKRP P FFFFGIRQP PQGGPFQTL 
TPLNLLKNLTTLLGAKKFFFFTGGK 


5627 


19528 


A 


5670 


1 


135 


ARTPSPSLSESSEDEKPTKKHKKGKALR 
LKRRFWWLMSALPCIH 


5628 


19529 


A 


3671 


97 


2 


KKFTHVWPGWSHACSPSTLGGRGGRIT 

RLN 












173 


VFSPPPPFKTPPPPPKKIPPPKKKICrPP 
PKKKKLFFFFFFFFFFFFFFGTRIVIDN 

ILSGNRV 


5630 


19531 


A 


5673 


237 


13 


SPGGGGGFAPYWMFFLDFERQSCSVaQA 
GVQWHMLSSLQPSPPRFXFFSHDGVKMR 
PVPRYAPPS PKGI S LRPYK 


5631 


19532 


A 


5674 


245 


14 


ANFLKAHIVSTLGLQI IWSLLQLUSIFAI 
VFYFLRQSFALVAQAGVQWRDIGSLQPL 
PPRFKRFACQRFSWEAEVAVS 


5632 


19533 


A 


5675 


2 


249 


SHPl-HSaPQHLPLTPlfWNWSSVPVDFIF 
RKAPPVFPHWQHHRAGPGTSLIPTDAPQ 
EflSPPPPYPSSPSSTPTIPRETFLRQ 


5633 


19534 


A 


5676 


115 


241 


SSTLGGQGSITLAQELRSCHCTPAWATG 
QDSISKKKKTLLGP 


5634 


19535 


A 


5677 


460 


2 


RLGVTGRWQKLKPQERMRAWRENGKGPK 
ALPASWEGPTVNLLWKHEHRAWVSCHCD 
SGFCAWCQGLGSEQSGLPSLIPSWDGLV 
LTSPPNPTGPP P PLKTRSVLRGDDVLLP 
CDQPSNIjARALWLLNGSMGLSDGQGGYR 
VGVDGLLVTDAQ 


5635 


19536 


A 


5678 


2 


346 


ENCLNSGGEGCSKLRLCHCTPAWVTERD 
S ITKEKKKKNFKPVLEKFGEI FFLKGWK 
FFQLNPLKSSKEMEFFUTTLPLEPCFNG 
NKPFWAAKIWTLNFFQTQKGSIFKWGGR 
KK 


5636 


19537 


A 


5679 


181 




PKEVRTLFGRFFFFFHMESHSWQAGVQ 
WCNFSSPQPPPPGFQKFSCLSLPISWNS 
RHM 


5637 


19538 


A 


5680 


549 


410 


FFVQTDFCHVAQAGLQLLGSSNPPASAS 
PKYWDYRREPVCWPSSGF 


5638 


19539 


A 


5681 


111 


403 


SRQINGFFWSRTKVAIGQFLVGPSVKI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=pD$sible 
nucleotide insertion 














FENMNE I CM I KS I QS YI VLIIHKHLKVYS 
SNIWQLKNILLRGGAVAHACNPS ILGGR 
GRRITRSRDRDHP 


5639 


19540 


A 


5682 


189 


3 


NHNRKASWKKKDRVLLYCPGCNWCDLGS 
LQPLPPGFKRFSCLKLPKCWDYKHEPLH 
PAPHV 


5640 


19541 


A 


5683 


2 


227 


IHVCVCVRVCVCILREDSAQPARCVCVC 
VCVC VC VC VCLDPRRELS PTS QLES S CH 
PLYVPSGGSRINRPYPGGC 


5641 


19542 


A 


5684 


318 


421 


DVPCFPVAFCCDCKFPEASPAMLNCESM 

KPLSFI 


5642 


19543 


A 


5685 


213 


398 


SLCCLLFCCCYFETESRSWQAGVQWHD ' 
HSSPQPGPPRLKSSSHLSLWRHDHSSPQ 

PGPPRL 


5643 


19544 


A 


5686 


398 


1 


SFPTSSTYSSSVQGGGQGAHSYKSCTAP 
TAQPHDRPLTASSSDAPGQRVQNLHAYQ 
SGHLSYDQCQQQQQQQQQQQQALQSRHH 
AQETLHyQNLAKYQHYGQQGQGYCQPDA 
AVHTPEQYYQTFSPSSSHRE 


5644 


19545 


A 


5687 


2 


288 


SDRPRSPSFSFPSSPAGVGRASSIFPIL 
SAILLLLGGVCVAASRVYKSKRNIILGA 
GILFVAAGERQREGATGAAHLGflHVCVC 
VCVCVRARARA 


5645 


19546 


A' 


5688 


314 


2 


CRPGSQLQELGDGGGGESFVGGHGQQHP 
Ga.PQRQRYLQPLLMCGaSPHASPRQPIiA 
ILAAGVKFRVLEHEAGRPLRLLMQINPL 
PYSRWHTYRLPSCGCLTCI 


5646 


19547 


A 


5689 


122 


3 


MGGGEFSKKMNINGQGWAHACNPSTLG 
GRGRQITRSGV 


5647 


19548 


A 


5690 


144 


1 


EAVKKYTCPTWAGAVAHACNPSTLGGRG 
GWITRSRGRDNWGKHGEIPR 


5648 


19549 


A 


5691 


377 


65 


LGVFPPFFFLRGEGLSPPPPGGLVKNQF 
FFPPGQKGPPGTPER3FC2GFLGPKFFP P 
LPPQGP KG I FFLKKKKKKKI P I CftLAVL 
TYQHFCYRTGQERVDGIJiEL 


5649 


19550 


A 


5692 


12 


147 


YERRFCHIGQAGLELLTSGNPPASASHG 
AE ITGVSYHALMRNIS 


5650 


19551 


A 


5693 


183 


395 


GHRFGFLHPPPPGFGGFERPPPPKTEGK 
RAPPPPPRFFFFLGGGGFSGGGPRGVLF 
LEHENPPPGAPK3VG 


5631 


19532 




5694 


177 


283 


THFMTRTDGRRWSLaSLPSSaYGTNTPS 
STVSVSA 


5552 


19553 


A 


5695 


266 


460 


LSQRKYAFFPFFFFFGKKIFFSSPGGE I 
RAQFFLTEPS PFWVKGFSPLPLPKTWDY 
IGRDPPPLG 


3653 


19554 




5696 


1 


412 


MQPLKVNLLEQKNLGLEFVAPFTWIHCP 
SYWS lAYNVYFSQYSFIWVFCTMFITVN 
IPSKKKKKKKKKKKKKKKKKKSSSPGSP 
GYAPKKAPKNKKKK 




19555 




5697 


379 


0 


FFFPSQKKKKKKKKKKKK 


5655 


19556 


A 


5698 


1 


187 


PTRPPTRFRFENGVNLGGGGCGEPRLCH 
CTPAWVTERDSVSKKKKKKKKAVGGPWF 
FQKKKV 


5656 


19557 


A 


5699 


374 


0 


DFLLQKKKKKKKKKKKKKKKKKKKKRGG 


5657 


19558 


A 


5700 


219 


413 


PFCPQGGGGGPNFVFLGPPP PGLKE I PP 
PPPRGGGKKKPPPPPRGGFFFFKKKGVS 


5658 


19559 


A 


5701 


381 


249 


FHRAGQDGMDLLAPGSAPLGLPKCWDYR 
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Predict- 

nucle- 
otide 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=LeucinE, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=irnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REPPRPAFFFFFFF 


5659 


19560 


A 


5702 


382 


197 


LANFCIFSRDGILPCCPGWSGTPGLKQS 
AHLCLPKCWDYTHKPPCPAPQVIKPS IN 
ILIVN 


5660 


19561 


A 


5703 




381 


ERGSKHPPRGLXHIRGRTCFVLKGCMIQ 
LCWPPGQGKKKKEGGGEKKKDKKREEE 
GREKKKNGNQGERVKKKKSKRKKKKRSS 


5661 


19562 


A 


5704 


291 


167 


NGPPFFFFFLVKMGFCHVGQAGLEFLTP 
GDPPAIiRSRLLGL 


5662 


19563 


A 


5705 


367 


23 


KNPCPPPPPEFFLAPKKKGPPPRKKTRF 
VPPPLFLIKGPPFKRGKGPPPGGFPPLF 
CPLGGKRGGVFLSPRVGKPRGKGGNPLF 
PKKKKKKKRRRRKKEQKKQRKERQGYIH 

TV 


5663 


19564 


A 


5706 


411 


116 


FLKRWGSHYAAQAGLELLDS SDLPVSAS 
QSAGTTGVTHCSRLSYSALACQLCQPVW 
RIPWRGRGVGRETSFRGWGRRYGRAWSS 
SGGQAESLPGLSPG 


5664 


19565 


A 


5707 


162 


2 


SYCFPNSDDFIPFQPLPHTWFLWPRPFF 
IFKTAH3WARWLTPVIPALWEAEAG 


5665 


19566 


A 


570S 




416 


LPLLYKIPPPPAILFLFIKDSPPPPPPP 
PFFFFFFPPP 


56^6 


19567 


A 


5709 


403 


142 


LSKfildCWGWPRGFSPRlFEKPPPPKlFF 
APPKKKIFPPPPGEKKFFFKGAPPPFFF 
FFFSFFFFFFDCYMSSTWNSAWHIRCSI 
SIC 


5667 


19568 


A 


5710 


339 


461 


PSTFILFILRRSFALVACX3GVRWRHLGS 
PQPPPPGFKRVPA 


5668 


19569 


A 


5711 


408 


42 


TQARIIFSGPPKKTYPSPPRAFIILSFK 
RAAPQLIYFYLNLFAYKRLVKLTTVIMR 
RRPTFFIiLFKTFLFVEPGFCHVaOAGL 
QFLGSSTCGSLPQCWDYRCEPQHLAKRS 
TFIKSESAV 


5669 


19570 


— 


5712 


277 


9 


GVCVCVGGVWPPMGLQTPSAPWGLSIA 
PPLGTLCSVOWLAESimiCICQALVE^L 
RHQLYQVPVRKLLIASSIVSGFGDSIWN 
GSYLR 


5670 


19571 


A 


5713 


411 


0 


FLQILLFLIDIAFFLVFFFPFWKK 


5671 


19572 


A 


5714 


205 


400 


HYLIQSSSSMASESVQLDWMKIPLTWP 
GAVAHTCSPSTLGGRGGRITRSGDGDHP 

GQHDEAPSL 


5672 


19573 




5715 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5673 


19574 


A 


5716 


176 


392 


GVHLVLEWKLSMVCAEDDELFPLCSWKK 
FFFFLFLRQSRSVTQAGVQGRNFGSLQP 
LPPGLKRISCLTRPSS 


5674 


19575 


A 


5717 


1 




FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5675 


19576 


A 


5718 


306 


42 


GSGLRRCALNSVKCLNTLECILLGIYLR 
QIKICVYTKTCTCKFIVTLFI lAKKYEY 
PSVHQLINKMWHIQIIFLVIKHKEGCGG 
SCL 


5676 


19577 


A 


5719 


1 


263 


PTRPGRFFLILGDRGCSELKS CVCTPAW 
VTEGDS I S KQNKKQVELRS PAAHLLGGL 
VPNRPQASTRIGDPCTKKLLELISKFSK 
AAA 


5677 


19578 


A 


5720 


3 


278 


NQE ILKNVESSRTVQPHFLEFLLSLGWS 
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sequence 


M 
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od 
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26 
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correspond 
ing to first 

residue"? 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, !VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VDVGRHPGWTGHVSTSWS INCCDDGEGS 
QQEEVISSEDIGASIFNGQKKSMHIIKQ 

IKYFLSH 


5678 


19579 


A 


3721 


127 


3 


KKKKKKEEEEQGEGRRRiOlKKKKEKEKK 
EKEEDGEGKEKGE 


5679 


19580 


A 


5722 


204 


96 


vtsrsslfstpgikrasClcSlpkcwdyr 

PQPPHLA 


5680 


19581 


A 


5723 


3 


261 


QIVRQWKYHDDDDDQLFyTRVYLDPGLR 
VGKAEEWaHPAftLWEWRPQaEWDLGSRG 
GCKEKNVCLGWJiHRCHPRLLGGLGRRI 
A 


5681 


19582 


A 


5724 


134 


358 


PPPPPPPXLLFFVSAVPPSPSPSLSFPP 
HRPCYSPPSPTLSSLPPPLLFPPPPSSP 
PPPPRPSAPSPPPSSPFPS 


5682 


19583 


A 


5726 


248 


470 


LLNVQNGIRNVCSCPHYTPFHKWKYLS 
VFLTEVDSSQLRRQLCGGSCjaaiBRMIH 
FGREKVQILHLALIYFLV 


5683 


19584 


A 


5727 


1 


281 


NQHNRGFFDSCDGFFTNYKWREEHLERM 
LGQAGERRADVYIiGVDVFARWNWGGRF 
DTDKGGGGFLSKGQRPSASP6TSFPHGP 
SIPVCTSKE 


5684 


19585 


A 


5728 


460 


3 


SQTRTLTLDREITLPSSGTALISLVDGS 
GNPVSVKVQSVTDGVKVKVSRVPDGVAE 
YS VWELKLPTLRQRLFRCVS IHEISDDGT 
YAITAVQHVPEKEAIVDNGAHBDGEQSG 
TVWGVTPPAVQHLTAEVTADSGEYQVLA 
RWDTPKGGPVTV 


5685 


19586 


A 


5729 


70 


465 


SQCSPHHSVLCQKVQDLESGTLQSNWKK 
ITPWDEHVCVCVCVCVCPCVSMCVYVCV 
YVCLCVCVCVTVCMCVCLYIKGLLFLLA 
KKETAGSFFHSQLTCLGVPCAFTIHPVP 
FLPQCRSHCVSGGRFWPGVL 


5686 


19587 


A 


5730 


281 


427 


SWLTAAPNFQLQMESRSVAQAGAQWHDL 
GSLC3APPPGPQSAETQKC2MIV 


5687 


19588 


A 


5731 


418 


194 


TKFHHVGHAGLBLVTSGDLLSSASQSAR 
I TDMHHCSWPASVLSMNRPSGSTRCLRN 
GIFGPLKDFKPNDLTVQL 


5688 


19589 


A 


5732 


229 


2 


PaGAPPLVRSGFPGWGG!iPPFVFVPNGE 
PPKNFFFFFDGVSLC3lPGWaAVAGSQLT 
ASSTSHVHAILLPQPGIVA 


5689 


19590 


A 


5733 


435 


138 


PPRYPPPLWGAPPHLSRGGGSFTPLTPQ 
GKPRFFLKIKINPGGGPPPYSPPFGGGG 
PRMFFSPGAKGPLVLISPLFFPPGDQKK 
NSFFKKKKKERERE 


5690 


19591 


A 


5734 


113 


409 


KDVCGGVLTGLSGVLTSPEYPNNYPNSM 
ECHWVIRAAGPAHVKLVFVDFQVEGNEE 
CTYDYVAVLGGSSPTPVHQYCGSISAPY 
LFFLAKTrlMSYSLPN 


5691 


19592 


A 


5735 




122 


GGWQVQFVSEPCSLSINGVIFGLTSTDL 
LFHLGAEEISR 


5692 


19593 


A 


5736 


14 


428 


FLiALRWGHIRLPASGPRDTATLFSTLDT 
QLLMTLYVGKDETGFYVSKALVHTGVAL 
VPRGLTLAPADGPTTDEVTLQVS6EREG 
SPSTAVEYPSGSVALPSQWLLIGHHELP 
PVLHTTMLRVHPTLGSGTAETRPPEN 


5693 


19594 


A 


5737 


415 


1 


QHINTLGQFWTTKAFLPRMLELQNGHIV 
CLNSVLALSAIPGAIDYCTSKASAFAFM 
ESLTLGLLDCPGVSATTVLPFHTSTEMF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 

last^mino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspardgine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib)e 
nucleotide deletion, \=possible 
nucleotide insertion 














QGMRVRFPNLFPPLKPETVARRTVEAVQ 
LTQALLLLPWTMHALVILKSILPQAA 


5694 


19595 


A 


5738 


335 


152 


GVEMGFHHWQAGLEFLSSSDPPASRSQ 
SAGITGVSHQASPFFLPLSGITLPLSIY 
SSGSD 


5695 


19596 


A 


5739 


35 


462 


IDASDFGECSQAAINDNWIVYELLEEM 
VDNGFPLATESNILKELIKPPTILRSW 
NSITGSSNAGDTLPTGQLSNIPWRRAGV 
KYTNNEAYFDflflEEinaLIDKSRSTVFA 
EIQGVIDACLKLSGMPDLSLSFBONPTLL 
DD 


5696 


19597 


A 


5740 


429 


325 


NPGGRGCSELRSHPCTPAWVTEQDAILG 
GRSGRIT 


5697 


19598 


A 


5741 


344 


423 


FWPGAVAHACNPSTLGGRGGRITRSR 


5698 


19599 


A 


5742 


431 


103 


RGPFFPRPRQKKGGSPGGGPPQNKKNGP 

PLHTGGAKGGSFFKKKKKKKVPKKGPGL 
FFPGP QG PRKG F VG AF P VL I PNRLGP PR 
IPPGWEPGRQIFFLAPVFFFPSKPK 


5699 


19600 


A 


5743 


31 


224 


EEMGFyHAEQADFELLSSNDPAALAFQG 
AGIASMSHCSWPGKNFNMFCCIWKMKRL 
AIHWECIA 


5700 


19601 




5744 


'217 


400 


HSVMGGSPSFFFFFFFFBGEKGLFFFCP 
GGPKLGKGNPPPPGKKGFPPPPPPKGGN 
NGAPP 


5701 


19602 


A 


5745 


4 


455 


DGLEFPGRRFRGQRCSSPPHRGRGQVEA 
LLTSQTGRRGRGAPHVSDDGRRGRDAPH 
FLD6LVAGKRRSSLPIWDGGRAETLLTF 
QTGQPGRGAPHVPDDGRPGRKAPHFPDG 
VAAGQRLQSRHFGGC3GRRLGGGGCIEPR 
SRHCTPAWAP 


5702 


19603 


A 


5746 


381 


212 


PRGGGGGRYSPPPGGG6RGNSFYPGGGG 
LNKPHFGPCPPPGGPKMKPPPQKKKKSL 


5703 


19604 


A 


5747 


334 


432 


MFCEMYKEIAWPQAVAHSCNPNTLGGRG 
RHITC 


5704 


19605 


A 


5748 


190 


1 


I PPKKGGLNFFSPPTPKKGVKSKPPC2GK 
VFFFLRRSFTLVTQHGVCJWKDLGSPQPP 

APGSKHE 


5705 


19606 


A 


5749 


425 


297 


E SAFHHVAQAGLELLGSCDLPVSASQSV 

GVTGMSHHTRLYCS 


5706 


19607 


A 


5750 


425 


0 


KKKQRKKERKKERKKERKKERKKEKKRT 
KSDE 


5707 


19608 


A 


5751 


136 


3 


AQSSRPAWLIFLFLVETGFHWVGQAGLE 
LLASGDLPTSQVPRRP 


5708 


19609 


A 


5752 


3 


237 


GCSEPRSRHCTPAWVTEGDSVSKTNKQK 
TQRGLKS lALFPAIHSLLPWIKLAMPI 
IiAMDIFPHPIVEFSYHVWEQM 


5709 


19610 


A 


5753 


1 


358 


GDRGCSELKSCHCIPAWA7TEQDLPSKKK 
KKKKGGKKTRNGPQTCG6DLPPPPLGGG 
KNFPKGETFWFHSGKKKRFFSLGKGEKC 
FFIPRAPKGPGKTLGGPNRGGGAKKKLK 
HPPGPVY 


5710 


19611 


A 


5754 


2 


191 


QGCSELCSYHCTPAWEtRaRPVSKKKKP 
KTKNQTKKNRLGKKPPNLIFPPFQKKKK 

WVQIPGK 


5711 


19612 


A 


5755 


42 


408 


KHAAPPASLSLSLLLHHGQKRACFPFAF 
CRDCQFLEGSPAMLPVQPAKLTPRSTPP 
HPCSLENDSAHFFN 


5712 . 


19613 


A 


5756 


179 


410 


LQAFLRPSPPVLFLLYADLSNQNLWPGM 
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od 
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nucleotide 
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ing to first 
amino acid 
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Predict- 
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location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, ]yi=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VflEACNPTTLGGRGGRIT 


5713 


19614 


A 


5757 


166 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
S CS VTQAGVQWCNLGSLQPLPP GLPD 


5714 


19615 


A 


5758 


334 


3 


EEKVFGLPSPILLSKGQGGAASGPPCPI 
S PRAMGPQKKGWETMVF ITKAP P SFGFV 
LVKQSPGXGPVPRISIFFFE'IjRRSLALSPR 
AGVQWRDLGSMQAPPPGFTPFSCLNL 


5715 


19616 


A 


5759 


373 


470 


SSCWSLRNSSPGAVAHACNPNTLGGRGG 
RITR 


5716 


1961<7 


A 


5760 


210 


384 


EALDTTIFFFFFFETKFCFVPQAGGQGQ 
NLGSLGAPPTGLTPFSCLTLRKTWDCGP 
PP 


5717 


19618 


A 


5761 


257 


400 


SLGSEYTWVCFCVFETESRSVGQVGVQW 
CNLSSLQPPPPGFKRFSCLR 


5718 


19619 


A 


5762 


400 


2 


ARAVGSVGEDPGLGTGPSPARETWEWAV 
VCRACRGSQLLWGKLFILRHWDLRKVLA 
RGNGTEDCLKQLWSQCGAGMGRGPFQGL 
KGKPVGCLCSSREVFYLFLYLFRDRVSL 
CHPGWNAVKQTRLTAASASQ 


5719 


19620 


A 


5763 


34 


440 


RVRAPLGWEEGNYFVLR2VKPRS GKRS CL 
PFLRCGHQEPLLWGISTASPLRHLVGAS 
QICCVLFLCFQVSLKPAGSWKWIGNNG 
ERKAQRLCPURATRVEGIFCFRFVFLRW 
SFTLVAQRGVQWRDLGSLQPPPL 


5720 


19621 


A 


5764 


297 


468 


LVGAGVPTGCGQENGVWSHKSVQAQWL 
MPVIPALMEAEAGGSRSQElETIIiMTMV 

T 


5721 


19622 


A 


5765 


1 


205 


GFKLTDSFASGAICAILRSQLLGRLRQEN 
RWNLGGGGCMGLKWHNCPPAWAIEGNLL 
FKKKKKKGGGTF 


5722 


19623 


A 


5766 


307 


1 


KERDPPPTPFGKTFFFFFFKRNFCFFPP 
KEGQGGILGPRHPFFSRLSFLGRGVPRG 
FPPGPRNFFFFWir.VKMGVPHVGQftDFK 
LLTSGDLNKTASQNAGIK 


5723 


19624 


A 


5767 


2 


319 


LLGNGEELSRFPFQRSSQAWVFESSAGI 
FGGKAGATGLEKLTGSFQQLTGHPDPRT 
PEELGDPEKSSASEKTRGLQEANGVPEI 
LKAVTYTQAWPKEAKUDPITP 


5724 


19625 


A 


5768 


787 


1118 


EAAWRELEAERAQLQSQLQREQEEIULAR 
RKAEKEQLSEEIAALQQEHDEGLLLAES 
EKQQVREPWRGLCCSLSCSSSGAGPCSA 
TWQLGALGQATALLGASVSIiPAGGED 


5725 


19626 


A 


5769 


2 


452 


NGAGTMSVSLEADENPFAQGALKSEDCF 
ILDHGKDGKIFVWKGKQftHTEERKAALK 
TASDFITKMDYPKQTQVSVLPEGGETPL 
PKQFFKiSrWRDPDQTDGLGLTYLSSHIAN 
VERVPFDGGNIiDTFTAMAAQHGMDDDGT 
GQKQIWRIEG 


5726 


19627 


A 


5770 


2 


130 


GTQGKIVDLVKELDRDTVFAliVMYIFFK 
GKWERPFEVKDWKS 


5727 


19628 


A 


5771 


422 


8 


HIARWGARAPQGGWALLKAELCLNPAPP 
PIGftETAPLWPGCCFSLRPVFLLiiFPFF 
SLFIQSFRILYPQPWPLQGRRSTTLSRM 
GTTDHVI VLAS TWHAD I LDGALMRP GRL 
DRHVFIDNPTLQERRKCMRRLGASRN 


5728 


19629 


A 


5772 


432 


3 


YSLNLKKHWPVLTHNSVHYYKSSEKSAL 
KLGTWINSLCSWP PDEKI FKETE YWN 
VAVYGRKHCYELYTKLLSEASRWSSAIQ 



890 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 
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nucleotide 

location 

correspond 
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Predict- 
ed end 

otide 

location 

corrcspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=IIistidine, Wsoleucine, 
K=Lysine, lj=Leucine, ]Vl=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide Insertion 














NVTDTKAPIDTPTQQLIQDLKENCMTSH 
WEQIYKRKKILRYTHHPCIASAVEDHD 

ND 


5729 


19630 


A , 


5773 


20 


432 


SRAAALLBAVTETLFYYEVAEKIWSNRA 
NRQCADCGS SRPDWAAVMLGWI CKQCA 
GQHRALGS GI SKVQ SLKLDTSVWSNEIV 
QLFIVLGNDRANRFWAGTLPPGEGLHPD 
ATPGPRGEFI SRKYRLGLFRKPHPQ 


5730 


19631 


A 


5774 


433 


2 


RGAFQTLQKKDLQAFVLSSVQLQVLTGS 
CFKLRTVHNIPVPSNKVDDECPGLYGFL 
HVIVHFAKGFKQSATLSCALEVDSFGYF 
VSKAKTRVFRDTAEPKWDEEFEIELEGS 
PS LR I LWYEE YKUKSKVNWLYSAAVGFS 
AWQ 


5731 


19632 


A 


5775 


133 


436 


MLISLPHPSTHLTSLPFCADFGTRQTTG 
AASHVLTLSAHS S VCS CPPLMGEATTSR 
AQDLPADEHTAFAHELEAPALSKGQGLS 
ILEALTGHLPLGQEVNL 


5732 


19633 


A 


5776 


397 


58 


KGTIS IGKLEEKLRGAAEQALADAI I EF 
QLLPASLCTEDTPTGMQVKRSLGSRECH 
RSKQDSVMERAKVGFRGSTSENYYIRT 
TYSQLSVSLSGHHPGATLaaWGCSCSP 
T 


5733 


19634 


A 


5777 


1 


324 


MRRCVSAGGVELEEYYPAFLDMVRSMLE 
GS IDPTQYEDTLREMFTIHAYVGFTMDK 
LVQNIARQVSEAGVGPALRTAEMALPSF 
HS P LGRGP WS TANLS CF F L I E TK 


5734 


19635 


A 


5778 


1 


398 


RLGSRPS LHDQS PLELRSE IQELKGDVK 
KTVKLFQTEPLCAIQDAEGAIHEVKAAC 
REEIQSNAVRSflRWLFETRSLDVIWQDP 
SQVRLIRGISLEEGARPDVSATRWIFDT 
QSLDAIREI LVDE KDFPPS P 


5735 


19636 


A 


5779 


281 


12 


VACNLGWRVRTPTPASYPPCVPQLSVRG 
KNRKELLGGFLRWIVKSADEALITGMSG 
LKEVDDFFEHERTNLLEYHTLYCGRSED 

HATSR 


5736 


19637 


A 


"5780 


406 


23 


DSQATGPGISRGRRISSPSSLCTRFSTS 
I VKCS PLEHVLKYLDPPPPPTRCSRPAS 
LR3SPKHCVQQQFPIACRPGAAAPPPPP 
AQCPNPGPAAASAHHEPKWGSRTEGGPA 
RARRREGTGRRKKHR 


5737 


19638 




5781 


415 


18 


TSQAGDDLLHGSLLRRGPEMGYLPGYDP 
DPTLATPPAGQTLAVPSLPRATEPGTGP 

LTTAVTPNGVRGAGPSAPELLTPPPGTT 
APPPPSPASPGPPLGPEGGEEEPPPTII 
TTTTVNTTVTSPMYCGRSEG 


5738 


19639 


A 


5782 


168 


432 


GMRRQTGLDQQSRGVGKNWRVDFFFPSS 
LSFFFFFFGKESYFVLKFGIHGLKFGYL 
EPLSPGLKEFSRLSPPKGGHLKCRPPSP 


5739 


19640 




5783 


1 


423 


TDDDLNWLDHSRTFREQGVDENETLLLH 
RKFFYSDQNVDSRDPVQLNILYVQARDD 
IIiNGSHPVSFEKACEFGGFQAQIQFGPH 
VEHKHKPGFLDLKEFLPKEYI KQRGAEK 
RIFQEHKNCEKISERSAKV 


5740 


19641 


A 


5784 


273 


400 


DSWLLMLSPGVCPHVQVWPISCTLGAT 
FGY^/AGLVISPLWI 


5741 


19642 


A 


5785 


445 


28 


DTNHAPNHMEIKGSGGTEAKVHGADSID 
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SEQID 
NO: of 

sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

Dtide 
location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














KEAAGDETMEAETMEAEATGAKPTGVEA 
TGAKVTETKPTGAEVREMESTEEEANME 
SKPTGAQATDTETTGVEAMGVEATKTKA 
EEAEMQAYGVGAGQAEPPVTGTVLRPL 


5742 


19643 


A 


5786 


380 


2 


QRQQESPEASSLHILERQVQELQQLLVE 
S QEEKES LGREVES LQSRLSLLEiSERGN 
TSYDVTTLQDEEGELPDLPGAEVLLSRQ 
LSPSAQEHLASLQEQVAVLTRQNQELME 
KVQIMENFEPCDECI 


5743 


19644 


A 


5787 


663 


2 


FGVERHATRTIRLTRTVLDLYSFLAGVS 
ENLRHATQDDASRTRAPGLSSQHPKPDT 
TVSGDTETG QS PGVFNTKE SGMKD I KS E 
LAEVKDTLKNKSDKLEELDGKVKGYEGQ 
LRQLQEAAQGPTVTMTTNELYQAYVDSK 
IDALREE LMEGMDRKLADLKWS CEYKLT 
GLQQQCDDYGSSYLGVIELIGEKETSLR 
KEINMLRARLQEPSAQANCCDSVY 


5744 


19645 


A 


5788 


398 


1 


TRCYHSALRYGGSFAELGPRIGRLQQAQ 
LWNFHTGSCQHRAKVLPPLEQVWNLLHL 
EYKRNYGAKRGGPPVKRAAEPPWQPVP 
PAALSGPSGEEGLSPG6KERRGCNSEQT 
GLPPGLPLPPPPLPPPPPPV 


5745 


19646 


A 


5789 


116 


371 


RHPKIWGLRGPPGLTLEKEGKGEEPLDP 
RSWGFKEAETPPRPFTRGKKGTPRLKKK 
PKERKTRERGGKKEGKKKGKKNEGKTNP 


5746 


19647 


A 


5790 


406 


284 


RRGFFHVDQAGLELPTSGDPPTLASQSK 
PGPLCPALLHTF 


5747 


19648 




5791 


200 


3 


GDALIYMEPEKQVMSRSSDECWALCDQ 
WLVSYSKKIPLVKLLTFDLKKKTISICPL 
P6TWIMEMY 


5748 


19649 


A 


5792 


372 


278 


LQIQSPAGQSRCGGFLVHEDFVLTAftHC 
WGR 


5749 


19650 


A 


5793 


1 


167 


VAPPKLDPHSQRVRAQAGLELLTLCDPP 
ASSSQTAAITGASHHTRQVLGSYKALS 


5750 


19651 


A 


5795 


112 


3 


SSWGKVGAHAGEYGA3ALERMFLSFPTT 
KTYFPHCI 


5751 


19652 


A 


5797 


356 


238 


FLFCNNGVYSLEHFRCIAEINAGDDAFVK 
DVTVLQNTDGR 


5752 


19653 


A 


5798 


379 


60 


ViaRHCHTMVSMSLLKKIRlILEEEQVE 
GNYSFCVLDNQNLQQLWDWDHRSLTIKR 
GKMYFAFNPRLCVSEIYRMEEVTGTKGR 

QSKGD INTRSNGERASCE 


5753 


19654 


A 


5799 


377 


3 


KVAFPGEGR(3GESWSAFQGHTRWSDLK 
WAVYEPDLMVTSSVGTYIYIWDI KDTRK 
LTVALFWAGASQVKWNKKNANCLATSH 
DGDVRIWDKRKPSTAVEYLAAHLSKIHG 
LDWHPDSEHNLY 


5754 


19655 


A 


5800 


143 


316 


CPTSPPRLPYTEFFGGVSGLTVEQFRKI 
NGFPNAFWGWGGEDDDLWNRYLCRSFGV 
F 


5755 


19656 




5801 


390 


2 


LFFFFFFFLRQSLTLVPQAGVRWRDLSS 
LQPPPPGFKRFSCLCV 


5756 


19657 


A 


5802 


342 


465 


VIFFFKLKQGLTLSPRLECSGTIITHCG 
LDLLGSSESPTFT 


5757 


19658 


A 


5803 


1 


378 


VYSSEFENIKEEYRGRGFPTICYFEKGR 
FLI QYDNYGSTAEDIVEWLKNPQPPQPQ 
VPETPWADEGGSVYHLTDEDFDQFVKEH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: In 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 

Stion 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leiicine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Vallne, 
\V=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=:possible 
nucleotide insertion 














SSVLVMFHAPWCGHCKKMNPEFEKASEA 
LHC3EADS SGVLAAV 


5758 


19659 


A 


5804 


152 


~j 


KPE I FFFESRSVTQTGVQWCNFGSLQPP 
PSGFKLFSCLSLPSSWDYTABtV 


5759 


19660 


A 


5805 


207 


31 


LYLQLLQQTASSGNLNTLSSIiHPMGGKC 
FTAAEEQQHPGQQTQLTTVLGVWHRFCE 

NIF 


5760 


19661 


A 


5806 


199 




GSECSCLLSRGGEMHVSLSSFKVTFSCD 
AIIHFSFLFFWRQEFPSVQAGVQWRDLG 
SLQPPPP6V 


5761 


19662 


A 


5807 


385 


239 


QDQPGIjQKMCVCVCVCVCVCVCVCVI ic 
KSCIRLSYMKMPINFGLKNL 


5762 


19663 


A 


5808 


39 


184 


GRI 1 1 KFFGEMRSHHAAQTGLELL6SSD 
PPTWASQSAAITGVSLRIRP 


5763 


19664 


A 


5809 


376 


2 


WABPTSFHVCITSYTQFFRGLTAFTRVR 
WKCLVIDEMCJRVKGMTERHWEAVFTLQS 
QQRLLLIDSPLHNTPLELWTMVHFLVPG 
ISRPYLSSPLRAPSEESQDYYHKWIRL 
HRVTQPFILRSV 


5764 


19665 


A 


5810 


271 


120 


FLCFIGKTSER6SFSLYSRDTGLPGCQV 
SLMIKKSNEMVFKHINKSSINR 


5765 


19666 


A 


5811 


367 


3 


DKSFKEWHKMLEDANAPTEERDERIAGL 
YEEIERDLMLLQASAVEDKLQEGVIETV 
TSLSLANIKIWVLTGDKQETAINIGYAC 
NMLTDDMKDVFVIAGNNAVEVREELRKA 
KQNLFGRCI 


5766 


19667 




5812 


294 


37 


FRGENPDDGVRGSPPEDYRLRQVASSLF 
RGEHHSRGGTGRIiASLFSSLEPQIQPVY 
VPVPKVSHWAFFPNSSLEPLTYKAI FLP 
TA 


5767 


19668 


A 


5813 


184 


3 


I IS YLKNYLLFSPLQRQNFVLLPRLECS 
GTITAHCNLKLVGSGDPPAPASQVARTT 
GMRH 


5768 


19669 




5814 


468 




DDMEYEGSVSVFTQVPRKSVMKGIiDYTL 
MVAGEFGLGKSTLVNSLFVSDLYRDRKL 
LGAEERIMQTVEISKHAVDIEEKGVRLR 
LTIVDTPGFGDAVISTECXSKRKaMY 


5769 


19670 




5815 


24 


314 


QAPFGPKCENVNLCFSLSLSKGEPEFHY 
lAGAHGNEVLGRELLLLLVQFVCQEYLA 
RNARIVHIjVEETRIHVLPSLNPDGYEKA 
YEGVMRLAHLTQ 


5770 


19671 


A 


5816 


474 


3 


TTLRKGYSRHKGLLHSIRQEVQATPLEG 
IIVSGQMSTMNLSSTLWPSPKRVRRLCI 
GRTLRSLRIRPCKRTPLQTRRMPQKMRA 
LYGDPGDSPPLRSRAIPASSPGSTGSFR 
ARPPHPLHPPPHLTREGEPPLLPVLQPR 
RGPPLPLHSQASRPIV 


5771 


19672 


A 


5817 


511 


389 


GMYCTEGRE WPT FRNEI E lEEDHCGRV 
SASIILTHCSNR 


5772 


19673 


A 


5818 


510 


2 


PLSCEVPWPLFPVPGVRiyilQHHIEACAQ 
VRGMQEIVLIGFYQPDEPLTQFIEAAQQ 
EF IL PVRYLQE FAP LGTGGGL YHFRDQ I 
LAGSPEAFFVLNADVCSGFPLSAMLEAH 
RRQRHPFLLLGTTANETQSLNYGCIVEN 
PQTHEVLHYVEKPSCIAAAIEDQASSRQ 
R 


5773 


19674 


A 


5819 


282 


152 


FSLFLFLLQSERHFIMQWCEATQCPDT 
RVSVRLYEKSVLHLS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalaiiine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Trypfophaii, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5774 


19675 




5820 


347 


1 


EPHPVTPPAVGTSRYNPLIMMDRNRSAV 
TPPSHPPQQPSSMQTG^^NPSAIyIQGPSPP 
PPPPSYMHIPRYSTNPITVTVSQNLPSG 
QTVPRALQI LPQI PSNLYGS PGSI YIRQ 
TKV 


5775 


19676 


- 


5821 


482 


13 


lYEKLITGCYNILAHHADPNSGLDESIL 
EECLQYLEKQLES SQARKAMEE FFSDSG 
ELVQIMMATANENLSAKFCNRVLKFFTK 
LFQLTEKSPNPSLLHLCGSLAQLACVEP 
VRLQAWLTRMTTS 


5776 


19677 


A 


5822 


608 


7 


FRFAEKWEGMFIIVNSITIKIHSKAFH 
ASFELWQLQGYSVNPNWQQSDLRLTRIT 
DPCRGEVLTFKEITWQTLRIEADATDMG 
DQDP VTTPLRL ITNQGRIQ lALKHRTKD 
CNVISSKLMFLIiDDLLWVLTDSQLKAMM 
KYAESLSEAMEKSAHQRKSLAPEPVQIT 
PPAPSAQQSWAQAFGGSQGNSNSSSSPV 
LRPL 


5777 


19678 


A 


5823 


328 


3 


AE VAS ED CS LP CFLAVWNR 1 1 E P VAAMR 
KEADMLRLFPEYLKGEELFGLTVHAVLR 
lAESLPGVESCQNYLFRYGRHPLMELPL 
MINPSGCARSEHKILTHYKRPSLY 


5778 


19679 


A 


5824 


69 


308 


TSVPSCVRCRYIILRTSSALTNILGTHS 
NS S F B'HAS S S ALHS C CFPF FS WLQTLD I 


5779 


19680 


A 


5825 


424 


2 


LQRAFSESIRKLRGYQGADRKQIYHRRC 
SFAPTHSVRPSRDEKCNSNFFEQRHGGSH 
QSSKWTPVGPAPSTSQSQKRSSGLQSGH 
SSQRTSAGSSSGTNSSGQRHDRESYNNS 
GS SSRKKGQHGSEHS KSRS SS PGKPQTV 


5780 


19681 




5826 


107 


2 


SSLTAGVRMGVPAQSTQGTVNGSSPQMS 
GTAALTS 


5781 


19682 


A 


5827 




437 


FPTEDSRTSKESMSEADRAQKMDGESEE 
EQESVDTGEEEEGGDESDLSSESSIKKK 
FLKRKGMTDSPWIKPARKRRRRSRKKPS 
GALGSESDKSSAGSAEHIGPCDSTGDME 
VSSGLPGSRRPENPVFLCIWVAVRAICP 
GFLWS 


5782 


19683 


A 


5828 


487 


22 


MPEPVFPLSHFRQFIAAIKLQFQARLSR 
CVRDLVRSLAAPNHDTLRMLFQHLCRVI 
EHGEQNRMSVQSVAIVFGPTLLRPEVEE 
TSMPMTMVFQISIRWELILQQCADIFPPH 


5783 


19684 


A 


5829 


2 


871 


RGICSQRWRREGSQSRGPGLVITSPSGS 
LVTTASSAQTFPISAPIjllVSALPPGSQA 
LQWPDLSKKVASTLTEEGGGGGGGGGS 
VAPKPPRGRKKKRMLESGLPEMNDPYVL 
SPEDDDDHQKDGKTYRCRMCSLTFYSKS 
EMQIHSKSHTETKPHKCPHCSKTFANSS 
YIAQHIRIHSOaKPYSCNFCEKSFRQLS 
HLQQHTR IHS EMHTETI KPHKCPHCSKT 
FANTSYIiAOHLRIHSGAKPYNCSYCQKA 
FRQLSHLQQHTRIHTGDRPYKCaHPGCE 
KAFTQLSNLQ 


5784 


19685 


A 


5830 


463 


1 


LPESRLPS PHKEEEGSR&RVIMTSYPIE 
PHERRKGSLMWATLKQKKLEEMTRTE 
QEDSSCMEKLLSKDWKEKMERLNTSELL 
GEIKGTPESLAEKERQLSTMITQLISLR 
EQLliAAHDEQKKIiAASQIEKQRQQMDLA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

nucleotide 

conrespond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, IfcPbenylalanine, 

K=LysUie^'l^Lruc?ne'"M=Mrt^^^ 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon,/=possible 

nucleotide insertion 














RQQQEQIARHQQLY 


5785 


19686 


A 


5831 


462 


2 


MILFDSSLYFLCGWYLSNLILGTYGVRK 
PWYFPFTASYWKSVGFLVEKEQYFLSSS 
LFFFNEIIB'DNKGSSLQNREGELEGSAPG 
VTLVSVTKEYEGHKAWQDLSLTFYRDQ 
TTALLGTNGAGKTTIISMLTGLHPPTSG 
TI I INGKNLQTVY 


5786 


19687 


A 


5832 


396 


2 


VPSTPASKRKGIVPRCDIFIYDRGGALP 
KDIHVAGILFASAWMNVWPLLSAIEFHS 
AWAMQMGVNFFVASTHHVSLNMSGSGRY 
APNGRKVYHyDMKTEMGELLLSEVDSHP 

LS SrjAYPTADNWMAYATLY 


5787 


1968S 


A 


5S33 


384 


137 


E E I CLiNP G AGG CS E P RIaWHCT P AWAT KR 
DSSKKKEKNCLSKNKKEKEKEMFACVCV 
CVLFSIYAFPSKAQELSNPCKEYGNF 


5788 












FVEGINISGNFYRNKLKYLAFLRKRMWT 
NPSRGPYHFRAPSRIFWRTVRGMLPHMY 


578P 


19690 


A 


5835 


278 


404 


S VLI KFE KKTLKYKAI WPGAVAHACNP S 
TLGGRSRGITRSGE 


5790 


19691 


A 


5836 


417 




SKGKEGRCEWQYDLPSGTHAVFQFFLKQ 
QGRVPGLTQAVQVAKMSLIDLAGSERAS 
STHAKGERLREGANINRS LLALINVLUA 
LADAKGRKTHVPYRDSKLTRLLKDSLGG 
NCRTVMIAAISPSSMTNEDTYSTHV 


5791 


19692 


A 


3837 


409 


230 


AGRAYCYNGMCLTYQEQCQQLMGPGKAL 
LGSTALPSPPLPSQPPLALRWWNLGFK 


5792 


19693 


A 


5838 


3 


409 


LRSVPCKDYLTQNHYITSPLSHEEAAFP 
liAYVMVIHKDFDTFERLFRAIYMPQNVY 
CVHVDE KAPAE YKE S VRQLLS CFQNAF I 
ASKTESWYAGISRLQADLNCLKDLVAS 
EVPWKYVINTCGQDFPLTTTRPV 


5793 


19694- 


A 


5839 


385 


174 


GLAVEIGSRRIAEDGLELLASSDPPTSA 
SQSAGSTGVSHHAWPDTVSSVYWCFLCC 
T KQVWGALiS I VS L 


5794 


19695 


A 


5840 


260 


2 


PLPRYTAACSRLLVQYKAAFRQVQGSEI 
SSIDEFCRKFRLDCPIAMERIKEDRPIT 
I KDDKGNLNRCIADWSFYITVMDKLRQ 
CI 


5795 












FSGIKYICIIFLFVCFFLYLLGIGSPYV 
AQAGLTLLGSSDPPPSASQSPGIIGVSH 
YVQ P IHT VAQLS PS S I FRTFS S FQTET V 
PIKHTSHSPSPSLY 


5796 


19697 


A 


5843 


391 


2 


APHAPAFLRARGEPQDPLSHPRVPAVSA 
NCRMWKHLPVHSSPTPRLTPLWKLQARW 
LLPQLVYLQGWGSYSLLRPAALISMVLL 
AREFLYPAKMSVSEVCSSGLSSPLLEQH 
KTWL I F YASGD I CSAW 


5797 \ 


19698 






479 


59 


FVGMEPGVGHASLAMHGLHIRSLGKIKL 
RKEKCTLFFMTQEKSARHRGHLLGENI Y 
LLLFAIALRILNCLLVQTSFVPDEYWQS 
IiEVSHHMVFNYGYLPWEWTERLRSYAYP 
LICASIYKVLHLLGKDSVQMLVSGNKSS 


5798 


19699 


A 


5845 


406 


548 


SVQTKPFFETESHPVTQAGVQWHNLCSL 
QPLPCRSKQFLCLRLPSSW 


5799 


19700 


A 


5846 


2 


394 


TLCHRAKDPTVHHBSGRIMYLNGLCFLM 
GPAQLTQRLSVSRQGEC 


5800 


19701 


A 


5847 


472 


201 


LSILSDTSLCLGRFFTYENGCAYFHEEE 
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SEQID 
NO: of 
oudeotide 
sequence 


SEQID 
NO: of 

sequence 


M 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

correspon 
last\mino 
residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutani!c Acid, F=Phenylalanine, 
G=GIydne, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valinc, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














REGLftRICRLS IHSRYEDFWDGENVLY 
NKKPVIYRSAPARPQLGQSLCNQVMWYQ 
VAILNK 


5801 


19702 


A 


5848 


127 


3 


SSNVKAAWGKVGAHAGEYGAEALERMFL 


5802 


39703 


A 


5849 


238 


) 


MaETHPNPTGLPIAPALPOJVGSATHPC 
LFSPKlSLDVnADRDGWEKNWPKKVPS 
FQGRHHPEPRASEDTAPDPAGMY 


5803 


19704 


A 


3850 


423 


2 


LS PLGNWSSAWQKWFDSVKPSRKDLFA 

QATQAAGDWS3NMWGSDGILSHSAPLTV 
PQPLLTGPQGPCLCRPLPNQQTSFPALL 


5804 


19705 


A 


3851 


326 


682 


PSYSLLSLLQVKNEVEKLPRQQRKESMK 
QKI1EEHTQKKQLLVSPWPQLGSAGAGAR 
PGMVTRPPLCLCPIPAPQIPRSEYLNIA 
SS FMAPLS PPPCSLHRLSCLHLEVRNS I 
SGYPNCI 


5805 


19706 


A 


5852 


263 


30 


HEKTDDERGPGQSARSGAITKPPGPPLP 
lEPHETTPEHPAPSGTIPEPPLPVELHE 
TTPQHPVPSGTIPBPPYLLSQ 


5806 


19707 


A 


5853 


389 


2 


GLPTQREKFGASMRTRMTIQSIAWTSF 
VCLVGEGNlSrVQGFRAESRCWRYDPRHNR 
WFQIQSLQQEHADLSVCWGRYIYAVAG 
RGYHKDLNAVERYDPATNSWAYVAPLKR 
RCMPTKAKRWRGRCIS P 


5807 


19708 


A 


5854 


379 




GRSLRYSGSCSGEENSTHHSAGQSRAVI 
AAAARRRGNSHNEYYYEEAEHERRVRKR 
RARLWAVEEAFTH IKRLQE EE QKNPRE 
VMDPREAAQAIFASMARAMQKYLRAAKQ 
QNYNTMESILQHC 


5808 


19709 


A 


5855 


374 


2 


SDAGAPVNIYEFRHRPQCLEDTKPAFVK 
ADHADEVRFVFGGAFLKGDIVMFEGATE 

LWPAYNLTEQYLQLDLNMSLGQRLKEPR 
VEIKKSTIPPCI 


5809 


19710 


A 


5856 


516 


18 


PYECKECGKAFNCGSSLVQHERIHTGEK 
PYECKECGKAFSRGCHRTQHQKIHRGET 
PIKCKECGKAFSWGSSLVKHERVHTNEK 
SYECKDCGKAFGSGYQLSVHQRFHTGEK 
LYQRKEFGKTFTHGSKLVHERTHSNDKP 
YKNKECGEAFLWTTYSNEKC lASAK 


5810 


19711 


A 


5857 


392 


3 


CERGMGGSPEALGRHWTSFLKLRLNCSV 
PGDS TFYFDVLQALTGPVNLHGRSALFG 
VFTTQTNS I PGSAVCAFYLDEIERGFEG 
KFKEQRSLDGAWTPVSEDRVPSPRPGSC 
AGVGGAAKFTSSRDLLY 


5811 


19712 


A 


5858 


391 


3 


ARRTTGMVPKAGGGKGRRGAVFRSYIRE 
lEELRSKLVESEAMNESLRRSLSRASDR 
S P YSLGAS PAAP AFGGS PASSMEDASEV 
IRRAKQDLERLKKKEURQRRKSPEKEAF 
KKRAKLQQENSEETNGE 


5812 


19713 


A 


5859 


409 


3 


SLPGEGDPWWAGQEELLFVQEGKLSLPV 
AEKVLLEIACEYDPRSNSWAEIAPMKKC 
REHFVLGAMEEYLYAVGGENELRQVLPT 
VERYCPKKNKWTFVQSFDRSLSCHAGYV 
ADGLLWI SGGVTNTAQYQIJRQCI 


5813 


19714 


A 


5860 


405 


135 


NLSRLSLCRQPMTLVTWVLLFYDSEEKA 
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nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutannic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














FWLLVALWKRMLPDYytlTRWGECPRVL 
PRLGLQVPAGSMRVARLPRPSSRLHVGS 

LGVRG 


5814 


19715 


A 


5861 


389 


2 


YYSELSEQLTDEFNAELNRVPLKRLDLI 
FVTFQDSRMAKHVRKDYKYVQCGVQPQQ 
S3VTT I VKS YYWRVTMAPHPKD I IWKHL 
SVERFFVIWARFIAINTFLFFLFFFLTTP 
AIIMNTIDMYUVTRPCI 


5815 


19715 


A 


5862 


525 


0 


SHLPESERIHHTVGQKEQVLMDTSKTRP 
NI>roVPEPPMPIADQV3NDDRPEGSVEDE 
EKKESSLPKSFKRKISWSTKGVPAGNS 
DTEGGQPGRKRRWGASTATTQKKPSISI 
TTESLKSLIPDI KPLAGQEAWDLHADD 
SRISEDETESKGDDGTHDKGLTIC 


5816 


19717 


A 


5863 


399 


2 


ATRRNRNRVPSGMTRTfJVEEMIAAVGPG 
PSPYPLPPPPEGTSSIEYSNQGNTCQGH 
GNFDFPHGNPGGTSMNDFMHGPPQLSHP 
PDMPNNMAALE KP I S HPMQE TMPHAGS S 
DQPHPSMQQDKRENTPQPMY 


5817 


19718 


A 


5864 


3 


714 


RRPFIALCLSNVAFMLPWQFAQFILFTQ 
lASLFPMYWGYIEPSKFQKIIYMNMIS 
VTLSFILMFGNSMYLSSYYSSSLTMTWA 
IILKRNEIQKXGVSKLNCWLIQGSAWWC 
GTIILKFLTSKILGVSDHICLSDLIAAG 
ILRYTDFDTLKYTCSPEFDFMEKATLLI 
YTKTLLLPWMVI TCFI FKKTVGD I SRV 
LATNVYLRKQLLEHSELAFHTLQLLAFT 
ALiAILILRLKLVL 


5818 


19719 


A 


5865 


423 


1 


APPVS TAVAQSNS SEEEAHEVGS PAQE F 
KYQKSLPPRFQRQQQQQQQEQLYKMQHW 
QPVYPPPSHPQRTFYPHHPQMLGFDPRW 
MMMPSYMDPRITPTRTPVDFYPSALHPS 
GLMKPMMPQESLNGTGCRSEDQNCVPPC 
I 


5819 


19720 


A 


5866 


497 




AVGAGQKGGRGGGGRELMPPQKIIVGGGA 
FTNNAHVGPLKIMPRLIRTGMLEEIHLE 
PGWQGLFPCVDELSDIHTRFLSQLLER 
RRQALCPGSTRNFVIHRLGDLLISQFSG 
HSAEQMCKTYSEFCSRHSKALKLYKELY 
ARDKRFQQFIRKVYCGRCRGSQQGR 


5820 


19721 




5S67 


382 


74 


LALSPRLKCSGQI IAHCSFDLLG3SDPP 
PSASRVAGTTGARRHARLP YS LRIIYHFV 
LRLFNPSKKSKLKHFHPILIRNPGKVGS 

YSFQMSDLLQSRRi'vRQQS 


5821 


19722 


A 


5869 


378 


3 


SFSHSAI\'LISHQRIHTGEKPFQCAECGK 
S FSRS PNLI AHQRSHTGEKPYSCPECGK 
SFGWRSSLNTHQGIRSGEKPYECKECGE 
SFSYNSNLIRHQRILTGEKPYKCTDCGQ 
RFSQSSALITPV 










375 


3 


IHVIVFNQATGHVMAKRVFDTYSPHEDE 

AIWLFLNMVAP GRVL ICTVKDEGSFHLK 

D - AKAL LRS LG SQ AGPALGWRDTWAFVG 
RK3EAECHWADTELNRRRRRFCSKVEGY 


5S23 


J 972.'! 


A 


5S71 


373 


] 


Q ?E E VS GAL S PPSASAYVKLVLLGL IMC 
VS LAGNAI LSLMVLKERALHKAPYYFLL 
DLCLADGIRSAVCFPFVLASVRHGSSWT 
FSAIiSCKIVAFMAVLFCFHAAFMLFCIS 
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sequence 


SEQ ID 
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sequence 
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od 


SEQID 
NO: in 
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09/515,1 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£>=Glutaniic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














VTRNMAIAHHLY 


5824 


19725 


A 


5872 


382 


2 


KSFSQKSQLIIHLRTHTGERPFECPECG 
KAFREKSTVIIHYRTHTGEKPYECNECG 
KAFTQKSNLIVHQKTHTGEKTYECAKCG 
ESFIQKLDLIIHHSTHTGKKPHECNECK 
KTFSDKSTLIIPHV 


5825 


19726 


A 


5873 


370 


3 


MGRVRAQNI SGVMSGPQKMLMQSQFPTQ 
GQQGFCEGKEPYQAMSQNMGNTQDMFSP 
DQSSMTMSMVGATRl.SHMPijPPASNPPG 
TUHSAPMEGLGRRRSDLTI S INQMGS PG 
IGHLNSTTCI 


5826 


19727 


A 


5874 


362 


2 


GGKFLVLGHLPSKLEBSMVQYyRLrVTAA 
SLVRGQISEYNISLRASDGGSPPLSTET 
HITLHVIGINDNPPTFPHLSYSACIPEN 
NPRGASIFSVTAQDPDSNBNARITyALT 
KDTLQGVY 


5827 


19728 


A 


5875 


369 


■ 


RIHPRPT7\RLAS7ii?TLHEVSLQES IRYA 
PGDAVEKWLNDLLC LDCLN I AR I VS VCP 
LPEACDLSYVNRDTLFWCHKASEVFLQR 
LMALYVASRFKNS PWDTiQMLS DAPAHRL 
LNKCLLCPPPV 


5828 
5829 


19729 
19730 


A 
A 


5876 
'5877 


119 

461 


39 


viqdyttppneklsrdlvnklkpymr 
"rrgwtsrrkpkedpsgaavpempkkssk" 
lasfipkggklnsakkepmapshsgipk 
pgmksmpgkspsapapskegersrsgkl 

GPGGTTLNHS I ss qtvsgs vgttqttgs 

NANSVQLPQPLCI 


5830 


19731 


A 


5878 


503 


3 


pspqvpgecpspkklgaraafttpdpap 
lspqsrvassgseqteeqgssensfqed 
gsgmkdvpswlkslrlhkyaalfsqmsy 
eemmtlteqhles QNVTKGARHKIALS I 
QKLRERQSVLKSLEKDVLEGGNLRNAIiQ 
ELQQI I ITPIKAYSVLQATVAAATLY 


5831 


19732 


A 


5879 


421 


34 


LVHKVLSASNVLVDAEGTVKITDYSISK 
RLAD I CKEDVFEQTRVRFSVHAMHYRTG 
NKGDVWRLGLLMLSLSQGQECAEYPVAI 
PSGLPGGFQDFLKKCVCLDDKKSMESPA 
WETKQNKSPANVLRPL 


5832 


19733 


A 


5880 


379 


3 


AQGLIjLaWPNLNRQHCPCSCSHNLSTFI 
QWWKSPGI,SLKTRAPGQGSV1,SSSDLPR 
AVLTGTFAVMSVMVGSVTESIiAPQALNG 
SMINBTAEDAARVQVASTLSVLVGLFQV 
WNSQEYVPPTTCI 


5833 


19734 


A 


5881 


429 


1 


SLVSYMDTESTAEKLGKELGLQAKELSA 
VHSSHHEIGVNDSNLFSLEMREPLESSN 

T KS FHSAVE I PCWR SQHE jPCFQHHG IDT 

PTSLQKRSACCPP SLLRLEETAS SSQDG 

dpqiwddt,pfsesln:<ft,tvi,h;seiavt 


5834 


19735 


A 


5SS2 


436 


3 


WSLVSYMDKKSTAEKLGKELGLQAKELS 
AVHSSHHEI6VNDSNLFSLEMREPLESS 
NAKSFHSAVEIKNRSQHELPCFQHHGID 
TPTSLQKESACCPPSLLRLEETASSSQD 
GDPQMWDDLPFSESLIKFKTVLESEIAV 
TQCI 


5835 


19736 


A 


5883 


195 


353 


DSYSYVRSTAPAVAVDSKQYYQQPTATA 
AAVAAAAQPQPSVAETYYQTGGFS 
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NO: of 
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sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 

ding to'' 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leiicine, M=Mefliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5836 


19737 


A 


5884 




1 


LSAKVP VAS VSDQAAAMHLS QC P KNLAT 
SLAELRTASQKAHKLCGPME IDSALNTV 
QTLKKELQDAKMAAVESQLKPLPGETLE 
KCAQDLGSTSKAVGSSMAQLLTCAAQGN 
EHYTGVAARETAQALKTLAQAARGVAAS 
TTDP 


5837 


19738 


A 


5885 


313 




GKS PHLL ILQTS PLSKAPQPL ILLLLYW 
SWVGWSVTAASRETEAGIQVSSEKPGP 
RNVS PHSRLQDETNLRLEAENNLAAYRQ 
VREVQGRDREGRKGLMATLY 


5838 


19739 


A 


5886 


432 


3 


GSGFCSPILCLWWVEEAVGIIRLHLWLR 
ELGCWHSFSLPGTYSLEFLSTCTQVLPY 
SWSGLLFASGNYLSLYLVPSRAETDSRG 
SQPLPVARHGVGACKCKPRQTRPGLSPQ 
VCVE KLMPLS S PCS AFQQHT YNKQPMS R 
PV 


5839 


19740 


A 


5887 


417 


2 


ASLMVAPDYAEISPLAMPAPSSGWCTP 
IMSTSSSEAMSTPLMLAPDSGELSPILM 
QDMNPGVMSTQPVPAPSFEAMSPLQITD 
EDTBAMSKVLMTALASGEISSLLMSGTD 
SBAISSLIMSAVASGGTSPQPTSTLY 


5840 


19741 


A 


5888 


420 


3 


KFDL IKKLDLKTMSSYDLDIEASDGGGL 
SGKCSVSVKVLDVNDNFPELSISSLTSP 
I PENSPETEVALFR IRDRDSGENGKM IC 
S I QDDVLFKLKP SVENF YRLVTEGALDR 
ETRAEYWITI TI TDMGTPRLKTEQS V 


5841 


19742 


A 


5889 


321 


414 


Fl*lRIWPGAGAHA<aSPSTL(SeRGGRITR 


5842 


19743 


A 


5890 


415 




PGLPGTSVKGIPASKQSPHESPRTUiLK 
TSPI IQQIiGLYLSHTAIRYHPQETLKEF 
VQLVCPDRGQQAGQVGFIiWPWGSSQDKV 
HNPFLPTPMLPPPPPPPMARPVPLPVPD 
TKPPTTSTEGGftDYPTSPTYSTPSLY 


5843 


19744 


A 


5891 


193 


3 


KPSSKVWSRHDPPLLSSVEQPIKDAVIT 
VPVPFNQAERRAVLQ?iflRMaGLKEMQLI 
NDWTACI 


5844 


19745 


A 


5892 


412 


2 


KHQVSDWKDSFYVSLYPDFSSLSRAILD 
WQFFKWKTVTWYDDSAGLIRLQELIK 
APSRYKLRLKIRQLPVDTKDAKPLLKEM 
KRGKEFHVIFDCSHEMMGILKOaLAMG 
MMTEYYHYIFTTLDLFALDVEPSV 


5845 


19746 


A 


5893 


413 


3 


ELLLCSNTSCCRCFCVECLEVLVGTGTA 
AEAKLQEPWSCYMCLPQRCH6VLRHRKD 
WNVRLQAFFTSDTGLEYEAPKLYPAIPA 
ARRRPIRVLSLFDGIATGYLVLKELGIK 
VGKYVaSEVCEESIAVGTVKHEWR 


5846 


19747 


A 


5894 . 


424 


3 


CSGRREPSVRGVQCKGDQGHHSARMAPS 
EAPGTRSGTPSHGQNTAiiEATPAQKTPA 
KWYVFSTEMaNKAAKAVLKGQVETMVS 
FHIQlilSSSKTERSTAPIMTQISAIiRND 
PKPLPQQPPAPASQDQNYSQWTHLQPCI 


5847 


19748 


A 


5895 


400 


1 


ASVQKPALRLVTREEFAIMQTPAGEIiYD 
KS I IQSAQQDS IKKMMKRENKAYSFKE 

QIIELELKEVSTRRHLRKVAAELGGLKL 
VAFPTAHMLFTWNILRPMAHSSDAPDQC 
LVHWHTFPS PLLHS FFHPQLY 


1848 


19749 


A 


5896 


421 


3 


VSS IQMGAVNLLGGGLDSLLGSDLGGGI 
AGSPAVGQSFIPSSVQATCAPSPTPAW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
efh 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSGLNDLSELSTWIGMAHGGYVAPKAVW 
LPAVKAKGLE S SGT FTRRQGHI YMJMD F 
TNKALQHMTDSAIQFTKNSFGVIPSCI 


5849 


19750 


A 


5897 


384 


1 


prasrfiheagivplpmrthtpkkpftc 
evagvqfsrnvkvkihmrkhtgerpysr 
qscsvrfvhsydlknhmhlhtgdrpyec 
hlch kafakedhlqrhlkgq i clevr tr 
rrrkddap phyp ppci 
^diMwqsltqvITtpIswrkMWtepq" 
ksqerykgiyvkeklyrearhdeslnwt 
scdhhesqeckgedpgrhpncgknlgmk 
stveqhhwhvlpqpftcntrcgvafadd 
tdprahpsthlgeksykcdqygkilsqs 

LY 


5850 


19751 


A 


5898 


429 


2 


5851 


19752 


A 


5899 


424 


1 


GTMLQALRAHDAGSRAHVLLSLSQQDGI 
EQHMDFDSRYTLLELFAETTSSEEHCMA 
FEGIHLPQIPGKLLFSLVKRYLCVTSLL 
DQLNSSPELGAGDQSEPCATREKSRGQR 
E LEFSMAVGNL I S E LVRSMGWARNLS EH 


5852 


19753 


A 


5900 


135 


459 


YFMLKAEHVISYVCSENSLMVWNMRFC 
LIGLIHIQEFYFFEMEPHRAIQAGVQWC 
DLCSLQPPPPGLKRCSCIPPSSSRDYRC 
APPYP 


5853 


19754 


A 


5901 


418 


10 


GRSPKPGDRLWESVHLSFPGPADSLSGG 
SAPASSYEPSQRSSFSSNRSQRGSTSTR 
NSSQKGSSVLSIKQKGKRELYMEKLQEH 
LIKAKAFTIKKKFSQNLVAQLWYFVKCV 
YFGLSAYQIPRGYPTRVLGNFLTK 


5854 


19755 


A 


5902 


402 


2 


GYRHPELE I KSVDGFQGPEKQP VILSLV 
RSNRKGEVGFPVEDRRINVDVTRARRLV 
TVICDYRTVSSHAFLKPLVEYFTQHGEV 
RTAFEYLDDIVPDNYSHEDSQGSSQAAT 
RPQGPGTSTRTKKQRQEGGMY 


5855 


19756 


A 


5903 


346 


1 


GICPLSHIKMTDARCQIYEMKGGEKSPK 
DTGKEPGHSEAKTGPPQVLAGVPAQPEA 
PQPGPNTTAftPVDSGPKaGLAPETTETP 
AGASETAQATDLSLSPGGESKAWCSPED 
PMY 


5856 


19757 


A 


5904 


139 


212 


EVENNDDISHHHHHHHHHHSNSKS 


5857 


19758 


A 


5905 


2 


345 


PCGCWGRCAIiLLISaAAK&KSKCGPTFL 
PCaSGIHCIIGRFRSKGFEDCPDGSDEE 
NCTANPMLCSTARYHCKNGLCIDBCSFIC 
DGQNNCQDNSDEESCESSQGVGYVCGFK 

KA 


5858 


19759 


A 


5906 


432 


17 


PQTTPHRTFGGGKAAWLLAVGGQFLLC 
WSPYFSFHLYVALSAQPISTGQVESWT 
WIGYFCFTSNPFFYGOiniQIRGELSKQ 
FVCFFKPAPEEELRLPSREGS lEENFLQ 
FLQGTGCPYKSWVSRPLPCIAAALRV 


5859 


19760 


A 


5907 


407 


1 


PYACGECGKSYRQSSSLVSHRRIHSGVR 
PHHCDECGEFFSRKYDLLIHQRVHSGER 
PYKCSECGKSFSHSCSLIflHQRIHTGMR 
PYECSECGISFIHSCSLITQQRVHLGTR 
PYMCSECGKSFSQSCKLIKHRSV 


5860 


19761 


A 


5908 


419 


3 


VIiAKKTIITKSiyiDCHEFGNILHLSTNL 
VASIQRPDKHESFGKHMVDNLDLFSRSS 
AENKYDNGCAKLFFHTEYEKTNPGMKPV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

location 
correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cj'steine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L.=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, '''=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














GYKECGKGLRRKKGLSLHQR I KNGEKPF 
ECTACRKTFSKKSHLIVHWRTHTGVY 


5861 


19752 


A 


5909 


441 


57 


KVMG I FKS S LFQALVD I QE F YE VTLINS 
QKS CEQK lEEANQVLQKWEKTSLLAPCH 
DRLQKSSELQSLIGQHSSLETTDGPQPW 
LFPRGSEGHRGKTYDHCHSIGLQIAVDQ 
RKMLHVLSKIKKIRV 


5862 


19763 


A 


5910 


3 


317 


CLLMLWRRWGSRGSRQRRLQATPSWEST 
KTADGQGDRRE SG S QRHCS SPHPQSKST 
TTSSVPAAREGAQEAAQGGPRKGQTSRP 
QVSTHQCAMPPARHRKLENDA 


5863 


19764 


A 


5911 


183 


1 


QNSDSKDSLNRVASRDHAKPNLTCHVSP 
AIQGTGSISESSIPSVSDTSTPRRSRRQ 
LPPCI 


5864 


19765 


A 


5912 


395 


1 


KTIQFNICVKVFHRFSNSNKDKIRYAGD 
KTFKCKECGKS FHVLSRLTQHKRIHTGE 
NPYTCEECGKAFNWSSILTKHKRIHARE 
KFYKCEECGKGFTRSSHLTKHKRIHTGE 
KPYICEKCGKAFNQSSTPV 


5865 


19766 


A 


5913 


449 


2 


FGSHLEKEDEKKQELVDKAIKPSIEATL 
ESIQRKXQYKEAESSRPEDIKDMTKAQI 
ANEKVALQKALLYYESIHGRPVTKNERQ 
VMKPLYDRYRLVKQILSRANTIPIIEEE 
EGSEADSNVKPDFMVNKKTDFSARMVKR 
PLGGSARPD 


5866 


19767 


A 


5914 


464 


23 


SAHMTETRS KS FD YGSLSLTGPSAPAPV 
APPftRVAPPERRKCFLVRQASLSRPPET 
ELEVAPKGEQESEEPQPSSSKPSAKSSL 
S QI S SAATSHGGP PGGKGPGQDRPPLGP 
TVPYTEALPVFHHPVAGTHKHEKPYLPP 
LYCGRCG 


5867 


19768 


A 


5915 


366 


1 


LVSFINFFTSVLATLWFAVLGFKANIM 
NEKCWENAEKILGYLNTNVLSRDLIPP 
HVNFSHLTTKDYMEMYHVIMTVKEDQFS 
ALGLDPCLLEDELDKSVQGTGLAFIAFN 
EAMTHSPACI 


5868 


19769 


A 


5916 


246 


366 


TSVKEQQMPGAVAHAC3SIPSTLEGQGGRI 
TRSGVRDQPCQH 


5869 


19770 


A 


5917 


400 


3 


NDNAPEFYQSVYKVTVIiENAFNGTIjVIK 
LNATDPDDGTOGDIVYSERRPVWPAWY 
APT INPKHGEIRTKGKLDFEE KKLYE I S 
VEAVDKGNI PMAGHCTLLVEVLDVNDNA 
PBVTITSI.SLP IREDTQPC I 


5870 


19771 


A 


5918 


410 


200 


CTPPQPVKCRFLNHDRVSSCCPGWSQIP 
GIKESSYLSLPECWDtRPVILLFCFLSH 
ISSKQPYFLPPSYR 


5871 


19772 


A 


5919 


214 


407 


MYFYIDHASLLSPRLECIWTIAHCSLE 
LLCSRDLPASASQSAGITGIEHHTWLKT 
HFYSSFKT 


5872 


19773 


A 


5920 


476 


39 


VPYGLFVGSTGRLGLPYLRGTSHPLRGC 
LHAAALNGRRLVQPLTRNKHEGCaEEPS 
AHDDVALGFCGSHSLAALPAWGTQDEGT 
LEFTLTTQSWQAPLAFQAAGWHGDFIHV 
DIFEGHLWSMVEKGQGTVLLLNSVPVTD 
AQPHM 


5873 


19774 


A 


5921 


430 


2 


SliRPSSTGPSPSGGLSEEPAAKDLDNRM 
PGLVGQEVGSGEGPRTSSPLFNKAVFLR 
PSSSTMILFETTKSGPALGKAVSEGAEE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ IB 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

correspond 
lug to first 
amino acid 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AKLGVSGSRPEVAAKPALPTQKPAGTLP 
RSAPLSQDTKPPVPQEEAGQDHPPSKAS 

RC 


5874 


19775 


A 


3923 


411 


2 


MSLKVDVEALENSAGATYIRKKGGKVTG 
DSQPKEQGQGVLKKKKKKKKGMVPKHYD 
PKVTPDPERWLPMRERSYYRGRKKGKKK 
DQ I G KGTQGATAGAS S ELDAS KTVS S P P 
TSPRPGSAATVSASTSNIIPPRLY 


5875 


19776 


A 


5924 


402 


2 


LQSGAIMNKFYQPHEAHIPYLLQLFIDY 
NLYGMNLINLAAVKFRKARRKSNTLHAT 
GSCKHHLSGNSLADTLFRWEQDEIPSSL 
ILEGVEPQSTCELEVDAVAADILNRLDI 
EAQ IGGNPGLQAIWEDEKQRR 


5876 


19777 


A 


5925 


215 


403 


AETSTYFPFFLERQFPFAPQLGGHGRIIL 
NSLNLLPRGLREFSRLSLPGGWIYGGAP 
PSPTFFC 


5877 


19778 


A 


5926 


320 




PGRRDRWIWGQGECLEYLLASMTQWCER 
EISSIAPGELCCLLLSFLPQEECQNYVR 
VLI VAGRKVFMCGTNAFS PMCTSRQVGN 
LSRTTEKINGVARCPYDPRHRV 


5878 


19779 


A 


5927 


400 


1 


WATDGGLPLLASSATVSVALQDVNDNE 
PQFQRTFYNASLPEGTQPGTCFLQVTAT 
DADSGPFGLLSYSLGAGLGSSGSPPFRI 
DftHSGDVCTTRTUDEDQGPSSFDFTVTA 
VDGGGLKSMVYVKVFLSDEMY 


5879 


19780 


A 


5928 


398 


62 


HNTEVLVGESVTLECSATGHPPPRISWT 
RGDRTPLPVDPRVNITPSGGLYIQNWQ 
GDSGEYACSATNHIDSVHATAFIIVQGG 
YPSSVPEPGLSPILTPPLARAAIPTDAP 


5880 


19781 


A 


5929 


403 


1 


LASDGGDPVLSGTSRICVKVLDANDNAP 
VFTQPEYRISIPENTLVGTRILTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
TS GELT 1 1 KDLDYEDATFHE IDIEAQDG 
PGLLTRAKVIVTVLDVMDNftUV 


5881 


19782 


A 


5930 


408 


3 


VASDGGDPVLSGTSRICVKVLDANDNAP 
VPTQPEYHI S IPENTLVGSRILTVTATD 
ADEGYXAQWYFLBKSPGETSEVFJiLKS 
SSGELTIIKDLDYEDATFHEIDIEAQDG 
PGLLTRAKVIVTVLDVNDNAHV 


5882 


19783 


A 


5931 


126 


391 


PVLTCflHFHFSAGAGGSYFMISRSLGPE 
FGGAEGLCLYLATTFATiiMYILGAIQIL 
LVSQAEPWSTLAMIKQFLWTPNRRmVL 
I PTC 


5883 


19784 


A 


5932 


436 


359 


NKEWLPGSKVPEKS INDVKNTSGLL 


5884 


19785 




5933 


365 


3 


LIIGMLTAIIGDLASHFGCTIGLKDSVT 
AWFVAFGTSVPDTFASKAAALQDVYAD 
ASIGNVTGSNAVNVFLGIGLAWSVAAIY 
WALQGQEFHVSAGTLAFSVTLFTIFAFV 


5885 


19786 


A 


5934 


394 


3 


FQYFSRTDRVLKHERMCHENHDKKLNRC 
AIKGGLLTS BEDS VFS TSPKDNS LPKKK 
RQI TEKRS S GMDKE S ALGKCDLKKVKND 
YSPLYSSSTKVKDEYMVAEYAVEMPHSS 
VGGSHLEYASGEIHPPCI 


5886 


19787 


A 


5935 


393 


1 


REDLIAGIDEFLDEVIVLPPGEWDPNIR 
lEPPKKVPSADKEKSVPSLAELGQMNGS 
VGGGGGAPGGGNGGGGGGGSGGGAGSGG 
AGGTSSGDDGEMPAMHEIGEELIWTGRF 
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nucleotide 
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residue of 

peptide 

sequence 


Predict- 

otide 
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last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=I,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FGGLCLDIKRKLPWFPSCI 


5887 


19788 


A 


5936 


410 


126 


PICYYKKAGRAEWLTPVIPSSWDYRHQA 
QLIFVFLVETGFHRVAQAGPELMTSGDP 
LAYNFLCSYPNEVFRSQASASDLQFVLR 
NLEPPDAGNV 


5888 


19789 


A 


5937 


426 


■ 


DATFHEIDIEAQDGPGLLTRAKVIVTVL 
DVNDNAPEFYMTSATSSVSEDSLPGTII 
GLFNVHDRDS GQNAFTTCSLPEDLPPKL 
EKS VDNYYRLVTTRftLDEEQFSFYNITL 
TAKDGGNPSLSTDAHILLQVADINDNAP 
CI 


5889 


19790 


A 


5938 


438 


3 


ADPTKGLLRNGSVCVEAPGGVSQGNSVU 
LKNSKQACLPSGGI PSLNMGTFSPPKQW 
SKESKAEQAESKRVPLPEGCPSGAASDL 
QSKHLPKTAKPASQEHWCSAIGTGESP 
KESAQLSGASPKESPSRGPAPPQENKW 
SPW 


5890 


19791 




5939 


3 


191 


YSVCLWSQLLRRLRQEDHLS PGGGGCTE 
PWNTERLSKNKKTKKREREKQCGFHSM 
WYCNLK 


5891 


19792 


A 


5940 


399 




QYSDDNDMSWKVTRAAAKCLDAWSTRH 
EMLPEFYKTVSPALIYRFKEREENVKAD 
VCHAHHSLFKQTSPVQSWLCGLDAMDQG 
GTPLAMLQSQVAKIVKaVHKQMXEKSVK 
TRQCCFTLLTELVNVLLGAVY 


5892 


19793 




3941 


411 


3 


SPCEGPRRFQCKSGKRVDGGKVCDVQRD 

CRDWSDET,I,KVMCGACI,RPLAGLSLLPS 

PSWYLGSRPSSAPCPDTFCSGPLFGFMC 
RFtiASEGAFRPQASGLHLYIO/LRACPSQ 
VLKK YVFSHKLGLS SFLPRS DHV 


5893 


19794 




5943 


431 


26 


KAWGIPGDMGPPGITWPGYNGLPGNP 
GVQGQKGEPGVGLPGLKGLPGLPGIPGS 
PEEKGSIGVPGVPGEHGAIGPPGHQGIR 
GEPGPPGLPGSVGS PGVPGIGPPGftRGP 
PVGQGPPGLSGPLVIKGBVSRVR 


5894 


19795 


A 


5944 


396 


3 


PLPVELIRVPAFUJLFMQSLFKPGARIS 
QDLKHKYIHILAYAASWETCKKNKRVS 
INKDELKSTSKAVETVHNLCCNENKGAS 
ELVAELSTLYQCIRFPWAMGVLKWVDW 
TVSEPRYFQLQTDHTPTV 


5895 


19796 


A 


5945 


465 


32 


ERVTUyDITWCTLLWLYKQVLEPSFRQ 
AFPNTNRWFLTCINQPQFRAVLGEVKLC 
EKMAQFDAKKFAETQPKKDTPRKEKGSR 
EEKQKPQAERKEEKKAAAPAPEEEMDEC 
EQAMAAEPKAKDPFAHMNKSTFVLDDCI 
AflAL 


5896 


19797 


A 


5946 


414 


2 


ATPPVRCIRKEIENWYVDIQPVQEPKaQ 
ACGNGHGI 11 lAETSTGCLFAGSSLGKR 
GVNADKVAIEaaEMLriaNLRHGGTVYEY 
KQDQLIVFMALMTGVSIIKTGPVTLHTQ 
TAILYDEQIVKVNCKENHSVDEQVY 


5897 


19798 


A 


5947 


411 


1 


EPCWHRIADSSVQTDDEDGESRYLLSR 
RRHARRSADCSVQTDDEDSAEWEQPVRH 
RRSRLPEHSDSGSDSKHDATASSSSRAA 
TVRAMSS VGIQTI SDCSVQTEPDQLPRV 
SPAIHITAATDPKVEIVRYISAPCI 


5898 


19799 


A 


5948 


153 


409 


LPFFPILLPFI3FFRYRVGLL3PKLKC3 
GVIIAHCSLELLGSSNPPTSASRVIFLY 
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peptide 
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od 
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nucleotide 
location 
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ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=I$oleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
■W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LSLCLFVPEMESCSVAQSGEQWRDLGSL 
Q 


5899 


19800 


A 


5949 


21 


447 


RAAAMSSRNFRFLSSHRELYFGVDLPSG 
NLWEEPADREQLCRAKAACVLTYDLVL 
EDPLELHKIHIHVLDTNDNSPLFPAGDV 
QLHIPEFLTPGARFTLPNAQDDDEGSNG 
ILSYSLSPSQHFRLDMGSRVDGSEYPEL 
VL 


5900 


19801 


A 


5950 


268 


19 


QDNWAYYCTVPEYCKTKLNILAWWLVIN 
YLSGDFYFFFIFFLDGVSLCHPGWSAVA 
RSQLTATSSSQVQGRFSCLSHCIAAAL 


5901 


19802 


A 


5951 


389 


3 


QQMDLARQQQEQ lARQQQQLLQQQHKIN 
LLQQQ I QVQGHM PPLMIPIFPHDQRTLA 
AAAAAQQGFLFPPGITYKPGDNYPVQFI 
PSTMAAAAASGLS PLQLQQLYAAQIiflSM 
QVSPGAKMPSTPQPMY 


5902 


19803 


A 


5952 


2308 


307 


RRRPARKKTWVRDGGPHQGL/ YPQLPSA 
\T1jQPTQPGHGPRALGKGRELESWQGKRP 
GK/GQTHRRKTRGTAS PAVYFSVEWGD \ 
SGGCPMTGKGAKQQRAGLGAAPNPELSG 
EQGTGRGRAQDSQGHGGGKGRRGQHPPG 
ND/PRGVSGMGQESVHSPEARPTGGGTG 

/emlgrtwrhqqglvggtgnlletgsws 
gsgswgrgwlrpqegqvttgmglagr/ 
yqparqravelspgaqglrqrrggwsgp 
pqeteqsgvpgg*gswpleshqgeq\pa 
agqkqslqmlpggfpcws/sgmgggpqq 
llesegagpspgggrhhgkgrvavtttp 
regd\rgqspgghtlqlfpcplws*etq 
gqgsrtvqhqewggegregegagsdqsr 
algia*icphtlrptsmepqstapapgq 
ppgppswghrghqgmgyggrcpgq/iqg 

RGRAQLGSTVG*RDG/QRG*MQGRDQQG 
PRSNGAGVGSCSHTSQK*IPPSSLCT*N 
SSHGPASG\QLWWSSPFIHSPGETNIPH 
TLTEPHS VPGWCVIDTLRRHGAGQGHPGM 
ARSGTGEGQ/QRGRY*ERGRRGRRQQKQ 
KKQGLKEPG*RAAPTLKGATRPLCRCLR 
KVQKPKQDGDVGS *LLKVFRAPGALGTK 
PQRTCRGPADFFHALSGLSNVHRSYCST 
WSGFGACM 


5903 


19804 


A 


5953 


461 


117 


HPRDI IGPVHQGLPPSPQPPPPCKQAPT 
TPSS\M* P*TNRHDTGLSGPVQVPGADW 

KPLNAP*LPEIiflFGEEPCRPGVPDPPYV 
WLPDSAPEHCVTWATSPGLSDLLSYPRR 
KQA 


5904 


19805 


A 


5954 


845 


610 


FFETGSY/SVAQVGAQWCNFGSLQSQPP 
RLKRS CHLSPPSSWDYRGAPPRLARFFF 
FCIFYRNGVSPCCPGWS *TPELQ 


5905 


19806 


A 


5955 


1 


378 


KTPVSDR \ATKCCSESLGNRRPCFSADE 
AYETYVPKEFNAETFTLHRDICTLSDKE 
RQINKHTALflELVKHKPKaRQEQLKAVM 
DDPADPVEKRCMaDDKETCFVEEG*TLV 
AASQAALGLLHRIK ' 


5906 


19807 


A 


5956 


723 


343 


GCQEGICPSPCPVIPATREABAGESLEL 
RRRRLH*AEIVPLHS\KAQRSLCSSDFI 
RILVI FSGMFLVFTLAGALFLHQRRKYR 
SNKGESPVEPAEPCRYSCPREEEGSTIP 
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ing to first 
amino acid 
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peptide 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IQEDYRKPEPACSP 


5907 


19808 


A 


3957 


55 


491 


AQHFNPHNWPVT * LLPHFTNEGGQVTEA 
RCLKWQAR\CLVPMQKVENAMPGT**PS 
AEVTPPVCWQHHWPQVEGIPSS/APAFQ 
WPLTMGHPC\WESAPRQDASTSPRPGTA 
SHC PSAVAFQERGS SPCPPHHSDSGLCL 
LPRLLPAP 


5908 


19809 


A 


5958 


2 


413 


KKPDQ\AE\REHliCTSL\WSGRNTDiasrG 
EELHGGKRVMERLKKAVKIAKTQCMDPSL 
QMQLYIEI LNRY I Y F YEKE YDAGT I QVL 
NQLIQKIREDLPNLESSEEPEQIT*HFH 
NTLEHLRLRRESPESEGPIYEGLILYE 


5909 


19810 


A 


5959 


871 


564 


WMCHATFGSHLN*AWRLRPV1ISG/LLM/ 
PSEGKAKKESTLMVGVTIS1PD\MKGKIRL 
LLHSDGKDEYVWNAGHPDILG\HPLTLQ 
CPIIKIKKKL**PKSGRTADGPDHSRIK 
V*VTPLGKEPTTDEVIAEGKENME*VTE 
TC* IDMN*VLQGQTWNLQR 


5910 


19811 


A 


5960 


390 


271 


GINFFSRN/RSLTT*SRLV*NSWAQVIV 
LPWPSRVLGFQA 


5911 


19812 


A 


5961 


333 


922 


GLSCRVFGRTLSCPPCILRHLSEVETLQ 
TLQKKEIEDLYSRLGKQPPPGIGAPAAM 
LSSRQRRLSKGSFPTSRRNSLQRSEPPG 
PGIMRRNSLSGSSTGSQEQRASKGVTFA 
GDVGRM* IQNRSHVSPPHQGPPWSLCSQ 
NLMLSDQQN*ARKI PTLKG* KA\GGHGE 
CSS IIVKSQTYVNCLLCGGVSSAAYHLH 
L 


5912 


19813 


A 


5962 


705 


387 


CVAQTGVQ*HDLGSLQPLPPGFKQSSHL 
SLPSSWDYRRVTPRLPNF\*FFVETDFC 
HIAQAVLQLLXSSraCLPASASQSAGVTG 
VSHRARPTLTLDIQAEIPDLKPSG 


5913 


19814 


A 


5963 


425 


182 


ASNQ*DSVGVGPSEPGAGYNLLVRFLKC 
\LEKHSSWVGVTQFSRCHLSPLSLTRKG 
NSLTPCTSRVRQCLALLWLTHGSRTH 


5914 


19815 


A 


5964 


388 


14 


PPMYTQLCSIRSTQAFCFVFFSRDE/ SL 
ALLPRLVLNSWVQAILLPWPPK:ViQG*QA 
QWLTLVIPALWEAMAGGLLEPRSSRPAW 
ATTQDLI STKENKTKSIiSASDAAELCVH 

WRLLKI 


5915 


19816 


A 


5965 


1 


373 


DDQRVKSVINLLLaAYTGDVSALRRFAL 
SAMDiyiEQRDYDSRTALHVSAAQGHVEW 

kflXleackvnpfpkdrwhntpmdetlh 
fghhdvfkilqeyqvqytp*gdsdngke 
hhtvhkhldgls 


5916 


19817 


A 


5965 


3 


329 


HEETSRYEIYELIHVHPYAVKQSFDLEE 
YSLSQSTLEQVGHF*GFVSSMVyvyKTF 
PGSCRGSTALFSSLYFLPPFQVFLELSK 
EQELGDFE/EDFDPSVKMKLLPQEEP 


5917 


19818 


A 


5967 


2 


47 


MEDRRLVRGIPCPQHNARQCPAVPPGIQ 
AYGAAPFEDLQVDFTEMSKCRGNKYLLV 
LGRTYSG*VEAYPTRTEKaHEVTRVLLR 
DLLPRFELPIiRlGSDNRPAFVADLVQKA 
AKILRITWKTiHAAYWPQSSGKVERMNQT 
IKNSLGKVCQETGLKWIQALPMVLFKIR 
CTTSKRAGYSPYEILYHHPPPILRGLPG 
TP*ELGEIELQRQLQAIiGKITQTI/YSP 
SKWPVSLFSPVHPLSPGDRVfflKKWNVA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=llnknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














SLCPLWKGPQTWLSPPTAVKVE* IPSW 
IHHSHVKP*TA 


5918 


19819 


A 


5968 


2 


150 


KSQS I *QSSLLSYFEKLPQPPQPSA\TS 
TSKQDPPLATRLWLAEGSDDH 


5919 


19820 


A 


5969 


3 


371 


HEGKSGPMTGWLVAVGEVAMKILLLCL 
CLILLRVESCRRKAARAALGMEAADAVT 
D* SPDSRLLPD\ PPHPVPPQSKRPCFLC 
RLCMVCPLLDSPDPSFPCHPTSPQTLRA 
KDSAALDTLHV 


5920 


19821 


A 


5970 


396 


58 


YW VL S KGPVT CMLLNS VF SFQAS S CLS F 
LINS*AP*PGIFLGMSVFILPFFVFVGQ 
AGLELLTSGDLPDSASQ\AEITGVSHCA 
QPHTALSQYLEQCLAPSRQLISVSLILR 


5921 


19822 


A 


5971 


442 


120 


I CMKQGIALS PRLDSS SAFSNT VQ PM P P 
RFKSSYGLSLP/RSWDYRCTPPHMAESG 
NGV*P\FMVETGSW*AAQGGLELLSSSN 
LPP/ SAFQSAGI TSKGHCTRPAVFLVC 


5922 


19823 


A 


5972 


459 




MWPLPWFSISSPSSPPPT*VSSNNHPCA 
VPRAYPGADIPGPGWQRLPPPHRGLSGP 
PBSCLLTTILCSKHPNGCPLTQNPRSLP 
GPRPCLIYSTPSPPP*AQL/YAAPSP*L 
YTWREALIR 


5923 


19824 


A 


5973 


681 


1758 

• 


VAITKCQIPDIKAKTYCICKTKEKRSYLK 
T * NLHRGF \ I KKQ I EE FN I GKRHLAMMM 
GEDPETFTQEDIDEAIAYLFPSGLFEKR 
ARPVMKHPEQIFPRQRAIQWGEDGRPFH 
YLFYTGKQSYYSLMHDVYGMLLNLEKHQ 
SHLQAKSLLPEKTVTRDVIGSRWLIKEE 
LEEMLVEKLSDLDYMQFIRLLEKLLTSQ 
CGAAEEEFVQRFRRSVTLES KKQL lEPV 
QYDEQGMAFSKSEGKRKTAKAEAIVYKH 
GSGRIKVNGIDYQLYFPITQDREQLMFP 
FHFVDRLGKHDVTCTVSGGGRSAQAGAI 
RLAMAKALCS FVTKDEVBWMRQAGLLTT 
DPRVRERKKPGQEGARRKFTWKKR 


5924 


19825 


A 


5974 


1232 


980 


SLSLSPRLECSGVTAHCSLRLPASSNSC 
FSASQVAGTIGACHHIWLVFVFLVETGF 
HHVGQTGLEIiLT\*VICPPWPPKVLGLQ 

A 


5925 


19826 


A 


5975 




259 


LTSYDYRRALPCLVNXFCVYFIiKREELA 
LLPKLFWNSWA*ATLLP* PPKIFGLQAQ 
VAMLRLKFPACLPACLPAYLPPFLFSFI 
PP 


5926 


19827 


A 


5976 


24 


223 


PYLTQISTNESSVCSHARAEFLDHQNQL 
/WLGTVTYTCNPS/TLGG*QGRIT*GQE 
FKTSICNTAEPCL 


5927 


19828 


A 


5977 


395 


136 


GSWBYTHVPPRSANF/LVETGFRm/A*A 
GLELLGNPPTSASQS7VKITAMSHCTWPS 
SYY*CEYNARFGSLHKRSGMTIYIKSYK 


5928 


19829 


A 


5978 


1 


785 


GTRLKSGTNTKKALQAVYSMMSWPDDVP 
PEGWKRTRHVIILMTD/GSEGTSLLSQP 
PHLLRPACGP*VHL*HYTHGWGPECDS* 
LAYHLS CDPS *GILPKPCDQLSLTLPQL 
AHPAMCI PAFSQFI FLISYPHQPVSSAQ 
LHNMGGDPITVIDBIRDLLYIGKDRKNP 
RED YLGE * PA*DP APHFLRAWTLI LP FY 
PSDVYVFGVGPLVHQVHINALASKKDNE 
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nucleotide 
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ing to first 

peptide 
sequence 


Predict- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QHVFZVKDMENLEDVFYQMIGREIQGNK 
EPNSPQVPLK 


5929 


19830 


A 


5979 




419 


KAIAPIDSDSSDGSGQNKLKT*KEFIIL 
NAIKSIYDSWEEVKISTLTGVWKKLIPI 
LIEDFEVFK\ADLVEIARERELEVEGEL 
/LQSQDKT 


5930 


19831 


A 


5981 


T ' 


154 


KKC*MMLTWWCMPVG/ PTPRAEAGGSPE 
TGNLRL* CTMIMSVNSHCTPAWAT 


5931 


19832 


A 


5982 


12 


311 


RKSLKFVEGCNI RLNKCRHTNFVPQT I F 
FFLLFL*LGQC* *VWFSFFFFFFFCFV 
LFFVFVCWFVFFCFCFF/GCLFWFLL 
/YFYELFCIVIIMCYYLR*YV*IVYMLF 
FVCWIFVLYLCFHY*LFLFFFVFVFLLF 
ICGVFWFYELFCIVIIMCYYLR 


5932 


19833 


A 


5983 


3 


203 


IGQKEASEDTT*GSAD/PKKSSAGPKRD 
ARQIYNPPSGKYSSNLGNFNYERSLQGK 
YRWPRLGHTRKS 


5933 


19834 


A 


5984 


368 


138 


STVGKMLSGQHPCYRE/II,/REQKSQSM 
*QISLLSYFKKLPQPFQPSATTALISQQ 
PSASRQNPPPAKEL*LAEGSYDH 


5934 


19835 


A 


5985 


2 


155 


LANF* lYFC/RDKCLPVLPRLVSNSWPQ 
VILPPWPPKVLGLQGMSPCAPRHS 


5935 


19836 


A 


5986 


325 


403 


FIYYLFFLSSLYLI ITYALFLSSF*R*V 
VYHMVMVLLKWICHHFLTLFFSFRSLI 
LYLFFILFM/YIiLI PVLFFIQFSFYLFF 
FYLFYYLFFNDYCFDLIFFLCIiIYYYHF 
QSYYFIS*FIYyLFFLSSLYLIITYALF 


5936 


19837 


A 


5987 


81 


651 


KLQVAGCRTLSQIDVKFKPSVHSSLAVC 
LRANYFTSQPHVIMQVTUiTRCVAHSKH 
LGTVSYSSWYIKLLFPKSHFIYLFILRR 
SLAPVAPGVQWLDLGSLQPPPPGPKRFS 
CLS/LPR*LDYRHPPPRLaNF\ *FLVEK 
GFCHVGQAGLELLASSDLPTSASQNAGI 
TGVTPAPSPQKATLNLILGGQFHI 


5937 


19838 


A- 


5988 


301 


341 


FCFCPCKNRVSLW*WHAPPCPANF\*LF 
lYLFLVETGFSHVGQASLKLLTLDDLPA 
AASQSAGIPGMSHHDGLFL 


5938 


19839 


A 


5989 


71 


1252 


TKEQRSLRSQQLGVRQQRSHRLKFGGGG 
APSARGVGGSWSLATRLGPVSTANMSRP 
VRNRKWDYSQFQESDDADEDYGRDSGP 
PTKKIRSSPREAKNKRRSGKNSQEDSED 
SEDKDVKTKKDDSHSMDSEDEKEDHKN 
VRQQRQAASKAASKQREMLMEDVGSEEE 
QEEEDEAPFQEKDSGSDEDFLMEDDDDS 
DYGSSK/NEKQKDG*EVQT*KKEKENAQ 
TQTKGYSnAKSSERQRESGSP\QASKAS 
KEKTPSPKEEDEEPESPPEKKTSTSPPP 
EKSGDEGSEDEAPSGED*K* *WSGERFY 
*KKKEKKRKKREEKKEPT*DRTWFWLWL 
DSWGFQCFFPFVESNISLSLSFFPFFFK 
AHHCMCKCLSYLFVYWSLCQPSPFPMKA 
MSN 


5939 


19840 


A 


5990 


630 


316 


RWNVNS VA \ (3AG\ IQWCDLGS TGNPRP P 
GF\K*FSCLSLLSSWDYPTHATMSSKFL 
YF* WRRGFTMLAELVSNWLVLNS *PSDP 
PASAAQSAGITGMSHRAQPIYLS 


5940 


19841 


A 


5991 


2 


182 


WQEQKAE/CLRDVWTAMCTaALtT?IAK<3 



907 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RVNIPCPMMKEWVRKMGPIYTFEYYSAV 
IMQVILKF* *HATTGMNPDDVISEVCQS 
QKDK*RILLSCNNAGNS 


5941 


19842 


A 


5992 


79 


278 


EAK*RVNGCSKPTWHMYTYVTNLHTYV/ 
I KPAHCAHVP* KLKYKIIITIIK* *RWC3 
LTILPRIiVSNSWAQVILLPWPPKILGLQ 

A 


5942 


19843 


A 


5993 


22 


274 


AGDEHRAWGYQPAM*ARLEMNTGPGDT/ 
MPfiMflSQRAGITGVSH*GCPFFSLLNKI 
WRSH*VAQAGEQRLFRE*TPGLSCYSCS 
LPSN* DYTSAPLCPP YDT 


5943 


19844 


A 


5994 


347 


195 


YMHTKHTHTHTHTHTHTHTHP \EKHLKA 
PNEV* KSKNNV* IKPKQSQSCFS 


5944 . 


19845 


A 


5995 


1 


122 


RYELEDELHLQMTTQLHALKENKYQLR* 
SSHVSIPST*QYRHVPSCPANF/SSLTM 
FPSLVSSCWAQVIHLPWPYKVLGL * ALK 
ENNYQLRTIIIDCTLNKMIF 


5945 


19846 


A 


5996 


365 


229 


RPPPCPAWF\*FLVETGFHHVGKAGLEL 
LTSSDLPAPALHQVFYA 


5946 


19847 


A 


5997 


357 


17 


ILVDKINKVWNLFQHKLWC*RGWGYK* 
DRSFHLLKTWAE*WVYEGSLY*SLSSY 
YLF/C*RQGLTiyiLPRLVLNSWHQAILLP 
WPFSLLLCIIiEVFQNRELKKKNQGKPMT 
GS 


5947 


19848 


A 


5998 


1 


207 


PILYNLFQKIEABGILPNSFYEASVTLL 
PKPDKDITR/ITTY* FILIiT*NILNKVL 
SNQIQLYIKRITY 


5948 


19849 


A 


5999 


603 


269 


EDRVLVLLPKRKSAVAWPRLTATSASW\ 
VQRSFHPSLPSRWDHRHVPPHSANFCCV 
CL*RQGIiSLLPGIiVLNSWAQVILPPWP\ 
PKVIiGITSMSHHARPLS FYYGHFKYIQK 


5949 


19850 


A 


6000 


3 


150 


dyrhaplhlatfiCk/ffcrd/rglsvip 
glvsnswpqtvlpp* ppkalglqa 


5950 


19851 


A 


6001 


365 


62 


AITAHCSLNLLGSSNPTASATRIAGTIG 
EAAMSYF*! SSTLTHFSYT/C*LHHVWL 
IFKFFFVDKVYVAQASLELLSSSDPPAL 
VSQNARFTGVSHCTEPV 


5951 


19S52 


A 


6002 


537 


236 


drvslccpgwsavarfqltatsafr\vq 
*ssclslpsscdyrsappcpflnflsfcr 
deggglpglprlvsnswaqaihppqpps 
witgvingtwpkipls 


5952 


19853 


A 


6003 


22 


368 


NFFIiQNKENIECRKrF*ERKHKLMQRTST, 
LSYFKQLPRPPTTSQQPSTWRQDPPPAK 
R/LRTYKGLDDR 


5953 


19854 


A 


6004 


327 


335 


NQS IFTFMMYPRPF/ CCKFKFKN/ TLHL 
WADKVAHTCIILSTLEGQGGRIA* GQEFE 
ATLANMVKP * LLCKVK 


5954 


19855 


A 


6005 


2 


367 


WQFLKKLNTKLLYGPVF/PYLGIYKQEK 
WKHVHAETCTQMF ITVLFI IVPK* KQPK 
CPSMEE*ID*M*FVHMLEYEAIKENKLI 
HATWMNLRNTML SEISQTQITT CMNLLH 
KMFRKVTETES 


5955 


19856 


A 


6006 


1 


176 


PGFKQFFCLS FWTRGI FI EMGFHHVG* A 
GLGLLMSGDPPASASQSAGITGWSHCA 
WTY 


5956 


19857 


A 


6007 


1 


289 


EVLTKI P IH/ FFVEIGKLIQDLKDTGPR 
IA*TMLTKKTEMGHIVHSDLVAYYIVAV 
IEMVWYYW*RDEHINQ/WEELEISEIDS 
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C=Cysteine, D=Aspartic Acid, 
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K=Lysine, L=Leucine, M=Methionine, 
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QNMFNWVFTKMQKLF 


5957 


19858 


A 


6008 


439 


223 


FFSPGSH / S IT*AEVQW/PHHSSLQPEP 
PSLKQSSHLSLPSSSDYRCAPPHPADLC 
RDWSPCCPGWSQTLGLK 


5958 


19859 


A 


6009 


1 


252 


MVS IS* PRDP PALASQSAGI TGMSHHAQ 
LFFLN/CIS/ELISSTVTPGLKESACLG 
LPKCWHYHREPQHPACTVLYLQFQSDFQ 
LK 


5959 


19860 


A 


6010 


131 


514 


pwpyslsssgllssfen/vekwvpeiih 
\hslflgttidles\epcttekln\kdk 

YNPLTPQMADMMVriDLKWEiLECSAFM 
WKRLK\iVFDEALWAALEED*RSSCRKET 
STLLQSTPLLVSYALPCTP 


5960 


19861 


A 


6011 


625 


141 


etgsrfvthagvkwiinhds lhpqp prlk 
*fsyfslllsswdyrhvpprpanlvypl 
*rggppsmlpqaslef\ldsk*ippclq 
ps/akvlglqvshrpspkvtfhqrakeg 
dwshwsrgks i pdrgnsqcmhrs ipc 
vfeeqkggqcycsieskeetsri 


5961 


19862 


A 


6012 


302 


42 


GliAMLPRLPGTPGLKQSSCLSLPSSWDY 
iaiILS*PAVY/C*FLKNlK 


5962 


19863 


A 


6013 


363 


82 


vsvslspslktkvnyavasasftiakiw 
kqlkcpsmdkqmk* iwhihtmeyysa/i 

kqkimsysttwmkledimlseisisqay 


5963 


19864 


A 


6014 


329 


99 


HIFGNTYFFQVASSFIYLFHDSVSLCRP 
GWSAVA* SWFTATSG\FRPSACLGLPKC 
/WDYEREPPRPANVLFSMEVTH 


5964 


19865 


A 


6015 


324 


238 


lamlSrlvsnswpqaillplppkvlglq 

A 


5965 


19866 


A 


6016 


2 


308 


fwkaiagiegdssdkceq/.sklktiwkk 
fisldamrnihds *ekvkms i fwkmipt 
prddf/eglktsvekrtedvvktarq*e 

i:iKVDRT*TDEELLLIiDQQRK 


5966 


19867 


A 


6017 


2 


457 


ffflrwsl/dsvaqagvqwrdlgslqap 
prgftpfsclslpsswdyrrplprpanf 
fyf**rrgftmlarmvsis*phdlpivla 
sqsagitdvshreaehviskqhivsvmw 
kplpeihipcilnlslrfnursvaglrn 
slivrmlsilthg 


5967 


19868 


A 


6018 


389 


142 


AHMLFAAQGDSSIPMLVAPLFTWKMWE 
QSKLPLaDEWSGFW*HIHVTEY/YAIKR 

REILTHATTSLKALCYVI*ALTKGRTL 


5968 


19869 


A 


6019 


384 


232 


LNLPSSWDYRRAPSHSA/NFSYF*RDGG 
LT^^L\PGWPQVIQMPHPLKAL*LQA 


5969 


19870 


A 


6020 


333 


84 


GLIDQEFATDTTKSTSLSLT*T/IPSRC 
SLCLRGSSDSPASC/SQPPASVAGITDT 
CHHAHIIFVFLVKTRVHHVGRAHTEKPH 
LH 


5970 


19871 


A 


6021 


106 


366 


dtisfffflkapiepvfiififfyfffi 
fff1*fffllflflfifyfvf/ifffil 
lclfffflvllfyllfff/slfflfcfv 
fcfffifcyfyfylffilfffffyiialf 
i ffffgs sflf ivflflfflfcfvfcff 

F 


5971 


19872 


A 


6022 


384 


!8 


HEKKPGPGTGKPPKKKGGPKFGPKGPKK 
RGEKKKKGGP/NS FFPRGGGTPGGNF*R 
GFPPEWQKKRGEKKFFQGGKRGAIsrPGI P 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLLGPKABGGFIPGKKKKKKKNSNKKSK 
QTKKALSRTSHS 


5972 


19873 


A 


6023 


3 


191 


LVKGKIRSWEETWVKTW/RFYPGVMAH/ 
TYNP1IPLGGQGRWIA*AQEFKTSLGNVV 
KPHLYKKKI 


5973 


19874 


A 


6024 


364 


142 


I IHGLI IHLHNFKINGHTLSTWYTFHIL 
CVCVYIYICLC\CIYVSVCLCVCVYTHI 
QYTY * INKHI C* RLEAK 


5974 


19875 


A 


6026 


338 


102 


WGFT I LARLVSNS * HLVI CHLDLPKFWN 
YRHEPPCLALFLIFVGT**VYIFMGYHR 
Y/ C * YRHAMCNMHI RVNMVSVGLNF 


5975 


19876 


A 


6027 


99 


77 


RE I CTPTFTKAFS I EAKGWK* PKQPLTN 
EWINTMWCIHMRGKF*P/YAKTSMKLED 
VFHE I SKSQKDKHY YI * FLS PT YVKIQA 


5976 


19877 


A 


6028 


2 


328 


tityrgakiritsdsssetmqaree*se 
if*vlk/ekkkth*pkilysgklsfksk 
geik/yf*gkqn/lkefvssrpvlqgml 
kwlqregklyrsetqtykkkekaseke 


5977 


19878 


A 


6029 




241 


GFAPLPRLECSNTNMAHHSLNLSGSSDS 
PASANFSIF\IETRPGHVAQAVL*FLSS 
SNPPISPSPSVEITGVSHHARP 


5978 


19879 


A 


6030 


108 


362 


CCYLMVTEISAHQNPAKQGLA/HTAFP* 
KTFFLKVSWTGTVAHV*KPSTLGG*GGR 
TS*SQKFKTSLGIMRPPSLQIKKKVFKI 
YP 


5979 


19880 


A 


6031 


1520 


77 


ASSSSSSSSVILSS *NIPNYNYIVIFM/ 
CL*IIF*AaSSSSIYyVPGICyiYH\IL 
CVCVCICMCIYVCICIYSIC*YIVYVSI 
YYIHY 


5980 


19881 


A 


6032 


405 


100 


EGFFLPPRGGGGKR/PPGPLTPGGPGNP 
RPKPPQKWGKPGPPPGGGPF*WVLLSPS 
LPGGQVESPGAVSGPRQVPPSGSRPFSP 
TAVGTTVTFLSKKKKKQV 


5981 


19882 


A 


6033 


42 


309 


CDKFFHKASNHIYVS TYIWVFQI CVCVC 
VCVCVMCILC*AY/ CGCLGIMIWFLFVL 
* CI /CYCCY*SSLFSVI ICI FLMCFVSY 
ERRFILFA 


5982 


19883 


A 


6034 


397 


2 


KGPPP*GGPDFF \ * PRGGPPQKNPFFPP 
GGGPPPPKNGVGGPPPQKAPFFGPKEGA 
PQKGGPPGGGPPRGEKKSLCPPPNWENT 

STGLEAPFQSVFIVELVEDMS 


5983 


19884 


A 


6035 


343 


2 


LWPPQGSFKTAAPFFFFFFLFISDFITF 
HFLYM\YVFFIPLFLL*SFFCIK* IFYY 
VTF*FLQSFSILFFSWFL\WLL*SLLY 
SF*LIRIYFRFILT*FQ*DTETLFFYYS 


5984 


19885 


A 


6036 


319 


68 


SLGEICAPK*FFPPAQKKGPFQKFPSCK 
FSPSPVFKTRPRAKF*KGPP*GKKFyYA 
NPGKI \GPPKGSFKRPPLFFFFFFFFFF 

P 


5985 


19886 


A 


6037 


419 


191 


RIHTIPITALFAVAKR/WQSKYPLVDE/ 
MDKIWYIHTMED*A&YGKKE IMSHARTW 
INLKNIMLSEISQSQKDKYFPFI 


5986 


19887 


A 


6038 


434 


401 


YGKNFSFSLKV*IFCEGLVLVFFPPKKK , 
/FFFKIsrFHRWFPLYVFF*TGRGWFFK 
PPFLEKI FFFLTRVlSrLGPPRGFF*GAPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFS 
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T=Tlireonine, V=Valine, 
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X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














*NFYFMN* IFTVI 


5987 


198SK 


A 


6039 


12 


345 


SKSFVRRSLTVTQAGVQWCHLGSLQPPV 
LPPS FK* FS CLSLQSNWDYRCLPS CLS W 
FCIF/M*RRGFSMnARLVSNS* PQ/CDP 
PASASQSAGITGVSH/SHLV*NA*SFKT 


5988 


19889 


A 


6040 


275 


45 


KDVQHQWKIVQPLWKQYGSLKASTES PH 
DPSILLLGTYPRKMKT*IHIK\TCIRMF 
AEALFLISKK+KLPKCLLADK 


5989 


19890 


A 


6041 


355 




FGTI KNFFFYIFFFKNYF/ FFLYI IFFK 
KKVLFFF*KKKIFFLFIPNKKIFWVFFF 
FFFFFFFFFCPRFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLVTRHLF 
EMRKGN 


5990 


19891 


A 


6042 


635 


209 


PLNI KKTFFFLRQS FTLVAQAGAQWHDF 
GS PQPPPPRFK* FS CLSLPRSWDHRRPP 
PRPANLVF\DFLVETGFHRVSQDGLDLL 
T/S/GDLPALASQSAGITGVSHCARPTL 
EFLKVKEMIFCSTTVKAVLDHANFLILR 


5991 


19892 




6043 


12 


348 


ESRYSS ICLPFSSVYCRR I I SLQLYS YV 
LLVYILFFF/CFLKREFGFCPPLGVQGP 
GLGSLNPPLLG*KQFSGLIFPGIGNTGL 
APPSPPNFGFFRKKGVSPCGPNRPAIS K 
F 


5992 


19893 


A 


6044 


192 


1 


IFHRFCT/HITLP / TTALNMN* PWLGTM 
AYAYNPNTL* G *GRRI V* AQAFKS SLGN 
MTRPCLYKK 


5993 


19894 


A 


6045 


338 


42 


WKTAWW/FLKKLD/MNYC*DPAILLLGI 
YPKELKAGT*TDIRTSMFTAELFAIAiai 
WKQNVDLKHLRQNE CRQF IWAKVEGNWL 
PSWFGEWSIQHRLQAGF 


5994 


19895 


A 


6046 


300 


3 


RLDCHYLHSHTHTHTHTHMPAGIYTHH/ 
HCWNLHTHWPGG I LGVALDL * MQFLLNR 
GWSWGGEQMAWPCPQGHVMWLGDPGKMI 
LKPPIWVCQLQGSSYDG 


5995 


19896 


A 


6047 


332 


3 


SLESAGFT*yKMYFYyYYKIK\yiyKSI 
FIILFIj*IFFFFDLFFIPFFFIFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLKSP 
BCRVIYFGFGTQNPQMFSLLTLKYVLC 


5996 


19897 


A 


6048 


2 


152 


KNLKISWT*WLMPWSAT/WGGSLKPGR 
LRLQ*AVlAPLHSSIiGDRARPYL 


5997 


19898 




6049 


1 


288 


PI 1 1 YVLLLLFFETDSHSVT\RL* CSGA 

fsthcnlclpspgdssasds/rvagvap 
aclf*pyrgfsgpatiiafraldpglplh 
pgfslqrpscsrgg 


5998 


19899 




6050 


358 


1 


FSIFSILIPDFLGGFLKFFAFFYGWDFL 
RFPFFYR* PFGPKSSFIFPHVF*R\WIW 
FLLLSSWFLKWSFDPQFFLKRFFFFPFF 

KGVWLI 


5999 


19900 


A 


6051 


308 


45 


ADLSAEaL*TRREWDDIFKVLKEK/NKG 
QPKI LYPS KLSLINE/NE I KS FPDKQKL 
REFTTKLVL*EMLKGItHMEAQGQYLPS 
*KHTKV 


6000 


19901 


A 


6052 


2 


124 


IFCVLVETGFHWVAQaGL*LIiSSGSPPA 
S /DLPNAGITCVSH*AGITCVSH 


6001 


19902 


A 


6053 


I 


330 


LC* PGWSR\FLiS * SACVGLPKCWdVrR 
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X=X]nl(;nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPLHLVFFFFFFFFT* lYLKGPYLNPRK 
IFKNFFPLLKNPFFGVPPKIQFSPIPNL 
PSLWPNMVFPKEVLGEKIFRGFFPP 


6002 


19903 


A 


6054 


34 


293 


SPILFLLCLRHQSGCHPSWTAVAQSELT 
AASNSWAQAILPHSWDYEHRPPH/LNLF 
GWSHAALPRPDSNF*PQAILLPQPQVLG 

LQA 


6003 


19904 


A 


6055 


319 


217 


YHFTPTG^4MSKNRQKTGDNKCC*GCRE 
IGMLVCC*WQYKMVWLL/WKTVWHSLKG 
IK/L*ELPyEPAIPL*GIQRVSWKPIPH 

SSEG 


6004 


19905 


A 


6056 


341 


151 


VHAGGSPES/REFETNLGNKARLHLYKK 
* ISQMWWCIPAVPATHAGPSLHFNLYYS 
PWLFQCDY 


6005 


19906 


A 


6057 


2 


558 


FFFLRWSFTLIAQAGVQWCNLSSLQPLP 
PRFKRFSCLSLPSSWDYRHVPP VRIAWF 
FVFLVEMGFLHVGQASLELIiASGDPPSG 
DPDLR*LiAFQSLGITGVX/HHAGQIFLF 
/ CLETES PSVAQVGVQWCNLGSLQPLPP 
\GSE* FSCLGLPSSWD\YRHALTHPfiNF 
\SIFSRDGVSPSWPGWSQMPDLS 


6006 


19907 


A 


6058 


320 


21 


EICMTTNHPSVSNTIW/SKMTLASVYLT 
LLPISKTSKTSGLPASKEVNRYLGTCAI 
RYFVFYFIIMLFV*EQGLTMLPRLVSNS 
WAQVILPPWPPKVLGLQA 


6007 


19908 


A 


6059 


294 


50 


CYFSLSFGL*QFLSLNQSFVT'LSSHCDR 
DTFE*SWSVI/PGiGVYSS*E*TRGLAM 
MPRQVYNSWAQAILMPWLPKMLEVQT 


6008 


19909 


A 


6060 


616 


354 


brvspchpgwsavalsqptaaltsw\vk 
qsshlsllnswdykcmppllanfNkffc 
rne/sllmlprltsnpwflqvicpp*ppk 

VLGIilA 


6009 


19910 


A 


6061 


214 


11 


AASTRPSAWQPPLLGSEEPLCPATTPSG 
RCTQQ/LH*ERRMMTMAVLWNRKGGKVG 
KRLRMRLVAMSV 


6010 


19911 


- 


6062 


349 


54 


QSKSAFPKKRINRISDSSEGYGQSKLKT 
FWKGFTILnAIKNIOSrSSEEVKIATLMG 
V/WDDFEGFRTSVKEVASDVC/ETARQL 
EVEPESVTG*QQSHVQP 


6011 


19912 


A 


6063 


332 


3 


SQPASGQISKVTQSTLRCLKMSVQQVKS 
CYKCTATYLCAKFNSHEFRSSGVCENPV 
SYFYTHS IRSKIL/WPGMVAHTCajPSAL 
ESQGGCMT*GQHCETSMVNMVTPRVSRA 


6012 


19913 


A 


6064 


174 


416 


NENELFFCFFFXI FFFFXFFFFFFFFFF 
FFFFFFGFFXFFFFFFFFFFFFFXFFFF 


6013 


19914 


^ 


6065 


826 


571 


DGSHSliA\RLECSGVISflHaiLHLPGSS 
DSVASAS*VRGTTGTCHHARLIF/WFL 
VETRFHNLGQSGLELLVIHPPWPPKILG 
LQA 


6014 


19915 


A 


6066 


1 


284 


GTSFFF*NRVLFCCPG*RAVPQ\SWLTA 
ALTS WAQGS S*NHRLKP/HMLS FFNF CK 
QGLTMLPKGMiNSWAQAILQPQPPM*LG 
LQACTYAWSLGP 


6015 


19916 


A 


6067 


36 


384 


VSKNKPIHLFLFFFSKTKSWSVT \RLEC 
SGVTSAHCYLRLPGSREGAASAL*LSAT 
TWLCKVALPLGEDLGALCRNPWGAGVKG 
SSRESPPLGEARGGELPWGQVRIRLWGR 
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K=Lysine, L=Leucine, M=Methionine, 
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GEPAP 


6016 


19917 


A 


6068 


393 


116 


PQPIFPRY*QNKDPNL*KTF*HKYMPKC 
PAK\LIIFIEKGFSRFGQPGL*HLSSSN 
LPALASQSAGISGINPGGRARFPLFFFF 
FFSLLLLSF 


6017 


19918 


A 


6069 


53 


280 


FPRFCFC/LLESESHSVA«AIiVHWRRLG 
PG/LKRFACLKLPSNMNYRHTPTHPMTF 
GIF IRDQGMTL* PSWS *T\ PDIK 


6018 


19919 


A 


6070 


3 


377 


HEEPQGCATVPGEFFKFLWRHGLITYPR 
LVSNSWAQAALPPQ*METHIIFSIGRLH 
MVK\RQYWSGAVAHAYNPRLLGS*GGKA. 
A*AQEFETSLGYVMRPCLHK1ILKNSPGT 
VAHPCGSSC 


6019 


19920 


A 


6071 


337 


3 


RESQAFFSKKKKKGFTICMVVFEQRCERG 
KGARQQSHLRKSIYV*EIASAKALRRHM 
PGVFQKQPGG/WPGWAHT»NPRTLGGR 
CGWrT*AQELETSLGNPEKPCLYKKYKL 
V 


6020 


19921 


A 


6072 


1007 


628 


FFLLAHSPHSVAQAGVQWGDLGSLQALP 
PGFMPFSCLSLPSSWDHRRLPPRLANF\ 
FIFLVATGFHRVSQMVLIS*PCDPPASA 
SQSAGTT/GARPKVF* IFVYMYFGVRKH 
SILMSMPQHDLFGYKI 


6021 


19922 


A 


6073 


27 


364 


MDLPSVSIFDLRSLWADISRATSTLFWL 
LFSWNCFLHLLSFNLFLSLGQK*VCCR* 
HIQSCVT*QWRY\WSGLVTHACNPSTFG 
SQGGWVASV*EFETSLCNMAKPCLYKKY 
E 


6022 


19923 


A 


6074 


2 


68 


ARACSHGTVALTAS CFPLQLDYKCLQYS 
FVCFFFFETKSNFVT\RLKCNGPISVN* 
NFNLPGLTRSQA*ASREAGTTGTCYHA* 
♦ILLPSSTRL 


6023 


19924 


A 


6075 


330 


49 


icrrfalWqagvqwrdlrsLQplppvfs 
clslpssrdyrcpas/rps*fflellts 
gdlptsasqsagiigmshcaqppliyfl 

fqikvilffs 


6024 


19925 


A 


6076 


2 


267 


argvevkvgktqkflnmirrtlvhcwwk 
ckliqplf/wrilpkvkeelpyhlailp 
lgi ypkqmks isl\ky icsy i * i vdhfv 
♦ailplgiypkqmksislnihmflyidc 
rsfcigtflll 


6025 


19926 


A 


6077 


379 


236 


ppppqtretpppghvhkdpppphhpypp 
pqepqprtpvpnthpgeeqnhelfnssl 
lepww\yyssllppeqa*ephih 


6026 


19927 


A 


6078 


2 


290 


rrslpsspledgrls lapgltweppshp 
ppq*rsppppkkkkkkkksppgpkf/hk 
fktgkpekpprvl*rghwppfkff*lkp 
glpff*gtkppkv 


6027 


19928 


A 


6079 


1241 


1487 


KMYCIFFVB*KPCLSGAIFIRLLKWS*P 
TVCNS ISGGPPHG* IMPSI IL*FLFLET 
VFHSV\TRVGVQWPDHGSLQPGPPGLK* 
SSCLTASQSTQIIGVSHHAWPPSITFEI 
FKRLAYSMSADGW*KSPPMKLDLGALKK 
KLSFARPG 


6028 


19929 


A 


6080 


333 


I 


IiCHHVQLSFKFFVEMEVSLCGPGWPFK* 
FSHLGLPKCWDYRHDPPHLAFFFSLSX * 
FFC3lDRVSMIiPRLVLKSWSQAILLL*PP 
KVFFLFVCFLRRVSFHHAEPYAVSSC 
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nucleotide deletion, \=possible 
nucleotide insertion 


6029 


19930 


A 


6081 


2 


239 


ARGKYI YNRDKI SFCCPGWHAI I / RSLQ 
P*TSELKRSSHIHRPNSWDYRCMPSHLA 
WF*IFRRDGGLASLPRLGTGWLKS 


6030 


19931 


A 


6082 


323 


185 


SACLGLPKC/WDYRREPLCLVSR*VLNS 
*AQAICLPWPPKVLGLQA 


6031 


19932 


A 


6083 


296 




GERILEECIYAESHRAEITSLHCSLSVS 
K/HSVAQWVKWPDFCSMQSMPPRFK* F 
SCVSWSSWDYGLPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


6032 


19933 


A 


6084 


443 




QKPK* * KTQTWSSISLQSRKMKKKILSV 
ITFFKT*GEEEEEKEKKRK\KEE*EK*K 
EPSQKKEVEEVEVEDERRGRK*KVDMEV 
KEEEKDRGEEKBGBEQ*KESAEAAVRE* 
VDS/SELVTFAVLPGDSSEEEQGMMVPR 
A 


6033 


19934 


A 


6085 


328 





SNI L I FKAQVTKHI YSLETCSPDYPGI S 
QAIKANFSPCTVG*LPSNWAYRHIjPPCL 
DNF*IFCRG\RSILCRLISNSWLQQVKL 
QEFPQFLGCHARFQGAGFLSHCLV 


6034 


19935 


A 


6086 


299 


46 


LDDLHLLSLCLSLFRDRVSLCYLGWSSV 
A* S *LTVA\ * TPVI LSLRLK* S SHLNLP 
SSWDYRWSNS* PQGLLLPRSPKVHLLL 


6035 


19936 




6087 


266 


48 


GIE IVRQMRSRTGADKVI Y IVQ SPLLKM 
QKI /WLGTVAHTCNPSMCGGRSGRIT*G 
QEFKQSSHLSLPKCWDYRL 


6036 


19937 


A 


6088 


2 


185 


ARVTMLPRLI SNS \GFK* SSHLGLRKCW 
DYRCEPPCPAEISFISSGYLTRSGIAGS 
YCNSV 


6037 


19938 


A 


6089 


520 


297 


LGYRDFLHLSFPIS/WNYRCLPSCPA/N 

F F I LVETN* FHHVGQAGFEHLT SGDL PT 

SaSQ\ SAEITGMTYRAQPAAGS 


6038 


19939 




6090 




187 


SQHLGK:,^fJRDCL/RLIVPDQS\GQHSK 
T F S L KK » F K AWTO * HM P WP A I QE AE 
VGGSLEPRKSKLQ 


6039 


19940 


A 


6091 


188 


329 


TLGLKRFFCLSLR* FIYFFFRDRVLLYY 
PGWKAVGIHRRDYSMLYSLELLASKDS/ 
SCLSLRSGWDYCVWLIFFGCM* CKRQDL 
SIIPWQRSLIGQ 


6040 


19941 


A 


6092 


124 


342 


RPAWPTWQNP I SAKSmSriTWA* AfiRVAG 
IT\GTCQRJ\QVMFVFLAEMGFCHVGQAG 
L YPKI I I LYDKLSHYKCSHLQMRKftLET 
YSGIYSLFTIYLV 


6041 


19942 


A 


6093 


169 


2 


SH*PFTSSTIESlWT\GAVaHTCNPSSW 
GE*GGRMA*DQKIETSLVimTRPHLYPR 

A 


6042 


19943 


^ 


6094 


2 


324 


FFFLRWSL/DSVAQAGVQWRDLGSLQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPftNF 
FYF**RRGFTMriftRMVSIS*PRDLPALA 
SQSAGITGMSHRaRPAVYILTSTSN 


6043 


19944 




6095 


76 


340 


LRKKFCFSVYPFIHRWTFELLPLNDFLM 
NFYFRIVFRFMKKLQR\WPGAVAHAYKP 
STLGG*GGaiS*AQEFKTSPGNIVRDFI 
STKNM 


6044 19945 

1 




6096 


349 


84 


FLLSLF*DTISLCHPGQSAWRTMTSQL 
KQSSHRSLLSSWDCRCVPPCPVUFFSFC 
RDE/SLAMLARLVSNLWDQALVAEAGVS 
VWARQE 


^045 1 19946 


A 


6097 


346 


184 


LTMLPKLVSNS \GLK*SSHFGLPKCWDY 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
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N=Asparagine, P=ProIine, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














RHEPPCLAKISLYITKFFHILSLPI 


6046 


19947 


A 


6098 


378 


250 


FKQFSCLSLPISWDYSREPPQMANS\*F 
LVETRFHHVYSYVP 


6047 


19948 


A 


6099 


3 


225 


GFKRFS CLSLLS SWDYR/HPAPYLANF \ 
* FLVEMGFHH IGQAGLELLTSGDPPTLA 
SQSAGI TGVSHHAQPL I IF 


6048 


19949 




6100 


1129 


457 


YCVS SKYRRI ISFSLIA* LLLTKLGS / V 
QKVGVQWRDLG* LQPPPPGVFKLFSCLS 
LPSGW\DYGHVPPHPA*FCVF*WEMGVS 
PCWLRLV\LNS* PSGGSRPPSASQSAGS 
LQGLI\HRARPEH*F*TS*PLGEGKTLD 
EIYIKTKTFYQFCIVCLKIiH*RLMEFAL 
VAQAGMQWHDLGSLQPLPPQFKWFSCLS 
LPS S * DYRYWriTiYHPALEriKEAPSFPES 
PCTH 


6049 


19950 


A 


6101 


3 


296 


E I IQALLI TI VLGLYRTLLQAS * YL* S A 

ftisdgiygstflaaXtglhglhviigs 
tflticfirqlifhftskhhfgfeada* 
ywhfvded*lflng 


6050 


19951 


A 


6102 


1 


749 


RHEGGLFKTI LL I * TSLTI GSLALAGLL 
FFTG FYS ECWHI I E \ TAWI S YTNA * ALS I 
TLIATSLTSAYSTRIILLTLTGQPRFPT 
LTNFKENN\PALLNP\liy\'AAGSLFAG 

fl i tnni s pas pfqttiplylkltalav 
tflglltaldlnyltnklkl ks plctfy 
fswilgfypsithktipyVlglltsqwl 

PLLLLGLT*LEKIJjP\WTISQHQISTSI 
ITSTKK\GILKFYFLSFFFPLILTLLLI 
T 


6051 


19952 


A 


6103 


377 


1 


SKAFGPPGF*APYGLKflHCFPPGFRRGV 
WAPSGFWARPPIGYPFGALIGAPWPSP 
GGPPKPGPP\GPFGVPKLGGKRLGIGGP 
FPGS PGFLTTPGSKKKKSTKKGKPQKKI 
TPLDTSCSRTIVM 


6052 


19953 


A 


6104 


1001 


531 


FFLRRSFTLVAQAGMQWCDLGSLQPPPP 
GFKQFSCFGLLSSWDYRHVPPTLANFSV 
FLVEM\GFSMLARLVSTS * PQ/ CDPPAP 
ASQSAGITGMSHCAQLPKPIFNYL*AID 
SVTTSISLAHDWPNSSENRLLGYMGKPS 
KFKRQEQKMECYRDLAPS 


6053 


19954 


A 


6105 


376 


117 


CSDWSAVSRSQLTAAPMS* \VKQSSCHL 
SLSSCWN/YRCQPRHLANFVLFCYVLF* 
RWAIAMLPILVLNCWPQAVFLPWPPKEM 
GLQA 


6054 


19955 


A 


6106 


689 


433 


LCHPGWSAWQSWLIATSPSIiVVKPSSH 
LSLLSSWDYRCTSPCPANFFVFFY*R*G 
STM* PRLLWNSWAQAILPPRPPKVLGLQ 

A 


6055 


19956 


A 


6107 


2 


176 


AREYPANFFVCLFFLILCRDG/DFTMLL 
RLISNSWLKRSDRLSLPKW*DDEREPPH 

QAY 


6056 


19957 


A 


6108 


328 


69 


I.YIFSFG*ERVLFCHPGWSAVAQSQLTA 
ASTFRTQVLSLIGRWDNRHVPPHIADL/ 
SFYRDGVSPCCSG* SQTQAILLPWPPKV 
LGFQE 


6057 


19958 


A 


6109 


777 


329 


FLEMGSHSVAQAGVQWRDLGSLQPPPPG 
FKRFSCLSFPSSWDYRRPRPRPA/NFFV 
FLVETGF\TIIiflELVSNS* PRDPPASAS 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QSAGITGVTYPRPALTQNFSNLTFGEEN 
RKPNPEDSCVRYLLLS IRQRSPTALLHV 
ILFQQMIQLVL 


6058 


19959 . 


A 


6110 


345 


309 


ASLIFV*LVEMRFLHSGISLFNSGKNGP 
TAFSQDPPG\SSVSKKAI11IINKINK» 
IQHVTVIQIEVQWHNHVSLQPLNPELNR 
FSRLSLLSSWDHRHMPPHSA1IF*NFKN 


6059 


19960 


A 


6111 


332 


3 


RKVLVCPPGLSAVAGFQVPARSAFQVHP 
LFLEAEVAVSPGCAPAFKPGPQGPF*KK 
KEEERKKKGRRKEEE/EEEEEEEEEEEE 
EEEEEGEEGEEEEEFYFLRAGKVKSPRA 


6060 


19961 


A 


6112 


498 


191 


STDRVLLCQKARVQWCDPGSLKPPPPGP 
K*FLCLSLPKSWNYSHAPPCLAMF\*FL 
VETGFCHVGKAGLELLASSDPPALASQS 
VE I TGVSHHAWPMQEYFQ I 


6061 


19962 


A 


6113 


1381 


172 


rn i ds kai l s kknkagg i tl p s f kli y yk 
atvtkta*ywhk/nh/nvd*wnrient,e 
mrphtynpXlifdepdknkqwamdslfn 
kscwdnwpaiwkrlkmdaflmpytkins 
rwik/dlnvkpktvkilednpgstiqdi 
gtgkd fmtkt pkti ateaki hkwdli kl 
ksctaketikrvnrqltewekivakyas 
ekglkssiyt*fkqi*kk\nhpikkwak 
dmnrhfsked i cgqqs y/vkk/ cpasl i 
iremqietttryhltpvqmaiikmsenm 
kcw*gcgeknmlihcwwkck/lj<:lvqpl 
wktvw/rikteipfnpaisllgiypke* 
ks / c / c ykdt ctrmf vaa\ cp s vtdw i k 
kmmyihtreyytai krnktdwawwlmpv 
ilalceaevggslevrssrpawptw 


6062 


19963 


A 


6114 


3 


155 


HEKKISQVWWCMPWP/TILWWLR*EDS 
LSPRKLRLQ*SYDCTTVLPAWATE 


6063 


19964 


A 


6115 


47 


368 


STPTPSAFLCLAF/YRQRWLCWPDWGA 
V\ IQHTYSLPT*LS ILKQD* PLNLLGT/ 
WSSRRVAACPENQI YRRNGVS /MLPRLV 
LNSWTQVILLPWPPKV/LGLQA 


6064 


19963 


A 


6116 


1 


392 


RQRIFLLECGGAISAHCNLHLPGSSNSP 
ASVS*VAGTTGVRHHAQLIFVILVETEF 
HHVGQDGLDLL/NLMIHPPRPPKVL*LQ 
G 


6065 


19966 


A 


6117 


1 


306 


LWKTLWQPVLKLNILLPYNPAIVPLGIY 
PKKMKIYVHTKTCTQICIGALFI lAKT* 
KQPTCSSIGEW/IKKLWYIQTMEYYSTP 
ERNEPSSHENTWKlSrLKCI 


6066 


19967 


A 


6118 


2 


284 


QTGVHVIHDQ\NLL*PETPG\SRDSPGSA 
SQVAGTTGMHQHARLNFPSFSFFLSFPF 
FFSLGKTLGLGLEFFPRWLQMSGPKKIL 
RLYIiQNPKNKG 


6067 


19968 


A 


6119 


2 


315 


SRVAGITGVGHHAELIFVF/CFFFFFIjB 
RDFLFVPRVGFQGPKLG*LKFPLPGLPL 
FSGLTLPKTGD*GAPLQPRTOFWIF*KK 
G\V*YCGPGWFEILDLRGSPPF 


6068 


19969 


A 


6120 


42 


392 


LQWRNLCSLQPLPPVFKQFSCLRLLSS* 
DYR\HAPPRI,G*FCIFSRRIVFLHGWVR 
AGSRNS* PQDGSAPPQASQLAGITG\VS 
HHHTWAPHFLFFKNSGCSTQNR*QVHKV 

GDKETS 


6069 


19970 


A 


6121 


2 


202 


VHLSDQNQPKCP*MlEW£KKMWHIDTME 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














Y YAA I KKDKFMS F \ E T I TLGE E TQEWKT 
KHRLLSLIRGS 


6070 


19971 


A 


6122 


355 


161 


FIXYLFFFFFFFFFFFFFLFFLSFLFFF 
XXXXXXXXXLXVFTGMIS*LWVCLYVY[vl 
NVYM*IYMF 


6071 


19972 


A 


6123 


1 


133 


LC*PGWSR\FLTS*SACVGLPKCWDYRR 
EPLHLVFFFFFFFFTL 


6072 


19973 


A 


6124 


3 


212 


HPGLPSSWDYRCT*PCLMIF/SCCCC*R 
LGLTMLPHLASNSWTQVILLLWLPKLAQ 
NTSLINMFYIDQMLK 


6073 


19974 


A 


6125 


)55 


2 


ALLTIOUOJKALXWLGAVAHDCNLSTLEG 
*GGWIT*G*EFNISLGNIVKHHLY 


6074 


19975 


A 


6126 


345 


34 


HPSTIHRVFCSIi*S*SSPQARNIIGSCLS 
SEHFGITGVNHHTWIiLFPKKNY/ClLIK 
IETGS*YVAQAGLEIiLSSSNPPISGSES 
AGFTGMSHGIRPVSPSLKWS 


6075 


19976 


A 


6127 


2 


328 


ARAYLPLCE-E'ltFFl*VSVlYE»IhLL*G 
LI*IJSKRYLAHPK\Y*LNVSYSVSFCKT 
HFFPRHKEKai IQLGWflHACNPSTLGG 
QGGWIT*GPEFETILTSMEKPRLYY 


6076 


19977 


A 


6128 


I 


341 


GTRKFSDYRGKPPHPAKNENLLFSIFFY 
FETLSCSVLLCEVTSSQL/OSrSFHFPDL 
PSSNDPPSSASRAAETTGACHHAQMIP* 
FFVETGPDWS*TPG/LLAVLPPRSPRML 
BLQA 


6077 


19978 


A 


6129 


264 


2 


TAISENKVASRSEHRRHHML*LQVI/LI 
ALQHRSQRDIsrLSH*KN*KKKMLGEVflHfi. 
CNPSTMGGQaGWIS*GQQFQTSLSNMVK 
PPRA 


6078 


19979 


A 


6130 


1392 


1157 


LSLPKCWNyRREPPCPALCCPGWS*TPR 
LK*SSPLGLPKCWDYR/R*TTAPGLFFI 
SKS SLRS PAYS CVTVCTLQLSHH 


6079 


19980 


A 


6131 


383 


108 


IVIRLCSQPHHHVDFSVLETESCCVTQVGV 
QWHDLSSLQSPPPRFKRFSSLK*LGAIJi 
LLPSSWDYRCTPPHPMr/LCRF*HGPVC 
LYMFGEKVTP 


6080 


19981 


A 


6132 


307 


1 


LKTLPALCELESHSVSQWRNLSSLQPPP 
LRFKRLSYLSLPSS*DHSHP*AHQWS[F\ 
CI FSRDRETGFHHVGQENL IML I VQQ S S 
GLGFS lEATIiLAVMQLV 


6081 


19982 


A 


6133 


351 


66 


RWSFTMLHRIiLKS*LQ/CDPPASASQS 
AGMTGMSHHTRLHNLF*TANNLTQSHTK 
FYSFl SLTPHFAINVTNYIFLYYTS INI 


6082 


19983 


A 


6134 


2118 


1392 


FFCKGK*IFIQCPDWGG/DLCV*QEKIQ 
KKTYHTGKDKYV*LF*KVKPITKI*RQI 
HTSGNTDAQLS YKENHDQGH* QAKNFKF 
FLFFWFYLRRSL/NSVTQAGVQWRDLG 
SLQPPPPGFKQF/ SCLSLPSSWDYRRPP 
PRPANFC/IFE*RWGFTMLARMVSIS*P 
RNP PTSAS QSAGl TGVSHCTRQE FKVKT 
GYHFHLLPSGGLFLFYVFYLYACLLYVS 
VYYPLRQQE SRKl SKIEMFLTTWH 


60S3 


19984 


A 


6135 


383 


47 


HLFLNFYYYLSHYNYLYYFLLHLIPSSP 
KIFFFFT*KII FFLI KFNKVF *NLS P* K 
KIP/IFSYTF*FWPP*NFF*KPPPFFFF 
FFFFFFFFFFFF*QTLFLGGCGYNTKLE 


6084 


19985 


A 


6136 


2 


196 


TRPSILVTALYYLYIFTTTQWG* \LTHH 
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£=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














INNIAPSFTREMTLMFIHLSPILLLSLK 
PDIITGVCS 


6083 


19986 


A 


6137 


700 


433 


FLRWSSASVTRLE*SGAISAHCILCLLG 
SSYSPALAS*VAGTTGTCHHAQLIFVFL 
VETGFLHVGQDGLDLL/NLMICPPHPPK 
VLVLQA 


6086 


19987 


A 


6138 


377 


298 


QMKWKLLLLFRYRVLLC/HPWSAVA*LQ 
LTAASN\FGLK*SSCLSL/MLTMLVLNS 
WPQVILLPWPPKVQEL*AIAAYTIKMWV 


6087 


19988 


A 


6139 




199 


LYTTYFVTKTLLLTSLFL* IRTG\YPRF 
RYEQLIHLL*KNFLPLTLALLI*HVSIP 
ITISSIPPQT 


6088 


19989 


A 


6140 


38! 


28 


stpnrlrerqedppes*mtwppglprkt 
ps ftknpkinlarwggpenppnpeekpr 
kvpespkrkkp*tkirplpsspgdptkp 
pNlkkkkkkkkswpatqsaemggplka 

RSLRL 


6089 


19990 




6141 




209 


LKLIASSDLSTLGSQSAGNIVGMSHCTQ 
PKINMGILLKLVYREWINPVKTPAGLF/ 
TKKKT*ICrEIDKRILKFIWKCTEHRIAK 
QSLKRTKLEDFLLNFKTYYEATEIKTS* 
YRENINPVKTPAGLFKKKKHEKQKLTSG 


6090 


19991 


A 


6142 


3 


282 


QGIFLTKRVKNF*RCSKFKTLVKKNQGDP 
KKGKWPPGPKIGKMNFGKTPFWAKKI *K 
FNLI \PKKTPPFFFQKLKKTGVKFFWAP 

TCGPKGFLSKK 


609r 


19992 




15143 






FFF*DSI,AVSAPGVQ\WHDLCSL\RPLP 
PRFRRFWCLSLLSGWD\YRHVP\QYPAK 
ICIFSRDRVLPILAKflSRNVSF 


6092 


19993 


A 


6144 


1S5 


316 


PCLAWP* IFPCK/NG6FSMLPTLGLNSW 
AQGILLPKPLKVLGLSA 


6093 


19994 




6145 


489 


231 


SVA*AGVQWHDHS*LQPQ/PLGLKQSSC 
LSLSNSWDYRHMAPHPTNFLKKFFVETG 
VLSCCPGWSSTPGLKGSFCIOTiPKF 


6094 


19995 


A 


6146 


179 


1011 


GT I LNPQSLPFKPFFFSTLI FLKTVLLC 
CPGWVSNSSGSSCRSLPKCWDY*HEPPH 
SAL*TFPWPFP*LVKDTS IHSTQLLKAE 
IFSASLDAAPSPTPYIRTHPSSGLQRWL 
LDAAESPTPYIRMHPASRLQRWCSTCTL 
LPSPSYHTPA*JUilTTASSLISPFFPLPP 
PAHFSQCRMTFCLFVL/CLFEMESRSVA 
QGWECaGAILGSL\QPPPPGFK/RGFSC 
LSLLSRW\DY\GHAPPCPG\NFCIFSRD 

gXvslcwpgwsrtpdlwhpprppkalg 

LQA 


6095 


19996 


A 


6147 


348 


3 


GYHKT INWHCCMS VFAKLTCGFNAI P I K 
/IPAKYFVDPDKLVLKFTWKGKRPRIAG 
T ILKEKKVRRL I L PNFKTHYKAT VX KIG 
W*W*NWRHNHWNQIGSPEIAPHKYSQLI 
FDKEA 


6096 


19997 


A 


6148 


22 


228 


THRVALFVRT*]SIWKQPKCPSTGEWLNTL 
CYIHTIEY*YYSAIKRNKLSIHVKT\WI 
DLKGIVLNENIIQI 


6097 


19998 


A 


6149 


406 


279 


RS IQGSSHSHASA/ S * VAGI TGARHHTQ 
VI IVFLVETGVMALT Y 


6098 


19999 


A 


6150 


406 


10 


MRYHFTPDERLKSKGLKLEYW*GSGETR 
SLIHCWLEyKMSHPLWK*/rVWLFPIKri 
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K=Lysine, L=Leucine, M=Metliionine, 
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nucleotide deletion, \=possible 
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NKNVPYGPAI SLLDIL *EK* KHMS TKNT 
YK*MLQALFI ITKMWLLSIKRRMTKQIM 
VYSNRNYYSTIKNELPIQTRI 


6099 


20000 


A 


6152 


367 


I 


PTPWEABAGGSFKPRR/LQ/CPN*PPMC 
SCTPAWAPGSPPVSKKKKMQFHRRMGHV 
SSAWWPCGADVRNLRAGGYVRHRERFPG 
LPWAAGGRDEVIRCLTVCTACGEDHPGS 
HLAGPRGGGFP 


6100 


20001 


A 


6153 


441 


38 


I LWP GEAI TSEKYViQ* FNEMYQKftKRL / 
Q\mRKGPVLLSYNA*PQVAQPVSQKL3SIK 
LGHEVLSHRPYSLSLSPTDYHFFKCINX 
FLKEKRFYNQQDAENAFQEFTES * SA/D 
FCTTRINTFISHWEKYVDYNRSYFD 


6101 


20002 


A 


6154 


308 


121 


GESFPAEKTFPKENDYRWPSFFQWSSVL 
/CKIHGLLMLPRLV*NSWPQAILLPWPP 
KVLGLQA 


6102 


20003 


A 


6155 


1 


420 


QNLQPHWKTLRQF/LYKViCHMMLSi'AIP 
LLDIT*GRCKPCPYK/ESIC/RMFIEAL 
FI lAKKLKLSKCGSAGE* INK:V*DIYTT 
EYYSAIKSNKQLTHTTTWIKLKSITPSE 
*SQT*ETACFTMLLIWHSRKGNIIVTEN 
RSMF 


6103 


20004 


A 


6156 


77 


415 


TLGLKPTSSSHLSLSKCWDYRQEPPCPT 
SEAWFFFFFGGEFPFWSPS*LKQFFFVE 
MG \ FTMLPRLVUJSWPQflHIilLP/ASAS 
QSVGITGKSHRAPPQKLTFFFFLGGNFP 
FGPQAGCPGaESNYPEP*TPGLKAFFGL 
NLSKCLKYGDSPTAPGKLEIFFFKWGGV 
SPSCaSRVLNPQPR 


6104 


20005 


A 


6157 


436 


206 


LWYIHTMEYYSALKKNAILIHTI\STWM 
SLENMLNE IKQEISIHKRflNI*FHLYEIPR 
KESRMEVNQGLGRRGIWEPTVY 


6105 


20006 


A 


6158 


416 


50 


RFFTHPPAGEFFSPPPPKNIFFPPPP*K 
FWGGGGKRSPPQKKFPPLPPPQNGVLCF 
PPHECKK*PPPPPEKTVEPGPIjHVGQRPP 
QS\PFLCYPPLSFFLYFCYFIFFPFVFF 
HFLPFFFFFIR 


6106 


20007 


A 


6159 


443 


145 


FFFFSIRFLFVVWLYQG*EYLCFFMT* 
* IGVLLFLFEICEFFFFLLVGILFGMFL 
/YFFATASPGLFFFCFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFFFCEKS 


6107 


20008 


A 


6160 


322 


2 


TQDKNKRHRARHIYKRARKSARD*RSLV 
HR*K*KVQKRNEY*PBGTLHTCIALSQG 
NNI*DA/HSQE*WITPVI*TLWEAEAGA 
SRGQEFKTSLANIVKPDAWAKAW 


6108 


20009 


A 


6161 


421 


57 


FCEMKFTEGN]!TAVNIV*PTTKDLEY/WI 
NI /DKTVAGCEKTDSNYERSSTMGKML\ 
SNSIAMYTEIFRGRKSQLMWQISLFSFK 
KIi*QPLQYSATIALIRQQPTTSRQDPSP 
AKRLRLPEGSGDH 


6109 


20010 


A 


6162 


333 


41 


GAWACHLHVGAALNAPTPGLSRFHEEPG 
WSCRTAGGGSRLELSVFFVQMGSCI-IVGQ 
VGFFCI FCR/DRGLAMFSRLVSNS * TQG 
ICPPWPPKVLGLQA 


6110 


20011 


A 


6163 


2 


423 


KINMTKSQCFENILKIEKTLSNLTKKE \ 
REDTNY*YWE*NMDITVESTDI30J1IRN 
TT\ETLHKFDNL 


6111 


20012 




6164 


434 


5 


I CHIFFS ISKLLNILRPVSKLFFS INF I 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FGPELYYL * IHFNP * *GKCLS FLFLQNl 
CIGFFFNSFMQI^SYFLRISLFIPVSPl 
LLG\FVQIFIVSINILYLSI*LRLFFSF 
FLSLFLFFFYYYTLSFRVHVHIVQVSYl 
RIHF 


6112 


20013 


A 


6165 


1238 


1017 


ETGSHSVTQAGVQLHNSSSLQPAPPRLK 
*SSYLSLLSSWDYRRTPPQPAN\FCIFG 
RDKVS PCCPGWSRT PEFR 


6113 


20014 


A 


6166 


504 


210 


fffktrshdqgslqp*plglkgssclsr 
tlphswd/yrv/ppphpakkykwififf 
vetefhhvaoaglkvldssnspvsasqs 

VGITSISHCAQPILCQF 


6114 


20015 


A 


6167 


258 


65 


YKTKNTTSIFSLPSSF*TQILTF*NTYY 
YYY/CRDE/SLAMLPRLVSNSWPQVILL 
PWPPKWELQV 


6115 


20016 


A 


6168 


1 


156 


LARMVS IS* PHDPPAPASQSAGITGVSH 
CAQLFFFFFWRQPR FFLRAGGQGRDLGK 
VKPLLSGVRPFLS * TLPTEGRKSGPPTH 
FG*VLRE*FGGSH\PVSSLW*QSAGITG 
VSHCAQLFFFFFWRQPRFPLRAGGQGR 


6116 


20017 


A 


6169 




285 


HASAHASGNILSYVGSFISLTAAI \LI I 
IMI*EAFASKRNVLIVEEPSINLE*LYG 
CPPPYHTFEEPVYIKSRRIRRSLIDQDA 
ARKETQQGAAD 


6117 


20018 


A 


6170 


645 


15 


lpfiycs pcrnenslfwoerdsnnplg/ 
islphsdkitfnp\yyt\ikdalgltsl 
hsllkdlhsfphets*gpqayysl/qpt 
plnnpspharpewifpycayhnfsgsgp 
ltnlggvl\alll\silhpkq*fphpps 
ypnkqrh*yfgplrpitfidflgrrpsf 
isnresggqpvsypftiigqvasviiyft 
tiiiiliptislienkilkwa 


6118 


20019 


A 


6171 


409 


25 


YTTPGGKI PFKSNPGSKLFPPPPP/ PPF 
FSPFSP*KFFFSPKPIiNFFGGFFPFFPP 
PKKRFFSKNPPLVFFSPPFKEKVFPFPP 
PLNFAPPRVFLKAPPPFFFFFFFFFKGG 
EVKWLSEAIJ5CKSKDRG 


6119 


20020 


A 


6172 


73 


52 


VQLLKCRIHIF*DTCVLLCHPGWSTVARS 
♦LTATSRSSCLSPPSSWDYRRVPPCIAS 
F/SF*YIALNN*A*AIILPQPSICVLGL* 
AVAHAYSPSTLEG 


6120 


20021 


A 


6173 


325 


470 


IFTFFFLLAFI+FPCRNR/SIiALIjPRLV 
LNSWLQAVLPPWPPKVLGLQT 


6121 


20022 


A 


6174 


2 


228 


TLEDNSTIPCKTIQSfiSCTPWYVTQKAA 
CITLLSM*PKKLWYVYTKSCT®1FIVAI 
FIIAKHWKQPRCLSVSE*INiaiWYI*TM 
E*-YPVLKRNELSSHEKTW/MEM* IQIil 
ERSQS * KAWCLHKKIiHTDVYSSYIHNC 
QKLEATKMSFSKRMNK 


6122 


20023 


A 


6175 


3 


408 


RYLTLAIRDMQIITPVSYHFVPSGDYGQ 
KDNNKCW*RG*KMGSLMHC*W*CKMVQP 
LWKAAWQFLK/DVKRRAILFLGIYPREM 
KTYVHAKTCMHIFTLALFLYVHtHAUS 
VMDWMLGHPI EE ISRDSYS CVCLL 


6123 


20024 


A 


6176 


2 


237 


WVWWL I P WLRAW E AEAKGL LE LGS S RL 
* \WTMPLHYSLGNRARLCLKKKKKKKFK 
VFPPQKKKKKKKKKKKKKGGAL 


6124 


20025 


A 


6177 


387 


154 


AWQWRDLSSLQAPPPGFE *FSCLSL\W 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-^possibIe 
nucleotide insertion 














DYRHKLLTSGDLPASASQSAGITGVSHC 
AQPLSLFVFPLTHQRFVAALS 


6125 


20026 


A 


6178 


1 


239 


FFFFKMECLTVSQAGVQWYNLHSLQPLP 

pgfkqfsc\lslpsswd*rvptsrpakf 
/cvif*dgvshcqpgwsawqpplh 


6126 


20027 


A 


6179 


325 


50 


rkfpltwrrvsffsllrsstdwmptphr 
hirgiicvtqstdsnigliqkhphrhtq 

KNV*PNV\WHPMAKSS*HPKLTITVYPL 
STWHTYTSL 


6127 


20028 


A 


6180 


157 


338 


SQHFGRPRQEACSSSEVRDQPGQQREFP 
HLYKN*KEITQVWWHMPPEAQDY/LEPG 
RSRLQ 


6128 


20029 


A 


6181 


3 


273 


hegqsklktfwkgfti+dvikniddl/w 
rgvkipiltgvwekliptlinkfevfkt 
sleeiteyweigrelelevepkdvtel 

LQSHEKT 


6129 


20030 


A 


6182 


2 


359 


yypaipllgihpk*knsiyqmhtcthwf 

TAVLFTIAKIWNQL* CSSMGEWIKKM/VJ 

mhthththihtmeyyvpik/neiisfaa 
twmelgviilteiiqaqkktkgifshly 

irGAKIFDYM 


6130 


20031 


A 


6183 


3 


211 


npgklsvsinqdcfpwtpcpfqmgrcpt 
llf\sfyggqaiplitpsvgtqifrlkm 

/ PE VTHHFHS FP * E PLTAAAI.NQPSLPT. 

sllvdf/rdrglfchlgwsavd*shlta 
alns\lvtqlscpsflsswdyrrmlpcl 
gnf*ifrlkmlksltifilfhesh 


6131 


20032 


A 


6184 


707 


384 


lqiffynrvkcvknkstqnqfyifsriy 

MHANRHGKYE I /WI I KKMPMREG* AVQL 
RMKWKIKKNTHIKLGVWHGYNLSTLG 
GQGGRIT*AQEFKTSLGNIAELHLYT 


6132 


20033 


A 


6185 


720 


363 


FFFWGRS L * TILF* /QTGSHSVAQSGVQ 
WHHHSLLQP*TSGLKQSSHLSPQSSWDH 
RCAPPRVANFFLDFCRDRVCVAQAGLEL 
LASSNPPTVaSQSAGITGGSHRTWPFWD 
ILGIMYWGC 


6133 


20034 




6186 


439 


66 


KPKTLGSKKVNSNPRVPPGK/ PWGVGPL 
GPGP*MGGSKP/S*NPSPPPPLFKTRFG 
FFPPG*RQGHYSGFLQPPPPGGKTFFPP 
S PPKKWGHKKKIPFQKKKKKERKKRNI I 
GSWTTSETVGNPAPW 


6134 


20035 


A 


6187 


83 


287 


NKMFKIFRFRF/CGFLGQSHSVTQGGVQ 
WHDHSSLQP*TPGLKHSSLPTCWD]!IHHK 
PPHLVLFCFYYRL 


6135 


20036 


A 


6188 


290 


609 


RRMDDPPFFKYFFERRVLVCHQGCS*V\ 
K*SSHFSFPCSWDHRCV/PMPHPAN\FN 
FFHTDKSLTMLPRLVSNSWAQVILWPLP 
P*VLGPAGMSPCPQPPSQFCIEAV 


6136 


20037 


A 


6189 


2 


371 


ryipekvpakyseltndslsilsgksgi 
lwkkrlslfs*q* sykwfslgaiycgmh 
*krflgtsil/fte*vetgfhhvdqggl 
e llts ce tias ay * s ag itg i shcae s q 
stlkkyipgsti 


6137 


20038 


A 


6190 


531 


205 


RWRFALVAQAGVQW / HDLGSLQPLPP * F 
K*FICLRLQRSWDYRC2iPLRPA\NFFVF 
LVETQFHHFGQT\NLQLLTSGDPPASAS 
QSAGITGVSH*ARPWFIIFTALLYK 


6138 


20039 


A 


6191 


52 


375 


NECETSFLFFFFEKKFWFCPPGGGEHEN 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LK* MEP \LPPGLKE FS CLTLRRGGN* GP 
FPPLQEMFGVFRKKGVPHCNPG*FQTPD 
PRELARLTSPRGGDGGGRPPPRVK 


6139 


20040 


A 


6192 


3 


158 


FFFLLLMCGLKKETG/WLGWAtlACNPS 
TLGGRGGWITR*ELKTSLAHMAEPY 


6140 


20041 


A 


6193 




54 


FFFETESHSVAQAGVQWCDLGSLQPPPL 
GLKEFSCLSLLSSWDYHHTPPHPANLFV 
F/S FLVETGFRHVGQAGLELLTSGGPPA 
SAS QSAGI TGVNHRAWPEVFKK*RVS F C 
RPGWSAW 


6141 


20042 


A 


6194 


1 


323 


FILPSSWDCMCAPP/RPDWVLYFLVETG 
FHRVGRAGLQLLTASDLPASAS*GAGIA 
DGVSLTQCSMLPMLECRGVILARYNLHL 
PAACLGLPKCRDCSLCPAATPSRK 


6142 


20043 


A 


6195 


619 


243 


SPETGSCSGAQAGVQWHDLGSLQP*LGL 
K*SSYLSLLSKWDYRCA/PPRI»ANFLFF 
VQAGSCHLAQAGLELWSSCHLPALVSQN 
AGITGVSHHARLAVAWKRTPLYGLPSRI 
SNLCVLLKFLFFYLY 


6143 


20044 


A 


6196 


385 


19 


GGHFGAQE I S PPGVQLFFRLHFLSNWGF 
KRPPWPPG*FF/VFL*KRGFPPFGRWP 
PSLFP/SDLPPPAPPK6WVSGLRPCPRT 
FFFFFFFDKEC/RLSCPANFCIFSRDGV 
SPCWPPWSRTPDLQ 


6144 


20045 


A 


6197 


498 


437 


ENLQSSINLVDKAMAWFERIDSNFDRSF 
TTVKMPSNLTSI INACCGEIFHERKS * S 
IWQTSLLSYF*KLLQIAPPLIATTILrS 
KEP*TSR/QDRLLLAEG*D 


6145 


20046 


A 


6198 


79 


394 


SVKLLaLFFFFFSENFCFAPQVELKGPN 
LG* LNPPPPGLKNFSGLT/LPKKWELGG 
HPPPPSKFFFLFF*KKGMAPFSQGWFKT 
LGIiNQFACLTLPRGGNLRREGP 


6146 


20047 


A 


6199 


2 


313 


SASHSAGIIGESHCAWPIMPVYSLVLNF 
FFFLEKKPPFVTQLEGQGGNLG/SMEPS 
PSGLKGIFCLGPPKKWELRAPPPCPGYF 
WFF *KKGGLP IRPGRFWKSVP 


6147 


20048 


A 


6200 


62 


732 


AFQQaKKGTQ\GRLSSPflaGAI.P\PVIA 
PSGPPLPA*\GSSS\PASVaPYPPPQAT 
G/SPPTPPGELRRGGWSKGGLDHPRNTF 
YCWESSEP/PPN*GQS *APaGVNSEGRG 
Q/LPPSSrARCVP*TPSHVPPHPIiAFPP 
SLPHPGPASRAPP/TGSSDLSASAVSAP 
GREGHP*GTKMVPQQA6RQKPGHRGaS* 
GSSCSKRTLSPLSPRQSGFQSRCCFHPP 
RQNHSRC 


6148 


20049 


A 


6201 


424 


7 


KNMKRSPISCVTRDPQIKITMR*HYTGI 
KMANV/RLTGNTKC* *6YGAICGPLIQC* 
WECKIV* PLRKTAWQFSQS /LKI V*/P Y 
DPArALPGlQLC*KCTS\KKlSrCM*IFrT 
ALFIMAKKKQKPRYPSIGEWKYKLLHL 


6149 


20050 


A 


6202 


371 


144 


RWALPRRTNF\*FFVETRSLHVAQGGVK 
LLSSVSPPISPSPSAVITGVSYCAQPYG 
HPFELSFSLIEYEYIYLDFF 


6150 


20051 


A 


6203 


409 


no 


LRRSS*LCHLGWSAMVQSWPT*FKP*LL 
GLSDPLTSASRVDYRGMPPCPRHFI IFD 
M* F I * CVCVC /RTWGL I VLPRLVSWFWP 
EVLLHPGPS KVLELQV 


6151 


20052 


A 


6204 


15 


221 


KASRKMQIETTLRYYLSPMRLAKMQMLI 
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residue of 
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Predict- 

lociition 

ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NTSCW/RKSQETRTLIC*W*EGKMVQPL 
SKGIWPYLIQAEYP 


6152 


20053 


A 


6205 


23 


371 


KVACTCMFITVLFTIVKIi*NQSRCPSTD 
DWIKKM/WIiCVCIYSSVCVCLYVCIDR* 
IERHDIYVYIMEYYXALK*/DEIMFSAA 
'rRVKLDAII*S*ITQNVKCCIVSTHSGN 
LIMVST 


6153 


20054 


A 


6206 


113 


417 


SAVTLLVIYLNDLKTYVHTKTCTQMFTA 
ALFI lAKI *KQLRCPS \LIWIN* C* KEM 
SYQAMKRSYARNLWSSKKLWRNLKCLLL 
RERSOSEKSAYCVIPTI 


6154 


20055 


A 


6207 


510 


167 


GLGQAISPSLPKSQGIGAYGFGFFFFFK 
R*DLVCCPG\SLNLVASSQSSCLSLPKH 
WDHECEPPCLAFFFFLL*RQGLAMLPRI 
LPLLKALASGGGMAKKKCIRYFSMKIHL 
QAQ 


6155 


20056 




620S 




40 


S YTHTHTHTHTHTHTHI P* * LTGS / RLQ 
GICRKGSHKHPKGTiSQLTiP 


6156 


20057 


A 


6209 


1 


258 


SGFLLS*VPSVRDPTGMRTV*LTWQPLP 
APLEF\CPRLSD 


6157 


20058 


A 


6210 


229 


310 


DFSEAQKSRQCGGG I SNVNHPGWLKPV 
IQHF*DGVLLCCPGWSAW*SQLP*LPR 
LK* SSQLSLPR/SWDYRHVLQ\HPANCF 
VFFVEMSLAMLPRLVSNSWALAI LLPQP 
PKVIiDDWLEHHTWMIYITYATTTLP 


6158 


20059 


A 


6211 


496 


1 


CCQESFEILGSINSFGPGLKEGGVKNMR 
PGAGPTLNKRPFSDWPLQPGRRGGCFPF 
VSQRGSQKDRVPPKWFFFFFSF*DRVSL 
WPPGPSARKPSWLPVASTSR\V* QSSCP 
PLWFP6PPTMVSYFFFF\KRQDLPMLPR 
LV*NFWPQVILLSQPPKVIiRLLQAHA 


6159 


20060 


A 


6212 


3 


340 


VTflGIlIIFMI*EAFASNRKVLIV*EPS 
I *SLQV/ *WRGLLLLGSPPPYHTLEEPG 
YVTSKRKRKESNPPKLASSQPHGLHDFF • 
KKKKRGGPFKRILEGAPFGGGWGHREFF 
P 


6160 


20061 


A 


6213 


60 


493 


KSQQNFFCFETESRSVTQAGIQWHHFSS 

lqppppgficvisyl/slksnfsgyp*df 
fcngppcs*gp1rftyy1cv1psswdyr 
rlpphpahfcvfSiktvspgwsqtpdlr 


6161 


20062 


A 


6214 


414 




LaJGRPGLLVSPLPLGEELIPCLPR*KL 
GPPRVL* \KGGPFL*SSFAPFF*EMESH 
SVT\RLECSRAILVHCULCLPGPSDPPP 
SAS * VAGRIARAQKVEAAVSHDHnAALQ 
PEQHSETPSQQTNKOTJQTKLTQKYPF 


6162 


20063 


A 


6215 


2 


403 


slllplvqlplrlcfffflkrgfnfgap 
dkrewgefg\*lkpwlps*nnfsgqtlr 

RSGD*RKTPHGGVNLVF*EKTGFSNIjGR 
GGLKPPPSGNTPAWPPKSVGITGGTPPP 
PPVFCFLGFCENTNLRAKKKFI 


6163 


20064 


A 


6216 


2 


388 


KMEQTLWKTDWEFLTKLNILYPYSPAFA 
DLGIYPNELKT/C/CYTNTCI*KSIAPL 
FIVARTWKQPGCPLRDGWLNKLGCIQAL 

EYYSP*KRNEL*NHNMAYKKLKLIFLLR 
QEDRFSLGVQEQPKHNGEM 


6164 


20065 


A 


6217 


363 


302 


nkefkttlanmgeppfl*ecrgfs \mlar 
wlnslfhdspppasqkagipgvspppr 
plffffflletvshyvaqagqlisnydl 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, IfcPhenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLP SCLSLLDTWD YRH* PPYP VFKLLNS 
TYKflC 


6165 


20066 


A 


6218 


576 


6 


RAVLRGLLVLLLTKHTFFLFP PRQSLAL 
LPRVGVQW\HDL\GSL\QPL\PPG\FKQ 
FS\ CLS I PSSWDYR YPPPRPANF\ * FS V 
ETGFHHVAQAVLELLTSSDLPALASQSA 
GITDLGQSETIHGGSGLDKPPDHKVDKG 
PTEERACLLSYSAGRKYKEYHILPEQGS 
EPPHHGDILSISFQAASHIAQTPPAHT 


6166 


20067 


A 


6219 


u 


496 


GFRVDPRVRPRVRLNS*PRDPPTSGSAG 
ITGV/SSHHARP*NIFT*KLGISFLVFW 
GFLFLFFVFCFPFFWKGGFLLVAQVEGK 
/WPKFGLIEPSPSGVKEFFFFNPPEGWD 
\YGIAPPPGVIFWVLIKNGVSFFCRAWV 
QTSGLSWINRLNLP 


6167 


20068 


A 


6220 


1 


421 


qrsrltaisretsftlvaqagvqwhdlg 
slqhlppgfqgfpclsllst\tgncsvt 
qagvqwchlgsl*plppglkpsshlsps 

SSWDYRL*KPHQGSD*VCVT\GAASPEP 
NTMTAWNKPPDIRMKETLLLPPSSSEKM 
L 


6168 


20069 


A 


6221 


3 


36 


AYC I VKPS P * PLTGALS ALLMTS GLAM * 
FDFHSITLLILGLIiANTLTIYQ*WRDVT 
RESTYQGHHTPPVQKGLRYGIMLFITSE 
AFFLAGFF*AFYHSRLGPTPQLGGDWPP 
TGITPLNPL*VPLLN\TDJE*LA*GVSIT 


6169 


20070 




6222 


895 


13 


AWSWAWN*FELATQIAFSGRIPLTDSHL 
GLLKAHVEAPI \AGLIVLAAVLLKTGGL 
MLYTASHS FSNPLTK\HIAYP FPVLSL I 
EA*FITSSHLPSDKQDLKS\LMAYSS\ I 
SPHSPS*LTPILIQTP\W\SFNGAGQFS 
I IAHGTYHAYITYSA*ANFKI TERTSQC 
ACINPAAQGLSNS * LQLIGLFE * LPRQA 
VSLTMGLTPPLINPTGENCSVASNQRFS 
/WTQISLLLPYRDSTILGSQALYSLYIF 
TTTQWGSLTHHINHIKPSFTRENTLMFI 
HLSPILLLSLNPDIITGFSS 


6170 


20071 




6223 


68 


405 


HSYALVyUPVGIRVKVHLSKCILYTWHI 
SNSSQEVCEMPWLNLKRVPTTLAPliF*N 
IS*G1SQLGAVAHTCNPSTLGG*GRW/I 
QQLKTSrjUSTMAKPHLLLKKKKNFPNFPK 
RG 


6171 


20072 


A 


6224 


591 


283 


GTSTQftGVQWOSICGSLQPPPPSFKRFSC 
LNLPCSWDYRRVPPHLflNF\*FLVEiyiGF 
HHVN/QAGLELLTSSDPPALASQSAGIT 
GMSHCaHPRVaF*GEKl!IHRSL 


6172 


20073 


A 


6225 


302 


173 


GFPSFFLKC*DR/SLTMLLRLVLNSWflQ 
VILPLWPPKVLGLQA 


6173 


20074 




6226 


417 


81 


tqtrgpppppflggppsgpertvppprg 
gvpqtgsrfflfrqnpgpalfpkm/egg 
f*kkfgpplppf*ktpffftnrgnpcpf 
fvgenpqappvfggfnpfprkkkkrndf 

LS 


6174 


20075 


A 


6227 


41 


474 


TDATETFHFTSKH\HFGF*AaA*YWHFV 
DW*LFLYVSIY**GSCVPHPA 


6175 


20076 




6228 


503 


28 


AAHAPPRVRKVIGH*AFFTQRCTNSRH* 
IIKFPKDKDKARILKGAEEK/RTTYNGA 
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peptide 

sequence 


Amino acid sequence {A=A]anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, •■=Stop codon, /=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














PIWLAVNFSVETLQ/A/RREWHDMPQMP 
KEKTFYLRI L YPMKI S FKHERE IKTFPD 
KQKPRDFISSEPVLQEMLKSWPGARVHT 


6176 


20077 


A 


6229 


418 


175 


VSPLSPLKILFFPKGLKFWREC3GPKCPP 
PKKRVLPKK/PPAMF* IPPPKRKKSKLP 
PPRKIMPPQGIFKSPPPQFYFFFFFF 


6177 


20078 


A 


6230 


45 


308 


ACVRLKH*TDN*QPNIYNQP/HKSIjLP* 














FKKKPGGGPISWGGEKIIFFFFGOQKKH 

TLGIF 


6178 


20079 


A 


6231 


440 


209 


IQHFRGPSRRGLLGPEflEASPGQHRESR 
SFLKMQP\NKKFS* VWWCTPAI PASPKV 
BARE S LE PGRRRLHLTEALQ PG 


6179 


20080 


A 


6232 


419 


2 


SSPPPPGGKFF*KKPGKKKFSPPKKKRV 
FSPP\PPKKFFFSPKA*NFW6GGGQKFP 
PPKKGVFSKKPKGVFKTPPKKKKKFFFP 
PGENWGPPRVF*KGPPPFFFFFFFFFFF 
FLSFNVNILFKPVQAPCLT 


6180 


20081 


A 


6233 


448 


112 


QSARITGVSHRTRPMLRTSTPLSATDRN 
AGQKISKDIADLMNNEQDHRDIYRHSTQ 
* QQNTHS FQAAI ENKI DP I LGHKKtlLTH 
/CKQSDQTWNKLEISNKYAKPQKIKHTI 
K 


6181 


20082 


A 


6234 


2 


351 


MDQG 11/ SDFES YYLRNTFCQAI VARDG 
DSSDGSGQRKLKTFWKGFSLLDAIKHTE 
DSW/EE I KI STGVWKLIP ILVDDFEGFK 
TSVEE/VADVMDLSRELEL*VEPEDGTE 
LLQSHSQT 


6182 


20083 


A 


6235 


3 


252 


FHTYNSLLVCHSNLCYQNSLLFFIIH*/ 
CI/HFHS*AKVSIWKHTYTHTHTHTHTH 
MHTHFFQATDRKICVNVYMFSVAYIHKG 
LQ 


6183 


20084 


A 


6236 


3 


294 


IITP*LPQLSGYIEKSTPYECGFDPISP 
ARVPFSIKFFLVAITFLLLDLEIA\LLL 
PLP*ALQTTNLPLIVMASLL*III]:iaijS 
LiAYEGLQKGLD*AE 


6184 


200S5 




6237 


3S1 


121 


PVPASF*FFFFyEDR\VSLCYPGWSQTP 
GLKQSSCLGLLSHWDYKYDPPHPAFISF 
LLSSSLLWVL*RPWLLCATVPAFLDIR 
EQ 


6185 


20086 




5238 


395 


62 


PFFPIPKFFPWGPNFFF/PLGPPPFLGF 
FPGSFAPPSHFSPPD*NFPRPPKNFPPG 
GDLFPQKKGDWFFPKKEGVFFFLGGFSK 
FSGGPPFSLGPQKKKRFKI KGPKKFC F 


6186 


20087 


A 


6239 


1066 


761 


SFFLRWSFILVTQAGVQWCDLSSLQLPP 
PGFK*FSCLSLPSSWDHRRPPPRLANFC 
I F/M*RWGFaMIARPVSNS * P /CSDP PT 
PASQSAGITGLSHRAWRLVFL 


6187 


20088 


A 


6240 


390 


27 


GELPTPIYLRPPASPKGGVSGGSPPSRV 
KFFPLEVPPPPPPGC*PGPFFYIWPQGG 
ARVP* QQIRFFSLRW\PPRPTRPKNPHL 
QVGPLRI CLSPETGGGVIKVFRGSLLKK 
KKRAAASRSN 


6188 


20089 




6241 


366 


1 


kiknfnrpi/tskga*sw/rnlptkks 
pepngfpgvfcptfkkkltpvf/sklfp 
kf*kegilpnsff*sslalipnpfypsk 
kkkktkkhyhaislmmidekihyky*qt 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lInkno\vn, *=Stop codon, /=possibIe 
nucleotide deletion, \=passible 
nucleotide insertion 














EFRSTLKVLYTITK 


6189 


20090 


A 


6242 


23 


439 


lASGRPFFFFFFFFFCLPPPPGGNPSFS 
FLKKKKKGGGGGENPPNPPGKAPNPQGV 
GVPPFPPPPVFQKNPPPPFFF*KNPPPS 
PKGPPGGPPFFFPPGGGGGPPPT/PYFL 
KKKGGWGGPPFF*KTNPPPKGEITPLVG 


6190 


20091 


A 


6243 


20 


261 


WAQEILKKFSCLSLLSSWNYRCAPPHVA 
N\ FVFFVETGSYHVAQSGLELLDSRDPM 
TGESHHAQPHFC*ILYTGESQRIL 


6191 


20092 


A 


6244 


376 


76 


LLPRLECSGVIKPFSHLSLPSSRDYRCV 
/PTMPNF* KFFVELKFNSL*R* GLTMLP 
RLVSNSWCQVIPLPWPPKVLQGCATTPG 
LTDFITTFKGYRHGDTY 


6192 


20093 


A 


6245 


222 


233 


ksfyfalifekmfcyvkksqvglvffsk 
vkvlvrpffpmvikkkqp/fnpl/rfff 

FFFQDRVLLCHPGWSAW*SKTLS 


6193 


20094 


A 


6246 


2 


358 


EIILTFQSYY*ENTFNEAITAIDSDSSD 
GSGOSKWKSFWKRFTILGAIKNICDL*D 
*V/KISTLSGVWKKVIPSLMDDFEGF\K 
TS VEEVI ADWE IAR* LELEVE PEDVIE 
LLQSHDKT 


6194 


20095 


A 


6247 


3 


322 


PDHS LLQTPTPG FRRS SHLGFS SS WD YR 
LEPPHSVCVCVCV\FCVYIYFFFFLEKT 
GFGFVPQAGLNLGVKRIFKA*PPGGVEI 
PGLTHTPGPWGGPIKGKGAMPAP 


6195 


20096 


A 


6248 


1425 


1280 


SLDFQGLALPfiWSAW*SCLTEASTSR\ 
VKRTSHPSLPNTWVYRHALPRLANSVYF 
L*RWGLTMLPRLGSNS*TQAILQPWPPK 
VLAI.*ADEEMKAKGHISTKMGGQTLNPQ 
PDCGATS VYSLP IRKFQ 


6196 


20097 


A 


6249 


189 


336 


HAPFPVSVISLSCQKRREVWLAMVAHTS 
NPSTLGG*WGWIA*GEEFETSQ 


6197 


20098 


A 


6250 




300 


FLPS FMGFWPVLGHGRftSTVTPSEPLSL 
PFLLPAEPFLFFFFSLFRKRES IFGA/P 

GGLEGGQFKFLEPSPPGIKEIFWPNPPR 

GWELKIWPPPPINFCDF*EKTGFTMVAR 
AGLKL^ TGGLEGGQFT-JFLEPSPPGTKBI 
FWPNPPRGWELKIWPPPPINFCDF 


6198 


20099 


A 


6251 


1 


198 


LTGRL/SKCEWSWRFDVQLKDLKRWQN 
HLLLSQQLACIVLTSAGIMDHKEMR*KH 
TGGKSLGFFF 


6199 


20100 


A 


6252 


11 


214 


slrlkssshlslpsswdlrcelphlan* 
kkkkkl/ckn/rslxglpklvsnswv*t 
ilpprppralglra 


6200 


20101 


A 


6253 


402 


3 


KKGFVLSPQVEDKDAISGSFNPGPPGEK 
IPPPQPPEEVGPPGGPPQTRGNLGI/LG 
N*GGGGGF* PPGAGKSPAPGPQRG\PFG 
KDKREKPYGPPPKLFFFFLRQSLCFAQA 
GVQWCDLGPLQPPPPRFKQFCL 


6201 


20102 


A 


6254 


362 


78 


VGVN/MTFFPQHFLGLSGIPRRYSDYPD 
AYTT*1ITVSS IGSFISLTAVLIMI FI I * 
EAFASKREVISVSYASTKLE*LHGCPPP 
YHTFEEPTYVKVK 


6202 


20 103 


A 


6255 


357 


2 


DLSCHSHGEATQKHSEKCKASCTHMFIT 
ALFLIAKHWNQPTC/PHTMEYNSVIK/N 
EQTIDTTTTWMDLKCIMRHEQRQS*KNS 
VLVCFHAADKDIPETGQFTKERGLMDLQ 
FHVAGEASH 
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nucleotide 
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Predict- 

location 
correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspai-tic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, Ij=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T-Threonine,V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6203 


20104 


A 


6256 


2 


305 


QWCNHSSLHSNPGEEETVSSLKLPKAWN 
PRHMLLCPTT3FILRWGFIYVAQAGFDT 
RGHASL/LA*ASPSVGITSKSQRVRPVH 
DIACFHRESTMFSAENFH 


6204 


20105 


A 


6257 


335 


3 


GPFQSPPPGVTPISCFTPPKKGGNRGPP 
PRRGKFWI FKKNGGS PG * KHRVLFPDPG 
VPPPRPLKKGGLKGGAWAKI FFFFFLHQ 
SHSFTQAGMQWCNLGSLQPPPPGFK* F 


6205 


20106 


A 


6258 


219 


2 


PTLSSQKDTKAQRLYSFPLTTTKDERLV 
TLVS /WPGWAYTFIPSTLGGQGRWVT* 
AQKFETSLENTVKPPSC 


6206 


20107 


A 


6259 


306 


117 


SFELLGSSDPLI SALSSWDYRCAPSRLA 
NF * F FVCVEMG S S Y IVQ /SWPQ*ILLSW 
PPKVLRLQT 


6207 


20108 


A 


6260 


572 


269 


ETDI LSVAQARVGGLHLGSLPAPPPGFT 
PFSCLSCSS *dyrrlpprpa/nffvfsv 

EMGF\TVLAEIVSIS*PRDPPPSGLP*G 
ALGLPGVSPLRPALCLLLK 


6208 


20109 


A 


6261 


357 


116 


FAQAGLEIiMGSSSPIASASQSVGITGMS 

hca\rlllffcggrw*gg/lk*fsclsl 
lsswnyrhepphpanlcifsfnti3y 


6209 


20110 


A 


6262 


3 


337 


HERHERHERHEQSAEITGVSHHAGGGIG 
LLNPFLFLFFEftGSCTVT*T/GV*WHDH 
GPQ* PQI PRFK* SSHLSIiiS SWDYRHVP 
P * LGLQACATMP6* FFIFCRDP ISE INK 


62)0 


20111 


A 


6263 


515 

^ 


150 


flrwsftlvaqagiqwosvsslqplppg 

fkqffclslpsswdyrcpppcptnf\ *l 

lvekgfiiiivgqaglelltsgdlpasasq 
sagitgishr7vrpgtl?f*aa!\tgggkqv 
flllrvlngl 


6211 


20112 


A 


6264 




296 


GTSb'LFSt'LP/ljlji; i'LKFLNFFQ I VSHC 
RPGWSAVA*SQLTAPSI\FGLKPSSQLS 
RT iSSWDYRCP INFF\SLPMLPRLVSNS * 
PQAILPTQPPKFLGLQA 


6212 


20113 




6266 


298 


182 


LHFFEVSKIIKLIRAEAKWCL*VLFTIA 
KIWKm**ICKyLSIDEWI!aCMWYIHTV 
EYYSAIGNKE 1/ PVI VTIWIYMENIMLS 
EIN**QKDKCMIALL*SIYIQ 


6213 


20114 


A 


6267 


402 


234 


PPPPLFKNPPPPPFKNPPPKKKKI I FP / 
SPPWLAPPBCEF*KSPPPFFFFFFFFFF 
FP 


6214 


20115 


A 


6268 


125 


17 


KGQAQWLTPVIPAIjREA/EEAGG*LEPR 
SSEPAWAI 


6215 ^ 


20116 


A 


6269 


1109 


454 


GDSAGAEGTMENFTALFGAQADPPPPPT 
ALGFGPGKPPPPPPPPAGGGPGTAPPPT 
AATAPPGADKSGAGCGPFYLMRELPGST 

eltgstnlithylfleqaynkfcgkkvke 
klsnflpdlpgmidlpgshdnsslrsli 
ekppilsssfnpitgtml/velppphwp 

VAGAVSSDAYSASQEEE*AQAQTEPYPG 
SCPPRKTI*FRSQEEEKEKRRGS 


6216 


20117 


A 


6270 


111 


86 


WGIT6V3HCAWSRKGTLTHCWWECEKV 
/QPLRia'V*QFLMLKI\PYQMI*PLLGI 
YLRKNKAYIHMKTLK*MFSLFLFEIAK/ 
TWKQPKYPSASEWIII*LDSPS LYWLGWS 


6217 • 


20118 




6271 


2 


222 


RSCESNSQTSACYKSFYLYPLKS/GGFK 
I LERM* P I YFL IP YWTI FNQRYYF YI W 
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nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 
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od 
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USSN 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino aeid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, Ij=Leucine, M=Methionine, 
N=Asparagine, P=Praline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Thrconine, V=VaIine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon,/=pos,sible 
nucleotide deletion, \=possible 
nucleotide insertion 














NLKKTRGIYTVLYKMYKSF 


6218 


20n9 


A 


6272 


787 


470 


LPFFFSFFETGSHCWRWERESLGVAQVG 
VQ*CDLRSLQPPPPRLKQSSHLSLPSSW 
D YRCAPP * LDKFFFCI FCRDGPHHVAQA 
GLKLLSSG\NLPASVSPSAGTTL 


6219 


20120 


A 


6273 


2 


322 


FCMRIAAIDSSSSHGSRESKLKTLWKRF 
SILDAIICMTCDSWEEV/KVST*TEVWKK 
LIPTLTDDFQRFKTSVEEVTA\ELLWE 
ITREIELEVELDDVTELLQPHDQT 


6220 


20121 


A 


6274 


175 


406 


KNSLGGPNLTGEGKKITFSL*NFFFSLE 
ALIFVGRLSSFVPPPKIRFFSKNSRNT,F 
/CPVGEFFKTRPPVlIFFFSLP*RKSyFF 
SFPG* IWPP*GIFLKAPPFFFFFFFLRE 
RREVSMLPRLMSNSWVQVTPPPWPPKVR 
VLLGIVDVQ 


6221 


20122 


A 


6275 


2 


181 


PRVRGRVGHRKGKKSFVHCWWQWKLL*P 
LWKTVWRVLKTLK\ I ELP YYP /aTKLL* 
SYPI.E 


6222 


20123 


A 


6276 


397 


47 


PPPSKNFFFPPQKDFFLGGLPKKSPPPK 
K\FFFKKTPKKNFPPPSFFiCKPPPPAFI 
KTPLKKKKKFPPPPQBCFGPPPKFFKBCPP 
PLFFFFFFFPITTAP*KITBCSIPYPyFF 
TKTFYL 


6223 


20124 


A 


6277 


2 


264 


ETESHTVAQSGCSGAIIAHHSLNLQGST 
NPPTTAS*VAGTTDACDHAWLMF*FL\V 
EMGSHYVAQRELELLSSSDPP\PQSPKI 

LGLQA 


6224 


20125 


A 


6278 






I RC YRXI FYERKHQVILQQISLLS YPKKL 
PQP\PPL*ATITLISQQSLISRQDPPPV 
KK**LTEG*DDC 


6225 


20126 


A 


6279 


57 


306 


INKftNRDKL/GTIDFLEFLTIKVRKMQN 
TKTKSKKRN*ESILHVE*EDTGHVRAAA 
VCRDMTYLGKKLTDEEVDK\1 IREAKyW 


6226 


20127 


A 


6280 


419 


18 


SSSSPKACMPVYEWVCTQKCVWIYEYMC 
VFVYV*LVLMCCJOJVCACW\C*SACV*M 
CVICECVC3yi*GLSVICEGVCM 


6227 


20128 


A 


6281 


459 


235 


FLRMGVSCCHPGWSAWQS*L']?AaSN6* 
AQVILPSSWDYRPVTSHPAKTHNKYMQ/ 
C6NKKMCVTHS IVIFSLLQ 


6228 


20129 


A 


6282 


505 


470 


GVP*GQEFEPSLSHMGEPCFFLKCKINP 
AWWQKLF*PRRQKFRLTQI/LALAPQPG 
QQKVNPFS/NKKKKKNPNNC* lALR 


6229 


20 130 


A 


6283 


206 


374 


GGGGGKKK1IWGGEKKRGGEKK1FFF*SP 
PRFFFSPPYKKKKPPPPPPFFFPPPNFF 
FYPPPPPPFFPPPFPPPPFFFFFFFPFF 
FFFFFFFFFFFFFIiMHFF\FF*LYVIIV 


6230 


20131 


A 


6284 


431 


269 


YIKGGPPPEIETFLIFKYYIYLIXFIFF 
FFFFFPFFFFFFFFP*FVCLKYLIF]SIHI 
QRF*TY1 SI PHFMLLM\ YTMLFTFCIC 
SLFTIIWFYIFV*L*YCCTVKALCNSL 
LVSELDKYI KNNYNCKQ 


6231 


20132 


A 


6285 


298 




YLFFFFF*VE1:jVFFLFCMFFFFFFFFFF 
FFFFFFFFFFFFFFPFFFE FKNKVS PNS 
CPSLSLFKLYFLKYNFIQLQGNLHFNNI 
I 


6232 


20133 


A 


6286 


386 


1 


ANLHI KVNFWNFTLLNFS FLP VWFGFH 
FLW*RVYLSHRGFYVFFQNFLRIFFFF 
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SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SF*VG\FFFLFFLLFFFPFFFFFFFFFF 
FFFFFFFFFLCIAIVFY*IWYISYFTKY 
lEEPRTRGRTRGRTRG 


6233 


20134 


A 


62S7 


190 


134 


LEFGRNRFQFSRMI LRGSGLQWRNYADM 
VKIAKT*YLRNTFHKVIAIIGGDSCDGS 
GQKKLKTFWKGFS\LDAINGIHNSQEEI 
KMATLTGUWKK* IPVIKDDFEGPRTS VE 
E /V/TADMVKIAKT \LELEVEP \EDVT * 
LLERILF 


6234 


20135 


A 


6288 


394 


2 


RRQASLSCGGLHPVRSSRLLCLPKQAWA 
MAGAPPPASLPPCSLISDCCASNQ*DSV 
GVG P S KPGVGYHLLVHRFLS P LEKRS IE 
VGVAEFSRCRLSPLSLTRKGN*LP\TPC 
ASRVRQCLTLLQLAHGALHP 


5235 


20135 


A 


6289 


233 


58 


DRVSLCRPRWHWSRTRLTVASTSR/LQ 
AILPPQPPE*LGPQASAWRSPRSPKLTG 
LRG 


6236 


20137 


A 


6290 


385 


48 


TLQYLCTPTSTV*KKKKPENIKYW*QCG 
ANRTLFQ*WQVCICVAQTLWKPVWHYTVL 
NLNTRIVYDPGMPLLGTNNNAYMCCIC\ 
YQKTFTRMFIAALFEITPK* KMHKYLTV 
VG 


5237 


20138 




6291 


273 


36 


RNGRWTPCGGR*IHPS*GTDPVYLPV?G 
TLSSSQASNSHPLLKaPGSRP\PWPTDL 
GYKDPRGPPSNPACRRRESLFHSD 


6238 


20139 


A 


6292 


394 


5 


AKIYFSINPLVFSKQ*FLKNVFPNILHL 
QFFSNINSLMLIKV*ALLQDFPLVHNVY 
NKICDTSKDTTTLPMSLMFVFKMKTFFS 
LKAYI/WSGTVTHTYNPSTLGGRGMWIT 
*GQEFKI SLflNiyiVIASLy 


6239 


20140 


A 


6293 


536 


190 


RRSFAIiAVQAGVQWRDLGSLQPPPPKFK 
RFSCLSLPSSWNYHHAPPRPAWFVFLVE 
TGFLHVGQAGLELLTSGDPPVLASQK\A 
GMTGVSHRVQPD*GHGGSRLSSQPFWEA 

KTGGSPEVRSSRPA 


5240 


20141 




6294 


2 


238 


HE7-iDLGTTQVLTHPSTTAMYFVHYCQPP 
* ILYGTINT*PPVVHKNPIHIKTP/SPC 
LQASTAINPQLSHINC\NPKDTPH 


5241 


20142 


A 


6295 


2 


447 


F FFLMGVSLFCPGL / WSMGAILGSLAT 
SCLLGVQVILLCLSLPSSGKLQVCAYPH 
L6*FLYF*ABDGGFSPCWPG*VSNS*PQ 
VIHLPWAPKRLGITSMSHCCPACFHCTF 
FRHFRQNPHIiNLFFFLEGLGLSPPPPSQ 
PLEGGKGPGN 


6242 


20143 


A 


6296 


2 


133 


AEERERERERERERE \RERERERERERE 
R*VERERGGVPLRTYR 


6243 


20144 


A 


6297 


3 


244 


HDYSS FLAPPftLGYMRMPLHPAYFI LF Y 
FIS/FYLFFSQDEVLLC*LGWSQTPELN 
QASCLGLQKWWDYRCEPQCLAECDS 


6244 


20145 


A 


6298 


1 


361 


GTEIRSAMLARLVjaiSGPQ/CYLPALASQ 
RAGITGVSH*GCPFFSLLKKRWRSHCVA 
QAGVQWFFRG*TPGLSCFSCSLPSN*DY 
KCAPLCP P* DTVF * VRAVS SEVPLT ITY 
RGEGAGVWA 


6245 


20146 


A 


6299 


3 


348 


HERHELEELIKNLFFFFFFGKKIFFFFP 
GGKHGGPF*FIGAFSPRPQGIFL?YPP* 
KW\DWGPWPP CRGDFWFF* KKTP FS I LA 
RWLNSPPWGILSPYPPKGVRFPGGTPR 
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nucleotide 
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SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, -'sStop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGPV 


6246 


20147 


A 


6300 


1 


436 


VDGEFS C!LSLPSGWDCRRAS PCPANF/ S 
VFLVGIGFRYVGQAGLELLTSGDLLASA 
SQSPGITGMSHCTKPRKHH*IANSSSVL 
GDFRIDYFQISRKAQWITLGYNPWAESH 
LLMGAGUIVLPAPRIjMISQVDIiLPATHP 


6247 


20148 


A 


6301 


370 


70 


PKG*STLPASASRSAGITAVIL*SHCTS 
PPM+LRFKHFTHSSTSTPF/CFFFFGKK 
ILIFVPQPGGKG?NLS*LKPLLSWLRKL 
PCLTL*RGGDYPGFPPPP\SYFCFFIKK 
GV* PCGPGGNS* PASASRSAGITAVIL 


6248 


20149 


A 


6302 




285 


KHFFLFTYLFRDRVSVCYPSWSTWK/S 
L* SWLIAASN/CLGSK*FSHLSLLCHHA 
LLVDFNFL*RRRLIRQPRLVSNSWAQAI 
LSSQPPKALELKT 


6249 


20150 


A 


6303 


194 


351 


GT FI CI KKPVQ/ WLDTVAYTCTPSTLGG 
RDGRVT * AQE FETS LGNI VEPHVYKK 


6250 


20151 


A 


6304 


354 


211 


YRHMP/S*LADFFIFCRDSLFMLPWLVL 
NL *AQAILLPQPPKVLGLQE 


6251 


20152 




6305 


1 


346 


GTSSLQLISLHSSPTASYPPKHLHTRLT 
ELLSVSLMAHVLSVSPATSS /PILQPHP 
lYPCTWIMFTHSSGSGKPSCQTHTHTYT 
HTHTRMCaHaSGHATLPP*HPLHPRHSA 
YLLW 


6252 


20153 


A 


6306 


395 


177 


WLKTLWYIHTVDYFLAMKRSDLLIHSPW 
MNLQGIMLSEKRQ\SQR*HW*FYLYNN 

VEMKKSQKWRTVLWLPE 


6253 


20154 


A 


6307 




718 


GVLPYKPGWSRTPDLV/HPPTSASQIAG 
ITGKSHRALP\KFAFLTSSQVMLTLLAH 
GTTAWFSKLS*F*EPPMTCAVFSVVTG 
SCHI]SrSFRIQTCL*yySVFTSCFyCSMl 
EKLTTGIS*QKEHELPFSQSSDIFKLAy 
LIFFW/CWEMESHSVSQAGVQWHKLSS 
LQPLPPRFK*FSCLSLPSGGDYRCPPPH 
MMrFCIFSRDSVLPC*PGW\S*TPDLRM 
DPPGLRPFQSAGDLTGMEA 


6254 


20155 


A 


6308 


377 

h 


67 


PPPPKIFFFSQTLFFFWGQCPQIAPPPK 
KFFF*KlSrPQGFFFPPPKKKKIFFFPPPK 
FLPPQKFF*KPP/LSFFFFFFFFFFFFF 
FFFFFFFPTI * IFYSMSFLP 


6255 


20156 


A 


6309 




362 


QTYAKMHVTIIFFGVKXLTFLPQRFLGL 
SGMPRRYSDYPVAyTT*NILSCVGSFIS 
LTAIiILI IFMIHEAFASIRQELIVEEPS 
INLE*LLGCPPPYHTF*EPGYLKPKRKR 
PCESSPPKLV 


6256 


20157 


A 


6310 


2 


411 


"VLPTLWEAGARGSLEARSSRLQ*AMIT\ 
HCTPPWAT 


6257 


20158 


A 


6311 


59 


334 


THACX3NMLSYAGSFISLTAVLLITFMI* 
BA\PASKRKVLIVKEPSINLE*LYGCPP 

P YHTFEKPV YI KS HQ KRKE SNP PKLVS S 
QPHGLHDL 


6258 


20159 




6312 


3 


22 


kpsp*pltgalldllmtsglsm*fnfhs 
itllilcll™tlsiyq*wrdvtresty 

QGHHTPPVQKGLRYGI ILFITSEVF\FF 

agyfXwpfthssltptpqi *GHMPPTGI 

TPLNPL*VPIiIiNTFVLLSLQVSIT*AHD 

P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptopl)an, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possifale 
nucleotide deletion, \=possible 
nucleotide insertion 


6259 


20160 


A 


6313 


11 


182 


INKMW* I *VRG* YSVI KK\DEILSFMAT 
WMEVEDILLSKIREAQiWTHPVFSFISG 

S 


6260 


20161 


A 


6314 


3 


421 


IQASTMAGAPPQASLLPCSLILDCCASN 
S/RGSVGVGPSEPCTGYHLLVCLLLRPS 
EKCSIRVGVT*FSRCRPSP/TRKGHSPT 

PCTSRMR 


6261 


20162 


A 


6315 




111 


DKGMWFMPAKTASVLWPMDQGVI LNFK 
SFYLRK\TFSKYISAIDSDSSVGSGQSK 
FKTF*KGFTNLDAIHDSWEEIKIST*TG 
VGKKLI \ PTFRDDFKGFSTSVAEVSADV 
VEIGREL*LEVKPGDRTELL*SVLWPMD 
QGVILNFKSFYLRNNIF 


6262 


20163 


A 


6316 


440 


312 


TSSFSLSFF/CQDG/SLTMLFRLVSNS* 
PQAILPSWPP3CVLGLQL 


6263 


20164 




6317 




404 


prymivkllki/ernkileaarek*hft 
s kgkai wi i vdls s etme vrknlhnfqv 
lkek/nktwevrilysakisfqkkgkik 
ifsdegklr*sftsrptlre**ri<;fsk* 
k*ndrrrnlktfgrkkeypkelkp 


6264 


20165 


A 


6318 


394 


182 


mfiva/vfskakewrqpkclstdewv/q 
ym*yihtmkyysaiksnelvihvaa* il 
fentvlsershimhgshi 


6265 


20166 


A 


6319 


1 


378 


apsdfflfqslnkplncthfssvhnvkk 
tslt*lnsqnpqflrdepngwyhcl/qk 
yleldgayvkk 


6266 


20167 


A 


6320 


286 


314 


NRCCFSIiPTQVKCFNLVLGICCVCDWGY 
yHWLNF*NPI F* FILRQFHSVAQAGVQW 
RDLGSLQPLTPRPANFYILVEMGF/TPL 
liRRLVLNS * P SIjLKYKN 


6267 


20168 


A 


6321 


23 


397 


QKTIjNMEBCEMNI YI HKM* ITPNRSNI KR 
SSLRHITVKFSKVKDKQRI/MKLAIEKQ 
PITYKETTIRLSEDFSAETL/WARR*WD 














SLGGPILTGGGKEIF 


6268 


20159 


A 


6322 


411 


2 


TEPPPPGGNF\L*TKPGKKNFLPPKNLG 
FFSPFSPLKFFFSPK\P*ILGGGGAPFC 
PPQKKGFFPKIPRGVFFSPP/YKKKNLI 
FPPPVIIiGPPKVFFKGPPPFFFFFFFFG 
QSGQVKLKSPKCK 


6269 


20170 


A 


6323 


395 


280 


MLARLVSNS *P\CDPPG]:iASQSAGITGM 
SHHVPQVNHF 


6270 


20171 


A 


5324 


3 


359 


QHYLVFRETQIKAAVRGHYIPTGMAKII 
KTGSTKC*Q*CGaAGTLIHCWEECR\LK 
QPLWKTAW*FLI*MLCD*TVPFLQiyPT 
EMKMCVCRMTC\RVFLSVLFIIASSMKQ 
PKYHKLVN 


6271 


20172 


A 


6325 


3 


174 


FFFETGPNPVNQAGGQGHDLG*LQPPPP 
\GSRIFPA*TSRVAGITGLlilHHPHPLGS 


6272 


20173 


A 


6326 


265 


13 


TASPIRI I ITSQDVLHS*AVPSLGLKTD 
AIPVRLN\QTTFSDTRPGV*yGQCSEIC 
GANHSFMPIVLELIPLKIFEIGPVFTL 


6273 


20174 


A 


6327 


32 


259 


CYYEHLCTSLCVNWYFHFF*VYT*EWQW 
PKYLSADEWINKM*YIWTVEYYLTIKIS 
EVL/SATTWIiGLENIMLSKRS 


6274 


20175 


A 


6328 


1 


228 


PGLNHCSHLSWDHRHALQCPANFG \ YFV 
ETRFTVLPGLVSNDPPALASQSAEITGV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

conespond 

residue of 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Hlstidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














SHCAQP*VCFHYCLTFFSLQ 


6275 


20176 


A 


6329 


346 




LPKKKKKKISRGLFPFQLPKKKKDNEKS 
LTFSSETMPGRRQWSKIFKVLRGKKQPQ 
QPRIISLAKLFLKSEDFL/ETQKLREFV 
ASRPALQAMLKEVL *REGK* YRSLFKKG 
RALK 


6276 


20177 


A 


6330 


349 


77 


HTTPKKNKHN/ RKKI KLDVLRYNKVDDN 
SKMSGLH* QCPSDE CDAGVFMASHKQ / F 
DRHYCGKYCLT YCFNKPEDKAQHCGS LE 
PRSLRPAWAT 


(ail 


20178 


A 


6331 


319 


46 


RPRRPETK*ALNK*KDiyPMCSWKQHHIJ 
TVKMLILSKLINRFNAITIKISTE\FHR 
NKLILAI IWKCKGPRIAKTT+KGIKLEE 
LLYLISRPP 


6278 


20179 


A 


6332 


496 


309 


DHATAF * HQS FTLS L/RAGMQWHDLS SL 
QPPPPGFK*FSCPSLQSSWDYRCPSPRT 
G*F\CIF\VEMGFHHGGQAGLELLTSGD 
LPASASQSAGITGVSHCTWPNILNWK 


6279 


20180 


A 


6333 




173 


HHHSRL/ IFFCSVLGTSSHYVA*AGLEL 
LGSSNPPDLASQNPGITYGSHYAWPDIP 
VF 


6280 


20181 


A 


6334 


339 


153 


FSPPPPFFKGFFFPPPPPPF/SF*PLRG 
GGPPFFFGPPGVRPPPSPWSWCSPPPV 
VLASFFFS 


6281 


20182 


A 


6335 


32 


331 


QLLNHSVIVFLLSFDHSYFLRNLSLFIjA 
FKLIGVKNS *YI ILSFRYLLHP * HLUiTF 
FFF/CKRQGLPVLTRLVSNSWAQGICQP 
WPPKVRGLRAWSPGLSL 


6282 


20183 


A 


6336 


226 


177 


DGVSLLLPRL\GGAI SAHCNUDLPGSGD 
SPVSAS*VTQLQYYFCICLGLFTFTTRL 
SAYLGTGKLTTALPNTPGLLDS 


6283 


20184 


A 


6337 


302 


59 


KMLLNTIMC3SIRBIL/RKRKSPRVHQTSS 
VSYFKKLPQPPCJHSAASTLISQQPSSSK 
QDPPAAK*L*FVEGRLIAFPSNNVF 


6284 


20185 


A 


6338 


197 


28 


AEICVRFSSAVTWHLGSHSL**SKKNL/W 
LGMVDHNCNPSTLGGRGGRIA*AC3AFET 


6285 


20186 


A 


6339 


46 


317 


EVPSS ISSQISLFSGDFSSVPLEELSNL 
FFMFCPEHI IVTCBRVTSNYS /YYKWKP 
EFSYP*IF/CFKMSLTMLPGIiSIiNSWAQ 
VIIPPQPPK 


6286 


^0187 


A 


6340 




276 


QTTLRGTTVQITKNLSSETMWASRCEKK 

NL*SAWGWGMWEHNYQPILYSAKISFKN 
GVEIKTFTN/KQKIKN/FITSITLV*EI 
LKEVLQAEG 


6287 


20188 


A 


6341 


485 


2 


GVWEFLETESRSVAQAGVQWCDLGSLQA 
PP\PGFTPFSRLSLLSSWDCGSPPPHPA 
NFFFVFFVETGFRHVSQDGLDLMVS IS* 
PRE P PASAS QS AG I TGVSHRARP KHSKI 
LIAGSRRS I QMFP VKFFQKLAWENFQN 
TKDFLNPYSEILLHTSQIHSSC 


6288 


20189 


A 


6342 


2 


339 


EDLRVKLKKEGYSNI SYI WNHQDPMLN 
SNGSVTWALLQAS *YLCILQAS *ASSQ 
SWTS/VTLAAVSFQRISNQEHIPVYQQE 
ENQTDVWTLIjNGSKDDFLIYDRCGCLVY 
HL 


6289 


20190 


A 


6343 


181 


12 


RWGLPSGWDYRSMPLHPVNF* II / CMRQ 
DFTVLPRLII.NSQPPVILLRWPPKVLGL 
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Predict- 
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otide 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















6290 


20191 


A 


6344 


3 


295 


YWHLS CF* FLAI TNKAAMNI RLLWTCAS 
I FPR* RSRKEMAGS *GS/SLFNFVRNCL 
TIFSHDCTTL/HS/HQQWGGV/PVLPFP 
HQHLL* S I FLVFDI LIGI 


6291 


20192 


A 


6345 


334 


8 


KPSNPGGEGPP * KKGPKVFS PWGKKENF 
SPKKKKKI*CPQKPKGPPWGAPAFKKKK 
NF*NPRGSPQL*KKRGIIGPPTLIPPSS 
QRPPPKKKKKK/GPPKQRLLAFGGIKFF 
FFFLEKNFLFFPKGKTLIiAPFFKGGLPL 
PG 


6292 


20193 


A 


6346 


68 


256 


VA7A\PVTHTCNPNTLGGRRGRIT*AQEF 
KTSLGNIVGPHLYKNLKKKKKKKGEKKP 
GGGFSPL 


6293 


20194 


A 


6347 


385 


60 


KKKKKNRTPYDAAI /PSLGTSSKEFKSG 
SSRaiCTPMFIA/AITKRWK*LKCPSMV 
MI/VKKWWFIHMMQY*SAKKKEILS*MH 
LKDVMLSERMLSQSQKKNFCMILLIWGI 


6294 


20195 


A 


6348 


234 


61 


CHRIEHIKMVML/ WLDWVHTCNPSTLG 
GQDTWIA*AQEFKMSLSNTAKPCLYKKY 
KN 


6295 


20196 




6349 


2 


358 


FLSLGIHFNYCLVTTESKLSKLRLTYFF 
FFFLETEFPFCSPA*RPW\PN1,G*LKFP 
P PGSHETPALTLPKTWNNRUAP PHPANF 
GFLRKMGV* FFGPGGFKTLDLGGPTRLW 
FQKGGNW 


6296 


20197 


A 


6350 


346 


27 


:<MV*SLWKAVWQFTjIKT,NIH* PYDSAVP 

LIG/IYPT*MTT*IYTKP*TQMLAAVSF 
IMAI-arrR.VESNIFRWPIDKLWYIHIMEY 
* *AFKRig*LLIN-TKT*MNLKC*LF 


6297 


2019S 




6351 


^6 


158 


KNSVPSQE*LYYL\RQENRLNSGGIGCR 
EPIWCHYTPAWAT 


6298 


20199 


A 


6352 


2 


146 


RSSHLSLPSSMDYERAPP/H/LANCCIF 
CRDR/SFTMLPSLVIiNS*IQTILL 


6299 


20200 


A 


6353 


74 


303 


EPSICGQIYLSFSIYKIQK* /WPGAVAQ 
ACKTSTLEG*GQWIT*AQFKTSLGNMMK 
PCVFS KKKKKKKPHPGGGGRKK 


6300 


20201 


■ A 


6354 


3 


210 


NNKKIWTLKNLKFFRDRVLLCFPDWSTA 
VHSRLTaaS\TLGLKQTSCLSLVSS*DH 
RCMPPCLALKRKKT 


6301 


20202 


A 


6355 


2 


627 


grvggflvpmrdlaeaskgqqgqvklge 
mkpsvscvcggv* c iskfklkn*l\wpg 
tvahtcspstlggqggwia*aqefets'l 
dnmvkXpcly 


6302 


20203 


A 


6356 


584 


166 


rqghnvaqagvqwrdlgslqppppgfk* 
ffhlslpsswdytrvpphlanf\ * flve 
tgfhhvgqaglelptlsdlpawasqsag 
itgvt/hqarlqll*kelinpstrigp?w 
elleawgrcpswmgddrialncfhhfsv 

S 


6303 


20204 


A 


6357 


448 


202 


FTTLVSE CI *D IHDSWKDV* I STLTGAW 
KTWLI PTLVNGFEGV FKT* VEEVTADW 
EIARESELEVEPEDVTKLLKSHDKS 


6304 


20205 


A 


6358 


3 


237 


DRSSL*P*PPGLK*SYYLSLPSGWDHICR 
A*PHPANFCLFTF/V*QRGLDLLA*HDP 
PASASQGN* ITGMSHRAGPPLTF 


6305 20206 


A 


6359 


49 


242 


SSYLSL*P*PPGLR*SSYI.SLPSSWDHK 
HA*PHPANFCLFTF/V*QRGLDLLASHD 
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nucleotide 
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ing to first 
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peptide 

sequence 


Predict- 

otide 
location 

last^amino 
residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPASASQSTS ITGMSHRVWPPLTF 


6306 


20207 


6360 


508 


223 


FVTQAGAR*LDLSSLQLPPPGFKQFSCL 
SLPSGWYYRHVP/STPIVF/CIFSRDGF 
HHVGQAGLELLTT* STCLGLRSAGITGL 
SHRAQQERGFEP 


6307 


20208 


A 


6361 


2 


595 


AIWG*GTHFLIPWVQKPIIFDHRSQPH 
NVPVITGSKDLQNVNIIPCILFGPVTSQ 
LPRIFTRIGEDYDEHVLPSITTEILKSV 
VARFDAGELI TQRELVSRQVSDDLMERA 
ATFGLILDDVSLTHLTFRKEFTEAVEAK 
QVAQQDAERA/RNSLATAGDGLMELCKL 
EAAEDI TYQLSRSWNI TNLPAGQSVLLQ 
LPQ 


6308 


20209 


A 


6362 


1 


259 


lesfyfllyyyk*ss*plrlkelilfif 
eteccpvaVqlecsgtisahchlclqgs 
gnspapayrwgitdsfeithciltcip 

SE 


6309 


20210 


A 


6363 


60 


221 


NTTKI(GMVAHTCNPSIIiGARaGRIT*V* 
EFKTSLDNIVK\PCTPPKKSLKIKLN 


6310 


20211 


A 


6364 


1 


393 


RPRRPQTQVQWQNLGSLQPPHPGFKRFS 
CLDLPSSGDYRHTPPCPANFSQFS/HHV 
GLKLFDSSDLPWPPKFVGIARHESHRKC 
T*RSRRGRRIADPLER 


6311 


20212 


A 


6365 


19 


85 


CI]MSKIMQSSRITTLCKMVQPI.*RALWQ 
CLiKLNIPHPYDSAIPLLHFYQCK*RHM 
NPHN\TFTSMFITALFITAKN*KQSKCL 
SIGQWMKK* * CIHTVQ*NSGI * *HE* IN 
SKIMQSSRITTLCKMVQPL 


6312 


20213 


A 


6366 


2 


298 


PPRLAKVQKFLLFFYLVF*DRVTLCHPG 
WTAW*SQLTVTLAHRNLRLPSGWDYRH 
VPQRLADFVFL/CR/DRGFSMLVGLVSN 
SRPQEV*ATSPSQSLLS 


6313 


20214 


A 


6367 


67 


636 


QDNKKGIRYLFVYFRQSFALVAQAGVQW 
HDLGSLQPLSP*FKQFSCLSLPSS*DYR 
HAPPRPANFL/QFLVETEFHHVQQaGLE 
LLTSGD*PASTSQGRGITGVSHCTGPRI 
RFLKTFSHLEAPGGFPEKRQPLRSERLQ 
PHAFPQTAPCAHAYIiGCVSHLEMGCRN 
IDIPELSFFLCPLQCLIYRGKSP 


6314 


20215 


A 


6368 


3 


251 


RSRHCTPAWAPLSTQDHFYVFYFFLHFL 
iaSLFIFACFVSSYFIEFSyCI,\YFFVD 
SPGFYRYTMK*NIiNSNYFKLSASILPQ 


6315 


20216 


A 


6369 


333 


193 


FKKLVmAKAKYCaCftNFFFQQVPVASKN 
/TGDLWEEPGWSQTPDLR*SALLGLPKS 
WDYRHEPPPRLANFCI /FLVETGFQHIG 
QAGLALLT/S/GDPPASflSGALE*LGGG 
SCL 


6316 


20217 


A 


6370 


36 


369 


RPTRPEMRFHPVGQAG/PRSPGLK*SAC 
LSLSKCWDYRHEPP/^PG*FSFFTYIYI 
YFFFLFFSFSFLEEMGFHHWKAGLELL 
C^NDPPASASYSDRIIGVSHQARPFFFF 

FF 


6317 


20218 


A 


6371 


2 


375 


DIQMIDRNIKN/CSTSQSIRKLQIKRIM 

* YHLTSVHMAITKKTTHKC *RCGKK\LH 
C*WQCia:iVQPL*KTIQRFVQKTENISmSri 

* PSNS KN/WKSAC * RD I CTAMC I TT * F I 
RAEHGGSXPCKPSTLG 


6318 


20219 


A 


6372 


75 


372 


NGNSQKKKKKKKKKGGPPLKKPSGGPNL 
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sequence 
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SEQID 
NO: in 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last\mino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlirconine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=TJnlfnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPGGAKKSFPLKGGQKKPPGGFLEKNPP 
LGGAPL\GPPPPPKKRPRGKTKP*GG*R 


6319 


20220 


A 


6373 


605 


350 


I STGI CLGMFIAALFTIAKL/WKAT*PS 
FNR* KKKMCYI YAS YSAIKK\DAILSFA 
ITWMNLENIMLSEISQTHKDKCCMISLl 

CAI 


6320 


20221 


A 


6374 


167 


378 


KKSQGGTNIKPPAITKYLSQMGG*KKLL 
TGFLKKKSLLGGGAGVTILLT\NRTFST 
KINDPDGGGDHILSQ 


6321 


20222 


A 


6375 


994 


747 


VSLALSPRLECNGVISAHCNLRFPGSNN 
SPASAS*VAGAHHHTQLTFVFLVETGFH 
HVGQAGLEJjL/NLVIRPPWPPKVLGLQA 


6322 


20223 


A 


6376 


1956 


1592 


hefemesysvtqagvq*hdlgllqplpp 
rfkqlsclslp/rywdyclappclanf\ 
*ylv™gfrhvgqaglkllassdapasa 
fqsagitgmshrarpksafkiqaiigeg 

GVSELHRKINI 


6323 


20224 


A 


6377 


94 


402 


KTQELQNKI SHCL I RRVDRKM/ WLCTVA 
HTYYPSNLGGQDGRIA*GLEFETSLSWI 
TRPRLYRKKKKKKNVFQAWPLWSNFLRR 
LRREIRAEKSF 


6324 


20225 




6378 


463 


146 


GKLGPPKGSLKPPPPFFFFL*R/P/HPP 
PRGETPFFFKKPKKSPRGGGASPPFPPL 
GRVRPENFPPPRGQGSP*PKRGPTPALP 
PGGQKETLFPKKKKRKKKKKYMRLI 


6325 


20226 


A 


6379 


"3 


207 


leyqvAqtmCi issriqlVaekaSvpfv 

DKrVIiS\TMGQL*EAKAEDGFFVPG*YG 
LAVSPLKFQSTRP 


6326 


20227 




6380 


328 


70 


RPRRRLGLPRSWDYRHEPLHPDQLLVE I 
MSHYVAQAGLKLL\ITGVSHHARPH/YR 
F*LGHC*SLVERPWTISYSFLCLGFPMW 

EVSD 


6327 


20228 


A 


6381 


13 


77 


VIFLLQPPK*SSCFSLPNRQDYRRVPPY 
PANF* FS /CRHSLTMLPRLLF*TGLQAC 
ATI PS 


6328 


20229 


A 


6382 


342 


77 


KGQGFYPHLFFFFFFF*KDSPSLPRVEC 
CGKISAPCmi*riFGSNDFP/ASHAIVGG 
ECLPKQRKGAFFS*GGGFFFFF*M*KHSV 
LNSSRN 


6329 


20230 


A 


6383 


1 


208 


RQRVDKIFYLFFETDSQSVT*iM3V*Wm 
PGSLQPLPPGFK*FS*PQVI\TCLDLPK 
CWYYRREPLGPVY 


6330 


20231 


A 


6384 


277 


50 


LVLKLGRCKTSSFI*NFKGPQiaKTILK 
KNKAGSLT\LPNFKTNYKATVIKTV*YW 
HKH* YRHGF ISGLS I LFH 


6331 


20232 


A 


6385 


2 


354 


AAASTNFRDR/IL/LCHPGWSAVflQS*L 
TAVSTIiN*AQGIG\SSHLSLIJSrSLDYRY 
APPLLAVLLCCP6WPQNSWPQVILLSQP 
PKALGYRCE PP CPAPLLLNFFFFFKNRL 
WPCCPGWSA 


6332 


20233 


A 


6386 


2 


188 


FPKYICY/CINMYMCFYIFINIYSLIYL 
* IHI YKYIFAYIFINTYL* lYICLYIYK 

YIFIYYR**IYICLYIYKyiFIYYR 


6333 


20234 


A 


6387 


2 


241 


WATKCR*G*RRTGTLTGG*WDCKASGQL 
*KTFQQFL1S1TY/PYGPAISLLEIDPS 
ELKTYVHTKTRT*LFIAASFIITKN 


6334 


20235 


A 


6388 


142 


429 


YCVCIYIYIFLYLFIFFCTFFTYSSLHL 
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M 
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nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

corrcspon 

last amino 
sequence 


Amino acid sequence (A=A]anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Thrconinc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LIFSFFFFLERGVCFSPRGGGGGKYLGL 
WHPPP PE * I KKiSrVY I YA\QQ YYSAIKQN 
E ILSFAATWMELEAIMLSEIRQAQNDKH 


6335 


20236 


A 


63 89 


468 


250 


VHRRTPPCPIIFFFF/CKQ*VLPMLPKL 
VSNSWPQATLLPWPSLTVEFQALTPGWK 
NFKKMFLMVMSWSI KKR 


6336 


20237 


A 


6390 


616 


1084 


LCLCQYAGISVQCPDCVFQFFHNVDSTY 
ED YLRKLKTS LS FFWRRS FAL VAQAG VQ 
WCDLGSLQALPPGFK*FSCLGLLSSWDY 
RHP PWHPAHFVLLVEMGFIHVGQAGLEL 
LTSGDP PASASQSAWITGMSHRTQQE \ Y 
FLNLKFSTSPGELLCLK 


6337 


20238 


A 


6391 


415 


200 


SPPPRFFFFCPPKKNFFFPPPRSKNFFF 
* RGPPFFFFFKKTP FF/YFFFKPPFPPR 
/RFKNPPKIFFFFNRGPKNPGGGGGKKK 
PPGPGGFSPLWGPPPPFKKPPPPPKKNP 
PLFGKKKKKKKKKKRAAPR*NPRARGVF 
FFPPPPLGFWAPG 


6338 


20239 


A 


6392 


3 


117 


KLKPSSH\DY*VGETYGKDITSRGKDKP 


6339 


20240 




6393 




204 


DRVSLCRPGWSAVA*S *LTATSMS * \VK 
QSYHLSIP*GSWDYSCVP\PPLG*LVIL 
LRQPPKVLGLQA 


6340 


20241 


A 


6394 


501 


187 


AGVKWYEHGLWQPPPPGLKRSSHLSLPS 
S*DHRHEYPCPANP*KIFF\VETRSHYV 
AQTSLEFLDSSNPPTSASQNAGI\ *GMS 
HCAQPMQTFSLVKIGTNFTjIF 


6341 


20242 


A 


6395 


3 


393 


hacgaimns kphavtqyihtp j j.m* s vli 
ttWllllfvtilndgitilltdrnrnt 
tlfypsgggdpilyqhlf*fsghagg*i 
l/llpgfgtishivtyysgkkepfgyig 
mv*amisigllgiiv*ahhi 


6342 


20243 


A 


6396 


101 


135 


FDSPTPGIVPGTWQTIRSLNVYLTRLTT 

sfslmps*flfiffchdg/sltilprlv 
snswpqailspqphkmlglqa 


6343 


20244 


A 


6397 


395 


187 


SSYLSLLGIWSYRHVPPHPAMFLFFCRD 
GVVYFIFFC\KMGAMLPRLVLNSWAQVI 
LLL*PSKVLGLQA 


6344 


20245 


A 


6398 


35 


334 


QNETLSKTKKIKtfKKKKKKKKRGGPLKK 
PLKGPKLTRACKKKPFSLKG/WRKKISINL 
GIFEKKPYFGGGKKWNKPPTKIKSFKEK 
KKF*HEKGEKQPQNPGC 


6345 


20246 


A 


6399 


464 


146 


hiqirefqrtptrmspktstlshfkikk 
qlvmnkrsfikqwis*qkpcqmarrd*h 
niftvqkgtncqirXilcpaklsfkner 

EIKPFLGKYKI.REFITa.*FIFKG 


6346 


20247 


A 


6400 


723 


532 


FFN*DKILLCHPGWSAVARSQFTAASTF 
RVQ\QSACLGLPSSWDHRHIPSCLATF* 

* krgfaplprllsnfwacjailppwppdv 
\lglqa 


6347 


20248 


A 


6401 


522 


46 


FFPPFLSFFFFSKIPKQSI.GLVFYF*NK 
VWVFCPGWRAIR* FPFPA/APPLGSKNS 
/ SCUSTLPSTWGYRISGMVPPKAH* I PPE 
I FFFFFF/CRQSIiALSPRL/WQWQVQA 
ILLPQPPE*LRLQVPGHW 


6348 


20249 


A 


6402 


103 


261 


nipftclikkkkkkKKkIcKKKkkkkkkk 
kkk/ik*kkkk1sivwggvvkkklggd 
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SEQ ID 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
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sequence 
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od 


SEQID 
NO: in 
USSN 
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26 


Predicted 
beginning 
nucleotide 
location 

ingto first 

peptide 
sequence 


Predict- 
ed end 

ofide 

location 

correspon 

(ast amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6349 


20250 


A 


6403 


146 


384 


GFLPSLSGHKELSLFFLFETESHPVAQA 
\DLGSLKPLPPGFK*FSC\SLPRSWDYR 
RVPPCSVNFCVSPY*PG*SQTPDLR 


6350 


20251 


A 


6404 


3 


395 


ETESHSVAQAGVQWRNPGPLQPPPPKPK 
RFPYLSLPSSWDHRHMPPCPANSAFSVE 
TG l/ LHAGQAGLQLLTPGDP PT * T SQ SA 
RTTGASHRARPSLKYFLISFVISSLLHG 
LFRSVLGSiynfLRISQVFFCC 


6351 


20252 


A 


6405 


30 


205 


CRLTR PAEFRTRKTHT \REKPYECKQ CG 
KFFS CKSNLIVHEKmKVETMGIQ*VMW 

LFL 


6352 


20253 


A 


6406 


3 


444 


ARKITGY* ISKTIVFLYT/DELAEKEIK 
KAIPFTIVPNK*DN*ELNLTKQVKDPHK 
EDYKILMKETEQDTK/WKDI PSSWIRI I 
NI VKVAVLSKAI YRFSAI CI KIRVTFKT 
ECHWKKKIWIGICVAHACNPTTLENQGRR 
SLEARSLRLS 


6353 


20254 


A 


6407 


480 


307 


QHDSPSAKKPPQKPDKS/HPSSRS/HPF 
PPPPPC*TPPPEF*YPPPKGEWGAHPPP 

QG 


6354 


20255 


A 


6408 


923 


833 


KftMKIIFLI KTEAFFHHVTAHL I DQS I VK 
TELLYVTSFIAI FALLWWYGKEPML/WP 
GAAAHTCNPSTLGD*aGRTA*AQEFKTS 
LGS *NICrKP * CTiSAV 


6355 


20256 


A 


6409 


382 


152 


ECSGTLLftHCNLCLLCSSDFP/SLFHW 
SLSSNSSS*VAEIIGLCHHVWLIFVSAK 
LFLGRRVRNLFSPTPKNMFSK 


6356 


20257 


A 


6410 


553 


173 






AR/WPP*K*RAP*NPVKTPGGSF 


6357 


20258 


A 


6411 


38 


562 


FLRQGL/DSAAQAGVQWCnSHSSQRAPPF 
GLGRSSH\PASLaGWSAVA*SRLAATST 
FRVQVILLSQPPE*LGLQSCSITQVGVQ 
QWRDLGSLQPPIiPGFKRLSCLSLPSN*D 
YRRLPPRPDSSITFSRKFFQSPQARNLP 
PTPELKPQQPVPAPSPSELHLSWSSGNK 
CHFLWFLCP 


6358 


20259 


A 


6412 


414 


144 


1,'KJ.'ETESHSAAKLEC\ISAHCNLHFPGS 
SKS PASAS * VAGTTSVCYHftQL IFVFLA 
DTGFYHVGQDGPDLMFCPPRPPKVLGLP 
RLPLICF 


6359 


20260 


A 


6413 


3 


455 


AEAGGLLEPRSSRLQ/ CSQP* SHPCTPA 
WAT 


6360 


20261 


A 


6414 


462 


162 


TQGPRLFPHPFGAPGGPVPLGWEVRPPP 
APPGKPPFPPKPQ\NSPPWGGQPSIPPP 
LEG*ARKSP* PPKGLVPVTPKGPPPPHP 
GQENPTPFPKKKKKIOTK 


6361 


20262 


A 


6415 


499 


0 


HFTSKEIHFGFEAAXA* YWHFVDW* LFL 
YVSIY**GSH 


6362 


20263 


A 


6416 


187 


236 


MCHINGIMQFFFFFFFENEVFFCPPARI 
GN*R/WPNLG*LKPPFPGLTLFSCLTPP 
RSWNKRPPPPARVIFLFFNKKGVFPLLP 
GRV*IPHTPEPIiATTNLLSVTIDLLILN 
VSYKWDHAVFFFFFP 


6363 


20264 


A 


6417 


158 


412 


SSDPSRRSCLFSF/CFFFLEREFFFAPR 
VEWHGPNFCLLTPPPPGKKHFFAPSLRG 
GGDYGHLPHGPVIFGTLHKAGFSPCGPG 
GS * PPPPGKKHFFAPS LRGGGDYGHL PH 
GPVIFGTLRKAGFSPCGPGGS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
otide 

ding't?"' 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnyla)anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6364 


20265 


A 


6418 


457 


11 


HTGQSPYRCSECQK/AFTQKSAITNHQK 
PQQKGRKSMQ*LRI*ESFFD*SHQLRHN 
RSSMNTMIVMGPFGTLVGEKPYRCNEYF 
KPSFLKISLVYRRDLTRRIIGQAHWPMP 
VISPiWEAEAGRSLEVGSSRPAWPT 


6365 


20266 


A 


6419 


103 


376 


SALSLILGITFFF*IFLETGSHFIAQAG 
VQW/PIHTCDHSILQPQTPGLKPSSCLS 
LLSSWDYRHVPPVPRTSACFECTILRSG 
SVAKTVLWKAKAGGSLEVESLKIACATQ 
GAPCLTKKKCDPQD 


6366 


20267 


A 


6420 


463 


6 


FPPNPHKKQNQQTP\PPKKAPQQQKYTA 
DP/QDPPGNDPKAQAYKKSPAATYPAAS 
SSPSSPSS/RPPSTAAHVLRVPL*GSPP 
GLVGNPRPPPPGGAQHPQPPDTWQKRPA 
HHGLQSLSASPPGIS*EPQLAPRNSPHW 
TRDL 


6367 


20268 


A 


6421 


1 


479 


RGFAVLFSAVEFPSSSKSHLDLETSPKA 
YKHGAPCSAPGSLQASLPKGRNQRWPFS 
NFFVLF*DKISLCCQGWSAVIQS*LTVA 
STSPGSDDPPTTPPPPHflPDTWDHRHVP 
PHPAKFYI FYRDR/ S FTMLPRLVSNSWA 
QTVCLSSLFMLLPSLLWSLSP 


6368 


20269 


A 


6422 


11 


242 


FEIIFSAVEFGWSDHGSLQTQPPG\SSD 
PPTLDS*VAGTTGIiHHHflQLMF*LFVEI 
GS PGLKMS SVSAS E SEPLYPA 


6369 

6370 


20270 

20271" 


A 


6423 


488 

4_-i4 


335 

22^-- 


SCQLRPWNESIGQRSGIRMLTPVIPALR 

3a/ eeagg * le pr s s rpawa i 

r:ppasasqsagitgvseh?_rl/vslfl 
ivm*aegcylsgiwrtrlkmnppo*rke 
rqiqvkstrsdifffyflfl 


6371 


20272 


A 


6425 




1309 


?5;SESHSVT\RTECSGTIS\AHCSRHRL 
G S S D S PASASRVAG I TGVHHHAQL I FLF 
LVELEFHHVGQDGLELLT\*VIHPPWPP 
KVLGLQA 


6372 


20273 


A 


6426 


276 


437 


CFHPPEPWDQQWGPPPLGNF* IFWKKRG 
PPFVPQIILNPWPQKNFFANPPKTLG 


6373 


20274 


A 


6427 


445 


240 


RPRHJiDHL/RSGV*DQPGQHGETLRLRK 
IQKISWVWWHAGTLWPFWERDGSHRLSA 
RTQMLGSWACSPGF 


6374 


20275 


A 


6428 


U4 


450 


LLLLLLFESESCSVAQA/PSAVL/WILL 
GSL*PLLPGFKQFLSLSLLS IMEYRGLP 
PYPMSTFFKFLVEMGFHHVCQAGLELLTS 

GDLLflSASQHSGITDVSHRAQPVTIFLK 
GLS 


6375 


20276 


A 


6429 


380 


224 


LRHCLFTQAGIQRQEHGSL*P*SPGLKQ 
SSHLSLPSSWNY\GVKLPCPNKFQH 


6376 


20277 


A 


6430 


509 


228 


PHNKGRVSP PPPQNFFS PP SENLGGGWL 
P KIPP PKKRVFSKKP PRGKKK/P PP *KK 
KKNFSPPPFFGPPPKXIKKPPPPFFFFF 
FFFFFFFFFFFF 


6377 


20278 


A 


6431 


3 


117 


HSWES I* KRGACFNIFIAAQFTIADIWN 
* FMCLSGDE*IKKLGLIYT/MDCYWGVK 
KKETMPLATTWIELICVNILSEIGQA*KD 
KYHVF*M1IR 


6378 


20279 


A 


6432 


417 


271 


LSKKTKFENSPQPPLPCSFL*NDRK\WP 
RWVHTCNPSTLGGRGRWIT 


6379 


20280 


A 


6433 


385 


145 


TPG IKQS SCLHLP KCWDFS CE PPHLANL 
NF/CaCR*YIPIVPNSKV*TFSQEVFSYT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO; in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine,' 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














WPHSSSFTGKEVIIIiFPKECPGMFK 


6380 


20281 


A 


6434 


65 


383 


LFPNLLKKKKKKKKKKKKKKKKGGPIKK 
KP * GDKI KTRQKKKNFFPKRGGKKNQPG 
KFKKKNMFWGGKKMEKP P \ KKKKTCKEK 
KKI*KKKGEKKPQIAPEKKKGS 


6381 


20282 


A 


6435 


3 


454 


E IQRTPVRYYTK* TLPRH IVTRLSKS TL 
K/EKKKKKKFLRAPTEKGQIPPKKIP/I 
KLTVNF*KKS*RREKWG\PIFNILKKKK 
F*PKIFYPAKLTFIREGKKKFFPKKQGL 
GKFVTTRAPLKKILTGVLNLKMKKGYLF 
PQNTLXYLAHNHP 


6382 


20283 


A 


6436 


424 


410 


N?FFPPQGGGGGGFFF*DKPPPGNPSSL 
FFFFLGGGKKRGPPPPPKG/QTPPPGGL 
F*KWLGGKKETPKKFSPPPPFFGGGGKI 
FF*KPPPPPGFFF*KSPPGGGGPPSVKK 
KKEKKHKKKKKKKKK*R 


6383 


20284 


A 


.6437 


3 


214 


E IDCCAAVWKQPKCPSADE *RKKMWYLY 
TMEYCSATNKNEIQSF/VMNWKEVEIIM 
LS\KKSQAEKEKHGIF 


6384 


20285 


A 


6438 


83 


420 


QSFP*ISCRKYS/WLGAAAHACDANTLG 
GRGGEAAWGQKFKTSLGNLEPGPKTADP 
RTSHGCELRRSSYTYALTHGFSSTPETA 
RPAPLLPPPPPPTQREDYKEEDLYDDLL 
NK 


6385 


20286 


A 


6439 


163 


289 


KMYCYYYY/C*DRVLLMLPRLVSNLWAQ 
EILPPWPPKVLRLQA 


6386 


20287 


A 


6440 


459 


431 


DKRIRTRITPLAIRKRQIKTTV/RKQLQ 
SHYMWVKETrNNK/CGIFIHSWWECKM 
VQLLWKTVSQVFKRLDLPYNLATLLIFI 
CQLQIKTLSSPKYLHMKVHS III KIRKK 
YIKKNLKCPSTDEWIIKI*C*SIHTMEN 
ERA*K 


6387 


20288 


A 


6441 


478 


36 


VFGCRVILVEKLFLCEPLRVTFIiRLGKQ 
LEKSL/WNFHDLELNISLHKNQPNRCVH 
Qt'JRIG*RKRS*CSE* *R/TMQFINSNLT 
L IN* IPSCHTaMD I I I I AQH/ ITHTVE Y 
YSAIKKNEIMCFPATWVELEAIILGEIT 
QKQSQIWRSPP 


6388 


20289 


A 


6442 


464 


206 


LVQRS6IHI.PYSSTVPL*GL/HQKNEAD 
VYRNTRMFTAALFLIVCJNWK* HKTYLQTG 
EKINKLPYIHTMEYFSVIKSNC5HS1T 


6389 


20290 


A 


6443 


414 


216 


CYIYTCPLTQQSRSWVAT*ASSKEEWIN 

KLWHIHTMD*Y/SSNKKNKLQINTTWTN 
LKSIMLTEISLT 


6390 


20291 


A 


6444 


7 


411 


S FYCLHS FAFSRVS YCWNIQYIAFLDWL 
LSRSNSLHVFSW\WD/CKMV« PL/YETV 
WQFLTKLNILLLHDSALrLLGIY*K/DM 
KSYVHTKMCTQIFIAALFIIAKTWKNL* 
CSSVGEWRNKLWYLQTMEYSS SVLKK\N 
ELSSHEKTWRELLRERSQSKKAIYCIFQ 
QYDTLEKAKLWRQ 


6391 


20292 


A 


6445 


257 


419 


KSLPFFF / CW*ETGSHSVTQVRVQWCNH 
lALQPQTPGLKQPSCLSLQGSWDYKFHF 

TQACPY*RP 


6392 


20293 


A 


6446 


20 


417 


HQYCFAS FFFFNNFFKTESHSVAEAGAQ 
WHDLGSLKPLPLRFK*FSCLSLLSSWDY 
RCMPPHLDNF\*FLVETGFHHVGQAGLE 
LLTSDDPPALAS*SAGITGVSHRARPVE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LFLLRRQGLRLLQNIGIGRQ 


6393 


20294 


A 


6447 


371 


3 


VIGCQFL/RKTKRYLPYDLPIVLLCIYP 
R/EKKNLCRH*TCTSMFMTALFIITQH* 
KYPKCHSIGEWLNILWHIHTMEYYAEMK 
RNKLLIHEKTYMDLKDILFSGGQRVYLK 
SSILSIKCIHRQE 


6394 


20295 


A 


6448 


46 


332 


LGEHLRQGPI ILKVCQVILTHQQYFESQ 
WSIMALSIJRKYNANCRYNLKCPMSYIKK 
EIJ!L /WPGAAAHACNLS TLGGQGRHIA* A 
QEFKTIAKPCLY 


6395 


20296 


A 


6449 


2 


381 


LCSLVFTHI CS IFLFFFFFFKDRALLP/ 
H/PCWRAGP*SYLTAASNSG\VKQSSNL 
SRGKPLCRANFGSF\CFWFWKKGGPTMW 
PRLV*N\ILLPQPPKVRGL*AAAHAGGQ 
GHDHTSLEPQTPGLKQSSNLSRGKPLCR 
ANFGSFWFLVLEKRGSHHVAQASLKQSS 
CLSLPKLGDYRHEPSHRANMFNFRSLPG 
LRKADCLYPY 


6396 


20297 


A 


6450 


2 


136 


KSVGCWRGCG* IGAL/LHCWWECEMVQL 
LQNAVWQFLKNCKIVPHy*A/ 1 PHLGIH 
PNALKTATQRNICAPMFIAALCATAKRQ 
CK/ PKCPS I DKT* YTHTMKYDSA* KKCW 
LLARVWINWSPVHCWWECEMVQLLQNAV 
WQFLKNCKIVPHY 


6397 


20298 


A 


6451 


314 


72 


MLPLTNGSCRTKVSKDIEGLSNSINPLP 
LIGIYRTPFPKTAGTPFFFFFLV/PNGT 
FTKIDHILGHKTRLNILKMI*ILKYIP 


6398 


20299 


A 


6452 


54 


350 


LRSFCSALFCFETVLLCHLGWSAVA*PR 
LPAGSSNPPPS/CLPIPSSWDHRLVPPH 
QVFLEAFFFFFFF/CRKGVFFFLPGGV* 
NSGVKGTPRPNPPKALN 


6399 


20300 


A 


6453 


368 


172 


WLTSVI\QRFGRPRREGHLRPGA*DQPG 
LR*STHLRLPKYWDYRRESQCPARPVIL 
KEflNMFHLG 


6400 


20301 


A 


6454 


1 


365 


NILKLNKKLLMNL\VKEEVAKEIKK*LV 
LNKNENTTYKDLWNEAKTVLHL*H*VKF 
IALSA*RIVSNQCSK*ASTLEMHKKKSN 
LNPRLI*QQKSITLKREKSMKPKAGSLI 
RSIKLIIRLT 


6401 


20302 


A 


6455 


364 


1 


PETRSCSVWPRLGGVQRQDHGSLPPLSP 
RLKQFSHLSLPSS*lSIHRGMPPCPAlirLLF 
KFYLFIYL/FCLFVEMGSHHVSQAGLKF 
LSSRDPSALVSQSVATTGMSRRAWQDI I 
LQQKC3AATAA 


6402 


20303 


A 


6456 


91 


300 


GHAEHFGRPRWGDCLGPEVRDQPGQHGE 
TLTLQK\KNKN*PAWWHTPAVPATQEAG 
ITGVSHRARPCPPN 


6403 


20304 


A 


6457 


2 


195 


RFSCLSLPSSWDYRHEPPRIiAIFFFF/C 
FWNFKKRGGFPLWARWVSNS*PRGIGLL 
WPPKGWGLRA 


6404 


20305 


A 


6458 


522 


202 


THNWRPSLQIFTAALFLIAKKWKQPKCP 
STDEMIHKMWYVH!MKY*SVIKENEVLI 
HSA\KWMErFENMLNEEACWERPHIV* FH 
LYEMSRVGKTGETKSRLWV 


6405 1 20306 


a 


6459 


377 


161 


EMNIKCPSTDK*VKKRY/LYIHTMEHCS 
AI.KGNEVLI IHAMTWMNTjANIMT.SEKSQ 
SQNTTYSMITWKCLE 


6406 20307 


A 


6460 


92 


484 


LLQACFLRTVLIDYKMWAEKGMVGMCVL 
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nucleotide 
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sequence 


M 
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od 
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Predicted 
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nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














MRIIMTVI 1 1 FRDRVLT ,* UPGWIIAWRSQ 
LTSA/VKRSSCLKPPKC*DYRCEILCQV 
EIAFTHHSLi'KSFLKRSMRVILLNYWKD 
PSLGFCSYDSTGMALLFLNFS 


6407 


20308 




6461 


483 


192 


MGLMLILRAIIFQSTKKKMICLLKKSLTF 
IMCI'/LSNWPCTFNMYIFVCVFAVYKLH 
WAE \ P VAHACNP ST VGGQGERTA* AQ E 
FETSLGNIIRPCLY 


6408 


20309 




6462 


451 




KQTOQLL*KQVMQLLESLNMKLLYDLAV 
UMLGAYPEEEKTIIVIITKTCTQMFTAAPF 
I\IKK*KKNFSIDMKYEERDKMWTIHVT 
QHYLI IK* S ID\ YAPTWMNLENIMLSED 
SQTQKTTHCIISFI 


6409 


20310 


A 


6463 


395 


67 


flrfwcthhwss/dilypvcsllslnrs 
hs fprvhksplyhsyalcphslaptye * 
ahkmf/rlvnflflvetgfrhaaqaspe 
lpsasnppfsasqsagttdashraqtk 


6410 


20311 


A 


6464 


485 


220 


fffetwshsvtqarvqwcnlgslXkplp 

P\GSSDPPTSAS *VAGIAGMSHAPTHE I 
PS IQARQQHLQGEKKI ELLSLPTSASQV 
LRLliA 


6411 


20312 


A 


6465 




371 


IS ippkkylrinlttyvqglyeqnyktd 

KEHQGRSS KWRDGPCS R I GKLN I RMS VL 
SNLI YS FNVILVIVPASYFVS IT / RLI L 
KFT*KCSRPRIVPLFHLIjLPLLVPFVCS 
FILCRPGASLVK 


6412 


20313 


A 


6466 


489 


175 


YKIIE1SRCIK/GLL*LWRTTHLFSHPH 
WRRGMAYEKWADFRLGTVAHAFNPSTLG 
G*GR/WNQGFKTSLANWKPCLY 


6413 


20314 


A 


6467 


2 


367 


FGTETLGQAGIKGDPGEKSHWGEGLHQL 
REALKILAERVLILETMIGLYGS * LTEL 

paaagsagapgqtghppgshhlaXepel 
gsgagpagtgtpsllrgkhgghatnyri 
vaprsrderg 


6414 


20315 


A 


6468 


472 


260 


TTLGGAGCfSGYEHIQPHVAWYLFI CKDR 
/SLILLPRLIIJ!rS*L*AILLSRPPKVLV 
LQT 


6415 


20316 




6469 


88 


433 


RGVLLHMiSCLPLCKTCFCSSFTFHHH* 
TSFSFSFFFFLLGNKVSFYCQAEGQGPN 
LG*LKPLPPGLKQFSCFT/LPEKWE*QT 
EPPGPVKRHFQLSYFPLTRG 


6416 


20317 




6470 


34 


106 


EFAIPFLTIYPIEIiNTHVHTKICI*MST 
AAI.STR*KSPTCPSM\INKMWYIHFMEY 
YSAISKGRWHaiSrVHQWIHKM*YIHTVEY 
YSAIKSNEALTYATTW/MiniiENrMLSER 
SQS*KICI 


6417 


20318 


A 


6471 


390 


92 


DFLIDAINHNLKVFFYS*FISMC/HPVN 
*SLSFSNIQKSLIKTMINV3LSFKY*YL 
IYFS*KKTF*N*RQGLTMLPRLLSNSWA 
QAILLPWPPKVIiELQV 


6418 


20319 


A 


6472 


414 


167 


HIHIKTCR.*IirSAA*FIIAKK*KPKCPT' 

ADEWINKMWyVHTIEYYSAIKK\NKLLI 

PWMNLENIMLSTGNQTQKATYCMIPFI 


6419 


20320 


A 


6473 


3 


244 


TTl LLGTTYDALFPELYTTYFVTKTLLL 
TSLFL* \IRTAYPRLRYDQLIHLL*KNF 
LPLTLALLI *HVSI PITISSI PPQT 


6420 


20321 


A 


6474 


12 


224 


lASdRPFFdYDVW/VKRLCRHSIVIINT 
* KKKKKKKKKKKKKKKKKKKKKKKKKGR 



941 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence, 
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09/515,1 
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nucleotide 
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correspond 
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residue of 
peptide 


Prcdict- 

otide 

din^g to 
last amino 

residue of 

sequence 


Amino acid sequcnce (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGALKKKPGGEPLFPH 


6421 


20322 




6475 


417 


135 


FQSFTENKMLSSSITCSSEIFGERKSQL 
M*RPSLL/SY*EKLPLLaSTTVISQRPS 
AL*VQPL/PTAKWL*LTAGSKGHYYFLN 
SKVFSN 


6422 


20323 


A 


6476 


1 


131 


RTRPPTRTRTRGRWLTPVIPALREA/EE 
AGG*LEPKSSRPAWAI 


6423 


20324 


A 


6477 


830 


513 


WNSVKESLPADTWFLPFLGPLIDIFLLL 
I FALCLVNLLVKFVSSRLQ * FHVKMMLA 
QGFQPIPSSDPEDKDILPLSLLEQVS\G 
ILPLQC*AGSMPITSAGSSYRR 


6424 


20325 


A 


647S 


399 


196 


SKQSSHLSLLSR*KYRCAPPWWAFF/SF 
FLINIFCRDE/SLTMLPRLLLNS*PQVI 
LLPWPPKVFGLQA 


6425 


20326 


A 


6479 


1 


411 


ntg impantvs i lllidqevlvtvktcy 
lrntfcgfsdgsghsvVqtfwigstild 
aiksl/r/dsredgkistligirkk*ip 
ilmedlvgfktsqe EGTVDGUJLA* ELQ 
VEGPRDETELI.* PQDQTCMDE KFLLMDE 


6426 


20327 


A 


6480 


381 


162 


AGFTHHNFFFI,F*DMVT,r,CHPGYSAWQ 
SWLTVASNSW \ VK* S SHLS FPKH*DYLQ 
AVHFRMASNSPGLYLLYQ 


6427 


2032S 


A 


6481 


379 


85 


S ILPFLFHP INFFFFFF*AW I F I IPPPH 
\RAFKVVRWGFFLADSVWFIKRLPQYY 














FFFFFFFFFFFFFFFFFFFFFFFFLSVC 

GG 


6428 


20329 


A 


54S2 


355 


117 


PKVKIFPPPPLPRKWDPRGPP*SPGTNF 
SIFFLVEKDFPHVAQAGLKLLDSSSTPS 
VAS \QSV* ITGMSPHAWLQVLLSL 


6429 


20330 


A 


6483 


402 


97 


TAFHLLDIYHKS*KIMFTNQPLYKCPLF 
I IAKN*K*LKCPRHADKQIVA/WCIHTI 
EYYSAHKRNVLLKYTRLWMNLKSIMLCK 
RSQSQDVTYYVIPFICHS 


6430 


20331 


A 


6484 


714 


365 


NGFiVLVAQAGV/QWCDLGSLQRLPPrFK 
QVSCVTLLNS*DYRSMPPHPANFCIFSR 
DGV*SYWSEWSLTPDLGLPKCCDYMREP 
PCPANAKSFTVEETHVTYQAYETMLLSD 
HDESG 


6431 


20332 


A 


6485 


408 


146 


LKTETQTETCPPMF IMIoFTLV/ RRWMQ 
PQCLSTDEWISQI*YICIMNINSDIKRN 
EALKHAVICMNLKNMVPFISPTEKGKKK 

EKKK 


6432 


20333 


A 


6486 


243 


11 


GMSQYQEYSKSNFFLSHRVLLCHPGWSA 
VAH/SRLTEALNRFK*SSCLGLPKLWDY 
RRVPLLLAQNATR*irSL*SCPA 


6433 


20334 


A 


6487 


403 


161 


DMVSPRSTPVWSQNS \StK*SAYLSLPRC 
WDYRRE PS CQASKNTS 1 GRGNRKNQWNT 
CMYLLFI LI PLKKYMCHGRVMEYFKT 


6434 


20335 


A 


6488 


292 


24 


DWRGRMAIPLKKGSKC*QGCRKTGLLRY 
*W/'MECEVAQPLQEMV*WFLQRLWIKLS 
YDPDF/PLLGIYPKE*KAGT*TDICMPM 
LVATLFSG 


6435 


20336 


A 


6489 


3 


203 


SSWDCRCIPPCLANFLICCRDR/SLTML 
LRLVLNS *TQAVLLPWPPKGMPGLLFVF 
YGGIVLSYVNIS 


6436 


20337 


A 


6490 


413 


30 


RGP/YQPGFLGFSPPNGLGFG*KVFLPL 
PPPPRKGPGKKGSPEARGFPGQAPPKKE 
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peptide 
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Predict- 
ed end 
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ding'to"" 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=C)'steine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NQRG(3GFWSRGRLAKKKGFWFG*GPAKG 
KKFFFFLGAGWGLKVLIPIKKKKGFPP 
PPLFFFFFFLQTIHPIRG 


6437 


2033 S 


A 


6491 


101 


335 


SSSFLGFPKYWDYRREPL/WPS/LWQFF 
RLNIHLLYNPSIPILAVYPREMKTCDLI 
KSCMRMFTVAVF\ * ITQTWKQPWVF 


6438 


20339 


A 


6492 


532 


127 


STVFPJUIQLPCLHEPGRQSSP\PVPTSI 
LYT\PLPFPSSRYQQNGPCLSSPKLSPS 
PELGTQSFPVHKSFSPCLPSPKPLTVPP 
ALPSTGPSQSLAQKHSLASLLHTHSHTP 
PAPSCNPESWGLDATAAISTCFYG 


6439 


20340 


A 


6493 


399 


-j 


LCPAPRGGVYRGR/ESSLSWDGLHSVQA 
SWPLCLPTPASAMVDAPPPA\SLPSRSS 
ILDCCISSEQGSVGMGPAEPGAG*NLLV 
CCLLRPLEKCSIWAGVSRFSWYSLSQLP 
LARKGKS PNPLSFPKEKMPRPVS 


6440 


20341 


A 


6494 


319 


75 


KKIWGPLKKKFGLQTRRPKKWFFKRPPP 
FFFFFFFFFFF/HVAIFLLFCRDRG\T,,A 
MLPRLILNSWP*RILQFWLPKVPGLQA 


6441 


20342 


A 


6495 


333 


190 


YSFRDGVLLCHPGWSAAMPS 


6442 


20343 


A 


6496 


1 


299 


LPRLEYSGTITASNSWAPAILPPQLIKy 
YWFFCCC*GRVLTVLPRLVIiNSRPQIVL 
PPPPPKSAE ITGVMHHVWPKKI I SNEKT 
THCSCSALLAKQQISS 


6443 


20344 


A 


6497 


51 


266 


PVNVILPSSWNIiRHSPPCPAIlFFVLFCL 
LFFGKN/RGLAMLLRLVSNSWP*VILPP 
QPPE*LGLQVQATMLSS 


6444 


20345 


A 


6498 


366 


78 


LLAHCHSSNNKSLLILDNFNIYHYGPSN 
TVTS*F\*MSSIJyiILLSSP/C*GTSWSY 
LFYALKKrFCKD/RGLAVLPSLV*NSWP 
QWLLSWPPKELGLQV 


6445 


20346 


A 


6499 


37 


396 


FPRPTRPKSSSPAKATQQNTISFKKKKK 
SKQKKGNTVIIIRVKNQRKRKQKN*KLK 
Q*N*EQ*KINKISKPLAKLI/KKKRVKT 
*IANIRNERSSITT*STDIKRLIWKYY* 

QL 


6446 


20347 


A 


6500 


706 


394 


VPLLLPRLAVQWHNLGSiePLPPGPKRF 
SCLSLLSS*DHRHTPPCPA/NSFVFLVE 
TRLHHAGQVCLKLLTSG/DTPSSASQSA 
GIKGMSHRTQPIEILFYSVFF 


6447 


20348 


A 


6501 


403 


216 


ILNVr.SFK/KMLY*HYTTVLRTVRPLSP 

R*ATVTCTYTSGWPEAAEDPQKMTFPHC 
DLFLPHPN 


6448 


20349 


A 


6502 


377 


116 


LLPY*EN^FLMYLPPICCLPFDFGNSGF 
CHQKKWFFFFFKTKSLSVA\RLE CGGAI 
PAHCKLCLLGSSDSMKNFLNEAEFPNLA 
FN6L 


6449 


20350 




6503 


3 


401 


CVQHLEASWSSRWAWAWRAPHSEQPAGP 
AAPGNE6LSIRASGCGGCTGYPSSASPP 
ALCSIS\PRP*LPSRRAGLATCSPPCLS 
LPPPPWAPVRPEPPRRVPPPAPWRPVPS 
TTQGLRSASARHRTGRQLHLQ 


6450 


20351 




6504 


406 


55 


WPKGFTPPRFPRKEGSP \PPFPKVFILE 
TRQPGPQTKPPFFPPIFKLTRPGGRPP* 
FPPPQRVEAGKPP*PQSFRVPGAIFPPP 
PPPLKGHPPRPGFQKKKKKERKKRPTRK 
RGGT 
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M 
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od 
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beginning 
nucleotide 
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ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6451 


20352 


A 


6505 


3 


281 


FFFCDRVLLCHPGWSAMVRWSRLTATS 
/ SFLGSSDSHACLS SSWDYRHLPPLLAN 
FCIFLKRQGFTMLPRPVSNP*AQAMRLP 
QPPKVLGLQA 


6452 


20353 




6506 




247 


LGSLQPPPPG\SRHSPASASQGAGIADM 
SHRraQSIVFLRTNWKPY*IENiyiMITIS 
CTTAYKYEILRHSFWKICARSLYEKL 


6453 


20354 


A 


6507 


34 


47 


FDPNLTPRHIRI ILWAVNKERMLTASRK 
KKQII*RSSNLSGDRLLNEEASWEP*CK 
KSMQETIQATEGWNDI FKVLKEKKNL\H 
SKI LCPAKLS FKYEGEVKAFPHKPKWRE 
FTVTKP I LQKC *RENQRTFDLTQI 


6454 


20355 


A 


6508 


403 


106 


HYVPPPSPPS/YFFFSPPGKRGFFPNPF 
ILVSPGFFPSPLF*KPPPEFFFFAPLKK 
IFFSPPRELKFFFFKGPPPFFFFFF 


6455 


20356 


A 


6509 


389 


230 


LSFFLFFRDKVSLCNPGWSAV/VTFELK 
RSSCPGLPK/SQSHCTKPQPGF*KYIYF 
SFFIK\KR*SLTTLPRLVSSSWDQVIHS 
ALWHPICVLGL*AVAHAYNPSTLGCQSAE 
WIT 


6456 


20357 


A 


6510 


101 


419 


ISFFFFFFFFLKKFPFFPPGWRGGGEF* 
LPQTPPFWVKKIFFPPPPKVGGFKNSPP 
PPFFFFFFKEKGFSPVCPGFFKTNPKK/ 
SPPPPPPRGVGLGGWPPPPGWDP 


6457 


20358 




6511 


41 


376 


EWWCETRGGAGQGRAHGAaGGATGRVLA 
IQARICRRPKKEKHPKKVSCSVAQAGVQW 
RDLVLLQPLPPGLRVSLCHPS/WECGGV 
VSAH* NFCLPV/IKDGVLPCCAGWS *TP 

GL 


6458 


20359 




6512 


2 


332 


WEKVKIATLTCFWK* LI /SGSSRTLMGD 
FERFKTSVEKVIAD WE lARE LE r,E VE P 
ENVTELLQSHYKTPGTIiaFSRLSPDVSL 
YSQLPETRSHEWQAASKIQPRKASTSQ 


6459 


20360 


A 


6513 


3 


294 


FFFSEMESLSVTQVGVWWHNIiGSLQPPP 
PEFKQFSOiSLPSSWDYHHAPPHLA/NC 
FCI FNRDGVLPC*LGWVSNSHLGFTGMS 
HCTQPLLMiSFmG 


6460 


20361 


A 


6514 


1 


526 


FRLGRKPR* GGVM* PWSRGKPGSVGAE 
AG/RS *SAPRRLLHHPAAGLATGLSaSG 
RRSARWKMEHASGLSPGGGLGATSRQMS 
PGTQLANPPDHGDKDCLGRISPGSGKQI 
QAAGQLPGPPTSLAPAQGRLRSLTPWGL 
QTPEHSEPEGIGHLQAATEAVLPHSTQN 
LITKRNLM 


6461 


20362 


A 


6515 


747 


451 


QAGVQW*DLGSLQPPPPGFK*FSCLSLP 
SGWDYRYRPPNPANF\*FLVEAGFHHFG 
QAGFELLTSGGPPASASQSAGITGMSHH 
TPLGVI-IALVGENIQ 


6462 


20363 


A 


6516 


444 


3 


PGGGGPPFPPPWGGKPGKIS * / PPGGGG 
SNKPKLPPCLPPGGPK/GNPPQKKKKKK 
GFGPGWAHACNP 1 1 PALW EAEAARGLP 
PDAGRYGPGGRREAVRREAGPQECQDPE 
GQV*GDPRGCGAflHAPARGHPRKA*RGA 
EGSSPGLPTK 


6463 


20364 


A 


6517 


257 


575 


as svtwagvrwchlgslqllplg fk * fs 
clsfpsgwdy/ghappgpakfciysgdg 
vslc*pgwsqtphlr 


6464 


20365 


A 


6518 


337 


389 


SQHFV* ISCLHDMPALASQSAGITGVSL 
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M 
eth 
od 
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26 


Predicted 
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nucleotide 
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correspond 

ing to first 

residue of 


Predict- 
ed end 

otide 

location 

corrcspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Plienylalanlne, 
G=Glyeine, H-Histidine, I=Isoleuclne, 
K=Lysine, L=Leucine, M=IVIeth!onine, 
N=Asparagine, P=Prol!ne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '^Stop codon, /=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














RARLIFIERNMDSCWPQAGLE/PPELK 
RSSHCGLPKPWDYRHEPQAQPVCRFIRA 
TKCAYVSTQALS PAPVQPGMTPPAKRGL 
PCQPPGSPCPSLCHHQLAGTPPyE 


6465 


20366 


A 


6519 


61 


323 


FMWDRVARKGS I EGASLELP LGGSVGGH 
GS C I SM WGALGHS S \ THV* HTHTHTHT 
HTHTQTNTHTPS SDSKKS S I LDKEEAEG 
CMTP 


6466 


20367 


A 


6520 


399 


101 


KIQIKATMKY/HNIPIRMGFFFKLSSQG 
/C**RCRK/IYTLKHCQ*KCETVWPLWK 
SVW/RVFLKLNIHPLHSPEI PLLGTYPR 
/DMKT*TCM*MFTATLFITTYS 


6467 


20368 




6521 




276 


RTRGFEGLRFFLGFFGTVLLCCPGWSAV 
E\QSWLIAASASCLW\VK*SFCLSLLRG 
WDYRCIPPCLANF* *R/CGSWAQVILPP 
QSPKALGLQT 


6468 


20369 


A 


6522 


2S0 


277 


GDDPSCSMGGGHCIjASRSE*GSVGVGPS 
KPGAG \ YNLLVCYLLRPLEKRS IRVGVS 
QFSRCHQSWLPFDRKGNSPTPCTYWVR 


6469 


20370 


A 


6523 


2 


327 


SLFSS*D/YRHI PPHPGNFR I FCR / SAL 
TMLFRLVSYSWPQVILLPQPPSVGTTST 
SHCPQPCIMFLKADLNPVRCQFRCALAP 
EPYIQDPFLRDDVAFPCLGASAPMHLA 


6470 


20371 


A 


6524 


298 


414 


PGPQSETLTKKKKGGGRLKGSNFTPAGM 
EGNIPF/CFFLSKSHSVAQ2\RVQW/P*W 
LTAGSLQAPPPRFKLPSSLL*VAIYASP 
TLLSS * DYRCTP PRLftNFC I FPRDGVS P 
CRS CKAAVFNLLDTRDWFHGRQSFHRLV 
AGDGFRM 


6471 


20372 


A 


6525 


376 


180 


KCKKYPGGVAHACS PS *DYRHVPP /RPG 
YFLHF*QKHFGQAGLDLPTSGDPPASAS 
QSAGITGVSHRTQPQLNYF 


6472 


20373 


A 


6526 


415 


86 


LYLPYDLA.I PFLGMYPRGKKKCAHTKIC 
TRIFTVVLFIIARK*KRSKNPSTREW/I 
KMWNIYTVKSHSVIKRKELLIHASMILV 
H*/WMIFENI ILSDRSQMQKITYCMVPF 

T 


6473 


20374 


A 


6527 


391 


166 


FFFFFRDWFSLSHPGWNflE\S *LTVISN 
SWSQARLIFKFLKLRQGUTLLTRLVSNS 
RLQGILSSWPAKVIiGLQA 


6474 


20375 


A 


6528 


544 


176 


TMRYLLTPVGMAI I KKaK\NT* C* *GGR 
E/IGTL/LQCR*ECKLVLPLWRTVWRFL 
KSOAIELPYapPFLFSLEYGLFSLGSTN 
CAQLRSVYRKAVQNHQGVSGECAGFQDS 
SHSAPVAIPTARQ 


6475 


20376 


A 


6529 


392 


139 


TYVKMHCK/WLGMVAHAWNPSTLGGRGG 
QIT*GQASTKLSHSWSLSRESRFWIjBCY 
SRSCSFCSCRLPADCTSLTDSKQ 


6476 


20377 


A 


6530 


362 


87 


NGPPFFFFFGEIDKLILILLWKCK/RPK 
lAKTVFKKKNKVGGLI *HYFKAYYKGTV 

IKLVQFWHKVRKTDQ*SRRESPETELHI 
YVLVNARDSA " 


6477 


20378 


A 


6531 


1 


346 


apthprlkktls klkttnlqlpyd pafs 
flsvypremkt*r\fakicv*mfmaslf 
mivttq/nvprcplmgewlnkwr\ * F I t 

*YHTGMKNT\VMHKTTWMNSQRIMLNEK 
SQSLTY 


6478 


20379 


A 


6532 


3 


294 


FPFFRDftVFLtepeWSAW*S*LTAM,S 
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nucleotide 
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sequence 


Predict- 

location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














PWLjKRHSHLSLPGS *GHRMH/PPHLGNF 
FV\FCLFL*RQCLAMLPEQVLRSWPQVI 
FLPWPPIG^VLGLQV 


6479 


20380 


A 


6533 


2 


280 


PRYLWNPTPTLGSCVWDK*LSLWASV 
LYIYIYTHTHTDIYRKIYAF/lFVCVyi 
YIFI*KHIYYKEQAHAIMDAEKFQDLPS 
ASWRPGKLVE 


6480 


20381 


A 


6534 


96 


183 


DQUSVPHRKLKEYDTYTFTVALLABHOQ 
SC/LSLGSS*D*RYTPPLQTN*K*FFL* 
ISLPMLDWLVSNSWPQAILLPLPPKSSG 

LEA 


6481 


20382 


A 


6535 


2 


241 


GFKRIDSNFESSVGKVPTN/TIFCERKS 
LSMHQI /SLLSYFKKLPQLPQCSAATTL 
\IS*QPYTLKQDPPPTRLQPAEDSDGY 


6482 


20383 


A 


6536 


485 


97 


PTTRPILLLGFYTK*LKS\NKSDSCPPM 
FIATLF/ILKCPSTDKWLKKMWYKHTTD 
YYSALKKGNHA/YM*QQMNLEGIMLSEV 
SQSHEGQLLHSTYVRYLK 


6483 


20384 


A 


6537 


411 


57 


LFPSHIKFVLSWQSCFAKTSGPQTPSFF 
FF*LFFF/RDRVSLaHLSWSAV/VSWLT 
VASTSQAQACLST.PSSMDYRCPPPFPAN 
FSRRGFTILPRLVSNSRAQEICPPWPSK 

VLGLQRRGI 


6484 


20385 


A 


6538 


438 


201 


IYPKEMKT*VHTKTCI*MLIATLFTLAR 
KWKQTKCSSTDEWINKMWHIQ\TMEYYL 
AI KRNEAQLEHGE I IMLSERGQSQ 


6485 


20386 


A 


6539 


466 


131 


S F P KKGGYRAP P PP PP PFFFB'F f FFVEA 
VLSLCCPGGSQTLGLKRSaHI,VI.PKC*D 
HRHKP P CLASN/ S FFNRHRMMTNNCLMC 
IKSWCLCIINPFYHLDSSLLWMLSLIYR 


6486 


20387 


A 


6540 


419 


50 


PLPFFKPGPRIF/CLGRPKKKI*VPT/L 
RE IKLFPLKGPPLFFFIiNKGPLCFPGRK 
GRGVLFFLATPKSWGQKIFPPSKEGASF 
PFFFPFFSW*RWDLTMLPRLVSNSWPHV 
ILPSQPPRVLGLQV 


6487 


20388 


A 


6541 




341 


S 1 1 KTAREKCLVTY *GTS IRLTGDF* KY 
I /SAKILQWRQ*NDIFKVIK/EKNCQP 
RIQYPAKLSFINERKIKSFPNKQKLREF 
ITARPDLQEMIKGILYLETKE* YLPS *K 
HMKV 














NCKQPTCPSTWEW/ rKL*YICITEYYSA 
IKRNKLQLHTTT 


6489 


20390 


A 


6543 


32 


397 


AMIVPLQASLaSTRSRKTLSLKKRKKKEG 
EKETLPGGST*KEKVRKKQNin:,LV*KTH 
IKPG\AQAHPGNPNIWGGGGRQITRSGN 
KNHPG*HGKTPFLQKKKKRGiaLLKGTNL 
TARGRQSKTF 


6490 


20391 


A 


6544 


402 


0 


LSWTQ* CKPIAPATGEGGTGGSLEPRES 
RLWCAMIAPVHA./NCTPAWVT 


649J 






6545 




365 


RKVTSLPSTSRCLSILPHFSDGPRPVKP 
LQ**LRCSICSASLPTHLHTAKAAGLLP 
\LASSTAVNQN*SRELGaCVKMEHSGIK 
YGLF* HD/RGLTMLPRLVliNSWPQAILP 
GQPPKLLGLQA 


6492 


20393 


A 


6546 


1 


331 


ISTFKSYLRNIFSKAiaaVDSNSTEGSG 
QG*LKTFWKGYTIIDAIKNiyDLWEEGK 
TSTLPGVWK\LDDFEG\FQTSVEEVT2iH 
WQIAKELELEMDPEDVTKWLQSHDTKT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

location 
correspon 
ding to 

.residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=I.soleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6493 


20394 


A 


6547 


"1 


417 


TRPKINKTKNLLFERIWKIKKP*VKLTK 
NKNKNKTPHITKIRNESEAFTIEKLKSI 
LRE\YMNNCVYNLLYANRKPRQMDKFLE 
RYKQLKLTQEETETLTD\ YI TS KEIKFV 
I*NFSTK*S*GPNGFNSEFGRVLWFTPV 


6494 


20395 


A 


6548 


410 


288 


AHFLFFC INK \ SLTMLPRL V * NMWP* VI 
FPPQLPKTLGLQV 


6495 


20396 


A 


6549 




216 


I DRGFTMLPQLVSNS L \LKRS SHLSLPK 
CWDCRCEPPCPACFHFLSECNTCIKNQL 
Br'/TLFFPNT*GLTS EG 


6496 


397 


A 


6550 


479 


146 


VPGFKQFSCLSLPNS * KVQRAPPCPANG 
FAFSVETGFPHVGQAGF*FPTSGDPPAS 
ASQSAGITGLSLHAWP\ENL*KLRLHAT 
GLRLHDHVPLKVPNYLKVCSFKFE lADV 


5497 


20398 


A 


6551 


42 


484 


SMRWNSMNVTIMPANTTSIPQPTDQEVI 
SVFWS/YYYYYRYCKATAAIDSNLSGGS 
RQSKLKTFWKGLT I I YAIKN I CDS WEEV 
KIST*TI*KKVIPTFMGDFGGVKNSIK\ 
EITADVAEIAR*L*LEVEPRVGWLTPVI 
PALRKAEVG 


6498 


C 99 


A 


6552 


304 


74 


HTGKAAKKKEC/WRGTVAHAYNPNTLGG 
*NGWIT*GQ*FETSLINMVKPRFYS CI C 
ISGYTHKVIHRII ICKGKKLE 


6499 


20400 


A 


6553 


432 


62 


FMAGNSRMQQWCTDT/VSAHCSLDLLGS 
SNPPASAS*VARTTQARHHARLMLITMN 
EKPLGARYCARIFFENKLGIYPNESVPT 
KTNAKWSKPESCFSHDPDPGHWIYLMS 
SGLIVNKIGKVK 


6500 


20401 


A 


6554 


2 


174 


ycnlclpgpsdlptsas*vagttcmchy 
tmliffciXlvetgfyhvaqaaklfiys 

I 


6501 


20402 


A 


6555 


3 


268 


DAWADNIKCR*GYGAI*TLIHCWWECK\ 
W*ATWWFLIIUIMLLPYNPVIMV\PRTC 
IQIFMSTLPINTKTWKQPRCHLVGEWIN 
CGTCR 


5502 


20403 


A 


6556 


3 


402 


DAWTDNIQCR*RYGAI*TLIHCWRQC\K 

w*atwwfliiujmllpynpiimdhk\tc 
lqifmstlfidtktwqppkchlvgew/i 
klwsmpakeyhswlartel / ivp*nt*r 

RLSCT*LWEKSQYEKATYCWPTV 


6503 


20404 


A 


6557 


396 


2 


KKQNAFQSPSFIjPRL*TPKITILISFHN 

lyiythtyiyihvciftyihthiyiff/ 
rdrfllclpgwrawqsqltaastsqaq 
gipg*fffk/sifcrdw/sitmlpelvs 
nswaqai pppwppkvlwdyecgs 


6504 


20405 


A 


6558 


393 


175 


fksqglvllprlefsgmliahcslklpg 

FSDSrjSrSA\SQ*IiGL*GCTATPGPYIjLL 
lYAGELGLNKRSISSK 


6505 


20406 


A 


6559 


401 


188 


SPPEPFILGGPRVFPPPPFLKPPPRr*F 
WGPKKKKNF / CPPPGPKNFFF* KGPPLF 
FFFFFFFFFFFFHRSS 


5506 


20407 


A 


6560 


37 


343 


YEPAWLLDIYPKELKT*AHRKTCTRIL 
IASPFKTVKMWQPPD/CP*ADE/WDK*T 
ACLQTVB * CSARKRSELSTM/ lEMWRKL 
KCVALSERGQSEQAACYVIPTL 


6507 


20408 


A 


6561 


792 


141 


LFPPPP/YLSTSPNAGA*TTSPLPPQFL 
YHSPRRQQ3GTKPPFFSKK/PEPPSP/S 
PNKFYPGPWWGVSPFKFPPPPQKS *GPK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=GIydne, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














KSPLPPEAP/PAPPLTQDCPPFVSNPGG 
KKKTPP/SAKKKKDPTPS/PSPPPRH*! 
GGPPGFSPPPLFKTPPPKLILGPPKKKK 
IFPPPGGKNFFFLKGPPPFFFFFFFFFF 
FSLLFPVGVWSP.WLRSEAADLCGECYSS 


6508 


20409 


A 


6562 




356 


FFFNFIFIYLF/NFLRWSLALSPRLNGV 
QWSDLGSLQRPRPRLMPFSCLSLPSSRD 
HRRPPPRLANFLYF* *RWGFTMIiARMVS 
IS*PQ/CDPPASASQ\STGITGICHGTR 
LIPRNPMIA 


6509 


20410 


A 


6563 


416 


84 


L3LPSSW/DYRRPPPCLANFLYF**RRG 
FTMVSIS*PHDRPTaASQSAGITGVSHH 
TQPIISFLKSB'FFTLSNRFNEVCIDSEL 
LLQCKDFYLQKFNLHIAFQEYTHCHGVH 

Y 


6510 


20411 


A 


6364 


2 


51 


FWREMQIKTVLRYFSLKLVKIKK*DST 
FFWKGC/GETGNLIHKAMW/R/SI *QYL 
KKLYVHFLFVPVIPPLEIYLKNKKILST 
K*LFAVF1VTEKC*KKKECG*T 


6511 


20412 


A-"' 


6565 


2 


511 


VEW / S CAGL / LHTAV* I PGR I I TSEKCA 
QQVDETHRN/LQRLQPALINRKGPILLH 
NNTCLYTAKPMLQKLNELGCEVLPHLSY 
SPDLSPTNYFFNHLDNVLQGKHSHTQQN 
AENAF* EFVKF* SMDFYATGINKLIS /H 
WQKCIACNGSYFD 


6512 


20413 


A 


6566 


3 


248 


GDICT\SMLVPALFTVPK/TWNQPVSIS 
K*TDKMWYIYPVEYYLAIKKTKFC/SFS 
ATWMILEDIMLSKPGTER*1LHVLYLCG 
R 


6513 


20414 


A 


6567 


3 


258 


WRDHSSLHS*PFGLKWFSHLSPLSNWDY 
RDVPACPANFEKVIj*K*GLIMLPRL\LL 
SSFPEW*ILSSRLPASSFQSVQITHMNH 
CS 


6514 


20415 


A 


6568 


392 


166 


GGGGPPRGGGLEPPGPPPFFLKNKK* /R 

PPGGPGPLFPPPGGGREEKFFYLGGKNF 

H*TRVGPPPT,SGGKKKKIRG 


6515 


20416 


A 


6569 


234 


41 1 


PGAVl\iHAYNPNTLGGQGHRIS*GQEFKT 
S/LGIIIVRPHLY 


6516 


20417 




6570 


1 


175 


FLQHV*WKFQTMSDQIIGKIDDMNTLID 
I.SKNVKDi:.\LITQrEVEELEGENKIP IL 

PG 


6517 


2041 S 


A 


6571 


392 


140 


LPSSWDYRHVPPCPANFVLLVETRFLHV 
SQAGLELPTSaSQSAGITGVSHQARP/E 
* *NSIRKGATTIQRDLGPVLQVRYLVRC 


6518 


20419 


A 


6572 


2 


228 


RTCTQMSVATLFLIAKNWKQPRCPSIGE 
WrWKL*YILTMEYyijVIKK\MDFSSHKK 
TRGNPKCILLSKESQSEKA 


6519 


20420 


A 


6573 


412 


100 


AIiHSHPLGGPGQGVPKSRI PNPPCPQGG 
TPFFPKNPKLTRPGFARLPFPPLKKLKP 
EKFPUPG/MPGVPLN* IPPPVSHPGVQP 
NFFSKKKKAHGFCLVLNPSSS 


6520 


20421 


A 


6574 


406 


257 


NKREKKTRKMLDSIVIRKRHIKSQMRCY 
SAP IQMAKV\SNRQSPKFWVRCRQTISL 
*HC**ECKIVQSMW*TLQKFLPAIVFLC 
IYHTEKLRFTQQP/CT*IFLAALLMIV/ 
RTV/KRLVCPSMGEWLSKLW*TOLSFSV 


6521 


20422 


A 


6575 


370 


68 


EQGSVGMRPSEPGPGYNLLVCR*LRPLE 
KRTIWAGVS*FSRCSLSRLPLARKGKS P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
locution 
correspond 
ing to first 
amino acid 
residue of 

seTulnce 


Predict- 

nucle- 
otide 

correspon 
ding to 
last amino 

relfdueof 

peptide 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K-^Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=UnI<nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NP\CTS*VRFSNGLS 


6522 


20423 


A 


6576 


1 


259 


IDLIDVLRPSLTVT*AWQWCNLGSL* P 
*PPG\SGDPSASASRVAGTIGQHHHRWL 
IFVIFVEIWFRHILVYFHTAIKNYLRLG 
NL 


6523 


20424 


A 


6577 


418 


218 


KGKTRLGRWPPCNPSTLG\GQGGWIT* 
GQAFRTSLANIVFPLM 


6524 


20425 


A 


6578 


1 


401 


EKAGCSGSYL*P*HFGRPRQEDHLTLGI 
*DQPGQCSENLSLQKNF/TNISWA*WPG 
IVAHACNPSTLGSQGRRIT 


6525 


20426 


A 


6579 


3 


356 


HWDYRHEPPFPV*MHFFIWLPGC2ISLLA 
SPNLTGCFFS IS ITESLLSLFIYLETRV 
*DLQCSYFWERRG\SLCCPCWSRaPGLK 
RS CCLRLPSSWDYSRLTPPHLRNYCILG 


6526 


20427 


A 


6580 


411 


158 


IGSVAQARVQWRDLSSSDPSAN*APPGL 
KQSSC\PASLPSRWDY\NKCTPPDLM-IF 
YI FFVETGFPYVARLADCFYPPPL* KMT 
GSY*VFTMCQIPH*AFYKF* FI *FRVGE 
DKNSLPAWQHRETLSLQKKYKN 


6527 


20428 


A 


6581 


399 


218 


GGAELKVPK* GGGPGS PPLP PPLQKPRP 
KGPLWPGG*GPPRPPGEAPSPLKNKN*/ 
PLPG/AGPPPVPPNWGGGFYPRG*GFG* 
LPLGP * TLARGKQRDPVFKKKKKN*NVN 
KKra.KI.*KS*QFFFFFLKTGSRCFPRAK 
VQGPNGSYPKPQPRG 


6528 


20429 


A 


6582 


392 


214 


GSSDPPNSTSRVAGTTDEVLPCSPGWS * 
NSW/VSK*SACI.GLPKCWDYTREPQHIiA 
RSNF 


6529 


20430 


A 


6583 


390 


257 


PKRPPPPAFFC*PK\SLGMLPRMISNSW 
AC2AI I*RWPPKVLEKQA 


6530 


20431 




6584 


65 


251 


HYFNSFWFFLMGSSNHPTKNPG*FFFFL 
TFYKNK/SLTILPRLVLNSWAQGILLPW 
PPKGLGL*AQGILLPWPPKGLGL 


6531 


20432 


A 


6585 


2 


143 


RHMPPCLSSF/ SYFL*RRGFAVLPRLFS 
KSWVQVILMPWPPKVLGLQA 


6532 


20433 


A 


6586 


142 


334 


GRWLSQLLILNFTTRSVLKQMSW/MaCV 
SETWPA/STLGGRGGRIT*GLKEFKTSL 
QNMARPYLYK 


6533 


20434 


A 


6587 


485 


377 


RTRLSCRFFILSLIFRFGNFYCFTLKF/ 
H**VFLLSFVFYC*6HP/FEFIFKVFFI 
IFFHFKVFICVFKYLI.FFFILIAETFYF 
FVNIFSFKKFVSVMFVLLIEMFYYGRFT 

IPLR*F 


6534 


20435 


A 


6588 


241 


251 


NSLFTKKKKKKGGALLKEP ILHAGGGEG 
I IFFKGPQKSIPGPGFKKRGREKPGPSP 

*NWFF*KGPPFFFFFFCKE/RSFTMLPR 
PASNSWAQAICLPQPPKWGL 


6535 


20436 


A 


6589 


429 


54 


YFLFLPSDPTFFPSFSSFLPSVLSFFLP 
SI/SSFLPSLLFYILSSPPLSSPPSSLF 
LSFFLSRVSFCHPG*SAVAQSRLTATS I 
S WAQAILP /CR/NRDFTMLSRLAIiNS * P 
QVIRPPRPPKVLGLQA 


6536 


20437 


A 


6590 


411 


137 


KCIGTITTHCSLHLLGPSDPPTSASRW 
GLQAHHHI QLAC\ F*LID*LID* LKVEM 
ESRHVAQAGLELLGSSDPPTLASQNAE I 
TEMSHCIF 


6537 


20438 


A 


6591 


412 


196 


QPQPPGAKKTSGGGFPIR/WGHQPPPPG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etii 

0(1 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

sequence 


Predict- 
ed end 
nucle- 

iocation 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














AKKYlIFFFFFFW*RQaLAMLPRWPNCQ 
T*AIILPWPPKVL*LQA 


6538 


20439 


A 


6592 


28 


400 


IMPNCFPKWLYITFFFFFF*RQSL/DSV 
AQAGVQ*HSLGSTATSAFLGFK*FSCPP 
AFLE * LDYRHVPTHPTNFCVFKKD/MGF 
TMLARLVSNS*PQ/CNPPASASQSAGIT 
GMSHRAQPHLFVLKMEKPPPNFLFILLM 
NQSIHIYLLNFYRF 


6539 


20440 


A 


6593 


414 


23 


QTHLFWFPPGFSPPPFFKNRPRIFFLGA 
LKKNFFFPPPRHKNLFFLKAPPFFFFFK 
RSFFPPGWCAIGLFSLQPPPPRFK*FFC 
PSLPKKWGYRPLPPRPGKF/CFPFFFVI 
LVENGFHHVGQDGLDLLIS 


6540 


20441 


A 


6594 


91 


274 


1 1 iHWS * PLGNYHL/ WLGWAHARDLSTL 
GGRGGRIAGGQDFKTSLGSKQGSISARK 
KEKKKK 


6541 


20442 


A 


6595 


391 


231 


HLKRCSTSSVIREVQIKTTTTCHFSE*K 
R/SEINMCW*ECGQTELSALLIRC*\WD 
SEMVQPLWERKEYLKQISTESSWAPAIP 
LLD I » P * ERCASLSLRKGWGYRPEPPEL 
ASFLLGSYI 


6542 


20443 


A 


6596 


463 


268 


ILLCHLGCSAVAQD\LTLCSLKRSSHLT 
SPSSWDSRRGPSHLTKFF/C**RGLNTL 
PSLV*NFWAQAL 


6543 


20444 


A 


6597 


3 


229 


CHGSPQPCPPGLK*SSHLNLPTPK*L\D 
YRGMPPRSANFLFL*RWGRCVAQAGLEL 
PDSSSLPASASCPRQPVLYF 


6544 


20445 


A 


6598 


122 


418 


I IVNELCHIMCFFFFFFFKQKLLGTQAG 
GQGGKNWG*LKPKPPRLKGSPWIjTLPGG 
WNYGGFPP/HPS*FFFFFLGKKGPYFCP 
PAGGKFFNLKGPPFFA 


6545 


20446 


A 


6599 


1238 


930 


F3TDSCSFA*AGVQWRNLSSLQTPPP\G 
SNNSCASAS/RIAGITGTHHHARVIFAL 
LVETGFHHIGQGWSRTPGPQAIRPPQPP 
KVLGITGVTSRAEAISKHFV 


6546 


20447 


A 


6600 


398 


78 


PSSPFYLFIFRQSFPFFARVEEKGGHPG 
FPNPPPSGLKKFFYAPAPPGSKAPGQGP 
PA/RFYLFIFRQSFPF/CCPGGGKRGPS 
GFPQPPPFGVKKlFLCPSPPRK*GPRaG 
PPRPRKIFFFFFSSLFFFFFVRWGFTiWL 
PRLVSNSGAQAICPPQPPKVLGLQV 


6547 


20448 


A 


6601 


3 


473 


FPKGKKKGPPPPAGG/QLKNPSKKPNKK 
PGAPPRGPRGPKKGGGGLGSPPQSPQGG 

ekwtlkggpnlppprgglggarpppkig 
ggplfpkgggpIkggtpppplfffflrrs 
palsprlecsgai *aqtss/spparvgc 
psppaptrptgltvspgsn 


6548 


20449 


A 


6602 


398 


156 


ngrkhtdvkpeelpswvptrdltptgta 
gaf/q/rgyygyymkyvtvkkgsiagvs 
tvlaa*vlfsyclsyreikheqllrah 


6549 


20450 


A 


6603 


380 


77 


pfpkksppl*jsfppff*kkkvpppppgk 
lplsfiiffffff/rdrvllcrpgwsamv 
hghglspvnfcistrnral/smlprpis 
ntwtqairps*ppkvlglqm 


6550 


20451 


A 


6604 


352 


204 


KRGPKI*NFFFG*RQSIjHMLPRWPNSW 
AQMILPPWPP\KVQGLQA 


6551 


20452 


A 


6605 


631 


357 


FFF*DRVLLYHPGWSAAaE\SIjLTTTSA 
SWV\KQFSCLSLPSS *DyRCAPPCPASF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/S15,l 

Z6 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 


Predict- 

location 
correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, lVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide Insertion 














CrFSRDRGFTMLARMVSIsrS*PQVIRPPG 
PPKVLGLQA 


6552 


20453 


A 


6606 


3 


397 


VLHKLTQRCSAVPLRAPTAFLEIGTQTP 
KRVWKCR*P/RETQNNLLDKEQDFKTYQ 
KATVTKAVWCQHEDTIEQWDGIESTEKK 
FHTQRAGGRDHSHSDSVRKKRKESLFFF 
LKRGLILFPNLEGTRGGISTP 


6553 


20454 


A 


6607 


323 


35 


YLKTKIGKPGGLAALQFLGFRPSKWTQL 
GMRFMCTHTHTHTHTHVHPHTSQYLYPH 
RSISFCSYLDSI/SP*VLTESSNSNPTP 
LKCC*HFPSPCL 


6554 


20455 


A 


660S 


439 


252 


LVPPLFFPRSFLLFSPPKKRGaPKPV*F 
GPPRAFYPPPF/V*RGPPPFFFGAPKKK 
IFFSPPPG*KCFFF*SPPPG 


6555 


20456 


A 


6609 


3243 


2602 


fffgrdgvllccpgwsqtprlkqsssls 
l\pkc*ddwh/ratssglqvflsiccsi 
isiffmpsicdkstkqfsn/pqrayilv 
g*r*viylgwlaflfvflrqhl/ntva 
qagvqwcnlsll* ppparfkrfsclilp 
rswdhrrlppepanffvflvetgf\til 
arlvsns * prdppasaslsagimgvnhr 
twpilvflhpsvffkvlqlfyl 


6356 


20457 


A 


6610 


81 


395 


TWAALKLYSTCIFSSSETEPDVSQSQIT 
DTTACIMEPGLFW/YRSIIWTL*FFHFF 
WRDGGLTLLTRLVWNSWAQVISLPGPPK 
VLGLQAS ATAPNS FILIREGIS 


6557 


20458 


A 


6611 


1 


126 


GRLRQENHLNP6GRGCSEPRLHHCSPAW 
AIERDSVSKK/NKNGLKRCFHLRKiCTLK 
GEKRIW*PGGRGCSEPRLHHCSPAWAI 
ERDSVSKKKKMD 


6558 


20459 


A 


6612 


389 


67 


YLFMYLF*/DRVLLCHPGMSAVAQS*LT 
AASNSWPQVILLSNRFKSWTVLLLSSKI 
PWQGSTPCPRCYSWHPKCLQLQGLLGL 
DCSSPKYPRGPSLAAFRSLPKWHV 


6559 


20460 


A 


6613 




384 


rcfkvkdlkmrpfwniqvdpksndkcpc 
krlkrrghaslppwekrrrpgkdggrgw 
nsavtsqgt/veatrswksqgsvlp*sl 

*RERSPV\ELaHF*QRIiflMLPRLVLNSW 
PQDISPPWPPKALGLQEP 


6560 


20461 


A 


6614 


385 


174 


CLRLSALKKPKU!ISWY*K*LyiMEY*SA 
LKT\NKGILSFAITWMNIJ)DILLNEISQ 
TQKDKYHMNSLICEI 


6561 


20462 


A 


6615 


1 


370 


MNPRAHLSFIiIiRRKERRILWLKALGHIM 
/FTV*GLTGIALflNSSLYIVLHDTYYGV 
SHFHYVLSIGAGFA1IGGFIH*FPLFSG 
YTKKKKKNIRGGGTKNFLLCPALVQREK 
VNPFFALTNLWG 


6562 


20463 


A 


6616 


3 


342 


YRHEPPHSAIFAQLyKL*K*LIYALKTG 
KVYVYKLYLNKAIKSQ\SYNRASLSLT* 
* PRVKRKRS * PGTGAHTCWPSTLGSQDG 

LIA*VQEFETNLGNIAKPCLHQKKKKGA 
PV 


6563 


20464 




6617 


1179 


915 


FFFETESCLVTQAGVQWRDLGSLQSPPS 
G\SSDFQLI FVLLVHGQAGLELLIS * S F 
AliASQTARI TGVSHHAQLVWSDFI KS IN 
LFCCL 


6564 


20465 


A 


6618 


3 


390 


HASVKMELPYDAAIPLPSIYPKELKARF 
*DLCIPM/FITALFTIAKRWRQLNCPSV 



951 



w o 01/64835 



PCT/llSOl/04927 



SJEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 

amino acid 
residue of 
peptide 
sequence 


Predict- 
otide 

dingto*"*" 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l]nknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VIRSWGREGVKESCLMGMEFHSAR*KSS 
GDTFHNTENCY*TVHLKIVSGKF**L*C 
VLHYVGFFYHNKNI KELS 


6565 


20466 


A 


6619 


392 


49 


QFLFERLFSSYSGYVSGVKMLGSTEVQL 
FPRKCEMKSCSVMPRLECSGGISAHCNL 
CLPGSRDCPASAS*VAGLIFY/IFLVEM 
GFHHDG/RELVLNS*PQVIRPHQPPKVL 
VLHV 


6566 . 


20467 


A 


6620 


193 


282 


TPGLK*SSCLSLQNSWDCIWDYRRLPLY 
PVNCFCC-'FVCFVC/IFVEPGSCIAQYC 
LELLASSNSPMSASQSTGIIGMSHGACL 
ILT 


6567 


20468 


A- 


6621 


383 


8 


LNDTVSTFFF*DRISL\,CPSWSAVARSQ 
LKARSTSWVQAILPPQPKTNLLGSSDSS 
TSA*DQS PGFK* S F CFRLPGSWDYGCVP 
PFSANF* I FCRDKLSL /WAQAI LSPWP P 
KVLGLQALWEAEEGG 


6568 


20469 


A 


6622 


395 


42 


HVSSTSFQGDELHLRIVALPI,P\PLHNI, 
PLPLSPRSL*C/HSRGWTVLPPSRLTAT 
SLPD/ SSCLSLLSAWDCRRAPPI-ILSGFC 
WSAVA*SW\SLQPPPPSRLPWPPKVRHL 
QPLPSRHPV 


6569 


20470 


A 


6623 


3 


327 


PSPPSRWNPKDVPPNQENLGIFFFFL*T 
RDIiATLPRLVPNS*TQVl/WPPKVLGLQ 
A 


6570 


20471 


A 


6624 


395 


47 


KNRALFFFPPPRKRGFFPKPLIMGTPRF 
FPPPPF*NPPPYFFPGAPKKKFFFS/SP 
PGGKIGSFKRPPPFFFFFKQNFFPVPPG 
GGPRGDFGSLQHPPPRVKGFSCLAPIIF 
FSSF 


6571 


20472 


A 


6625 


100 


267 


IiSLSKKANISKFWLGT\VAHAV\DPSTL 
GA*GRWIT*VQGFETSLGKIVRPCLYFC 


6572 


20473 


A 


6626 


409 


126 


PSSSFSSSSSSSSPSSSSSSP/PFFFFF 
FFFSFW*KWGVSLCGPGWSQTLGLKQSS 
CLSLPKCWDYENKPLKLAFLASLILFTF 
QEWDITD 


6573 


20474 


A 


6627 


506 


323 


RKAPGTLlPCWQKYDMIKAQLVWQFLRiC 
PNIYLPYNPAIPL*HTCPRKINTQYT/C 
FITKTCTRMFIAALF1IPQNWKQPT*PF 
GPFTirSHSCSPRDNWF 


6574 


20475 


A 


6628 


330 


117 


DVCPRKIYTLVLQQLCINRMFTAILFSl 
AL*FCKPMCTSILEWIKKL*SIRTTEYH 
LVMHENKPIiYSMPOM 


6575 


20476 


A 


6629 


396 


139 


ffsppekkgvfpsp*Hlgtpgfpppppf ■ 
kkrpp*fifgapkkkk/hispppaekff 
ffkrppplffffffffflfffffsafkl 

KV 


6576 


20477 


A 


6630 


79 


405 


VSGRMTALESYLLASSANRNKCSSNCKI 
KKMTARLaVIAHRCHPSTFGGiGGRIC* 
AHKFETSQGHIIRPCLYKKKKKGRPFKR 
TQFSPARWQGNGFFWGTLKNSRAGF 


6577 


20478 


A 


6631 


2 


417 


SFIFPCKFLNQFISFYKNVCHDFDCNCV 
ESKDQLG/WPDAVM-ITCNHNTLGSQGGW 
IA*AQELKTNLDNMAKPCLY 


6578 


20479 


A 


6632 


74 


417 


KrVFRYNGWAL*ASENRISLADLLLPSF 

hvllclpkag\pnfplpssikkffdlpd 
lirshkthisegvlfpfp/rrkeprter 
prri*tdrscwvsslslivldhaffvqs 
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nucleotide 
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correspond 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otlde 

correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIaIanine, 
G=Glycine, H=Hlstidine, I=lsoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glntamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFYM 


6579 


20480 


A 


6633 


740 


776 


CS FFLKKQTRSRAAVQAGVQWCHYS SLQ 
PS PPDLKQPPASAS QTV/KD *GAYFWCC 
FFLFLKRWGL/NSWQAGVALLPDLPW 
RRLVIPLSLWAVEPALWAAAQL*LL/SP 
CPLASQSPRHQTGQSPHQKCFPDILGLH 
LALRSSFSPGLERPTLRLPPGLASFHT* 
ATWLSWPLPYK/PPKRVMMYLVKADRPE 
PFL* GTRGRKSVMQMTHAKTLQETLYSG 
PEAQAAASVTDEF 


6580 


20481 




6634 


2 


296 


FFFLRQSFALWQAGVQWHDLSSLQPPP 
PRFK*FSCLSLPSGWD*RHAPPRPAIFV 
FLVEMGF\TMLATMV\* IS * PGDLPTSA 
CQSAGITGLSYEARP 


6581 


20482 


A 


6635 


407 


212 


GRPR*TPPPLGKKKKGAPPPKTKKKKNF 
LGGRDSYSQKKPFFF*RNKTRPGGGGPP 
GIPPPLKG*AKKKGLPRGPKFPVT*/NK 
PPPPPPGGNNQNPFPKKKKRKSTR*I*K 
IRISRFILYFFFFFFLERGFGCSPRGGG 
GGAFLSYGKLWPPG 


6582 


20483 


A 


6636 


20 


223 


P CKKLPQPLQPSAT \TLI SQQPLTLRKD 
LLPGKRL*LAEGSDDSYQFLVAKYFKIK 
GLSFGRNKLANI 


6583 


20484 


A 


6637 


866 


647 


FPLDRVL/LCHLGWSaVMRSWLTAALTS 
GLKQSSCLSLPKLEDYRHEPQHQANNIS 
*SSWSDSNMQPRLKTTL 


6584 


20485 


A 


6638 


398 


26 


KGAPPPNIFFFFLFNIFFFFFFFFFFFF 
FFPINFLQCI.LYYRACiHENKVKVINNF/ 
PLNFCAYYHPL*NM* FSYIS FQNLYKFF 
LI /FGVFNLSGI YFNVL* lANF* *G*LI 
LPGTIYLKLCYFTTN 


6585 


20486 


A 


6639 


1 


341 


NWDYRHALPCSMIFFILVETGFCQLPRL 
VrjSrSCPQVSHHA\KFLFIFFFLFFLEEG 
GFVGPPRLEYREGSLQP*LPKLKGSTCIi 
TLPKG*N*RGEPLPPARSSFFFISKFFI 
ET 


6586 


20487 


A 


6640 


391 


118 


NflNFEKSFSMGKIL*NSLAWYKKILCl)/ 
RKNRSMQKTPLD**YYFKKLPQPL*PSA 
IiITTLIGRPALTSRQGLPPAKTL*FAEGF 
DPMILSLF 


6587 


20488 


A 


6641 


352 


285 


KKKKKGGGGGAYSGIPTF/ CFFFLHGVF 
LGPAGVQWQNQRSLQPPPPGVKLFFFLS 
LPSH*GPHHGPPRPGMFIYKNF/CFF/V 
RDGVLtHCPGWSQTPGLK 


6588 


20489 


A 


6642 


433 


64 


TGNFFLDSLFYENPGPLGGGGILKGFFS 
FFFFLETKFFFFPPAGGQWKYFSSL*PP 
PPKFKQFFFLSM/WPRPGKFFFFFFGI L 
VEAGFHRVAQftGLKLLSSGNLPTSASQS 
ARITAKSHRTVF 


6589 


20490 


A 


6643 


424 


11 


PPSIFLAPPPSRGGGPPPPPKKK/QSPP 
KKKKKGGGGGPKHPPPKKKGPPKKTTGF 
KKNPPQKKKKKNSPPKKKGGPPANK*KP 
P PRPQPFYFLTFYYKI FIYI FLLKFLLF 
YILFFFFLKSSKQGRPIjAI 


6590 


20491 


A 


6644 


411 




PPLRIWLASPPSKI F*PQPLHI / YFSPF 
PP*KFFFSLKPLIFLGGFFPKFPPPKKS 
FFPKNPQaVF*NPP*KKKIYFSSPP*NF 
APPKIFLKTPPPFFFFFFFFFVEMGSRY 
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Predict- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutannic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, Impossible 
nucleotide insertion 














lAQARTEEETRGRTRGRTRG 


6391 


20492 


A 


6645 




381 


AKLGTRGAAKQEKE I KVTR I * WKE I ELS 
QFTEYTQKNL/NVTDKLL*LVYMFSKIT 
GYKVNLKNINCIFHKHTNQQQKILKMPF 
IIA*K/TIK*L* INLTKDTQDLYAE I YQ 
TLKflEL 


6592 


20493 


A 


6646 


17 


448 


HFYAKLGTRASLS CSGLHPV*ASQPLYL 
PTQASAMADAP FPARLPPKS LIS Y CCTS 
NEQGSVGVGPAEPGAGYNLV*HLLRPLE 
KHSI*VAVSRFSHYSLSRLPLARKG\NP 
QTPCASWVRQCPTLLRLaLHGLHPLSNQ 
SQGD 


6593 


20494 


A 


6647 


462 


336 


SLRLPGSNDSPASAS*VA\GITGPCHHA 
RLVFVFLVEMEGLGS 


6594 


20495 


A 


6648 


447 


62 


KDDIWIAITHMNRWSISLAKRKM*IKIT 
MKYPyTSIKMA/IKQTNKN*KKSSSIKC 
W*ECRTMRTLIHCW**HITG*PLWKTVW 

KFI iklnlylpydpaStctqmfta/alf 

IIIKY*KQSKYPSSGYCVPA 


6595 


20496 


A 


6649 


3 


1492 


LAYAEAFLDDTNTKRYPDNKVKLLKEVF 
RQFVEACGQALAVNERLIKEDQLEYQEE 
MKANYREMAKELSEIMHEQI/CPPGGED 
ERLTEFPSHLQRHQWDSNKHWGSRDDQL 
VFGRVITSHGPCVGTCFVICKLRMLSKA 
NHWGDRAQ/ERTKGKGRERK*RTTLFtN 
R/RFYRSWKKVHIFF 


6596 


20497 


A 


6650 


495 




GAAEFPASQSAGIAGVSHHARLEEFFSL 
DSGRVMRGTEVLKYLGGYTITI * S I FFF 
t'SFF\WPGLGSLQPLLPGFK\LSLLSRW 
DYRCPAPHLISL 


6597 


20498 


A 


6651 


9 


222 


QTTRPERRNSGDPTHLSL*SSWDYGHVC 
HHAWL IF * K/CFSGGLALLPRLV*NSWA 
QAVLPPWPLKVLGLQV 


6598 


20499 


A 


6652 


241 


833 


KKTPIiAMWNPWPGPPQIPHPLTKLVQiyiA 
G/QENGCLSC3HDLES IRPAGLHNSARGE 
VLGLSSSLGKELVFLQEELDLSEIHIPE 
AQEVEMASGHFAFLHVPVPDGHAPYCKA 
SLSASSSLEPTPPEDTAISSLRPPSAPE 
MLTQHGaQEQVEDHPG/Q*QPSPHSQSR 
PSPQKDPQALVIASLRSRTERKQASHGG 
TRPG 


6599 


20500 


A 


6653 


1 


398 


FFFDIGSHSLTQaAVQ*HDHGHGSLHPR 
LPGFK*SSHLSFPES*DYSRHTPPRPSK 
F/EVETEFLHVPQAGLMRCNLLPAT/AT 
SKSARITSVSHHVQPEVSIKW*MNISNL 
IKCFLLFYKYKCI FINTKFVFTV 


6600 


20501 


A 


6654 


489 


282 


PAS/WDY/RHI«PPLIANACAFLC*RDFS 
MLPGLVSNWWARVICLPWPPRVLRIQAE 

ATTPG 


6601 


20502 


A 


6655 


953 


908 


3QSCSVS \RLECSGAISADCNLCLLGSS 
YSSTSAS *VAGTTGAEHHTQLIFVFLVE 
MGFHHVGQPGLELLT\*VIHLSRSPKVL 
GL*APSSARGQNSVSQ 


6602 


20503 


A 


6656 


412 


44 


R I S ERRKSLGAVHTKDKKYSS SRVRTSN 
PNCAHSGASSKQSFLPGKKRAPHPGIKS 
F\KDflLLWDGITGVHHHA*QIFVRSRE 
R* IRPVIPALWEAEAflGSRAQKIETILA 
NTVKPRPSLLRA 



954 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!aniDe 
C=C}'Steine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=]VIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


6603 


20504 


A 


6657 




241 


FFIFLEAHSHSVIQVGMQWCDHSSL*TQ 
TPGLKES \ PASS FHRAGITGVSHHTYLF 
L*GH*SHSQGPHPHDLITSKRPCS 


6604 


20505 


A 


6658 


467 


42 


FFFFFFKQGFGLFPQLEGKEGIPGPCNP 
CPPGANDSPPSTPLGNGGPGGPPPNPGN 
FG I FG I * GGEGG F* TPGSGKPSGPGS PK 
\IPFCWVIRERPHGPPPRPFFFFLRQSL 
CFAQAGVQWCDLGPLQPPPPRFKQFCLR 
NS 


6605 


20506 


A 


6659 


2 


286 


KNLKINWT * WLMPWSAT/WGGSLKPGR 
LRLQ*AVIAPLHSSLGDRARPYLSVSQY 
LYFSLSLSLSLSLTHTHTHTHTHTHKIE 
LLGCVAEIIHFF 


6606 


20507 


A 


6660 


3 


249 


LSPRP*CGDTISAHCNLHLLGSSSSPAS 
VS QVAG I TGACHHAWL I F WS VEMGFHH 
AGQDCLHLL/NLVIHPPQPPKVLGLQA 


6607 


20508 


A 


6661 


3 


1507 


PEAPVPFLDSNQENDLLWEKKFPERTTV 
TELPQTSHVSFSEPDIPSSKSTELPVDW 
S IKTRLLFTSSQPFTWADHLKAQEEAQG 
LVQHCHATKVTLPKSIQDPKLSSELRCT 
FQQSLIYWLHPALSWLPLFPRIGADRKM 
AGKTSPWSNDATLQHVLMSDWSVSFTSL 
YNLLKTKLCPYFYVCTYQFTVLFRAAGL 
AGSDLITALISPTTRGLREAMRNEGIEF 

slplikesghkketasgtslgygeyvik 
itl/ssstdlwtheidlhrnsl*nrdsn 
csnf/lreqaisdedeeesfswleemgv 
qdki kkp-^ i - ,s i kt irkekhevqmdhrpe 
swlvkgintftllnflinskslvatsg 

PQAGLPPTLLSPVAFRGATMQMLKARSV 
NVKTQALSGYRDQFSLEITGPIMPHSLH 
SLTMLLKSSQSGSFSAVLYPHEPTAVFN 
I CLQMDKVLDMEWHKELTNCGLHPNTL 
EQLSQIPLLGKSSLRNWLRDYIYITORS 


6608 


20509 


A 


6662 


326 


^ 


IMIFLFFQTGSRFVA\RLECSGTITTHC 
S LNLPGSSNP\ PTSAS * WTHCRTRVI L 
NFSVDPGSRHWKGGLELLDAILKPWPP 
KVLRTRGSTPEFRSGGSGVRIRGFN 


6609 


20510 


A 


6663 


414 


105 


QGKVGFKVGLNPL3RLLHFFKQKLFVLS 
* KGQVKKLSLKIPFPPGGFFSPPr *NPP 
P*NKPQRFKNPFMGFT/IEKKILFLAFK 
S TKSGG* NPPS K/HKKI * FS F/ CFWETE 
SRS VTQAGVQWHYLSSLQPLPPRFK* FS 
FCKPHKRVFKPLGLVLWGGISNRRRKKT 
SGQKEHFKA 


6610 


20511 


A 


6664 


1 


407 


GCVPWPAAREAEAQ* IA*TRET/RLRW 

AGIVPLHSSVGNKSETLSQKKKGAPSKK 
SLQGVALFLSQLGKLPPFLQRKKRPQRG 
GGRGQLAPGYNPPAGKTLVGPGPHLSFC 
SKKAWKELRPWKKPRGFWFLINST 


6611 1 20512 


A 


6665 


1 


280 


FFFF*HRVSLCCPGWSTDLHLLPP*IiAR 
FKRSSHLSLPSSWDYRHTPT/HPGFFVC 
LBVFCRDRVSFVLPRLVSNFWAQAILLC 
GDRKVMGLQA 


6612 : 20513 


A 


6666 


408 


89 


WLPNSLILPN*NYFVSYLSDRWSLCRP 
GSTAVAQ*ELTA\A*TLGLKRSASFSHS 
HSCSLPSSWDYRCTSSHLANFFFKRWGF 
TILPRLILNSWPQVILLPQSQRH 
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sequence 


SEQ ID 
NO: of 

sequence 
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od 
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USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W^Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop eodon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6613 


20514 


A 


6667 


484 


208 


ITWSAAGLIHYSFLNPGEPMTTEKYAQ 
EIDDIH*NLQHLQPALVDRKGPIFLHDH 

A* phaaqpkxlqklnklgcrvlphlpys 

PDLLPLT 


6614 


20515 


A 


6668 


495 


227 


PAFVKVPPEKDQRQNSTHAGRNHLPME I 
LLL * KKKLIETESHSVT\RLECSGVITA 
HCNLKLLGSSDPPTSASDYRLKPWHLAP 
SSFSMS 


6615 


20516 


A 


6669 


2 


65 


DGVLLTLSLLARLVL\NSWPQVINLPWP 
PKVLG*QAGLGTPGLK 


6616 


20517 


A 


6670 


403 


224 


ILL*P*PPGLK*/FLPSSWDWRRASLCL 
ANFLFFVEMGYPRVARAGLKFLGSDPSA 
S QSAG VTGVS RPGR FL IRT VL 


6617 


20518 


A 


6671 


406 


162 


rs yaqagtqwld fsslqp*ppglikqss 
hfsfsqsash/rhpdnycipcrnrvyvg 
qaslellgscdlpasasqsagitgscl 


6618 


20519 


A 


6672 


453 


32 


wsatlhtswvcacatrcvgmcmtlgv 
cgcecvw*gcvctsvn*\htstgwsek 
V* LCVCGCV* lcvaecmafvsrqcvcdp 
vcclpekmrlspisva*vcapmkvspcp 
pqlselhllslniyinkcvslsqnawlf 

L 


6619 


20520 




6673 


378 


180 


kevirsykvslfspriekwikk/i/hwy 
iypmeydsaikrnevlihatt*rkleni 
tlnerifdiipfi 


6620 


20521 


A 


6674 


378 


94 


pvknrrtkelhikdvctpmlivilfmiv 
k/s*kqpkcp*idewmnkmwythmveyc 
svskkegspficyneditlneishkktn 

lYWSTYIRYLE 


6621 


20522 


A 


6675 


1 


380 


VCSL* vlcrhmscfqncsl* fsfsfwff 

WRQCliPLSPRLECGGPMSAHHNLRLPGS 
SDSCASASLVAGIROaCEHflEVrFVFW 
EMG\FHCVGQAGLELLT\*VICPPWPPK 
VLGLQVLATVPGQPHI F 


6622 


20523 




6676 


424 


135 


shrwgvkcfkynifffrd\rv*lcctgw 

s*tpelk*ssclglpkcqdyrhetlrvs 
y f *-us i qigvl, amvsklvlns*aqai 
llpwpprvlglqi 


6623 


20524 


A 


6677 


408 


91 


hthfvwppvl ip** pmr* vllltpntnf 
flrdrvspchpgwsveaqsqitaasn\y 
glkqfshpsvpkcwdyrrelpcmvslii 
fiyvhlhmyvhaekekpeypl 


6624 


20525 


A 


6678 


1231 


217 


ppyfmnpltfpqkrkfkgdktlkkwl*! 
reiilqhrgig*ktdsir*mgkteapi.k 
ekfllgtalqpsflpymlghfkee*gee 
fpgthswpfpsfpvwglglsvnhrgihg 
prvslqtwgpqqeamececgiihgtedtg 
\gavlqpg*vt*gfhtptsepqlvappc 
tcsglkaea\gflklpgkqg\segalpp 
*splpsacfpelglrhlgsspglmihpg 
wglftsplglkilgprhpagpalpqiaq 
rpslkfr*rrlgwgpellpaqtcgaqsl 
gpstapllgqgpwglg*hasrwgtlgla 

VAPaNEVPSIjSPR*PRSftGPLHEHRVPV 

rswg 


6625 


20526 


A 


6679 


1 


178 


rgwdyeeplp/rpq*cfvflvemgfhrv 
gqdglylltl*ssqsagitgvshrahlq 

PPIF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Seriiie, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V^'possible 
nucleotide insertion 


6626 


20527 


A 


6680 


3 


135 


KSEHLEREPI IKQLSTMFF/ TAKYGWHL 
RGH*HRCQKKPEPPEDR 


6627 


20528 


A 


6681 


425 


107 


KGPPTPS/PPPPSPVFPTEKILVPPPFF 
PPPPFKKPPPQNIFWGPKK/IKNFSPPR 
GEKNFFF*TPPPKYFLGAKKKKNFSPPR 
GEKNFFF 


6628 


20529 


A 


6682 


424 


284 


CFLQPGP\PGSKGASPPSFQDWWGLRQA 
LPP P /RES FFLFLFLFFPI VQTGFHHVA 
RSGLELLTS SDLPASASQS AE I TGVSH* 
A*PFP 


6629 


20530 


A 


6683 


1290 


449 


PLSAPPPGDPSP*KSSPRAPNLPAP*PP 
NLSPWPPKCPRPRR*PASS/PP*LPVPP 
KSPSPPV\PVVATEASRGHQPQIT*ELG 
SLVLRMTEKGTWG* GI PDS I MPLPRRRS 
EPRT ORE * GDTRWRWRRRRRS TGAVS TF 
SPGFACGGGSGA/PAHPGEQKLGASARP 
DE / PVCPAAW VLLP SAR WAS PAAAP PP 
LPSQPVH*RFKP\SPPY*PP\A1.PPLSP 
FEPSAPPHTGSVPY*QGVPP/SEGCPST 
SKPLISVRPPVTINGCLPQSQPPKTHTV 
SRLYEEN 


6630 


20531 


A 


6684 


425 


40 


EWASLKN*HPP/GANQAVEAVEAPTHCW 
*GCKSIEPLWKTVWQCLDKLNI CLLYDS 
TPE*MSEHVYQAACKKSFTAVLFMIPPN 
* KQPS CPS TGWLGQAWWLTLVI PAPWEA 
KVGGSLEPRSLRPAWATG 


6631 


20532 


A 


6685 


2 


107 


SWLTAASTSQWKRSSRLSLPSSGDIIRC 
MPANFFFLKDDVSLLPRLISNSWTQAIL 
LSLPTKGLEL*AHCSLWLPGSSDPPASA 
YQWGTTGACQLIFFF 


6632 


20533 


A 


6686 


12 


224 


TPPGSTHASGAPPCLANIKFNY\L* RLP 
RLFWNSWPQVILLPWPSKVLGLQWATV 
PSPFLIFDRDIDREI 


6633 


20534 




6687 


404 




TAARGIIYGGPIKKLLPRKPADVK/ SGS — 

FKTAAPFFFFLGTRTLTEFLFIIIQN+K 

QPKRPST/DSIMKLWl'ISTIEYYSAKKR 
NKLL IHTTAWMNYKG I MVS KRGQ SQKS I 
FSRTKTGFGRKHISGCFLYPLPLRL 


6634 


20535 


A 


6688 


410 


163 


PRGNTPPGFSPPEKENQKVFNPPPGNIV 
*KKaQPPPQQAQKKQPPY*aPPLSFLPP 
PCKKKKEGPPPPPPPKKNPPKGNS/SFSP 
KKKKKFFPPPPKKKGF*GGPPLKTLFFW 














GGPFFFFFFG 


6635 


20536 


A 


6689 


205 


405 


KSPPSPGGPPLIGKLEIGGAPKPKNWPP 
PLGPKGENPFLftLGGGGGAKNHPPNPQF 
ILLFLOGKNP+KSPPSPGQPP/ANRKIG 
HRRGPKTQKLAPPLGAKGGKPPPCPRQR 
GGGQKPPPQSPVYSPFFGGKKPL 


6636 


20537 




6690 


3 


199 


HGSLQH* PPGFKSSSSLSLPSSWDHGHV 
QPRPAIF/CFFLILEKRGVSPGFPGGPE 
TPGLKGLDLL 


6637 


20538 


A 


6691 


8 


350 


SVCLGLP/ECWDYRYEPPHLA*LAFLCQ 
I IFHHMDWIYHVSILWDGYLGCFYFLA 
I INNV\6*AGLBLLTSGDPPTSASQSAG 
I TGVSHCTHPYCYTFLTRE I TTPKE VK I 
GLCM 


6638 


20539 


A 


6692 


373 


197 


LPSSWDYRFMP*HMLFFK\FLFLLFCRD 
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nucleotide 
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SEQ ID 
NO: of 
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od 
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nucleotide 

location 
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ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine; 
T-Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














E/SLTMLSIiLVSNSWAQATLPPQSPKVL 
GLQV 


6639 


20540 


A 


6693 


369 


100 


PMLARMVS IS* PRDPPASASQSAGITGV 
SHRAEPIYFFK*R*SHAGYSAVPVHRHD 
PTTDKHNSFLPCFLQTGSCFWQVGMQW 
HDHISL*PRTPGFKQYSHPSLPRH*DYK 
PTPP/HPS* FWQVGMQWHDHI SL 


6640 


20541 


A 


6694 


12 


182 


FLSSWNYRCVPPHLAN*KKNFFFYTDR/ 
SL&VI,PGLVSNS*TQAVLPLQPPKVLGL 
QA 


6641 


20542 


A 


6695 


402 


248 


SSWDYRCAPPCPAWFYKIL*TGS/I.TjC* 
PRFVSNSWDQVILPPWSPKVLRGV 


6642 


20543 


A 


6696 


301 


309 


VaEF*IDRILLSPRCHPGWRAVA*SRLTA 
ISASRLICARPSIILSLLSSWDYR*VPPGP 
JJ^FLHFFCR/DHGFAMLPRLVSNSWTQV 
ICLLRPPSVGITGVGHRARSILLD 


6643 


20544 


A 


6697 


3 


387 


PCIiflWFF/ VFFVETGTHYAA* CGLRLLG 
SSALPV*TS * SAG 1 1 GMSHCTC /LQI TL 
LKTESHS\VAQAGVQWHDLGSLQPLPPG 
FM*FFCLSIQSSWDYRRTSPR PANFC I F 
SRDEVLPCWPGWS *TPDLR 


6644 


20545 




6698 




287 


LWFLYQGNAGITGISHRAQENSFVFFKD 
RVLLCCPGWSAVA\HSSLTVALTSQAQV 
IFI.PQPPG*LGNFCIFSRD/RGKAMMPR 
LISNSWAQVIRPP 


6645 


20546 


A 


6699 


412 


3 


FEMES CSVARLECSGT I lAHCNLHLL S S 
SSSPVSASRVAGTTGACCHTWLFFFFFF 

V/FLVETGFYIIVAQMGLNHWAI-CAILPQP 
P*VLGLQARITTSSPNSSFTREHTSGSG 
VGYPAKFHS I VLLLLYLLFHLLCGVF 


6646 


20547 


A 


6700 


393 


SB 


QKLYPYLTPYAKVW*QWHa>JLSVTAKKI 
*FLGENTGVN1.HSFRPIKGPLDM/TPKS 
ISTKRK*KIKWSSSKL\KNFYALKDINIK 
KMKRQPTKYEKIHLTLNNWV 


6647 


20548 


A 


6701 


494 


239 


KAGSHSWSRLBCSGMISAHCTFCLLS* 
NNPLTSASLaaGTTGVHHHAWLIFVFSV 
QMGFCHVTQAWVLI FDCGFBSPINIKYN 
LL 


6648 


20549 


A 


6702 


2 


379 


RPTWVTWQNPASTKKKKNFFFFKTKSPT 
VPQAGGQGG1TLG*RKFLPPGLKRFSG1.K 
RPEAPKGQSCSLMGGGLCPLLPP/WSGT 
PPQR 


6649 


20550 


A 


6703 


220 


47 


TGFSGSQIN*EK/ IWLGTVAHTCDPSTL 
GGAGGQVT* GQEFKTSRHE PLHLTVLCS 


6650 


20551 


A 


6704 


80 


414 


ICYYFIVFAIYLNQFFSNCYRSFTFFFC 
PFETESRSAAQAGGYKPTLTLLHPPPLG 
IKQFSCLTLPSS\WAES*KPPCPTWGNS 
VFVGIYKUKLGGGGAIRRVQTCNQTYSG 


6631 


20552 


A 


6705 


69 


416 


NVEKHHNIHNIKKTPVYFKGFLLYCCKA 
EVFyVLDWPFIFYFLFYFFETHCVAQAG 
VQWIINLRSLQPSHP\SSSDYRCaPPHLa 
NFCRDGVSPCWPSW/S*APSLKLIHPPW 

SPKVLE 


6652 


20553 


A 


6706 


373 


75 


HAQLTFKFYYFRNTFCK&VASTKSNSSD 
RSE*SKLKTFWKGLTILDAIRKISDSL\ 
EEVNMSTFRGV*KKLTPIIMDDFEGFDF 
TRESNHRGAEHGGS RL 



958 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SKQID 
NO: of 
peptide 
sequence 
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od 


SEQID 
NO: in 
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nucleotide 
location 

ing to first 

residue^'of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possible 
nucleotide insertion 


6653 


20554 


A 


6707 


331 


379 


K*KIWKI^*KGLTII.DAIKNICDSWEEVN 
IST/LTGVWKKL/ITLIDGSEGFKISVE 
EGNAGWEIAEELEIEV/EPEYVTKLLQ 
SHKKT 


6654 


20555 


A 


6708 


578 


152 


LQQENPGSLQPLSP\GFKRFSCLSLPSS 
WDFRHAPPCSAHFf i/FL VSRG iTHHVG* A 
GLKLLASSDSPTSASQSAGITGVSHEAQ 
PLIGFKVNSVKKSVSVPTREILKGNMSM 
YEQL 


6655 


20556 


A 


6709 


392 


146 


FFETESLSPGLECSGTVSAHCNLGSSDS 
PALPS/RLLGSSDSPAPAWLYSFKL*! 
Y/FHKTCNYVLKINILLHVAIPYSFSWS 
C 


6656 


20557 


A 


6710 


9 


393 


KSKPCN*DKSQSYRL*LFAQLFTF/SNF 
FYFLETF*ESHCVIQAG1HSGTIMAHHT 
LKLQGSSNP P ASAP P S SWDHR CGP P PLA 
TFLKIFL*RQWAHYVSQAEDNGLIMFPR 
LVFELLGLKRFLLPQPS 


6657 


20558 


A 


6711 


34 


387 


qdfgflsffllfffekglggptcracSpe 
snlme\ggppglkqfsgltlprtgnngr 
asrsrakfwn*kgkggppgpfveletft 
lrgpphlsspkgknngggpkpdffkkvg 

KKAPNP 


6658 


20559 


A 


6712 


1 


188 


IOAWWYTPWPATWEA*AGGLPAEPRRS 
RLQ*AVIWLIISCLGDRVRPCLNNKKGS 
FPPQPLW 


6659 


20560 


A 


6713 


377 


98 


SLGLKQSSHLNLPSSWDYRPVPPCPRNW 
FVPVPFFF\RE*GLAMLPRLVSNS *AQV 
ILPPQPSKVLGLQaRTTAPGFFSHLLKT 
LVPiraVNLNI 


6660 


20561 


A 


6714 


353 


229 


SSCLGLPKSMDYRHEPPHLAYFYY**RQ 
ALV/MLPRL*Y*FLDVIS1M*LPLPFHVAP 
SHLLNSVGMV 


6661 


20562 


A 


6715 


371 


64 


ENHGDCTPAWATEAKLRFKKKT /CYQMF 
ITTLFTIAKRK*LKCPSSDK* INKMWYI 
HMMEY/YAIKRNWTLVKLNQNPYAL*KV 
LLRE*KQNP*SVENIFKTHI 


6662 


20563 


A 


6716 


462 


232 


HFBrLITIKIIPFF/C*DGGLTMLPRI,VP 
NSWAHMLLPQPPK*LGLQARACKL 


6663 


20564 


A 


6717 


425 




GFGGPPFKKftRFGFLHPF*TPPVIjRPRP 
PPVFFGGFKftSPPKKKTPK/PPGGGGPP 
PYFHPFFLSPFFP I FFEKEPPPFFFWRG 
LGPPPPPFiaSIPFFFFFWVEGGKFLPQKK 
KKKFFQS ISTRSVIHK 


6664 


20565 


A 


6718 


3 


413 


GTFLTKTSLCKTLLY*VISYMCV*FIS* 
\SYLVYI lYFKDl CV/LLYMYI FYIYTC 
VHISVCMYMYVCMYVCMYKA* PANLKTF 
CTQKHLAWGWVQWLISVISALWEAEARE 
S PEVETNLAQLGPTLSLPRKKKKKKKK 


6665 


20566 


A 


6719 


408 


71 


SSWDYRC/LPPGPAUFCIFSRDGVSPYW 
PGWSRTPD*RSLPGGDVMPALVIRHVPE 
LLKKQHSSVRSSRPAWPIW 


6666 


20567 


A 


6720 


375 


57 


s vi remq i kttmkyp f i ptgl i vi kkte 
nh/rgwqrcreikaciy*wwgckmtqi.l 
qktiw*fi.kklniel/pydpaipplgm* 
lrelktclprlgwahacupstlrg 


6667 


20568 


A 


6721 


391 


164 


RISETCT*MFPAALFIV/DQNNPDKWTN 
LMWSYTHTMDDYSAIKENEVLLHSSTWM 
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sequence 


M 
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USSN 
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ing to first 
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peptide 

sequence 


Predict- 
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location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














\ENLENLMLS KRGQS PKTPFG 


6668 


20569 


A 


6722 


381 


278 


SSLQPQTPGLKQSSCLSLSNS \ PPRFID 
*SISLYRQGLTILLRLLSNSWSQAIVLP 
GPPKALGL*ASKKFLYSNI 


6669 


20570 


A 


6723 


2 


363 


QRSLEHRPCKKC/CGLW*VS*KVQYLV 
YKESSSKKNLLRRlHCNFLfSGIVKHl!'* 
KLLRTPRSFCLCGLHIP1NFTIF*IKIE 
KFKKC*PGTMAHPYNPSTLGGLGGWIT* 
GQDGLDLLTS 


6670 


20571 


A 


6724 


352 


58 


HGGAIGARFFKVFNLI *FNIiVCKDE/ SF 
SLLAEAVINSWPRVILPPWPPK\WGFQ 
AKTPLPRLKGPLESLPSLKKKKNLAYAR 
TITAQIGLLVCFIDQQ 


6671 


20572 


A 


6725 


383 


54 


KS S P FFFFEQGLLCCPGWSAWS PLQPK 
CPRLKQFSPLS/LPSN*EYSCTPTTFSL 
HVCVSIHKYI*YrDQYrFKFF/CR/DRT 
LAMLPRI,VSNSWSQAIT,PP-*PPKVI,GFQ 
V 


6672 


20573 


A 


6726 


3 


343 


LEV*ARATRQE*KINGT*IIKYKKVKQY 
SQNDMILYIQNPKDSAKKLPELVNKFSS 
FKVNRQNSWFL*NSNKKS/EKEIKKKL 
/ PFT IAS *RIKTLGINLTKEVKNLGEAQ 

WVTA 


6673 


20574 


A 


6727 


3 


615 


HTSQGNL* IQYHTPWHPPSSERVERMNQ 
TLKSHLTKLVLKTRLSWTKCLP lALLKV 
RTVPQKEVGLSPCEMLYELPYSHFTVDI 
PTFETKSQFVKSYVLGLSSTFSSLKAK/ 
ELFSTDATLGSSPGTSAFSLGDDVLIRS 
WEEGKLKPAWEGPYLVLLTTKTAVQT/D 
NKKMQTHHTPVKKASPSSKSCAIVPRP I 
PTKLKIKKKKA 


6674 


20575 


X 


6728 


2 


228 


SGAI / SVHCNLHFPG/ SSSDSSASAS * V 
AGTTGV[rrMA*LIFVFL\/ETGFHPISQA 
GLELL/NHMICPPWPPKVLGLQL 


6675 


20576 


A 


6729 


98 


359 


KVEYTVLLF*GFKGKFPTLPLyLSFSSF 

TiKNHTitSNFFLNRVLLCYPDWSWG* FT 
AHCNLNLSCLSLPSTW\DSRCVPTGAWI. 
RFI 


6676 


20577 


A 


6730 


10 


428 


RSTRTDYAALREVLPDPRRCRPRTGAWS 
NGRRDHPGAGHSGPKaEflETRBTLEKGL 
LFCCPGWSAVA*PRPTATFASRAQSLTV 
SPRLECX3GWSAH*NF\SFHFQRWGLAV 
LRTLVUHSWSLALLLLWPRKVLGLYALA 


6677 


20578 




6731 


425 


158 


SQTHDLR*STCLGPKCWDHRCAPPRPAL 
Fi:,F/FIIKCPMSIRKIEHAQ*LPGI*KKN 
FCRDDHLPLLSRLILNSMftQGIL*CRPL 
KVLGL 


6678 


20579 


A 


6732 


431 


22 


RDRVLLCHPE* SAW* S*LTTVQNSWAQ 
GILSSI*DYSCVP\LPVNF*TFFKSVYY 
YFI IFIFEI YFCNSKRLGL/LGVSHRAR 

PPRI.VIjNSWPPQVILPSQPPE*LGLEAL 
ATAPAIPRNSNL.VWHQYSFKS IPDDL 


6679 


20580 


A 


6733 


44 


414 


AIYDLPFLLIDLYPMELRAS IRAHTCT* 
RFVATLPTVTKRLNQPTCPSTDE*QNKM 
WSL/YIMNYYPAIKRNEILIHEWIMLCK 
RSQSQKaHTE*FHICEISRISKSAETED 
ELWATRWGKGRE 


6680 


20581 


A 


6734 


375 


49 


QRLTYHCRFfiyuTASNKMELS *TQMHQK 
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nucleotide 
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correspond 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=C)'steinc, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unlinown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SACSMCPGRL*GVHAMRPKVLiyiRLSRSH 

/varpiaatcaq*vcdrik/r/slliee 
qkiwkvlkaqtqsqtakfknetffrnk 

N 


66S1 


20582 


A 


6735 


381 


88 


slfprletpglrpsfffflfslshnpgw 
swv*srlttalns*arfsllswcvplh 
qahff\ffcrdr/sltmlprlvlkswaq 
aipppeplkrsglqv 


6682 


20583 


A 


6736 


384 


293 


AHLSLLSGWDCRCMPPCLDNCVCLC/RV 
FVCR*GQAMLPRLVSNSWTQVILLPWHP 
KVLGL*AQDQEPLWGRSGPGLPQAASG 


6683 


20584 


A 


6737 


24 


300 


NILSAFLCIjS*P*H*YGLVIL*NVPQFR 
FLPLFFPMQMYTHIFFKIRIRPGWSHA 
YNPSTLVGQGGR\SHEFKTSLDNIAEIPR 
LYQVQNKIR 


6684 


20585 


A 


6738 


2 


353 


ihcw*-kckitytlektvwqfpiqlnivl 
pyipa/ipllgihtremkthiytk/t/c 
mqmfgwlfi lakivfqlvwr* ikl * s i 
hkvdyhse/irnkqlyfat*:mlkctvl 
se*s*tqktt 


6685 


20586 


A 


6739 


2 


336 


ihatytnp*hrh'\vhtpthvht/hvyth 
ngtiithtarawtpapptsliiipsptesp 
palqipnlpfglsssphpagssqgepee 
ggcqreaknrvghraekwetelcagdmn 

L 


6686 


20587 


A 


6740 


3 


385 


KKQATLLWIIHLATTFSVEI *QSKIEVH 
DIFKVLKEKKK*TTFYPKITNPGKIAFI 
HEGKIKLY/IVK*QLRDFINSGLVLKAM 
LKRVL*SKRKGH*QTIRNLRVQNSLVTA 
NTQPNTEYCNTVIVVGN 


6687 


20588 


A 


6741 


406 


155 


TGSCSVTQA* /CWSAVIVHCSLDLPGSS 
NPPASASWIAGTTGMCYHAQLPSFNLCL 
QFYCMGCVFSRFYLRFTFGLQSYTRKQK 


6688 


20589 


A 


6742 


1 


215 


LPYSPDLSPTD/Y/HFSKHLDNFLKGKR 
FHNQQDAENAF*KFIESQS1DFYAVRIN 
KLISHWQKRVDCNGSYFD 


6689 


20590 


A 


6743 


376 


109 


YHMEVWMLYSLKGSaYLRLPaatoYWCP 
PP*QAKF*K\FFCRDKDLflMLPRVVLNS 
* LQAILLLRPPKAEPLQLLGQFQFREES 
GWN 


6690 


20591 


A 


6744 


382 


148 


NIPQFMFDILFQ*LITriSCPASF*SKFP 
LSSLYFIQ*GR/WLGTVJUiACNPSTLGG 
RGQR\*A*DQELETSIiftHK2yiPHL 


6691 


20592 


A 


6745 


403 


209 


HlVlNRllRKYCA*HYftlrfKENNLDKLDT 
FLE\NTTCKNLTQEETENLNSPVSVKEI 
EQGKLCSDRN 


6692 


20593 


A 


6746 


366 


40 


QGSVLSHJSVKIVFWFKTGCVHFLFPLN 
FLSLFQNYVCSHTS/yPLSLIPHESLWS 
FPPSFNHSS*Y*HFLKTN*MPDAMaHAC 
NPSTLGS*GRWIT*DBEFETSLANr 


6693 


20594 


A 


6747 


392 


57 


APKKKGGGVYPLYPKKSGRKKKPEG/PP 
PKPKKKLKTPGRKNDAKDIKPGGAKFFF 
S PKKKRGAPPGVPPRLF* KPPQATI *GG 
PKKKKKPPQEGGKKGAPPKAPPLFFFFL 
YF 


6694 


20595 


A 


6748 


376 


140 


ledni / gysnqnagtysrvs * intpkikv 
s \qskiakwdlikpkrlcaakvsmnrvn 
rqppewdrifalasadXrgflnkilg 
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w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 

corrcspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6695 


20596 


A 


6749 


540 


172 


FFLRQSL/ DSVAQAGVQWEHLGSLQAPP 
HRFMPFSCLTLPSS*NCRCLPPCPANVL 
YF* QRWGFTVLARMVS IS * PRDPPASAS 
QSAGITGMSHCAQWNTLSLHAFSCDKV 
PQNDLL/WPSKSFCGTLSHEKACKDNVF 
TTWAQWLMPVIPALWEAKAGGSRGQE IE 
TILAHTVKPHLC 


6696 


20597 


A 


6750 


1 


353 


WFLERESTRSENSWIVEMTT+DLEYYV 
SBCAVAGFEKIDSNI KRSFTVGKMLSTSL 
H/ CYRE I FHERKSQSMCQTS LLS YFKKL 
PQPPQPSASATLIVNIKAKISAKRIKLT 
ECSDDC 


6697 


20598 


A 


6751 


404 


231 


SPS/PPPPYFGGPPGFFPPPVF*TPPPE 
YFFGAPKKKKNFPPPGGKNFFFFKGPPP 


6698 


20599 


A 


6752 


77 


371 


ITCPVLYLTSRNTKTKKYYLDCCCFWP* 
WNNRDQTYLHLFFF /CFl KKNFFFQPKR 
GAEGGFCINETPPLGGKGNPPPPPPGRG 
EKRAPPKTLCFFFFF 


6699 


20600 


A 


6753 


354 


169 


ETESHSVTHRGVQWHNLGSLQPPPPGFK 
RFSCLSLPKK/WFSVKLN/PHT*SFCYH 
ILKALII 


6700 


20601 




6754 


■ 


- - 


CICRTAFELTGRATRSEKEERKKERMYD 
*KKERKKEKEGILLKKEGQEEEEGRKEE 
RKEGRRQTQDEFGGQNRLSTVKQGS*PK 
KRK*GIQ\QTGAPTQERDKETSQNKNEE 
KARHDS*QPQA*RTQSRLQQEGRKLWKR 
GLVSERK 


6701 


20602 


A 


6755 


366 


262 


FF*RWGLTMPRKLVLNSWAQVILLPPPL 
E*LG*QA 


6702 


20603 


A 


6756 


3 


223 


GLKQSSCFCFLSNEDCKHVPP\HRLFFL 
FVCLFCFVLFFW*RyGGLaVLPRLVLKS 
WP* VI PQPWPPQMLRLQA 


6703 


20604 


A 


6757 


418 


244 


FSLLSSWDHRYMPPRMATF/SFVLFCLR 
EGCLTRFLRPV*NSWTQAIFPQSPKLLG 

LQV 


6704 


20605 


A 


6758 


46 


256 


KLK1\TKTEAEHS S SACL* /LPGSWDHRYL 
P PRIiAl-IF * TFWRDRVS I .LPKGVSNLWAP 
TILLPWPPQVLGLQV 


6705 


20606 


A 


6759 


391 


53 


FIFFFFPHSSHGVSPCCPGWSRTP\GSS 
GSPASASQHAGITGRNHHTRLQDIY*LB 
/TGQTVTLQFSVLADTTLTKRLSWSILC 
CYNKIPETG 


6706 


20607 


A 


6760 


377 


2 


FGEPPGFSPPPVLKTRPHNLFLGPHKKN 
INFPPP/WEIJSrLVPLKGPPFFFFFLR*G 
lAMLPRLVSTSWAQVlIiPQHPKLLELQ 
VHAIMVLSSRTSHS PRSRSLSPRFSQRLL 
PVSCNPAQPSPMA 


6707 


20608 


A 


6761 


2 


259 


r,TIA*KNFIFLGL*QGICQIPKDyQTSL 
R*1KDNLNK/WDTPCSWIGGLNIVEMSV 
LSKLITOIFRAIQMKTTSGFPFFFFWKKG 

IFF 


6708 


20609 


A 


6762 


1 


169 


FFETEFRS FA*AGV* WHDLGSLQPQPSG 
\SSDSPASAALFFFFFFLKKKIFFFPPG 


6709 


20610 


A 


6763 




496 


matctpkhintqicvsfllslklsefeq 
msifkkgenelalgtgnirpydscpgnh 
p/apfatspaprhpsgpswrpptsggvp 
araecpaspnplppgfcsqspsrgrkee 

*LRGBSTSTLAIF* *APNEYHSF*LPRQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

nuele- 
otide 
location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P* PHGDCPARKKKKKPNDPRLQPTVT 


6710 


20611 


A 


6764 


1 


229 


VLLLCFFFFVL*DRVLVGHPGWSSRLYL 
KTKTQKKQNNIFVCRVGG\LTTLPRLVS 
NSWPQMVLSPQPLNALGLQV 


6711 


20612 


A 


6765 


580 


273 


wrqslthsvaqpgvqwhdlgslqppppg 
fkefsclslpsswdyrrapprp™f\*y 
lvqtefhhvgqaglelptssdlptsvsq 

SAG I TSVSHHAQLDMI FLR 


6712 


20613 


A 


6766 


220 


239 


aqe fkaaasttalqtetlclkkkkkkkg 
gvslggvssnfrtgggasvtspsif/cf 

FFFF * DRVS LSGVQWCSLQP * TPGLKRS 
SSLSLPSSWEYRHVPLHPA 


6713 


20614 


A 


6767 


89 


387 


FLSPSGSQKEHLQIQLRSTFSFFFFFKR 
EPRFGRVGGQCPNLG*WKPPPPG*RQFS 
GLTLT/SNWERWATPPPPTNFWIFKKKG 
NS PWWKRGAKIFGPWDP 


6714 


20615 


A 


6768 


397 


223 


IREVGWINWQHKDPCD/KWNCSVY*L* 
* W IHKPTHDKT\HRNTHTHTHTHTGMHS 

HTQK 


6715 


20616 


A 


6769 


3 


367 


SWKVCS I<RPI PS PS LFF* KHPRVLLRAD 
GT * * VS LATL*HMQLS *AGSVKEAEPRS 
NAPPHKATFLFLFFCTD/SSLTMLPRLV 
MTWPQAILPPCPPKVLGLPAEANSSSL 


6716 


20617 


A 


6770 


490 


260 


FFETESCSW*AGVQW\LELLTSSDPPA 
SASQSAWITGiaSHCAERCGTHYHLCERF 
LVSFVTALLSQLLBLGLflHSRC 


6717 


20618 


A 


6771 


418 


130 


PSPRDUSrLFPLNGPPLFFFFF*DRVLLC 

clgwsavasn\fdlk*strlglpkcldy 
rheplcpamwdtlssl*eifglfchrps 
lpaprtgpgsq 


6718 


20619 


A 


6772 


311 


314 


FAVGELFLLLHHVGHACLPFCHDCKFPE 
ASPA]MLPY/QPVEL*AT*TGIEVLTPGC 
RFQAVFGLKVGFHQGPTPVYLGI 


6719 


20620 




6773 


465 


283 


CPSLPSSWDYEHPPPRPSNF/ * YF* YHV 
GQPGLELPTSSAPPASASQSAGITGMSH 
RAWPK 


6720 


20621 




6774 


410 


91 


RDHSKLQPQTFRLKQSSCLSLPSN*DYR 
HMLPCIiAGWLaGWIiACQTDRQTDRQTDR 
QTDRQTDR* TDR * I DS RDR / S LAI LP RM 
FSDSWPQVNLLPQ\WPPKWRLWA 


6721 


20622 


A 


6775 


409 


253 


LIFAFF\CRDGGLAILLRLVSNS*PQAI 

LPPWPPKVLGLQARAPGVRPDLS 


6722 


20623 


A 


6776 


3 


278 


FFFFSLHLWAPLAFFFFGAISAHYNLPI 
PG\SSDSPASASQVAGriGACHHVQLF\ 
VFLVEMGFHHVGQAGLELLT\ *VIHLPQ 
PPKflLELQI 


6723 


20624 


A 


6777 


79 


442 


RDEDRGLERRCVSACSTPQLGNASTT*G 
CPGVSDPSLHILF/CFFKEN/SCFAPQA 
EGGGHNLG*LEPGPPGLKQFS CLTPQRI 
WNNGGAPPPPLYFFFLKKRGGSPLWPGW 
A*TS*PR3 


6724 


20625 


A 


6778 


77 


417 


HRGLHLS FFLF / S FLFFFLKEAALLWPR 
EQGRGWIWVNCTLSLRGQGISKALPS*E 
AGTKGGRPHPTLFLVF*KKTSFSQVDQG 
GIDLWAEGMPPPGNLKWFELQKETLCPG 


6725 


20626 




6779 


62 


235 


IQELRIREGKCLAYEHTARNGKSQDLHA 
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SEQID 
NO::of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=AsparticAcid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, y=Tyrcsine, 
X=Unl<novvn, *=Stop codon,/=possible 
nucleotide deletion, \=P('ssible 
nucleotide insertion 














GLSCLCCCCCCCC\CNCWP*STRKFFTK 
PE 


6726 


20627 


A 


6780 


908 


566 


SKMEFRSCCPGWSDLGSPQPL\PLGFKQ 
L\SCLSLP\RAWYYRHAPTHPANF\*FL 
VETGFFHVSQAGLELPTSGDLPTSASQS 
AG 1 1 GVRHHAHPECVFQKQLSEftKLRR I 

WNQED 


6727 


20628 


A 


6781 


11 


483 


TTALLKAVRH/G *HLSLQRLLLS F \VCL 
CPAPRGGAYRGRQAS *SCGGLHPVRASR 
LLCLPNQAWAMAGAPPPTSLPPCSLISD 
CCASNQRDSVGVGPSEPHAGYNLPVCRF 
LSPLEKRSIRVGVTRFSRCRLSPLSLTR 
KGNSLTPCASRVSSASPCS 


6728 


20629 


A 


6782 


225 


589 


SRRSEPGNFRSCLGVGVTCCMCLSLSLP 
SRSFLRPLFLSLSVCVFVCLGTRVPCAP 
EGGFLARRPFFWSASPRVSAWVLWPVGN 
RFPGGSGLGVCEGLGNVGI\WSEPQGFS 
FPPHPEQPLC*AGSRRALPNQGQR\LPG 
AHRL PAGGCPQS FKKVWTPVALCPHRE 
M* PQLDAGTEKKAGKGMGQACLSLKGWP 
SWPSHPFDTPPRMPWVAWPPRILPGLW 
PLL* PPAGRNAERKKPETDRRGRSAGSQ 
PQACADGVRVSLPKSGALVMSRTGLPVC 
PWAALSPVARSRGDQNPATSGAAWGWVL 
PAVCVSLSLSLLALSFALSFSLCPSVCS 
CALGHVCPVRRRVGFLHVGLSSGQPLPA 
SLPGCCGRLAIVFPAVPAWGSVKAMATW 
ASASEPQGFSFPPHPEQPLC 


6729 


20630 


A 


6783 


458 


170 


PCIVTASASQSAGITGVSHRTRLLVNFS 
KVGSVTISLG*EVGLEHWP/NI/aRLDK 
TGPGEKQHFILHAPQSAPSVYHCLGNTQ 
ELPPLSTAMISQPKR 


6730 


20631 


A 


6784 


1 


192 


FFFFG* ILPLSPRAGVQWHDLGSLQPPH 
PGFKQFSCLSLP\RG*nYEACATRDGLI 
FCNFRLGD 


6731 


20632 


A 


6785 


3 


246 


SLCVCQYYCRCACV*VPKQVSG*VTMGI 
/CATGFVLIYLCG*QCLCG/C/C*EDPF 
yCEQLCVHVCSCVLGRGWSVCVCISRLL 


6732 


20633 


A 


6786 


408 


128 


GPVPYKARG* SPFWPPGEKP SFPKNPKT 
PPGGGQAVLFPPPWRGRPENSPPPGGKR 
C/QPN* rFFLPPHPGHKTQGPPPKKKKK 
KISAEFLSW 


6733 


20634 


A 


6787 


I 


418 


LTILGVMKNIODSWKEVKISAFTGAWKK 
LIPTLTD/Ey*GFK2VSVEEGTAHVLETA 
RELEVEAEAVTEL/LQSHDQT 


6734 


20635 


A 


6788 


3 


237 


FFFLT*VLTIV/ PG*SABVQS *LTAaST 
S* \VK*SSHLSLPSSWDTORMPPHPflNF 
FFFFGKKSLIIjHPGRGGPSLPTP 


6735 


20636 


A 


6789 


282 


3 


PHPD]SnF\RFLV*TGCLHVGQNGLAIiLSP 
GYPPTLASPSAGITGVSHCTRPKRYF*!!! 
VLSNKSLNRFGLCTVLPTCLQAPRRTGP 
PPATVLGRGT 


6736 


20637 


A 


6790 


62 


328 


FFRDRILFCHPRWSAVrQ\SWLTEASKS 
W\IK*SSRLGLPKCWDCEHEPLCLDNTT 
Y*CRQNTLPITLIGCFRVSLPYAKyTGK 
FWGAEG 


6737 


2063S 


A 


6791 


67 


353 


TNiCHTYFLEUSIKREKPQLTVFFFF*REL 
EGLIIiftI!rGWFRLRG*SNSPA*PSKVAGI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TGPPHQAGLILVFLEKTGFNYVG/RRGP 
PPPPPKELALQA 


6738 


20639 


A 


6792 


744 


382 


MTFKKTAHIYFYLYLFIYLRQSL/DSVA 
QAGVQWHNLSSLQAPPPGFMPFSCLSLL 

sswdyrrpqprpa/nffyf* *RRGFTVI, 
ARMVSISRPCDPPASASQSAGITGMSPR 
AWPKLHIFKL 


6739 


20640 


A 


6793 


372 


129 


RVLPCHSGWTAWQS*LIAVSTSQA\KR 
S PHLS IRHVPPHPA/NFKQFFCRDGL IM 
LPRSVSNFWAQAILPPQPPKVLGLQA 


6740 


20641 


A 


6794 


395 


25 


QPGLFCFTPRKKPDSFSKVFAVLSF/FP 
PSL*FS/PLSL*VQEMVSICCPGWSAV\ 
TPGLK*SSCLSLPKGRGYRCAPPHSML 
KFFL* R* VLFMLPRLGLH\SWVQWLCQ 
WPPKGPSLRACTSKI 


6741 


20642 


A 


6795 


412 


176 


KKS * / CIiATPRSENWVL*NGPPLFFFFL 
*RRDLAMLPRLVSNSWAQVMLPPWPPK\ 
VLGLQAGATMDHSCVQYSHSFAPE 


6742 


20643 


A 


6796 


444 


95 


NFVRNCQSFLQSGCTFLHS YQQFLL/ PC 
PCQH* IMSVFQILTTVIGVYMHLV*ICS 
ALMTFDLFMCHVLlCHliHIFFGEVSVQH 
FSLLLFLFESGSIYWQBGLHLLFSSNP 
PASAS 


6743 


20644 


A 


6797 


378 


732 


FPFWLNLGSSQPPP\PAFTR\PSHLGLL 
NSRD YRHLPPRLANF\ * FLVETGFRHVG 
QAGLKLLTSGDLPALASQTTGI TGVTAS 
PSLDEGVFKALPPPVGTTLCLPPWTTGG 
PFLGPAL 


6744 


20645 


A 


6798 


^44 


1 


PLCFPPGGGGFLSSHQGFPPPPPKRKGS 
VGAKKKN I P PGPF P FFFFKKNFFF * GP Y 
PF/SEPKKARWGPPPPGPGFSPPRSLVS 
PLGEKKKKKKIPPPETQPPSFGA*WGKK 
RAPPPGPRAGGRNSFLEGGRGQGRTGGS 
VREFRAGR 


6745 


20646 


A 


6799 


462 


83 


IEPLRPALCLS*SNPflSI*KPHFLLFF/ 
CLPFLSFFFS/SFPFSFPFLFPFPFPSF 
FLIPFLS YSLCCSGWSTVA* * GLTIAS \ 
TYGLKQSSHL/SLPASWFPCLTN*KKIF 
FRDGGLTMIiIRLAYLFLNF 


6746 


20647 


A 


6800 


433 


185 


PPPGGRLLVEKTRGEGQLKPGQPGRQAG 
LGMRGNPGBCAQPEPASGAPEMQLPAHPP 

SGGHLYGCGAKNNEGGANPLLCGGTGSG 
SVAAAGEVSKSAPDSGLMGNSMLVKKE/ 

AaGaGEPQAN0ETEN*EG*PM3P3SGAL 
LLTSPAAATLPLPVPPQRRGFAPPSLFL 
APQP 


6747 


20648 


A 


6801 


395 


60 


lEFLLFLPLSVDGRLWSHQLLPVMKEAT 
/MS IP*HAFAWTC/SSFLRGGCLGA*ST 
LLYGGYMFSYLRNKQTIFQSSCCIVNGE 
QPRMENELFS*FIQHLVFSVFLIRAILE 

GI 


6748 


20649 


A 


6802 


423 




RGVPY*P(3GLQPPPV/REFFFFFIGRVS 
VLQAGAAAPTCRAQS ILP3SWDYRHLHA 
NMSS*FLYFV*RQVFSMLPRLVSNSWDQ 
AIHPSQPHPALFFTCWSVKPLATL 


6749 


20650 


A 


6803 


396 


194 


SLMKLftHV*VLHTDGGFMHPDSAISCHD 
IFDSLHLTGEGSAKICRALYELIMQLME 
EHP\GEKQTTIA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6750 


20651 


A 


6804 


373 


192 


PALNPWAQAHLPHQTPALSSNFFYF/CV 
EMGLTVLPRLV*NSWPEAILILLPWHPK 

VLGLQA 


6751 


20652 


A 


6805 


370 


242 


KKGTSKFLFFPSYSFFSKFWYQKKVKIA 
FFFF* DRVSLCHPEWSAVW* S * LSAALT 
/CPGLQGSWGYRHVLPCLANFS \FFC * D 
GVSMLSRLENWPFCLF*PVFQPGQHGNS 
ISTKK 


6752 


20653 


A 


6S06 


464 


182 


DGPHRFEPPISS*FLPPWPPQKLEIKG* 
APIiAKQIFFFFFFL*RDlVLLCC*GWGT 
VYS VYSWVS SRLTAAS \TAVLKQS SHLG 
LLSSWDYBCHHH 


6753 


20654 


A 




3 


228 


YTCAGFFRRQSLCHLGWGAVARSQLTVA 
LTL/VPPGLK*SSNLGLPKCWDYRCEPP 
HLACVMFNTVKDVCFSCILS 


6754 


20655 


A 


6808 


506 


173 


NLHLPGLKQFSCLSPPPRSRDYRRAPPC 
TANF\ * FLVEMGSRHVAQAGLELLNSGD 
SPASASQSAGTTGVSHHGQISLFKQILT 
STRISEVIFLTLLLRILHDLPCLKLFSL 


6755 


20656 


A 


6809 


676 


397 


VSSCPRDQPGHHGETLSVLKIQKLARHG 
GGCL*SQLLRSLR*QTRLKSGVGDHSEP 
RLSHCAP /AWATEQVSVSRKCKKRQLHQ 
KERIPEWVRS 


6756 


20657 


A 


6810 


367 


31 


QVFQELYSQDLFIYLETLSLCHPG*SAV 
ARS*WTAASDSW\VKQSSRLSLPSSWD 
YR* VPPYSAHFDVNSS FLTYCTS PGLTQ 
KIEDSWLGTVAHACNLSTLGGHSGQIT 


6757 


20658 


A 


6811 


469 


251 


LLSSWK/YQVPPPHLVFIKIL/CRIGRE 
TASCCr*PRLWNSWPQVILRPQPPKML 
GLLSATMPGPISVFCFCFCF 


6758 


20659 




6812 


16 


462 


EIISYCGFKLHFSNDYBaCKEEKRERERK 
KRKKENIGE/HI*DIGVGSDILDLIPKa 
WTTKLiNIDKWYHIKL/QKLLSS*RVAKT 
NKLKS PCTAEEI ITRVKRQPMEWEKTFA 
SQTSDKGLISNTYKELKQLNSTETNKQK 
NNFDLLSADKG 


6759 


20660 


A 


6813 


492 


243 


S SWDTQCTPSRPD \ I FVFLVETGFPMLA 
RMVLIS * PRDPPASASQSTLVDTSRYKV 
YQNIHQQV]!IMF*NYGWCFFDIFTASVGD 


6760 


20661 


A 


6814 


32 


308 


EYTLPHPYLITGPPPVPKPETYSHS *YF 
LSPNTQCIQ\PAPRTIiNTASSTCLSRRV 
TARPVPIYQDPSLEPLQLPPNSLTLPTP 
LRHSPCWV 


6761 


20662 


A 


6815 


97 


274 


GRFLELKVKRGSLGLGMWHAVlILSTLe 
G*GGRIP\QGFKASLGNIAKSCLYKNKK 
6AA 


6762 


20663 


A 


6816 


74 


376 


DFFIVYSVFTHLVNIKDVKDVLYKSLKY 
FFAIYRMGENIYKLYILGGINIQNISME 
YYTA/ IKKD* ILLFAATWM/RLEDI ILS 
EISHEQKS\KRCMFSLICGS 


6763 


20664 


A 


6817 


1 


1127 


mrvtaprtvllllsgalaltetwagshs 
mryfytamsrpgrgeprfiavgyvddtq 
fvrfdsdaasprimaprapwieqegpeyw 
dretrnmkXasaqtyrenlrialryynq 
seagshtlqrmygcdvgpdgrllrgyhq 

DAYDGKDYIALNEDLSSWTAADTAAQIT 
QRKWEAAREAEQLRAYLEGLC'VEWLRRY 
LENGKETLQHA\DPPKTHVTHHPISDHE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
'26 


Predicted 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, I^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATLRCWALGFYPAEITLT\WQRDGEDQT 
QDTELVETRPAGDRTFQKWAAWVPSGE 
EQRYTCHVQHEGLPKPLTPERWEAILPS 
PTSP IVGHCAWAGAVLASWWSSGAWAT 
VDV*EGRSSGGKGG\SYSQGWRPSEPVP 
QGFWMCLSTALKKP 


6764 


20665 


A 


6818 


34 


382 


STAGAIHVCMCMCVHVSTCVCAKMCVRV 
KF*IWDIETGAaQLSCCVI.VAQTEIIYL 
QEKNSSLSERECVCAK\CVCMCMCVHVS 
TCrVCVCVCV*V*KRHNWPGMVaHVCNSS 

TLGG 


6765 


20666 


A 


6819 


1061 


305 


FFLRWSL/DSVAQAGVQWRDLGSLQAPP 
RGFTPFSCLSLPSSWDYRRPLPRPANFF 
YF * *RRGFTMLAEMVS IS * PRDLPASAS 
QSAG I TGVS HRARRRNT I L Y * HLNFVI K 
SHKPPD I VFRDL * GPRLH1.ENPVL *GAK 
SHSSCPWLNLSPPPPHAGSHCLSSSSPR 
MGARQDLLFNASSHTGLFPFLQLNOTAS 
RPTSQVPKSLCFPLPWVQPFALPWVRQD 
EGRAS WS PQD P PRVP SCAGFTHS CNLS 


6766 


20667 


A 


6820 


610 


248 


ERRSHPVA*AWKWCHLGSLQPLPP\GS 
SDPP\T*AS*VAGTTGVRHHARLIFVfF 
lETVSHHVAQPTGF * TPGAQAI RT.PQPP 
ENARITDVKPLCLAHFCYSKHFIMVPAP 
CISHQKVFH 


6767 


20668 


A 


6821 


3636 


3332 


TCLGSHATPHCGHS FSSCKILLF* FFVF 
EM*SHPVAQAGVQCRDLRSLPPPPL*PK 
RFSCLSLPSSWDSRRPPPRLANF\CIFS 
RDGILSCWPGWSRTPDT.R 


6768 


20669 


A 


6822 


563 


146 


APGLSFNCYIPPAPAFMPGELTLFEVD/ 

NRS ifptey/tihili SSED ILHS *TIP 
LLGLKTDAIPRCUIQTTLTTGRPGLYYG 
QCSEICES/N/HSFIPIVLELILLKYPK 
T*STSTLKYHCKAIQHLPFKLKIDGVYT- 

SLQ 


6769 


20670 


A 


6823 


399 


201 


VLCiLEPDDIAVmFMKERHPQSKS * TIT,Q 
KWIQPFSFIVQK 


6770 


20671 


A 


6824 


323 


323 


LPTVIVI ILSFYtlFFFFSMRLIRTLSFI 
CMYVCICIYVCIYVCIYVCVYI lYVCIY 
/IYINISF*NDRISFFFMLE*YVCVYIH 
EWIKKI 


6771 


20672 


A 


6825 


396 


122 


LPPGGHRPAVPGGREMGPSHGNPGNPGR 
F*KKGPPSPGGGPAPGGPPSPEG*ARKM 
G*PRGPSLP*TQIPPPGSPP/W6PPGVT 
PETLVGFKKRGHPPRGGGPPPGVPLPRR 
AKPEKWVNPGGQAFHEPECSPPRGPPRGA 
KPETLSPiCKKKICV 


6772 


20673 


A 


6826 


414 


64 


RGVEGGAACIFKGLSVAGKPG/YLSiyMR 
SCPRRMLLFKS *G/QVIATKHMLPTVLR 
MVGDPVAHVRFNVAKSLQKIGPI LDNST 
WQSEVKPILEKLTQAQDVDVKCFAQEAL 
TVLSLA 


6773 


20674 


A 


6827 


2 


519 


KTKTCLVEEIS*ITGRFLERINKIDKPL 
VlirL*RKK\VQVMriENEKVDVTTDRT*K 
\ 1 1 KEHYGQLYMf KFN/ IDEMDKFLKRD 
KLRKLTQEEIENLNIPVSI*KFLI*NIP 
KMKTP/GPDGISREL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6774 


20675 


A 


6828 


413 


114 


GGKRGPGPIPHPPAHGPQ*GPGHPflVQV 
PEPP6LSFPHFRGGFPSSGGCQRGEGRK 
ERRPSWTGPSWMAQEGRTPAARLEGAVL 
PG/APGPEAPGTGTTMSAPTWPEPP/G/ 
GRQ VQDVPAF PEGSAHQAR«HEE*LHWK 
KILL 


6775 


20676 


— 


6829 


479 


261 


PCFSFLSNEDCRHAJ=P/RPV*F/LVERG 
FRHVARAGLEFLT * SGLPAS AS QGAGIA 
GVSHCAWPLQVGIEASLQY 


6776 


20677 


A 


6830 


492 


313 


HP IGWPVNWGRASAVGEFRDWKLFYPEC 
EIRTMGGREHGQSPGAQRTFFQLLLSFX 
VESK 


em 


20678 


A 


6831 


378 


1 


SNG I CLQASSAS PNKIVPSTSLKVCSN \ 
VRVDGSRNPS VAFP PLTPNLFAVAPS I S 
SGMGIETIPIQGYRVDEKTKKCyiPF\V 
*flMRHSPSGVYNINVHflLVSGPPLSDDS 
GVNKPQMKHQHCSA 


6778 


20679 


A 


6832 


374 


76 


QLVEKWTGNVKSMTDRVLTMRCELRAHL 
*ALRTPGTWNLINDQIGMF\KFSGLSPK 
QVECLVNEKHIYLMPSGRINVSGLTAKT 
LGYVAASIQEAVTI IQ 


6779 


20680 


A 


6833 


369 


56 


LKYLGTSVENRYADQIUWLSPVISMiW* 
FE/CGWIARSEDRDHPG*QSEIPS/PflN 
FLFLVEMGFYHVDQflGLDLMTSGNPPAC 
NTGMSHCTRPGRTDVLAI LCSDP 


6780 


20681 


A 


6834 


457 


142 


DCIALSSRIiDLQWCDLGSLQAPPPGFKQ 
FLCLSLPSSWDYRCLPPRPANF\*FLVE 
TGFHHVGQAGLKLLTSSDLPASASQSAG 
ITGLSHRAWPDSSFEBSSSS 


6781 


20682 


A 


6835 


407 


35 


TAPAIRGVHLRGRGWTVyG&V*AVPPAS 
IPAYPGVVYQYGFYGflDI.YG\DMQHSDM 
RGLMVSQHSPMPHT\LQQS\SVTVMAGC 
TQPSHTMPLPI.PI.VY\ErALWRVYAEVA 
TSYLPPAEVMLAPCK 


6782 


20683 


A 


6836 




428 


GKTLNPPSLGKSFGLIGLPPENDWPPDV 
SVPR2\AFPLRGPRSVQSVVIIEMEQSVAQ 
LLL*MLTFTPHKRI SAFEAL\QHSYLHK 
DEGDPE 


6783 


20684 


^ 


6837 


3 


612 


FKGSGVFNRKKKIiEEHPLWQDVALGTAL 
CPQLPSLPGWGVIEWPREAACLPCMPQL 
CPHCCGPQEaQYPL*AGGQ*GQHQEKPG 
A\GPHPPSGPQVPSAPGEAMTAMEGRKL 
SPAVAQNPPQRKNTLPRGPLPRGAPTTT 
P*SLRPR*TPVGAGIAAPIRSHHPGCSG 
QEALALCPDRPHKDRGVRRGAGGPRGRR 
VQLGGIPP 


6784 


20685 


A 


6838 


3 


412 


yrwgftmltrlvlss*pr/ cdppasasq 
svgitgvnhrvrlfffffgtgphs\maq 
aegqghnlg*lqpqppgakgtlprtgnf 

KGGPHTPPNF/CIFFGKNGVSPYCPGWD 
FGAKAILPPGPPKELELEGS PPAPGKVF 


6785 


20686 


A 


6839 


419 


26 


EETSRLWS IGQCPLLIQTAQVLSFTSGH 
GRHREHRKPTDSPLPSACPAP/RLTPPP 
SMCPASHSLPSSQCI4PGPTLTPPTACAR 
PDTHSPPPSVCPAPLP/CRPVCPPPRSL 
PTPA* TKPRTQLPS SMYCGRS S 


6786 


206S7 


A 


6840 


365 


66 


LLERQIGRSVYGPS SKSS I FDVPSLPHR 
GIKLVFCLCHTSSWDTGVHYAPPA* SWH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TWHPRPRQVGQ/AGFELLTSCHPSPLT 
SQTAR lAGVSHRARPKFM 


6787 


20688 


A 


6841 


445 




GFQ*FSCFGLLGGWDTRHVPPRLADFC1 
F/M*RWGFAMSAELSSNS*SAPSASQSA 
GITGMSHCARPATLNSPICGTSLPLGSF 
FF 


6788 


20689 


A 


6842 


3 


412 


PQQSPQLTTVTAPRPRAUjMGTTMMLQ 
TRVSTSHLQGPRLTSALTYQMAQVAVAT 
PPWSVLAVWLSARVTVLP/LN/VNGI 
SWIRQLQKAAGRQPPSPHQTRNNSLE* 
VASNLSKFLILEVAELFCHVLT FFSYQ 


6789 


20690 


A 


6843 


113 


410 


NHKCNVNFYNGPRSLEKRLITLPILHYF 
LNFILCKSSYLTLLNLKLQTNLFCK/YE 
M/ESHFVTQAGVQWHDLGSLHPKQSSYL 
SLLSS *DYRGMPPHLAN 


6790 


20691 


A 


6844 


2 


357 


FCPLARILQSGRFAVLPWNSGLVRGRRQ 
EGPHGAFTREHT/SLRAPQPRSENERR/ 
RIACALRJ^QASGWRLTPRAPRSPSGC 
HCPPAEPTSR*EAWGARRKGHGCCPAPH 
QPAPQDIS 


6791 


20692 


A 


6843 


392 


279 


HFFFPCRD/KSIiftMLTRLIIJSISWA*VIL 
LPQPSKVLGLQV 


6792 


20693 


A 


6846 




275 


VNLVDKAAAGFERMDSNF\ETSSTVGKM 
LSSSIARYREIFYERKSPSMQQTLFLSH 
FRKLP*PF*PLATTTLIY*QPSTSRQDP 
LPAKRAR 


6793 


20694 


A 


6847 


1 


434 


FFFFETQEHSLPRLECSGAISAHCNFCL 
LGSSSSPASASQWGITGMHHHAWI IF/ 
VLFLVETGFHHVGHAGFELLTTCGPPTL 
ASIMCILVYLLVFQHLVSLKMLKGQNAS 
FYFSPFGQHI.SGFVKCS IWCCGSLSGLE 
GHFLPPHSPPLH 


6794 


20695 


A 


6848 


13 


148 


GSAWRHTPAG\ELLKKLRHRYGIiNPEGG 
GCTEP IWCYRTPPWTTD 


6795 


20696 


A 


6850 


204 


30 


TIFSRQVIfilQNVLSEKLRVSTVYANNG 
S\VLQGTLWASVraGKILIGTVFHKTLY 

CEL 


6796 


20697 


A 


6851 


2 


429 


EVWASGITGKAIMaiMPIVIQPKNPSSYP 
CRRQFPLQLEAKEGFQIiLTEKFQKHGLL 
I PYN/TPILPVKKSNEKYRLVQDLRI IN 
EAWPIHPMIPNSYVILPQPPDAQWFPV 
LDLKDSFFCISP/VDPSSQFLFAFE 


6797 


20698 


A 


6852 


3 


160 


SGWDHRRHTQLIFMYSFLFFCRDR/ SLP 
MLPKLVLNSWAQVILLLQPPKVLGL 


6798 


20699 


A 


6853 


2 


243 


AdMILAaATllLRW^PUlYIILIWGIN 
KPTKKLEirPYSIDHKELLDFLSRVPSDV 

QKVQYAELKLCSSHSPU^KKRSAL 


6799 


20700 




6S54 


24 


458 


SHAAKRAAGTTRRSSHTGCRPRPADPGA 
RCLPRGSPHPRMDPPESP/PSETSPGPP 
PMGPPPPSSKAPRSPPVGSGPASGVEPT 
SFPVESEAVMEDVLRPLEQALEDCRGHT 
RKQVCDDI SRRLALLQEQWAGGKLS IPC 
KEENGS 


6800 


20701 


A 


6855 


3 


183 


CCDPPVSASQRAGITSVTY/Q/WPSLGS 
LQPPSPEFKRFSCLSLLSSWDHRHPPLR 
PACKVFC 


6801 


20702 


A 


6856 


2 


242 


GKHPYAHKLA\AQREASTVSSVTQVEVD 
ENAYRCGSGMQMaKDSKSLKTHQTQPGI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I^Isoleueine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RRARDKALVLViWGKRLWCIEGSVC 


6802 


20703 


A 


6857 


54 


401 


EGERLQKFMHVLKTHTHTH/AHTHERAH 
THTHTRTHTPRSGMHRSNNRNLAKNLFK 
I FFFLSSSPPSLFFLI IFPFLSLSLSSL 
SLS LYKHTPLC ILSLRTEHLHRMGALLG 
GHILE 


6803 


20704 


A 


6858 


1374 


579 


CENPSRSTRPGQGPLLLEHQLPFH/RPA 
PSQSSPPEQPQSMEMRSVLRKRGSPRKA 
RRARLNPLVLLLDAALTGELEWQQAVK 
EMNDP SQPNEEGI TALHNAI CGANYS IV 
DFLITAGANVNSPDSHGWTPLHCAASCN 
DTVI CMALVQHGAAI FATTLS DGATAFE 
KCDP YREGYAD CAT YLADVE QSMGLMN S 
GAVYALWDYSAEFGDELSFREGESVTVL 
RRDGPEETDWWWAALHGQEGYVPRNYFG 
LFPRVKPQRSKV 


6804 


20705 


A 


6S59 


36 


269 


I ALGSMDVLT I LMFNEHSKNE I KRTI PL 
KIATR/ 1 KYLGINLT\RGKKDLDTENYT 
TLIEEIEDDTNKWKDIPCSWTGV 


6805 


20706 


A 


6860 


47 


888 ■ 


TLRARALQAfiPRTGSSCXAATWTS/SG^- 
SQHSLRALSWRRLYLSRAKLKASSRTSA 
LLSGFAMVAMVEVQLESDHEYPPGLLVA 
FSACTTVLVAVHLFALMVSTCLLPHIEA 
VSNIHNLNSVHQSPHQRLHRYATELAWGF 
STALGTFLFLAEWLVGWVKFVPIGAPL 
DTPTPMVPTSRVPGTLAPVATSLSPASN 
LPR S SAS AA us QAK PACPPHQACGGGGA 
HGPGWQAMtAS TA I MVPVGLVFVAFALH 
FYRSLVAHKTDRYKQELEELNRLQGELQ 
AV 


6806 


20707 


A 


6861 


3 


391 


NKISFFCRDWGLPMLPRLVLNSWAQVIL 
/LISTCQPPKVLGLQA 


6807 


20708 


A 


6862 


322 


175 


GCRCVPPPHASFKIFGRV/RGLDILPKL 
VSNSWPQAI LPPWPIiESLVLQA 


6808 


20709 


A 


6863 


700 


294 


yVLRQGLTLLPRIi/ECSSAI lAHCSLKL 
LGLRNPPTSASRVAGTIGACHHTRVIFI 
VFWIESCLrAQAGPKIiliGSSDPPALAS 
QSAGIAGISHHAWPPYYFYWVLSSTIPS 
VLSLHSHLILATILEGGDVTSQTDK 


6809 


20710 


A 


6864 


371 


77 


RSMFflM\NL\m)TgDSDDYHLLKVLEEG 

iqtlmgrlevgsrhtgqilkqtyskfdt 
nslnhvaij:.knygllycfrkdmdkvetf 
lrmvqcrsvkgscgf 


6810 


20711 


A 


6865 


273 


34 


DYLPTYLKLFRIiLFF/CIil lETESHYVA 
QAILEIiLSSSNSPTSASRSAGITGISHH 
TQPNFSNHTUliKPYIiRIiSYALLFA 


6811 


20712 


A 


6866 


1 


190 


VGLE/PPALKHSSCLSLPKCEDYRHEPP 
HIALFCFSF 


6812 


20713 




6867 


506 


293 


ekpsnghkkpytwisiailfi laktwkq 
s\prcpsagewinc/wytqtmeyysmlk 
hnelsncekt 


6813 


20714 


A 


6868 


2 


116 


LFPFCRDR/SLTMLPRLISNSRAQGILL 
PWPPKLLGLQT 


6814 


20715 


A 




10 


141 


GSAWIiFFIFCR/DRGLALLPQLVSNPWL 
QAILIiLWPPRVIiQLQA 


6815 


20716 


A 


6870 


2 


603 


APTFINFPAKGKPKRGDTYHLQVRGFSA 
EQIAEWIADRTDVHIRVIRPPNYAGPLM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGLLLAVIGGLVYLRRSNMEFLFNKTGW 
AFAALCF\VLAMTSGQMWDHIRG\PPYG 
PKDPHKGHVNYIHGSSQAQFVAETHIVL 
LFNGGVTLGMVLL CEAATS DMD I GKRK r 
MCVAGI GL.WLFFS MMLS I FRS KYHGYP 
YSFLMS 


6816 


20717 


A 


6871 


375 


-j 


GFPGFPVPKFGIPAPVAPQNPFPPPSF/ 
PQGPPPSPGGPNPNGKNPPFPKTPFFLK 
NSPQKKKIFLKKFKRGPP\GPNQKPPFK 
DFKKKRGLKPPKKGGPLKKKKKKKKRSN 
YSLKKIYSPFGFHFF 


6817 


20718 


A 


6872 


255 


23 


GRVDPTSTSQNAGITGVSHR/GVQWRDL 
GSL/QPLSPGFKRFSCLSLSSSRDYRLV 
/P FC I FSRDR ILPFWPGWSQTPHLK 


6818 


20719 


A 


6873 




157 


YLLFTSYTHAHTHTHTHTHTQFLLLWTS 
GLS \ S CCYI FSLSGPNTKCNM I F 


6S19 


20720 


A 


6874 


351 


132 


SPRRCSAYGVA/ lYLFLYFLINLLSLYS 
IFYGFAPNSFLCEVQEPSLG\SGWEPLS 
GNHFSS IKMKIAHREVNM 


6820 


20721 


A 


6875 


99 


337 


AG S F FQKKGKKKI S R VNRQP IQLEKIFM 
NCaSDKGLVPRIYKELESAR\KYQSIPS 


6821 


20722 


A 


6876 


69 


306 


YVSHHLSFFF/CFFGGGEKALFFCPRTG 
IKWPQFGLLEPSPSGLKQFFGLNPPETL 
EYRVFFPPPGKFFFFFFFFFFFFF 


6822 


20723 


A 


6877 


335 


169 


WHDLGLLQPPPPG\SSDSCASASQVAGr 
TGMHHSFSNFIYLFCYFEKVFLTFTLD 


6823 


20724 


A 


6878 




237 


ELGYKVLPHPPYSPDLSPTDYHFLJCHLN 
NFL\ FHNQQCaENAFQVFIESQSTDLYA 
TGIKrKLISHWQKCVDSNCSYUSr 


6824 


20725 


A 


6879 


2 


199 


RGRVGRQFQKEIEELKKKLEEGEE/VAP 
SSSSSSSDPLISEPDISGSEEDDDEEGE 
VGEDGEKKKKK 


6825 


20726 


A 


6880 


-I 


206 


FPFFETBSCyVAQAGLELLGLGDPPASA 
SQWGTTGaEHHAQLI F/VFVFLVEMGF 
HHVSQDGFDLLTS 


6826 


20727 


A 


6881 


1 


123 


WYIHTMEYYSALKRRKFLSFATTWMNLE 
\EIMliSKISRRRG 


6827 


20728 


A 


6882 


15 


105 


SQLL/WROIQKNK1.NRGGGGCSFPLWCH 


6828 


20729 


A 


6883 


223 




AASTFLFPNLKNSLRG\SmTFSSVTHV 
RKTALTWLNSQDIQFFFSRRSLTLVARL 
ECWRSWPAASRRLGCL 


6829 


2073O 


A 


6884 


193 


3 


qgeOdflppclpnffnfNj'fcrdtvsml 
prlisnswpqavilpephkvlelqaeat 

tpshvfif 


6830 


20731 


A 


6885 


2 


355 


RLLTSSDLPASASQSAGVTGMSHRARPR 
NF/CILPNLSPCNTHSPLSLPQPLAPTI 
LLSLWICL/SLDIS/WDYRHMPPRLRKV 
FFFGDGV3 Q\ YRPGWSAWP FAAS TS GV 
EAILPPQPPK 


6831 


20732 


A 


6886 


1 


151 


HSVAQAGVQWWDHGSLRPGTPGLKQ/ FL 
PPLKQSSCL3LLSSWDHRCATLY 


6832 


20733 


A 


6887 


314 


203 


FLFFCGDKVLLCP \RLVSNSWTQVILQP 
WPPKVLGLQE 


6833 


20734 


A 


6888 


334 


86 


QNRS TFKHGM/MGWAWWLMFVI PATWEA 
EVAAKIAPLHYSLGDRARLSKGGMKCTS 
I FCSKSAKDCQTLP VKKKIS LP TLP S I 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop cO(Jon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6834 


20735 


A 


6889 


345 


152 


QHARIMSVFIVETGFHHIGQSGLELMT/ 
S/GNMHASASQSARITGVSPHARPSLPY 
YKCSKKQLCNE 


6835 


20736 


A 


6890 


2 


169 


ARECSGAITAHCSLDLMGSSDTFTSASQ 
VAGTVGVTIMPRYI FK\FFVEMESHCIA 


6836 


20737 


A 


6891 


361 


184 


NQSSCFSLLSSWEHRLMPPHLANF/SFF 
DRDE / SLTVLSRLVSNSWTQAILP PRI F 
FENS 


6837 


20738 


A 


6892 


3 


330 


HE SAAI TGVSHCTQTFFFF/ CETGPRFV 
PQVKGQGHDLSSGEFGPPRLRNWPGLTL 
QNAGNTGTLPPHRTNFGDFRRGSFSPCG 
PGGFGTRDLGGFPHLTGKRDTGLDPP 


6838 


20739 


A 


6893 


2 


348 


ARAFFFWNFFCK/NRGLSMLPRLALNSG 
AQGILPPWPPKKLGLRALPTWPKTGSSG 
LEGHSSLYPHHSLI FPHHCSKFCFIKQW 
RLFI LRI TENQGLTPLNAAKPAEPVMNE 

ARTQA 


6839 


20740 


A 


6894 


340 


240 


CRD / RS LRMLLTLVLNSWIQAI LLSWPP 
KVLGLQA 


6840 


20741 


A 


6895 


155 


1 


HLCFWflGNPVAHSCHPGTl.GGRGGRlTR 
GQEFETSLLGPGTQDVLAKCSRA 


6841 


20742 


A 


6896 


454 


294 


PSSWDYRHA/ PPMHNWPDFFCRDE/ SL 
PMLPRLVWNFWAQLILLPWTPKMiQLQA 


6842 


20743 


A 


6897 


49 


333 


I YI CLS FYQS 1 1 YLYDKQHDVLETVSHS 
VSLDLPGLRFSHLSLWDHRLILPWLASS 
MSJFFX C31NRGLLVLPRLVSNSWVQAILP 
PWLPIVLGLPA 


6843 


20744 


A 


6898 


434 


324 


FFFFCRD/RSFAVLPRLVLHSWTQAIRP 
PWPLKVLGLQA 


6844 


20745 


A 


6899 


127 


365 


LPERRRYVSISGYRYHLPQSLLFFYAQH 
ATF^JNELH\LGVIAHACNPSTLGGRDGR 
MALGQEFEIGLDSIVRLHLCNKIK 


6845 


20746 


A 


6900 


208 


1 


FFFFFWILVETRFHHVA/RAGLELLSAG 
MPPTSPSQSARITGIWFFVWFSSNLSQF 
CYAHSFFTCWYSEA 


6846 


20747 


A 


6901 


1 


162 


GTSGTRLFTIGKRWQQPKCPPMAELINK 
MW\YTMEHYSVLKRKEIFCTNLTQLL 


6847 


20748 


A 


6902 


120 


313 


DKVL/WRLRQENPLNPGGGGCSEPRSHH 
CTPAWATRAKLYLKKKKNFKGTLSNWYS 
FRRVANTNLQMQSGSNYMQI 


6848 


20749 


A 


6903 


340 


234 


IFFCRER\SILPRLVSTTWPQAILLPRP 

PKVLGLQV 


6849 


20750 


A 


6904 


1 


327 


GTSGTSRILPWGPGCIRTPELKQSAHLS 
LPKCWDCWWYRHEPPVYSFANLI IKLFF 
ELHLSVILWFLLLANPQTQTESKTFSKT 
/PRLVLNSWPQAVLPWSLKALASQA 


6850 


20751 


A 


6905 


532 


359 


RDG/DLWWPRMVSMPGFKQSSHLGHPK 
CWDYRCEPLCQDFSYSIITWFPPSREL 
RR 


6851 


20752 




6906 


270 


3 


GGTTVYPRKTKT/WPGAVAHACNPNILG 
GRGGWITRSGLPVSTKNTKISKAWCCAA 
WLATPEAEQENRLKPGDGDRDHHGQHG 
ETPSLSC 


6852 


20753 


A 


6907 


1 


312 


GTSWKKMLPTLMDDFRG\KTLVEELIPD 
VVEIAREVELE\MEHEDVTELLQPHDKT 
WTDEELLLTDEQRWFLEVGSVPVEDAV 
NIVEMKTKDLEYYISLLIKY 


6853 


20754 


A 


6908 


331 


61 


FRHVGQAG/LQKLLTSSDLPAPASQSAG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQIU 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQJD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possi{}le 
nucleotide Insertion 














I TGMSHHIRPKLILRKKSMRAKLDSRQT 
RLQPPWEQGLLEEQHEVYNVYMaHVSK 

SSSRKLS 


6854 


20755 


A 


6909 


419 


199 


SAVQDGV\SWCDLGSL\QPPLSRLKQFP 
CLSFPSTWDYKCAPPHPATFCIFIRDGV 
SPCWSDWSQTPGLK 


6855 


20756 


A 




341 


69 


EAEVGGLLKARSLRPAWAIQQDP I S ILK 
KSFTLGGQGGWI TRS GDRDLRASAS QS A 
GTTGVSHRAQ\LI FAFLVERRFHHAGQG 
GLDLLTL 


6856 


20757 


A 


6911 


174 


356 


KTYLMPLRSSLRPGMVAHACNPSTLGGQ 
GGWITRSG/ESRPSPVLUNRLTRCQANT 
VKPCLY 


6857 


20758 


A 


6912 


391 


3 


SPPPPPFPPSFPPFPPQNFFFPPGASSF 
FGPPPP/YFPPPPKKSPPPNPPPPFFFP 
PPLGKQFFSPPPPFFFPPPGFPFPPPPF 
FFFFFFFFFFFFFFFFFFFFCDKKYFIC 
LMICMLFFCKYILNFCRYS 


6858 


20759 


A 


6913 


43 


331 


I RDT YS FSLSAPALAPMLGTKGVLLAVA 
SLGSRGASLCVFVCVCLCVRIH\IGVQA 
SGCVCVCAC/CVCVCWVCVCVCPHWPA 
SLLTKDLQCLPFTV 


6859 


20760 


A 


6914 


250 


11 


TPNLWGLFLVFCFEM/EVH YIAQAGLE / 
PPGLKPSSSLSLLSSWDYEHVMPHAAQI 
CISVFSLAQKGVPPSNWILITHYSIKS 


6860 


20761 


A 


6915 


332 


53 


SQMALHEGFLLLFLKEKEUK3KFSHPPP 
SAPNSSMSKWFYFSETESHSVAQAGVQ 
WCNLSSLQPPPPGFKQFSRLSLTSSWDY 
TYLPPHLANFLFLVEMGFIJK'GQAGLKL 
PTSGDLPALGSQSVGITGVSHHTR\PRF 
LFKIGPLFTiFSKSMSODEARI 


6861 


20762 


A 


6916 


2 


373 


WHTPWLATHEAEAGGSLEPRRAKLQLA 
MIVA/NCTPAWVT 


6862 


20763 


A 


6917 


] 


270 


GTRQSPRLSIiSSSCDHRRVSPHPADF/ 
SFPYFYFFCR/DRGFTMLPRQVLIWAQV 
MLLSQPPKLLGLQWGVSHQARPPHCLG 
YSWVYNKP 


6863 


20764 


A 


6918 


2 


367 


RCAPHCPDNSS FFK/ RVKTGLT I FPRQT 
SNSWPQVILLPWPP/KVLGLQA 


6864 


20765 


A 


6919 


3 


354 


HEVNIVETTAKDLEYSINLVDKTGFEKI 
DSSFKRRSTVSKMPSDSITCYREIFCER 
KSQSMLQTSLLSYFKKLPQPPQPSA/TT 
TVISQQPSTLRQGPPPAKRLRCAKGLND 
FQHFWL 


6865 


20766 


A 


6920 


366 


41 


STSGEDAVHIVEMTTKGLEP/YTAVPGF 
HPSSERRSWGNMLSKSVTGYRGIFGER 
KSQC/RQTSLLPYFW/KPQPPQPSATIT 
LI\SQQPSTSRRDPLPPKRLQLTESSND 
H 


6866 


20767 


A 


6921 


3 


368 


QLLMGLHLEPATLDLLQLKT\TIQGGSH 
SHHI TAI P \ PS \ FTRENTLMF IHLS PI L 
LLSLNPDIITGFFLLKKKNKPKNHKWRG 
PSYKHTPEGPLLLLWAQKTLPPKGGLY 
KIAPLRLFGEDS 


6867 


20768 


A 


6922 


3 


91 


FSVLPRLV\QNSWPQVICLPWHPKVLGL 
QV 


6868 


20769 


A 


6923 


334 


91 


ERCGKCflHISEIFLSSLDYIYIFVFY\F 
FFLFFFFFFFFFFFFFFFFFFFFFFFFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO; in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFKFNTKSKEKMFNKTMKKVQVTF 


6869 


20770 


A 


6924 


375 


37 


TKAGVQWHDHGSLQPQPPRLKPSSCFSL 
PSSWDYRHAPP/RP /ALFLFLFGLWFG 
LVFLVEIGSCYIAQAGIiKLLGSSDPPTL 
VSQSARIIGPTHHAWPTLTTSIQCTIRG 
PG 


6870 


20771 


A 


6925 


647 


1054 


CFWKAHIQAGETNNIQEGSERFHFCPAD 
EVYVHSGPAVEDQPQRRSFAL/SAQAGV 
QWCNLSLLQPLPPRFKQFSCL\SLPSSW 
DXHRHVPACLAKFC I FSRDEVCHVGQAR 
LELLTSGDPPASASQSAGITGMNHHV 


6871 


20772 




6926 


3 


204 


DAWETTGFGMIYDSLDYAKKHEPKHR\L 
AEHGLYEKKKTSRKQRKERKHRMKKVRG 
TAKANVGAGKKK 


6872 


20773 


A 


6927 


1 


112 


PTRPRTRGVASVLYFTTILILIPT\ISL 
lENKILKWA 


6873 


20774 


A 


6928 


1811 


478 


DRASSPRSERA\RRSSSRLRQRDPSLGR 
RRLRREIRPGLPESEPRPPRPPAALTAD 
QPPPRRLSESRGGGGMSE\AGE\ATTTT 
TTTLPQAPTEAAAAAPQDPAPKSPVGSQ 
APQAAAPAPAAHVAGNPGGDAAPAATGT 
AAAASLATAAGS \ EDAEKKVLATKVLGT 
VKWFNVRNGYGFINRNDTKEDVFVHQTA 
IKKNNPRKYLRSVGDGETVEFDWEGEK ■ 
GAEAANVTGPDGVPVEGSRYAADRRRYR 
RGYYGRRRGPPHNYAGEEEEEGSGSSEG 
FDPPATDRQFSGARNQLRRPQYRPQYRQ 
RRFPPYHVGQTFDRRSRVLPHPNRIQAG 
E IGEMKDGVPEGAQLQGPVHRNPTYRPR 
YRSRGPPRPRPAPAVGEAEDKENQQATS 
GPNQPSVRRGYRRPYNYRRRPRPPNAPS 
QDGKEAKAGEAPTENPAPPTQQSSAE 


6874 


20775 


A 


6929 


324 


203 


LIFVFLVETGFHHVGQDGLDLL/NLVIR 
PPRPPKVLKLENP 


6875 


20776 


A 


6930 




364 


HEASLTQTRTTMTHCSRTTGCSTASGRT 
WTRSRHSCAWCSAALMRAAVASRCPSS/ 
IPVTPPQCLSWP/WK7PLQCPPALS 


6876 


20777 


A 


6931 


2 


165 


GRVGFCSVAQAGLQWHDLSSLKPLPP\G 
SSDCPASASRIVGITSIiAFHFLKNVF 


6877 


20778 


A 


6932 




370 


TELRSYSPSNGAISSQCNLCFLGSGYS/ 
RSSTSQVAGITGA/R/HHTQLIFVFLVE 
TEFHHV\AKAGLELLTSGDPPASASQSV 
GIA6VSHCAWPHVYILKEIMLGNLVRCL 
MSICHHHTESICLFL 


6878 


20779 


A 


6933 




441 


RRANTPNNPEAPPN/QKKKKKKKKKKKG 
KNPPGGPPPFFKTPPPFKNGGGSPPGKK 
KGGEKKRGGKIFFPPPGGKKPPPQKKKG 
GGQGKKI FFFLFQPGGGOf FFKKKKGPP 
PFPPRKKKKIFPTGGGPKKKKKKKAPPK 
KPRGFYPP 


6879 


20780 


A 


6934 


518 


340 


PKPKNFPS/ PSPPKIPMJQSKVFSISCPP- 
GGFKNPPHKRKKYBFPPPEKLGPPKEFL 

KRPPL 


6880 


20781 


A 


6935 


396 


299 


WEVC/GVCVCVCVCVCVCIHSLCKSTL 
HRTGV 


6881 


20782 


A 


6936 


486 


■272 


PTRAPTRPAPPHLADFLrFCSDR/SLIM 
LPRLVLNSQPQMILPLWHPKWGLQA 


6882 


20783 


A 


6937 


1 


264 


YSNLRDRARSCLSLPSSWDYTCSWDYKC 
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SEQED 
NO: of 
nncleotide 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning ' 
nucleotide 
location 
correspond 
ing to first 

residue of 

s'eq'uence 


Predict- 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienj laianine, 
G=Giycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \-possible 
nucleotide insertion 














ASLRLANF/SNFCRERGLAMLLRLVSNS 
WAQVILLPQPSKVLRLCLAVSVFFGEKQ 

GQCS 


68S3 


20784 




6938 




407 


TTGTCRSVWLTFVFLWTGFHHVGQAGIi 
ELL/RPPQPPKVLGLQA 


6884 


207S5 


A 


6939 


422 


65 


LKGPPPSFFFFFFFFFFFFFFFFSP\LP 
YTLYFDnASNACLFFKNWTWRYIKRIFF 
SPTPFIPFFHEVEWVTSLSYMVFQYCFL 
FF 


G885 


20786 


A 


6940 


462 


209 


LSEFWDYRREPPLPAKFAIFMLCRDRGIi 
DTLP SLVSS / SWHQAILAPQPP IMLGLQ 
VHSAYKGFCLPVLTFEEVIGNRPLGLLP 
P 


6886 


20787 


A 


6941 


491 


299 


CAE YS PESGPTHASAHAS / SNVQMVYSR 
ISCKEELLLGRTSPSKNYNMMTVSG 


6887 


20788 


A 


6942 


429 


256 


FSHLSLPSSWDYKHLSSCPANFCIFVQT 
GFHHVGQA\GDPPASASQSVGVTGMSHH 
S 


6888 


20789 


A 


6943 


932 


661 


FKTGSYSV\TRLEGSGEISAHCNLRFLS 
S SDFFIILS SQVARI TGSSQYARLI FVFF 
VEMGFAMLPRVGLKFLGFKPIiHPP\RTP 
RVLGLQE 


6889 


20790 


A 


6944 


381 


55 


PASLPPCSLISDCCASNQRDSVGVGPSE 
PGVGYSLWRRFLSRSEKRNIRVGVTRF 
SRCV/LSPLSIiTQKGNSLTPCASQVRQC 
LALLRLAHGACTHWPAPTVWHS1.VR 


6890 


20791 


A 


6945 


816 


513 


FFIFIYPFEMESLLPRLECSRMISAHCK 
VCLPRSSSY/PASASQVAKITGA/TRHS 
RLIFCVCVLVERRFHHLSQAGLELL/NL 
VIHPSRPPKVLRLQGVTRTA 


6891 


20792 


A 


6946 


383 


234 


LIPSLMDNSEGLKTSVEEVTADWKIVR 
ELELKVKPE/NVTDLLQS\YDKT 


6892 


20793 


A 


6947 


423 


60 


LNPPPPPF\YKPPPKKKKFFSPPPLKFG 
PPTKFFKRPPPFFFFFFFFFFFFWHGTF 
PLNSLVGPPKVEGWVSWAQLPRAVGPPL 
PNPKHGIiNKPPAGSLNKPIHFPSLNQSQ 
EATPPLIPWC 


6893 


20794 


A 


6948 


1 


327 


MRLLAAALLLLLLALYTARVDGSKCKC \ 
SRKE PK1RYSDVEKL\EMEPK\ YPHCEE 
KMVI ITTKSVSRYRGQEHCLHPKLQSTK 
RPIKWYNAWNEKElRVYEEAQQLRRRia 


6894 


20795 


A 


6949 


952 


796 


FPLHFLVEMGN FLLVGQAGLELPTSGDP 
PALASQSAQITGMSHRARPSVSS 


6895 


20796 


A 


6950 


364 


6 


ITTHKNYFLSRTATPPKEGLPQRKKIFL 
FPTPI FI FFVGFI FWSGGGRGFLVFFFV 
FFPFFFFFFFFFFFFFFFFPFFFF\RVK 
TFIGFKVLFFKmVFFFLIRAVPLDYMN 
HALTIFQM 


6896 


20797 


A 


6951 


418 


266 


I SFFFXXXXXFFFXXXFFLLFFFPFFFF 
FFFFFFFFFFFPPFFFFFFFFFN 


6897 


20798 


A 


6952 


459 


7 


PSYLYKPGPSQPNRRRHSVP/HIRSNKD 
PPPPEVMKKWGPPTPQKKKYFPPRKKKL 
GGGGGPIYPPKKKVFFKKKPGGFKKPPK 
KKKKIFFSPPEKRGPPPSILKNPPHPFI 
FFFFFFFFFFFFFFFFFFFFFFLIGEKA 
MLLDMGA 


6898 


20799 


A 


6953 


325 


143 


LPSSWDYRRVPPHSAHFKFFCRN/RGLA 
MLPRLAKPRFPSISIFLNHPQVSEILGA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

residue of 

sequence 


otide 
location 
eorrcspon 
ding to 
last amino 
acid 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamlc Acid, F=Phenylalanine, 
G=Glycine, H=Histidlne, Wsoleucine, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














TRLAR 


6899 


20800 


A 


6954 


585 


306 


KTESHSAIRSQCS\GEISAHCNLRLPGS 
SNSPASASRVAGIKGTHHQVQLIFVFLV 
EMGFHHAGQDGLRSPALVICPPWPPKVL 
GLEAWPALHC 


6900 


20801 


A 


6955 


2 


155 


flvetgfhhagqaglellisgdpptlak 
s\dsqdvritglshrawpfldiy 


6901 


20S02 


A 


6956 


3 


346 


DAWPDAWGK\RNWPPHQANFyTFVE/TG 
FT /HVGQAGLELLGLS DVPTSSSLDAGI 
TGMSHCSRPRPEISSRDVHTIPGALQET 
WHTPDLASS IRSLRCRLDCPHHiCPLDK 
DGVTL 


6902 


20803 


A 


6957 


1 


224 


QCGRIRSRKDARKGNVQI IPREGRRRTS 
PRYLFPVTGQETDGWG\MGLPAWLLKTR 
STSPPRSP RLHRRRGATSW 


6903 


20804 


A 


6958 


102 


344 


GLLKPKSLKLQCTIITPVW\NHCTAAWA 

T 


6904 


20805 


A 


6959 




296 


KEEEEGEGEGGGGGG\KKREEKEEEEEE 
EEEEEEEEEEHAEFTGHVESITQCKLLL 
CHLSLRVEAGITIILLCFTVRAVTRSQRL 
LTKRRNGSVGHTSE 


6905 


20806 


A 


6960 


3 


172 


RLGLPKCWDYKREPP/RPGLELLTSSDL 
PALVSQSDGI KDVGHDTEPQDFYTLGQD 


6906 


20807 


A 


6961 




224 


LnLLPRLARSGTIIAHCSL\NRSLRLGL 
SDPPASASESTGTTGiaSHCSQPAKLLST 
CIRNSVAETKYYLMFKCK 


6907 


20808 


A 


6962 




330 


KGTLRRQVTYKGTLIR\LVADTLAETFH 
ARKEWDDIFKVLKKHCQPRILYLAKPSF 
KNEGESFSQRKCITTRVARSKJVIAKGVLH 
LEVKDISTIJKQMKIRSSLVEQIHKW 


6908 


20809 


A 


6963 


1242 


929 


ETGSCSWQAWQWCKHGYLKPQPSHAH 
/DDPPTASS lAGTTEftHYHAQLIPKKFF 
HSLCCPGWSQVSWRIIiGSSDPPASASQN 
AGITGMSHGAWPQLSIYRKGN 


6909 


20810 


A 


6964 


1 


378 


KFFLQLFLHSFIYEHFLFFSFLFFFFFL 
RPS FAFVAQflGVQWRULGSPQPLPPRFK 
QFSCi;SLLSRWDYRHaPPQSANF\EFLV 
ETGFLHVGQAGLELPTSGDPPTSASQSA 
GITQVSHRTRLAFY 


6910 


20811 


A 


6965 


393 


43 


vspfplknfyfsftpktfvogqgpsgpp 
pqkrfffhtpnspffpfppf/yksgprp 
gffftpppekgkhfpNpplrlgpppvff 
trppppffffffferqrtcs fthflwr 

PLFRFYL 


6911 


20812 




6966 


225 


2 


trvechgvffgscnpllpglkefsapap 
rgsgnsgpfpppr/rnsfffffswlle 
mgfhhgqadleiltssdpp 


6912 


20813 


A 


6967 


8 


341 


FVS VPP I LHYLFKI FFFFLRRSLCLPPR 
LECRGTISAHGSLGLPGSSDSPS/STKN 
TKLPIJ^GGTCLGPQLLGLLRHEKTWTLG 
GGSGSYPKSGKRAMALQGGVQNKTCVPH 


6913 


20814 


A 


6968 


29 


362 


DYTCKHPHSSELKKKKKKKKKKKKKKKK 
KKKKGGPPKKKP/HGGPQFPPGRQKKIF 
PFKGGFKKPPRGFFEKNPFFGGGNLGPP 
PPPKNKPPGEKKUFLGGRGQKNSFLSAW 


6914 


20815 


A 


6969 


391 • 


17 


VFFFFFLKKFFPPPDKKLFFPPFPLKIF 
F/SPPNPFFFLGGFSQISPPPKKGFFPK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tiireonine, V=VaIine, 
W=Tryptoplian, Y=Tyroslne, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IPPGV/YFSPPFKKKNFFFPPPLNFGPP 
RVFFKGPPPFFFFFFFFFFFFFFFFLGL 
RGNAGDCNGYGDVSYK 


6915 


20815 


A 


6970 


374 


268 


IMLPRLVLN3WPQVI LLPWP \PKVFRLQ 
VRAPVPG 


6916 


20817 


A 


6971 


614 


315 


F FFGDALS P GLK CGGAVJaAttLQA/ CLLG 
SHESPASASRVAGTCKRPPPRPSGFFVF 
LVEAGLHRVGQDGLNLL/NLVIRPSRPP 
IVLGLQGMSHHAELFYFL 


6917 


20818 


A 


6972 


861 


528 


FETESRSVTQAWQWCNLSSLQP/LPPG 
FKRFFCLSLPSSWDYRRTPPCPANFCIF 
SRDGVLPWVIPGWSRTSDLVIRLPRPPQC 
WDYKREPLRLAKSRHPWNHLPEQEVEHF 
K 


6918 


20819 


A 


6973 


274 


119 


PPN\FVFLVETGFLQGGVKLSTPGDPPP 
PASRRAGITGVSHPACPGKEYIFKtT 


6919 


20820 


A 


6974 


370 


194 


KITNFAKLFGYILPYSinrFWCLEKNRNR 
/ SLTLLPRLVSNSWAQAFLLLWPPKVLR 
LQA 


6920 


20821 


A 


6975 


2 


239 


ARGPCSS I / SDCCASGEQGSVGVGPAE P 
G/MGYNLLACCLLRPLEKCS IKAGVSRF 
SWYSLSRLPLT\RKGNPPTPCISQVR 


6921 


20822 


A 


6976 


361 


270 


PQWLVPVIPA/LLETEVGGSLEPRSSRP 
AWAR 


6922 


20823 


A 


6977 


363 


48 


GPPKKMCPYPN\PRPYEaiLFGKRVFAD 
VIKPKILRW\ I ILDYTQVDPKSNDECPY 

KKQKRRGHRQIKRKSQADQGGRDWSGAT 

;'JiNAIOTITSVGSDTOKHKPLAYC 


6923 


2U824 




b978 




163 


AGYNFLVCC \LPRLLiiKCS IRMGVSRFS 
RYHLSWLPFARKGNSPTSCTSWTO 


6924 


20825 


A 


6979 


408 


212 


lETGFCHVAQDGyLELLASDDPPASASQ 
SAEITDVSHRAQPHSAFLPWFSS KIFAY 
GSVWBVFPHS 


6925 


20826 




« 


395 


254 


LIFVFLVETRFHRVGQAGLELLTSSDLP 
SliASGXlTGVSHHSRPSSAF 


6926 


20827 


A 


6981 


408 




KVSPFPPKGPRGKKKNGftC3GPSPFPPPF 
LGGGTKNFPYPPIFSQGGGKK/QKPPPP 
EKKKNPPWPAFWGPPLVLKNWGGRPGGP 
LDPKVSNFMGRPPFLFGAKKKTFFSPPP 
PKKKKPQKKKGGGRSRSRTSRTRG 


6927 


20828 


A 


6982 


2 


302 


RKHLPPH\PVIFVFLEQRTLRHVGQAGL 
BLLTSSDHRASASQSAGITGVSHCSIiPA 
TSFSTPPFSYRKMLRGTQKMKVPLIGPL 
PQPIRLVAOHYFIYIT 


6928 


20829 




6983 


2 


375 


RGRVBGP VGGP VGRTFI FFQAGS DS AR I 
LVLFPYCTLPHPLPGLFFFFFKTGI KSW 
VLLNPS PRFVI FLEKKI FL PQ P P FF /H I 
LSPPHPGGAGGGNPTARKIKGLSYRDPK 
EGHKRGWAHTPTP 




20830 




6984 


1 


146 


PRPANFC/ DFLVETRFRHAGQAGLKLS T 
S SDLPTLVS QSAGTTGMSHRA 


6930 


20831 


A 


6985 


2 


95 


LTMLPRLECNDTI PAHCSLNS/ SGSSHS 
LTSTSKVAGTTGVHHHFWLMF/PLFIIQ 


6931 


20832 


A 


6986 


340 


169 


VCS SGLS S PLLEQHKTNLI FYASGDI CT 
/ANGKSGFNQPQPFLKTFCCTHRIMSCT 
YL 


6932 


20833 


A 


6987 


346 


218 


PCLTNF/SIFCRDE/SLTML/PRLVLNS 
WAQAILLPRPPRLLRLQA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJn known, *=Stopcodon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


nucleotide 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


6933 


20834 


A 


«a. 


345 


141 


NRPKCPSMIBWIKKMWHIDTMEYYAAIK 
KDKFMSF\ETITLGEVTQEWKTRHRLLS 


6934 


20833 


A 


6989 


154 


236 


ELISFSRSVERCISSFIKVLFVETDLFT 
LSLSPFFSVLFFFC/RRWGFTMLPTLVL 
NSWPQVILLPWLPKVLRLPT 


6935 


20836 


A 


6990 


2 


156 


FLVE/TGFVKVGQVGLELLTSGDLPASA 
SQSVGIIGVSHRVRPCFLFVCICLV 


6936 


20837 


A 


6991 


310 


96 


KNFFFFLVKKSP / LNVAQGGLKLLGSND 
P PAS VS QS GG I T6 VS PRARP LNFYKEAT 
DPNTSVLKGLNIKKKKN 


6937 


20838 


A 


6992 


15 


279 


NLLCFPSISPLIGIFFFFERGFHHVGQA 
GLKLLTSGD/LRASPSQEARITAMTRHP 
HPLVCGLLNSTHCDWRGRIPHWGFDLRF 
REGPS 


6938 


20839 


A 


6993 


127 


352 


KLADTPQSRFVLVCV/LIETKYSLCCPG 
GLQ LIAS SNP PTS VSQSAT I TDVUHHAK 
SQVFLNT.WPKSSKPTNTGL 


6939 


20840 


A 


6994 


323 


119 


SSSEDHKHVPPHSAMFCFF/ CLRQSLTM 
LPRLVLNSWAQVILPPSPPKPFLDALKK 
KSSKWMSSLLIT 


6940 


20841 


A 


6995 


230 


1 


FFFETESCSVTQAGVQWCDLGSLQPLPP 
GFKQFSCLS/LPNSWDYRHALPCPANFC 
IFSRDGVSPGWSGWSRIPDLG 


6941 


20842 


A 


6996 


346 


121 


RDICTPMFVTALLIAKIRSPHKCPSVDT 
WIRKMWF\ lYTMEYYKM IKNKILGMSH 
HASLPVFNHMCKLKYLIS 


6942 


20843 


A 


6997 


2 


198 


SNNTWAFREPGITGMCHHAQLIVLLYM 
QFHHVDQDGLDVE/NLVICPPWPPKKLG 
IQALNQPSPDI 


6943 


20844 


A 


6998 


183 


31 


MRHLKSSQIQQGTVGffiiCNPSTLGDHGE 
RITQGQEFKT\GNIVRPCLYKD 


6944 


20845 


A 


6999 


3 


471 


LALSCSGTILAHCNLHIiLGPSLPPTSAP 
RAAGTTGVHHHAWLIFVFFGKDGVAQPG 
LKLSFK/VIRPPQLPECVLGLQA 


6945 


20846 


A 


7000 


483 


239 


GNNNILFLFLPFIETGSHFVT\RLECQW 
VQSSAHSQPSAPEAQVLIFVFLLQTGFH 
YVG\QAGLQLLTSSDPPHSALLKVRD 


6946 


20847 


A 


7001 


132 


353 


YAKLGTRGFARGPWPGGFLTVAVWMSF 
AQAG INP FVCIFSNRELRRCFSTTL/PL 
LQKIQVTKGTLLCYMREHL 


6947 


20848 


A 


7002 


2 


399 


EDAQEVELQEGKVHSESDKAITPHSQEE 

FQKQERESAE \SELTES 


6948 


20849 


A 


7003 


2 


135 


IFQFLVEMGFHHVGQAGLELL/KLMIHP 
PRPSKVLGLQAYWHSTRP 


6949 


20830 


A 


7004 


2 


114 
1 


IFQFLVEMGFHHVGQAGLELL/WLMIHP 
PRPSKVLGLQA 


6950 


20851 


A 


7005 


455 


ALFPAWLWLPSSSKGGLKLSHGTTLTLF 
LLPSSTLKDLKPQEQWL/GTCNSQHFGR 
PRKADRLAPGVQNRPGQRGETPSRKKGG 
EGNWGGGGELH lYTTTHTHTYVHTYIHA 
NWTWWCVPWLAFQETEVGGPLEPEVAP 
AWTVRPCLKKK 


6951 


20852 


A 


7006 


371 


172 


QVILCLSLPGSWDHRHTPPHPA/NFLVE 
TRFQHVDLELLASNDPPASASQSVGITG 
VSHCARPCYDL 


6952 


20853 


A 


7007 


486 


252 


PQTVFFFFFFFFFFFQALFSTFYILHVF 
FFFFPPFFPKFQFFPFP/LSLIKQGKFF 
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SEQBD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 

acid 

residue i>f 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, »=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopbi»n, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFCYLCNSLFIWTQYFFVNE 


6953 


20854 


A 


7008 




113 


KNFCR/DRSLDTLPRLISNSWPQVTLLP 
QPPKVLGLQA 


6954 


20855 


A 


7009 


105 


383 


SLHRPAHPGLLKFGKKAIHSFIHLCRDK 
GDLAILPRLVSNSWPQVILLPWST/NVL 


6955 


20856 


A 


7010 


609 


348 


ESHSVTQAGVQWYDLDSLQPLPPGS \SD 
S PAS ASHVAG 1 TGMRHHTQL \ I FVLLVE 
TRSHHVSQTKFVLFDTTKFWIFYSGKR 
TRIRL 


6956 


20857 


A 


7011 


382 


233 


DLE I FRGKTFLHHI FPFFFFFFC/ RRRG 
LAMLPKLVLNSWPQVILPKCWD 


6957 


20858 


A 


7012 


359 


199 


DLLPSLPPFLPSFL\FFLFETESHSSPR 
LECSGAIPAHCSLGDRARHRLKINK 


6958 


20859 


A 


7014 


3 


274 


CFFFFFETGSHSVTQAGVQWRHLDSLQP 

pppgckrfscls/lpsswdyrhv/pssp 
llvrussicllscfemtpelfdgvlaiy 


6959 


20860 


A 


7015 


372 


54 


LPSAWEAEMRGPLEPKGLRSQRAEIAP \ 
CTPAWVTEQAYVSNKTNTERPFQKIEHV 
EGLSCTTQKFSSND 


6960 


20861 


A 


7016 


548 


158 


KFDT/GATLFDGRPAVFDEA/DFPSVRV 
YLTGAEYTGEELDSDTWQAELHIEVFLP 

AQVPDSELDAWMESRIYPVMSDIPALSD 
LITSMVASGYDYRRDDDAGLWS SADLTY 
VITYEM 


6961 


20862 


A 


7018 


396 


34 


QKNEIKPTMQLHLTPARMAIIKKILKNG 
GWHGCGE/METLLHCWQECKLVQPLWKT 
MWRFLKEPKVELPFDPAIPVLGIQQSQH 
WESSNPSTGDPAIPPLGIQQSQPRRKVI 
KKRYIflTRL 


6962 


20863 




7020 


2 


241 


FPGPTAARRRQKEDRYEEYDKVLSDKLK 
EAETEAEFAERSLTRLEKSIDDLQDEL\ 
YAHKLKYQAISEELDHALNDMTSI 


6963 


20864 


A 


7021 


347 


3 


NDLKSLTHSVAQARVQWQ\NISSLQRPP 
PQFKRFSCFSLPSSWDYSLIiNCKFYHLH 
FADEEIETLTSQRAQLECKSNMYKDIHN 
TVRSYKLKYKVPQKLERSRNRTKSSTRE 
VRS 


6964 


20865 


A 


7022 


2 


160 


LREDDRRGRGFHHFGQAGLELLTSSD/L 
PTSASQSAGITGVTHRARPDWS FCQF 


6965 


20866 


A 


7023 


3 


248 


DRVSSVAQSSLELLSSSNPHLGFPKCWD 

YRPNPPLPFSSRSLLFLPSFLIFCRDRV 
SMLPKLASSFW\VRPPWPSKVLGLQA 


6966 


20867 


A 


7024 


483 


143 


PLNPLKFFFFPKAFKFWGGVGPICSPPK 
RRVLSQKSQVGFYFPPQKEKGYTFPSPG 
KFGPPKEILKRAPPFFFFFFFFLRRDK/ 
SLTMLPGLVLWPWPQAVLPLQPPKALGL 
QA 


6967 


20868 




7025 


414 


81 


PHPLKFFFFPETFYFWWGVFS PFS PPGR 
KVPPQKSPGGFLLAPPKGKSFPFPCPFK 
IGPPRGI FKRAPPYFFFFFFFNRRDK/S 
LTMLPGLVLNPWPQAVLPLQPPKALGLQ 

A 


6968 


20869 


A 


7026 


16 


218 


KFAPRCHNSARENTQNQPRKHSDTPFLK 
K/HKINMMWKLAGRGTHLWSQLLGRLRW 
EDHLSPRQSRLQ 


6969 


20870 


A 


7027 


28 


272 


EFVGVGPSEPGAGYNLL\CAFLSPSEKC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutaimic Acid, F=PIienylalanine, 
G=Glycine, H-'Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparflgine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoptian, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SIRVGVTRFSRCHLSPLTLTKKGNSLTP 
CASRVRQCLTLLRLAHGALHPLSSTP 


6970 


20871 


A 


7028 


277 


480 


LGIFHMLSDSFLLLLLFCLFVFKIFFGI 
F CRD \ RS C YE VWAMLPRLVSNS WAQVI L 
PPWPLKVLGLQA 


6971 


20872 


A 


7029 


24 


153 


SVWWNSP/RSSGEFRKTGVHDADFEAHI 
IDMLAK AS K I K VYRGN 


6972 


20873 


A 


7030 


2 


470 


lETPRPLWWHSAYRGRQASLSCGGLHPV 
R/ASWLLCLPNQAWAMAGAPPPASLLPC 
SLISDCCASNQRDSVGVGPSEPGVGYNL 
LVRRFLSQSEKRNIRVGVTRFSRCV/LL 
PLSLTRKGNSLTPWASQVWQCLALLRFA 
HGARTHWPAPTVWHSLVR 


6973 


20874 


A 


7031 


487 


107 


SPPAPGVAPAPPPPTRVFSKKKKKFFFF 
FLKKI FFFKPGGDPGGIFFFPGGPPPGP 
KKFSPPLFPKKGGYRVFPPPPGKISIFF 
FFFFFFFFFF/CRDR/SLAMLPRLVLNF 
WAQVI LLPWSPKVLGLQA 


6974 


20875 


A 


7032 


576 


36 


GYTSQMGRPGRGAPHFPDRAAGQRRSSL 
PRWGGWAEALLTSQMGWRPGRGAPHIPD 
DGQPGRGAPHLPDEEQLGRGAPHLPDEE 
RPGRGAPHFPDGAAGQRCASHPTRGGRA 
EVLPTSQTGQPGRGAPHLPDDGRPGRGT 
PHLPDG\GPGRGAPHIPDGAARQRHLSP 
PRRGNCTTRIVDQ 


6975 


20876 




7033 


2 


129 


YGPTHASGAMLRSCAARLRTLGALCLPP 
VGRRLP\EASRDPS 


6976 


20877 


A 


7034 


1 


287 


RLSLALVARDGVQWCDLGS PQPPPTGFK 
RFSCLSLPSSWDYRHVSPRPANFVFLVD 
TGFLHFGQAGLELPTSGDPPALASQSAG 
I\TGVSHRAWPK 


6977 


20878 


A 


7035 


3 


315 


HASAHASALFYFI FPETRSRSVAQAEVQ 
WRKLGSLQPPRFKQFSCLSHPYRHAPPH 
FANLVFLVETGFA\HVGQAGLELPTSSN 
LASQSGGITGVSQRARPGLNF 


6978 


20879 


A 


7036 


430 


304 


VIFYLIFFCKDGV/LTBdLPRLVMSWLH 
AILPPQPPKVLGLQA 


6979 


20880 




7037 


3 


256 


FPFFNRQKDLFGNFQTCQKWHSPSPRPT 
/APIIQAPKGNTSPPSLPIHLTLASTCTY 
PIPQAMLENWSVPSPAQAAGEAALWHPH 
R 


6980 


20881 




7038 


13 


481 


FARLHEFGTSRVIYLLARLVSNSWPQVI 

clpwppkl/lglqa 


6981 


20882 


A 


7039 


455 


255 


SGLSLPSSWDYRRAPRHPANFR\FLVET 
GFRHVALaGLKYPPALASQNAGITGVSH 
HARPEQFILTN 


6982 


20883 


A 


7040 


490 


172 


KCWQG/C/GETGTLVHC/WIGRNCTMMR 
PQWQQAWQVFKKIiHRELPSDPAMPLLGV 
QPGGLKIDTQT/RYMYKBUiLTVKRWKQP 
E CP P VDK/ LDKLW Y I HTTE YRVT I KKK 


6983 


20884 




7041 


438 


37 


IFPNLPPKLHYPPQFSPPEKLPKPLLIK 
MAKRQSQGKVFETPKEK/SRAGPPPKFT 
TGFSAEKHGGPKPGGYI SQRVQKKKNNC 
HSRILHSAKLSFKNRSEIKTFLNKQKLK 
/DFTASRPALQKILQEAEFHRTD 


6984 


20885 


A 


7042 


292 


463 


lEAISFLPFFFGDSLTGAQWRDLGSLQP 
LPPGLKRFSCLSLPSSELP/CSGRCQS 


6985 


20886 


A 


7043 


59 


257 


LASLVSQSW[<IKKMWYIYTMKYYAATK\G 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Gliitamine, R=Arginine, S=Seriiie, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NEIMSSAGTWMKLEAI ILSK^/TQEQKTK 
HHMLSLRSGS 


6986 


20887 


A 


7044 


279 


14 


RTADTEIGRGSSLCWRYGSLQIQVSKSH 
NFFFETGSHSATQYGVQWHNLGSPQPQT 
YGLQQSSHCGLPKCWDYRE/LPPSMECS 
GTISAHLSLKLMGSSNPLTVASQSAGIT 
GNSSSRLHDINYGLQ 


6987 


20888 


A 


7045 


578 


253 


AAASTFSCFFVFFETEPNSVA\RLECSG 
ATSAHHKLCLPGSSNSPVSAPQVAGITD 
AHNHIQLIFVLVETGFHHVGQAGLELQ/ 
NLVISLPWPPKVLGLQAWSHHAQPFV 


6988 


20889 


A 


7046 


3 


229 


FFKTGSLSVSQAGVQW/PNLSSLQLPPP 
RFKQFSCLSFPSSWDYRHPPPHPiiNFCS 
FSRDRVPPCWSGWSQTPDLR 


6989 


20890 


A 


7047 


367 


151 


LSLLSSWHYGHVPTSPANFFYF/CVDTG 
PRCVSNSWTQAVCLPQLPRVLGLQA 


6990 


20891 


A 


7048 


370 


141 


CQPPHPANFCI FVE IVFCHVGQTGFKLL 
TSNDLPAWASQSAGIWD\TGVSHHAWPS 
RFNSEVQFDYVSNKNVDFSLS 


6991 


20892 


A 


7049 


104 


361 


VT APGLEAAF KERVW PGAVART CGPS TL 
GGHGHHI VRGQK I KT X LMITLKPCLY 


6992 


20893 




7050 


2 


325 


RFSCLSLPSSWDYRHEPPRLAIFFFFFF 
GI I RKGGFTH \VGRGGFKPWTSGNMPAL 
PSQGVGFPGLFSPPRPVRGFFYPLPKWG 
GLFLQNGVKFKKGGSGDLFKMEKK 


6993 


20894 


A 


7051 


14 


208 


AHEILSEIENPLEPGWDHRCVSSCPAHF 
FVF/CYRDGGLPMLPKRVLNSWAQAILP 
WAPKVLGLQV 


6994 


20895 


A 


7052 


103 


368 


GNRVHTYMLTALFMTAKKWNKPKC/PVS 
DEQIEQIWSVHAMEYYSAIKRKrEALICV 
TTWVHIiENIMLSESSQKTMGCMLRRS I Y 
MKGPE 


6995 


20896 


A 


7053 


421 


2 


LFLPPEVARGFPKGRGRMPSPGGFFFG 
PRGGPIFPPP\SFFPTRPWSKQEVFPSP 
TPGKGFRGFFFPPPPKGKGGVFPLFFKK 
YLGGEGAPQNKGGPVFLLWPPHSGPPGP 
PKNFFFFFFFFETESCYVYPGCSAWQS 
Q 


6996 


20897 


A 


7054 


489 


40 


I PEAPGGPGFKHEDS PGANKSGPKALYP 
PKTFHPPWEGRFRSFQKGAPPPPVFFPP 
FFPLPFFFLRQSHSIPQAGVQWRDLGSL 
QPPPPPGFKQFSCVSLPCPP\HQMGFRH 
VGQAGLELLTSGHPPDSASQSTGITGVS 
HCSyEPHRTD 


6997 


20898 


A 


7055 




261 


TGSGSVTQAEVQAEVMaHCSLNLLGSSD 
LSALASRAAGAAGTCHHAWLIFVFFIEM 
GFHYVPHL/GLELWARVIHPPQPPRVLG 
LQV 


6998 


20899 


A 


7056 


1198 


1040 


FLWKIALLFYFKKLPQSLQPLVATTLSS 
QQPSTSRQDPLPAKRL\RLaEGSDDR 


6999 


20900 


A 


7057 


271 


33 


LFFFFKMESRSVARLECSGAILAQCNLA 
RLFLVE\QGFHHVGQDGLGLLIHPPWPP 
EVLGLQAQATPPSPKITYEAATVM 


7000 


20901 


A 


7058 


2 


194 


CYRRQPPHIiAiyiNFYFLLNFFCLFFC \KA 
LSLLARLLLNSQLQASLPPQPPKVLGLQ 
ACGTTPCP 


7001 


20902 


A 


7059 


3 


272 


NMWKLNNILLNNQWVNGKIKMKIFKNCK 
E\NGNTTYQNLRDAAEAFLKGKFIAVNT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 
ed end 

otide 
location 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=l]nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YIKKDYRKTCLRLFYSYHFFNKEKEYTL 
KTMIKV 


7002 


20903 


A 


7060 


397 


129 


FWGPHKKNFPLPARGRKLGSFKTA/LPL 
FFFFFSFSLSLFFFSGSCSVS\RLKCSG 
TITAYCSLKFLGSSNPPASFSQAAGTTF 
TFFLFIG 


7003 


20904 


A 


7061 


3 


351 


SCFRLLCLPKEAWAMAGAPPPASLPPCS 
L I SDCCASNQCDS/VGVGPSEPGAGYNL 
WRRFLSGSEKENIRVGVTRFSRCRPSP 
LSLTRKGNSIiTPCSSQTOQCIiALLRLAH 

GARTH 


7004 


20905 


A 


7062 


60 


324 


ddfvswqtgvhwcgrgslkpqpphlss 
tsrlsspsswdygrvpfXnifcrdrvsl 
cfpgwprtpdlrgsscpspkwklhtrv 

TAPGQ 


7005 


20906 


A 


7063 


481 


160 


MPSGAAFPGLLSEP\PHRGPRSFAARSP 
TLRYPGRAPS KGSGEDPARP PATGPGPN 
L IQDGLVLMSLTENVC 


7006 


20907 


A 


7064 


65 


481 


CLCPAPRGGAYRGRQASLSCGGLHPVRA 
SRLLCLPKQAWAMAGAPPPASLPPCS LI 
SDCCASNQRDSVGVGPSEPGVGYSLWR 
RFLSRSEKRNIRVGVTRFSRCV/LSPLS 
LTRKGNCLTPCASQVRQCIiALLRLAHGA 


7007 


20908 


^ 


7065 


392 


183 


GGLKP/ I PGNPGNSLFSQKKKKQPGAGG 
SPRGSPPPGGLGGG IFFGPKGLGS INPN 
FGPAPPPGGQKKKSLF 


7008 


20909 


A 


7066 


538 


248 


FFLKQSCSVAQAGVHWGYLSSLQTLHPR 
F\KFSCHSLSSSWDYRCAPPRIiAN\FVL 
LAEMGFHFHHIAQAGLELPTSNDLPTSA 
SQNAGITGVSHHT 


7009 


20910 


A 


7067 


2 


127 


VTGQAWHACSPGTLGARAG/WIVWAQQ 
FRTSLGIMVRPCLYL 


7010 


20911 


A 


7068 


379 


217 


KKMFCKNE/SFTIFPRLVLNSWAQTILP 
PWPPKVLGLQVQTTVPGSYSPLSLYKE 


7011 


20912 


A 


7069 


386 


176 


KRDKVDKWLKKMWCIHIMEYYLAIKKET 

lpfatwvnlqXtfilseisqeqdkyhmi 
slicgikmliyhire 


7012 


20913 


A 


7070 . 


30 


408 


WSSELGSVGVGPTE PGTGGNLLVCWLLR 
LWEKCSVWAGVSRFPQ/CTDYDGFPWVG 
KGNPPTPCMQVRRHPMiIiRRTLHGIiHP 

casheknqvpqlemqkssvfcidltgsc 

RVELFPFGHLGSKNP 


7013 


20914 


A 


7071 


310 


414 


HVPWRSFAS'JVHCNLaSLQPPPPGFKfeFS 
CLSLPSSWDYECPPPRLflN\FCSGSLQS 
WLTATSASWVQAILLSQPPE 


7014 


20915 


A 


7072 


353 


243 


IFIFCHV/RGLTILSRLVWNFWDQVILL 
PEPPKVLGLQV 


7015 


20916 


A 


7073 


1 


276 


ELWDNFKEPWIYLKSLKERKECVCERGR 
RDRNKTLEDIMAKNFSNWMKI INPQIQE 
AQQTPN\ QETGRKLALRHI TIKLLKTWD 
KKKILKAV 


7016 


20917 


A 


7074 


362 


133 


GGGGGGGRYSPQKKGTLPLTPPSNWGPK 
GAPPPRGWFFFFFFFSXRDGCLAMLPRL 
VLDSWAQVMLLPQPLKVLKL 


7017 


20918 


A 


7075 


3 


177 


KWS PHHSLPNN\WLYHHMPPCPANVEFF 
CS LAVLPHMVSNS WPGAILS PWPTRMLG 
LQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etit 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ingto first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 
otide 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L.=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnltnown, *=Stop codon, /=possible 
nucleotide deletion, \=posjil>le 
nucleotide insertion 


7018 


20919 


A 


7076 


1 


169 


PQLPSSRNYEHVPPCWANFCIFS/RRRC 
FTMIiARLFLNSWPQVTRPHWPPKVLGLQ 

A 


7019 


20920 


A 


7077 


402 


231 


PSCLSLRSSWDYRCMPPHLTNYYYYYYY 
YFVEMGSHS VTRPQDI LPPWP \PKVLGL 
QV 


7020 


20921 


A 


7078 


422 


316 


FCKDK/SLPMLPKLILNSWVQTTHLPQP 
PKVLGLQA 


7021 


20922 


A 


7079 


58 


293 


DP I CTKSKKLKKTQ \TDKEDEITTKKaK 
KRKS KI KI EEDGEEGKWWEEEVSVKK 
KKDKKKPWLGMVAHTCNPNTLGG 


7022 


20923 


A 


7080 


363 


123 


GVFFRDHARAFSLHTGVCAVHISRCF/H 
VLTHFKEFLFHWFLTIHTHTHTHTHTHT 
HTPSFPELGKAVPMHRIVIVPTNYQH 


7023 


20924 


A 


7081 


3 


181 


YFLL\FFWDRVLPCHPGWSKGWSQTPGL 
GQSAHLSLPKHWVCRREPPHPALFLLFK 
NTIS 


7024 


20925 


A 


7082 


453 


142 


FFXXFXFXXXXXFFFFFFFFFXPPFFFF 
FFFLKNSFPYFFLFFFFYPLFLPLFFFF 
FPPNFFFFFLFS*NFFFFFFPPPPLFFF 
FFFFLFFFFFFFFFFFFFSF 


7025 


2092fi 


A 


7083 


268 


92 


KNNLPSSWG \ YKCS PPHKGNRFFFYI KD 

GGGLIMLLRLISNSWAQVILLSWPSKVL 

ERQA 


7026 ' 


20927 




7084 


413 


0 


SCSVAQDGVQWDDLGSIjQRLPPKNKPFF 

clsstpaa\fpsdwdyr 


7027 


20928 




7085 


400 


125 


KTRS F S WRGGVQL / C I / GS LKPRTPGV 
KSSSPSASPVGRNTGMCPPPQQIFFFLV 
ETGSHYAAQASLKQSSCLGFLILFLSQP 
LGEKEGPTICT 


7028 


20929 


A 


708^ 


377 


38 


mpgpGfyfWgpkkKiKpfpppgvklvsf 
krappfff/cfffetgspfvaqtslell 
dshnpptsasqsagitdmshhahpmhs i 
cmiqi.wksvhrfhfisqgsrdcsssree 

T 


7029 


20930 


A 


7087 


1 


151 


S FCKD/KRIJ.IMLPRLVLNSQAQVMLLPQ 
PPKVLSHHTWPIIKPSPT4HFNLP 


7030 


20931 


A 


7088 


405 


111 


PRLVLN\SELRRSAHLSLPKCWDY6PEP 
PCPAQMHVLKSGTAAHKVHDPLASWSHL 
PTHSPFPVIiCSKTQISVAFFIQNLRflGQ 
RSGQRHF6KQCGSIH 


7031 


20932 


A 


7089 


1 


309 


KQINKSSLRISGKPPNCECVLHIiGKBVW 
KSLLEPHKLaSDP/HI.CDFIjESS\EQVK 
SVKELNCHVXTSIiYTLGALESDMaEYFX 
, NKYTLGPGMLAHAYNSSIL66 


7032 


20933 


A 


7090 


414 


240 


FSLF/ CKNIjKKGLTMLPKLVSNFWPQAI 

LLPQPPKVLGL 


7033 


20934 


A 


7091 


460 


328 


RHPPPYLGK/FFCRDR/SLTMLPRLVSN 
SWPQVILSPWPPKVLVL 


7034 


20935 




7092 


388 


199 


KQFSCLSLPSRWDY/SALPPRPAHFCIF 
NRDGFSPYWSGWSLGLPKCSDYRRRIFL 
RAKAHVRE 


7035 


20936 


A 


7093 


18 


271 


MHSFATYLIiVKLHLLVLCRLCPLSMYSG 
EKSYILLTB35PMLWF/SSGPFPSWSFSS 
ITLRIFLTCLTLKLSCCSCCCCCYCCCC 
C 


7036 


20937 


A 


7094 


415 


291 


GDLGSPQAPPPG \ S RHSPASASRIDGTT 
GARHIMNFKNLAIK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

location 

ingtoflrst 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=GlLtaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide Insertion 


7037 


20938 


A 


7095 


21 


428 


DKGLTVTQAEVQW /HDPSSRQPPS PGLK 
RFFRLSLPSNWDHRCAPPRVANYYFFYR 
WGSYHVAQAGLELLASSNTPTSASHSAG 
ITALFLRDTIiVEFFMGSFCFPQSFGAAF 
TAQIPQRGRTEELFIFLFLNDYLPS 


7038 


20939 


A 


7096 


320 


134 


LPL \ PKTTFYLFLNRDRVLLCCPGWSR \ 
I LSSKQSTLLGLPKCWDHRHEPPQQAVD 
HTLDFS 


7039 


20940 


A 


7097 


402 


221 


DYRHVPPPPANF/LVEMGFWHVSQftGLE 
LLGSSDPPNTSQSAGITETSHQAQAYTF 

IFIIY 


7040 


20941 




7098 


427 


259 


PPPRFKRFSGLSLLRRWDYRPPPSCPGN 
FF F FF\ C 1 FVQMGFHLVGQAKMLKLHD 


7041 


20942 




7099 


422 


275 


VQWYNLDSLQPASSGFKRFSHLRLLSTW 
D YRH \ PRPANFC I FSNTKDTF I 


7042 


20943 


A 


7100 


3 


404 


has gatwrnp vs trvwwh i p i vpatqeg 
emggsleprk/lkaavshctpvwvt 


7043 


20944 


A 


7101 


539 


336 


FETESRSFTEAGVQWRnLGSLQPPPPGF 
KR/DSPALASRVTGTTGflHHHAGLIFVF 
LVEAGVSPHWPGWS 


7044 


20945 


A 


7102 


408 


287 


WLILFFIFCRD/RGLTMLPTLVSNSWAQ 
AILLPQPPKVLGL 


7045 


20946 


A 


7103 


1074 


705 


SFLRWSFTI.VAQAGVRWRGLGSLQPLPP 
WFKQFSCLSLLSSWD\YGRPPPRi^ANYF 
F\VILVETGFHRYWPRiyiVSNSQPQ/CDL 
PAVGRPISrSAGl\TGVSHCAWPKMFLNWC 

TCTHS PYLQG IVSLG 


7046 


20947 


^ 


7104 


227 


55 


FSQHKINIQFYISLSPTHITMSFFLCRD 
/RGLAiYLHRL/VLNSWPHVILPPKPPKV 
LRL 


7047 


20948 


A 


7105 


479 


314 


PRLLLIFFFIERGL\>I,CSPGWSQTFGLQ 
ESSCFGLPKSWGPRKEPQPCLRLKWTLQ 


7048 


20949 


A 


7106 


37 


143 


NSFF/ CKRRGIiTMLSRLVSNSWPQVILT 
PQPPKLliEIi 


7049 


20950 


A 


7107 


441 


106 


GAPSPASLPPCSLISDCCaSNQRDSVGV 
GPSEPGVGYSLWRRFLSRSEKRNIRVG 
VTRFSRCV/LSPLSLTRKGNSLTPCaSQ 
VRQCIALLRLAHGAHTtlWPAPTVWHSLV 

R 


7050 


20951 


A 


7108 


413 


207 


CCIQAGPAFFGGPPPPRWGWGGVPPPGY 
/MPSPPVFRVGLLVFRGPPPNMFLPMGL 
QPDGPGLGDPKKFAK 


7051 


20952 


A 


7109 


92 


409 


PHLPDGAAGQRCPPPGRGGGCVEAGPQL 
PPGRGGWPGGGLPPPPSRTGWLPGGGAP 
HFSDGAVARQRVSSLLRRGGRAETLLTS 
QTGSRPGRGTPHISDDGRPGRD 


7052 


20953 


A 


7110 


408 


13 


PPWGPPGGGPPEAHGSNPPGPPWGTPIiS 
PKKPKNYSGQGGGPLNPPPPEG/SGPGI 
PFSPEGKGSLNPGGQKGKIGAPSPPPGG 
PKKKPPSQKKKKGKEKKEICVIKGESSQI 
LLFKVSKIIiSVNFLHLSSLI 


7053 


20954 


A 


7111 


290 


175 


LKKNCR/DRSHTMLPRLVLNSWPQTILP 
SQPPKVLGLQI 


7054 


20955 


A 


7112 


1136 


552 


I LFKEMGCEAGQRC S C PVI P I TLGRPRR 
ADHLRCGVRDQPGPTWLKPRLY2NTK1S 

wawwrvpa\ipa\irrpeatrftfnprr 
wrlplnlrshplhshpgrqskipikker 
kkgqdrkgkgcgqnwgqkqkerkegkke 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














MGMQRIQGKNKKEQRKERKEQRKEQNKW 
VWAELGAFPEQGKGLRRGPSEEAVGVSP 


7055 


20956 


A 


7113 


570 


199 


LCLLKQAWLMAGAP P PASLP PCGLI SDC 
CASMQQDSVGVGP S EPGWYSLWRRFL 
SRSDKEMIRVGVTRFSRCV/LSPLSLTR 
KENSLTPCASQVRQCLALLRLVHGAHTH 
WPAPTVWHSLiyiR 


7056 


20957 . 


A 


7114 


466 


284 


RTRGIGWTANRRMKATQPQLTPFHLIHK 
QEGGNVGSQKGQRD\VTNSAFHWRLHDQ 

TANCLS 


7057 


20958 


A 


7115 


474 


224 


DSNFE KS STLGKMLSNSI KC YREI FCER 
KSQLMQQTSML\FKKLPQSPQPSATTTL 
ISQLPPTSRQMPPPAK\RLQLAEGSDGC 


7058 


20959 


A 


7116 


377 ' 


111 


FXXNFXSFLFFSLSLTRKNWVAHITFFF 
FFFLPTPPFFYFIFLLIFCFFFFLFYIF 
FFFFFFSFFFFFFFFFFFFFFFFFFFFF 
FFFFL 


7059 


20960 


A 


7117 


417 


148 


VPPCLANF\NFFVETGSPYVAQAGLELL 
GSSHPPTIASQSSQITRVSHCSLLYFLM 
HSNTSISFFSFWALVSFLEMPSSCQDGI 


7060 


20961 


A 


7118 


2 


196 


RVP P P CPA \NFVFFVQTAFCHVAQAGP K 
LLASSNPPASASQSARIAVLSYCPLPSL 
RLRSHLDFF 


7061 


20962 


A 


7119 


2 


417 


EAGGSLEPRNLRLHCA/LVAPMNSRCIP 
AWAT 


7062 


20963 


A 


7120 


801 


450 


GPKFLGPPGVEIN/RRPGGGGLPRVYKK 
NWYPEDRPIVLPAFFSGGGR/GPGPHDK 
HPLPRKKQKHSPGRGGEPPFFPPPGFPP 
GGERKNPLTPQRKRVHWGQGLAPPFPPS 
LSAKPKLF 


7063 


20964 


A 


7121 


3 


199 


DAWQFALWQAQVQCCDLSSLQPSP/RL 
ANFVFLAETGFUIVGQAGLKLLTSQNLP 
ALTSQRLRLQ 


7064 


20965 


A 


7122 


407 


154 


NVAGLTGAYLDSFFGRDRV/LTMLLRLV 
LNSWAQAILPPWPPKVMGLQACTIEVSS 
LVAQIASKRLSGVQESDEAAWIRDC 


7065 


20966 


A 


7123 


33 


192 


RWDFTMLPRLVLGSKRSAHLGLPKCWDY 
KHEP/ PCCPGWSWAQRDLPTLASQSAGI 
TSMSHLHQAIPLKFLEQFIRFSILVF 


7066 


20967 




7124 


3 


399 


SIPVDWPSRAPTPRQRESAAHQQN/YPV 
GQRNXRMAGLSPRSFCWREVSFSR/SHS 
C / 1 CtJVQLGKSAVLPQQSRGPGAHEGSR 
RRG6SSPPCPPLQTQLVFSPQELSMSAP 
IHRLSVTLQSDCISTGGAPFRCRLA 


7067 


20968 


A 


7125 


1901 


1574 


FFEMESCSVAQAGVQWHDRGSLQPLPPG 
FKQFSCLSLPSSWDYRHLPPHEANFFVF 
LVETGLHHVQQ\SQLELLTIjQD/LLPAS 
as qs ag i tgvshhawakmp cflts i dt 


7068 


20969 


A 


7126 


408 


234 


LPSSWDYRSPPPQANFYIFSRD/MGFTM 
LARLLTSSDPPALASQSARIAGVSHHAQ 


7069 


20970 


A 


7127 


88 


277 


KLSTEYKKQRPEDSFFLFL/VEIGFRYV 
AQAGLELLSSSDPPTLASQSAGITGVSH 
HIQPPFKK 


7070 20971 


A 


7128 


3 


140 


QRWWCVPATRKAEAGESLETESSRLQGA 
MITP\CSPAWATEQDLAS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptopiian, Y=Tyrosine, 
X=UnIinown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7071 


20972 


A 


7129 


• 


212 


PLLFVSPPLLSVSPPLLPFP/EPAASPP 
PSTHSLLPSVQSESLGQFLHPPDPPPIiA 
FPAPLKHQDLHISHL 


7072 


20973 


A 


7130 


361 


3 


RNNISQDIAVIGNDSSNRSGQSKLKIFW 
KGFTILHAIKDFCDS/WEEVKISRLPGV 
WKKLIPTLMDDF/EGFKTSVEEVTADMV 
ETETELERLELEVKPEYGTGLLQSRDKT 
LTDEKLLLMD 


7073 


20974 




7131 


421 


201 


NFC I LVD / TG FHHVGQVSLE LLT P SNP P 
ASASQSSGITGLSDHSQRKSELVLSFNF 
FALFHILLCVMFLAIDRND 


7074 


20975 


A 


7132 


221 


529 


LIKFTIHSFYFIYLFIFFSGDGCFALSP 
RLESSGVISAHCNIRLPGSSDSPDSASS 

vagii/cmrhhtqlily/clvet/efhh 

VGQGGVDLLTSGDPPASASQSV 


7075 


20976 


A 


7133 




227 


KGIHFSSANDVKKKTVLTWLNSQDPQLF 
RDGLDGWYHRLQKCL\ELDEVYVEK 


7076 


20977 


A 


7134 


1242 


300 


dfvlpdgrvip\kgivcliniig\thyn 
pn \lwptp levydp frfnp\enikers p 
lafipfsagpkjmciguafamaemk\,'vla 
TjTllhfrilpthteprrkpelilraegg 
lwlrveplagtsrgnprassrpslsdpw 
vlrrplpppiflflllltlpllfsvmar 

EPPSPLRVEAPGPPEMRTPPAIEATPKG 
TLQPAGGRLRFLSGCVPl LHQVAGHRYG 
KDKMGI LQHPDGTVLKQLQPPPRGPRE L 
E F YNMVYARDCTDGVIjLE LRQYLP K YYG 
IWPPPAAPNDLYLKLENVTHKFNKPCIM 
DVKIGQKKL 


7077 


20978 


A 


7135 


1 


259 


GTSCILASMLTRHRLRFSF/CLFETEFC 
S VAQSGVQW YDHS SLQPQPPGLKLFSPV 
NFPS SWDYRHMPPCLASAS ILKLLFT ID 
LGFYS 


7078 


20979 


A 


7136 


1132 


900 


CSGVI SAHCKPRLPGSRHSPASASRVAG 
TTGTRHHARLNFCI FFLVEMGFHRVSQD 
GLHLL/NLWCPPRPPKVLELHA 


7079 


20980 




7137 


1 


501 


fflrwsltvtrlecsdvisahcnlhlpg 
s s dspasafreagtt/ghdaql i fvflv 
etg/fhhwkdgvdllts 


7080 


20981 


A 


7138 


2 


229 


WHSHGLLWSQPPL\YLSLSSSWDYSHVS 
PCPA6LCLFLVETGSPYVAQAGLSLRGS 
ASQSTGIIGMSQPAGFIVKQL 


7081 


20982 


A 


7139 


255 


58 


IFGKGFPPFFCQNQGFSVPPPPRAFFFF 
FF/CPAKYYTMLPRLWSNSWAQAIHPP 
MPLKVLGLQA 


7082 


20983 


A 


7140 


422 


274 


TMLPELVSNS \QLKQSSRLSLPKWIVDYR 
YKPPCLVYTPFINRPPEVPKE 


7083 


20984 


A 


7141 


417 


60 


VWRFLEELETELPFDPAIPLLGIHLKEH 
KLLYHKDTC\WNQPKWPSMVDWIKEMWY 
INTMEYYTAIKKINAIFRDPLTSPWDPF 
SWRKFCLPLEIWPASYFLPGINTRSLDE 
PLHIKGER 


7084 


20985 




7142 


391 


70 


SPPPHPPRKGDYRGGPP/RPGKKREIFF 
FFFFLVDTGFCHVAQAGLKLIiASSN/LP 
TSASQSAGITSVRHYIQPTLLKIIPQFL 
DILLFLKIFHSFLKSFNFNLGSFY 


7085 


20986 


A 


7143 


405 


304 


FCRDR/ SLLMLPRLDLNSWAQGILLPWP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7086 


20987 


A 


7144 


3 


159 


HRSEKIWYMYTMEHYRXSQKKNKIMSFV 
ATWMRLEVNI LSKLMKNEMMS\raGR 


7087 


20988 


A 


7145 


343 


115 


CE FYFLSCIYVF lYFCR/ DKGLSLLLRH 
VSNSWPQVILLPWPPKVSHQAQPEFYFR 
NKQTKI DLSFEETAADLAYYS 


7088 


20989 


A 


7146 


453 


119 


RCWKGCGLWISLRTT/WREWQLVQPFWK 
TVWRCLRKLKVEPPYEPATPLPGIHLKK 
TKAVAQRDMCTPLFIVAFQHHLLKRLFF 
SRCVFLAPLLKIA 


7089 


20990 


A 


7147 


3 


105 


TIEWGEVIS/SSrSSTMSNKEGLFSLVA 

RKLSRPL 


7090 


20991 


A 


7148 


295 


23 


FANIFSHSAETLQTRREWGPIFNIL/EK 
NFQPRISHPAKLSFISEGEIKSFPDKQM 
LRNPITTRPALQEFLKKEFMSFPGTWMK 

LETIIIS 


7091 


20992 


A 


7149 


1 


161 


RTRGFFYLDGVRSLAMLP/RLVLNSWPQ 
AILLPWLPKVLGSQTIAMVAAYTVFV 


7092 


20993 


A 


7150 


1 


219 


DYRCTPPHPAHF/SIFIFCRQR/SLTMF 
LSLVLNSWTQAILLPWPPKLLGLQACTA 
MPIPGVELLWVSGCFVKES 


7093 


20994 


A 


7151 


177 


34 


ELVSFLSFSLSLSFFPSFLPSFLPFLSF 
LPSFNPSF\LPSFLPSFL 


7094 


20995 


- 


7152 


398 


40 


MNDHVLLFHPGWNAVAQSWFTGASN\ FG 
LKQSSHLSLLCSWNYRHTPQCPASLLAY 
VCMYVCMYVCMYVCRD/RGLTMLPWLVS 
NSRPQAILPPQPSKIMGPLLGAVAHAYN 
TSPKGSQRR 


7095 


20996 


A 


7153 


41S 


179 


RNS CTQMRNEALPNSGYFPKLKYYYDF/ 
CQRQDLAILPRLVYNSWVQAILLPWPPK 
VLGLQHEPPCLAVIQFLKBNLKIV 


7096 


20997 


A 


7154 


767 


537 


QALLCLPGWS \AVAQSWLTAASNSSDPP 
ACLPSSWDYRHVPPRQAEWSFRVSVSSP 
VKRDPGSPWRRMRPCSE 


7097 


20998 


A 


7155 


421 


301 


HftQLFFCRHW/SFTMLPMLVSNSWAQAI 
LLPWPPKVLGLQV 


7098 


20999 


A 


7156 


410 


269 


AGdPQTdSFFFF/CYERGVSLLPRLVSD 
TWSHVILQPQHPKVLGLQA 


7099 


21000 




7157 


3 


237 


MQIKa./TMLYHFTPTNMTVLGFKKITSRG 

WQARGEIGTLVHCWWGCIMMQPLWETVW 
HFLKRLNKLGWVHACSPNYSGG 


7100 


21001 


A 


7158 


435 


211 


LFLVEMG\FAVLASLVLTLE\PRDSPTS 
ASQRARMTGMSHHTQPLFCLFVSKFHIA 
KKLSLIWQNLRRAGICRLS 


7101 


21002 




7159 


3 


220 


SSSASQSGGITGVSHHAHPSERQ\SCSV 
AQAWQWCNYSSLITPTPGLKRSSRLSL 
LSSWDY\GALPHPSVAS 


7102 


21003 


A 


7160 


' 


172 


frkvlgrtgsnglctqvhvefmddt3rs 
imrnvkXspveegdvltlleleletlrl 

R 


7103 


21004 




7161 


1 


180 


DAAPVLKATARPTRPDRWK/ IDIDSLDI 
HGETPSLQKNTNTFRAWWCIPWPATRK 
ADMF 


7104 


21005 
2TOO6 


A 

A^ 


7162 


3 


108 


KKKKTGRFKLV\MQYMRRFV)SrWPTFGK 
KKGPNAHS 


7105 


7163 


409 


110 


GQHGLTQSLPLSAPPRGAVPRWAPRPD/ 
SGLSLSPLLPTAFATTGASSAHRFVSIG 
PRDGNFLNIPQQSQSWFL 


7106 


21007 


A 


7164 


35 


371 


YFEfflI£LIiKPIiLASMMVFLTVVTOLLE 
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ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, iM=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLF FFLGET E PAFVAQPGR \QG PNLGS L 
RPH FP S VKHF PGLT P PRAGTS GLDP P S P 
AIFGFFKKKRGSPCGPGRAWTPGPRGGP 
P 


7107 


21008 


A 


7165 


3 


382 


YTGPAWWSMLWLATEEAEVGGSLEPRS 
SKLQ\QPPLRHCTPAWAT 


7108 


21009 


A 


7166 


1 


305 


TAVTVDPYHSVYIEVNARPGACMNGGGL 
LNGEQNSLKRKQLVQTIiNKHHGEELaMV 
DSLTLGPKLEEA/ASPPAFAEVRLRARQ 
ACRRKGSRLCWEHLVSKV 


7109 


21010 


A 


7167 


383 


55 


RDLSMVRMKSMLAIALCFTALMGMFNS I 
FYGRWAKLPFPPLSYIQGLSHRNMMGD 
DTTDCSFIFLYIFCTMSIR\QNIQKIVG 
LAPS RAATKQAGGFLG P PP PCGKFS 


7110 


21011 


A 


7168 


465 


166 


SPQGAQIKGVGFQKLRPWGP\PGFPRGK 
PFFWFKKKLPPQGGGPLCSRFSGGLSQK 
N\SGPWGEIFPWGQNGPPPSPPGGKETF 
FSKKKKKKKAVDYLNS 


7111 


21012 


A 


7169 


2 


323 


IQGPKTLSEEVSADEVEIVRELGLQAET 
EDVTELLHFNSTACCREIFHEHKRQQMQ 
QTSLVSHF\RKLLQS PQSSAIATINQSQ 
PSTLRQDPMPATRLHLTGGLDNH 


7112 


21013 


A 


7170 


3 


509 


FGTIPLVLCADLWSLPDSGWEYLSTGG 
VETNHKDFKELRYNESLTNFSCHGKNGT 
TNGRITHGFKLQSAYESGLMPYTNYTFD 
FKGIIDYIFYSKPQLNTLGILQPLDHHW 
LVENNISGCPHPLI PSDHFSLFAQLEL / 
YTAFPAPSQRHPPSWQEWKHLQRTALI 
HL 


7113 


21014 


A 


7171 


1121 


320 


SSCFQTSSHACFVTEFVPGGDPMMQIHE 
DVFPEPQARFYVACWLGLQFLHEKKII 
YRDLKLDNLIiLDAQGFLKIADFGLCKEG 
IGFGDRTSTFCGTPEFLAPEVLTQEAYT 
RAVDWWGLGVLLYEMLVGECPFPGDTEE 
EVFDCIRLHGRPPTPAFLSVQG\VKFIQ 
KLLQKCPEKPLGAGDQNAEEIKVQPFFR 
PTNWQAIiLaRTIQPPFVPTLCGPADLRY 
FEGEFHRAA\PALTPPMHSLLTARC3QA 
AFRDFDFVSERFLEP 


7114 


21015 


A 


7172 


355 


3 


GPLLSMLGMNILFLTLlFLLLPLStoEGG 
VCGNF/ITNCCLEIDDNKKVIEDITAKI 
QKIAHVPVQIWKGWSPDTLFGGWFSSLG 
RFKTLVGIVLVILGVCLTI.PCLLPLLVK 
NIQSMSrV 


7115 


21016 


A 


7173 


184 


2 


BKKVEDDDDDDDFEMESRSVI PAGVQWR 
NL\LHPLPSQFKPPSCLSLPRSWEHRCA 
PLHQCI 


7116 


21017 


A 


7174 


1190 


407 


SLEOiRPSARGSRPMSMYETaSQQKPYLP 
MGEASRPEESRMRLQPFPAHIGRSALVT 
SSSSLPSFPSTLSWSRDESAREflSRLEK 
QWSTPESDYDNTPNDMEPDGMGSSRKGR 
QRSMVWPGDGLVPDTAEPHVAPSPTLPS 
TEDVIRKTEQITKWIQEIiLRAAQENKHD 
SYIPCSER1H7A.VTEM2^FPKKPKSDM 
VRTSLRLLTSSAYRLQSECKKTBPGDPG 
SPTDVQLVTQQWIPVCVRPFAKAAKQL 
VTITTKENHN 


7117 


21018 


A 


7175 


3 


136 


AIHLASLKVFYRYRVLSIi/ LKRLVSNSW 
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ing to first 

amino acid 

residue of 
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correspon 
last amino 
residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PQVILLPWPPKVLGLQV 


7118 


21019 


A 


7176 


3 


316 


GIVKFFFFFFLTKCRTVTQAGVEWCSLG 
SLQLPPPRFKQFTCLSPPVSWDYRHLPI 
CLANF C I F \ VE TGFHHVGQMGLELLTSG 
YPPTSAPPDAEITGVIHRTQP 


7119 


21020 


A 


7177 


529 


217 


LLPRLEYSGMVIAHCGLQLLGSSQAILG 
ACYCAW/HMFKAFFVETKSHYVTQAGLK 
LLASSNPPALASQSVGITGVSHCAQATL 
PFKTVDKGPSFNLFSFTPPLL 


7120 


21021 


A 


7178 


3 


150 


YTLCKSKKDIKSIYHIMT/HLKTHTHTH 
THTHTHTHKPSTTFSYTETFIM 


7121 


21022 


A 


7179 


540 


69 


ACRGAHVQVGDGSALTYVEDGTACGPTM 
LSLDHLCLPALAFNFSTCTDSGERRICS 
HHGVTAWSLGWREKLTRGDRPLLTSPGP 
ALPIG\CSNEGKCICQPDWTGKDCSIHN 
PLPTSPPTGETERYKGPSGTNI I IDS lA 
GAVLVAAIVLDETGWGYK 


7122 


21023 


A 


7180 


452 


279 


CIDSIYQEIYHLSISILSVYLSIYRSIY 
LSIYLPIFW\SLYLGPRFLRTRDPQLLS 
LT 


7123 


21024 


A 


7181 


501 


183 


HVRCLAFICALIGFPSGYFS IMPWT/ V 
DLVGIEHLPNAYGI I I CVNGI SALLGPP 
FAGWIYDITQKYDFSFYICGLLYMIGIL 
FLLIQPCIRIIEQSRRKYMDGAHV 


7124 


21025 


A 


7182 


446 


245 


iRUHSLALNQVSSASRYWDTGVATKIiAT 
F/SLFWKDRNLDILPSMVFNSWPQVIFP 
SWPPKVLGLQM 


7125 


21026 


A 


7183 


420 


73 


DSSDDSGQSKLKP/ FWKGFTILDAIKNI 
CDSLEDFKISLTGAWKNLIPPLMDDYEG 
FKTRVEEVTADWEIARELELEVEHGLG 
AVAHaYWPNTLRLRSLKPRSLRPGQHSK 
TLSL 


7126 


21027 


A 


7184 


567 


174 


FFLRDRVPSLTQAGVQWQDHSSLQPRPP 
RLKQSSH\PALLSSWD\YEHRPLCIiAKF 
LIFLQRRGLAI/FARGWSNSWSQ/CNL 
PASVSPSAGITGESHYSQPFLLSDSQTK 
VYKTHRSTISSTITEMGCERKD 


7127 


21028 


A 


7185 


345 
K 


59 


RWGSSCLWQVIQSLTLLPRLECSGVITA 
QCSLGLVGSSNPPTSASC\VESHCFAQD 
GLELMGSSNPPTLftGITGMSHCAQPLYD 
LFLYEYGLSVHY 


7128 


21029 


A 


7186 




209 


LSWLEIGSHSVTQAGVQWVHSSSLQP* 
MLGLKPSSRFSLPKCWDYHHEPWCLE6L 
LRLKFDPSVGGVA 


7129 


21030 


A 


7187 


3 


225 


EKTPVSDRGAEGCTESLVWRRPCFSALE 
VDETYVP*EENAETFTFHADICTLSDKV 
RHFLKPIALVILNHHIF 


7130 


21031 


A 






377 


ES*RNHNGDEDDSHVRAQGACIEAKHDK 
YMflDMDELFSQADDNRKKEDI PDYLRGK 
ISFELMRQPCITP'SGITYDRKDXEEHLH 
RVGHFDPVTRSPLTHEQLI PNLAMKEVI 
DAFISENGCVEDY 


7131 


21032 


A 


7189 


1 


359 


SRATDHVQRQGHG* G IHNYSDWENTDWF 
RETKNQSVPLSCCIETVSNGNGSLAHPY 
DLYAEGCEALVGKKLLEFMMHVI WAELA 
FAAI QLLSMLSAC IVMCRKSTDPAYELL 
ITVGTYA 


7132 


21033 


A 


7190 


496 


118 


QNQKFPFEPGTS*LGLYPKKYKLIYHKD 
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ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metljionine, 
N=Asparagine, P=Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PKTCMFTAALLTIEKPWNHHKHLSMADW 
IKKVWYGTVGHACNQSTFLFWFFFEMES 
HAVAQGGVQWLNLCSLQAPAWATESDS S 
QKRPKIDPHKYIVN 


7133 


21034 


A 


7192 


1337 


342 


ISEATRTFRESIWQFQSSPREDDRNRQL 
SASGSRQSHPGPRSTPGPGQGHRSSHQG 
CAPVSSAflRPGGPSFRLSEGALSRSQFS 
VLVYIP*VAAGQFVKACLGASSPPSRL* 
GGLGTGHRTPGKEARSWPETSAPPAWHL 
LRSLCLPSRGEKLAQARSQKHLGWKWLS 
KLCPKTKYSWSGPDDSPSVLRDPEAGTT 
RWAPSGNPEEEWLQLKAKYLQNISEVL 
SVLHLLHIKALQLHSTNPFFVNARHLLV 
GQISSALTLNNLSQILFLLLRFPFLSQV 
ERHSPLQKTGTTETNLVLFQDTVPMNFP 
SSILLPSKYSCPALDKEASIIFYQ 


7134 


21035 


A 


7193 


469 


130 


SASWDYRRPPPRPSNFLY**RRGFTILA 
RMVSVS *PRDPPASASQSAGCAYS lYLQ 
ERFAELHPACLAPRYMASDPLSSLHHSV 
PPGLHLEHEHWLWHCLLPGQICRPGKLW 
Y 


7135 


21036 


A 


7194 


430 


149 


TVWQFLKQLNIELSCDPVILLLGICPRE 
LKTYVHTKTRT*AFIVALCTIVPKQKQL 
KYPSVDEWVIKMWYVHTNDYFSVIKKNK 


7136 


21037 


A 


7195 


257 


412 


TATSNKEPLKKVRCLIWRKKDCWARGFT 
ACNPSTLGC*GRWIMRSRDRDHPG 


7137 


21038 


A 


7196 


1 


124 


PLISGVArHPCNPSTLGGRGWRIA*VQEF 
ETSLGELAEPCLY 


7138 


21039 


A 


7197 


2 


120 


YGIILFITSEVFVFAGFF*AFYHSSLGK 
KISKKESVIIY 


7139 


21040 


A 


7198 


234 


416 


GVHAEVLISPGPQDRLRALWIPIIFGI 
LFAILLVLVFIKKVAKKBTNKVGHP*EP 
GTEF 


7140 


21041 


A 


7199 


372 


3 


VHYPVPQTGSPCCVPHPRGLSTIHPGLH 
SRYTGLGM\;AHACNCSTriGGQSGKIA*A 
LEMEISLGNMVRPCLYERTNKQSKPRHM 
GITFGLTPSLTLTDKYSLI I SGFYLPQV 
LISLHPLGPY 


7141 


21042 


A 


7200 


396 


21 


GLKESHKCCaiPACLQRDPLVRVtGKAG 
F*EKSLSISTELQSLPDSPDFSKITGKP 
I KLTQVEHHAGFEWWEDGAGTTPS PGLQ 
PAHLTFPLDYHLWQPFIFVLRDSDTGAL 
LFIGKILDPRGP 


7142 


21043 


A 


7201 


2 


148 


REPPHHSIiftNF*KFFVDKGFTLLPRLVS 
SSWPQWLPPEPPRELGLQA 


7143 


21044 


A 


7202 


1 


291 


AEGFRMHALVSQAACGTSGYKLSVGYMV 
QGPLTHLMLGLLISLIFLSRGSSRTWAP 
SHS*YKTSDLLPCRNRREVIEPLHYRNL 
HSHICLSVTBCTFL 


7144 


21045 


A 


7203 


262 


406 


ILAEMTSWNS IKHLILFLK* SSHLS IP 
SSMNYRCVLPHQANFLFLFL* R* SFTLM 
PRLVETVLL IQNS PTKIDSS PDDFSGKF 
YQTIiNIVFENlRRGNTSNLLHKNSRILI 
PNLTKTLHD 


7145 


21046 


A 


7204 


2 


247 


DTVIERSGQGTAALPRSTSLDRDWAITF 
EQILASLLTEPALVKYFDKPECMMARIT 
SAKNIRMMGKPLT*ASDYEISAMTG 
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M 
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od 
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26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
East amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparf ic Acid, 
E=Giutaniic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possible 
nucleotide insertion 


7146 


21047 


A 


7205 


2 


168 


FFFLVETGFTV2jARMVSVS * PRGLPA3A 
SQSAGIAGVSHRAQPALRFLMLENRMC 


7147 


21048 


A 


7206 


1 


237 


GKCAHTHTHTHTHDTHRHAHAHTHACTH 
THTYVQGCSLSQWS QDCVHMWACHI FGV 
PRE *ASRYTSSLVYKPGCTGVCG 


7148 


21049 


A 


7207 


2 


239 


K\,'DSEHKLERSSGGVLGRFKGKGSGGLN 
EDDGSESE*EYS*PMPALEPCSS 


7149 


21050 


A 


7208 


1 


405 


GSRDNHNVILICFPNAPICVSAESVKNDA 
QLDKYLECTAQE 1 1 ^NIIWEDVPDLFHVM 
RTLPSDNMPSLPPGGELASMRTVIEAVY 
YRLNPYINDDTDSTSTDDMVIFNCSSSHG 
VYLDLHRRVQVNFDETFSMHPEL 


7150 


21051 


A 


7209 


2 


761 


QDFGTRFATVLLFYGYFKCTRTLKGTCL 
YNAILYKVCSPRNDQPDVCYNPSF7LLP 
I KTGELLGFLVYAS RE KRS lAI GD * KDD 
KWPPERI I QYYGPAT*AQDS SWGYR I P I 
YMLN* I IRLQAVLE I ITNKTGRALTVLA 
LQEIQMRNAIYKSRLALDYLLAAEGGIC 
GKFNMTNCCLHIDDQGQIVEDIIKITIO. 
AHVPMQ I PDI VWKNIVKIPVLFCS VL IT 
GTCSPQSGTPCWLSRSRPSHVDPFRWS 
P 


7151 


21052 


A 


7210 


147 


390 


KTGLSDQNPRAIQLSYSYLSPLLY**SY 
KKKKKKKKKKKKKKKKNFFKKGPGN* S F 
WGGEKKSGPGAGLKNPGGGKNTFY 


7152 


21053 


A 


7211 


433 


214 


WGRGITALTIHPPWVPFQESETTPSPL*G 
ITLLDNPFSPIPKQPPPPH*SITVPYLL 
SSPQPGFPHPFQAPGLfl 


7153 


21054 


A 


7212 


532 


253 


KWINVFHKAAIMSDFSDRSRHSTLKAS 
E*GFTMIDAIKNICDL*KEAKISILTGI 
* KKS I PTLMDDF *GFKTPML* VrANVGF 
QRFSKKFYW 


7154 


21055 


A 


7213 


456 


24 


DAQAFSWYSQGYFNSSQCSFS ISLecSDTS 
PVKIKRMGGPLFFS *LIiICSTCWACSKTS 
RVPGKVLKTGLPRPR*PVYKDPLGGFYP 
RF 














ILRDLCQRVPTWSGFPSWAMELLVEKAI 
SSASSPQSPGDMiRTVFECISSGIILKG 
SPGLLDPCEKDPFDTLATMTDQHRKDIT 
SSAQFALRLLAFRQIHKVLGMDPLPQMI 
QRSNIHNNRKRRRDSDGVDGFEAEGK*D 
*KDYDTF 


"715(5 


21057 




7215 


203 


3 


VRPGWAHACNPSTLGG*GGRIA*SQEF 
ETNLGNTVRPCRNSGEVLNRKTVS IRRA 
LHVFSPLHTA 


7157 


21058 




7216 


1 


285 


HTHMCRGAV*VRGARTVCTCGSAPFLES 
PGKEPKDTLRPLFVSQAARAFLGEKAFC 
SVYRQREPPPI 


7158 


21059 


A 


7217 


1 


386 


GDSKCEWSVLYVIIATPGIGVALGILSW 
TVICMLKHQNGKPKMKSKYKNLDATDQE 
SL*LKPASRAGIKQKGLLLSSSLMHSES 
KLDSDDAIFTWPDREKGKLLHGQNGSVL 
NGQTPLKATSPREEIL 


7159 


21060 


A 


7218 


70 


379 


NILYFNI FFFFK* FLKFLWVFFFFFFF* 
WGPFFLFFRFFFCLGFFGTFQSWRWFFA 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamlc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, F=Froline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptoplian, Y=Tyrosine, 
X=TJnknown, *=Stop codon, i'=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IiFPWGLMFLFPLLFFLVFF*LVFF 


7160 


21061 


A 


7219 


1 


208 


PRLAASVSPGDMLGMQVLGLGTVASACN 
CS TLGGRGMG I T * G * E FETSLVKKARTC 
FYKHLKRQKSANL 


7161 


21062 


A 


7220 


455 


96 


HYLYGSQGLSRFYFVTQGRVQWHDHGSL 
QPPTPGVKQaSSLSijPKCWDYKCEPPCL 
AB.PCPFFFFL*TSSVDTGLLLNQQPIL* 
IMTFPYPLLRKSLSFHHQIYEPTSICSH 
LLSFCYT 


7162 


21063 


A 


7221 


201 


319 


IYELWPGPVAHACNFSILGG*GEQIAQV 
QEFKT3LGHLG 


7163 


21064 


A 


7222 


307 


3 


NRRQAGPFPAKLGRPSPKGGFPNFFKLF 
FFKSSF*KNPKGQGWGFPPLIPGFQGPQ 
VGGSLGAPGLKPPWGTPQNPFSKKKKKK 
RSNCTDTGTTIAHHRQM 


7164 


21065 


A 


7223 


331 


141 


SETITAMVSPNL*DANDVPIQCEISPLI 
S YAGEGLERYVADKEFHAPVI IDENGVH 
EWKNGI 


7165 


21066 


A 


7224 


157 


314 


WLPGFRDGGEDGVCHVAQAGPKLRT*GD 
PSIKASQRAGITGLTHRTGPK*KF 


7166 


21067 


A 


7225 


2 


392 


RDVRSLQPLPPSLK* FSCISLLSSWDYR 
CAPPTT*LIFILLVEMGEHHIGQAGLEL 
LTSGDPPASASQSAGVTGVSHRTRPPS F 
VL.VMPIPPLLPPVPSPDGPNISSSTTSW 
RGYPEVLRWGLCB FWGCG 


7167 


21068 


A 


7226 


84 


204 


FIFFFFKR*SLTMLPRLVSNSWARVALP 
PQRPKVLGUIV 


7168 


21069 


A 


7227 


3 


307 


DRAMRGAGRAGA.VGPHRTGDWaiiAQTP 
AGRGHARHIGRGPKQGRAAGQRGHLHPG 
GLI.GRGGNSEP*RKQRTPGSMRRAAPIiL 
STGQVWEGPASVGDRVP 


7169 


21070 


A 


7228 


52 


1176 


GSQQPLLHISEAQGDTLIKKAPAEAAWP 
PLIAFSSGGRIPCSHES *SAA6YKRLDD 
TAAOaPPSPOaSPRSPATAPGiaPVWGG 
QLRPRGGRGLPATGSGEAFQALTQLSRG 
RACTGPPAPGSEDAAPSGCSAaRWSP*M 
AGSSSCaiPDSftASCICPQGPPWRW*SR 
APARNRCSASSGRSRC*QLRRRRY1PGL 
LHPVIiEIEALPDSWGCETQLGPPLASEAA 
PPRSRRREEG*RaSGRC**KPSLPRSWS 
LCRNQAEAGFVSILVWIiSGLAGAPSRPA 
AAAPQIAPNQHRSPPPRIiAPVPAASGSP 
WPWERDLGVLETLHViafFflQDPNPSACS 
GPQSTPDTYYPVQKPKTGLRLHFHSRPN 
TIVKSVGNMCL 


7170 


21071 


A 


7229 


3 


112 


ASNLGSRGCSEPRPCHCTPAI»V*E*DS I 
SKTNKQKN 


7171 


21072 


A 


7230 


103 


299 


AMSQIRNAIPFTIAT*RI>*yLEIQLTKE 
AKDiyWENYKIPLKEIRDVTNKWKNIPC 

S*IGAQLCK 


7172 


21073 


A 


7231 


2 


137 


SLTLSLSICFCL* RWGLS VL PRL I SNSW 
PQAIFPPWPPPWLELQV 


7173 


21074 


A 


7232 


3 


273 


LTHIVDGPNFLTTMYQCLMCVC*QELGY 
QPVTTEHCHWVCNDCLQRSFKAQVFSCH 
ACRHDLGQNYIMI PNEIMQTLLDLAFPG 
YSLGR 


7174 

.. 


21075 


A 


7233 


338 


136 


RPRWEDCLS PGVQNVCGQHSES PFLQE I 
*KISQTWWCKPWPSYQEVEGELLEIRG 
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nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
otl 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ingto first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














STCFGLPKCWD 


7175 


21076 


A 


7234 


437 


261 


HLGFPECWDYRHEPQHPT*FLEHFM*RR 
GLTTFP2LVWNSWPQAIFPRWSPKVLEL 
QT 


7176 


21077 


A 


7235 


41 


264 


VTVMEIKGKINLVLDIIKDIYVEGFSTY 
YF*RQGLTMLPRLDLDSWAQAILPPQPP 
E*MGLQECNHHAQLSRYF 


i\n 


21078 


A 


7236 




154 


GTRDFSVKTL*ARREWHDIFTGMKEKNF 
YPKKVYALKI PFKNEAETKKVEV 


7178 


21079 


A 


7237 


1 


179 


SCLSLSSS*DCR*PPPCPflNFVKSFFRW 
GVLTMLPRLVSNFLPQANLLAQPLSGFI 
ILL 


7179 


21080 


A 


7238 


340 


140 


KINWVWWPVPVIPVTPEA*AQKLP*PGK 
LKFPLT*IPPLPSRLGHQSQTSFPKKKK 
KFYILSSDSV 


7180 


21081 


A 


7239 


2 


175 


QFESKPSPAWATEQDPVSKKKKKKKKKK 
KKKMKKKNSSKFKMIPF'IKRQY*EGNWK. 
K 


7181 


21082 


A 


7240 


336 


192 


QSETPSQSNQKFIF*GQGIALLPRLVLir 
SHAQAILPPWPPKMLGLQA 


7182 


21083 


A 


7241 


1 


186 


DRPGQHGEMLSLLIFCFFCRDGGLTMLP 
RLVLKS *AQVILPSRPPKALGLQDTVHC 
SKFDAV 


7183 


21084 


A 


7242 


2 


111 


VKTLKKLGIDGTWLRIIHAIF*KPIAtlI 
ILNMCSFL 


7184 


21085 


A 


7243 


360 


217 


LPSRGDYKRKPPCLVNFYFL*R*GLTLL 
PRLISNFWAQMILLPQSPKC 


7185 


21086 


A 


7244 


35 


276 


VQQQRVKQ6RKYYS I VLKUKKKLCNYLR 
VGE*MG*RNIGWVQ*VAHTCNPSTIiGGR 
GGRII * PWEFETSLOflG *TCISG 


7186 


21087 


A 


7245 


1 


356 


IQHTFSLLIFGIKYSYYVLKIVSWIAAV 
AHACKLSTLGGRGGWIT*GHELETNLFF 
AVKPWVYDKNSFPPTLMIiRFAKSGHLWE 
PRGKKHGPPETPPLSRVSPPQRFFPGGT 
RPLKKE 


7187 


21088 


A 


7246 




293 


CKCGKAFHSTFYSFQNHEISHTGEMLYDI 
RNVGK* S I FF *HHCQHKTTHRAEKLCEC 
KICREAFSHFGNLKVHEAGRSGSRL*SQ 
HLGWANCLTSGI 


7188 


21089 


A 


7247 


6 


387 


ASIPCLTYPSWEFL*FI*QAGHFSFSKK 
KLFICAFFSNRDEVSLMLPRLGLNSWAQ 
WLLPWPPKVLGLQVRATTPAKLGIFWG 
FMSVSLGQGVCLCRSLLCPLPLGPGQVH 
SGCSQSFVEKMNKEL 


7189 


21090 


A 


7248 


373 


146 


HLGKKVLIFWPQYLALLNPPNGGVSRVY 
PRCRPLMVFFFFFFFC*GRGLAMLTRLV 
SNSWSQVIFQLQLPKVLGL 


7190 


21091 


A 


7249 


2 


184 


ailrgi*kkfistlmddfegiktslE;ev 
tadweiahevelevevepkdvtellqs 


7191 


21092 


A 


7250 


795 


544 


emkscsccpgvjvqwrdlgslqpppprfk 

RS P as ASLVAG I TAACHHARLI FVFL ID 
TGFHYVG * AGLKT PE a* ATAPGP I FL YF 


7192 


21093 


A 


7251 


34 


242 


KNTSLKKKFFWCQ KLRWVS LVE E FAFW P 
GAVAHACNPSTLGDRGGRVA*GQEFQTS 
LSNIGRPRLYRKI 


7193 


21094 


A 


7252 


216 


57 


NIQVLKYVLTSYLKKKKCAFRLGPVAHT 
CNPGILGAQGRKIS *AQEFETSLS 



993 



w o 01/64835 



PCT/llSOl/04927 



SEQD) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalaninc, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=TryptopIian, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertioo 


7194 


21095 


A 


7253 


126 


366 


INLTECQQYLDKAI FFFFLKKGLFAP* A 
EWQGGSFASLQPKPPELKBCSSHLTLLST 
WDYGGALPCEAKFGIFFFWKKGG 


7195 


21096 


A 


7254 


2 


213 


RPTWATWQNPASTKKKKNSFFFETEFRT 
VAQAEVQGGNLG*LQSLPPGLKRFSCLK 
MPEAPKGQSCSLIG 


7196 


21097 




7255 


150 


263 


EGVSLLPRLACNGAILAHCNLRLPTSSD 
LCAFS * TTTRYYKYSGYCFSHPNLMLIC 
SRFTLLSSHSSLGHLTLFLLIVCFEKES 
HSCPGWRAMVLSWLTATFASREQATFAP 

SLELSR 


7197 


21098 


A 


7256 


386 


93 


PPGGLPPQSIPSPRAQLKPKSPRGKKRG 
GP KP GGSQPGFWGFFFGPGP PFKRGGGL 
NWVLGAKN*KGVKKNQKKKKGKMPQETK 
RGNRSIDMKFGGS 


7198 


21099 




7257 


153 


369 


KENSRLQGYGPLFSHS*FTNSCPGQVQW 
LTPVILALWEAEVGGSKDQPGQHGKTPR 
LLKl QKFARCGGGRL 


7199 


21100 


A 


7258 


180 


3 


CSTLI* IRKVWLGAVAHAYNPNTLRGRG 
GRIA*GQVFKTSLGNNVKTCLFLPSPHKr 
OQ 


7200 


21101 


A 


7259 


22 


265 


QLEAIPDLKLDPLMEGKEILSRTLLGSG 
TVAPICDPSTLRGQGGWIA*VQEFKTSL 
GNMAKPHLYRKKKLLGQVTKLEYKW 


7201 


21102 


A 


7260 


326 


2 


KSAMQI CXVDPLNAFCLE ICCHANLHCR 
PSNAFCFLKVDNVRAFVSPS IPQVSLIL 
KNFSWPGWAHTCNSNTVRGQGERIA*A 
QE FETILGNTGRPCLYFLKNNRKS 


7202 


21103 


A 


7261 


T 


68 


VNLCFILWLGPGWAHACNPST*GGRGG 

T 


7203 


21104 


A 


7262 


282 


176 


GLKKTFFFLVRERVMiCYPGWSAW* SQ 
LTAN*TPGLKLFSCIiSLPCSWMHRHVAL 
HVAENIC*CHMPWPATWEAEAGEQLEP 

RSSVCSEL 












351 


NNTVCLLES I SS VNLCLPHLQHS PVKLE 
IFLFFFSSDNFPSTHHITVRPLSPS*AI 
IFPVTCTYTSRWPEATKDPQKK 




21 106 




7264 


327 


2 


KI KERINNPI PI CRDRVS I CCPGWNAW 
TLNSWAQaiRFCLSmSS*NYllCAPTHQ 
QVTTIFFCRDQIEIHMVPKYHLGNTLNY 
FFVLFCFVLFLEHPFSAHRNFRLP 














HRALPSCY*FYVTIiHBLTVACa. 


7207 




A 


7266 


3 


227 


SSQVSLPNNWDYRREPPCLRFVFLFFVE 
TRSCYVaSLELLGSGDPTVLAS *SaEVI 
DMGHCDWPNLSEILIiHVR 


7208 


21109 


A 


7267 


300 




EMLIiNTIMCNREILCKRKSPRVQQTSFV 
SYFKKLPQPPQPSATTTLISQQPSTSKQ 

YIMCFLPHTGIVHLVN 


7209 


21110 


A 


7268 


29 


344 


ASLLALKKKSGLGAViVHAHNASTSGGHG 
GRlT*AQEFATSLGNMaRTCLYKKKKKK 
KKKGKNFTGVGGPQHGAPIFGPWGGGKS 
PAPGKPGGPNPFPGKGAPTLF 


7210 


21111 


A 


7269 


3 


209 


LHPIjSPGPPAPVAaGIPGPQASR*GPCR 
PDPSPDSASVDASLPGPALSLWSWGSNR 
GEYGTTPASQIH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

Tq^te 


Prcdict- 

nucle- 
otide 
location 
correspon 
ding to 

peptide 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=GIutaniic Acid, F=PlicnyIalanine, 
G=Glycine, H=Histidinc, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7211 


21112 


A 


7270 


86 


206 


CQPGWAHAYWPSTWGGRGRQIT*GQEF 
ETSLGNTARPPL 


7212 


21113 


A 


7271 


45 


274 


GLGTVAQACDMPSTLGG*GGWIA*GQEF 
KTINNDKSNKWQYIiAI lEQIHNRMLICK 
NKKELTDICNNTDEFQKHRE 


7213 


21114 


A 


7272 




122 


VLVETGYLEVGQAGLKLLTPGDPPALAS 
QSAGITGISHRA*SYMHF* LVETGYLRV 
GQAGLKLLTPGDPPALASQSAGITGISH 
RA 


7214 


21115 


A 


7273 


3 


242 


LLKQLDYHP PHPAKFFFFLE FLVKKGFH 
LWKGGFKTQTSGGLPG*ASKGVGIPGV 
SHCPGLIGVFNLTLKVAVSLMDL 


7215 


21116 


A 


7274 


2 


169 


IAJjVAQAGGQWHDLGSLQPLSPSFPQFF 
YLSLPSS*DYRHEPPHSAWFSFFSMYR 


7216 


21II7 


A 


7275 


353 


76 


FAHLGLPQCWDYRREPLCAT*KLFLKGK 
*PSKFQVIFSSLGTGHFSNACFSCVMWT 
IVLLIINFWVTGIDGRTSGSSSVKCGNH 
SIYPMVLL 


7217 


21118 


A 


7276 


6 


87 


RRGLTLLPRLGSNSWPQAILLPWPPKVL 
KL*AQTPGLRQSSCLGLPKC 


7218 


21119 


A 


7277 


1 


203 


FQVKMLCSGWMWLTPVIQALGAVAHTGN 
PSTVGGRGGQI I*S **FYTSLDSGNPPA 
SASQSAGITLA 


7219 


21120 


A 


7278 


272 


51 


IFFPVlI*CGGHDVQHHFlSIPS*lSn!ILSSWS 
SWFMPIILPLWEA*AGK*LEPRSriRPAW 
ATKLDSNSKKKIIEIHA 


7220 


21121 


A 


7279 


1 


131 


SRDYEHGPLHPTD FVFFVEMRFHHVAQA 
GLKLLGLSDLPALTS *SHDYHHGPLHPT 
DFVFFVEMRFHHVAQAGLKLLGLSDLPA 
hTS 


7221 


21122 


A 


7280 


411 


36 


FLECVPNFFPPKKKFFFKKMTGQVFFPV 
TFL*KGPALFFFIFPFYGKKFFFLSSAK 
DFPPRVFF*KGPLFFFFFFFFFFFFFFF 
FFFFFPFPPFFFFFFFCFKHFIYKKVHF 
*SSVHPQPIIFF 


7222 


21123 


A 


7282 


395 


181 


RKHES IHHDSVISVTFHMQSLRLFFFIF 
FILSL*LFRFIFFFSFFLVFFFFFFCFF 
FFFFFFFFFFFFFFFFF 


7223 


21124 


A 


7283 


392 


2 


IKFFGYNV* FNYILKNGriF*QLLI*LIN 
LPILRQGLIFWQVCaiVLVFFFFFFIFF 














FFFFFFFPPFFFFFFFPEr.TQNMLPIVN 
FCKDTFKQQNIHLRTRGR 


7224 


21125 




7284 


113 


291 


YVCVCVCVCVCVCVMEY*SAIKRDKIVA 
FTVTWMELETIILSEVSQEWETTHHMLS 
LIN 


7225 


21126 


A 


7285 


327 


44 


LI KVSPNFSPPKKKVIiPKKPPCTIFPPS 
PF*BCTPPEPIFIKPPHKKKNFNSPPPQK 














LGFSSKLLIR 


7226 


21127 


A 


7286 


3 


215 


VILIIFMI*EAFAWJ»KVLIVEEPSINL 
E * LYDGGYYDGLVLTSGLGPY 


7227 


21128 


A 


7287 


367 


119 


LKKAPAPPFLNPP FWGKNF ILS PPEKMG 
PPHGFFKRGPPFFFFFFFFFFFFFFFLG 
FFFFFFFF FFLQYHMMYFH* ILI IWQ 


7228 


21129 


A 


7288 


73 


259 






FKKTPRGAQISPGVKKKHFPPKGGFKKK 
QAPGPF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspai-tic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Vallne, 
VV=Tryptoph an, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


7229 


21130 


A 


7289 


337 


53 


LRSFAPFHLP INNLLTDIHMVI PLPPSS 
LCSHYLFNEVTVTTLFKTTTHLQLGTMA 
HTCMPNTLGG * GGRSA *AQESE TSPGNM 
AKPRLYLLNF 


7230 


21131 


A 


7290 


1 


96 


KHHFGFEAAA* YWHFVDW* LFLYVS I Y 


7231 


21132 


A 


7291 


288 


2 


SRVEFXGPFMAHCSVTIPGSSYVFTSAS 
* VAGPHGARHHAC * FLF* FLV* MLLEMG 
SRHV\,'QAGL*EVSSSDPPAEACQSGGIP 
GVKPHAQSVSV 


7232 


21133 


A 


7292 


121 


233 


YFPKKKKKKKKKKKKKKKKKKKKRKKK* 
KGGGRSRKSYLG 


7233 


21134 


A 


7293 




346 


NPLNPLVNLTVSPKRNSSLDTRKKPCRK 
RHREKETLRKKRWQHTEARGRQREWHPE 
TE* ETGQRDRDKQTKKQKDTDI QAYTYT 
DREKEDKKRHRQ IR*AEAERIRDTERQR 
DI 


7234 


21135 


A 


7294 


46 


346 


ETGLKKWGDLSNGTVRAVLNLAQADFCV 
S IPSYHIKHGNTKKHCGGEGKKKILWT 
AKKSE PGWTHACN PNTLGG *GGRVT*G 
QWFKTGLGMIVRPCST 


7235 


21136 


A 


7295 


254 


12 


RKDAEHNGSRAE* IKRVKDLVQSKLKFF 
QEI.EKEGILPNPFYEAS ITLI PKPDKDT 
SKEDYRPISLMNIDAKILNKILAN 


7236 


21137 




7296 


258 




CIGQCVLVAVGRELLCVSRSWCGLLHVD 
EVGGEALGRLLWYPWTQRFS E S FGDLS 
TRDAVMGNPKVKaHGK*VLGAFSDGLAL 
LD 


7231 


21138 


A 


7297 


191 


263 


VCSKCEPFIYIMDYYAAIKRNKIMSFVA 
RWVQLEWSELTQEQKTENNMFS*WFG 


7238 


21139 


A 


7298 


2 


164 


SRVAGITGftHHH&QLIFVFLVETGFHHV 
GQAAFKLLTSNHHARPQ*LHFFYFFK 


7230 


21140 


A 


7299 


307 


8 


VMI ITFWLPHLSGYIEKSTPYECGFDPI 
SADRVPFSIKFFLVAITFLLCDLEKALL 
LPLP*ALQTFNIiPLIVMSSLLLIIIIiaL 
SMAYE *IiLKBLDCSE 


7240 


21141 


A 


7300 


205 


317 


KSSPACTPSTLGG*G6WFA*VQEFETSL 
DNMVKPQLY 


7241 


21142 




7301 


140 


22 


FNFLFLYP*NYFFFFFFFFFFFFFFFFL 
FFFAMNINVFY 


7242 


21143 


A 


7302 


382 


151 


S IHRSEFRPRRSHSVSRPESRGI ILVYC 
SLYFLGSGDSAPWS *VaGTTGVCPWAS 
LIFGFFFFFFNRDGVLLSCPG 


7243 


21144 


A 


7303 


318 


17 


LVPTKRPQVEaSYYPVHTYIWYCTKGKHK 
WAGIG6S CLSSQHFGRPSWQDPLSLGV* 
DQPGQHRETPSLQK* KNWWQE PIVPATQ 
EAEAGEWCEPGRQSLQ 


7244 


21145 


A 


7304 


310 


132 


QWCDLGATTPANFCXFFVRNGVSPCWSG 
WS*PPDLKLSARLHFQSVFFNPI IS IGL 
FFL 


7245 


21146 


A 


7305 


338 


168 


REPPLPAANFVFFVEMRSHHVRQAGPEP 
PSSSDLPASASQSAGITGVHCHAWP**T 

YK 


7246 


21147 


A 


7306 


276 


161 


I INARI KKWEFMS SAGTWMKLEAI ILSK 
LTQELKVK*KLRKFNTDTILLLLLIFKr 


7247 


21148 


A 


7307 


310 


61 


ITIiVYCNFFYFlNFNFLKLFDSFVI ILS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leueine, M=!VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possjble 
nucleotide insertion 














LKQIYSCTAIVYILIL*AFVQFKFLCFY 
FLYFKNI FI YKNI FYKIKYV^NMYFYF 


7248 


21149 


A 


7308 


257 


■ 


CC3RVDLH*AVFFFF}CRWSLTVLPSLDLD 
SWAQVILLP*PPE*LGQQSCTTPPSSLS 
TFLYAFLIII PVNICi^SCFVGKDRGMFFSY 


7249 


21150 


A 


7309 


2 


181 


RDGFCHTGHTGL*LLTSGDPPASASQSA 
GITGVSHHMJPCLHSLTAFFKNWMVWPG 
QQE 


7250 


21151 


A 


7310 


25 


340 


FCVFNTMDVKN 1*1 FLIFVPDVHLG*QK 
NLLLDTI*KKKSIRIQE**QYKSISKEN 
YNTVSDKSWLGTVAHRCNPSTLGGQGGR 
TA* TQE FQTSLDNTVRPCVYY 


7251 






7311 




193 


FVDFSMHILDMLSLYLSIVICYITLFLS 
I IDN I YWGPHK IIIPYFYCIKYIFYIY 
IWFCFWFFETVLLCCFSVFTFNDYCSYL 
HYFLLLGVFGLFHFF* FLYLYMVLFLVF 


7252 


21153 


A 


7312 


1 


245 


RTPGLKQSSHFSLLSSWDYRHVPLHPAN 
AFFFFKGP*SEPLTliS.SSLHWYFPGGHF 
LQGPAKIDDPNFS P I LGKLKKKSPF 


7253 


21154 




7313 


167 


2 


GKPCLLWMCRDSQKCGEKGTRLKREAL 
RMGK*GQARWLTPVIPALWELEAGGSP 


7254 


21155 


A 


7314 


334 


152 


CHVAQAGVQWYNHSSLQP*PPGLR*FSH 
LSLPSGWNHSWTPPHPANFFIMGSCHLP 


7255 


21156 




7315 


2 


164 


PHVGQDGLELLTSGDPPASAFQSAEITG 
VSHHARPTFYLS * QVGLTVTS FRNPQ 


7256 


21157 




7316 


155 




QRIKH* FHWLGAVILTCNPSTLGGQGGW 
ITGGQKFETSLAYMATPRLY 


7257 


21158 




7317 




293 


GIPSTWDYWCTSPQPHF*FFFFFF**KR 
DLAMLPGGVLNSWIPAliiQPKGPKVLGL 
QAWTILYKGYYGKEMKHKEICKGTHDVL 
SKLLTTKRFKRLL 


7258 


21159 


A 


7318 


262 


73 


SNSLVYLIKPDSVYIFLLFVETVSSHVA 
QVGLELVASSSTPI *ASQSVEITGVSHH 

ARPKKVT 


7259 


21160 




7319 


169 


265 


GGRM ALS PGRGGCS E P * GCHCTPAWETE 
QNPA 


7260 


21161 




7320 


169 


260 


KPQKIPVACDQQFTDKRNCSHGDYEC2M 
VDFFFFFFRERVSLCHPGWSAW* SGDH 
TTALQPG 














FFFFERESRSVARQPGVQWRDLSSLKSP 
PPGFKRFPCliSLPSSWDYRRAQPCPflNV 
FEFW*RWGFTTLARIVLLS*PCDPPTSA 
SQSAGITGVSHCAQP*FFFFERESRSVa 
RQPGVQWRDLSSLKSPPPGFKRFPCLSL 
PSSWDYR3AQPCPAHVFEFW 


7262 


21163 


A 


7322 


130 


26 


SPGPIiFGLIBPLPSRFKGFFWLlIi:iP*TW 
EYRIiALTAPIKFFKFLIKTGFPH*SIKIi 


7263 


21164 


A 


7323 


17 


303 


KSSGTIKeRAQSPFtQLVAIEQCQDCFH 
FRSQF*YLKKQLQPGMMMHaYIIPSySKG 
*SRRIT*AQEFKSHI,GNITKLCCLKIKN 
SLIFKITENIE 


7264 


21165 


A 


7324 


194 


8 


DWVEVLKKSTYRPDTVAHACHPSTLGG* 
GGS IT *GQK:FE IRLGIWVGPCLTCKWLTN 
QLGVGG 


7265 


21166 


A 


7325 


413 


72 


GVPIFLKTLTGKTIS3ERESKVPLEKGK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 
nucleotide 
correspond 
amino acid 

sequence 


Predict- 
ed end 

locntion 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Add, 
E>=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyi-osine, 
X=Unknown, *=Stop codon, A=pos5ible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














ANFRDKRGI PPHHKRVIFAEKKL* KGRS 
P FDDGFQKKLTQPRVSAFKBHP PLQPLY 
AF I KS FREGTHHQDCP FWHLTLLSARRQ 
Q 














NVKHTHTHTHTHTHTHTHGWCKSDTGYS 
SDS*CITPRCIHDPTWPSSSQTVNLAMG 
LTLAKEVTiANMTQTEV 


7267 


21168 


A ■ 


7327 


232 


22 


SRPSGSKKGAPALQQSSCLSFLSSWDYR 
HKPPYLTNFCIFL*SQSFPMLPRLLSNS 
QPHSNLPKWELQV 


7268 


21169 


A 


732S 


232 


3 


YHRSSGSLISLIQIRLRSKI*APSIPII 
KEWLSSVAHVYNPNTLGGQGGRLT*GQE 


7269 


21170 


A 


7329 


364 


3 


PKARRATVRRTRVSKAPSWRQCLGVNTH 
VLGHRRPVKGNAKTRWSASESQAHTCNI 
TTLGGQGGLIT*DREPESSliATMQNPVS 
VSGWLQKMPS KQKLYS KFFPITVAYKAFR 
IWLLCARA 


7270 


21171 


A 


7330 


177 


2 


DPHLHKKFKARCGGMTL*SQLLRRLR*K 
DQLSLGVGGCFELLWCYCTPAWVTERDP 
SC 


7271 


21172 


A 


7331 


413 


40 


RHMYLMFYTYIYVLDI YTYTVF FM YI Y I 
P*CVCKYISCLHIMYRFISMYPCVYVYV 
CMCVCNVIHVIVENAGSCLALSALIEME 
GMARDSKMWTFNPDEKMRLSYKKQLEKL 
NFMILCPTWKEF 


7272 


2II73 




7332 


147 


49 


KNYIYFCRRGLPMLPRLGLNSRAQEILL 
PQPPM*LGLNSRAQEILLPQPPM 


7273 


21174 


A 


7333 


148 


361 


SWLGTVAHAYNPSTLGGRGRQIT*VQEF 
ETSQSNIARPLLYSVLAYAVQPPLPMVA 
MRKLWLIPTSRITAL 


7274 


21175 


A 


7334 


305 


2 


C PGGFVTRVHR* YVRLSQPDLRVFTSGA 
MPASAAQSAG ITGMSHRARPVCS 1 1 1 FL 
NIFKVTCIFVELTIPNQNKQNTFLSSTV 
ASCRILASRPTPIVWL 














FFLRQSLLYCPGWSWAQSQLIASSITQ 
A*AILPPCSWDYSSQHTWLKFFFC*DGF 


7276 


21177 


A 


7336 




206 


GTRflDHLMSGFQDQPGGHTETLSLLKIQ 
RSQAWWHVPVMLPTR* inSHESLEELMT 


7277 


21178 




7337 


65 


302 


KSSLFYFLKIETGFHYVAQTGL*LLSSD 
DPPASVFQSAGITSMSHHTQPKELLKST 


7278 


21179 


A 


7338 


208 


38 


NPVSAKNEEYPRGTFDIiGTQAEVQWHDL 
GSLQPTPPGLK*FS*LSPRV 


7279 


21180 


A 


7339 


2 


198 


ARADHPRSGVRNHPGQHGETPVSTKNTK 
N*AWWRVPLVPECWDYILFYMFTNCMDN 
KLPKWLTLC 














ARGEIMSFARTWMQLEAHILSKLTQB*K 
AKYCMFSLLSGAKHWVLMNIKVGTGDTH 
DYLGCSTQAVLKNLGTRftE 


7281 


21182 


A 


7341 


3 


300 


HEGRNYTLTGRDSCTLAASAEKaCOaCP 
LWGKCDAESSKOTCREaSDCEEEGISlC 
VDVNCKEQTMPEC*AVAL*CTVHLIYVT 
INNALCLPQLTLFL 


7282 


21183 


A 


7342 


210 


351 


RGL6TVafiACl!tLl!tTfG(3*eftWll*AHEF 
ETSLRNITKPSLRENPDML 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

26 


Predicted 
beginning 
nucleotide 
location 

ingto first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=Fhenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \'^possible 
nucleotide insertion 


7283 


21184 


A 


7343 


306 


164 


lYMLYFIYITLIYIHTHTHTHLYLDKYI 
YL* CVHCFL* *AYMI CHQR 


7284 


21185 


A 


7344 


442 




SQMSKLKTWFRAVAHAWNPSTLGGQGRR 
IT*VREFETSLGNTARPNLYWKr 


7285 


21186 




7345 


3 


248 


HELSLSLLLSPFFFKNEKGSR*VAHRGL 
KLLTSSDPPT*ASQSARITGISHCARPL 
LSHFLAYLALPCHQNRLCTGHLPST 


7286 


21187 




7346 


2 


246 


HELYQSILLSPLFFKNDIGSR*IAHAGL 
QLLT-* SDPPT*ASQSARITGISHCARPL 
LSHCLAYliALTCHQNRLCTGHLPS T 


7287 


21188 


A 


7347 


1 


130 


GTRLLKRLTQETRLNPGGRGCREPSLHY 
CTPTWARQSQTVAHM*KHLTQETRLNPG 
GRGCREPSLHYCTPTWARQSQTVAHN 


7288 


21189 


A 


7348 


516 


288 


FFETESRFITQTGVQWCDLGPLQPLPPG 
FKGFSCLSLLSS*DYRCLPPHPANFCIF 
SRDGVSPC*LGWSQTPDLK 


7289 


21190 


A 


7349 


417 


148 


STS VWQAGVQWHDHGLLLPGPPRLK* S 
SHLSLPHG*DYRIIILSCLANF*NIFVEA 
GFRHTAHGLEPLGSSDLPASASQSAGIT 
GMSHLA 


7290 


21191 


A 


7350 


143 




TQKFKTSLSSWDYRRVSAHVANFC I b'SK 
DRVSPy* PGWSRTPDPARA 


7291 


21192 


A 


7351 


189 


37 


FAIAVRRII*NLFFFFFF*RWGLTVLPK 
LVSNSWAQAILSPQPSICVLRLQA 


7292 


21193 




7352 


131 


2 


KSRLGTVAHACNPSTLGGRHGWIT*SRE 
FKTSRANMVKPRPRA 


7293 


21194 




7353 


1 


106 


GTRGQCLTMLPRLVLN YWAQM IL P PW P P 
*VLGLQV 




21193 




7354 


939 


653 


KGNFI FLRQSFTLVAQAGVQWRDLGSPQ 
PPPPGSK*FSCLSIiRSSWDYRHGPPHPA 
NFAFLVBMGFLHVGQAGLELPT* GDPPT 
SASQSAGXTGR 










270 




EARFLGLIiRDIAQLCyPGWSAAV*S*LT 
AASTSWAQVKASTHLSLPNN*D*ARATM 
PG*LKKYVFP*RQGLAILPRLRLVF 


7296 


21197 


A 


7356 


329 


206 


HHIiftNF*TFCRDRGLTMLftRIJISWPQAI 
LLPQPPKVLGLQM 


7297 


21198 




7357 


128 


253 


MRPDTVAHTa!irPSTIiGGQGGRIT*THEF 
ETSI/3DMMKPyLYK 


7298 


21199 


A 


7358 


2 


108 


AREGSGGQGCS EP * WRHCTAAWVTEQDP 
VSKTIKK 


'7299 


21200 


A 


7359 




340 


GTRERNWHLA*ALHTHTHTHTHRHTHTH 
THPSFPA]!IS*RDTFVIF*QSLLLEIMTS 
FSPSLGLKCSFSQQPSGLLSSIAGERDS 
RIHGENTVSMCLHNRLKRELRSDVISLR 


7300 


21201 


A 


7360 


385 


1 


SVGPSKPCAGYKLLVCHLLRPLEKRSIR 
VGVT*FSRCHLSPLSLTRKANSLTPCTS 
WMRRCLILLRLTLGVPHPMSCTHCPTLP 
SEMNLGPQLEMQKSPIFCVAHAGSPRLE 
LFLFGHLGSTPNLPSC 


7301 


21202 


A 


7361 


234 


29 


LSTEPKGSKALKPtIVTLLLNIAP*CNSG 
CNSNL* S * QQY*RVIQSHHLSQTLWLTP 

VIPALWEAEAGG 


7302 


21203 


A 


7362 


68 


339 


DRDIAHKPAWQLCQSRILYLAQLQFtDQ 
GELLPDFRIHYKVI *TCTAWH*RHNRI,I 
DQWNRAD I PDIiNP CQAWRLTP VI LVLWE 
ADVGGS 
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SEQI0 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A°=Alanine 
C=Cysteine, »=Aspartic Acid, 
E^lutamic Acid, F^Phenylalanine, 
G=Glycine, BNHistidine, I=IsoIeucine, 
K=Lysine, l;=Leucine, M=Metliionine, 
N-Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CF YS THAHI F KE LRQRPGAVAHAyNLS T 
LGGRGGRLT*GQKFKTSLfiNMVKPHSC 


7304 


21205 


A 


7364 


399 


62 


QPGQYGKHPVLIKHSKIKPFWGDPPWP 

NAREG * A* KMVE PGKVRVQSAQ I KALE F 
NLGPKKKVPFKKKNPWIAKGGGLRPEIP 


7305 


21206 


A 


7365 


289 


76 


VCDVLRESRAQLSRLKCSGI I *AHCNVS 
VDRFEAYGSRGYVCA 


7306 


21207 


A 


7366 


364 


182 


QWHDHCNFKLLGSSNPPTLAP*VAGTAG 
YFWF 














ELGTKREGWE KSATVAHACNPS SLGGRG 
RWIT*GPEFPTSMANMMKAHPRA 


7308 


21209 


A 


7368 


342 


184 


LSLPSSWDRRYTPPRLANFFKFFFKR*S 


7309 


21210 


A 


7369 


2 


211 


ARE PKCL* TDEW INKMQYIDAVEYHLAI 
KLNEILTHATTWMNLENMLIGRS*TQKA 
TCCVIPFMDSIYV 










190 




YKNVTVKRILTWPSTVAHACNPSTLGGQ 
GGRFG*GQEFETSLGKIARPCLY 


7311 


21212 


A 


7371 


24 


254 


KPQGIGAHASSQHYGRPRREDHLSLGV* 
DQPGQHGEIPSLQKKKKKGLPGRG6VCL 
KPWFLKIL6EKHVWVLTPKG 


7312 


21213 


A 


7372 


43 


358 


DSNLNYSLFFHGEADLGTTQVLTHPSTT 
AMYFVHYCQS P * I LYGTIHT* PPWHKN 
PIHIKTPSPCLQASTAINLQLSHINCNS 
KATPHPLGYQQTYPPLTVHST 


7313 


21214 


A 


7373 


2 


288 


IHIIIAFTSSLLGILVYRSHLIASLLCL 
HGI ILSLTI lAPLI PLNTHSLLAPIGPI 
AILVIAA*QAAVGLAi:,LVSISTTYGLNY 


7314 


21215 


A 


7374 


335 




VQWNDFGSLQNPPP6VSPFFWLSLPNNL 
GFKGPPPRPGPFLKF**KPGFSALTRMV 
SLSRPRDSAPLAPPKAGVSRLNPRARAP 
FFFFFVFCFHFFLLSGEKKSLAHKGGI 














VFS I PPI FPPKFLKKNPFSPWEVFLFIiS 
FFQKDAFFGPQV* KGNFFPFKGPPPLTQ 
NSPQGVPPKKKKKKKKKKN 














NliFYIFYYILINFNILIILIINFFYFI 
LLIILNIILILYIIIYYFIIKNIIKT 


7317 


21218 


A 


7377 


2 


156 


FHHW*AGLELLGSSNPPTWAFQSAETT 
GVSHHAQLPHFILRKQELLSLGL 


7318 


21219 


A 


7378 


319 


39 


DTKMNSQDILYSIQSNRT*W*LIKRQ*T 

PSTSKQDPPAaK*L*PAEGRLLAFPSNN 

VF 














IPTNISTNTPIJ5L*FYLRLIYSTSISLL 
PISlSrNVKIK*QFEHTKPTPFLPTLIALT 
TLLLPISPPILIIL 


7320 


21221 


A 


7380 


11 


169 


RRDLftMVPRLFSSSPGLK*SLHLGFPEH 
WDYSHBPPSWLLFFLFLITEGYFF 


7321 


21222 


A 


7381 


300 




KFFPPPGFFFKGFFSPL*SHQKKTQIFF 
GEFFWGFKKFFLFFFPPGGFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLFFFF 



1000 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=LeHcine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FF*INSLFIQNSFLT 


7322 


21223 


A 


7382 


163 


160 


PGTSNDKIQRRNSRNRISPIAER'GLSL 
LPRLEWMGSNEAHSNLELQGSSNPPASA 
SKSTMITC3VSHCA* PRLEWMGSNEAHSN 
LELQC5SSNPPASASKSTMITGVSHCA 














VCVCVCVCVCVCIKWDSYSAIKKNELVA 
FWTWKRLETIILSEVSQE*NTIYHMSS 
LICGS 


7324 


21225 




7384 




196 


KIQMWPGWflHaCYPNTLGGQDGRTA*G 
QEFQTSLGNIARPCLYQQQQQQQQPRCS 

LTYDGDKS 


7325 


21226 


A 


7385 


3 


192 


RNLSSLQPPPPGLK*WSCLSLPSSWDYR 
RPPPCPANCFMLCEFFRTNGVFPCSPGV 


7326 


21227 


A 


7386 


256 


198 


TMLSGFLRSAGLTM IS * PHDPPASAS QS 
AGI TGVSHRARPLVLFLRDS LYSGTQAG 
VPWRDLSSLQSRLAEQMRFCFP 


13X1 


21228 


A 


7387 


164 


311 


S KQQKGLSGVAHPCNPS ILGGRGRWIN* 
GQEFETSMASMVKPCLSLLKL 


7328 


21229 




7388 


3 


218 


EIMSFAATWMQLEANILSTLSHE* lAKY 
CMLSLLRGAKHWVLMNIRVGTGDTRDYT, 
GCTYPSSTEQLGYYS 


7329 


21230 


A 


7389 


521 


58 


SDCGLTPIAQTGVHWHHLSSLQP*LPWL 
G*FSHLSLPSSWDYKyVPPCPASFLYYF 
LVETGFCHVAQDGLELLDSWHPPS SASQ 
SAGITGMSGCAGLRRF*KLRSWRYLVLL 
AMQVNEQGTSQA6EAVSEADAGQGCCCG 
LCLHRILSQQYALP 


7330 


21231 


A 


7390 


169 


322 


APGNFKTN* RKNGFWAGNTLGVS VKNMK 
NGWAWWLTS VI PALWBABAGGS P 


7331 


21232 


A 


7391 


1 


353 


GTS PHYGS FHGHQKDVTGKGSQSRPRBR 
VLGSHARKNSLPVHEVKaYIiL*KYLSIL 
RATHILISPDGCWVPIFIAQSYIFLSSC 
GFFLLFFLLLLTLSFCSWHLIISFLALF 


7332 


21233 


A 


7392 


372 


3 


IQAGACCAECIPSKGIRWNPSIVEGLHS 
DMSWSLCLFLSGAISVLCNFRLSGSSDS 
PMSA*QAARTAVIRHHTRLIFCIFSRDR 
ETGFHHVGQENLLMLLLQLFSGliGFS IS 
ATLLTLMLLV 


7333 


21234 


A 


7393 


362 


176 


ARMECSGAI *AGLRGSSDSLASAS*BAG 
NIRTSL 


7334 


21235 


A 


7394 


2 


304 


ARADCCARNE*GSVGMGPSEPGVGYNLL 
VFRFLRPLEKHSIRVGETRFSRCCLSQL 
CLAHTGNSLTPCTSWVRQCLALLRLMLG 
ALHPLSCTQRPTSHS E 


7335 


21236 


A 


7395 


256 


3 


MYYAARYWNKAKITTESLRHWAGHGGSH 
PYHPSTLGG*GGQIS*AHKFETSLGNMV 


7336 


21237 


A 


7396 


346 


2 


RDHFYRESHSLITYI CLENVKSHLKCGE 
GPLNFTFHIORKNCHLGHTRSPGGVYHV 
GRTETATGPIKRRNGLGAVAHTSIIPSNL 
GGQGGWIT*GQEFKTSLSLPRWLTPVIP 
LV 


7337 


21238 


A 


7397 


2 


242 


RRYSDYPDAYTT*NI LSSVGSF I SLTAV 
ILIIFMI*EA*KKKKKKKKKKKKKGGPF 
KKTLGGPKFNRRMEGKWFSLKGGR 



1001 



wo 01/64835 
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SEQID 
NO: of 

sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Praline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l]nknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7338 


21239 


A 


7398 


1 


289 


TKLLGLYYTLLQASNDFKSPFSIFDGIY 
GSTFFVTTGFHGLHVI IGSTILTI*FIR 
QLIFHFTS KHHFGLE ADA* YWHLADARR 
HSESITHQCCSL 


7339 


21240 


A 


7399 


346 


40 


IIIFTNKIYIKINKRLKLFIIKILDLYI 
I*IFFFRK*FTFKQ*KF*IININKIYYN 
NIK* * I FI YLFFFFFFFPFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLKK 


7340 


21241 


A 


7400 


1 


334 


NSCRRQRLLDLLLLPITL*LPQLIGYIE 
KSTRYECGFHPISPARVPFSIKFFLVAI 
TFLLFDLQ lALLLPLP *RLQTTNLPL IV 
MSSLLLI I IIALNLAYE*LPKGLD*AE 


7341 


21242 


A 


7401 


2 


302 


HTINLLGERYVLATTY* *STMTLLLTGL 
TVLHTGL YSVYLMTTTQWG* LTPHI INM 
KP * FTR* DTLVFIHLSPILLLSLNPDI I 
TGVVLVLLSYELYLLT 


7342 


21243 


A 


7402 


57 


338 


TTHNVIG Y I EASTAYE SGCDP I S PARVP 
FSi:<FFLVAITFLLFDLEIALLLPLP*A 
LQTTNLPLIGMSSLLLIIILTULSLAYE* 
LOKGLD*AE 














Q3ELVDPLTTVREQCEQLENCVKAR*RL 
*LCDERVSSRSHSEEDCSEDLFDFLHaR 
DHCVFLILYINLKYMCELT 


7344 


21245 


A 


7404 


494 


154 


SLFIFYLYLIFLF**FFFHyFFYFLLSI 
FSFKLFFFFLLHFFFHMFFFLFFSPPKK 
IFFPKIPPRFFLFPLF**KFFFFLPFKF 

F 


7345 


21246 


A 


7405 


1 


400 


NPGRFLSTSNSSLYERTREIRPTSQSAF 

KKGGPLKKKPRGGQK*TGGEKKKFFPKR 
G6KKKPPGKF*KKTFFWGGiaCMGKTPQK 
KINPLGKKKIFKGKRGKTPPL 














7346 


21247 


A 


7406 


2 


257 


RASLGHPGWSAW* SWLIAALTSWLKQF 
SHLNSLKCWDYRHEPPHLA*LRFEHRHS 
GSSICTLNR*ILLSFPILSIRYKNGLIL 


7347 


21248 


A 


7407 


259 


-Ti 


YVFFFFLFLLFFIRLQSVSPFFFFFFFF 














LYLLRQVS IS lAYTLFG 


7348 


21249 


A 


7408 


3 


248 


VTFL*CDVEIAVLIPLP*ALPTTNLBLI 
DMSSLLLI I ILaLRIiAYE*LRGGLD*AE 
EHQATHSAYEEAPPL 


7349 


21250 


A 


7409 


2 


350 


RSGMPRRYSDYPDAYTT*]!llLGSVGAFI 
CLTAGILIIFMI*EAFASKRKVLIVEQP 
S INLE *LYGCPPPYHTFEEPVYI KSRHI 
RKESNPPKLVSSQPHGLHDFFKKKLGTI 
TE!CN 


7350 


21251 


A 


7410 


2 ' 


232 


LDQTYAKIHFTIVFIGVDLTLLPQHFLG 
LSGMPRRYSDYPDAYTT*NILSSVGSFI 
ALTAAILI I FMI *EAFAKKF 


7351 


21252 


A 


7411 


2 


321 


TSLLAVLLIIMTL*LPQLNRYIEKSTPY 
ECGFDPICPGRVPFSIKFFLVAITFLLF 
DLETALLLPLP*SLQTTNLPLIAMSSLL 
LI I ILDLNIiAYE*LQKGU)*AE 


7352 


21253 


A 


7412 


2 


194 


SIECFLVPITLI.LCDLEIMjLLPLP*aL 
QTSNLPLIVMSSLLLIIILaiiNLAYE*L 
QKGLD*AD 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=posslble 
nucleotide deletion, \=possible 
nucleotide insertion 


7353 


21254 


A 


7413 


1 


197 


LYTTYFVTKTLLLTSLFL*GRTAYPRLR 
YDQLIHLL*KNFLPLTLALLI*HVSIPI 

TISSIPPQT 


7354 


21255 


A 


7414 


1 


185 


STPLP*ALQTTNLPLIVMSSLLLIIILA 
LSLAYE*LQKGLD*AEKKKKGAAVLKDP 
SGGPS 


7355 


21256 


A 


7415 


347 


228 


LE*LyGCPPPYHTFEEPVYIKSRLKRKE 
SNPPKLVSSQPHGLHDFFKH I KNYRKUK 
RKEGLASVKENDRGP 


7356 


21257 


A 


7416 


1 


296 


HAYDHRS AQLNG Y I E QS TP YE CGFD P I S 
RARVPFCIKVFLVAITFLTjFDLEIALLL 
PLP*ALQTANLPLIGMSSLLLIIIIALS 
LAYE*LHKGLD*AE 


7357 


21258 


A 


7417 


3 


202 


TCSI*HVLVRMT*LIYDLEIALL*PVP* 
ALQSTNLPLISMTSLLLIIILALSMAYE 
*LQTGLD*AE 


7358 


21259 


A 


7418 


1 


321 


SNTLLALVLITI'i'S*LPQLSGCVGECTP 
YECGICRILNAEALCCMQIFLKAITFLL 
FDLEIALLLPLP*ALQTTI!ILPLIDMSSL 
LLIIILaLSLAYE*LQKGLD*AE 


7359 


21260 


A 


7419 


1 


212 


RTRGI IFRPLSQSLY*LLftMILVILT* I 
GGQPVSYPFTIIGQVAYVLYFTTILILI 
PTICLIENKILKWA 


7360 


21261 


A 


7420 


2 


257 


GRVGGRV6YSIAFFLQAITVLVLDLESA 
LLLALP* SLQTTNLSLIVMSSLLLI I IL 
ALSLAYB*LPLGLD*AEYWGEMMTTTLD 
A 


7361 


21262 


A 


7421 


367 


269 


HCDLKLLGSSDPPASAS*VAWITGTCHH 
TKLS 


7362 


21263 


A 


7422 


2 


116 


ALQTTNL&LIVMSSLLLIIILALRLAYE 
*LQKGLD*AE 


7363 


21264 


A 


7423 


2 


336 


KPECGRBGILPCCSSSAWPEGSFRPFQM 
NLGSFLSFFFLFFFFFERGLFLCPPV* S 
PYGKSNLLEPSASRFKQFFCLGPLNNWD 
YRLFPPFPGNFGVFKKNRVFPFWPSLV 


7364 


21265 


A 


7424 


3 


154 


LFL* IRTAYPRFRYDQLIHLL*ENFLPL 
TVALLI*HVSIPITISSIPPQT 


7365 


21266 


A 


7425 


3 


154 


LFL*IRTAYPRFRYDQLIIILL*ENFLPL 
TLALLI*HVSIPITISSIPPQT 


7366 


21267 


A 


7426 


1 


314 


INTLLTLLLIIITL*LPQLNGYIN*STP 
YECGFDPISPflRVPFSIKFFLIAITFLL 
FDLEIALLLPLP*ALQATNLPLIDMPSL 
LLI IILALSLAYE *LHQGLY 


7367 


21268 


A 


7427 




331 


ILIINTLLALLLIIITF*LPQLNGYIEK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLLFDLE IALLLPLP*ALQTTKLPLIVM 
TSLLLIIILALRLDYE*LQPGLD*AD 


7368 


21269 


A 


7428 


2 


373 


SDHNHI IHALVITILLGLYFTIxLQASEY 
FESAFTISDGIYGSTFFVATGFHGLHW 
IGSTFLTI CFIRQLI FHFTSKHHFGFEA 
AA* YWH I VDVAGLFLHVS I YRGGS YNHQ 
IIPSVAQESTPA 


7369 


21270 


A 


7429 


378 


63 


PSFFPPFPP*ISrFFFPPRPLFSPGGFPPF 
FPPPQKGPPPKIPPGFFFSPPFWEKFFF 
PPPPLILPPPRFFFNPPPPFFFFFFFFF 
FFFFFFFFFFSFFFWDFLGSGC 


7370 


21271 


A 


7430 




109 


QTTNLPLIVMSSLLLII ILALSLAYE* L 
QKGLD*AE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide Insertion 


7371 


21272 


A 


7431 


35S 


290 


FLLVIPNFLFHFLI YI I IS I VPNVSFFH 
IIYILSIFFPFIIFILLLSSFNFILFFF 
FITSFPFFYFFFFFFFFFFFFFFFFFFF 
FFFFSFFFFFFF*FFFTN 


7372 


21273 


A 


7432 


1 


95 


PTRPPTRPHNSLQL*TPGLKQSYPRLPS 
SWD*EHMPSHPAlSrFRKFF*RWHLYCPGW 
S*TPGLKQSYPRLPSSWD 


7373 


21274 


A 


7433 


1202 


870 


HNIFKIRPHCRLTHRGKPFYWGLLVTPV 
FSPSLRPSFLPSFLPSFLPS*LPSFLPS 
FLLFLSYSFCPLSFLCLRCIPHSAFFWG 
NPRQVQENCVIVTI YRYLSFTALHQL 


7374 


21275 


A 


7434 


163 


1 


SVYVKRIGLGRVAYTCNPSTVGGQDG*! 
T*GQEFETILAKSREPLFLACFLFLI 


7375 


21276 


A 


7435 


148 


362 


NDDNSVEFGIRLHKGLGTVAHTYKPSTL 
GGQGGHIA*AQEFMTSLGHMT*SSLYHK 
LTEYGWRDTASLLMC 


7376 


21277 


A 


7436 


61 


233 


GGDLRHWPSSDSFFLSLSFFF*NKNGGG 
IiALLPRLDIiNSWSQAILPLKPPNALGLQ 
A 


7377 


21278 


A 


7437 




242 


FVCLFLFFFFEKESTFQHDLSSLQPPLP 
RFKRFSCLSLPSTWDYRLAPQCIiANF*! 
F**KSGFTFG 


7378 


21279 


A 


7438 


119 


300 


YCVSLLIFFIFIIYIPILLIFKFLPITL 
NYISFLFFYF*H*FYF*KNFIFVFPFGY 
FPRP 


7379 


21280 


A 


7439 


1 


221 


LKLQHHDPTTISLVITRNWKQPKCPNVQ 
Q IVH* SAIKRUELL IHAKTWMNLRGIML 
SEICQPQKILYLEIPFI 


7380 


21281 


A 


7440 


35 


345 


FSFCCVCVCVCVSLPKHailEMLNQTHP 
YVPKGVAYLPEREPFIVPMDPELTA*YE 
DYRTHESAQEPSEPHQLSRRSLPH*PGH 
ATRDHSPPTDSWGKRRTSH 


7381 


21282 


A 


7441 


3 


384 


VIHL*AGAVI IGSRSKSTNALAHFLRQG 
TPTPVILVLGI lETMNLLIQPVALAERL 
TDNITAGHLLMHLIVSATLAILTIMLTS 
TLIMFTVLILLTILQIAVALMHAYVFAL 
LVCLYmDDAYDHGG 


7382 


21283 


A 


7442 


15 


277 


GLAILPRLVPNSWPKVILQVMPPKMLG* 
QA 


7383 


21284 


A 


7443 


21 


694 


NGGLNAHLASASEFDHSGVQLIEREEEI 
CIFYEKINIQEKMKLNGEIEIHLLEEKI 
QFLKMKIAEKQRQ ICVTQKLLPAKRSLD 
ADLAVLQIQFSQCTDRIKDLEKQFVKPD 
GENRARFLPGKDLTEKEMIQKLDKLELQ 
LAKKEEKLLEKDFIYEQVSRLTDRLCSK 
TSGCKQDTLLFTKKMNGYSRRIKNATEK 
MMALCA* LSMKQAPTIELQKEVQGERRP 


7384 


21285 


A 


7444 


141 


428 


DRSLLEAGDGRIAGDFFFF*TGTNJ!VAQ 
AIOTQWGIHS SLHP * PPGLKQSSYIUNLLG 
SYHHWLRPHALFFFLKHRGSLFCPGWFG 
TGFRGTTFWGAK 


7385 


21286 


A 


7445 


35 


343 


GYS IIGCPKFLDEKI FDSGHEKERKINS 
LLF*KKKKKKKKKKKKKKKKKRGGLLKK 
TS GGP KI KRGGGI KNFP I KGGVKKTSGE 
FFGKKPFFGGEKFWEILP 


7386 

- 


21287 


A 


7446 


403 


193 


E Y I KQGAE *1INQ P KLHAHRKRLTWKETP 
RCIHEGIAPAIMNSDCLVFDTSIAQLFA 
ENGNLGINVTISMC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutaminc, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7387 


21288 


A 


7447 


411 


no 


LTDS LINI VYR* KMS YIS *TNSA* VYLY 
IFCIYIYIYIYIYRRIYSIYRC*YIYYG 
PAHIVLVYSSSYILVYYESHKYGYIMGL 
NILSYIFIYSYFSACS 


7388 


21289 


A 


7448 




380 


PTRPQKLKKNCNGVSNGDLCFFGGGQFL 
IWGFFFFFCILIEGGLNSKMYELKSLIK 
TYGAKKAE 1 EHTEN* KNRETLLERAAQL 
AI KVTNPNAMLRTVEHD YAMYMHNFWLN 
KLNNCHKVAISCKQ 


7389 


21290 


A 


7449 


200 


437 


VQSNNNKNFKRRRL*PGWAHTCNPSTL 
GGQARRIS *AQELETGrJ»ILMRPCNYNK 

FIN\'LDD\'HGVTSPVSQYYKKVC 


7390 


21291 


A 


7450 


405 


3 


PPRFFGFFFPfYPLKFFFFPKGFNFFGG 
F F P i »■ FP P KKKF _B' F K^J" S PGGi'FL P P FLG 
KI FFFLPPFKFGPPRGFF+RAPLFFFFF 
FFFFFFFFFFFFFFFFFFFFFLQRINFT 
ARPGTPLSRAVPLRGALPPSA 


7391 


21292 


A 


7451 


2 


348 


PRVRPRWYKLQTKRCDLTLPS * GHSHV 
SLRNTTFDALKS+DTDSLLSPKLECHGV 
NTPNCNLRLPSLSDYPASLSREAWIAST 
RLRAWEI FVFLAQMRQDYCMNPGGGGCN 
ELK 


7392 


21293 


A 


7452 


1 


168 


LVHDGHAGLKLLTSGDRPA*AYQSAGIT 
GVRPLGPPTYYYFRTASFMTFICKFATV 


7393 


21294 


A 


7453 


184 


260 


SSRSGSLASRSSR*STLLGLPKCWDYRC 
MPPSLAHFFFFNF* I*KFCFW*RRSLGM 
LPRLWMS+MQAILPLRIRLQGIHFAGT 
QTFSL 


7394 


21295 


A 


7454 


405 


64 


FFFFFNFF*KKKI*PVFAKIFFKIFrFF 
SGLKIFWGGGFQNAPPQKKFFF*KIKSF 
FFUIFFF*KNIFFFYaGVVFFLLNFFFR 
GAPLFFFFFFFFFFFFFFFFFPFFFFFF 
FFFFFFFFFFFFF 


7395 


21296 


A 


7455 


368 


2 


SQSFGRPRWENRLGPEQHIKISRYRPAQ 

VHACHPSTLGGRGRWIT*GQEFKGSTHL 
GLPKYMDYMPEPLLLVYIQCFHFRTFCM 
tcaAIHCFKPEQLLYNKTSYFKESYKVKA 
ALVACKPTRP 


7396 


21297 


A 


7456 




T¥3 


TERDSVSJOIKYIYIHTHTHTHTYI CVHG 
VIYVYI *SLLLIYNIAYILIYHVFYILL 
IFKA 


7397 


21298 


A 


7457 


281 


127 


YNFCI FM*RCGFNNVGQSGLKVLi:iRGYS 
PASASQIAGITGVBQCAQPVAYS 


7398 


21299 


A 


7458 


2 


194 


TSVHKYF*YPFTITDRIYGSTWWATGL 
HGLHVI IGSTFLTI CLIRQLI FHFTSYH 
HFGSEAAG* YWHFVEPAHLFLYVCIY* * 
GSTMWATGLHGUIVIIGSTFLTICLIR 
QLIFHFTSYHHPGSEAAG 


7399 


21300 


A 


7459 


22 


319 


GGSVTGAYHRILDHDRNQIFLA*GGSVT 
GAYHRILDHDENQIFLALLISMLLGLYF 
TLLQASKYFECPFTICDGIYGSTFFVAT 
GFHGiaVIIGSTFLTICWIRQLI LEFTS 
HHHFGLEAAA*YWHFVDW*LFLYVS I Y 
* * GSVTGAYHRILDHDRNQ I FLALL I SM 
LLGLYFTLLQASKYFECPFTICDGIYGS 
TFFVATGFHGLHVI IGSTFLTICWIRQL 
ILHFTSRHHFGLEAAA 


7400 


21301 


A 


7460 


1- 


222 


TILLGLYYTLLQA*EYFKATFTIYDGIY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

residue of 
peptide 


Predict- 

nncle- 
otide 
location 
corrcspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M[=Metliionine, 
N'^Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GSTFFAATGFHGLHDIIGSAFLTICFIR 
QLIFHFTSEHHFGFEAflA*YWHFVDAT* 
LCMYVS I Y* *GSTFFAATGFHGLHDI IG 
SAFLTICFIRQLIFHFTSEHHFGFEAAA 


7401 


21302 


A 


7461 


3 


246 


LNYSLFFHGEANLGTTQVLTHPSTTAMY 
FVHYCQPP*ILYGTINT*PPVVHKNPIK 
KKKKKKKKKKKKKKKKKKKRGGGF 


7402 


21303 


A 


7462 


2 


155 


SRSHJiALLLPLP *AliQTTNLPLI VMS SL 
LLIIILaLSLAYE* LQKGLD*AE 


7403 


21304 


A 


7463 


1 


136 


PTRPAPSHLLYCKNIKKLFPSGTANEAL 
IIADIFLRCKKKKFKVK**VLKASEE*N 
GCKHLHGMH*NSAVCTVFWLKQIMNSD* 
* *KVWTQ*KKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGENNK* KNI KKLF 
PSGTANEALI lADI FLRCKKKKFKVK 


7404 


21305 


A 


7464 


166 


192 


NIN*MKYMHTTOCYSDFKRKEILIHATT 
WKKLEDIMVSEISQSQKDKYCMILHIEY 
KLYIHM 


7405 


21306 




7465 


416 


55 


LFFLRLGYFFLPNIRVFFFLSTLFFFFF 
VWaFFFFGRFFFFFPPPQEVCFFFIF*G 
IFFFFLFK*KMMFFFFFFLWGPPPFFFF 
SGAPLFFFFFFFFFFFFFFFFFFFFFLR 
KSNTKKMF 


7406 


21307 


A 


7466 


295 


84 


IWaNVTLIiQLVSCLFCFLFLH*QFFCCC 
CCLFLFLFVCLFVCWFLNLKPSIKCLLP 
FSLSLGCAEIPSSF 


7407 


21308 


A 


7467 


142 


258 


PGTVAHTCNPSTAGGQGGRIP*GHEFKT 
NLANMVKLHLY 


7408 


21309 


A 


7468 


3 


192 


PSEYFESPFTISDGIYGSTFFVATGFHG 
LHVIIGSTFLTICFIRQLIFHFTSKHHF 
QFEAAA*YWHFVDW*LFLYVSIY**GS 
TFFVATGFHGLHVI IGSTFLTICFIRQL 
I FHFTSKHHFGFEAAfl. 


7409 


21310 


A 


7469 


2 


110 


GRVGKHHFGFEAAA*YWHFVDW*LFLY 
VSIY**GS 


7410 


21311 


A 


7470 


380 


8 


TPQKRKKLKKAGEGWPPPPKKTTPPPP 
PIFIYFIFFFFFIYFFFFFFFFFQYMSF 
FYSHIMIQRHFLNF*ESFYSCISILQ*r, 
FKFFSFLQGLIKL 


7411 


21312 


A 


7471 


1 


107 


PTRPHHFQFEAAA* YWHFVDW* LFLYV 
SIY**GS 


7412 


21313 


A 


7472 


2 


360 


VNTLLALLLIMITF*LPQLNGYI EKSTP 
YECGLDPISPAHVPFSIKLFLVAITFLL 
LDLEIALLLPLP*ALQTTNLPLRAMSSL 
LWIIIjALGLAYE*LQRGLD*AEWLHSL 
TQPTITOU 


7413 


21314 


A 


7473 


423 


167 


VKVHKCFLPGVAPHAFNPSTLGGRGGRI 
A*AQE FKTSLGN IVRPPSDTCNPI VLGL 
QA 


7414 


21315 


A 


7474 


1 


73 


IKPE*YFLFAYTILRSVPNKLGGV 


7415 


21316 


A 


7475 


131 


3 


I SWVGAEWLTRVIPAL* EAESGVSRGQE 
lETILAMTVKPDAW 


7416 


21317 


A 


7476 


2 


103 


PRVRSTGFSSIAHITRDVNYG* I IRYLH 
ANGAK 


7417 


21318 


A 


7477 


2 


325 


NTLLALLLIIITF*LPQLIGYIEKSTPY 
ECGFDPISPARVPFSIKFFLVAITFLLF 
DLEIALLLPLP*ALQTTNLPLIVMSSLL 
LII ILALSLAYE*LQKGLTERDT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 
correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 

otide 
location 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valinc, 
W=Tryptoplian, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7418 


21319 


A 


7478 


423 


170 


FFSFFFFFFFFFFFFFEIDKNVYLIM*C 
TNPQEKRNLLFCFLAnWSPVCITPHTYI 
CVCVCVIYIYIYIYIYIYIYPFIFSCCD 


7419 


21320 


A 


7479 


433 


50 


RVFFFFFFTPEKKTFFPQEIYLFFSPFP 
L*NFFFSPKPLFFLGGFAPNFPPPKKVF 
FLKFPPGFKFSPPFKKKFFFFFPRFFFA 
PPRFFF*GPPSFFFFFFFFFFFFFFFFL 
TVKFTRGFRGFCGQI 


7420 


21321 


A 


7480 


401 


337 


FFFFFGKFFFPQEFTVFFPPFPFKIFFF 
SLRLLFFWGGFAQFFPPQK*VFFSKFPR 
WFFFPPLFRKKFFFFFPG* FLAPQGFFL 
KGPPPFFFFFFFFFFFFFFFRPI*VGGR 
VGNNPNPNTNYLPN 


7421 


21322 


A 


7481 


1 


84 


KNFLPLTLM:LI*HVSrEITISSIPPQT 


7422 


21323 


A 


7482 


562 


412 


LFSFHVSGTMAHTCNPSTLTG*GRIT*G 
LEFKTSLGNTVRSHRYRKKKIA 


7423 


21324 


A 


7483 


12 


233 


difhftskhv:fgfeaaa*ywhfvdvvrl 
flyasiy**gsll 


7424 


21325 


A 


7484 


385 


56 


GNFPTPKKFNFFYPFYLSIFFFFPKKLK 

FF*^IFPIFPPPKFT,FI,PPNFPVI_,FFFF 
FLFWEFFFSFLFVJFFFFPFFFFFFFFFF 
FFFFFFFF*KTTEFYFVNSNFQHTK 


7425 


21326 


A 


7485 


424 


6 


FFFFFSEGSGLIREDPIGAGALYDYGR* 
LVVVTG*TLFVGVYIVIEIARGNRLCD 


7426 


21327 




7486 


16 


129 


FXXXRXXLIXKEPSINLE*LYGCPPPYH 
TFEEPVYIKS 


7427 


21328 


A 


7487 


183 


387 


lASGLDI FLLLLLETGSHSVTQAGVQWH 
NHDSLQPRLTGLK* SSRRSPRGSWDYRV 
LLCHPQWSAEA 


7428 


21329 


A 


7488 


351 


169 


EKKIFFFSPRGYGPPPFFFLKAPPHI FF 
FFFPFFFFFPFPFFRGGGGDHKNPFFFF 
WGFLKNPPIiFFFFPRV* I* IFFFFPFFF 
FFFFFFFFLGGGGGTIKIFFFFFGGF 


7429 


21330 


A 


7489 


47 


212 


THaSALFL*IRTAYPRFHYDQLIHLI,*K 
NFLPLTIiAI.LI*HVSIPITISSIPPQT 


7430 


21331 


A 


7490 


98 


273 


TSaSCCSAHILAFKP*ASMATFITKSL* 
* CHTHTHTHTHTHTHTHYMYIYTHMHTY 
IF 


7431 


21332 




7491 


70 


405 


LYGCPPPYHTFEEPVYI*EAFASKRKVL 
IVEEPSINLE*LYGCPPPYHTFEEPVyi 
KSRQKRKESNPPKLVSSQPHGLHDKKKK 
KKKKKKKKKKKGGGPLKKPRGGPNLTGG 
GKKNFFPFRGGEKKPPGGFWKKTLFLGG 
GKLGTPPPRRF 


7432 


21333 


A 


7492 


22 


208 


RSSLPARPTRAVI L 1 1 FMI * EAFASKRL 
VLIVEEPSINLE*LYGCPPPYHTFEEPV 
YIKSYV 


7433 


21334 


A 


7493 


2 


204 


KLPETVKKKKKKKFYS FRS I VMLLL YL F 
S*SLSFFLLYSLLFFLFLLSFYIYSFLC 
FPLCKTATTEH 


7434 


21335 


A 


7494 


2 


104 


TTLLLSRT*NKLT*LTPLIPSTLLSLGG 
LPPLrG 


7435 


21336 


A 


7495 


291 


160 


PRHF* I FVEMGSHYVAQAGLKLVAS SNL 
LAWAS QSAGVTGVSHGSLL I TS CL 


7436 


21337 


A 


7496 


422 


180 


LGAPQGVF*KGPPFSPSSSSSYSSSSSS 
PS*QNIFNKIFLFVCFVLETESVTQVGV 
QWRDLGSLQPLPPGFKRFFTITTH 


7437 


21338 


A 


7497 


244 


296 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
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nucleotide 
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residue of 

peptide 

sequence 


Prcdict- 

°oeaLn 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFG 
aNF*FYK*LLGNRDGVCLPPSSYL 


7438 


21339 


A 


7498 




410 


PTRPDQLIHLL*KNFLPLTLALLI*HVS 
IPITISSIPPYT 


7439 


21340 


A 


7499 


51 


478 


PQKYTLIMKICIIPPGKGFRPVEMLAGG 
NENTRCLIEERSYKYHYSLIKLGLQQQC 
IFSSLLYKYNMIYCLMLCYYIGYIFIFY 
DMYYRQYSVLYYKLYKYVMLI * I YILYL 
FMCI*KYIYFIFM*YKYIFIYLYLY*YK 
Y I F Y I YLC VYKNI YTLYLCNINI YLYT Y 
lYININIY 


7440 


21341 


A 


7500 


3 


110 


FTSKHHFGFEAAA*YWHFVDEV*LFLYV 
SIY**GS 


7441 


21342 


A 


7501 


168 


327 


AFCYKATVIKTI*Y*YKHRYTEQWSRTK 
NPDINPYICGQMIIFLQLKSLHKI 


7442 


21343 


A 


7502 


400 


159 


NNFSPLRNLQGFFGPLKFPMGFQAGPPG 
* GFSFFLRSNFFLDVFRGKY*TFGF*DF 
FYCPSPNEVLGEPPLMGGNKPGFP 


7443 


21344 


A 


7503 


401 


257 


TSQKKKKKKKKTPKKKKKNREKPPKKKR 
GTRGEPFKTGGGKEKKKFFKKKKKKGKG 
PP* PQLVKRGGREKKKKNLWKKKGAF* K 
KKKKKKKNPKKKKKKPGKTPKKKKGDPG 
GTF 


7444 


21345 


A 


7504 


44 


344 


LLASLMTLALPPTINLLGELSVLVTTFS 
* SNITLLLTGLNI LVTALYSLYIFTTTQ 
WGSLTHHINNIKPSFTRHNTLMFIHLSP 
ILLLSLNPDIITGFSS 


7445 


21345 


A 


7505 


286 


252 


I *FAFHHVGQAQLEPTiTSDLPASaSQSA 
RITSVSQHAWPVHNIFYSLL 


7446 


21347 


A 


7506 


3 


213 


RELPGLKRFFLLSLLSSWD*WRMPSHPA 
TFCIFCRDEVLPCCPAGLLILLVWNLKN 
CCCYHHHRYYYCY 


7447 


21348 


A 


7507 


400 


20 


SLPQKKVSLLKLKSAVQLVMF* DRFSLC 
HPGWRA*GQSGVTTASSLRAR* SSCLTW 
EYRPSPLPLADFLFW*RRGLPI*PRWS 
NS*GQVGLLHQPSKMLKFSSLSPCARPL 
FFFFNRSTNLRVQS 


7448 


21349 


A 


7508 


349 


227 


SDFLTSMIPPASASKSFGITGVSHHTRP 
*PRFLKVNLIYSEMKVYN*ASFRHVD*G 
GI,ELLTL*FTHLGLPKLWDYRREPPHPA 
LTKIFKGKPYIQ 


7449 


21350 


A 


7509 


371 


21 


FI,VETGFHHVG*AGLEPLASSNLPASAS 
QTAGITGMSHCAQPNPGSLLNKTMIAVA 
TEQWVGYMWPSVFQHITARPEGKRLSDC 
LHSDTNVCWCVCSGLI FVCFETGCRFA 

LQPG 


7450 


21351 


A 


7510 


330 


14 


DKVCS VTQARVQWHDHGSLLFLS SGLRQ 
SSHLSLLSSWDHRHMPPGMAWFFYP* *R 
LRSHYVAQAGLKLLASSNPPTLTCQSAG 
ITGVSHNTWPKALFSVTDLKH 


7451 


21352 


A 


7511 


417 


33 


aPPCGGGPPKRIFGPSRKRGKGPPPPKR 
FF* KNKPPKGGGKKSPPPPKNFFPKKKN 
LGPKKKKSGFWGRFAPPRAPGGGVFNFP 
FKAGGEKFFSRGE INPPRGAGKKKGFFL 
GKKNTKKKTPKKKDL 


7452 


21353 


A 


7512 


81 


296 


CLREGASHSAAQAGLHWHDHSSLHP*NP 
GLRGS SRIiNLLSI WDYRCVPPSLANFE S 
CLLKI ILGSPFPKNL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSiV 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptbplian, Y=Tyrosine, 
X=UnI<nown, *=Stop codon,/=possible 
nucleotide deletion, \=pos$ible 
nucleotide insertion 


7453 


21354 


A 


7513 


412 


279 


KKYTQGLGWAHAYNPSTLGSQGRRI P * 
AQFETSLGNTGRPCLH 


7454 


21355 


A 


7514 


74 


125 


KI *DWVPKGAPLQGPVF*NSTFRPRVRH 
RGPPWPRPGPTVWKG*NKKN*PAPRGSN 
PPFFFPGNPGPFKLRGWPGSS*GSFQNC 
KKAQAG*APT*KPGPPPPQNKV 


7455 


21356 


A 


7515 


332 


20 


GERQVGSYFFMMTERLFGMTKKF*KWIV 
SMVAQHSSSISGVWVFRGQELAFPLSPD 
WQVDYESYTWRKLDPGSEETQTLVREYF 
SWEGAFQHVGKAFNQGKIFK 


7456 


21357 


A 


7516 


1 


167 


RTRGQRFTVLARIVLIS *HCDLPASASQ 
SAGITGVSHCTRPGLSFLYTPTAKHST 


7457 


21358 


A 


7517 


2 


223 


GRMEEMGSHYVA*AGLEFLGASDLPA*A 
PQRAKILLLLGGHKPGLTILPRLSSNSW 
PHVKWPRWPLKALGLRG 


7458 


21359 


A 


7518 


151 


475 


SLGKFAVSFSFSFLK*REALAVLPRLCS 
NSWP *AVLLPWLPRVGIAGMSYHTQPDY 
HFQARAVAE I LGLVSTQHS S VTGL P VEN 
SFRAGHGGPRLQSQHPGRLRWADC 


7459 


21360 


A 


7519 


3 


134 


CSFRLQAVRHICajVITLGGRGGRIA*AQ 
EFETCLDNIARPCLY 


7460 


21361 


A 


7520 


14 


219 


APS IHGFGFLYTYIHTHTHTHVYMYI *V 
YIHTHIESIKCQPLTQLQSNKKRWRIVF 
RTQFMLTCYFMR 


7461 


21362 


A 


7521 


134 


334 


RKHSC* r FFNFCFYLSLETRSHYVASTG 
LDLLASSNPPALftSQRARITGMSHHAPP 
VLTRFKKSLF 


7462 


21363 


A 


7522 


406 


145 


FLKTGFSPFGPFVF*TPPPGFPPPPPPP 
IFGVPGGGPFAPPLFFFFPQTKHFPPFL 
NPPSPPPQRGFFRGFFLFPPAFFLKTPP 
FPF 


7463 


21364 


A 


7523 


3 


382 


HPQGEGGRLTSI.E*NTSLRLKKKKKKKK 
KRGGPFKKNPWGAQI+PGKKKKNFFLKR 
GAKKNHLGNFGKKPYFWGGKKLAKPPKK 
N*TFKGKKKFIJ1GKGGKKTPKPWLLKIF 
FSGFYLKKIFPPGP 


7464 


21365 


A 


7524 


319 


106 


gtrscsvt*agvqMhdhsSlqp*tpwlk 
rsswlrnrwdykheppciinkfyffcrdg 
gllchpgwsqspelk 


7465 


2)366 


A 


7525 


433 


228 


MQPYlSIWEVNSSSQLSLSSflRQRVTHVLT 
GNFFIK*EaRWDLPVLPRLVSKSWPQVIF 
PPWPSKVLKLQV 


7466 


21367 


A 


7526 


26 


351 


CRSLVFSLPKPGLRFVSILHIQVFSHFD 
FFFLTGNTQK*RK* TPFI FPA*LFKTPA 
PKPGMVAHTCSPSYSGGLDGRIA*AQEP 
RTSLGNKTNPSLEGGRKKKKPSGS 


7467 


21368 


A 


7527 


438 


62 


QLSKIHRPPAGGRFFKSGGGFKPFCPVG 
JjPPFGKKAWQPTPKNPPKKQKPSGGGJjS 
PMGKPP-i-FPCSMPSDSPGSVSFSRGGVA 
PAPKDYCP IV* AALI PSKKKKKSKPG * V 
KKGKPNLYSIYNK 


7468 


21369 


A 


7528 


3 


132 


FYYL*RRGFPVLPRLVIiNSWLQVILLPW 
PPKVLN*AAHHPHG 


7469 


21370 


A 


7529 


2 


628 


FFFLPGWNRVLLMGRGLGEANSREGRG 
PSQPRGMHGQLEVRGGRSQGDGWHGTLS 
PPWGRIYRGAPPTFAAPQAPKPFRQLLP 
M*LPGRPSSCLLPETPALSPLPSAEWGG 
FKLLTGTSETPGPSGSPSVGSANAQAAG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 

ingto first 
residue of 
S'ence 


Predict- 
ed end 

location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysfeine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l]nknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATTIVARGHTPCTRGPLDTHAHSSLRWH 
AHTRPPPRPLAHEHGTHTHAHPHTPGAW 
VGEGPFQGWGRA 


7470 


21371 




7530 


63 


205 


GRPTRPRTRGVKIL*QFEHTKPTPGLPT 
LIALTTLLLPISPFILIIL 


7471 


21372 




7531 


390 


56 


KTFFFFNLIAFTEAKSVLLFLLLFFETG 
SRSSAQPGVQWHHQSSLQPQPL*LKGSS 
CLSLLSN* DYRCAP PCQ* S PCLSLLKCW 
DYKREPPHPAKESHMQQSYLIYSLYIA 


7472 


21373 




7532 


543 


335 


LAAYGGTCLQS * LFSRIjRWEDHLS PGVQ 
GCNELQLQHCSPAWATE*DPVFKGEKKE 
RVLVKVFS 


7473 


21374 


A 


7533 


3 


145 


WCDHGSLEPRSPGLKQSSCLSLPSNWDH 
SSEPLHTTSMITOF* IFKFLNLK 


7474 


21375 


A 


7534 


386 


106 


SWVTDEITMEI*E CVLNDNS DTPYQNLW 
DTEKWLRAKFTALNACMKKSERAQIRH 
TCVHTHTHTHTHTHTGSKCLLRKYLLLT 
IQQQNIRIN 


7475 


21376 


A 


7535 


398 


2 


SLQPLPPRVKCFSSPSPPRRWDYRGA.PP 
RGGKFFFFGKKE.FPPVGQGGF* FLPPGV 
LAPPPPQKGRISGGSPRPRPFPFFFKIR 
ISYPIIiAKQLKLCFLKKKKKSIQAIKLY 
HHQKMPITKSSVFWESQPRA 


7476 


21377 




7536 


2 


344 


HSKCVPAEWTMAYYIRYLSISSLLHIL 
EN1VHFFFHFLY*GPNNLFFFYFAGTRL 
IQRSWK*PMII**VHVQIKREGQIFNK* 
I FCMIGRGG* ITCGQEFETSLVNMVKPC 
LY 


7477 


21378 


A 


7537 


116 


224 


LHTHTHTHTHTHTHTHTQFNKLP * LP I P 
CGHIPRRY 


7478 


21379 


A 


7538 


1 


383 


WHERTHSRI I ILFQGLETLLPLIAF* * L 
IiASLAHLVLPPTINLLGELFVLVTTFS * 
SNITLLLTGLNILVTMiYSLYIFTTTQ* 
GSLTHHINNIKPLFTRENTLMFIHLSPI 
LLLSLNPDIITGFSS 


7479 


21380 


A 


7539 


31 


332 


DNYLSSYSSSGSWMRELILASQGIRWEP 
IVDRTPSHHHTHTHTHTHTHTHTHTHTV 
LFRLGPWTHAREPRGHICGN*IENQSP* 
KNPRTRGKNLRTPHGW 


7480 


21381 




7540 


199 


15 


NNVQI K* QFEHTKPTPFLPTLIALTTLL 
LPISPFILIIL 


7481 


21382 


A 


7541 


337 


157 


AEIVPIiNSSLQNRVRLH*MESYSVAQAG 
VQWHDLGSVQPLPPGFKRFSCFSLLSSW 
DSRLYFRCHD 


7482 


21383 


A 


7542 


28 


313 


RARIGSDQCLSAGTQWENKSQISGDFM 
WNDFRS * SVTRVGVRWRNHGSL* PWIPG 

LR* S SGFGLPNCWDYRDRNEVWGGKKE V 
TCPTVHLQKSQ 


7483 


21384 


A 


7543 


493 


180 


GGFRGSWLVPRDGGQGVGFCSLPPPPSR 
VKQFFCLRFPRRWGGREGSPRPSYFFFP 
*EKPRFFFLGRWLNFLGQGVGPPWPSQ 
RVGFPGLTPGPRASFFFNLI 


7484 


21385 


A 


7544 


403 


426 


FF*KFFITFHTLFFFFFFFFFFFFFFFF 
FFFFFFFFFLAPPPGGFFFSFFYQKIFL 
KFPGQPEKFAIjPFGGGFFF*GKFPQFFF 
CFLVFFFFFFLFLENGVSLYCPGRSRTP 
GLKLSSRLLASLRAGITGVSH 


7485 


21386 


A 


7545 


3 


335 


RHYSDYPDGYST*NILSSEDSLISLTAA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predlct- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=po55ib)e 
nucleotide insertion 














LLMVFMI * EAFAYRRTGLIA* E PSMNLE 
*LYGCPPPYHTFEEPGYIKSRLKRKBSN 
PPKLVSSQPHGLHDFFKKKKNRGGPF 


7486 


21387 


A 


7546 


424 


140 


RRVKF VGGGAPQGP PP KRGVLPKI PRE K 
VFPPPPG*NGPPGPAFKTPPKKKKN1SS 














FFFFFLNPLK 


7487 


21388 


A 


7547 


2 


348 


FCHVAQ SGLKLLAS SD PPTIjAS QS AR I T 
GMSHHGGLK*GEFLGPSSNKGALLLWEA 
TWLQSFMKSASLFVKPRPTFVYFVFaPV 
GYRD I Y FTAS S L I KVI QEGLKYTMKCAP 
LTL 


7488 


21389 


A 


7548 


2 


90 


RKQAYTHTNTHTHTHTHTH*KKTGIHAH 
TCHTHTHTHTHTLKAEKQIEGGK 


7489 


21390 


A 


7549 


256 


405 


CELYSGKEMELVFGLFIFTGVTRLETFY 
KKECS SFFHSQSRGHKLQPS ILNEWWHI 
Rl-JPLGLTHFLLL* KTGVAEYKNSLNWH 
HPFFLSYAVSFLLQGWSFAMLSKLGSNS 
*VQARCLPQSPKVLGLKESPQERTVNVS 
S IRGKKWSWYLDYLFSQGLQGLKLFIRK 
S VHHS S I PRAEGINCNHQY 


7490 


21393 


A 


7550 


2 


362 


ILIMNTLLALLLIIITF*LPQLNGYVEK 
STPYECGFDPISPARVPLCIKFFLAAIT 
FLLYELEIALLLPLP*ALQTTNLPLiaM 
SSLLLIIILALSLAYE*LQKGLD*AELK 
EEQKTLQC 


7491 


21392 


A 


7551 


1 


317 


TTFDNSALLFFWDGGEIEFHSVSQAGVQ 
WHNLCSL*PQPPGFKQKLILPQIVKENV 
SKISSQLLFSRVNINISPSEQCITPPSL 
QLARIGI FYAAKIHLTKGLRG 


7492 


21393 


A 


7552 


479 


183 


YLFHVHTTASHSNGACTGPTVGDI I VSS 
TL* SMTEQVTMPLASAVSDGTVPSVRTA 
SRGSEQAAHSMVSSEHIE ILEHAGELVI 
BSPEGQLEVQTVIV 


7493 


21394 


A 


7553 


454 


105 


KPPWGNKIRPPLFPGRKVPPPRF*NPVR 
GSPPKK*FVFPVGRVLGTGVTPLSPFLK 
TTPLLLWDPPLSQPPGGVKPENSLYPGK 
KRFR* PKFPPCPPAWGTKRKPPLQKKKK 
RQVL 


7494 


21395 


A 


7554 


439 


I 


LLKRCVRKDSPPPQNKIFFFILKKFVFF 
FAPY* VRKFYFLTAHFGKEPPQ I YI FGP 
PPVFLIFCCFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFNKQVFI 
ERLLCflEHCFRPPPEEKTKNKIEAJajPN 
SEGEER 


7495 


21396 


A 


7555 


323 


82 


FFPFFFFFFPFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFR**AYL 
VRSPLLFIFVFSYF*LI*HVyHRYSVyi 
ELDVLLTGSKIILI 


7496 


21397 




7556 


194 


412 


KMYRTSLGHLVEAKKKKKKKKKKKKKKK 
KXKKKKKKKKKKKKDSRGGW*KKFGGG 
HNTRGVKIIFFFSLGG 














7497 


21398 


A 


7557 


414 


219 


KGFKKLFPPPVF* FFLCPPPI FLKGFP * 
GRKPPPPKNPQGGPPFFFFQTPPPLFFF 
FFFFFFFF 


7498 


21399 


A 


7558 


377 


25 


THVGQVFWGKFFFSRRVFFFFYHLIQIL 
SPPPQKRPPPKGTPEG* IFPLFKEINPF 
FF*DFFFAPPPFFFFFFLFFFFFFFFFF 
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NO: of 
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sequence 
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NO: of 
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sequence 
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od 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
tiucle- 

I'ocaUon 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFLKVKTGV 
YCVPPIAVILYR 


7499 


21400 


A 


7559 


386 


2 


FFFFSFGEIFPPPGISGFFPPFPL*NFF 
FPLRPLFFLGGGAPFFPPPKKGFFPKIP 
RGVFFPPPLREKFFFSPPP*IWAPPGFF 
LKGPPLFFFFFFFFFFFFFFYRKARTKP 
ICLWGDVSPSKHFQCI 


7500 


21401 


A 


7360 


1 


396 


KICHRILRPIKKVK*E KKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGAS 
LKIPSGGPQLPRGGEKNPPPFLGGEKKP 
PGVFLKKHLFLGGGHFGPPPPKKNSPWG 
EKKNFWGGGGKKKQNI PGGKKI PSHKKK 


7501 


21402 


A 


7561 


1 


390 


NYVFKEFSDKLGAVAYTCNPSTLGG*GL 
WIT*GQEFETSLGNMVKPYLYQKYQKKK 
KKKKKKKKKGGGPFKKTLGGAQFNGGGE 

KKFFFFLGGVKKPPRGFLGKWPFLGGGK 
MGAP P P P K I laLGE KKNF 


7502 


21403 


A 


7562 


406 


182 


FVFFFFFXXXFXXXXFXFXXXF*TXXFF 














FFXXXXFFFFFFFFVCFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF 


7503 


21404 


A 


7563 


2 


113 


FHFTSKHHFGFEAAA*YWHFVDW*LFL 
YVSIY**GS 


7504 


21405 


A 


7564 


313 


97 


SPTYPCYIRNSAQSPSLIAKSFCSHPHT 
YHHDPHLEKSLTYCPQSPS*PIIPPVTC 
TYINRWPEATEEPQNK 


7505 


21406 


A 


7565 


3 


144 


DAWVFVATGFHGLHVI IGSTFLTICFIE 
QLIFHFTSKHHFQFEAftA*YWHFVDQV* 
LFLYVS I Y* *6STFLTICFIRQL1 FHFT 
SKHHPGFEAAA 


7506 


21407 


A 


7566 


1 


130 


FIRQLIFHFTSKHHFGFEAAA*YWHFVD 
W*LFLYVSIY**GS 


7507 


21408 




7567 


434 


1 


MEMYSIQDTNSLPRRER*RKTRTCI 1 FP 
HKIKTSEHSFDIKEKH*NYVTLCXVLRK 
LSTNQRQS * IPLSTNKNDTL6RARWLTP 
AIPALWEAEfl!GGS*GQEIETIIiANKVKP 
RGRV 


7508 


21409 


A 


7568 


441 


86 


NFSFREKGGEFVPPPPLKIFFFPPPPNF 
FGGGGPPSPPPQKBCVFFPKPPRSFFFPP 
PKKKKNFFPPPQKIGPPPSFF*TPPPPF 














MFVCLY 


7509 


21410 


A 


7569 




155 


PTRPDLRSLRFEAKSPNSRRTLHKPGVT 
I *EAFASKRKVLIVEEPSINLE*LYDYG 
FFYYKIKEPVYFYSNQKKE 


7510 


21411 


A 


7570 


434 


1 


S PTEAGSKNFFKKNPGRKTP PKKKKNI F 
FSPLTPKKFFFPPRS*ILGGSGGPNCPP 
PKKGVFPKNPQGVFNTPPKKKKKKFSPP 
GKNGGPPGVFLKQPPPFFFFFFFFFFFF 
FFLRQSHTLLPRLECSGA 


7511 


21412 


A 


7571 


493 


196 


SSLDCB*SLQGSI*FLTIMMLAVbFVDV 
LY*V*HVLYVLVQVAITKCHIVSGIiNNR 
HSFLTAVKARKSKIKVPADLVPGEVSLP 

SLFS 


7512 ■ 


21413 


A 


7572 


1 


239- 


•KGLTILDAIKNICDS *KGVKILI FSEVW 
KKIVPTLTHDFERFNTSVEEVTAHWET 
AEELELEWPQDVTELPQSHDQT 


7513 


21414 


A 


7573 


391 


32 


FFPQPSPPGVKEFFCLNLPNNWGFEAGP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPGYFVFLVNKGVFPVGPPGFELPSPS 
*VAPPASQKGGVSGVSPCPRVWVPFFKK 
KKKLRGVEQRCTMHLLVYSKNVPVKPQK 
KSSIGSL 


7514 


21415 


A 


7574 




383 


PRVRSTAFSSIAHITRDVNYG* IIRYLH 
AKGAKKKKKKKKKiCKKKKieGGALKKKPG 
GGQKKGGGEKKNFFFKGGEKKKPGGIFE 
KKTFFGGGENGAHPPQKKKTPGGKKKIL 
RGKGGKKPLYPWGEK 


7515 


21416 


A 


7575 


402 


69 


KIFFKNSPLWKISHPPTYRGFFPPFPP* 
NPFFLPGPYFFLGGCSHLGPPPKKGFFQ 
KIPPGLFLSPPFKKKIYLFPPRIFLAPP 
GVFLKAPPPPFFFFFFFFFFFFFFFF 


7515 


21417 


A 


7576 


1 


119 


LI FHFTSKHHFGFEAAA* YWHFVNW* L 
FLYVSIY**GS 


7517 


21418 


A 


7577 


219 




• NDISANTAIKKFWKLPKCPTTEGWLNT* 
LYIHLVEYYATLKVCGDLYVR 


7518 


21419 


A 


7578 


3 


287 


HASAQIiNGYl EKSTP YERGFDP IS PARV 
PFFIKFLLYAITFLLLELE lALLLPLP * 
ALQTTNLPLIVMPSLLLS I ILALRLAYE 
*LQKGLE*AE 


7519 


21420 


A 


7579 


10 


125 


ALQTTNLPLVVMSSIiLLIIILT*ALQTT 
NLPLWiVISSLLLIIILTLILAYEGLQKG 
LN*AE 


7520 


21421 


A 


7580 




299 


DAWVRLLDFDF*VELPTEARIRI I ITSQ 
DVLHS*ADPTLGLKTDAI PGRLNQTTFT 

ATRPGVYYGQCSE I CGANHS FMP lALEL 

VPLKIFEIGPVFTL 


7521 


21422 


A 


7581 


3 


116 


AFASKRTGLC-VEEPSLTLE*LGGGPPP* 
HTLEEPVNI 


7522 


21423 


A 


7582 


398 


2 


RVFPAPKNSLQIFPPLFFFWGSPWKFFF 
TPFPHFVFWFGNFKFDTPFFAFLFRLPFP 
GEKI PLVFFFFFTD*NFVENFFFFLSLL 
FFFGFFFFENM*S*LF*SPANFNV*VTH 
LFIFMVFFHPDFWSFGPILW 


7523 


21424 


A 


7583 


288 


9 


RGSKNLEKNSFPYKVNNHGNGSPLFPHL 
FFSPRGVGPFFFFFETHSRSVIQAGVQW 
HDLSSPPGLK*SSHLSIQSSWDY*RAPP 
HPMTFVYF 


7524 


21425 


A 


7584 


1 


124 


RDGGFTMLPRPILLTGLKQSSCISLPK* 
WDYRCEPLHLASC 


7525 


21426 


A 


7585 


339 


76 


THPGHNGISVSPKKKKIPRGGGPPPLFP 
PLPRVGGGDSL*PRKGGEN*TKPAPFPS 
SWGKKKELPFSKKKKKKKRKKGSISTPF 
SRIE 


7526 


21427 


A 


7586 


205 


87 


S ISLSLSLSHTHTHTHTHTHTLSHPRCV 

PQPIGLPISKWAKVKELIT*R 


7527 


21428 


A 


7587 


353 


256 


PHPSVTTLVTSQQYQDPPPAKHL*LAEG 
SDDC 


7528 


21429 




7588 


159 


195 


QICRHRWHINFLMSMGSIMITPPSFLF 
LNFL* SYVAQAGLKHLSQSDP PASASQS 
AEIRGMSHITAWPHF*YFVELVHFIYINK 
FVGIELFILTS 


7529 


21430 


A 


7589 


76 


205 


RKKPEPEKTCFDNITGNTYPCLSTYEHD 
SD*YYmRT*LCAER 


7530 


21431 


A 


7590 


218 


20 


LSLYLASHLWLKPPHPLEWAPSK3SLRL 
GTVASPCNPSTLGGRGGWVT*GQESETS 
LANMAKPCFY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

corres"p"ond 
ing to first 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metiiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


7531 


21432 


A 


7591 


142 


12 


QIDYYYYLFI*RKGLTLMLRLVSNSWLH 
I ILLPWPPKVLGLQM 


7532 


21433 


A 


7592 


2 


224 


QIIEALLrTILLGHYFTLLQA*EYSKAP 
FTISDGIYGSTFFVTTGFHGLHGIIGST 
FLTICYNPQLIFHFTSEHKFGLQAAP*Y 
WLFVDVARLFVPASIY**GSLFYPPTSL 
RVLQGSLHHFRRDLELNIFCHHRLPRTP 
RHYWLNVPHYLLQPPTNISLYLRT 


7533 


21434 


A 


7593 


1 


323 


HAYGNTLVA*LWTMTP*LPLVDGYIKK 
FTPYBGGFBPKSAARVLFS IKVFLVAIT 
FLLSDLQSALLLPLP*AVQTTNLPLIVM 
AALLLDIILALSLAYA*LLKGLE 


7534 


21435 


A 


7594 


1 


104 


RACVCVCGVCVCVLCLLKMCVCVCVVCV 
CVFCAC* KMCVCVCWCVCv/FCAC 


7535 


21436 


A 


7595 


383 


43 


VFISETLCEHVAWSQTTIESLGLGTVAH 
TLNASTLRG*GRMIA*GQ*SKTRRGNMD 
SISSHSQTIAQSPHLLQRHQPKWCMCVQ 
AABAVLKQQVETDASFQIQKIPINSLIN 

L 


7536 


21437 


A 


7596 


2 


331 


WPPTGITPLNPLEGPLLNTGGLLA*GVg 
MT*AHHSLGENNRNQVIQALVITILLGL 
YFTLLRASEYFESPFTISDGVDESTFFV 
TTGFHGLHVIIGSTLLSIWFIRQLICNF 
TSKHHFGFKAVA*YWLNFTKYLVHPPTN 


7537 


21438 


A 


7597 


384 


23 


LFKTKQYRFLL?PFPLKIFFFPLKA*IF 
LGGFSHYFPPPKKGFPPKSPPWF*IPPP 
YRKKHFCSHTPKNLAPRWFF*KRPPHFP 
I FFF* *RWGLAMLPRLFSNS *AQAVLLL 
QSPEVLGL 


7538 


21439 


A 


7598 


401 


189 


RVLPCFAGWS*TPGLKCPTCLaL*KCWD 
YRREPPCLIRFYLFVYFLRQGLAMLPS L 
VLNSWP*VTLLP*SP 


7539 


21440 


A 


7599 


81 


368 


KCVIYPFLSITLGKAKYDFFFFLEKELa 
FFPQGEMQGSNIiG*LKPPPLR*RDFSCL 
ALPRGGNGRGAPPSPTNFGFLGGNGVYP 
SGPGGFETPDLK 


7540 


21441 


A 


7600 


264 


25 


AVEHPQLRLFCSHY*NNIVEERLSVLAR 
TCNPCPLEDKAAWIT*RHEFETSLANMV 
I FCLD IGSKSFLSMRFSRCLLWKL 


7541 


21442 


A 


7601 


1 


196 


WEPSLVGETNVNSENQKYINWPGAVAHT 
YNAGTIiGGQGGWIT*GQELEITLSKKVK 
PSTYFTNTR 


7542 


21443 


A 


7602 


315 


S2 


TCTQVFIAIiFITAKKFKLICPLTDNWIS 
RVWYVHTMKHISAIKRKAVIHATMWVKL 
ENIILSEIKQSQKATGCMISFI 


7543 


21444 


A 


7603 


334 


118 


PKIKTEGGPPPKKNAGGMYK*KKFILVL 
AAPPHPPFGVFSVFSFYSFWFFRFFFFF 
FFFFFFFFFFFFFPLR 


7544 


21445 




7604 


3 


288 


DAYTT*NILSSVGSFISLTAVILIIFMI 
*EASSSSSSPPGGLRPEAFPVRPPGAGL 
VF*VSVPPDLPQCSLAGLQVLRFMELAR 
LAPHIRWriQ 


7545 


21446 


A 


7605 


354 


33 


EGRARYTRVPQRPRFHLLGIFFAPQLGK 
KKKF S F S KKKKKKNQ LWLGAVSHAWNLS 
TKGG*GRGSRFT*A*EFRTSQKLGTKGN 
MVKPISTNTFKNERGGAELSQL 


7546 


21447 


A 


7606 


401 


161 


YLVSHLLQIILPPA1FWIFFF*DRVLL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Prediet- 

nucle- 
otide 
location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1 










CGPGWSAWWSYCSLCLLGSNSSPCQSL 
PSSWDYRHLLWRFLMPRGILCVYV 


7547 


21448 


A 


7607 


354 


50 


SGAQDPAGQHGQTPFSQKVPKKVYPAWW 
CMWPATQEA* LKKSLEPWRRRFLLREA 
FFVYLGVWLSNGEPPPLGGGGSVFLEAL 
E KSVGFS SLGFGWVFFF 


7548 


21449 


A 


7608 


370 


115 


Tl KNI CNSWEVKl PRFTGVWKKLI PATL 
MDDLEGFKTLVEEPTEDW*ITGELEVE 
PENVIEFLQSHDQTSTDEGIiLLMDEQRR 
W 


7549 


21450 


A 


7609 


120 


358 


TTAVYLGHYCQSP*ILNGTVIIT*PPWH 
KDPILFLTPFPCFHASSAMNLQLSHISC 
NSKATPHPLGYQQTYPPLTVHMT 


7550 


21451 


A 


7610 


3 


164 


HE* AS IP *NTHTHTHTHTHTHTHTHFYK 
EENLSHLWNI SCI FLGAHKGKKKMKT 


7551 


21452 


A 


7611 


85 


324 


FRFFAFFFF*KKISFCPQAGGQGGILPS 
LDPPPPGLKKFSGLTPPRSWN*GPCPPP 
GLIGVFLKKRGFPLVGREGPNLWT 


7552 


21453 


A 


7612 


268 


317 


FLENCFNPGGGGCSQPKLGPCPPAWGAK 
*ASLSKKKKNKNKN*LSyKAPSYGYKGK 


7553 


21454 


A 


7613 


101 


339 


AVPLTMVKIHALWKRVWRFHNKSKL* LP 
CDPAISLVSMYPKEMKSVCLKDVCSPRL 
ITGPLTIAGMWNPPNRSSMDYSG 


7554 


21455 




7614 


366 


208 


LELLTSGDLPASASQSAGITGISHRARP 
GTLFF*AVNGGGNQVFLFLRVriNGL 


7555 


21456 


A 


7615 


30 


328 


NYCLDFHGETELGTTHFLTHPSTRA^WF 
EPYCQAL*IL*QVINT*PTVIjHRNRSHI 
KAACLCLQARTVVlJLQISHI]SrWK5f*TTP 
HPri*YQQTYPALEYI 


7556 


21457 


A 


7616 


39 


159 


TPGLKQFFHLGLPRCWDYRHEPPRPCPN 
MYLI*ISF*CV 


7557 


21458 


A 


7617 


290 


91 


KMSFLIFFRDMaSLCCPGWT*TPGLKQS 
SCFHLPRSWDHRCTSLHLAVKTFLYYFL 
KMFYYQNVWH 


7558 


21459 


A 


7618 


37 


246 


TQQELVYIYRYIDTHTHTHTHTHTHTHS 
HTHYWIYTWKHTLKGPKLF*EKTKMGST 
PIISWKERLSLYRS 


7559 


21460 


A 


7619 


23 


190 


IPPQPCPFLLSLFCT*IDMSC2SnyiCACVC 
VCVCVYVCVICDMVLjPFCFCKLECSK 


7560 


21461 


A 


7620 


367 


2 


FFSPKALIFLGGGGPIFPPPKKSFFSKI 
PPGGFFPLPGFLKPPPPPLF*NPPLKKK 
NFFFPPPENWftPPRVFFKGPPPFFFFFF 
FFFFPFFWSSRPL IFPQFFFLNNKIGKK 
KIFRHLGAG 


7561 


21462 


A 


7621 


6 


25 


SVILQSIIIFIATTNLLGLLPHSLTPTT 
QLSINLaMAIPL*AGAEIIGFRSKIKNA 

LAHFLPQGTPTPIiIPILVIIETISLLIQ 
P lALAERLTANI TAGHLUMHL IGSTTLA 
ISTI*LLYYNP 


7562 


21463 




7622 


383 


67 


AFLNPGKKPFPPGFKRFFCPPPPKKWGK 
NRGPPPRGKIFFFFFFVKTKFFHVGPGG 
F* ILTSGDPPGQQGGLGGFFFFFQIGGT 
GDLNLSHKSPMPFTINTHNKG 


7563 


21464 


A 


7623 


372 


35 


TGGFWGVFFFFFFKQGFPFKSTGGNLGP 
KNPCPKGPRVSPPPPPF*NRGPQGGPPP 
PGKKFPFFFFFFFIETGFPHVALAGLEL 
PGSGNLPTSVSQSAGI PGl STWPNGVFQ 


7564 


21465 


A 


7624 


287 


6 


LSGVEWVDHMIWICLFPKKLPPCV* KWL 
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NO: of 
nucleotide 
sequence 
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peptide 
sequence 


M 
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od 
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09/515,1 
26 
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nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YQ PVAVAHACNPSTLGGRDRWIPGGQBF 
KTTCLSLPKRWAHRREPLYLAFCPFYNG 

IGCLLIIGF 


7565 


21466 




7625 


271 


277 


NRGKGKTGSLC-ILEKNPFFGGGPFWANP 
PP*GLK?WGGVGPICRPPPKKRVFFQNSQ 
GTSFPLPP VLKSGPGLVLKKAPQKEKKI 
TFQPPGKFGPPRGSLKRA.PPFFFFFFFS 
LRKSSVIEK 


7566 


21467 


A 


7626 




269 


LLGELSVLVC* FPPKHTTFI.LTGLS ILF 
TGLYSLYILTTTQWGSLTHHIINLKPLF 
TP KNTLMF IHLYP I LLLSLNPDI I KKKK 
KGGAF 


7567 


21468 


A 


7627 


397 


164 


FPPPKKVFFQKNPKKKFSPPRVF*TPPP 
PPFFLPPPKKKKFFFSPPP*IWPPPGIF 
*KPPPLFFFFFFFFFFFFFFK 


7568 


21469 


A 


7628 


3 


324 


DGiVINVSTLYGATGSHGLHDI MGSALLTI 
WYIRQLILHFT S KHHFGFEAAA* YWHFV 
DVD*LFLCGSIY**GSPGGPKFSRGGKG 
lAVPHWGDD 


7569 


21470 


A 


7629 




246 


SGLGTTLS*KNISVLLTGLKILVT6LYS 
VYIFTATQWGSLTHHINKIKP*LTRENT 
LMFIHLCP ILLLSLNPDISYPGFTS 


7570 


21471 


A 


7630 


112 


290 


LI SGKGQWTQMRLPVLVTLFTDS FS K* K 
NPSTLGEQCGR IT * GQELETSLGNI VRP 
HLY 


7571 


21472 




7631 


248 


2 


KVPASGKRPEPHWYPYHAVPGCGRIPAT 
AI WTQKWYQGFPPALS PRLE CSGAI SAH 
CYFGFPGPSDSPASPS *LAGTTGTRP 


7572 


21473 


A 


7632 


383 


106 


GWGQ CFKPAI PALERPNPQECFRPGVLN 
QP *PQNKSPFFFFFFKECKKKYIYICMW 
HAYNPSYSGG*GGRIT*AQKFKTSLrrNT 
VRPHLYKK 


7573 


21474 


A 


7633 


435 


133 


YACLGLPKCWDSRHEPPGLANFCIFSKN 
RGPP ISDRWINSWPRVFG* PKPPKMGG 
FQTLVQGRSLGFLLIFLKTGFPFFSQHE 
EQGWDTSSFQHRFFRGK 


7574 


21475 




7634 


• 


209 


NAYRIVILCQKEiB'PLLSGKMNFKKTRCW 
LGMVAHACSPSTLGG*GGRIA*AQEFGT 
SLGSILRPCHCKK 


7575 


21476 


A 


7635 


117 


495 


STSFIDKVQLRHSFLFFSLFFETKSCSL 
TQTGVQWHDFGSL*PPPPRFK*FSHLGP 
LSSWDYRHftPPRPASFFVFLVEMRFHHV 
GQAGLELLTSGDPPASASQSAGITGIMSH 
HALAHLYFSSGKS 


7576 


21477 


A 


7636 


1 


241 


LPGSSDSPTSASQVAGlfGManESCPF" 
FLAP * L I FNF * FFFGNS FFFVPQAGGHW 
GIGG* LNPPPPGLKRFSCLTLWGG 


7577 


21478 




7637 


3 


93 


EAEAGESLEPGDRGCSKLRSCHYTPAWQ 
QSVTLVSKKETKNTKTSEVPFGVLNLLY 
KVSINILIFKEIFYQAPIiILV*DNPDSS 
TLSSALQPGQQSKTVSNKTKQKTVS*PG 
NRV 


7578 


21479 


A 


7638 


46 


228 


GIPHIQPSKSVFMNVHGPGTVAHTRNPN 
TLE Y* GGS I T* GHEFETSLGNIVRLPPP 

SLQSI 




21480 


A 


7639 


1212 


1023 


KNHSNEWI KKM* YI CTMEY* AIRRNE IM 
AFAAT *MKLETI ILSEVTQEWKTKYCMF 
SLVSGS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

iocation 
correspon 
ding to 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Hlstidine, I=Isoleucine, 
K=Lysine, lj=Leucine, M=lVIetlilonine, 
N=Asparagine, P=Proliae, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireoninc,V-VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=po5sible 
nucleotide insertion 


7580 


21481 




7640 


479 


318 


AEFLNLGDGGCSEPR*HHCTPAWATRAK 
FHLKRKKK 


7581 


21482 


A 


7641 


470 


172 


RNIRPRENQYPVFMVAFFLIAK]SIW*QPK 
CPSSEEAMNKLRCIHTTAVLHSSEEEQP 
TDTHNNSDLRS I AP SWRS QTRAAAHRMI 
PWHSGKGKTPSTENA 


7582 


21483 




7642 


2 • 


201 


AGAPPPAWLPPCRLISDC*ASNQRDSVG 
VGPSEPYATVRKYLELLLCQiVlHHDMCTY 
RFSIRIVLNL 


7583 


21484 


A 


7643 


521 


385 


GGSPLETH*YTSQGGGIPYTDLTGHHPS 
QGRIQEAPKLTHL 


7584 


21485 


A 


7644 


331 


3 


DLVPKKGGGGKKTTPRIKKFPPKGEDFF 
FFTPIIYFWFV\n/Fj'WGGAFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFT*DYYIS 
F*EERFWKFYRCQS TLVQ IGHTSNL 


7585 


21486 


A 


7645 


88 


367 


HCSFLLLGKKKKKKKKKKKKKKKKKKKK 
KKKKTKKNEYKNLLGGapp*KPPSGALY 
KYRRKNFI PFSFERVS KYPPRDLCKKNL 
FWGGLNIPS 


7586 


21487 


A 


7646 


378 


37 


FFFPXFFFFFSPPPXFFFLFXFFXXPFF 
XXSFFSLPPLPLXXPPPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFNI*TI 
VL 


7587 


21488 


A 


7647 


34 


293 


TGIPGVISAIFiyiI*EAFVSKRKVLIGKK 
PSINLE*LYGCPPPYHTFEEPVYIKSTQ 

KRKESNPPKLGSSQPHGLHDFFKEKKKI 


7588 


214S9 




7648 


22 


106 


ENFLPLTLTLL*KNFLPLTLTLLI*YGS 
IPITISSIPPQT 


7589 


21490 


A 


7649 




231 


FDRTISKDISK*MVLVSKKKKKKQKKKK 
KKKKKKKKKKKKGGGPFKKTPGGAPKNP 
GEEKI IFFFLGGGKKNTWGFF 


7590 


21491 


A 


7650 


79 


286 


WSRCLVSLRNPCI IKTCS*AKKKKKKK 
KKKKKKKEOOCKWGGi^FLKIPWGGPIFPG 
GGKFFFFFFGGGY 


7591 


21492 


A 


7651 


57 


375 


GRWIT*GQEFETSPQGDPISTKKKKKKR 
GALLKDSLGGPNLPGFSNLKVTSFRGGI 
LKPTWEFWEGTFILGGEKIGPN 


7592 


21493 


A 


7652 


3 


357 


LAFLIiALSKITHASIPIPVSSPSKSPRS 
KGTKKKKKKKKKKKKRGGPLKKTLGGPK 
INGGKKKKIFFFKGGEKKTPGGILEKKL 
FLGGGZMGPNPPKKIKPLGEKKISrF*GEK 
GEKKP 


7593 


21494 


A 


7653 


113 


378 


MGAFNLGPFLLMGGGNPSGPNGHWVLGA 

GSYLGPGKTPPKKSQKGGPPPDL*GRGG 
QCSPGTKPGGGGEKTGLftPPFTKSPLGF 
I.QKK 


7394 


21495 




7654 


341 


147 


NFFFSLKPPIFPGGFCPIFPPPKKSFFS 
KIPPWFFSPPF*EKIFFFPPPLNFAPP 
RVFFKGPP 


7595 


21496 


A 


7655 


395 


3 


FFFFFFFFFFFFRGKSE*FVIiFPVSPAP 

SLRG 


7596 


21497 


A 


7656 


2 


162 


ESKGYTHTHTHTHIYTHIYTHTYTPREH 
* PKKADVAMLILNKLFFKPRCVALL 


7597 


21498 




7657 


333 


28 


LEFFFNFFFSYFLFFS*RVWGPERPPPP 
LKKAV*KFFIiFDIFLFFFFFIFGIiGFFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=<Jlutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLEHIRLLMFVTFSSHR 


7598 


21499 


A 


7658 


1 


137 


HSSLGDRVRPCLKKKKKKKEKKKKKKKK 
KKKKKKKKKKKKGGGLF*EKKKKKKKKK 
KKKKKKKKKKKGGGLF 


7599 


21500 


A 


7659 


236 


32 


IIFLIKFLYF*FIFI*F*FFLTFFYFKR 
LKILFFFFFFFFFFFFFFFFFF*KNFIK 
NIIYFFIFYLFYYLKFFIYIKTFLFLKI 


7600 


21501 


A 


7660 


301 


65 


KAPPKIFWFFFFVFFFFFVSFYIFFFFF 
FLFFFFFFFFFFFFFFFFFFFFFFFWKS 
LART* FLRGFETF*LAYILKTT 


7601 


21502 


A 


7661 


70 


273 


KHSPPHKPASD*NTELTIKKKKKKKKKK 
KKKKKKKKKKKKKKKKKGGGALI KNPWG 
GPIHRGGEKFFFSFLRRD 


7602 


21503 


A 


7662 


1 


353 


I LI INTLLALLLI I ITFGLPQLSGCLQK 
STPYEGGSDPISPARGRFSLKGPLQAMT 
FLL * E LE I ALL * PLP *ALQTTNLPL IDM 
GCLLLNI ILCLSLAYE*LP*GLDCSHYH 
SQRIQ 


7603 


21504 


A 


7663 


1 


131 


FIRQLIFHFTSKHHFGFEAAA*YWHFVD 
VA*LFLYVSIY**GS 


7604 


21505 


A 


7664 


2 


192 
~ ' — " 


PLSQSLY*LriAaDLLILT* IGGQPVSYP 
FTIIGQVASVLYFTTILILIPTISLIEN 
KILKWAY 


7605 


21506 


^ 


7665 


356 




TFFPLSSPGGSCXSSPTPARPKKNPPPWG 
GGLPLFSRRGALFPKNFFWGGYPFFFLF 
* KKPPFFPPPGPQSPVTSPKDWPPIjRM 
PPPRPHVRPLGLPKKSFSSPRWEKQVKK 
TKKRAA 


7606 


21507 


A 


7666 


312 




GFFFFFFFFFFFL*DRVIiLLLPRLECNG 
VISAH 


7607 


21508 


A 


7667 




233 


PWLSSNAEFDSANSCECLEVQRMIQDQD 
SFPTYHYFDMYVCIYV**RSLAVSPGLV 
SNSWPQAILQPQP PQSL6LQE 


7608 


21509 


A 


7668 


174 


295 


I FFLP FCGEHSLAVLPRLVLMSWTQAIL 
PPLPP*ILGLQA 


7609 


21510 


A 


7669 


183 


330 


NKFKLYWSPVAMAHACNPRTHRGGWIT* 
CEEFETTLftNMVNMVKPCLY 


7610 


21511 


A 


7670 


3 


319 


TSNTLLALLLI IITF*LPQLNGYIEKST 
PYESGFDPISPARVPFYIKFFLVAITFL 
LFDLEIALLLPLP*AVQTTNLPLIVMSS 
LLLIIILPLALAYE*LQKGLD 


7611 


21512 


A 


7671 


1 


313 


AEGERERERERERERERERERESGGGGP 
TQTDCKGGRNT*RGREIYRESE*DDRPP 
FLPTYRVNLQRPVGLRRLKGAGDKTFCL 
ILTLARDYVWPDYRMKRADHM 


7612 


21513 


A 


7672 


3 


91 


TRRERERDR*REREREREREIFREKNSQ 
S 


7613 


21514 


A 


7673 


T 


115 


DELANLF 1 1 * KAGFPVLPRLVLNSWLQV 
ILLPWPPRLN 


7614 


21515 


A 


7674 


3 


232 


TRREREHERERERERERERERERERERE 
RYRE*DRERERERERDCVWGGGVI SLSR 
AVALSGAPAVGRQTRERI SR 


7615 


21516 


A 


7675 




234 


ARGERERERERERERERERERERERERE 
REIERERERLTQRERERERGRGVWHAPL 
AIERG*KPQGGFWWKEERVKKEPLGKLL 


7616 


21517 


A 


7676 


1 


262 


ARGERERERERERERERERERERERERE 
RERERERERERERERARAPQNL * ERWAH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Argiriine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPPAPGVSLSWREHTRGGPPWGKKEGGW 
GRLGHPPPS ILRAKGG 


7617 


21518 


A 


7677 


214 


2 


QQASVAQAGVKWPDLCSLQSMPPRFK*F 
S CL S L LS S WD YG P P P P CAGLH I WRRNVT 
LALQGLPTTSLALV 


7618 


21519 


A 


7678 




341 


HERHELEELIKTLFFFFFFRKKNHLCKP 
GGKKRGPIWFLKKFFSPPQKKPPPPPPF 
LRE IGAMAPQPGGGLLI *NKTPFRPRGQ 
GGPKPPTPGKPNPPPPPKGNFPL*GPNP 
QPPGSPTPPPPQKVIFPFEGHQA 


7619 


21520 


A 


7679 


1 


357 


GTR INTLLALLLI I ITF*LPQLTGYI KK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLLLDLE IALLLPLP*ALQTTNLPLIDM 
SSLLLI I ILTLSLfl.YE*LQKGLD*TDFY 

SVGGEAW 


7620 


21521 


A 


7680 


365 




PDASQ*HCTNGQSNRLLNLLIKALLTTA 
KIWIQPKCPPVDARIKKMQ* IPRMEHPS 
AIKKEILPSVATWMSLEDTVPSRISQAQ 
KDKYHTVSCMWNLRMLNAEPEGRPWPR 
GVQGWGAASA 


7621 


21522 


A 


7681 


387 


3 


FFFKKEFCPRKKAKK*NWGPGNLGPRG* 
KNFPPPPPQKRGKKGGPPQGGEILGFL* 
KKGFPPMGRGGSKLWPLGFSPLNPPKRG 
GKKGEPPPPPQRGKPGGPPPKGLFFFFF 
FFLRQSLTLSPRLLV 


7622 


21523 


A 


7682 


107 


290 


ELNKRWGLGAVAHACNPST* * ELNKRW3 
LGAVAHACNPSTLGG*GRWIT*ALEFKI 
NLDNKARSHLKIKKQHQQQKNSGLDA 


7623 


21524 


A 


7683 




348 


GTREHKPTWLWYHRER01ERDCSCVPGS 
S GI SESRVWVQVGMEVYPAI I YLCLEPK 
YLVISEFSPAFRIWFLGLSWAHTCNPS 
NLGGQGGRIT* SQEFKTNLftNIVRPHLF 
FFLF 


7624 


21525 


A 


7684 


364 


77 


GGPSIFPRLALLGGGGPGGPPLPRGNFS 
FFNFFVKKGVPFPPGVFPPLPPGVLPPP 
PPFWAPG*PRPPNFFLGftPGFFPFFFLG 
GGFFPVAPGGV 


7625 


21526 


A 


7685 


3 


24 


HEETn:iQVKRKPVE*KKIFMIPTFKrKGL 
TSEIYKKQLNRKKPNNPI*KQSSK 


7626 


21527 


A 


7686 




125 


GTRPffl1P6TYSNYPDTYTA*KIISSIGS 
FISKTAFQPGPWE 


7627 


21528 


A 


7687 


459 


298 


LSLPSSWNHMCaLPHPASFFL*RQGLAL 
LPRLVSKSWAQVMLLPWPSKVLGLQV 


7628 


21529 


A 


7688 


374 


248 


FPHHNVHEVIVRPLSPRYPIISHVTCTY 
RSRWP*ASEGSQKK 


7629 


21530 


A 


7689 


284 


153 


GTVAHTCNLSTLRGEGGWIT*GQEIDTS 
PAWPPKVLGLQA 


7630 


21531 


A 


7690 


83 


255 


K VD YVS I KS E FF * DRVSL YHP S * GTVAR 
S * LTTTPWPGI KQSTCPTVPSSWDYHHV 
LPLPTRTIFSVPTQSLATILI*ITLNLY 
INLGKIHFL 


7631 


21532 


A 


7691 


3 


193 


HERLDPADFSFNFFSVDVGLAMFPRLVL 
NS*HQVILLLWPLTVLDyEHELLRPAWG 

FYYLNFT 


7632 


21533 


A 


7692 


1 


364 


NFKSFFP*LFYLyHLLFFGFGLFLFFLI 
KLGSSLFINFSKKSVL*FTNWIYFSVLY 
FTDFCFIFIISFFGFSLIYFIIAVFPLK 
FIGFILSFI *ITVFyGQAQWLTPVI PIL 
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SEQID 
NO: of 
nncleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,y'=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WVAETOGSL 


7633 


21534 


A 


7693 


370 


151 


GKKYSVASPGKLWPPQGFFKTAPPFFFF 
FFLK*RCGFTMLPRLVLNPWPQGILPPQ 
PPKVLGLQVLRHHIWLH 


7634 


21535 


A 


7694 


3 


373 


WPGYTLNQAYAKIHFTIIFIGANLTFFP 
QHFLGLSGMPHRYSDYPDAYTT*NILSS 
AG S F I S I TAV I LI I FM I * E AFAS KHKVL 
IVEEPSQSAGITGVSHCARAEYLFIDRR 
DGLSLCWPGWT 


7635 


21536 


A 


7695 


406 


394 


KFF*KKFLFPPTPTFLLEACLFFFLPPI 
KGFLFKYIAPGIKTPPPKKEKFFSSLKV 
LFSPPYFFFIPPPPIFFFFFFFFFFFFF 
FFFFFPFFFFFFFFFFFFFFFFFFRFEG 
ECWRL*W* FYFYFFKESPHCIP 


7636 


21537 


A 


7696 


2 


334 


ILINNTFLALLLLI ITFWVPQLKGYI KK 
STPYECGFDPISPARVPFSIKFFLAAIT 
FLLFDLEIALLLPLP*ALQTTNLPLIVM 
ASLLLI IILALSLAYE*LQKGLD*AE 


7637 


21538 


A 


7697 




95 


LNLTIYIILTTTAFLLLNLNSSTTTLLL 
SRT*NKLT*LTIYIILTTTAFLLLNLNS 
STTTLLLSRT 


7638 


21539 


A 


7698 


423 


326 


RQCLTMLPRLVSNS *SQVILIiPWPLKVL 
GSQA 


7639 


21540 


A 


7699 


400 


32 


FHKEYESYFFSPNQPQFFFFFPHR*NFY 
VGVLKKQPPKKKFFFLLFTPERFFFPFL 
*KKNIFFFP*YFFFPLVIFFL*PPPLFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFSSWSFPFY 


7640 


21541 


A 


7700 


125 


407 


FINFSSTFWKPTTCNMQKHTPIARTKD 
LC IT I FFFFLEKNFLFVPQVGGQGGI LG 
*LKILriRG*KQFSCFTIiPGHWTNGGIPP 
PPKNFLKKFF 


7641 


21542 


A 


7701 


50 


225 


PWNMV™SCWLGKVAHVasrPSTLGGRGR 
RIA*AQEFETS*VTQ*DPVSVIKKRKKN 
FK 


7642 


21543 


A 


7702 


2 


146 


EIALLLPLP*ALQTTNLPLIVMSSLLLI 
I lUUiSIiAYE * LQKGLD*AE 


7643 


21544 


A 


7703 




93 


ILI IHTLLALLLLIIMTMGLPQPKGYIKK 
STP*ECGFDPIFPARVPFSIKIFLIAIT 
FLLFDLETALLLPLP*ALQTTNIiPLIVM 
ASLLLIIIIiSLSLAYE*LQKGLD*AriLL 
LIMTMGLPQPKGYIKKSTP 


7644 


21545 


A 


7704 


2 


112 


GRVGKHHFGFEflAA*YWHFVDVG*LFLY 
VSIY**GS 


7645 


21546 


A 


7705 


1 


325 


TAGQFLPKLS ILLS YNPAITFLGIYPKI 
LKTYVYMKTCTWMFIAALFIWQTWKQQ 

SKLWYIQTIKYYSVLK*NELSSYENTSK 
KLRCILLRERSQYKKPPYFLIPTM 


7646 


21547 


A 


7706 


2 


169 


SRSRAGTLAI*TIWLPCTLMIFTILIVL 
TILEIAGALIQAYVFTLWSLYLHDST 


7647 21548 


A 


7707 


263 


346 


MLINVPLGLLFVGVI LS KES PS VDQGGV 
QWINLFSMQPPPTGFK*FSCLKA*ASDN 
LSPHEQYRIiALSFLKLTL 


7648 1 21549 


A 


7708 


343 


163 


PKEF**RQGFFHVGQAGFKLLRSGNPPP 
S ASQNGKITGVNPLAWQTNNS I PPMAP 


7649 j 21550 


A 


7709 


459 


144 


FWPGASSHAFDPTTLGGRGGRIA*AHEF 
KTSLGNIVRPPSDTCNPKVIGLLA 


7650 21551 


A 


7710 


2 


83 


KFLPLTIiALLI * YVS IP IT I S S I PP QT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/SlS,l 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


7651 


21552 


A 


7711 


19 


181 


KYVHTKACT*MFISSLFIITKNWKQPIF 
SSIGE*VNKT*YIHTMld:YLLFRNKHK 


7652 


21553 


A 


7712 


392 


49 


SD I FNSDIFSHY *CSVF*S YIAYNRYLS 
FL*LFLEFIYSYP*IDHM'HCFLHEVSS 
FWLDFFLFIFSFFLFSYFFSYFYF*FYF 
*FIFNFIFNLFFILFYF*FSFFSKCLKF 
PC 


7653 


21554 


A 


7713 


334 


208 


FFF* EWVLTTLPRLVLNPQAPAILLPWP 
PKVLGFTGVSHIKI 


7654 


21555 


A 


7714 


2 


408 


WNGMEWNGLKWNRLERYGMQWMGLEWNR 
TE * TRMVWNAMEWTR I ELNGLE WNGME C 
NGMHliNGLEWNGMERMGTDPEWNGTDPE 
WNGTDPEWNGTEIWIGMEWNEVDSNGMEW 
NGIDSNEMGWTRMEWNGMESTRV 


7655 


21556 


A 


7715 


362 


223 


VWWFMPIIPTLWEAEVGG*LEPRSSKPA 
WATWQDAI FKKKFKHQHP 


7656 


21557 


A 


7716 


2 


335 


LIVIINTLLALLLIIITF*LPQLNGYIE 
KSTPYECGFDPISPARVPFSIKIFLVAI 
TFLLFDLE lALLLPLP *ALQTTNLPL lA 
MS S LLL 1 1 ILALS LAYK * LQKGLD *AE 


7657 


21558 




7717 


2 


142 


S I TL * LPQLNGY lEKTTPYECGFEEICP 
ARVPLS IKFFLEAI TFLLFDLE lALLLP 
LP*ALQTDDLPLrVMTSLLLIIILTLSL 
AYE*LQKGSD*AY>IPRPRPFVHKILLRS 
YYLLII 


7658 


21559 


A 


7718 


6 


92 


FFEAAA*YWHFV1IW*LFLYVSIY**GS 


7659 


21560 


A 


7719 


417 


164 


PLEPLISGReLEQIAPPPKKGSPPKSPR 
WFFFPPP*KKKNYFPPPPKILPPPGFF* 


7660 


21561 


A 


7720 


25 


401 


THNS *DPISIWD* IPHYA*P*TSTVICKKKK 
KKKKKKKKKKKKKKGGGLIKKFRGGPKY 
TGGGKI IFFFFMGGKKKPLGDFLKKNFF 
LGGGNLGKHPPKKLSLQKKKKNFKGRGG 
KKTPLCRRGKKFS 


7661 


21562 


A 


7721 


388 




APPFFFYFFFFIFFFYFFLGVGFSFK*N 
SKVFWISNFLKKILRVFICSLEKGINPF 
LEKCFYMFFFFVAF I IRLNFSLQflKFFL 
LNFALU>FPE IFFFFPFFFFFPQIEDFW 
HPYIEQVYQLSLFSLS 


7662 


21563 


A 


7722 


389 


71 


FFLPHQKQVFFPPPPFKIFFFFSRVFFF 
WGGWAQKPPPPKKFFFLKTPPGFFLPPP 
*KKKFFFFSPRFFLPPPGFFFKPPPPNF 
FFFFFFFFFFFFFFFFFFFFLN 


7663 


21564 


A 


7723 


77 


220 


TPRRGWAYWLMPAYPAILEA*AGGLLEP 
KSLRSTWPTMRNS 1 SPKRK 


7664 


21565 


A 


7724 


16 


258 


ISSFVFKHRSFAMFPRLVIiISrSWPRVVI.P 
PHPPE *LGLQACTTTFSEBrFFDGIFLRP 
LKEDVTARDQREILYVFALFYYGGA 


7665 


21566 


A 


7725 


353 


111 


IFGAGPRFVFKAGGQGANSRSLKPFPPG 
IIKWSFPFSPSKKWEGRG*PPRPPFFFFF 
FFIREGVSLCCQGWSQIPGLKQSS 


7666 


21567 


A 


7726 


2 


185 


MSMGHTRLSSAMTGKPPLSVEDDFEKLI 
WEISGGKLEABIDLDPGKDEDDLLLELS 

*MIDT 


7667 


21568 


A 


7727 


346 


143 


SQAFLYLESFIiLLLLLY*FLFFEMESHS 
VTQDRVQWEHLSSLQPLPPHFQQPSRLT 
LPSSWEAEVGG 


7668 


21569 


A 


7728 


411 


221 


HLRPPGSS YSPAS AS * lAGIPGARHHTQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 
locution 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leuclne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIinc, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














LI FAFLAE IL3QJLAKLYLI PRKPNI YYL 
PNCYINI 


7669 


21570 


A 


7729 


2 


303 


AAASTNMLLFFLGRHLWEWLGHMVGLI 
FTFKKLPNQAGRRGSHL*FQHFGRQRWE 
DHLRSRVQDHPEQHSKTPSSLQKNKMSI 
ESNLLNIHKLFSTGRM 


7670 


21571 


A 


7730 


67 


65 


SICEYCFVSSVFVLKRLFSWQAFSLEG 
CVSCQS*AFLQHYLFKKLYFLY*WGLTM 
LSRLVLNTWPQVIHLPWPPKVLGL*AFC 
GHEA 


7671 


21572 


A 


7731 


354 


238 


QSILDDVAMVLD*QAEVFIVKMWRLLIY 
VHEAKKIGLVK 


7672 


21573 


A 


7732 


33 


186 


RGALAVLSRLILTPGLKESSCLDLPKGW 
DYRKEPPRPGCFF* LMVLVLSF 


7673 


21574 


A 


7733 




181 


VNAGADCS S IGGVP FLQHKKCHGKDYEP 
RGITTLERSYVEETTEHLVSKSK*PLRA 
QINL 


7674 


21575 


A 


7734 


336 


80 


HKLKEPPGVFPVFPFKNLEFGPGPPKFF 
FKKAFSPILSFFFGKIFKIPRFRGENFA 
P*NFGKNS* KPRFS PPPPKKKGFFFFFF 
F 


7675 


21576 


A 


7735 


2 


163 


TPVSTGTPVPTLT*VPSPIIFFVSEKWA 
GCLHLCLNFTCTELRLLTSLLTIRS 


7676 


21577 


A 


7736 


87 


258 


KAPSVCLFSALLMLLRMSARTSVCTVRP 
LSPS*AVISPVTCTYTSRWPEATEDSQK 


7677 


21578 


A 


7737 




103 


LDSKGIYMTLQPSKLEPKLEANVEIREH 
MLENSSRF*RDLYDSAAFKBIITKARSKC 
RDKRAHVGEFF 


7678 


21579 


A 


7738 


193 


300 


GGLPPPPFFFFFFLKKRGFPWVTRGGPN 
PPPLEMGGPTPPKGWNYGGGPPP* FHPF 
GGVGPPISKGGGLGPPRVTQGKPLFFKK 
KKKKNGGGGKPPYSPFLGGVKQKKGVNP 
KGGGSKKPKFPPPPPPGGKKKKPFLKKK 
KKKK 


7679 


21580 


A 


7739 


150 


17 


GRVAQVWMLyiHaiPAL*KaEAGRSLEPR 
SWTPDWGTWHLPISTK 


7680 


21581 


A 


7740 




358 


SPPRPPPPPPPPPKEKFKKLENPPPPPK 
NPQKQNPPPKTKKKS PTTTPKKKPKIGT 
PPQTPKKINCS *KKKNNPPENKKGLNYP 
NKPNHPHAPPKEPKQREKPLVPTKKKKG 
PSPKKKK 


7681 


21582 


A 


7741 


235 


147 


F*FFLGDEISLCCPGWSLTPGLEQASCP 
GLPRRWWYRCEP PHS VKT I F 


7682 


21583 


A 


7742 


131 


12 


AASTYGQ*FKICGAILRLMPIVIEFIPL 
KILQIGPVFTL 


7683 


21584 


A 


7743 


1 


184 


NQYPWCFCNHMSMGKESKTLNRSGMVAH 
TCNPS TLGGQC3GR I T *A*GS KS S LGDKV 


7684 


21585 


A 


7744 


347 


202 


SLFVETGYHYVD*AYLELLASGSPPASA 
SQSFGISSVSLCTQPMFQFE 


7685 


21586 


A 


7745 


138 


263 


KKRKTYIHTKTCM*MFIAALFIRVRKWK 
QSKCHSADEWINKF * YI KI CLS I HQLND 
IWIASTF 


7686 


21587 


A 


7745 


358 


169 


AAAVAERTQNTEKTEDLVLGLWI KKVI Y 
WPGWAHVCNPITLGGRGGWIT*GQEIE 
TSLPTRR 


7687 


21588 


A 


7747 


78 


346 


AIQMLRMQVKIKLPKSPVEII PFKIVPQ 
FQVQKIKSQNNFLLLFQESQIFFATSTG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V--Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SVRPLSPS*AIISPVTCTYTSRWPEVTE 
KSQKK 


7688 


21589 


A 


7748 


222 


98 


KKKARHGLGAVAHACNPSTLGGQGGRI T 
*VQEANIVRTCLY 


7689 


21590 


A 


7749 


314 


52 


LNNVDLQFFLGPRKKKRNLNDSSLFVSA 
KEVHLMYCFNKLAILHYF*FSNKFIVFV 
* KFGHLLDENMGSKFDNIGMNAMANKDN 


7690 


21591 


A 


7750 


371 


216 


YHTHTHTHTHTHTQIPSLPLNLEGPVGY 
LLSCGSLHLKNLW-KCnyiRKERKK 


7691 


21592 


A 


7751 


3 


238 


FLIQTGFGHVSQAGWELLR*GNLPAIAF 
QSAGITGMSYLCVAESLYLLPPFLKILC 
SSLLTLSPYDTWKVALLCSYTV 


1692 


21593 


A 


7752 


2 


247 


NSDIFDIYIIQYIFLIYLIYIIQYIYLI 
*VFLIIFFKFTLK*EFIFISLSISFRLiA 
FIVYRDVILLFLFFFS*NFVGDSIS 


7693 


21594 


A 


7753 


359 


2 


TLKIRPljLYLPLFFFFFF*KKKXFFFyP 
GGGKGV*SGFYLMGRGTFSLFFKRFFLF 
YPPKEGGLRGSPPPPSFFFFFFKKRGFP 
FLAEGV* KKGPQGTPLPWLPKSLGLRG 


7694 


21595 


A 


7754 


3 


174 


CSVIQAGGQWCGIISSLOP*IPGINQSSY 
LSFPSSWDYRHATMPS*FLFFIEIKMLQ 


7695 


21596 


A 


7755 


292 


219 


IWPLSHVHSNPLTHSPSQVDQGLFTTLL 
PCLSGYMYVQFPLAEMPFISSLLLEMAC 
LSLCEGSEAQKPLLWEGLSYLPPLNQLP 
LCL*LCVCISGGVCRCLMCILGVCVCMH 
VCVSVCPCTHTLAQETKGC 


7696 


21597 


A 


775rt 





195 


VALMADGAIDTBSNDYGAFMPLGIERGL 
DRIWEMPELWLRPNEFDCMTD*PCIQHA 
PSVSCGIA 


7697 


21598 


A 


7757 




280 


DHIVDMITPSFTRRTIAVF*DLNLYIVI 
RGHITSILKPNKNLKLWYIYTIEYYSAL 
KGNEILIHATIWINLE3NMQDEINQTQKD 
lYCTISLI 


7698 


21599 


A 


7758 


352 


107 


FLTRASRGHNGBTIVTSWPGAVAHVCNP 
SSLGGRRRWIT*6QEFKTS]:iANMVITFF 
ISKNSNRFLSQFPISLRPTHYKVLS 


7699 


21600 




7759 


92 


4 


RPSPRRGGWTA*GQEFKISLGIIMVKPCL 


7700 


21601' 




7760 


92 




RPRRERGGWTA*GQEFKTSI.GNMVKPCL 

Y 


7701 


21602 


A 


7761 


328 


124 


PLFSFLFFPFFFFFFFFFSRRDGVLPCC 

PGWS*TPGLKLSSSLCLPKCWNYKHKPL 
HTAHHNF 


7702 


21603 


A 


7762 


1 


167 


VDSTDKRPGAVAHNYNPSTVGSQGGQIT 
*GKFNPSLANTVKLFKKKKKRKRGALF 


7703 


21604 


A 


7763 


309 


226 


RPRRRLTLS PRL * C S GAI SAHCKLRLPG 


7704 


21605 


A 


7764 


349 


7 


QVFCFLFFVCFSS I FMTFNE * QGLKVTS 
GISSNVY*FLLIWILGKITYQDIGRCPS 
E YGS PEQHNLG I VRITflS SRLGWAHTCN 
PITLGG*GGWIT*GEGFETSLAHMGKPL 
LY 


7705 


21606 


A 


7765 


43 


339 


FFFFLFFFFFFI I FLLLSFFF*DHFFVS 
LLFISNFVFLFP I FI FTVFQAHLF I CLL 
FFPICMFLNLFFPNFNYYFYPFPLFFYL 
LFFFFG*FPNHLFYL 


7706 


21607 


A 


7766 


352 


102 


HTSCLIIKSSLSKMKISCVSSSISFFF* 
FILIEIGSR*WQAGLQLLGSTDPPASA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

peptide 
sequence 


M 
etii 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S QS AR I TGLSHHAQLS STS L WENVE W 




21608 


A 


7767 


346 


90 


TPPSSFFSFPPFPFFSFIPFRFFFFSNR 
DRVLVCYLGWS*TPELK*SSQFGLPKCW 
DFRCGPPCLTSTAFYISITVWFSPSNCY 
S 


nos 


21609 




7768 


278 


87 


AASTGVGPSEPSAGYYLLVCRLLRSLEK 
CS I RVGVT* FSRCHLS PLSLTRKGNS LT 
PCASRVR 


7709 


21610 


A 


7769 


5 


272 


AVYHPIKES CVLNI GKDSCLLY*LFEKK 
KKKKKKKKHKKKKKKKNKHYE* FNTFPG 
VARKINLFLC IQKVDAGGGTLWGGGP P I 
FFFFL 


7710 


21611 


A 


7770 


198 


33 


QRPQSRQCRAWLGMVAHAYNPGTLGG*D 
RRTT*GQEFKTSLGNIVRPYLYEKIJIF 


7711 


21612 


A 


7771 


97 


410 


A*WLMPVIPIIWEVKWRLLEPGSLRPS 
WATWRKP 


7712 


21613 


A 


7772 


5 


415 


ILCVYLHFVGTHPFVHL* YMHFLPLTLA 
LLI*YVSIPITISSIPPYT 


7713 


21614 


A 


7773 


1 


342 


VVRVTSGHSG*AAAYAT*YILSAEGSFF 
PLTEELLI * FMKREAFT*KR* VLI IHEP 
SIHLD*LYGCPTPYHTFPDPVYLNSKRR 
RRDSHPPKLASRLPHVIiEIDPFKKKKGGR 

RF 


7714 


21615 


A 


7774 


398 


122 


SPPPPFPSSPSFFLPPPSSSFFLFFLFF 

CPA*SQTPGLK*YSYHGLPKFWNYWHEP 
PCLAYFL 


7715 


21616 


A 


7775 


1 


336 


FNFLLII IIEMESRSVSAHCKLSLPQSS 
NSSASASRVAGSRGAHHHTWLFFFFFFF 
F * KKKGAP PGGQGG FLTLKiOKNP P PRAP 
QRGGINGGTPRPQNGGGKFFPWGEKKKF 


7716 


21617 


A 


7776 


1456 


1232 


FAIESHCVTCJAGVQWCNLASLQPSPTEF 
K*FSCLGLPSSWDYRCVPPHPANFY1FS 
RDRVSPCWPGWSKTPDLK 


7717 


21618 


A 


7777 


400 


2 


NTE FP YGPP IS LLGM* P *EMETYI PRKP 
CTQMFLEVLFTrAKK*KKPKRS*LING* 
ILVYPLMEYYSATKRFDVMIHTKNVnAI 
GNlILSERHO!l*KTHIVySIHRASCTAE 
VRIKVSYKRAATWIKSILIA 


7718 j 21619 


A 


7778 


360 


218 


PYLflNF*NLL*RWHLNMLPRPDSNSWPQ 
VILSLWPPRVLG*QRGGRVEQRGARGNW 
MSKCPEGGDCLCWVAGA 


7719 


21620 


A 


7779 


1 


372 


FEVES CS VAHAGVQWHDHRGFE FLGS S D 
DPSVSASCISRITQASHCTQG*L*LLTK 
VL*VSAFCLSVNTLRLTSLLKSDTFLGS 
RSKLNSLGESLQTYISPSRHFIQLLCFR 
SLFPRFLVKHKK 


7720 


21621 


A 


7780 


372 


219 


ATSLLSYFKKLPQPPQSSATTTIISQ*S 
STSRQDLPPAKRLELTEGSDDP 


7721 


21622 




7781 


358 


145 


IKNIHDSWEDVKISTLTRV*KELIPVLT 
DNS EGLKTSM* EVTAEWE lARELELEV 
EPKDVLVQFHDTINE 


7722 


21623 


A 


7782 


317 


146 


GRVDCKCPFYRFQMLLVRLGMVAHVCNP 
STLGGRGGWIP*GQEFKASVSRMPRPHL 
Y 


7723 


21624 


A 


7783 


13 


186 


DRVSITQTSWaHPPTSAS*VAWTTGMHH 
HTWLNFVFFVEMGFHHVGQVGIRLPLLS 
WK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 

peptide 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=TJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7724 


21625 


A 


7784 


339 


221 


KWDLAMLARLVSNSWLQGSSYLSL*NCW 
DYGSEPLCPAL 


7725 


21626 


A 


7785 


355 


3 


FGNSGRVDLFFFFFFLDRVSLCCPGCSA 
VTNLEVHHSALQPQTPGLKPSSHLSLLS 
SWDYRRVPPHPS*SLF**GK*YSITQLR 
FLNLHLES6LRSLLPKPVASTVRIRTQS 

GLGLK 


7726 


21627 


A 


7786 


1 


201 


QQFSHVFRLLFQFSFSCQEPPSSHIQPG 
VCTRNTKVSQAW*CVPVIPANREAETGE 
SLEPGRRSTRP 


7727 


21628 




7787 


270 


1 


RPRRHLVPCVLMLGPKCPGLPHTLTTLP 
YTLTTPSPTRSLGPPECK*SFHLSFPSS 
WDYRRMPQELANFCRGRILLCYPDWSSS 
PGLKQS 


7728 


21629 




7788 


229 


2 


DGWVRINWAQPGTVAHAYNISNLGGRGR 
RIT*GQEFKTSLTNMEKPCLYSDHEVRS 
LRPARPTWRNP I STKNTKM 


7729 


21630 


A 


7789 


118 


263 


CSKSYYWPDAWHTCNPSTLGSQDGWIA 
*AQEFKTSLGNIARPPISVF 


7730 


21631 




7790 


246 


359 


TFIFSETRSHSVTQAGV*WHDHGSLQP* " 

PPWAQVILSC 


7731 


21632 


A 


7791 


70 


102 


AA* SRLTATSLSRVQAIPLPQPPEQLGG 

QGGRIS 


7732 


21633 


A 


7792 


7 


312 


FLDFQLRLHSNSYCEDKGVFDSEIYEIC 
IVLICKK*RKFLNQ KKKKKKKE KPGL I * 
KKKKNSQKKWFDFPWLPPNPETKNSLPL 
PKEFLWLKTKPPFPLTLT 


7733 


21634 


A 


7793 


1 


341 


IKPE*yLLFAYTILRSVPNKLGGVLALL 
LSILILAIIPILHISKRRSIIFRPLSRS 
LY*LLft2ffiLLILT* IGGQPARYPFTI IG 
QVASVLYFTTILILIPTISLSENKVLLW 
A 


7734 


21635 




7794 


1 


143 


YGSTFPVATGPHGLHVIIGSTFLTICFI 
RQL I FHFTS KHHFG PE AZA* YWHF VDW 
* LFLYVS lY* *GSTPFVATGFHGLHVI I 
GSTFLTICFIRQLIPHFTSKEIHFGFEAa. 
A 


7735 


21636 


A 


7795 


2 


257 


KWAI lEEFTKNNSLI IPTI lATITLLNL 
YFYLRLIYSTSITLLPISHNVKIK*QFE 
HTKPTPFLPTLIALTTLLLPISPFILII 
L 


7736 


21637 




7796 


22 


326 


RDASDCS FQNLPVPLWVBK*MVFLLTKK 














AKIIPGKKKIIFFFLEGQKKKLWGFFKK 
KPLFWGGKKRPNPPKKKT 


7737 


21638 




7797 


506 


281 


RERERERERERESPRPPKRQRERETEIQ 
TLSLTVSLAPPPTCVF 


7738 


21639 




7798 


435 


336 


MRSNIHFFSHTHTHTHTHTHTF* ILKQH 


7739 


21640 


A 


7799 




106 


RNPPPLFFS PPLQRGGFPP I YWGPPRFF 
PPPPL*KTPPKLKIGAPPKKKKPPPPPG 
EKMVSF*PPPPFFFFFFFLRRPFPLFSP 
GGGPWGHFRGPPPPPPGV 


7740 


21641 




7800 


2 


296 


FVPSTANWCFFLWSVF*TGGSNYFFVHR 
SYSQn.GVDLIFIRLANTISNLSFI**RG 
GLAMLPRLVLETRAQTILHSWPPKVLGL 
QVWATVPSFQFLKN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO; in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

lon-Mpon 
ding to 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
Lysine, L^Leucine, M-^Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7741 


21642 


A 


7801 


344 


105 


PGFFFFPPPGKRGFFPKPFFLGTPGFFP 
PPLF*NPAP * FFFWAPKKKI FS PPPPGK 
KIFLLKGPPLFFFFFFFFFFFFF 


7742 


21643 


A 


7802 


68 


216 


VHTIAKIWKQPQCPSMDD*IKNMRYLYT 
VE*YSVLKEREILLYWGHYAM 


7743 


21644 


A 


7803 


407 


3 


GFIH*FPLFSGYTLDQTYAKIHFTIIFI 
GVNLAFFPQHFLGLSGMPRRYSDYPDAY 
PT*NILSSVGSFISLTAVILIIFMI*EA 
FASKRKVLIVEEPS INLEWLYGCPPPYH 
TFEEAVYIKSRQKRKESNPPMY 


7744 


21643 


A 


7804 


2 


136 


THRRIIILSQGL*TLLPLIAF**LLiASL 

ANKKKKKKKHGGPFKKFPWGAKI 


7745 


21646 


A 


7805 


2 


309 


THSRIIILSQGLQTLLPLIAF**LLaSL 
ANKKKKKKKGGAPLKKIPGGAKIKRGME 
I KNFS QKGGEKKTQRGIFGKKPYLGGGQ 
NWEKLPKKFKGLKGKKNF 


7746 


21647 


A 


7806 


110 


362 


TLSDLERLLMKAVSHFLMIDLLEKC*CV 
LKNQSSKKKKKKKKKKKKKKKKKKKKKK 
KKKKKGGPLLKNSWGGPNFPGGEKIFFF 
FFRGGF 


7747 


21648 


A 


7807 


295 


108 


KQRGFVFFFFEMESRSWQARWQWCNL 
SSL *PPPLGFKQFS CLRIVYRKDSLSFN 

KPRLLP 


7748 


21649 


A 


7808 


2 


142 


GSTFFVATGFHGLHVIIGSTFLTICFIR 
QLIFHFTSKHHFGFEAAfl*YWHFVDW* 
LFLYVSIY* *GSTFFVATGFHGLHVI IG 
STFLTICFIRQLIFHFTSKHHFGFEftAA 


7749 


21650 


A 


7809 


390 


2 


PASLLHCG*ISDCCASNE*GSVGLGPSE 
PGAGYNLLVCHLLRPLEKCSIRVGVTRF 

SRCCLSPLPLDRKGNSLTPCTSQVRQCL 

ALLQTiTLGATjHPVSCTHCPTISGEMNPV 

S QLEMQKS P I FCVTHAG 


7750 


21651 


A 


7810 




157 


FLHFGQA^LELLTSGDPPASAS'SAGIT 
GVSIIRAQLVCTFITIYVFLI'CNSSY 


7751 


21652 


A 


7811 


368 


44 


QNFPPPKKRLPPQPPLFFFLVPSFKGEG 
FFFFLRGFFFFSSAQRALL*FIYFIFFF 
FFFSFFFFFLYFSISPDSKGDTHHDLLL 
GVSWWTRSLPLWIAEHKTMHKWGW 


7752 


21653 


A 


7812 


400 


80 


PQNSFSPPGIGGFFPPFPL*NFFFSPKA 
FFFLGGFSPFFPPPKKSFFFKIPPGFFF 
SPPLKKKIFFFPPPFFLAPPRFFFKAPP 
PFFFFFFFFFFFFFFFKYFRHI 


7753 


21654 


A 


7813 


119 


230 


SEEFETSLRCIVIPSL*KKKKKKKKKKK 


7754 


21655 


A 


7814 


3 


412 


HEQEHL*LLLP**PLAII*VISTLAETN 
RTPFDLAEGETELDSGFNIEYAAGPFAL 
LFIGEYTDI IIRNTLTTTIFLGTTYDAL 
SPELYTTYFVTKTLLLTSLFL*MRTGYP 
RFRYDQLIHIiL*KNFLPLSIiALLI 


7755 


21656 




7815 


1 


183 


LRLERLSEFSTRRERERERERERERERD 
TRIDIYIVSQKRKKYIV* IRENIFVYHA 
EFSKR 


7756 


21657 


A 


7816 


484 


97 


QPRTPDLK* FACLGLPKCWDYRHKPPCL 
ASDGNHS * SSPGLLVLSSASVSPSGHMS 
PSQQTSPWVSEESLLLGRVPGFLYFPPV 
DAPGPGAWIiAHWQAVLKLLGSSDPPVL 
PSQSAGFAGLSNCAWPW 


7757 


21658 


A 


7817 


2 


219 


ADELLRNS *EA*EREREREREREREREPY 



1026 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQH) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

location 

correspon 

residue of 


Amino acid sequence (A=Alanine 
OCysteine, I)=Aspartic Acid, 
E^GIutamic Acid, ^^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireoiiine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,y=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPSHIYTDRNRH*MRHEERTLFLWGSGE 
VRDAAPWSHIPNTAL 


775S 


21659 




7818 


190 


292 


NSHI>IFPWTKVFFMS*CTYErLVLPGSEK 
KFYSHA 


7759 


21660 


A 


7819 


498 


139 


RRAATPAPAAAEPPPPLQRP*PPSEPPD 
TQRDARGRRRGEDPGNSPFHPNPDRQPS 
RALCSTPRMHRLWIGPAFFLMTSLSVSG 
AVI PRNGGPGGVSSGPCLLQLLCGQAGS 
STIRKIPS 


7760 


21661 


A 


7820 


489 


330 


ELGFLHVPQAGLELLS *SNPPASASLPT 
SWDYRHDHRTGHHPGIYDSKMCIF 


7761 


21662 




7821 


490 


260 


F FLXXNXFXF FF FXXFXXPX FXXXFX F F 
FFXXXFFFXFFFFFFXFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFLY 


7762 


21663 


A 


7822 


493 


482 


VGQAGLKRLTSGDLPASASQ3AVITGVS 
HEARPIMYFRYVQ*AKGSHV*RMY 


7763 


21664 




7823 


456 


121 


ASFFPIQYKGLiGGATPPQEGGCG*GIPI 
KWGYKRRPGGPHGGSKRPPTQ*KKSYFI 
NVLVLFYLRDKGLAIPTLVSNSWAQVIL 
P PW P PKVLGLQTRVTVPGQN I LKENM F F 


7764 


21665 




7824 


294 


457 


LCIGFVIIjISYFNIMENWYCRPGTVAHT 
CNPSTLGG*GRQITRSRDRDHPGQHG 


7765 


21666 


A 


7825 


26 


235 


SVWWNSQTGNKVKFTISW*IDKQMRCIH 
LMEYYSAIKRNEVLINATTTWINLNTIK 
LSKRSQTLKSNTL, 


7766 


21667 


A 


7826 


30 


303 


SYVSWEFPGSSGPTLPSPWHQTPRTYL 
NSVASAINLTQCPH* PEKKKKKKKKKKK 
KKKKKKKKKKKKKKGGAFKKKGP* KTPK 
KTPGGF 


7767 


21668 


A 


7827 


464 


21 


REPPHPAPADILILHFWPLEL*ELISW 
lntrc wi cddtprkltcqpmprlllqt 
EPL*VLFSNLGSFGSLSKS*MGLVNFFC 
KGTDSRSFSLFVAVFCLFL*RWGLflMLH 
RLVLNCWPQVILPPPPPKLLGLQVEATT 
HEFHGTE 


7768 


21669 


A 


7828 


372 


205 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFPFFFFFFPFFFIKKFFI 


7769 


21670 




7829 


463 




DIYPERRNSKIMSFAATWMKLEAIILK* 

LKNRKPNIVCTHL 


7770 


21671 


A 


7830 


449 


266 


WQMRFLHVGQAGL * LLPSGDPRASQVQ 
SAWITGVSHHAQRS IFFKKRWNDCDRPD 
WTECT 


7771 


21672 


A 


7831 


480 


148 


lELRLSHGAAEFHRPREWYG IFKVLKER 
DFYPRIIYPAKIILRHEGEMKTFPDKQT 
riRHFINTRPVTiQEMTiKKVHQSEIKGC* * 
ATNNHLKVKKKKNSLGW 


7772 


21673 


A 


7832 


23 


131 


QRGNSKGYHLKMIQQEEIRKI.EEEKNQL 
EGEIIHFYKMKJVASEALQTQLSTDTKKD 
KHGKKQ*FL 














KRSSCPDLPNCWDYEHGPPYLaSPVLLK 
*TLSILLLPYYHKMHTCVFMVLCTNFCG 
GVCPGMELLCLKVYGYIiSLKHTFH 


7774 


21675 


A 


7834 


355 


70 


KKTPRGFSGVFLGPFYKKGPLFFFFFFF 
FFFFFFFFFFFFFFFIiFLQFFlLCQQFIi 
S*STEKTVH3KYILSFPEHCLIFCPFrii 

LCSTFCTJGAHY 


7775 


21676 


A 


7835 


494 


282 


PENGMIQGGGACSEPR* *HCTPAWAKER 



1027 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: Of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

location 
correspond 

lmino"cid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

residue of 
sequence 


Amino acid sequence (A^AIanine 
C=Cy stein e, JJ=Aspartic Acid, 
£=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Un known, *=Stop cDdon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DSVSNAKTI QNKKNLGN *HS I S TAKAKII 
IIIS*YKSELRKKM 


7776 


21677 




7836 


305 


185 


LVIETCLTVRPLSPS*AIISPVTCTYTF 
RWPEVTEESQKK 


rm 


21678 


A 


7837 


179 


191 


LGCRKYLFYLNGGLKCLY IKCVCVCVC I 
YIGVYIGMCVYICIYIYLMAS*NQVTSF 
LSFYVFLLNYR* IPVKVITVLFNLRMTG 
LP 


7778 


21679 


A 


7838 


466 


343 


LGRLRQENHLPPEGRGCSEPRSRPCTPA 
WETEEDAVSKINK* INK 


1119 


216S0 


A 


7839 


451 


89 


LRDTR*SGVICLPKQAWflMVGAPPPASL 
PPCSLISDCCASNQRDSVGVGPSQPGGG 
YNLLVRRFLSLSEKRSIEVGVNRFSRCH 
LSPLSLTRKGNSLTPCflSRVRQCLALLR 
LAHGARTH 


7780 


21681 


A 


7840 


442 


164 


AHTNQFSQCIKKSVTPDLMEEMYKKRHA 
AI *EHPVYKKRPKKEVKKriRWNCPKMSL 
AQKKHQVA* KKASS LRAQEQ&AES 


7781 


21682 


A 


7841 


494 


359 


ICMLPRLVI.NS*PQVILLPWPPKVLGLQ 
A 


7782 


21683 


A 


7842 


577 


224 


IFFFFNKIFLFRPGLNIWGDIWVPSTPP 
LQG*TNFPPQPSQEVGLQKAPPPPGLIIi 
VFLSRQGFPQLGRVGPHPPPPGDPPPRP 
FKKIWSHPGGGKSPPKKBCKBCKEIAEVLG 
AFRLR 


7783 


21684 


A 


7843 


404 


56 


HTLS TNVCSS YLNLDFFSLRGRGL IMLP 
RriAriKrSWAQEILLPQPPE*LQLIJlRLRQ 
ENLLSPGVQGQPGQHNETPTS 


7784 


21685 


A 


7844 


1 


323 


INTLLftLLLI IITF*LPQGIGYINKSTP 
YEGGFDPISTARVPFCIKPFLEAITPLL 
FDLEIALLLPLP+ALQTTNLPLIAMASL 
LLIIILALSLAYE*LQKBLD*AE 


7785 


21686 


A 


7845 


419 


159 


FLFFFFXFXXYVLS CGIY* YWCNLFSS * 
QYFFFYVMAFRIFLFTFLFLHFYYKMF* 
CTYFYLSCL*FEYICSHIIHYFIYFLHL 
ENF 


7786 


21687 


A 


7846 


475 


124 


FFFFFQPPQKEGGPPPPPKNFFFFFTRI 
FFFGVFFFFFPPQKNFFFF*KTKGVFFF 
PFF * * KI FFFFPGVFFFFPVFFFLSPPP 














FFLKRGQAS 


7787 


21688 


A 


7847 


6 


189 


lidtvspsvj«:agvqwcnhsspqp*pps 

LKRSSRLALPRFWDYRCAPLCKftHSFlirS 

NHEK 


7788 


21689 


A 


7848 


20 


249 


KINFIWELTCSNTVHTFYVYGFDKCIL 
PTTQLFFFL*ETGFHSVflRLECSGVIMA 
HCSLDILGSSWRPANFLNFL 


7789 


21690 




7849 


101 


398 


LFQKKKKKKKKKKKKKGGPLKKKPGGGK 
NKGGEKKKI FFLKGGEKKNPRGNFKKKT 
FFGGGKKGEKPPKKKKS PEEKKKF * RGK 
GGKKSLICWVEKNLG 


7790 


21691 




7850 


87 


226 


GSLSPIMLVLPFLIHLL*KlIFLPLTIiAL 
LI*YVSIPITISSIPPQT 


7791 


21692 


A 


7851 


2 


440 


GAI IRRLLH*FPLFSGYTLHQTYAKRHF 
TIIFIGVNLTFLPQHFLGLSGMPRRYSD 
YPDAYTT*NILSSVGSFISLTAAVLTTL 
MS*EAFASKHKVLLVEEPS IDLERLYGC 
PPPYHTFEEPVYIISRPRRKESNPAKLV 



1028 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 

dingto""" 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
Kf Lysine, L=Leueine, IVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T'^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 








78l2""" 


) 


80 


SSQPHG 


7792 


21693 




SSLLLIIILALSIAYE*LQKGLD*AE 


7793 


21694 


A 


7853 


3 


135 


ATFLYFS * K* SLSMLPRLVLNSWTQAI L 
PPWTLKVGLCFFDPL 


7794 


21695 


A 


7854 


38 


419 


F1»5PLHSSLGEGVRPYLKKKKKKPPWE 
GGSPPKNFKNQIFA+NQKPPFFFFIPPN 
KKWGS PPGGGAS PL*SHPSGG PGGPNNK 
VKI SKPPCPPGGTPVFTKTPTFT*PWGP 
IPEKEGRKTVCPRTH 


7795 


21696 


A 


7855 


398 


61 


FFSPRPRGGGFPPPPPKHFFFSPPTIFF 
GGC5WNFFPPPKMVFFLKISPWFFFPL 
IRKKIFNLSPGSFWSPQGFFF*GPPPKF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


7796 


21697 


A 


7856 


1 


121 


LDASGRLPPSPKPIKIKNYNRP*EPK*T 
KICSLHSLPPQS 


7797 


21698 


A 


7857 


1 


332 


TPP lARPAENSARGAEGRRR* I lALQVC 
NKKMPPTKLHCIKKK^CKKKKKKKKKKKK 
KKICKKKKKKKKIiWGGGGGKNFFKKGGKK 
KLGGGFKKRGRKKKK 


7798 


21699 


A 


7858 


3 


191 


SPRHLTRAAGIRHEGQTGLKLLTSGNLP 
ASAS* SAGITGMSHRIGPRMLDLLTLVS 
WKQHGL 


7799 


21700 


A 


7859 


503 


62 


FFFFLNCCLSGLRHLIKLFEPQFLHl*N 
GGMVLSI ILSIKSPLRSPSLSYSWL**P 
REKIKWWGGQVRWFMPINPRTLGGGGGQ 
IA*SQEFKSSLnNMWPYLSEPP 


7800 


21701 


A 


7860 


355 


86 


YIKG*QMVGNRGASQAGMTGYGMPRLIL 


7801 


21702 


A 


7861 


336 


225 


PQLftGPVKTVRKVYKKEKARVIPEEKKN 
FKAFASFRKGRANGRVFGIEAKKAKEAA 
KQDV*KKPPIPLAGGSPTREPPSQAHLF 
PQNPPRGNPP 


7802 


21703 


A 


7862 


465 


170 


GGPPPKTHPFGGERGGKiHRGKKKPPGH 
QRVKPPSPKKKKKNWGRGKNPFTPLIGG 
GQAKKPPQPGRGECP * KKVPPN1ISVI.GK 
KKNFFPKKKKKKKK 


7803 


21704 


A 


7863 


495 


275 


RSFFFFFPPPPPFFFFFFLYaRK*YYPL 
ATKKBCKBCKKKKKKKKKKK 


7804 


21705 


A 


7864 


493 


70 


PLTASISLPVNWRYIAPHRVVKIK*KSV 
Y*KYLA*CLAQSKCPKSVCSHLSSP*LH 
FVPPPNPAEAMWTARQC*CIRGLGGKGT 
LCVAHTYNPSCSGG*GRRIV*AQEFKAS 
LGNSEPLILKKERKKESQAWWLIPI IPA 
F 


7805 


21706 


A 


7S65 


67 


385 


VARTTGMQHHAWLIFFIFIIFVETGSHY 
VAQTSLEFLASSDPFTSVPQSTWITGVS 
YSWS*PGIAHTCNPSALGDRGERIT*GQ 
KFKRSLGWIVRPCLYKKNKNKKN 


7806 


21707 


A 


7866 


454 


0 


LS FFFFPLGDRVTLCCPG* S AVARS QLT 
VALWSWAQAILLPQLHKHVPPCLANF*N 
FF*RQRYTMLHRLVLWSWAQVILQPWPT 
CPTAAQA 


7807 


21708 


A 


7867 


1619 


698 


PATS SS SS S S SS SSAAA AVfl AAATAASL 
PPCRPPLRPPQPLAPHEVATLAPWRG*K 
PSFTSSGKRWLTSEPQTSGSCASCL*SM 
RASSPSSG*SKKKPPLPAEAAASVAACA 
VCWRVRAPPYVAATTAYTMAVMGWMAFP 
WEVIWTRWRMMSQAIRPLQTWTNSVTAP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

ccrrespon 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspart!c Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msnleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutarainc, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop coflon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 
















7808 


21709 


A 


7868 


52 


187 


LDFN^RRCLTKLPRLVLNSWAQVILPPW 

PPKLLGLQV 


7809 


21710 


A 


7869 


33 


373 


AGVQWRDLGLLQPPPPGFKRFSCLSLPS 
GWDYRHAPPHLAWFWLVEMGFHHVGQA 
GLKLPT*GDPPTIAYQSAGITGMSHRAR 
PG I SVFHLKLHS FDPFKI I FRPGAETMP 


7810 


21711 


A 


7870 


100 


119 


VFVEIVFPEGLLFSPCSFHLCGFRIHLK 
AEF* FFVETGSMLPRLVSNSWAQVILPP 
WPPWL+LQA 


7811 


21712 


A 


7871 


471 


446 


LLMLPRLVLNSWAQVILPPQPPKVLGL* 
AY I 


7812 


21713 


A 


7872 


449 


33 


FSFFFFFFLGKKDLFFSPRGGGGGANSA 
FFPPPNFGKKGSAPPPPPLKSV*NPaAP 

TGVFFFFCGGGA 


7813 


21714 


A 


7873 


334 


41 


PCPSLFFFFFLRQSLTLWPRLECSGVIS 
AHCNLCLPSSWDYK*HCPYLATGAPGLL 
SQS3LSYVTL* LQESRPLCI PVDFMTFL 
PSLCFQSPNSTTL 


7814 


21715 


A 


7874 


342 


716 


STG I FLFVCF*DGVSTLAQAGVQWPDLG 
SLQPSPPRLK*FSCLSLASSWDCRHVPP 
LLANYLY F* *REGFTVLAMMVSFS * PCD 
PPTSASQSAGITGVSHCAHPGMFFLNMP 
LKSCPSESGAAV 


7815 


21716 


A 


7875 


454 


215 


PPPAECH*PP* *TLVHAVEFKSHHYSLI 
SSTQGHKHCGRPQGPLPRKTRDLCSLVY 
LLTFPSLLSYDPAKSLSAHNTQE 


7816 


21717 


A 


7876 


492 


176 


PCGGRGAGGNWGWPPWHLRASAiSSRWAW 
ALQPHTRSSQPAGPTGPRQ*GAQHLGQQ 
LLCSISHQaiiAAPPQGRAQDLQPflMPEP 
SPRPPLPWAAQ 


7817 




A 


7877 


281 


439 


TWSIDL*HMAYFLFFSFLFFAFLSFSLL 
SFSLLFFAFLFL*VPSFSLFQRQSL 


7818 


21719 


A 


7878 


507 


338 


LLRRLK+EGHLSPGGRGCSKPRLHHCIP 
HWATE*EPFSKTINK*N*S*MKMKMKIN 


7819 


21720 


A 


7879 


55 


298 


PPCLAUFLKFFL+RQSLPMLSRLFSNSW 
PQAVPLPRPPKRNQLYLYFGCCSQLCWP 
QHLCSKQHTFKILFDIEKHFRLREH 


7820 


21721 


A 


7880 


145 


292 


YFGGKQIYFIFCRDRISLC*PVLLTSLV 
SNSWPEAILPSQPPKVLGLQA 


7821 


21722 


A 


7881 


3 


203 


LSLLPRLECSGTFTAH* SFKLLGSRDSP 
TSAS* lAKTTGMRHHAQIiRFFFFVPGGV 
CVCVCGCVYV 


7822 


21723 


A 


7882 


324 


211 


AWWLMPVIPTL*EAKMGGSLEAESLRPV 
WAHNETPFLF 


7823 


21724 




7883 


2 


124 


QHFGRLRQVDHLRLGVHYQRGQHGETPS 

LLK*IWVYIYMT 


7824 


21725 


A 


7884 


367 


141 


KCWDYRREPPRLAQGLEFLFNFCFEIVT 
EPPCPALAYF*NFL*RWSLATLPRLVLN 
SWPQAI LPRWPPKALGLQE 


7825 


21726 


A 


7885 


471 


215 


RGMCVSHI FLCVGAEYNISSARESLLKE 
FTECTERKHTHTRAHTHARARTHTHTHT 
HTHTHTRV*NSRSQPYCRVHACSPAYLG 
G 


7826 


21727 


A 


7886 


462 


204 


RHSS*LGLP KCWD YRHEP PHLAC I S S F * 
RKSDI SLSDELGTKCHKWWPQWKSRAG 
SVGRDPCSGHFIIAGFGQLGA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=GIycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Metlilonine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unl{nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7827 


21728 


A 


7887 


409 


152 


PVFIiALERPNQQDFLRPGVLDQP*QNlIK 
ISFFFFFFKKKKKYIYICMWHAYNPSY 
SGG*GGRIT*AQKFKTSLM3TVRPHLYK 
K 


7828 


21729 


A 


7888 


1 


152 


GVHVCCDCKCICVCLCWGVHECLCVCV 
SK*VHMCHEEGTIFGNISWEGD 


7829 


21730 


A 


7889 


101 


399 


EI INIVFFFYFFLPLLFFFFWGGRFKKQ 
LNPE FLTNSWKKNHKT IGGKHPMGG* KK 
AITNPQKKGRKRSFSF*NKAQPTQGIC1IG 
QNSQGSKAVGALTAH 


7830 


21731 


A 


7890 


478 


142 


KFSKKKKRGVLNFYKKKAFFSKPPKPVF 
KKKKLLKNPFFKAKKQFFFKTPTRPLFR 
EKJ'FPGPPKKKKKRPPGLFKNN*KGPPW 
PRE KKRVKKKKKKVKSHYYMNHMP I S VH 


7831 


21732 


A 


7891 


497 


320 


PPWKFEFEFWFSP*LHKYYPPYVCVCVC 
VCVCVCVFDFRFFKPAPSPQSHFGINLS 
FLL 


7832 


21733 


A 


7892 


217 


23 


IPPFKKFF*EMGSHAVT*ARWQWHNHA 
SLQPETCGLKGFSHLTLPNSWDCRSVPS 
SSAERQSH 


7833 


21734 


A 


7893 


652 


470 


SSSSSSSSSSSSSSSSSNSSSSSQFERK 
TRSAPQVEGKGPNFCSLKAPPPGLSLFS 
CLNF *RSWDPGPPP* PHATPKTRAGGDL 
LCATHHRARHCTPGMFPLFLAL 


7834 


21735 


A 


7894 


25 


178 


GILFFWRDGGTiVIT.HRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW*LLKH 
SF*FFWHDGGLVILHRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW 


7835 


21736 


A 


7895 


104 


451 


ILSFMTTWMEAEDIMLSEISQTQTDRYC 
MILLVESEOWNlUlPGMMaHAYNPSTLGG 
*GERIS*GQEFETSLVNIARHYFYKENF 
KIKSKFKKIVRKSQKKNVRRWLLEARNK 
GRYG 


7836 


21737 


A 


7896 


510 


335 


SCIVSLHHSWDLWHVPPFLftNF*IFRKG 
GVLPCCPGWQIYTPQLQIPFLSITPVAF 

SG 


7837 


21738 


A 


7897 


37 


260 


AGKSQNQGGGACSEPRWRHCTPA*ATER 
ESVSKNKQTKFLCSVPYFESTHFHCTNE 
ATGAWGVEKVALGHAASW 


7838 


21739 


A 


7898 


35 


466 


THTCGGIRKKLTHKREKQK*SQQKPSAP 
KEPEVKNKQKKTKNKPKKKKKTPQIQPT 
KKPFENPNLRVFTFFVTTQNKTPTRGRK 
TFQKVKLFPPHPCLGNKPQTLLPRGGLQ 
KKKKKIPFFYSHPPPLAQKKKATPWQWG 
LGFT 


7839 


21740 


A 


7899 


494 


149 


PARLLSI.HSYEVKSVLPKEEGLLNFFVQ 
S VTAHPTSCIGLEE I * Lrj3 


7840 


21741 


A 


7900 


494 


114 


GAEGGPLSKKKKKNLLPDYHTGPMSLSL 
YLWPFLSNGHTLTLMCAHTHTHTHTHTN 
H*LSRVGLPVLKPGQYQaNGNRWSP*YN 
I KE *PRKTRVMPVTPQLRNDGPMLTFWC 
SSRNSFAMHVLGKL 


7841 


21742 


A 


7901 


19 


454 


VWCNSSVSIHTHTHTHTHTHTHTHTRSW 
VGWHMPIVLSTQEAEAGRLLGSGSLRTQ 
RAMIVPLHSSLGNAaRPCV*HTGSCCVS 
QAGVQWHNHSSLYPQTPGPKQSSSLSLL 
SR*DYRHVPPHPASCVCVRVCVCVCVCV 
CVYGDRGITPH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

otide 

location 

correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V^Valine, 
'W=Tryptophan, V=Tyrosine, 
X=TJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7842 


21743 


A 


7902 


3 


393 


DAWDPLNPLVMLFVSPKENSSLDTRKKP 
CRESKKFNTHSRPKSSHQLRKRSSSTPT 
T*KIPNI*LNSSTPSEAKKKRKKKPKKK 
KK* KNSQKKKKPMRKTKKKKGGGLLKKI 
LGGAKFFGGRKEKI FFF 


7843 


21744 


A 


7903 


424 


139 


LFFFFFFFF* G* GFVI FCYYGQVGL * LL 
TSGDTJHISASQSAEIRGVSHHAPPLSLI 
WFISQCVGIIjNHTVHLQYIQCSFVHHF 
LINQEKIDIIK 


7844 


21745 


A 


7904 


1 


181 


SQ * LY* LLAfllvILFI LT* IGGQPGSYPFT 
IIGQGASGLYLTTILILIPTICLIESKI 
LKWA 


7845 


21746 




7905 


3 


281 


HAYAHASATFPVSTCFHGLHDIMGSAFL 
TMCFIRQLIFHFTSKHHFGFQAAA*YWH 
FVDW* LFLYVS lY* *GAVLKEPWGGQS 


7846 


2)747 


A 


7906 


1 


84 


PTRPLTMLPRLVSTPGLKQSSQLSLPKC 
*DYRCEPACTAYSF*LTMI,PRI,VSTPGL 
KQSSQLSLPKC 


7847 


21748 


A 


7907 


3 


392 


DPDNYTLANPLITLPHIKPE*YFLFAYT 
ILRSVPNKLGGRLALLLSILILAIIPIL 
HIS KQQS 1 1 FRPLS QSLY* LLAADLLFL 
T*IGGGPRSYPFILIGQVAFVLYFTTIL 
ILIPTISLIGDRILKWA 


7848 


21749 


A 


7908 


2 


231 


GCVEKGTLSHC*WECKLVQPLWKTWGF 
LKELKIDLPFDPAIPLIiGIYPEENKFTL 
GPLHSVGRSSRKLPGYSPRV 


7849 


21750 


A 


7909 


529 


297 


KIFIGAPVFCPPPHFFFLIPPF*GVEKI 
FFS FFVLLPFGFFFLFLGLFFFFFKLFF 
FFFFFFFKI PFFFS KKVGVGK 


7850 


21751 


A 


7910 


410 


61 


FNNQAMQWFPPFPLKIFFFP*RV* FLE 

GGWSNLSPPPK*GPSPKFPTRVL*GPP* 

GKN*NFGFPG*NWAPHRFF*RAAPFFFF 
C*D3VLLPRLVSNS *AOVIHPPWPPKVL 
GLQA 


7851 


21752 


A 


7911 


512 


314 


ANFC* LFIETGFPHIGQAGLELLTSNHP 
PAIiASQIAGTTGVNLRPGQDFSSYHSEN 
PLVYGSLYIA 


7852 


21753 


A 


7912 


1 


264 


SSTTTLIiLSRT*NKLT*LTPLIPSTLLS 
LRGLPPLTGFLPKWAKKKKKKKKKKKKK 














KGGP 


7853 


21754 . 


A 


7913 


416 


129 


FFFFFFFAPPPKIKTPPFFFKREVYIY* 
ALSVFQSVCKBIiYIHFRIISSQYPC*LG 
FFFFFFFFFFFFFLKGSLDLLSAYCVPG 
SVQGM.HTLSH 


7854 


21755 


A 


7914 


397 


34 


I FSNLPFPPQKKMFFPPPPLKYFFFS FT 
ALFFFLVFFPFFPPPKKIFFFNNPPKFF 
FYPS*KICKFFFFIPVYFrjapPEFFF*AP 














FSVSKQPLFCLKI 


7855 


21756 


A 


7915 


396 


86 


IFLSPRK*GFFSPPPP*KFFFSPKPLIF 
FGGFFPKFPPPKKNFFFQNPPGVFFYPP 
FKKKKFFFPPPIiNFGPPRVFF*RPPPFF 
FFFFFFFFFFFFFFYLWVC 


7856 


21757 


A 


7916 


351 


112 


SFFFFFFFFFFFFFFFFFFFFFFFPFFF 
FFFFFFPr*KYVFIYISLSLIPPCKFLK 
GRECE IHFCDIQHSYSKYLGHFKCSTND 
RQMNK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

sequence 


Predict- 

locntion 
correspon 

last^amino 
acid 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, r=PhenyIaIanine, 
G=GIycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnl<nown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


7857 


21758 


A 


7917 


445 - 


2 


FFFFFFFFFFFFFFLSYF*LIHYAEGIG 
ASPCYIMLGYMFSSFP CGT I S lAP GFNF 
YRLYFISHASADAWVDPRFLDRYRRG IN 


7858 


21759 


A 


7918 


2 


407 


PPPPQVYFNrWRTIFFGEGLLHIFPPQK 
RFCFYKTYTNYIKTLFIKKKNIFSLflHI 
KMVPPRSIY*TPPPLl)'FFFFJ<'FJ<XFFi<'F 
FFFFFFFFFFFFFFFFFL 


7859 


21760 


A 


7919 


107 


U 


KMWYIYTLECYSALKKKKE ILLFDTTLI 
NLKDIMLSKISQHRKHTHT * FKT 


7860 


21761 


A 


7920 


1 


70 


FFFF*AAESCSRIKCSGTI lAQCNPELP 
GLSDPPTSAr*VAGTKGVCHHTQLRF I F 
IV*LSGRILLQDQVQWHNHSSV 


7861 


21762 


A 


7921 


123 


260 


GIQARKEPGAVAHAYKLSTLGG*GGWIT 
*GQELETSLVNMVKPCL 


7862 


21763 


A 


7922 


184 


399 


LHLLMRVLKKKKKMGGPLYRS PRGAKVN 
PALQRLDSLLIGSRI LSNLG I FGKAPHS 
WGa*AFFAATGLHGLHVIIGSTFLTIGF 
I RQLI FHFTSKHHFGFKADD * YWHFVDV 
A*LFLYDSIY* *GSRILSNLGIFGKAPH 
SWGGTKWDNPPPDMKSS 


7863 


21764 


A 


7923 


2 


407 


GRVGM I * EAFAS KR KVL IVEEPSINLE* 
LYGCPPHPLHTSNKKPPHTKKKQKKKKK 
KKKKKKKKKKKKKKKGGGLLKKLWGGPK 
NTGGKKKNFFFFWGGKKKNLGDFLKKNI, 
FWGGGKIGPTPPQKKNPLGEKKF 


7864 


21765 


A 


7924 


1 


398 


PTRPPTRPTSSRSRAPPFFFFFFFFFFF 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFVSAALFFGKKKNFFFFF*FFRG 














*GFYFFPPLYFFICGGGGVFFFFFFAGW 














NIFFIPPKKKGIFKTLRGKGG 


7865 


21766 




7925 


2 


335 


NLTFFPQHFLGLSGMPERYSDYPnAYTT 
♦NILSSVGSFISLTAGILI IFMI *EAFA 
SKRKVLIVEEPSINLE*LyGCPPPYHTF 
















7866 


21767 


A 


7926 


■ 


328 


RTRGRTRGESNPSLRIQNSPC3ILSHPIL 

QAKI*PGPKKKNLFFKGGLKKTPLGNFE 
KKPFFGGGKKRKKPPQKKKTLKEKKKF 














7867 


21768 


A 


7927 


27 


226 


LSR*KM*NKSHIYSFEKKKKKKKKKKKK 














KKKKKKKKKKKKKKQKKENRPKNGDSEN 














GHPKSKIFWL 


7868 


21769 


A 


7928 


398 


2 


GGGGGPGGRGRP* *NPHKKEGRDPPHPK 
KKFFPRRRKNRGGGGREKSPPQKKKAPQ 
KKPGRGFKRAPQKKKKTPPPPRKKGPPP 
KNFKKGAPPFFFFFFFFFFFFFFFFFFF 
FFLGIIiARTTTTTNDMKNHR 


7869 


21770 


A 


7929 


3 


156 


HASAHASAPVFV* SVLMTAVLUU^SLPV 
IiAAGVTILLADRILNTTFFDPA 


7870 


21771 




7930 


405 


249 


ASAHASAPLFV* SVLITAVLLVLSLPVL 
AAGITILLTDRNLNTTFFDPA 


7871 


21772 


A 


7931 


921 . 


699 


DHPGQHSEILSLQKMKTMSQCGGAHL*S 
QVLRRLRREDCLSPGSQSCSEL* SHHCT 
LACETE * DLVFQKKKKF 


7872 


21773 


A 


7932 


423 


259 


HDLGSLQPLPPGFR* SSYLCLPSSWDYR 
CE PLRPAQRRGI LMAATSRI LTRS KGW 


7873 


21774 


A 


7933 


395 


35 


PRVKRFS CFWLPSKWGPRPRAPWPGYFC 
FFC*NGGSPFLPGGSWTLAPNYSPPPAP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQBD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartie Acid, 
E=Glutamie Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V^^possible 
nucleotide insertion 














QNGLVQALRPPQPPTFPFLI FPTKAPWS 
WPFFGGKKKKKLSPKGHSERHLHHKYIR 

LLSQHGGM 


7874 


21775 


A 


7934 


1 


118 


MF* IFCRDWGLALSPRQLSNSWPQVILP 
PQPPKVLGLQA 


7875 


21775 


A 


7935 


322 


383 


KKKTGLIKISTLCFVKNTFKKTKWQAPE 
RKKKVP I FLS YKRPVFKI YKKLFPFCRK 
KIKKKIPPPGFTSFSCLSLPSSWDYRHP 
PPCPVjSIFFCIYNKNGVSPG*PG 


7876 


21777 


A 


7936 


3 


131 


GFHRVGQVGL*LLTLGDPPASASQSAG1 
AGLSHRSQPLTWSL 


7877 


21778 


A 


7937 


334 


146 


HPQGLNI *NKMVSSSGAKHSSRLSLPKW 
WDCRQEPPCPVIMPLKRGINHALYSPSR 
KLLHFL 


7878 


21779 


A 


7938 


2 


371 


SEPGAGYNLLVGRF*SPSEKCSIGVGVT 
RFSRCCLSPLSLTGKGNSLTPCASRVRQ 
CliALLMLVQGVLHPLSCTHCLaLPSEMW 
PVPQMKMQKSPTFCIAnaGSCRPELFLF 
GHLGSSPQSKL 


7879 


21780 


A 


7939 


2 


95 


RIiNI,GGGGCCEP*SCHCTPAWVTKRNSl 
SKK 


7880 


21781 


A 


7940 


1 


191 


PTRPSQTPGLK* FSHLSLPKSWDYRCEP 
PCIAVILSIPQNSSMPSNIFAASYSS*F 


7881 


21782 


A 


7941 


127 


383 


SKDCVRIVLLHAQAKAGSYRTVNWCRPG 
AVAHTC3!IHSTLGSRGGR1A*GHEFKTSL 
GNIVILPLYKKNKKKKKRQfiELKEPNLT 

P 


7882 


21783 


A 


7942 


235 


3 


KNILPGYFCFFLREKPPPPGLESLFSKK 
KKKKK]:iaRHGVAPLLGRMR*EDRi:iRPGV 
QGC3SIELWWCHCTPAWATEQNP 


7883 


21784 


A ■ 


7943 


3 


225 


EHGSLYPPTPGLKQSSYLSLLSR*DyR* 
AAPCPANFFIFNFL*REDLTMLLRLVSN 
SWTQAVLLPPKVLGLLV 


7884 


21785 


A 


7944 


2 


232 


TLLPDNLSQTIYPNKKKKKKKKKKKKKK 
KRGGPFKKKKF*TRGGGKKNFFFRAPKI 
PFGGRVLKKGGGKKPGEPTN 


7885 


21786 


A 


7945 


394 


41 


WCRWLETWGAG6SGAVPPPPFFPPHSSS 
WDIALQGSGASSPFSPGAVAHACNPSTL 
GGQGGWlT*VQEFETSLftNMMKPHLFSK 
KGPVTAVAHPAKSHRAMWCTPTHMHTQH 
VIHVL 


7886 


21787 


A 


7946 


405 


100 


FFFSPRFSSSLFLFLSSFFFFFFFFFFF 
FFFFFSTGLFVYWSPTH*SLPVPRIFFF 
SYLWQTEVETMLGTE 


7887 


21788 


A 


7947 


391 


212 


KEMQIMPAMR*HLTPVRMVITKKSKAKC 
WQGCEEKGTLAHSWWKCKGTAETGKLAG 

F 


7888 


21789 


A 


7948 


106 


301 


LQLPITQLVWHSIYFMKKIRYRLGWAH 
TYNPSTLGGQGGR IA*GQVFKTSLS]S!TA 
RQKKKKKKG 


7889 


21790 


A 


7949 


342 


208 


GLGAGTHTYNPS TLDN* GGWITSGQEFD 
TSLANMVKHHWQTGQF 


7890 


21791 


A 


7950 


391 


21 


RQGFPMLPRLISNS*PKGSAHLSLPKCW 
EYRSEPPSLATWYVTLSRHFCSTLLLGQ 
GRSKEHMKWESRLEGLLAKVSSLRSSYC 
SSLRILGLE ILLSPLLFFLETGSGSVSQ 
AGVQPLCPGLK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=A]anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7891 


21792 


A 


7951 


146 


355 


ENIRQTQNEDWIKSSNETVFCKNIIIS 
LGWAHACNP SPL* GQGRRI T * AME F * T 
SLDNIARPCLYQK 


7892 


21793 


A 


7952 


289 


327 


KGIPPLPPRLKDKGPFFFFFFLKRESLF 
LP* FWKWGCRGPPLFFFFFLRDSVSLLP 
RLVSNSWP*VILLPQPPKVLRLQA 


7893 


21794 


A 


7953 


861 


586 


DRVRSVAQARLQWHDLGSLQPPPPGF*Q 
FSCLSLLSSWDyRRMHTWPFVF*WRWGF 
TCVAQAGLDLLSSDNPPALASQSAHI IG 
ESCCVLLK 


7894 


21795 


A 


7954 


402 


52 


PRKVS IS* PHDLPASASQNAGITGVSHR 
TRPS FNPSYIETAITETRKSLQPSPGMN 
*SLPSGMVGFKPHSFIHSANFCEALAVQ 
QTLGEVPGHAPLSPPRQEPRVCPKKVCL 
KTALK 


7895 


21796 


A 


7955 


259 


3 


YNTIWLSFPIIiL*TFLKHLWIjaAVAEAF 
NPSTLGGGGGRIA* GQEFKTSLGNIEHC 
RLYKNLKLFLESGLFLMRWRGAKRVRTR 
G 


7896 


21797 




7956 


171 


386 


LPIRLFSLVFTTIEKSRPGMVAHTCNLS 
TLEG*SGM1T*AQEFETSLGNMAKLSLP 
KIQESASLITRSQNK 


7897 


21798 




7957 


397 


283 


PQPLPPGLKQFSCLSLPKC*DYRRDSAR 
PAKKNPTFNK 


7898 


21799 


A 


795S 


169 


324 


GTHRVEMCSQWIiGWftHACDPS PLGGQV 
RKIT*AQEFGTSLGNIGRPCLYIK 


7899 


21800 


A 


7959 


290 


397 


DCSSCCSPLKGR*RG*AQWLTPVIPALW 
EAEAGGS 


7900 


21801 


A 


7960 


496 


298 


LSKTGSLYVAQSWYWUTGTIMVHDSL 
KLLCSSSPPVSAS*VAGTTGAHRCCEVI, 
TTFQPDVYRL 


7901 


21802 


A 


7961 


404 


174 


PGPQKGSFF*KKKKKKKEIDHILSHKflISr 
LNKC*RIKII*TMLSDQNEIKLEIIIKR 
QLLKETLFLEMKKYTIRKLS 


7902 


21803 


A 


7962 


25 


99 


PGLFLSS *HQVILLPWPPKVLGLQL 


7903 


21804 


A 


7963 


436 


106 


GGGKSPPPGNWAPPGNPGEPLFF*KKKK 
KNPGGGGGRFFPPPPGGRKKKGVYPGGG 
GPYLTNFPPPPPPWGKKKNFFFKKKKKK 
KKKKGRIAI CYI ILFITS INFKKAFI 


7904 


21805 


A 


7964 


212 


395 


ERERKERDRDGEREREREEREE6RKKHG 
GR*GGREGERHRPRGflEDPDRaGRATDK 
RTGAL 


7905 


21806 


A 


7965 


344 


345 


FSDCYKRSSCASWTWYTRLAHHQPPARP 
GAAPS PRKRWVDFAVNRNFPSRRHPCAT 
PRPAPGATGRPSRVPKSTAFTQMLTQPR 
DPHLPLPIPPLSQPPPSP*PAPLVMR 


7906 


21807 


A 


7966 


397 


298 


RGFTMLPRLVLNSRTQVI*LPWPPTVLG 
SLSIF 


7907 


21808 


A 


7967 


92 


268 


IKMSLCPNFFFFFKKQNVGPRAGAYPCN 
PTILGGQGRKIT*GQDLKTTLV1TMGKPH 
FY 


7908 ! 21809 

1 


A 


7968 


257 


107 


RTHTHTHTHTHTHTHTHTHTHTCENSKT 
TKNIANNCS*LESQRNNGKCTYAKF 


7909 [ 21810 


A 


7969 


413 


12 


PPPPPLLTPWPCFFPQIL+PVGHSAPPL 
FK*LKIPRPFFFPPPPKGIGAPPPKIFF 
PPGFSLPPFVKLPPGEKILFPPPKKKKY 
PPPPPQS*SF*PPPPPPFFFFFFSTSAK 
KSKKNFLFIFSPFFNPKYYNT 
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S£QID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 

peptide 


Predict- 

otide 
location 
correspon 
ding to 
Isist amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7910 


21811 


A 


7970 


399 


212 


TGVQWSNHSSLQSQTPGLKRLFLLILLS 
IWDYGCMPPyLSFKN*FCTKSGSCyvaE 

ADVNIL 


7911 


21812 


A 


7971 


584 


424 


RPRRENRLNPGGGGCSEPHSHYCTPAWV 
TE*NSVSNR1IKTNQNPIFFSCKQFD 


7912 


21813 


A 


7972 


103 


1 


PSMPQLSHL*NGILGRVRWLTPVIPALW 
EAETGRS 


7913 


21814 


A 


7973 


363 


28 


SSSLGGHPQLWFHTLKLRPVTVSYA*NK 
LCDKIEKKLFFTIALHNKFLRKKLTRDV 
KILFNNK*IKKEGPNEF*NILCLWIERI 
NINKVPYPSKVMYRSNTLPIEVAVFKSQ 


7914 


21815 




7974 


404 


250 


FFFFW*R*GFTMLPRLVLNSWTQAIHPP 
ALASQSAGITGMSHHAQPLSIIF 


7915 


21816 




7975 


2 


60 


FSCLGLPFCVIDYRHAPPHLAHFCRDGVL 
NY * PQVrHLPQLPKVLGL*ACTTTPG 


7916 


21817 




7976 


325 


81 


KTIHSLLFGQFFLLQPLLPSPIPHAAPP 
TPHLIFLLLLFCF* FFK*RQGKQGLTTL 
PKLISNSYSQVTLLPQPPKVLGFQA 


7917 


21818 


A 


7977 


406 


282 


QWRDLGSLQPPTPWFK*FSSLSLPNSWE 
TKAGRSREPGGGD 


7918 


21819 


A 


7978 


382 


221 


DCIISASYLQKNFFFFLGDKVLLCHPGW 
SAVA*S*LTVTSKRVHFYSSEISLY 


7919 


21820 


A 


7979 


423 


29 


FLW*RRGFTSLPRLVSNFWAQGICLPWL 
PKVLGLQA 


7920 


21821 


A 


7980 


3 


238 


SIiAFFVETGSHFVA*AGLELLSSSNSPA 
LASQSaElTGVSHHTQPE*GYSHM*PQY 

PYQLRFNSDT 1 ILSNVLFVPNF 




21822 A 


7981 


3. 


128 


WLSVSPYTFBLSLLVGYEFVIHDVI-*TL 
I F W\^/WSLFCFLWFFLVCMVFWFFC 
CCFLSVFCI 


7922 


21823 


A 


7982 




86 


NLTMLPRLVSNSWAQAICLSWPPKVLGL 
*AQAI CLSWP PKVLGL 


7923 


21824 


A 


7983 


1 


137 


RPANFCIFFVETEFCHVAQAGLELLGSG 
DPPTSASQGAGTTGMSH*D*PANFCIFF 
VETEFCHVAQAGLELLGSGDPPTSASQG 
AGTTGMSH 


7924 


21825 


A 


7984 


2 


277 


PRVRSVQRVIYQYV*NLQIHVPFDSTNL 
GIYLTDILPHVWNDNMYKVFAAVSFVIA 
KY*NPCLTRRQCI!IKLWPIHAMEYYVTIK 
KNEDGRA 


7925 


21826 


A 


7985 


211 


42 


GPQKWPRGPGENLGGFFFFFFFCRDDGL 
TMLPRLALNS * VHAILPPQPRKVPGLWV 


7926 


21827 • 


A 


7986 


409 


183 


LLRLK*SSHLSLPKWWDYRCEPPCPAIF 
FKRKQ ILKES FPGENTCCHLFNS LEVLK 
KKERTCYNFPNTLAVLKNM 


7927 


21828 


A 


7987 


2 


194 


RFFETQSRLLQPPPPQLK*SSHLSLQSS 
WDYRHVPPCLANFFLFWYFLQGQGFAFL 

LKLKFLFK 


7928 


21829 


A 


7988 


483 


209 


SSPSLGNFCIPSSGGVSPCWPGWFQTPD 
LTIJ\rF*AQVILLPQLPE*LGLQADRQVP 


7929 


21830 


A 


7989 


164 


29 


KFWFWLGTVAHACNPS ILGSQGRRIA*A 
QELEPSLGNEGRPCLYK 


7930 


21831 


A 


7990 


3 


474 


PTRAPLRAPFETIVQEEEFSKHHFGFEA 
AA* YWHFVDW*LFLYVS lY* *GSPPPP 
PKILIKKNGAETKKFFF 


7931 


21832 


A 


7991 


469 


323 


VETGLHHVAQDGLELPTSGDSPASASQN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptopiian, Y=Tyrosin6, 
X=llnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IGITGVSHHA*PDFTFLEQF 


7932 


21833 


A 


7992 


469 


13 


KGDTVSSSDSSPSCGGLFPVGAS*LLCL 
FTQALAMVGARRPFSMLPCSLISDCCAS 
NERGSVGMGPSKPGVGYSLLVCRWLRPL 
EKRTIKVGVTRFSRCJRLSQLPLARKGNS 
LTPCVFWVRHCFGSHSVGCTHCPAPAVE 
QAPVK 


7933 


21834 


A 


7993 


15 


439 


lASGRPFSIKFFLVAlTFLLFDLEIALL 
LPLP*ALQTTNLPLIVMSSLLLIIILAL 
SLAYE*LQKGLD*TEPNKKKKNKKKKQE 
KKKKKKKTKKTKGGGLLKKILGGAPILW 
GGKNMFFFFWGGEKKTPGGFLGENLFLG 


7934 


21835 


A 


7994 


392 


186 


YFYI INNFFFFKKI FI ♦ * I 1 FFFLKKFL 
*YSPKKVFFLFF*IFFFFFPSSSLFLFF 
SSSSPFFFFFFFFFFFFFFFFFLRG 


7935 


21836 


A 


7995 


11 


386 


TWEVEVAVS*DHAIALQPGQQE*NAISK 
KKMTGRMFIEVSFIIASSWKQLKCS*TG 
E *MKSLWYIHI I EY YLTIKEWKVLEHEV 
SRLTLIDTGRERSKTKEYTLYNPIYTNL 
RIAKLSGCGGSHL 


7936 


21837 




7996 


3 


186 


DSFCF*RQSCS I TQAGVQWGDRGSL* PQ 
SPGLKRSSCFSLPKHWDHRDEPLHPAGV 
AVLN 


7937 


21838 


A 


7997 


316 


649 


RDHSFFFEMESRSVAQA3VQWHDLQSLQ 
ALSSGFMPFSCLSLPSSWDYRHPPPCPA 
NFLYFLVEMGFQHVGQNGLDLLTS *STR 
LGLPKWLGITGVSHWHEaQDGGFTIHVG 


7938 


21839 


A 


7998 


468 


336 


RR6FTMIiRRMVSIS*PSDMPASASQSAG 
TTGVNHHIEIHRSLL 


7939 


21840 


A 


7999 


464 


283 


TLCGGIRPITJIMKGNLLYSKSIGIiNVSr. 
LPKNPRRNIQNNV* PNTWALQPSHVDI 


7940 


21841 


A 


8000 




T87 


RQQRLSLQKFLLPFVQLCPAHRGGVYRG 
RQA3LSCGGLHPVRVSQPLCLPT*VSAM 

AAAPHPASLLPCSLISDCCASSEQGSVG 

VGPSKPGAGYWLLVCHLLRLLEKRSIRV 
GVSQFSRCHLSWLPLARKGNSRTPCTFR 
VRRCLAL 


7941 


21842 


A 


8001 


295 


1 


TQPWGTKRKLPLKKKKKNSGDREWRMLC 
NSVTILNATTMYTSEWLHPGTVTHTCNP 
KTSGG*GGErA*TQEFKASLGNIA*SWIi 
YKKILKISPLWWQT 


7942 


21843 




8002 


144 


317 


ELFYLKEMV*RPGPVAYACFPSTLQDRQ 
GRIT*AQEFHTDLGNIAHPCVYLRKKKK 


7943 


21844 


A 


8003 


69 


291 


HCVWTMIFIVIKYFYLNPQNNSMRYYYS 
HFMDKETDVQSD * MTC* RLGAVAHACNP 
STLGGRIA*GQEFKTSL 


7944 


21845 


A 


8004 


3 


176 


IVAHSSLTPGLKQFSCFSLPSS*VYRKM 
LPRIiaiIY*FFGTNRVSLSCQGWS *TLAS 
LLGSSSSPASACQVAKSTGTCYHAWLI I 
DFLVQTGSRCLAKAGHELCSCL 


7945 


21846 


A 


8005 


419 


178 


NPPPKKLFLSSSSSSPLPHLNWGTPGFF 
PPPPF*NPPPEFNFGAPKKKKKLSPPRA 
EKLVPFKGPPPFFFFFFF 


7946 


21847 


A 


8006 


429 


225 


QPLPPRFK* FFCLSLPRR*GHRHRPPHP 
FWFFFF*YFLRWSFVLVAQAGVQCMDLS 
lYLNEFWLLDF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7947 


21848 


A 


8007 


756 


224 


PPRHQPQHVPPPLLHFSTPSPAPPKSHA 
AAPNSNPPQAHAPPAPPAPRAHSPSPPP 
RGATP P FP PPPFSFLAAVGRAP IFPREP 
LLPSSSSPPPPP*IGEPPGFSPPPLLKT 
RPPK*N*GPPKKKNFFPPPRPKNWFL*K 
GPPPFFFFPFFFLRQSFTLV2^GVHKT 
RLHLKKKSN 


7948 


21849 


A 


8008 


170 


559 


SSHLSPQSSWDYRSGMVAHTCNPSMLGG 
QDGRLT*AQKFQTSLGN1VRCHFYFKKY 
FKNKIFFYSPFIOTNPKKKKKKGGAVLK 
NQSLRPRAGKVLVFLWGHLIQFRGPRFK 
NGGTGKPGGLPNLLAPR 


7949 


21850 


A 


8009 




662 


RTRRTRSWWYMPVLPLSHiTGEAGESLGP 
*KWMVD*APIAPLHSTLGDKMTLPDIKK 
KNGETVANTSPLSTTAKPAKLFLKAKKD 
QLMRDLFLPKTPKKPQMSTGLDARSKEW 
LKI IWRRHGIWPLKNIGPTEDVQA3AHG 
GVEENMTSDIEIPEAKHDHRPTEDVQVS 
AHGGVEENITSDIEISEAKHDHHLVEDL 
SESLSVCLEDFMTIGSQWKPICLS 


7950 


21851 


A 


8010 


377 


2 


NFFFFFPPFWGFFFFLGPGFFKVFF*IF 
PFFFPRNPPFF* FFLGVPLFFKK* IFFF 
PFSIRFCFFPKKFFFFLILFFFF*FPFF 
FFFVYVFDFFFSFILYIFFFVYFSLPFT 
VLIISLIYHPAPA 


7951 


21852 




8011 


1 


363 


PTRPLF**RGCFTMLPRLILNSWPQVIC 
PPRPPKVLGLQT 


7952 


21853 


A 


8012 


278 


39 


FXXXFFLSPPXFLI FFFFFFFFFFNFFF 
FFFFPPFFFFFFFFFFFFFFFFFFFFFF 
FH*DQ*FSHFDFTPFTFTLISNII 


7953 


21854 


A 


8013 


79 


412 


MNECILGKGGKDGQFFEQCENLGLC*FL 
KVP*DSDIWLGAVaHAYKPSTLEGQGER 
ST'AQEFKTRLGNMTRPCLYQKHTHKKK 
TKKKPHPGLGWGAWGPTYLGGVGTKIT 


7954 


21855 


A 


8014 


277 


19 


FGNIHNFFLIGSRSVTQAGGQWCNHSSL 
*PQTPVLKQSSHLSLPSSWEIEAAVSQV 
RITALQPGRQS KTLPGNSGPVLRADARR 
SI 


7955 


21856 


A 


8015 


1 


130 


ARIVSrS*PCDLPASASQSAGITGISHH 
TQLamJTLKASPTLF 


7956 


21857 


A 


8016 


4 


417 


RLMPPHLANFLFF*RH6FrYVAQVGLKL 
LGSNDPSASTSQSTGITDVSHCTQPHLL 
KSSHSTFPLKTLDSMATNCSWDKVHYLP 
TACRRLLPCPLPSACFPSLSQVPSVFTC 
CVPATLTITLLPLLRAFEHVTLCPIiF 


7957 


21858 


A 


8017 


388 


158 


CVTWASNClNWICFPLS*INSHLa!IEII 
VLI *GEI YNVCIS I YLCIYMCVCIYISV 
CVCVCIYVH*TPYLIHKCIiI 


7958 


21859 


A 


8018 


154 


382 


GIKRPGAVAmiCafPSTLQG*GKWIG*AQ 
EFKTS PGNMVKPQIYLKKKKKKGGAVLK 
DPSLEPEAGKVIVFFWGPLI 


7959 


21860 


A 


8019 


396 


237 


FFFFFFW*G*GFTMLPELVSNSWAQVIH 
LPWPPKALGLQVRTMVPGLELANF 


7960 


21861 


A 


8020 


412 


168 


GWLI,GSPLPGNVCKNKPFILF*KKK*SL 
AVLPKLVSIIFWP*MILPPWPPKI *GVQA 
RAPPPSQILKKKKSSHFYNSHVNSL 


7961 


21862 


A 


8021 


1 


124 


TRFLHVSQAGL*LLTSGDLPASASQSAD 
YIFIISPAKIIFL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
Ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PlienylaIauine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leui;ine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=UnknDwn, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


7962 


21863 


A 


8022 


3 


200 


HLSPYFYFFKDRVALCHPGWSAWQ*PI 
VTSIFWSSC*KLTSWRRWRPSWLTR*N 
PVSTKNTKN 


7963 


21864 


A 


8023 


3 


132 


FLFFFQF*FMRLGLAMLPMLGMNSWA*V 
ILLPQPPEWLGLQA 


7964 


21865 


A 


8024 


3 


187 


VGQAGL3PPTSGDLPASASQSABITRMS 
HRAQTK*DFIGSFSEEFS 


7965 


21866 


A 


8025 


405 


231 


SDKWIKKMRYIHTTEYY*ALKRKEILTH 
ATTWLNLENlr.LSElSQSPiniRYYVIPL 
T 


7966 


21867 


A 


8026 


175 


382 


GKRI FPPYPPQEGGKTGPPQTPGLFFFF 
LKKK*PGGLGGARFSPLLGGVGRKNSFT 
LKGGGAINLKGPPSLPPGGKRWAPPQKK 
KKKGQAWWLMPVI PALWEAEAGRS 


7967 


21868 


A 


8027 


83 


374 


GDQVWWLMHACNPSTLGGQCGRITWDRE 
FETSLANIFFPISTKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKAS SGGGVS * K 
KNFSPRAGKEIFF 


7968 


21869 


A 


8028 


382 


148 


SPHCSLDLVGLKQSSmSLPST*DHRCV 
PPRLAIFSFAFLFFWEATTTXTKATMLP 
RLVLNS *AQAILLPQPLICVLRL 


7969 


21870 


A 


8029 


88 


208 


SGSV*ENNPLSWAWWLQSI I PATQEAE V 
GRLLEPRSSRSA 


7970 


21871 


A 


8030 




283 


NKSRIKKRIKMTEKMKEKIEMMiCLMirQL 
PNGEHHAMIQ^MKIKIEMK6RMKIKE«DK 
ESDKDDRKDEREDRDDETDEPTPERRTS 
RDDSDDDEDRDEGKNEDKRKDDSKDDDE 
ADEDTDQDDYDPMDAHKAADEDDDEDED 

DEREDRDDETDEPTPERRTSRDDSDDDE 

DRDEGKWEDKRKDDSKDDDEADEDTDQD 

DYDPMDJ\RKAAr)EDDDEDEDEVAE 














7971 


21872 




8031 




139 


I,QTT,LPLIAF**I,LASLAKLALPPTINL 
LGELSAQFAVIGMYILY 


7972 


21873 


A 


8032 


80 


324 


yLHFYFFVR*KLMGLDTVAHTVNPSTLG 
DPERRIA*A*EFKTSLGNMVKPCIYKKK 
















7973 


21874 




8033 


299 


158 


GNQPSASflHTCNPSTLVGQGGRII*AQE 
FKflSLGNMVKHYLYQKHKN 


7974 


21875 


A 


8034 


416 


60 


AQWLTSIIPPL*EflEVGGFFEPRSLSPA 
WATE 


7975 


21876 


A 


8035 


398 


271 


FYFL*DEVSLLPRLVSNSWTPAILPPWP 
PKVPGLRHEPPRPA 


7976 


21877 


A 


8036 


385 


T51 


FLYFL*RLGLTLLPQLVSNISSSDPPT* 
ASESAGITGVSNIIAIILRICILTRLPGNS 
SH*NVRRAGLGDTQSSALIPPG 


7977 


21878 


A 


8037 


3 


406 


PASAS*VAGITGTHHHAQLIFVFLVETG 
FLFFFFWAVFLFFKTKNGPPGGQLFFFA 
APPALGGGDFGGQGATFSPKGLGVLGET 
RGGAPAPKKLGAKKEPSHLLGGGAQNLP 
KPRGQKGFGFYFYFLARDFFLG 


7978 


21879 


A 


8038 


428 


20 


IiANQ*WKPPSNPWETKISPKKKKGPPGQ 
GPPPLTPPLWEAPKARKLGS PWLPR*NP 
LFPQKPKKFFGFGGGPPYSPLPQKLSPK 
NGVTPEVGPSHKPKFPPSPPPLGQK*TP 
FPQKKKPKRLKRGMFLHGSTFQSL 


7979 


21880 




8039 


3 


194 


QNKISQVWWYTPVILATWEAEGGESVEP 
GRQRLW*AKTAPLHYSQDEKYLKASVDS 
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SEQID 
NO: of 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od" 


SEQID 
NO: in 
0SSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=:Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


sequence 














YCLISCH 


7980 


21881 


A 


8040 


415 


96 


LEPL*KYFPILDAIKNICDSWEEVKIST 
LTGV*KKLIPMLMDNFEAFKTSMEKVTA 
DWE I GELELE VE PADOTELLQSCAKLE 
GMRSCFLWMSKESGFLRWIYSW 


7981 


23882 


A 


S041 


413 


254 


PGFKQFS CLGFPKR *DYRHKPPCPAS KY 
FKTNYLRNIYRSLILKH 


7982 


21883 


A 


8042 


409 


102 


GFFPFGPFFF*PPPPGVFPPPPPPNFWF 
*GFGPPPPPLF*FFFRGGPKKIWFFPFF 
LWPPLFFFFFPFWPKKTPPKKARSLKIF 
FFLGPKKIPNPFFFFFFF 


7983 


21884 


A 


8043 


414 


153 


GRGWATNFYFFLIILK*FFFIFKKFFFF 
HLFFFFFFFFFFFFFFFFFPFFFFYKIF 
LFTLVLTCPQTSCREAEPFDHKVCSVLK 

ML 


7984 


21885 




8044 


401 


292 


NYLPPQ * KECKYYPQGPFRKP IKKSPPPC 
FFFFFFFFPSFPPFLNIPPPQKGKFPPK 
KYFFNPPLFPPLFFLKPPPLFFF*GPKK 

KNLIFHPPSKKICPFKRGPP+FFFFFFF 
F 


7985 


21886 




8045 


370 


14 


TPPPTKTPFFFLYPPPWREKEEGFF*KA 
NMSPPVWKKKIPPLPPYIYL*KGGSRYL 
AKLSLNLKGSIYLSFCFPPLPPSLPLSL 
FERWGLTILPKLVLNSWAQMILLPMLPK 
VLGLKV 


7986 


21887 


A 


8045 


395 


268 


PFFFF*ETGSCSIVQVGVQWLFTGTILV 
LISVGVLTCSISDL 


7987 


21888 


A 


8047 


379 


98 


GPAHNSPPLGGESETPLGYP6KPRFF*K 
NKKINPARGPGPWPPPPQG*AGKRPLP 
PRRGFQLTQNGPPPPPPGGKKKPPFQKK 
KKKREPAIC 


7988 


21889 


A 


8048 


369 


177 


RNTRGGPTFFFFFFFFKx-NLABLSRLEC 
HGTISAHCNFRLLGSSD3LPLTSNDIGQ 
INKSLFA 


7989 


21890 




8049 


1 


128 


ALGLVAHAYNPSTLGSQRGKIA* GREFG 
TSLGHRARPCLYKK 


7990 


21891 




8050 


166 


163 


PVHQGEQTTQDKCLR*STHLGLPKC*DY 
RREPPCQASKLVPTGILADSKHSKCQTM 
DSCESNQLYKQR 


7991 


21892 


A 


8051 


402 


245 


YFKSVCQAQWIiNTYNPSTLGG*DGWITC 
GQEFRLPWPSGITGASHHAQPHFF 


7992 


21893 


A 


8052 


404 


268 


QQLWRLRQRDCLNLGCGGCSEPRSCYCT 
PA*ATEPDPVSNKQINK 


7993 


21894 


A 


8053 


1 


154 


GFLRVGQAGLELLTLRDLPTSASHSAGI 
AGVIYHAWPGEAyLFSG*WLRFT 


7994 


21895 


A 


8054 


406 


150 


PKKKKNPCPPPEVKFDSLKRAPLFFFFL 
EESCFVTQAGLECSGAILAHCNLCLVGS 
SDS PLS L*RWVGKSHFI FTSLLS VS PRS 




21896 




8055 




125 


TAVILIIFMI*EAFASKRKVLIVEEPSI 
NLE*LYGCPPPTH 


7996 


21897 


A 


8056 


400 


260 


HMSPHLAFFFSFLFL* RLDMLPRLVSNS 
WPQAFLQLLLPQVLGLQV 


7997 


21898 


A 


8057 


396 


263 


FLQPGTVAHACNPSTFRGGLIT*AQEFK 
TSLGNKVEPOiYKKQY 


7998 


21899 




8058 


402 


211 


FS CWWCTKPKaAG*FPAPWGPPFLNWPP 
GAGTNFPGPPRSGGFGSPPPPWPGPRRP 
CPNGIFFC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possibfe 
nucleotide insertion 


7999 


21900 


A 


8059 




279 


VLLYHPGWSAWQSWLMQ*TPSNADIWR 
lAEGFKRSSHFSLLSSWDHRHVTPCLAN 
FFLIL* IWGLAMLSRLLPLRLRKPVLLD 
*APSPCGLI 


8000 


21901 




8060 


408 


315 


SLTMLPRIVFNS*AQAILLPWPPKVLGL 
QV 


8001 


21902 


A 


8061 


198 


2 


RIQHVBSNCHl,HVEGGWAQWLTPSTLGG 
RGRM I S * AQE FETRLGNTVRPC F YKNL K 
SQPGMVWI 


8002 


21903 


A 


8062 


3 


294 


GLSPTAHPITAFTSQPLPSGLSVCRLFL 
WAI1CEQCPHLPIRPCCPRIIGPECALYT 
TSCLLHQLKKREGF*KKKKKKKKKKKKK 
KKKIF*WGGGKK 


8003 


21904 




8063 


197 


184 


IHDFLFFF*GRVSLCCPGWSAAAQSRLT 
AL*PGELKETFYL* IRRSWDHKHAPPLP 
APF*LFWVTLIITPSTHEVSSAVCSLLY 
VKSEQKCQ 


8004 


21905 


A 


S064 




373 


TSGSliAKRPADCLSAQPTTPSSLCGCSP 
VRVCITVPRCTHARAHTHTHTHTHTHTH 
SNHG * APGAL Y I QELQRGKVSHV P E KE K 
SDPETA*TKGG*HVTNPHGRQRSGEQEE 
HRICVGRCCSG 


8005 


21906 




8065 


532 


106 


ENKPIHNLSINKYIGKILRDKALLYCPG 
WRAVMQS*LPVASNSWAPAILLCPGFLS 
TWDYKRVPPQLVKIFSKKKKNKPTNILR 
PKCHTRNQDIYKRGKVGFLCSYCPPALL 
LLLS LLLLLLLLFS FSTRDSASGAGCSS 
LP 


8006 


21907 


A 


8066 


411 


130 


PHAQLI FLRR *S LTVFPRWSNSMPQAI 
LPPWPPKVLGPQARAMVPSPKLVLESIL 
RIPTSVIFFPLNLCSNIKIAEIKEGCSA 
AFLFNQRFAN 


8007 


21908 


A 


8067 


3 


32 


DAWADAWGSLTDEWVEKIWYIHAMQYYS 
S FE KRE I L * YVRTCMNLEDTMVSE IRQS 
QRDDYCMIPHI*GRTRGVH 


8008 


21909 


A 


8068 


410 


116 


STFNIQTWPGTVAHTYNPNTLEGQftGRI 
T*GQELETSIiGNITRPNFIJ:.FCQACPGQ 
QLSTVRMCGSLHYFVLPLRPIiLKCIiDNR 
KSPFYHPVILPCYE 


8009 


21910 


A 


8069 


158 


289 


DLCSSLRNlVRPQLYKK*KISQVRKKKK " 
KKKKKKKKKKKKKKKKKKKK 


8010 


21911 


A 


8070 


388 


1 


APTPPCFLFKPFVLGGPPFGKAFFQKFP 
GGGFFPSISGPKPPPPRNPSLISKGKRV 
PSPQSPIMVFPGAF*RPPPFPLMVG*K1 
KIFLPPPPPFKTPPALSPFFFFEAEFRS 
CWPGWNAVMRSRLSATS 


8011 


21912 


A 


8071 


15 


336 


KUDKEYDRTGSPGTTHaSGYLI EPLVAE 
ASYELILSIAFFFFKRHGLMiLPRLEGS 
GVIVAHCTLKLLGSGGPPASAS * VAGTA 
GMWHHEGHE PGL FE KQNSG F I F 


8012 


21913 


A 


8072 


396 


274 


PHLHTWESSL*RQGLAMLSRLVS1!}SMPQ 
VILSPWPPKWVL 


8013 


21914 


A 


8073 


189 


54 


NKPLGPGMWQaCNPTPLGGQGQWIP*A 
PEFKISLGKLAKPCFY 


8014 


21915 


A 


8074 


3 


257 


HAFVIiLFETRS CLSLRLES SGAMIAYCR 
T_,EIiLGSGDPLTSASQVAGTKGM*DDAWL 
RQVSNSWPQVILRLSHPHMFLFQFSLET 


8015 


21916 


A 


8075 


1 


119 


TLPATWEAGAGGLLEPKSLRV* CVFTAT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

ingto first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 

residue of 
peptide 

jequence^ 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y= Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=passible 
nucleotide insertion 














VNSSRTPAWAA 


8016 


21917 




8076 '~ 


232 ^ 


266 


LPYCSGQWCLGLPIIQGVQWHDHGSLQP 
RLPGLNQSSCLSLPKCWDYGREP*YLA* 
YQARYHGSIiAEQNVFNIRPGIMVHARS P 
STLGG 


8017 


21918 


A 


8077 


390 


143 


LGGFLVFCPPAKKGFFPNPINLGTPRFF 
PPPPF*KPAPEINFGGPKKKKIPSPPPG 
VKFDPLKRPPPFFFFFFFFFFIHLLV 


8018 


21919 


— 


8078 


424 


144 


LKNYIFFTFL*NRDRVSLCCPRLVLNFW 
AQEILLPWPPKVLGLQA 


8019 


21920 




8079 


182 


45 


VHKAGMVAHaYNPSTLRGRGGWIT*GQE 
FETSLASLDNMVKPRLC 


8020 


21921 




8080^ ^ 


405 


163 


YFLEEMGFHHADQAGLELLTIGDPPGGM 
SHCTWLNLNVMLIQKHPHRNIQNNV*PH 
ISGPVKLTHEGNHHNMHMSTQKHSP 


8021 


21922 


A 


8081 


39 


240 


QSKTVSKKKRKIIFCRYRVS*SPGFKQS 
SHLGLPKBWNYRREPPHPASTFSQSFPV 
KDHFCSGVSS 


8022 


21923 


A 


8082 


2 


263 


DCCVS SE*GSVGVGLSE PGAG YNLLVFR 

LLRPLEKRSIRVGGSRFSRYYLSWLPLA 
RfCGKSPTPCTSQ'j?vQCPTLLLGLHPLCD 
KPO 


8023 


21924 


A 


8083 




248 


SVGVGPSEPaA*YNLLVCCLLRPLEQHS 
IRGGVSQFSKYRQSWLPLftRKGNSPTP * 
ASCGMRGPALLWLTLLGLHPLSNKPQ 


8024 


21925 


A 


8084 


3 


255 


HLSLPSSWDYKRMPPHLMIFFFFFLGKK 
GVLP*GQTGLEIi*N*KNPPA*PPKQAGV 
KGVTHRPKP*HGGKGRFEKDTPGGPFL 


8025 


21926 


A 


8085 


409 


145 


LRGPYKKNFSFQAPGRKIGSFKKAPPFF 
FFFFF*GQSLCCPVGLSTGVIIAHCSLE 
LLTSSDPASSASRVAGTTGACHCS*VLQ 
TFTF 


8026 


21927 


A 


8086 


199 


26 


GFPPPPNKTEIF*RPPDFUSrFFFLVEMN 
LTSLSRLVSNPWAQAVLLPWPPKALGPQ 
A 


8027 


21928 




8087 


222 


3 


SVGITGVSHHVLWYFTLKGPLGRGAHSC 
I!JNSALGGRGGWIT*AQEFETSI.GSVARH 
HLYKKYKNYPVaWRL 


8028 


21929 




8088 


415 


197 


TILCFYHQNNSVFSSNPFLSL* IFIF*D 
RVSFCHPGWSAWQSWLTVFPILCATVK 
YFTCQQYI IMIFALNS 


8029 


21930 


A 


8089 


297 


71 


FIFPENKNRPEWAHaCNTLRGRGGRIT 

*GQEFEKSLSNIARPHLYKTKTQNILSK 
YIQYLGFQLTLQIDLANIY 


8030 


21931 


A 


8090 


185 


415 


PFGLKQSPHLTLLSS*DYKHTPPCRVNF 
FFFLGGFGF F PKAG FNLGN* RNFLAS P S 
KRAGMAGINNLAGPIFRFFKK 


8031 


21932 


A 


8091 


2 


125 


RKQAS KKERKE* KK* RKKERKKERKKER 
KKERKEKERECEEP 


8032 


21933 


A 


8092 


8 


412 


KGPQGTTLTCIFDLSYSSSLSHWSQSLC 
VAFPLPECIFPHKPLCPACHFFFFQSWG 
LALLPKREYRGKIGVNGNLELLG*WWPP 
SGTSKIAEPflELFHHTHARPFNWI IKTD 
FPYIAQTGF*LRPSRNPSTLEV 


8033 


21934 


A 


8093 


249 


143 


NIFAATWMRLETI ILSEVTQG*KTKHQM 
FSLICGS 


8034 


21935 


A 


8094 


3 


280 


PERWDYRHEPPGPASFT*LLLFVFNL*R 
KGLYVGQRGHQLLASSSLPAPASLSAGI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 
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od 


SEQID 
NO: in 
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09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=I.soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V--Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRMSHCTPAWVTVRPCY* INK* INGKDT 
IRQNLKKM 


8035 


21936 


A 


8095 


404 


16 


QMGFCLFFLPSSKEVFFPPIFFFYRESF 
FLFHFFLDPRQY* LYF 1 1 KKNI FLPS PF 
IFFLFIRVSPYIFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFWGSMEL 
LSKAHQCISSHGSG 


8036 


21937 


A 


8096 


358 


200 


KSYFIFSLIIFSHYFFLLFLKHLLATYY 
CFLIDCTLFFSFYCPFCCLFVLLSGRFP 
RSCL*TFH*RLYFSFPIFNVQKLF*LSD 
ICFCCIYLFIYLFIYFETVFCSCCLGWS 
MMA*FNGMFKDRI 


8037 


21938 


A 


8097 


2 


132 


KEEKKERKE* KKRKKERKKE KREKKEKE 
EGGHKEGKEGKNSST 


8038 


21939 


A 


8098 


48 


331 


PGTWHACLGGEDILKWQTKGRGRRAGSH 
RTS W I E P LETVFLELEAGGRLES PGKNY 
SRLGAVAHICIASTLGGRGRQIT*GQEF 
DTSLADFCST 


8039 


21940 


A 


8099 


7 


373 


NGELERGFAMLARL VLNF * PRDPPT * AS 
QSAGITGMSHHARPTIESFIHYCWECKM 
E*PLCKLILYFHLFIFKRQKLGVHWCHH 
SLLQPPTPGLQRSFRLTPPSGWDYRRGP 
PCPANI*LII 


8040 


21941 


A 


8100 


116 


432 


YLRNLFYQRQAFSPAILNI,MCSAT,*ESH 
TVAQAGVQWHNLGSLQPLPPRFKRFSCL 
SLPSSNIFAPLS YFFRNSSFTNIS PSE I 
GPQITDPLFIKKKIKVSVMVE 


8041 

"8042 


21942 

21943 


A 


8101 


361 


138 


FNLLGRKKKfTFGGLKEGBSRRPPNPGPQ 
FFFFFLFFFFFSG*RWGI,TMLPRLVLNS 

WAQVII,I.PWPPKSTjGLEV 


A 


8102 




199 


PTR?FVFI,MEMEFYHIGQTALEI,LTSGN 
L P ASA3 Q S AG I TGV S DR * LA* YLF FFHE 
LFCYFALLLI 


8043 


21944 


A 


8103 


412 


46 


kggspppppppkigrkgppnpgdpffff 
spfffrgf*gsefppk*kvllvgflkpk 
ppfplgwgnp pp p p vggfs ppqslqfff 

FKKVFGWVFFFPLFFFFFFFSPFFLFFS 
ETESHHIAQA 


8044 


21945 


A 


8104 


192 


2 


eetelsffiikyylprnqkvckrl*rrl 
gpvahactpsplggqrewitrsgvqdqp 

GQDGETP 


8045 


21946 


A 


8105 


373 


183 


GGPF* FQYILILFDNFFFFSLSFPFSQD 

ffflssffffffffffiinyfyff 


8046 


21947 


A 


8106 


373 


281 


SFKTAPPFFFFFFW*RLSLTMLPRMVSN 
SWAQWLLPQPPKVLRL*ATQRKKHFRQ 
EQI 


8047 


21948 


A 


8107 


3 


114 


FLI FVHIGPNYVAQAGLELLSSDDPPAL 
ASQCAGITGMNHHARPYYLL * TLNI PLP 
YDPE I FLLGIYPR*MKTYVHTBC3^T* FF 
ITPLFVIAKTWKQPK*PSVLGLQA 


8048 


21949 


A 


8108 


416 


243 


PGVFPPPPF*NPPPEIIFWAPKKKKYFP 
PPPPLNFVFFKGPPPFFFFFFFFFFFFF 

L 


8049 


21950 


A 


8109 


2 


211 


SVTQAGVQWHVSLLQPRLPGLEQSSCLG 
L**CWDCGHEPWHLALVTTSKHTESCSL 
LCI PTATFLGLVT 


8050 


21951 


A 


8110 


172 


28 


HWCF ITSFFGFYFL*RQGLTKS PRLVP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 

otide 

location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NSWi/*VILLPWPPKVLGLQA 


8051 


21952 


""A ~ 




2 


225 


QKSAKNFYHLSIYLSIYLSIYPSIYHLS 
IYLSIIYLSSTYE*VIILEQKWNSIKKG 
CL*CTIQTNCL*QLKE*VIRLIPKL*YK 
IKLPTCWSVHLNWFTLCNISWYLKYLAM 
* *H KLVCFA* FW YMSNKLL I TI KRIGH 


8052 


21953 


A 


8112 


392 


221 


PFPKRESPPDKVKPPPPDIFGGGKTKKK 
TLAtSlM*FPPKEKEFPVFFFKKEDFF*NI 
VGNFKTPSKI*AVCF*MPVFKKKKGAPK 
KKSSFKNF 


8053 


21954 


A 


8113 


369 


112 


IFFSTIYYFXHPFFFPPPLFFTPPPLFF 
FFLKKKKFPFFFFFXXXFFFFFXXXXFF 
FFFFFFFFFFFFFFFFFFFFPLAF*HVY 
SF 


8054 


21955 


A 


8114 


418 


248 


KYYLPTPGY*NLILLNGPPFFFFFFF*D 
RVSPCCPHLP*TPRLTQSSCLSLPRSWD 
HRCMPLRLVTF*FTFKLSHQSLRTLLTY 
TLAGTTLG 


8055 


21956 


A 


8115 


229 


78 


SFKGAAPFFFFF**RQNLTMSPKLVSISrS 
RPQAVLPPWPPKVLGLHRGGWIT 


8056 


21957 


A 


8116 


356 


66 


PLFPFFYIKNFKKKFFFFFQYPFPFPPF 
PKHYSHKDIFFFFIIKFFPSSSFSFPPF 

KALSSFFFFFFFFFFFFFFFFHA*FPVF 

FLYKIPMLYPQL 


8057 


21958 


A 


8117 


212 


81 


?:RLSHCSl,NLPDSSNFPaSaS*VAGTSG 
THQLVMWHAGS PR F 


8058 


21959 


A 


8118 


367 


2 


GVFFFFFSPAKKGVFFHPFFFLGPRFFP 
PPPFFFPPLWFYFWGP *KKY* FPHPRGY 
EFFFF*GGPPHFFFFFFFFFFFFFFFFF 
FFFYAKGIGVSPCYIMLGYWFSSFPCGT 
ISPRVRPRV 


8059 


21960 


A 


8119 


38 


215 


FFKSABHHTRLIFNFFFFLHKTGSHYVA 
QAGVELLG*SSPTTSAFEIGGITGLPLS 
FKH 


8060 


21961 


A 


8120 


266 


2 


PQIFLFIFETRSHSITQVGVQWCDHSSL 
QPQNPGLKQSSLSS * DYHHTLTAPG*FC 
FL*RCXSLaMFPMLVLSSWPQAVLKIHPC 
CGMC 


8061 


21962 


A 


8121 


3 


332 


DAWAnAWGHVPPCPANFVFLV*TGFIiHI 
GQAGLELPTLSDPPASASQSVFIYFLET 
ESHSPAWETERDSVSQKKKKRGGRFKGS 
NFTSMLQRFIFFMGPPKLISRAGV 


8062 


21963 


A 


8122 


378 


56 


LFIPLPIREEVFPCIPG*NWAPPGVLYN 
GRPFFFFLLETVFIiHAGQAGLELLGSSD 
LPVSASQSVGITGMSHRVQPKLRIFVNI 
SFPFEIFCEIMLLVPLAQP 


8063 


21964 


A 


8123 


2 


316 


ISMLLALLLRIITFRLPQMGYVEEFTP 
YECGFNPISPGRAPLFIEHFLVDITFLL 
FDVQIALIiIiPIiP*ALQTGNLPIiIVMPSL 
Llil I IIiALRLDYERLQNGSD 


8064 


21965 


A 


8124 


331 


14 


ITSILL1IITF*LPQI«GYI*KSTPYEC 
GFDPISPARVPFS IKFFLVAITFLLFDL 
EIALLLPLP*ALQTTNLPLIGMSSLLLI 
I ILALSLAYE*LQKGLD*AE 


8065 


21966 


A 


8125 


2 


228 


IIITF*LPQLNGYIEKSTPYECGPDPIS 
PARVPFSIKFFLVAITFLLFDLEIALLL 
PLP * ALQTTNLPL I VMSSL 


8066 


21967 


A 


8126 


451 


200 


KA*WNRFAimVTKyTCESQEEVKI PTF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


Predict- 

Sion 
corrcspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutainic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop eodon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVWKKLIPAHTDDFVHFKTSVQEVATD 
VAKIASLLELEVESEDVTE*LQSQDKT 


8067 


21968 


A 


8127 


2 


396 


LIVIINTLLALLLIIITF*LPQLSGYIE 
KSTPYECGFDP I S PARVPFS I KFFLVAI 
TFLLSDLEIALLLPLP*ALQTT1ILPLIG 
MSSLLLIIILALSLAYE*LQKGLD*TES 
GWEKLVKKEKADKLKGGRP 


8068 


21969 


A 


8128 


1 


350 


GTRGLQTLLPLIAF* *LLASLAtiILALPP 
TINIiLGELSVLVTTFS*SNITLLLTGLN 
IIiVTALYSLYIFTTTQ*GSLTIIHINNIK 
PSFTRENTLMFIHLSPILLLSLNPDIIT 
GFSS 


8069 


21970 


A 


8129 


375 


1 


LTCFSEHRPPPRANl''KFSRKTiiAPFFLP 
GGFQI PGPNFFAPPI FPNGGPIPGLDPQ 
PGPIiNFF*KSFFPDPKFFFLFFPPPI*N 
SGVFLPPJsIPVFPKTPNFLFFFSPFFPGD 
GVLLCHPGRLPRA 


8070 


21971 


A 


8130 


32 


211 


G I RUG FAI1LPRLVLNSWPQAILLPWPPK 
LLG^QAGVELLASSNPPALASQTAGMTG 
VSTRPPVLLPTLGTHT.C'LDACLGSST, 


8071 


21972 


A 


8131 


396 


5 


KYRDGPPPPGK?FFFFF*DGFSLCCPGW 
SQTSCPK*SSCLPEG*DYRHEPPHMTLS 
SSY*CSQMQKCINKNCT,GSCWAHACNL 
SPLGGRRITWGKKF ETSLDN I VKNRLKK 
KKKKI YQGGGDHPCI FTN 


8072 


21973 


A ■ 


8132 


347 


34 


KAGPVIPRGPQEFFFFFFPQDGIT.T.CFP 
GWRAVAKS*VTATSNSWAQAITGMLPPC 
PANF* KI S* RQSI SMFPRLVPNSWPQVI 
LPPQPPK*LGLmrCLVPSLA 


8073 


21974 


A 


8133 


102 


270 


DYRRTPPHLANFFYFL*RQILTVLPMLV 
SNSWVQAILLLQDFFSVQLKMGFLSVP 


8074 


21975 


A 


8134 


338 


47 


FLSTQKESSPLHTHTHTHTHTHSHLLTH 
HHGRGQERGKRGR*GG*WRTG'mSPHGP 
HCVRLRKSRPSGELGAAQGGCSLIQPKI 
EDPGQSGGPRKVP 


8075 


21976 




8135 


68 


467 


SLLNRIPFCLCIiMSFDCVFLLGLLFEAG 
FCSVAQAGVWWYEHGSLQP *PRGLK*SF 
HLSIiPKFWDYENEPPHPPFSCFK2VLNFK 
DYCDSGTVPCPFMKELSWLFIHALTESF 
VLFIVMSKLITTKHLTI ISNV 


8076 I 21977 


A 


8136 


500 


335 


FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8077 21978 




8137 


5S 


222 


IYGYEGRGSVAGSLSSL*SATTDSDIjDy 
DYLQNWGPRFKKLADLYGSKDTFDDDS 


8078 21979 


A 


8138 


345 


196 


KGDQAQWLRPGIMVflHICNLSTLEGQGGR 
IT*AQKFETSriSNTLRPPlSFK 


8079 


21980 


A 


8139 


500 


355 


FHEWGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFriNKUPSLR 


8080 


21981 


, A 


8140 


3 


231 


HASADAWGRNEWPCAVAHAHNPTTSGG* 
GGWIA*AQEFEFETSIiCKIVKPHLSKKK 
KKKKNFSPGWGSPPCTPLF 


8081 


21982 


A 


8141 


416 


26 


ARIFPNNSFWPGQGGFPFPPRPLEGPGG 
VI P * CQGVQARPAPPGEPRLFPKPQKI V 
GGGGEPPLFPPPERAGPQKCRPFGEGGF 
N* PKPRPCPPGWGAQGGFVSKKKKKENE 
DSKIPLIGKYPVGIRMMA 


8082 


21983 


A 


8142 


413 


35 


RGAGGGG S P PG PR YP SQQHLLEART PTP 
FARSRGSFPGVRGSGWRGGLGG*SRERP 
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NO: of 
nucleotide 
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NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
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nucleotide 
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ing to first 

amino acid 

peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tfireonine, V=VaIine, 
W=Tryptoplian, y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, V'^possible 
nucleotide insertion 














LPSLPPRALPRSGASSGPLDGGAQGPGR 
RERWAFLGGI RADARG CHAVTS PE B YRL 
PGAGAARARR 


8083 


21984 


A 


8143 


166 


368 


ENI LWAAMWLD CHP S TLGG *GGR I T *G 
QEFKTNLANIERPHLFFFFFFKRDSLFA 
PRAEGGGPILF 


8084 


21985 


A 


8144 


1700 


621 


DLRDHSVLQQGPACLTRCGEPILSHSEG 
SKIGSERESGEHSRHPEAPPPSVEVGSR 
RS*GQDS*GFG PGNGQDTRG PL SHAAPN 
WTPDPAAAVAAGGTSLPPRGGGGI *RGR 
SHTARSGARVGSANAHAEGGSGFSRHSG 
SHGCLRNQSPRLGKAGAIiCGSAWVSGL* 
GSGLTSPRGRRERLRMPGVAKGRPR*GN 
PRRGRGKQHGIPASGLPGRCWLGIKMST 
LQPRTRAPEELFVPQESSGSTPEKMLVS 
FHGS SLRNEATPRYSQQEEAGNGRWQQS 
LSLERWPPWTSHPLGTPPLMPVAVARCC 
ILPGLWPLLSPPSGSASPVSQGPGCLSL 
WPNAFKKDDFPVRQGDTSWRSVS 


8085 


21986 


A 


8145 


390 


38 


NTTTFNIYFFYFFFFSLFIYFKFFFFFF 
FFFFFFFFFI FI LFI YSLLVFGVWVLGL 
GLALALLAKLFLVNSLCRRYRG * SLL YY 
AWL*FFIFPLRYYIYCajRFQFLSPILYL 
GKWFG 


8086 


21987 


A 


8146 


20 


187 


KLTTSTGRMPGRPTRPNF*VF*RDODLI 
MLPRLVSNSWPLVNLLPWPPKILGLQM 


8087 


21988 


A 


8147 


417 


64 


VRPRVRPRVRPRIRKKKKNKKKKKKKKK 
KKKKKWGGGF*DFWGGEKGPPPLQKNDF 
LLFWGFFLKSFFFFCGGLFFWGGPFFFP 
P PPNFLGVGGKKKTFRLLOGPFSAFFGG 

EKK 


8088 


21989 


A 


8148 


351 


187 


RLGTFYLSNI LRFYLS ITHGPI P YIGRL 
TSQNLPLIiLLDLT*LEKLLPKKSQTYN 


8089 


21990 


A 


8149 


465 


60 


PPHFWGNPKKKNSQFFWAKKKKKIFLGP 
* KMVAKKGWFWGRKKGFQRGGKKG* KG 
GQLFYPVLSQVLGGGQNHGPPGKAGFGE 
NFFFFGQGLGWAI S RWLRTP LFLAGGGE 
DPKKNFKP PP KI KTLGAGVKKK 


8090 


21991 


A 


8150 


1 


175 


WDYRHVPPYLMIVFFVIFRGDKVSLCCP 
V*GPTTELKRSSCLGFP*GWDYRREPPH 


8091 


21992 


A 


8151 


59 


35 


RWNSRPRRPGLKFLTSGDLPASASGSAV 
ITDVSHHRWPETFFR*RA 


8092 


21993 




8152 


269 


159 


ARVEVSKECYIYIHTHTHTHTHTHIHTHT 
HTYTTKICLPS*LFQVCKMEKFGRLWQ 


8093 


21994 


A 


8153 


1 


256 


PGPPPRFFFFFFFFFFFFFFFFP*TLKI 
RFIWRKQKrHNPKDGILHQDISQVISLQ 
ITNHFHRRYFCTNFTCIQERDINQSLFL 

F 


8094 


21995 






252 


41 


LIIPVLICIVFIVLQFKIFSVISLNKGL 
FHNHFFIF* *RWGLTMLPRLVSNSKAQV 
ILPPWPPKVLGLQV 


8095 


21996 


A 


8155 


21 


443 


HELSLIiFISTLAETNRTPFDLVDGESE 
LVSGPSIEYAaGPFALFFIGEYTNIIII 
NTLTTTI FLGTTYnALSPELYTTYFVTK 
TLLLTCLVL* IRTAYPRFRYDQLIHLL* 
KKFIiPLTLALLI * YVS IPITI SS IPPQT 


8096 j 21997 


A 


8156 


2 


205 


LLIILMKHFN1.EIQKAQ*TRINLKRSISR 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last^amino 

residue of 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PIienyIalanine, 
G=Glycine, H=Histidine, T=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














HVIINLSKAKHKG*ILKAVIiEKMLIMYK 
KPLVKLQVISN 


8097 


21998 


A 


8157 


2 


205 


LLHLMKHFNLEIQKAQ*TRINLKRSISR 
HVI IWLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


809S 


21999 


A 


8158 


2 


222 


V Y TWGTGKARVF PGEF *LP VWI PTRHL K 
FVNEPIRDANESASAETENPQSSIIHPQ 
DEQNGDIRRTDKVTIHQ 


8099 


22000 


A 


8159 


1 


297 


GFHHVGQSGLELLTSGDPLASTS*SAGI 
TGMSHHAWPLNCSYGDSFSWLKCPELTV 
ACWWEEGWRKLLMENEQGREAHQKLESV 
INFYIKDSMTKRKYK 


8100 


22001 




8160 


138 


437 


NEYDHFSI I KERRDFIVCVFFFFFSKKS 
LTVGTQGGPSYLEPPFLGNKEFSCLTFL 
GSWDNGGATTRLG*FFVFFRKKGVSPCG 
PGWS*TPDLRKTPPSA 


SlOl 


22002 


A 


8161 


330 


148 


QQSKTPFFFFFFKKKKKYIYICBIWHAY 
NPSYSGG*GGRIT*AQKFKTSI.l\nMTVRP 
HLYKK 


8102 


22003 




8162 


391 


255 


RPAPADF*FFFFYRLGLaVLPRLVWNSW 
PQATLLPQPLRVLGLQI 


8103 


22004 


A 


8163 


131 


3 


HKVSLCYPG*SALMQS*LTVASNSWAQA 
ILQP*PPE*LGLQA 


8104 


22005 


A 


8164 


97 


440 


NKGSLCPPGGRPGALSHPTARSRGSPGP 
TPQKTGHNGPGPPTPVHLEFFEKRGFTP 
VGQGGLKI.RT*RSPPPSAPQKAGITGGS 
RRPGPIQKKFLKTLLGETKWMAKLIKrY 
LK 


8105 


22006 


A 


8165 


541 


295 


FSQQKKKIFISTNLSHQESKIYICKaiE 
SHSVIRLLEGSGAIMNHCSLKLPSSSDP 
PASVS*SAGIIGVSQHLA]SrYTFKKIF 


8106 


22007 




8166 


440 


259 


PPQKFFFFFFFVKTGFRHW*AGFRLLD 
SSDPPPLAFQSVEITGMSPSARPMFGVF 
HSLR 


8107 


22008 


A 


8167 


23 


405 


FRICHLLRVHLLVYFIPSFTDNHLLSTS 
AMFLYSNDWIAQNVSGPFLGIIFRTFF 
FFFLKGEFFEaPQVGGQGGNI.G*LNPPP 
PGLKGFSGLTPPGGGNSGGGPPWKPNFW 
IFKKGGGSTLWPRII 


8108 


22009 


A 


8168 


23 


388 


PYFSCAIiPRHHPPTTNLGPHLGMGAPWG 
QPSVAHHTLLFFFFFLKRDFCFVPRVEK 
LWLEESSLNPPLLGLKEFSCLTLWRTGN 
NGGPPPPPVIFCP*iaCGGVYPWGPGGGE 

TPALKETPGL 


8109 


22010 


A 


8169 


612 


498 


RTP*DNVPQ 


8110 


22011 




8170 


2 


288 


VNLTFFPQHFLGLSGMPRRYSDYPnAYT 
T*NILSSVGSFISLTAVILI IFMI * EAF 
ASKRKVLIVEEPSIHLE*LYIKKKKKKK 
KKKKKKKKKKN 


8111 


22012 


A 


8171 


381 


101 


QKSLQQPSNIATILEEVQVIFLGSPYHL 
QRWEMRLTKWLRGLQAVAHACNPSTLGG 
MGGWIA*AQELETSGDPPPSSNRQLFTG 
EHNNDNKLI 


8112 


22013 


A 


8172 


136 


360 


TKKHFGLLK* *TKKHFGLLKKKKKKKKK 
KKKKKKKKKKKGGGAFKKKKFFPRGGGK 
I FFFFGAPKKECFGGGVLKTGGGKKPGVT 
KK 
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SEQID 
NO: of 
nudeotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last^araino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PIienyIalanine, 
G=GIycine, H=Histidine, T=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proliiie, 
Q=Glutaniine, K=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 


8113 


22014 


A 


8173 


358 


58 


EKVSVFFLKMKCSGIILLPCRLTLPGST 
DSPASVCRMPAIAGPRRtIA*LVLVETGF 
RCVGRAGLQPLTASDPPTSASPGAAIAD 
GVSFTQCSMVPRIRVQ 


8114 


22015 


A 


8174 


254 


370 


GLTVLPGLVSNSWAQVILLSWPPE*LRP 
QAYATGAGFLL 


8115 


22016 


A 


8175 


48 


269 


FYVILCLRDRVLLCCLGWS*TLGLKLSS 
YLSLLSSWDLQHVPPHSflSCLTF*VKAC 
FFLLKYLKRLKIYAKIM 


8116 


22017 


A 


8176 


1 


338 


PTRPINTL1ALLL1I1TF*LPQLNGYIE 
K3TPYECGFDPIS PARVPFS I KFFLVAI 
TFLLFDLEIALLLPLP*ALQTTNLPLIA 
MSSLLLIIILALGLAYE*LQKGID*AEF 


8117 


22018 




8177 


426 


130 


QSLDPPGNMVTLSLQN*KNPPP*GGAVF 
P PPGRLGGENP PGGS KEQKRRPAPQGG* 
P RQ P P S P KKKNAD S E FS I FKLQ PALLCK 
QYCIVLHALPNKILF 


8118 


22019 


A 


8178 


396 


244 


KIHLLYICMCIYIYVISRHYir,K*HTHA 
HT YTHTHTHTHTHT PKNTKYTLW 


8119 


22020 


A 


8179 


2 


119 


TRPGNWPGWAHFYDPSTLGA*GELIT* 
GQEFETSSSLY 


8120 


22021 


A 


8180 


2 


80 


FLPLTL&LLI*HVS IP ITI SS I PPQT 


8121 


22022 


A 


8181 


438 


333 


SPLGRLRHYlICIiNWGGKGCN*PKSCPCP 
PAWGTK 


8122 


22023 


A 


8182 


393 


19 


IFFFFFPPPSPAKFFFIFKKKRFPPLWG 
GVF*SPPPGFQKTPPPQKVGFSRVFPPP 
PPKKFFFGGGKYFFFKEPPPPPPPIL*K 
GGAKISPPPPPGKKKNPFFLKQQKKKKT 
PFIFFFPPKKIKN 


8123 


22024 


A 


8183 


24 


269 


GSQQVLGE'CEtoRVLLCHPGWSAVM*S* 
LTVASTSWA*RILHCSLLGSWEYRHTPP 
CL*NRNPQLTVKWRVKDHNNKKSHR 


8124 


22025 


A 


8184 


154 


361 


WHNHDPLQPQTHGLKRSSHLGLPS IWHY 
RHVPPRLANFLHLSFRQGLAMLPTPSVC 
HLLVFLS*HIYWTFGYLS*SVCSHLrjSrF 
L**WHMHDPLQPQTHGLKRSSHLGLPSI 
WHYRHVPPRLANFIiNLSERQGLaMLPTP 
SVCHLLVFLSETLGPK 


8125 


22026 


A 


8185 


228 


337 


RPGMVRHACIPGILGGRGGWIT*GEKFE " 
TSUUIMVK 


8126 


22027 


A 


8186 


314 


127 


DIQPKNEGEGEEEQELWLGMTAHAYTPS 
TLGD*GGRIA*VQEPETCLGSIiARPPSL 
QKNFKN 


8127 


22028 


A 


8187 


10 


420 


TQEAEVaVNWDHATALQPGRQSETPSPE 
KKKKKRGPPRWGENFFPTPGGF*KDGGN 
KKTGFPPFKRDGPPPPKNWTPWVFQKRG 
APPEPPPLG6ALGREPLGGFFGKGGVET 
P PP P I FKKENP PEKKPHRGFNFFAP 


8128 


22029 


A 


8188 


398 


1 


LKKKRPQTPFSPKIQF*KFFPPFPFKKS 
PFPKKLLKKKRAPGGPIFFFFFYPGLFK 
KPSPL*MGSPPPPLFWPKN*KKPPLLKK 
GPGF*WPFFQKKKPPFFFFFFWRQGFTV 
LPRHTPSCLVNFCIFCRDGI 


8129 


22030 


A 


8189 


30 


153 


AWyVEREDHVSLGG*GCGKPWLCHGTPA 
WVTEENPKGKIK 


8130 


22031 


A 


8190 


206 


45 


HPSQNSLSKH*ERVSVCCPGWSAVAGSQ 
LTAASTSQAQVILLPQPPE*LGLQA 


8131 


22032 


A 


8191 


3 


228 


ICVCIjCVCVYICDVCVGCVCVYIDEYIQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I YRY IHTYIDIYTHI*IYIHTYI DTHMW 
VCVYIYIYIYRCMGVRET 


8132 


22033 


A 


8192 


41 


416 


I CRVKEI ASRPLLETHSLSLVE LS * FCL 
TAQ*RNDFLALKHLTAVLYNLILHSSGF 
WSSLKSHFIFLAl*yYL**FI*SN*RRD 
LAVLPRLVSNSWPQVILPPQPPKVLEIR 
YEPPSLALILFFN 


8J33 


22034 


A 


8193 


2 


183 


KRLS I WQKKVEGEGFQGMDVGPVS VAHA 
CVSSSLEGQGGKIA*GQEFETILSliIIVR 
PHLY 


8134 


22035 


A 


8194 




465 


AGVQWHDLGSLQSPPPGFK*FSSLSLPS 
SWDYRSAPPCPADFCIFRRDWAGWGVGC 
GFHRVARAGLELLTSGDPPASAS PGAGI 
AGMSHRTQRGQDNFASWRRGGPQMSPSP 
PSCLFLQNTFSLKPSWAPYPHASALGG 
GEYGGPPSPQPHLE 


8135 


22036 


A 


8195 


397 


149 


LICLTDLPIRFLSLLRLVFTVFNTKVKV 
KDLIKLGWAHTCNIjNTLGG*GGRIA*A 
HEFETSLGNIVRAVIPVCCFS VI LSR 


8136 


22037 


A 


8196 


355 


95 


KKIMPTLCLKIPS*YRCEEIIQLSLQSS 
CDHRHMPPHPTNPF*RQHLSMLRSLVLN 
SRPQAVLPHGPPKVLGLQAPATAPGILS 

LF 


8137 


22038 


A 


8197 


382 


112 


KTD FGKKGS PS SP PPFFFFKKSFAPSPR 
GEKKAPLKSQRGLCPPKPKDPPPPGSG* 
TQDKGPPPPGQPIFFFFLKKGVCLFSRG 
GSKTFI 


8138 


22039 


A 


8198 


32 


351 


ASRIGFILQEFLPVRTOTTNPSAGADGY 
NFLEKQKGTEKTSCSVAQAGAQWCHRNL 
L*P*TSELKQSSCLSLLSSWDYRMYFVL 
LGKQVTLLIGKRALSVYPRDWSLC 


8139 


22040 


A 


8199 




314 


INTLLALIiLIIITF*LPQLSGYIEKSTP 
YECGFDPISPARVPFSIKFFIiAAITFLL 
FDLKNALLLPLP+ALQTTNLPLIGMPSL 
LLIXIX>ALSXiAYE*LQKQLY 


8140 


22041 


A 


8200 


3 


109 


FTSKHHFGFEAaA*yWHFVDW* LFLYV 
Siy**GS 


8141 


22042 


A 


8201 


3 


216 


DAWAaVGHHHTRLIF*IFVEIGSLCVSQ 
AGLEILDSSDPPSSASQSAGIRGMSHGA 
GQEHLFHRPIQKIK 


8142 


22043 


A 


8202 


1 


267 


RIHTQEKPyKCEECGTQFKGPSTLTTHK 
FFVYCEEVaVLLKNCYSHLYPH* IIVNG 
SEADKREMCLLCIFFHPLREEQENVENK 
RKYUP 


8143 


22044 


A 


8203 


422 


92 


PLVPALPPKIGFAPTTPSSSPPPPPLFF 
FGPPVFPPPPFFNPPPFFFFSPPPKKKF 
PSPPPRGKIFPF*TPPPPFFFFFFFFFF 
FFFFFFFLFLYYLFILC 


8144 


22045 


A 


8204 


445 


260 


VCSPPLFIWVAPGFSPPPLFtOSrPPPNFF 
FGAPKKKNFFPPPPA*NPPFLKGPPLFF 
FFFFF 


8145 


22046 


A 


8205 


103 


248 


GPGWAIIACNP SSLGG* GQWIA* AHEFK 
TSLSNAVRPISAKKKKGGPF 


8146 


22047 


A 


8206 


407 


197 


QLIFKFL*EQASHYVAQAGLELLGSGNP 
PASTSQSVKVTGMSHGAWLLAPTLRSTS 
LNYNSLSVLLLPHP 


8147 


22048 


A 


8207 


1 


142 


SCSVAQARV*FRDLGSLQPPPPRFTPFS 
CTPAWETVELHLHKKKKKN 
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SEQID 
NO: of 
nudeotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nudeotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glu tamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8148 


22049 


A 


8208 


2 


370 


PEVFDYPN*FKNL*LHKTCTQ*L*AALF 
VITKTWNQGCPLTGEWINKLEYIHTMDY 
LAIKINELAWHKMM*RTLKCILLSERS* 
SRKATYYMISIIRHPRKGKTIEIVQNSS 
CQGLGERQKGL 


8149 


22050 


A 


8209 


363 


108 


KKPRVEACLSPGVGRERAKRER*KRERE 
RERERERERENEL*KLSLADLCIKNG*R 
EFFKEKRYVKRYAKILHVSGRKQKYGNV 


8150 


22051 


A 


8210 


386 


254 


LNPGGGVCSEPRLCPCTPAWVIR*DSVS 
EKGKKKRKGPENSLTATNF*ALSCNLDY 
LRCSMLKMQRQRKRKERTEVRFWQLHHN 
LTLGFV * L IVYQRE*RRTGW FVAKYMLY 
TPS * PGSFPRLRKCVEHVFGYEPACS PL 
FPLIHNQLHKAKC 


8151 


22052 


A 


8211 


3 


131 


AHLFIAWFTEYFKPTVITYCSKKKIPFK 
MLL L I D SAPS YPRALMEMYKGMNWFMH 
DNLTFILWPMDQ*VIFTS*SYYLEN*FC 
TIAAIDSDSSDGSGQSNLKI F* KGFTTL 
DAIKNICDS*EL 


8152 


22053 


A 


8212 


86 


286 


WS IWECTVISVQLFHKIJKTSFKNTS YWS 
GVMAYTNNLSTLGSQDPQIT*GQEFETS 
LANMVKSCLY 


8153 


22054 


A 


8213 


393 


344 


SFPFSLPSKWGFiCNLVPSPGYFLFFLNK 
GFRYFGQGGFKILNSNYPPPLASQKGGI 
SGISPWPRALVFF*WRNLGPN*RTIV3P 
KYGLK 


8154 


22055 


A 


8214 


3 


313 


QGLVLWHSHGLLQP*PPGLRRSSCFSLP 
CSWEHRCTPPHLANFLILCVCVCVCVET 
GSHFVAQAGLELLGSSDPPLSISQKSWD 
YKV*AI IPRAKQNYLMLFY 


8155 


22036 




821n 


165 


348 


RPCPLFPEDLLTFSREGPGTMAHACNLS 
TLGGQGGGIA*GQEFETTLGNIVEPHLY 


8156 


22057 




8215 


379 


218 


VFLVETGFLHVGQAGLKLPTSGDPPASA 
SVSA6ITGVSQKCRF*YMVFIFQKSSLL 
NALPICSLTMLLOILFSMFMTPKFFTCS 


8157 


22058 


A 


8217 


32 


276 


RYLPIIARLFTIAKr*KQPKSSLLDKWI 
KKLWCIYTMEYYSALKVILTVVTTWMNL 
VGFIPGDINHIQKAKYCMISLICXJI 


8158 


22059 


A 


8218 


171 


2 


KGVFGLGRLFFFLDRVWIiGLPEWSTWQ 
SHLSATSAS* IQAILPPQPPEELGPLCP 


8159 


22060 




8219 


530 


73 


TKPTVGSNSRSYINQTQYFPSTLTFGGL 
MRLTIYHFFFLr.VYYNPRVMVTPAYM*L 
LILFYSSSIRDLLrAKQPCLPNEGFSPH 
AC*TNATLLFPYSI*YIjGLPSFHLIFFV 
cfsfyfl*r*gltmppmlvlsswpqvil 
PPQPPKVLGLQA 


8160 


22061 


A 


8220 


350 


135 


ffffsqtgvaqnevq*rdhnslqp*ppg 
lrqsscltlpkcwdyrgkplciiaswiyf 
kdkvytstpvhtqey 


8161 


22062 


A 


8221 


185 


331 


PGMVAHAYHPNTLG6*GGKLT*AQEFET 
SLGNINSVPKKREKRRKEMM 


8162 


22063 


A 


8222 


73 


232 


YCSFKFITLTCLSWLGTVAHTCasrPSTVE 
GGGGRIT*GWEFTTSLGNIVESCLY 


8163 


22064 


A 


8223 


418 


211 


AMPSLLTYLLTFI FCRKKKI FFYWGEFT 
MLS *LVLNSWTQVILPPWPAKVLVLPRL 
ALFYSLLA7LFPSF 


8164 


22065 


A 


8224 


227 


33 


ESHSTTQAGVQWHDHGSLQPQPPGMSLY 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

Site 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, l>=Aspartic Acid, 
E=Glutaraic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metli ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tl»reonine, V=Valine, 
W=Tryptoplian, V=Tyrosine, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














CH * CLRFWSWRPAEKPGWCS SLRFMELE 
SWSGASDAF 


8165 


22066 




8225 


244 


287 


TDSPTSAFRSAGITSISHHTLPKSTF*! 
FCRAGGLAMLPRLVUMSWPQVILPSQPP 
KALGSQA 


8166 


22067 


A 


8226 


53 


235 


knkwqgwahtcnpttlgg*gewit*gq 
efetspanmakippt,vmfafcfeetdnk: 

SHTK 


8167 


2206S 


A 


8227 


275 


T5 


GVLMPNFKFC I * QNSP FMLPGWLNS W I 
PAILPPQRPKVLGLQAWTILYKWYYGKE 
MWHKEI CEGTHDVLSKLLTTRRFKRHL 


8168 


22069 


A 


8228 


386 


211 


QLKLPPLKGNKLRPP* *KFFFPFFFQRQ 
GLTILPRLVSSSWHQMIFPPQPPEVLGL 
QA 


8169 


22070 


A 


8229 


I 


117 


VFILFYF*DGLAI/ILPRIATI'ISWAQAILL 
PSAPKVQGLQA 


8170 


22071 


A 


8230 


356 


26 


WHVHGSQQP*PPGLKQSFCFSLPSSWDE 
RCAPPHPINYFQICRDRVYKRWANMVIS 
PNPLVLISQSAGITSKIPCPGSEISTIF 
I FSHQLLLVRGVYPDMLVLQIiGKTCl 


8171 


22072 


A 


8231 


3 


188 


FPSLRPA* IFFYIYFFFLKNKVFBCHPG 
WTQLLDPNWTQAAGLKQSSHLSLLSSLN 
FRGSA 


8172 


22073 


A 


8232 


2 


253 


KIiDYINCGTISrLAQKa.FCLFGSLFFQR* 
GLTLLLRIjFMNSWPQAILLP*PLTVLGL 
QftTMLGSQVPVFDLGITLCRCRFGKLE 


8173 


22074 


A 


8233 


373 


65 


pekagsclelafglomqhppwastlpaa 
ltdfglatvpwiivsqflkshthththth 

THTHTCTRTLLVLFL* EJJIjTTTPLLRS I 
CKHRESACSLSLTWPLCC 


8174 


22073 


A 


8234 


390 


46 


rvfkpg*npplvkapapplffggv*grs 
pkkkhpqgpggggpppgfhpffbtpffp 

IFLRREPPPFFFRGGLGPPPPPFKTPFF 
FFFWVKGGKFLPQKKEOOCFFQS ISTRSV 


8175 


22076 


A 


8235 


387 


216 


QKLGVGLVPPGRVI FFFFFFFCC*GGGL 
TMLPRLVSNSWPLAILLPQPPKVLGLQV 


8176 


22077 


A 


8236 


1 


201 


RFFFTIPPQCFPQFILSF*RRGIAVI.PR 
LVLNPWPQAILPPQPPELLGLQAHPTTL 
GPVFYSYCVRA 


8177 


22078 


A 


8237 


399 


229 


SVGGGGLTMLPSWSQTPGLK*SSHLSLI 
KCWDYRPEPPHIAHFSLIiNLQLYCHTKL 

S 


8178 


22079 


A 


8238 


192 


335 


SHTLPHSSNFSYFW*RBGFAMLPRLVLN 
SWARAVCLPQSPKVLGLHA 


8179 


22080 


A 


8239 


442 


254 


GRDRVLLFCPGWSQTLGLKRSSCLGLPK 
HWDYRHEPPHPAENIKFCVWPTLYFYWP 
ALVQINLP * QLQGTECVACMC 


8180 


22081 


A 


8240 


381 


180 


VLKLFFCSHRGPTMLPRLVSNSGLK* S S 
HLGLPNCWD*RCEPWHPASLCLLTSTFW 
PPTLICNIHK 


8181 


22082 


A 


8241 


388 


260 


NAIPFYITKCCNPSTLGG*GG* IT*GQE 
FENCLANMVKPCLY 


8182 


22083 


A 


8242 


1 


145 


GFTILVRLVLNS * P LDPPALASQGAG I T 
GVSHCAQLIYVFQRSLQLLW 


8183 


22084 


A 


8243 


1 


173 


GLTVSQAGVQWCDHSSL*SQTLGLKQSS 
SLSLPSSWDYKCKPQHQADRFFLMEFFQ 
M 
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SEQ ID 
NO: of 
nudeotide 
sequraice 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8184 


22085 


A 


8244 


202 


327 


TLFITPIFF*R*GLAVLPRLVLWSWPQV 
ILLLWPPRVLGLHV 


8185 


22086 


A 


8245 


42 


289 


KEKRSI ILFIFFCRDGVLLWLPRLDLNS 
WPKTIFPPWPPK*LALQAHTTIPRGGGS 
IILICIRDVKRERKQYAKSKNPEGET 


8186 


22087 


A 


8246 


592 


485 


NIVSQAGVQWHNLGSLQPLPLGLKWFSC 
LSLLSS*DYRHVPPCPANFCISVETRFC 
HVGQAGLELIiASNNPPAPASQSTGITGV 
S P CTRPRVYVFLLW I LANGPS * RAETLV 
NLATNQEDADSSIiAPGLGSGHCF 


8187 


22088 


A 


8247 


307 


12 


MCSPVFSPKQKWPLFIMFFKKEFLFFFP 
GGRQGGHFGFLAPPLPGLNKFLVLPFPG 
NGDTGKGPPARGNFFFFFFFGI FSRDFV 
SPC*PGWSLIPDPR 


8188 


22089 


A 


8248 


392 


237 


HYGQASLAPLTSGDLPASAPQSAEVTGV 
SHRAWPTVIil *SFS *KFNWV 


8189 


22090 




8249 


153 


386 


FFPPGWPKPSLLEPPPPWLKGFPPPTLL 
GGGD*KRPPPPPGI!rFCFFK!QIGVSPLWE 
GWF *TSNLGNPLPSPPKRVGI T 


8190 


22091 


A 


8250 


387 


166 


lETILDAIKNIHDSWEEVKISTLTRVWK 
N* IPTPMDNFEGFKTSLEEVTADVMGIA 
RELEVEPEHVIELLHDIS 


8191 


22092 


A 


8251 


405 


266 


LFAIVKR*NIIPKCLLIYflRIWKMWHISl 

GWNIISLISKGMKFSYJ1FLTWMNLEVIM 
LSEISHSQKVT*YMVtiLA*GHLEKLNCL 
PIKQQTPIPSSSSP 


8192 


22093 


A 


8252 


379 


201 


PGftNyACNPSTLGGRSGWIA*AQELETS 
LSNMTKPQPCSTPQCFLVNQLLLQNLYP 
YFP 


8193 


22094 


A 


8253 


78 


311 


LTLYFHTRKKKGLK*IFYFI*SILSQKD 
LJCVGTGPGaGaHTYJSTPRTWGGWKGLQAG 
KRIT*AQEFETSLD]SITVKTCL 


8194 


22095 


A 


8254 


303 


81 


TPCAENFKLRKRFHHIISLQRKMNYLIS 
FLYF* *SHSLTILPRLVSNSWPQAILLP 
LPPKVPGLQVSATLPGL 


8195 


22096 


A 


8255 


2 


376 


MGYNLVCHLLRPLEKHSSIRVGVT* FSG 
CPPSPLPLftRKGYSLTPCaSQVRQCrAL 
LWLTLGGLHPLSCPHCPTSPSVMNPVPQ 
LEMQKSPVFCVSHAGSCra.ELFLPGHLG 
TQSSLDSFFKKVI 


8196 


22097 


A 


8256 


70 


356 


LIIAYNLTYDLSWKMFyVYLRKIYTLLH 
WVECSIYTCMV*FVYSMVSWNRIDSPGK 

KKTNIYDQLIFNKGAKSTKLRKNSLFNK 
WGQDKISTYKK 


8197 


22098 


A 


8257 


403 


85 


PFCPWFPPVFIPPPLRKSGTPPGPGPEF 
PGPGFSNGGNPRPLKKSPKFSPGWGRGP 
LFPPSRGV*AGNFLGPGRWRGPWAKFVP 
LPSCLGHKARFFFPKKKRRKIK 


8198 


22099 


A 


8258 


249 


2 


YCEGDLGSYRPGVSKCMEFSPRRDIRIM 
PWPGAVCYAYNLSTLGGRDGWIA*A*EF 
GTSLAYIVKPEILYTHTHTHTHTHTMY 


8199 


22100 


A 


8259 


2 


189 


IQCVCVTKSHFCHPGWSAVAQSWLTAAS 
TS*A*VDFSHLSLLNSWD*RRVTPRLVN 

LILFPS 


8200 


22101 


A 


8260 


453 


128 


FFFFFFLIETVSHDVP*AGLELLSSSHP 
LASAFQSAEITGVSHHAQPSSVSWLIHS 
S 


8201 


22102 


A 


8261 


420 


79 


KWPTPFFFFFFLIETVSHDVP*AGLELL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspar2gine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSSHPLASAFQSAEITGVSHHAQPSSVS 
WLIHSS 


8202 


22103 


A 


8262 




262 


NSRTERLGSWQSHFCHKEMEAHFWQSG 
VQWCDLGSLRPLPPGFK*FSCPILLSGW 
DYSQCRPCRTCSLIFMHSRSPSQPSEKK 
WTC 


8203 


22104 


A 


8263 


321 


100 


SATKMYLLSRQS FI GK*KNGLGAV;VHAY 
NSS TSRGLGGQIA* A*EFETS * GNiVMRP 
HCYRKYKNLARHSTYSPR 


8204 


22105 


A 


8264 


145 


275 


KPRDYSFFF* WKRDLRMLPRLASNS WTH 
AILPPWPPKVLRLQG 


8205 


22106 


A 


8265 


73 


351 


SLCYRSFARCDRPKKGQHFSKSYLQDAQ 
WARTLLSTIAQNIISAEHHLKTTDW*GAV 
AHACNPS TPQGRGGRIA*VQEIETSQGN 
TGRPHLYKK 


8206 


22107 


A 


8266 


382 


232 


LVETGFCHVGQAGLKLLTSGDPPaSaSQ 
SAGITGVSHCAWL*T*YSj*DFQ 


8207 


22108 


A 


8267 


397 


130 


IGQAGLKLLTSGDLPTSASQTAGITGMS 
HRAMPTFSLIAR*NASFQVGHFSPKyQV 
GTLMCSLKCHQWSDTLEFILCCRI lYE 
TLGKF 


8208 


22109 


A 


8268 


309 


97 


DTWYRTRRKDHCTI TETRIQSVRPLSPR 
*AITSRVTCTYTSR*PEMrEEPQKITFR 
HCDLFLPTLTDQLTL 


8209 


22110 


A 


8269 


2 


224 


DSSGIPGSPTRPPTRPKKQKLPCDPTVS 
LLGICIYDQRQ*SVFQRDICHSHMFITA 
LFTIVlIIWKQT*HPIiMDE 


8210 


22111 


A 


8270 


374 


2 


TSWESARKPPPIHSHTLKGFFFFLFMFF 
FF*NKICKDGSLTMLPRWLNSWAEGIL 
LPRSPKWGIQELATQREAS ILLCNPTY 
ACGIFQHVWErVKLLMKQCPFCPAAPTK 
TQNFFPRV 


8211 


22112 


A 


8271 


1 


379 


PTSASQVAGTTGTHHHIVLFFFFGIFEK 
KGVFPFSPGGF*TPKPKPPPGFGPKRGG 
KKGGGPPPPARGMGSGVFYPPPQKPFFV 
LIGGQILFKGRTLDFKKRAPPFPPFSLF 
LPKHTPSFMIKWDP 


8212 


22113 


A 


8272 


80 


401 


AWGMRFCLLLYSCQYLISDIAHFSKAFF 
FMKREPPPAPQVKGQGPNLS *WNPWLQG 
*RGFSGLTPKGGGNYGPPPPPPLIFFFL 
RKNGFFPGWPGWSKTPDLRKPPC 


8213 


22114 


A 


8273 


405 


95 


PPKRFWGSSKGFPQREKKLGSPTFPKKK 
KDFIYSYLSKSLKKE*INRPSAAAHTCN 
PSTLGGQGGRIT*GQRFKISLSNIGRPH 
LYEKIKNVKKLKIKKPNKL 


8214 


22115 


A 


8274 


167 


52 


DREQRPGAMAHTCNPSTLGG*GGWITRF 
LANMVKLRLY 


8215 


22116 


A 


8275 


276 


23 


KYVSSRPGTVAHSCKPSPLDGQGEWIA* 
VRKFKTILGNMMKPHLLFYKKLTFKNVK 
CIFKKYCATESRYVTKEKQKDRKKGTKE 


8216 


22117 


A 


8276 




114 


FTMIjARMVWIS*PRDPPASASQSAGTTG 
VSHHARPGLS 


8217 


22118 


A 


8277 


2 


87 


TMLLRLFrMS*AQAVIT,PWPPAVLGLQG 


8218 


22119 


A 


8278 


357 


347 


SEGLALYQLEAFTVCDLRERGFCRGRSL 
WPLITWAWRGGVFLLVQFQEVCCRIALG 
SLSPDPVLLPESGGPTSASQSAGIIGVR 
DRAQPNCVFEIGSYSVTQAGVQ*HNHGS 
L*P*P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ingto first 
amino acid 
residue of 
peptide 


Predict- 

location 
correspon 

last^amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
li>=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=H!Stidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8219 


22120 


A 


8279 


371 


125 


FFPKKNKPPG*PPPKK*RKPGKFILAPS 
PPFFGGGEFYKLKRGVLKQKPGPPGVPK 
ILKTPPRLGKNKPISPPPGQKKQNFF 


8220 


22121 


A 


8280 


377 


63 


PPVFPPPWGGQGGGFP*ARGLRTPWPTR 
ENPVFF*NPKN* PGGGGPF* IPPPWPG* 
ALKFPLPPRGGVPLTKNIPPPPRFSHKR 
RVFFPKNLKKKKNPCLIESL 


8221 


22122 


A 


8281 


2 


284 


LPK*VKNFVHAKICT*MFVAALFVITSS 
WKQPRYYSSIEKWINKLWHISAMEYYSS 
LKR* ELSSHEKT* KNFKCGSLSEKSQSE 
KDTWCLIPVI 


8222 


22123 


A 


8282 


318 


40 


APRLSLWFMWFIKRLDSSPGSRITWT 
NCFENHLCVCVCDRFLLCQAREQWRVLG 
SPQSPSPKFKRFSCLSLPSSWDSRCKTP 
*PANFCGLC 


8223 


22124 


A 


8283 


30 


343 


cyqkashateksfmr*r™pcsklcccy 
kilpqppqpsavitlisqqnkkkkkkkk 
kkkkkkkkkkkkkdgggalkkkki fs pr 
ggri fffffwakknnlgggl 


8224 


22125 


A 


8284 


163 


392 


FWLYNFSVIYSISDPCWYKEQLCKSVF 
SFLSS*SLLDEPNPNSPANSQAAQLYQE 
NKREYEKRVSAIVSQSWjSDS 


8225 


22126 


A 


8285 


406 


63 


kkkli lakpgwnldslkrappffffff 
rhgvplccpgwsktpepk*ssclslwss 
wdyrreplclayspvlnithlsspnvkii 

CHDSQESYDPIIMMMVII ITEGGMICGV 

IM 


8226 


22127 


A 


8286 


1 


322 


TNTLLALLLFIITF*LPQLNGYVEKSTP 
YECGICPI SPARVPFS IKFFLGAITFLL 
FDLEIALLLPLP*ALQTTNLPLIGMSAL 

LL 1 1 1 LALRLAYE * LLKGLD * AE 


"8227" 


"22128 


A 


82S7 


395 




HGSLHPGTP3L0QFSRLSLPSSWGYRHV 
PPCLANF* IFCRGGVSLCCLGSVEGLEC 
QARLPVLVRAELGLD 


8228 


22129 


A 


8288 


3 


246 


SSTLSFP3G*DYRHVPPCPANFFFFWIL 
EKTPFCFVPQGGFNLWA*TSPLNPPPWA 
SKGAKITGGTPGMGLINIiANGRPP 


8229 


22130 


A 


8289 


138 


421 


LLSSVFFFFERDFCFVPQAGGQWGNLG* 
WNPWFRGLKNFPGLTPPRSWNNGGGPPP 
PLI FCFFLKKKRVS PRGRGRANFLDLGT 
PPPGPPKERE 


8230 


22131 


A 


8290 


141 


375 


RTFFFFFKTEGYFVAQftKGQGGDHGSLQ 
PQSPGLKGSSLUIRWDYGSMPPCPSIFF 
F*IF*KNGASLCGPNNSKTGG 


8231 


22132 


A 


8291 


407 


209 


IGGPPGFSPPPVFKNRPPNLIFGPPKKK 
KNFPPPGGKNWFF*KGPPPFFFFFFFFF 
FFFFFYYTL 


8232 


22133 


A 


8292 


86 


433 


PPXPNKLIEXGQAF*KRHFFPPPRGRVE 
FFFFGPQKKTPAPGQNPGGGGKPWGSPL 
FSPGEKNXPPTHPXGGPVLGAPPPPPGl 
VLI *KKTGLPLFVGRAGTNPNLGGPPAP 
PPKG 


8233 


22134 


A 


8293 


147 


342 


LVKFEYTVLPKNRYRLGWAHAYNSSSL 
GGQGGRIT*A*EFKTSLGNILRSHLYYK 
KRKEKKLN 


8234 


22135 


A 


8294 


6 


152 


CCGHAPPHPANFYSL*RRSLiAMLPRLVS 
NSWTQAI LLP* PLKVLALQV 


8235 


22136 


A 


8295 


18 


205 


IPTRPPTRP rPGRLNHFTFTATRPGVYY 
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SEQEO 
NO: of 
nudeotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=C)'steine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














GQCSEI*(3ANHSFMPIVLELIPLKIFEI 
GPVPTL 


8236 


22137 


A 


8296 


1 


199 


YLYIYLSILLSIYLSIYLSIYLASYLAT 
*LAI *KEIYYEBYVLVMTEAEIFHILPS 
PSWRPRKViXlV 


8237 


22138 




8297 




318 


PTRPP'I'RJjLSRPIJCIQIALHKFSLCPEY 
LQ * ADYLF* SLWIRPDAWPHACKLS IL 
GG * GGR I A*AQKFK ISLGNIVRPCLYCI 
IKNKIKKKKPLQIIAFFLCFKD 


8238 


22139 




8298 


416 


304 


GFLHVGQAGLKLLISGHLPA*RSQNVGI 

TGMSHHARL 


8239 


22140 


A 


8299 


352 


31 


EMILKAAREKK* I PYNEPLI CLTVDFLV 
ETLQARREWYDI FKVLKKKNKTKNPFYT 
KIEYLVKLSPKYEKVTKTLLDP*Q*LRD 

FINTRHANRNTSNHKKKPIIKHK 


8240 


22141 


A 


8300 


382 




KWDF V P KL VSN S * AQAILH VGLPKCW 

DYRSEPPELB. 


8241 


22142 




8301 


2 


273 


PI "RGPSSASDALSSGGPYHPSECCFTY 
TTYKIPRQRIMDYYETNSQCSKPGIV*V 
VHTHHTGGRGSQQGLLEGSREWWWNGDP 


8242 


22143 


A 


8302 


77 


430 


GMGLSMHRTHFQYLAHIHPCVHVHTHTH 
THTHTHTHTHTLCM*HTEIHKSVRYGTE 

HAQDPFSIFSPYTPMCACTHTHTHTHTH 
THTHTHSVYVTHKVCKETCLSFFYPESG 
TISHAQEILKRCLENKTHLGVWCFTSLL 
SVANIVPFSYKRQVAAARGGLPPSLSQA 
E 


8243 


22144 




8303 


137 


396 


SSVKTMNYLKPVVHMVADRC*LFQCLPS 














GGIFFFFGAPKKISPGRV*KKKKKKKKK 
KKKKGGPLKKKKNFNPGGGGNFFFFWGP 
KKNIAGARLKKPGGGKNPRSP 


8244 


22145 


A 


8304 


329 


144 


NGPPFPFFFF*DGVSLCCPGWTAVARSR 
LTFAVFPTLNCSWGIHSELQHWFQDYL 
KLKTF 


8245 


22146 


A 


8305 


404 


157 


RTYNPSTLRGRSGQIT*GQDFDTSLGDR 
VGPHLRKRKHNYYSLVECFYRIKEKYPE 
SHKKAMKMLFQLKPDFLHIFQHIATN 


8246 


22147 


A 


83 06 


394 


294 


SLCFLTPPRDFNLGALIKNFYSPTPGWV 
SCVLSKGPPFFFFFFFKDGVLLSCPH*S 
QNPFVFALEWSGTPELK*SSHPaLPKAW 
DYTYEPPHQAQVELLRY*RSFPVLLLDG 
ICIFHSFJSPRGSLIP 


8247 


22148 


A 


8307 


280 


2 


WDSHVTFFTSVITVMLPSIQYVKQLEN 
IKSWPGIVARACNPSALGGRDG*AQEFD 
SSPGNTAKPCLPQKmCTKQKTKNKVGGT 
LETRSWRL 


8248 


22149 


A 


8308 


136 


330 


MSYQRKSVNKYKSNlsrACEEVTTWARHSG 
S CLRLYNPSTLVGSGGKIP * TQEFETSL 
SNIARPYLY 


8249 


22150 


A 


8309 


3 


118 


IFWLVQMGFFIHVGQASriEri*PQVIHPP 
QPPICVLGLPA 


8250 


22151 


A 


8310 


2 


366 


TRVDPRVRVRSTHRNLCLPGSSHASTSA 
S*VAGTTSACHHaRLIFSFSFFFLGFSE 
KTGFNFFAQGGFEPLGEREPPPFGPPGF 
WGGGGGGTFKRPRRWKFQKDIIPPLVSN 
RGDQPEPFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
corrcspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possib]e 
nucleotide insertion 


8251 


22152 


A 


8311 


253 


446 


SKTFQARGGVTEMGFRHVGQTGLEILTS 
GDP PASACQflAGITGVSHHaRLRI * YFI 

RIFILMTC 


8252 


22153 


A 


8312 


430 


88 


GVGRIQISWSPWLPKPKNSPGSPPQ*VG 
APGGPPPPGPYFFFFFQKKGFPLFSGRV 
*GSGSKNFFVFPPPKNWGTSPPPPPEGR 

QV 


8253 


22154 


A 


8313 


400 


323 


HRLMPPHPIDFFLFFVETGSCYVAQGAC 
ELLGSGNLPGSAS *NARITDWHHAQP* 
KSSSGPGTQQC 


8254 


22155 


A 


8314 




421 


GGRIARTQEAKVAVSQQCTTALQPGRHS 
KTPSQKTKKKK3PGGPPGPPPGRFFFFF 
KT*KKGPFLKKPPQRGGDPRGKKKRGGG 
LWKKPGGKLGGPTKKKTRGGGAKTP P PLW 
GKGGGPKGAPRAFPFTKKKPPPSGRNQ 


8253 


22156 


A 


8315 


213 


32 


RCPGTVGHGCHPSPLGGQRGRIP*GQEF 
ETSLANMVKPYFFLNSKKKTKNHPTKKN 

SQAW 


8256 


22157 


A 


8316 


422 


153 


AATLYTHKI'PHCiyFFLDRNLLCCPewS 
QTPGLK*AFCLSLPSSWDYRHKPPRPVT 
SPIFRIKSYDVIPPPYLE*NPMMLYLPS 


8257 


22158 


A 


8317 


1 


284 


ERQDWESRLEAMECAFHLEKSVNQSLLE 
LHQLAMEKGDPQLCDFLESHFLNQQVKA 
IKKLGDYLSNLCKT*APEAGLAEYLFDK 
LTLGGSEEDT 


8258 


22159 


A 


8318 


187 


37 


TRDFFWPGWAHA*NCSTLGGQGRRIA* 
AQKFESNLGHRVSHCLYKKFKBT 


8259 


22160 


A 


8319 


2 


250 


KYEQSLQ3VWDCVKRPMLRIIGVPEEEE 
KSKSLENIFGGI lEENFPGLARDLDlQI 
REAQRTP3KSIAKESSSRHIVIRLSK* E 
VWDCVKRPNLRIIGVPEEEEKSKSLENI 
FGGI I EENFPGLARDLDIQIREAQRTPE 
KSIAKRSSSRHIVIRLSK 


8260 


22161 




S320 


371 


195 


NNFSILFFF?ETRSH*VAQAGLVLLGSS 
NPPCSASRVAGVTG'/HQRYQQDDFKNFR 
HN 


8261 


22162 


A 


8321 


3 


235 


FISVSFKYSKHIPNHMHLKFLVLFLNFF 
FRDGFWLRSG* SWAILRCGHSSLQPLT 
GLKQS SHLS FQGS WDYRHAEH 


8262 


22163 


A 


8322 


1 


395 


LEVPLL!!ITSLLLASGVSIT*AHHRLIES 
NRNQ I IQALLITI LLGLYFTLLQASEYF 
ESPFTISDGIYGSTFFVATGFHGLHGl I 

A*YWHFVDED*LFLYVSIY 


8263 


22164 


A 


8323 


322 


83 


MHQSFLVAKVKKGKHGRWFYSCLMIKSA 
PVLSVTHNTLLCWPGAWHACMPSTLGG 
QGRWMA*AQEFETSIJ3NVGKPHP 


8264 


22165 




8324 


12 


249 


GFFLPPPPGRGEKRGPPPPPLFFWFFFK 
KGEFPL3GGNFLTLEPPRLAPPKGGK*G 
GDPPPPGFFFFLKKKPKPLLKG 


8265 


22166 


A 


8325 


139 


335 


ENILWAaTWIiDCHPSTLGG*GGRIT*G 
QEFKTMLflNIERPHLFFFFFLKRKFFFA 
PRVEGRGPV 


8266 


22167 


A 


8326 




246 


FRRGLAVAPRVLL C Y PGWRAMVQS * LTA 
ASNSWAQAIHLSLPKCWNYKHTPPYPTC 
*NSACWDESSFPLLVSAMEVCGQPHS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
List amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8267 


22168 


A 


8327 


422 


107 


lYYVRQQRKKEGLFFPFFRFPLPPGGGQ 
WGHFWFLQPPPRGV*RFFFLNLPRNRPW 
DFKGLPPPPAYFAFFLKKGVFPPGQGDF 
*ILAPKGPPTSTSQKGWGFKD 


8268 


22169 


A 


8328 


1 


161 


FRGIRWVCHVLFFYLSFFLRQGLAVLPR 
LILNSWAQATLLPQLPE*LGLQTSS 


8269 


22170 


A 


8329 


422 


103 


PPYSPPFGGPGGGVPKVRESGPPGPPQG 
NPFFFKKPKNNPGGGGGPLIPPSSGG*A 
KGMGLSPRPRLPLS*NWPPPPPPGGQNK 
LPFPKKKKKERKKEKFNFEHRG 


8270 


22171 


A 


8330 


46 


422 


STPFPLASQSAGITGVSHRARRSSSFQC 
TQIPLFPETLPWLGFGSRHGRGFSSQVL 
PAQSSSFDLDLISVLGY*MLSHISLGPN 
RALGFIPPHLSKWENGSTLRSIVLSQIE 
GLSSTLSFPHPVS 


8271 


22172 


A 


8331 


429 


112 


FSHLGSSPRQGFTMLARLWNS * P CDPP 
TSAS QSAG ITGVSHHAQPHAVF Y I RL PD 
SQVLCHQMPYWPTAHYPHRWIFSLVLQK 
WLSVPPPLWPIQSQPKASILLN 


8272 


22173 


A 


8332 


291 


19 


KTPFPPKI*KFPENFLKFTPGCGGPPPP 
FLKKFVSPGNPPPKKIFFFFFPPPPPPF 
FFFFFLRRGSLISIAQDGIQWHYHGSLQ 
PQLPGLR 


8273 


22174 


A 


8333 


307 


123 


KYFRGGGGGGR* FPPLGGVKQKKRGNPG 
GGGSKKPKLGPSPPTRGKKKNFFSPKKK 
KKKKK 


8274 


22175 


A 


8334 


3 


169" 


HASADMCHQJ^LLLLLLVETGLCHDGRA 
GFGS*SQBIHPPRPPKVLGLQHFGLIY 


8275 


22176 


A 


8335 


1 


208 


LCVCVCVCVCVCVCVLFC*WRKGLaLSP 
RGECSGTNMflHCSLNFSGSSNPSVSPSH 
VAETTGVRPILG 


8276 


22177 


A 


8336 




352 


ILFYFIFYYFSFCFEMGSRSVSQAGVQC 
PDLSSLQPLPPRFKRFFCLTFPSSRDYR 
CVPPHLaKFCIIRRR*AAIGQAGIELLT 
LSDPPASAS* SAEMTGVSHCSGPSFINF 
U!IWQI 


8277 


22178 


A 


8337 


420 


272 


KflPCQFSGEKTVFPKHCWNSWPPfeKiCQ 
QQKBasrLCPCLVLYTSINSQ*TMGLNIKA 
RTVNIS*KYKEDP*TGKK*1NKSCPWTG 
KKKKFLARTQKTIAIWDTLINWTSPKLK 
MSTKQRHHLKMNR* PKWLLFFVS 


8278 


22179 


A 


8338 


427 


202 


LI lYFCHQSVQKKGVLPLLQEREGWAEA 
SLKISNISLSHTHTHTHACAHTHTHTHT 
HTSS *NGIKTWITIKSFA 


8279 


22180 


A 


8339 


2 


164 


TPGHGISLCRPGWSAVAPSRLTATSASW 
VQTILLPWPPE*LGLQESRLLRDHLC 


8280 


22181 


A 


8340 


422 


1 


PPGPPGGKTKRAPPPSFFKGPNPLLKGK 
GPGAKPPPKNGVFFFPPKKVKGPQKNFG 
GTQISQPNKRGA*kFTPFFi:r*KPPQKKK 
GPPPGEKIFKGPKF*KFPSPGFFFFFLK 

QGLTLSPRLERSGANTAHCSLKLLSSSD 


"8281 


22182 


A 


8341 


1 


456 


RTRGAVSQGHAIALQPGQQSETPSQKKK 
KKNPAPGGKGKIGGTAKKKGRNFLDANI 
LGPNSRPGGAFFGG*TAPPGKKRGSPGP 
IFGGGKGGPKKKILSF*KAGGGAKFFPS 
PVGTPFFRGF*KGKILLKKKKFSKPGGG 
NPENLSPTNFFF 


8282 


22183 


A 


8342 


407 


2 


RQFILGAPGVFPPPGF*KPPPKIIFWGP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspoiui 

residue of 
sequence 


Predict- 

otide 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaitiine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKNFFPPPRGEKFFSFKGAPPLFFFFF 
FFFFFFSPFFFGALFFFFFTILFKTKFG 
VFSCKGIPPKFFKESRPPNGAGRLPLLG 
VGPLPGVPQGGEAERKISQGSFQ 


8283 


22184 


A 


8343 


441 


1 


PPPYILGGPRVFSPPPFFKTPPQIFFLG 
PQKKKNSPPPPGKKIFFF*BAPPPFFFF 
FFFFFFFPPFFFGALFFFFFTILFKTKF 
WISWKGIQPKLFKRSHPPNGGGRLPLL 
GGGPPPGVPRGGEAERKFSRGSFQRTGG 
SAPEFRA 


8284 


22185 


A 


8344 


466 


323 


FGVAGTQNPGRNPRVPSFRGGPPPGVPQ 
GRNPPPPWFFPLNPPI'fflFLTP*VFPFLE 
PPIEWVGPGPP*PSYGFSSPVAEK 


8285 


22186 


A 


8345 


657 


347 


L F FW FFLRQS FTLVAQAGVQ * CDLGS LQ 
PPPPGFK*FSRLNLPSSLDDHCPAPCPA 
NFVLLVETGFHHVGQAGLELLTSGDP PA 
SASQSAGITGVSISAQPAQ 


82S6 


22187 


A 


8346 


3 


212 


DCSRGPPPHLANFGLLEKTGSNHVDQAG 
LKPLSLRDPPASAS*SPGITGGSHRAQF 
*FIFKRGMAPKGF 


8287 


22188 


A 


8347 


9 


255 


IPGPTHPSDFFLYEIDLPFKN*KGFPPP 
PRGGGKNPPFPPPPFFFS*PPWKKKITF 
QTP LTGVFGAPKNFNFKGGFKK IGF 


8288 


22189 


A 


8348 


79 


312 


NflAGFKVGISKA*ALETLQI ITREGHSK 
KPRYAVTFESHKMGTVLHLVEQEHTVGY 
ITASCSALKC*IESGGFLNDY1 


8289 


22190 


A 


8349 


1 


369 


FFFFLKRDRVSPLSSRVKYKWHDYSSLQ 
LQTPGLQ* SSCLRFLSSQDYEHIPPCPT 
NF*KCFVEMGSCCVTQGWSQTPGAQAVL 
PPWASQRHWGLQGVRPLCSLLLSNCPHS 
HLVWSWLFSGC 


8290 


22191 


A 


8350 


434 


324 


LPILSPVWWHMPWPGTREAKa*EQ*SL , 
EPGRRRLQ 


8291, 


22192 


A 


8351 


3 


253 


YMCSKFFFLECSGAISAHCHIiELPGSSD 
PPPPT*NF*VIGTTGTCHHEWLLFELFG 
KMKPHYIAQVGFKLLERNFLKTKKVFP 


8292 


22193 


A 


8352 


3 


1431 


StRG<3lPSLK<jVYSSLmSRj'SC!HLF* 

EKALQRAREEEEKRKEITSHFQSTLTDI 
QGQIEQQSERNMKLCQENTELAEKLKSI 
IDQYELREEHLDKl FKHRELQQKLVDAK 
LEQAQEMMKEAEERHKREKEYLLNQftl^ 
WKLQAK7LKEQETVLQAQLTLYSGRFEE 
FQSTLTKSNEVFATFKQEMDKTTKKMKK 
LEKDTATWKARFENOnCaLLDMIEEKAL 
HAKEYECFVMKIGRLENLCSiALQEERWE 
LHKKIRDAEISEKDDQSQHNSDEEPESN 
VSVDQEIDAEEVNSVQTAVKNLATAFMI 
IHHPESTPHQSKETQPEIGSSQESADAA 
LKEPEQPPLIPSRDSBSPLPPLTPQAEA 
EGGSDAEPPSKASNSPAGLGAETQCEGL 
PVGAQADQASWKPEAEASGQAPQAPTEA 
SLQKMEADVPAPACAaEEHVAAMVPACE 
PSEQPPRAAAEELPVGASAGPQPHNVA 


8293 


22194 


A 


8353 


376 


65 


IHCGDHVQEVPKLKVCALHMSSWACSQI 
PKAGDKILTFDQLTLDTLKGCGTILLSG 
PHKGQEVYWHFSKALGTQHSHTKPCVHS 
RDQKFKHIRG*WASQGYKN 


8294 


22195 


A 


8354 


459 


85 


'LEEVESLERVLEKKMIESEELKSKQTRF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 


M 
eth 
od 


SEQID 
NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVl=Methionine, 
N=Asparagine, F=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LEE IKNQDKLNKSLKEEAMLQKQSCEEL 
KSDLNTKNELLKQKTI ELTR&CQKQYEL 
E*ELAFYKIDAKFEPLNYYPSEYAEIDK 
APDESPYIGKSRY 


8295 


22196 


A 


8355 


231 


158 


PFFIFYLYQSKDIARGLERGLEPEKVIG 
ATDSCGDLMFLMKW*ESVAPMTFSGSRP 
LSKPRAISLLWYK 


8296 


22197 


A 


8356 




164 


IHRGVENQMAGEQGWDKDVSVYWVGAVA 
HACNPNTLGGQGRWIT*GQEFETWLP 


8297 


22198 


A 


8357 


23 


291 


CTHTPJiQAHAHTHHLFYYFLYI FNFFEE 
TGSH*IVQACLELPASSDPPASFSQSAG 
ITGMSHHAWPTSCILKVKMAI.CSIWCSA 
ACFLY 


8298 


22199 


A 


8358 


1 


307 


FFFFEIESCSFAQDGVQWCDLGSLQPPP 
PGFKQFSCL.SLLSSWDYRHVPPRLANFC 
IFSRDGFHHVGQAGLQLLTSWDPPT*AS 
QSAGITGVSPRAQPCPAF 


8299 


22200 


A 


8359 


75 


266 


FVKTENLFLTVLEAGKSNI KAEAG* VSG 
EEHGLCLQDGALLPLEGRMAVSSHPRGQ 
NMIMRAKC 


8300 


22201 


A 


8360 


1 


295 


NTWVFCFVLLFRDRVSLCCSVRSTVASS 
*LIVMMNLSFPSS *AYRHVSSHLDFFFF 
FKIQRILILPRLISNFWPQEILLWLPQG 
LGLQPRPQVLVGNS 


8301 


22202 


A 


8361 


376 


293 


MGFHHVSQAGLKLLTSGDLPASASQSAR 
ITGAGHHAQPDF*HSGDSWQAPPEPRDQ 

G 


8302 


22203 


A 


S3 62 


375 


~Tv 


PQEEFLRFATDVGEYRAVTELERPVSES 
WNTQKDLL * QKRGRVSTYCRIITYGVGES 
F SVQRRGEHVGGRGLGPCELE I 


8303 


22204 


A 


8363 


36 


363 


QLSENPLIGKKVKLRTVFIGKQFLFLLG 
GGVREAGAHSGRQATVRWHNLSSHAKRP 
SCLSLPSCDHRHLLPHPANFCIFMTD* I 
PPCRSGWVQTPELKQFTHLGLPKR 


8304 


22205 


A 


8364 


455 


235 


QAC5AElAPPRGIIFFFFW*RKiaFCMLPRP 
VSKFWG*WPLPWPPKRVGLDSWAPPPR 
DFIFFSFFVKKRVCILW 


8305 


22206 


A 


8365 


432 


175 


STDQVSVAMGPS* PGAGYNLLVCRWLRP 
LKKCSIRMGVS*FSRYa£WLPLDKKGN 
SLTPCMSRVRRCTVMIiRLMLRGLHPLSD 
KP 


8306 


22207 


A 


8366 


383 


83 


RRLIVFICYFSKK*GLPLLPRLVSNFWP 
QVILVPWPPKVLGL*ALKKLWLDMVTHa 
YNPSTLGGQGTRIT 


8307 


22208 


A 


8367 


27 


304 


FFGFILTLKKSSN* CTLRPCFLS IGKVR 
IHTLGNILSRRGYGERCSLPHCLWESIC 

LP VQSN*ALHKKVQMH IPFDQAIPSRGI 
LPTDTGHK 


8308 


22209 


A 


8368 


165 


204 


QNIPIAVSFFFFFLKRVSFCPPG*RAGG 
QSLFIGALTFQV*G1LIPYPPGNWELRG 
PPPlSILTNFF*FIiNLRFLLGPINLHYVHD 
KTSPLLYLFFFFF 


8309 


22210 


A 


8369 


400 


201 


KERPLPFRL IGS P FQGTWPRFFSKGEVK 
NSNFSKY*PPPFINPAPRVCFEGIEVIT 
KFFQVRPFFP 


8310 


22211 


A 


8370 


21 


408 


LI PLSTYVCVRVCECVCVCVCWI*GYA 
CNTLLIACNILQDVPFTSPTKDKIFSGD 
FLTLVGKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 
beginning 

location 
correspond 

peptide 
sequence 


Predict- 

iocation 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, P=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=^possible 
nucleotide insertion 


8311 


22212 


A 


8371 


1 


94 


NTLGHEN*LNSGFGGCSEPRSCHCTPTW 

ATE 


8312 


22213 


A 


8372 


546 


972 


PIHSLYGR* PSFCIiGIiALFKTLQVMHFli 
KDSVENLKSSNLKVGLNFRSPQIQATIS 
. PPLQPKTKTYEEDPKSKPKKEEKNMEVir 
MKEW 1 LRYAEQQNEEEKNENS KSLEEEE 
KFDPNERYLHLAAKLLDAKEQAATFKLE 
KK 


8313 


22214 


A 


8373 


502 


178 


QHFWRLRQKDSFALGG* KQPGQKGEN PF 
FQKKAKNILG*WDQPWPTTQKGGGGES 
L * PGRTKS P PCTSGWGKKQNPVLKKKKN 
PQKTNCI PLC I YHI FLINRKTNTV 


8314 


22215 


A 


8374 


362 


112 


YRMSVNPQVRAKRVSQRTNTPPNFFF*D 
EVLLCRPG*STWQS QLTVASI SWAKAI 
LLPPQPRQAKYFLRLVKCSSEKGERIE 


8315 


22216 


A 


8375 


411 


21 


VLG lYPTLVSPSPGLKMTRVKLPSGS KK 
VISSSNRPVFGWAAGG* FDKP ILKAGR 
ASS KYKAKRNCWPRVRGVAMNP VEHP FG 
GGILQRIGKPSAIRIYAPAGRKVGLIAV 
RRS GRLRGANTVQEKEN 


8316 


22217 


A 


8376 


450 


108 


AEGGRGPRK**LRSCRPVDGSAGQSPGP 
LRTESLETPGCSPQGWQELEPPPRRPQ 
WWPGPPGHRDRRQHRRPGARSRQQGPES 
SH*QRSGESRPPAVPRPPCPLPWKKGVL 


8317 


22218 


A 


8377 


25 


306 


EQQPPPVAVHQSGPPGAPPEASPSHPGS 
SEKPQPVPPPRAPLVPQPKP * LSPPHHP 
AAGLLAVHALAGGPGTVQRGLPGRAARA 
VSVRVPAAHC 


8318 


22219 


A 


8378 


376 


149 


PAHCS LLIPGFRGSSCLSLP CNWYPGRA 
PPRVGNFFFFF*EMESCyVAMLPSLVLT 
SWALAILPVLPPKLPGLQA 


8319 


22220 


A 


8379 


98 


374 


SVPPPVGTYEGDAAPTGGGGLPAIiGR*G 
AAGGHVQQGHPGPHRGHAAPKPSAQGGP 
GHSRECLEPSGGAAGGEREPPRAGGGQS 
WMRAPSMT 


8320 


22221 


A 


8380 


2 


807 


RPQCCVRAPRACAVSGVPKLWS VP* SGR 
SAPHPTRS*VRSWSRWLRTQSWQTSRTR 
RRSESKVRVSYQAWTASPPPAASASARP 
A*RTSSRSYSSSSTCCSWLWPSSWSTGP 
SQTFVRWSSTLSCLCLTRKWIAMMPQVL 
PCTPVRPSCSAVSTI TRSFLL *QALASR 
VT* IAPPRGSTTRTPSPIRL*NLP*IiSR 
GPGK*KSGSWSSSSSA*TRWRTSAPLI 
TSSSLLSRSSCKAQTG*ASCRPVRVPIP 
A6SSLGASAPGSRCHW 


8321 


22222 


A 


8381 


104 


327 


CIAFPEMLGIKYEVGKCPSGLKSLKILK 














GRICK*CVCMRKTDSTCVS 














G*LEPRSIiRHEMVCM 


8323 


22224 


A 


8383 


178 


588 


ADLLSGAPGPPGGSQDNftNSSEHSRaCA 
PAAPGP*WLQPWLC3?*WTQSPLEARGS* 
NPGVSHSSRIPGIPGAAGTSRPSGALPP 
QPAGHGGSSGELHVARGCACGRPLWSHR 
PGCGQLLRVTAADSHRCASRGAAPM 


8324 


22225 


A 


8384 


423 


78 


QGQELRELTGVTVESPEEPQRLLVS*GG 
TVPCrACSI.HRTI.*TVBHGSASSFLAQG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
Ingto first 

residue of 


Predlct- 

otide 

correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=lInknown, >'=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DMEPLGDRGLQGTEQLH*GQDRASLQDI 
CFLLPVLSLLCPRSLDLHRHRVGVTGFT 


8325 


22226 


A 


8385 


434 


176 


FADDSRFDFLLLV*C2yMLMLIREQLLEG 
HFTVNMRLMQDYPITDVCQILQKAKELQ 
DSK*PGGKRPMFGREASRPCAVAPGTHR 

NL 


8326 


22227 


A 


8386 


2 


305 


RLWEDPPGRNTPPGAR*PGPCGE6GHI I 
REEQACGWPRGHSGPWRHPQAGPLQLGQ 
WRGKGAGQGARRS PQGSLESRPPTLSAA 
PPAIHICSVGGVRLHCY 


8327 


22228 




8387 


331 


91 


EMRSHSVTQAGVQWRDYGSLQTRPPGLS 
FNVKWDWVGL*GDPSSPPMLLPSS*ARP 
GGGWGPGGEGHVSKEAAGRLQGPP 


8328 


22229 


A 


8388 


2 


250 


IHVRLWSQLFRRLRWEDLLSTGGGSCSE 
P*SCHCTVTWAFFLESCLRCAHWYVTL 
LTALLGSLPCANPRQTAVSLPQAQLV 


8329 


22230 


A 


8389 


388 


176 


YWDSQ*CMPSFAVPVESACICAIGHIJFS 
KSVNSVIAVCVDGS SHKYVFTHDGRCHR 
EAFDVYRDICDDDA 


8330 


22231 


A 


8390 


445 


330 


PPRFTPFSCLSLRSSWDYR*PPPRPDNF 
LSLYSVLHET 


8331 


22232 


A 


8391 


559 


142 


LRTLPVTLGKSLALSVPSFPYVGGNWIT 
NREPSAKGRGSGPNRGPGHAASCPARTH 
SLRHSPAGGVMLTQPRPILILPETQHPS 
ESAVARTDISKARRWLHSRSPWPH*DET 
SAGAERPGIRCYIGHPAT 


8332 


22233 


A 


8392 


104 


400 


YYEIFKKEMYSYYHILHFFFFFSFFFFE 
TKPNSVNGPEGKGPILG*LKPWLLG*KH 
FCLTLPSGGNYRHVLPPPVNGFFFFFFF 
KKKKVFPFVAPRGV 


8333 


22234 


A 


8393 


408 


169 


DQVWIKDWNIGSIiRPR*KGPQTI ILTTP 
TAVKIEGIPAWIQHSQVKPAAPETWEVR 
PSLDNPCKVTLKKMTS PAPVTLHS 


8334 


22235. 


A 


8394 


1 


269 


YCCPLFSSKALTQENSPYSSFRLVNPPG 

LSLHPEGEGGKWINERGRELGPSAGPLL 
T ,F LH F A3 AGRRQ PP DWAD SEADLQQ VRH 
KLGQP 


8335 


22236 


A 


8395 


~2 


204 


KDCKVNKEVERVLREFHQAGKPIGCDAS 
TSLPAPAPWWSWNAftLCCPGLSCVWCWQ 
GESRTCVGEHQ 


8336 


22237 


A 


8396 


3 


285 


KUTVKYDRKELRKRUSrLEEWILEQLTRL 
YDCQEEEIPELEIDVDELLDMESDDARA 
ARVKELLVDCYKPTEAFISGIiIiDKIRGM 
QKLSTPQBOC 


8337 


22238 


A 


8397 




192 


ETGFHHAGQDGLELLTSGDCPS LASQS A 
RITGVSHCTRliQLLTEQEYFKNQNHLPS 
APFYKRAF 


8338 


22239 


A 


8398 


8 


241 


LNPFLYAFLDPRFRQR.CTSMLCCGQSRS 
SGTSHSIRGEKSASYSSGHSQGPGPNMG 
KGGEQMLEKS I P YS QETLWD 


8339 


22240 


A 


8399 


2 


240 


SCLNPFLYAFLEPRFRQACTSMLCCGQS 
RSAGTLHSSRGEKSASYSSGHSQGPGPiSr 
MGKGGEQMHEKSIPYSQETLWD 


8340 


22241 


A 


8400 


10 


272 


CNWQNAYLTWRCAQDCEDYFAERLYRS 
MKGAGTDEETLIRI I VTRAEVDLLGIKP 
KEHQKYQKSLSDMVRSDTCGDFRKLLVA 
LLH 



1061 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 

location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteiiie, D=Aspartic Acid, 
E^GIutamic Acid, F=^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, Xi=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














TRGIFFFFLNTRSHSVAQAKGLGHDHSS 
LKPQPPRLKQSFNLSLLSTWNYGGLLPH 
SALWKAEAAKS 














TVEAS VS YLIi YVAMVMQLPWGKS PASGA 
ESHRQGCGGSWPGGGMEEGRGAERGGGC 
LLPQALLy 


8343 


22244 


A 


8404 


3 


235 


LTLLAHCRMMAAAHMGGPPMMPMMGPNT 
PGMMPVGPAPGMRPPLGGHMPMIPGYPM 
MRSPARLMMVPSQPRMTRPDR 


8344 


22245 


A 


8405 


3 


246 


LVFSVDVGLLAHTRQRGLKRSSGMSSLL 
GKIGAKKQKMSTLEKSKLDWESFKEEEG 
IGEELAIHNRGKEGWIKGAQPLGQ 


8345 


22246 


A 


8406 


1739 


804 


WEPDVSGSAAWLASAAFGMATFSGPAGP 
ILSLNPQEDVEFQKEVAQVRKRITQRKK 
QEQLTPG WYVRHLPNLLDETQ I FSYFS 
QFG TVTRFRLS RS KRTGNS KGYAFVE FE 
SEDVAKIVAETMNKYLFGERLLECHFMP 
PEJCVHKELFKDWNIPFKQPSYPSVKRYiSI 
RNRTLTQKLRMEERFKKKERLTjRKKLAK 
KGIDYDFPSLILQKTESISKTNRQTSTK 
GQVLRKKKKIWSGTLDTPEKTVDSQGPT 
PVCTPTFLERRKSQVAELWDDDKDDEIV 
FKQPISCVKKEIQETQTPTHSHKKERRS 


8346 


22247 


A 


8407 


2 


241 


CLRFPSSPAMGLLRSGTKLIFRRRPKQK 
EAGLSQSHDDLSNATATPSVRMKAGSFS 
RRLIItRFYFKSKPKANGNPSPQL 


8347 


22248 


A 


8408 


285 


186 


KQGLTSLPRLVSNSWAQAFLLLWPPKVL 


8348 


22249 


A 


8409 


1 


246 


RHENREELQVIADLCIKYDTLCISDEEP 
MWGVEYIWKGPCFCLTTFTLYLKGDIFP 
RPPSGCLNLWIVLNPMYPVIHLITSC 


8349 


22250 


A 


8410 


1 


102 


AGS YTLH 1 1 KGDDGTRGVTGRFTFTLHR 
IQTSEE 




22251 




8411 


877 


647 


FFFETESHSVTQAGVKWGDLGSLQSLAP 
GPKKFSCLNLPSSRDYRSPPPPLANFCT 


8351 


22252 


A 


8412 


1 


130 


RMLNDKTLRTDIGGNFPKNGWAAIATHS 
FEFAQFDNFLEEATR 


8352 


22253 


A 


8413 


97 


401 


PGRPRCPALPGLEWSNPPGPPTSGYLFP 


8353 


22254 


A 


8414 


348 


516 


VSAYGFITEGHERFSDHYYDTSWKRLIF 
YINHDFKLEREVWKRLHDEGI IRLYC2RL 


8354 


22255 




8415 


714 


410 


LGVFRSAUIGSLWLLLRSFPQKSPNPIiA 

CIASGIPOTSHVWVHDSGHANQLQNYEII 
YLLPAGYSLEEQRILDW 














PKDSY 


8356 


22257 


A 


8417 




309 


FYFEaSTALSKSNKPIHTIILISfBHVHLVGD 
DAACIAYIRLTQYMDGSGMPKTMQSEET 
RVWHRRDGKWQNVHFHRSSSPTVPIKPP 
CIPNGKBNFSGGTSLWC2NI 


8357 


22258 


A 


8418 


3 


242 


ARAIiTNAaSHVDDMPNALSAtNDLHaHK 
LRLDPVNLKLLSHCLLVTLVDHLPDEFT 
PVEHACLDNVLASVSTVIiTFKYR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJn known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8358 


22259 


A 


8419 


711 


296 


NVANSDGLIASLWKEYGKADARWVYFDP 
TIVSVEILTVALDGSLALFLIYAIVKEK 
YYRHFLQITLCVCELYGCWMTFLPEWLT 
RSPNLNTSNWLYCWLYLFFFNGVWVLIP 
GLLLWQSWLELKKMHQKETSSVKKFQ 


8339 


22260 


A 


8420 


3 


97 


F WS S LCNGL I AAQLIiFYWNAKPPHKQK 

KAQ 


8360 


22261 


A 


8421 


1 


279 


GCGIADLAMSAIFISrFQSLLTDILLLICT 
CAYIPSLAPNLLDRNKTGLMIFWKCAT 
IVERKSPYVaVCCILMAFSILLIQKLVK 

MPQCICHNI 


8361 


22262 


A 


8422 


1 


228 


RHEVFIELNHIKKCNTVRGVFVLEEFVP 
E I KE WS HKYKTPMAHE I CYS VL CLFS Y 
VAAVHSSEEDLRTPPRPUSS 


8362 










182 


S EDTGEEQ WTAEF I NRGE YE I D I AGYR 
FQAKAKLYPVASLFTQKRRKDDMELSDL 


8363 


22264 


A 


8424 




70 


VQFVFDAVTDVIIKWNLKDCGIiF 


8364 


22265 


A 


8425 




151 


ETTASSCTPASLESRRCCAPCRMPRTGF 
FGSSPLWRPSGSRSLKPGFQQ 


8365 


22266 


A 


8426 


1 


187 


RGRVGPGGERLVPGVPGfiEAQQPAGDGV 
RAGPLQARPPAPVGVSQGRCX5A2M3AAfta 
PPRPDG 


8366 


22267 




8427 


520 


423 


GSHQ 


8367 


22268 


A 


8428 


1 


627 


GTSGTRGVTGYFTFTLYLETPKPS I SS S 
NLNPREAMETVILTCDPETPDTSYQWWM 
NQQSLPMTHRFQLSETNRTLFLFGVTKY 
TAGPYECEIRNSGSASRSDPVTLNLLHG 
PDLPRIHPSYTNYRSGDNLYLSCFANSN 
PPAQYSIiJTINGKFQQSGQNLFIPQIXTK 
HSGLYVCSVRNSATGEESSTSLTVKVSA 
STRIGLLPLLNPT 


8368 


22269 


A 


8429 


3 


390 


ILGCNILRVEYSLVI CVSVPGSKKVILD 
LPLVIGSRSGLSSRTSSMASRTSSEMSW 
VDLNIPDTPEAPPCYMDVIPEDHRLESP 
TTPLLDDMDGSQDSPIFMYAPEFKFMPP 
PTYTEVDPCIUSnSHVQ 


8369 


22270 


A 


8430 


3 


208 


NGTHVIILLCLKTCSTWHVAnDMIVAS 
NLGTGVPNQTPVSSGDLLIRSNGLLIPG 
TCEITRLYTISE 


8370 


22271 


A 


8431 


3 


563 


LPTSRVDPRVRLDRMKKDQEEEEDQGPP 
CPRLSRELPEWEPEDLQDSLDRWYSTP 
FSYPELPDSCQPYGSCFYSLEEEHVGFS 
LDVDEIEKYQEGEEDQKPPCPRLKEVLM 
EAEEPEVLQDSLDRCYSTTSTYFQLHAS 
FQQYESAFYSFEEQDVSLALDVDNRFFT 
LTVIHHHLAFQMGVIFPH 


8371 


22272 


A 


S432 


2 


119 


DGDNILVTTFIYIKSVTELNGDIITNAS 
WRCSERVAPSA 


8372 


22273 


A 


8433 


192 


!285 


AGVLSIIEETDSEGLQTKWENQTYDER 
LEINDSEEVASIYTPTPRHQGLPRSAHL 
PNKAMADNSSDECEEENNKEKKKTSQLT 
PQRGFSENEDDDDDDDDSSETDSDSDDD 
DEEHGAPLEGAYDPADYEHLPVSAE IKE 
LFQVNQLRCVPTDVLDHKLKPFIPDFIP 
AVGDIDAFLKVPRPDGKPDNLGLLVLDE 
PSTKQSDPTVLSLWLTENSKQHNITQHM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
■ng to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVKSLEDAEKNPKAIDTWIESISELHRS 
KPPATVHYTRPMPDIDTLMQEWSPEFEE 
LLGKVSLPTAEIDCSLAEYIDMICAILD 
IPVYKSRIQSLHLLFSLYSEFKNSQHFK 
ALAEGK KftFT PS SNS T SQAGDMETLTFS 


8373 


22274 


A 


8434 


3 


251 


TLQADHFNTRLSCGDAAQTLWARTGYLG 
FVRRTELTAATGERHDALYWGALDETL 
ELRGLRYHPIDIETSVSRIHRSIAEW 


"8374 


22275 


A 


8435 


1 


225 


QTFNLEGSQIYEDSIVLQSVFKSARQKI 
AIEEESEDESNEEEDEDDEEYHEWKRYD 
RLGENMCLNC IWNGGGYIV 


8375 


22275 


A 


8435 


95 


340 


CGCGIAGLAMSAIFNFQSLLTVILLLIC 
TCAYIRSLAPSLLDRNKIGLLGIFWKCA 
RIGERKS P YVAVCCI VMAFS I LFI Q 


S3 75 


22277 


A 


8437 


1 


317 


GPKPLVRTSREPGKDVTTSGYSSVSTAC 
PTSSVDGGLGALPQPTSVLSLDSDSHTQ 
PCHHQARKS CLQMRPPSPPE STVSQQQV 
KRINLCIHSEEEDMNLGLVRL 


8377 


22278 


A 


8438 


80 


213 


PHLSFNAGITTIKVNIRKANSLGGGFHC 
WTCDARRRGTLQSYLD 


8378 


22279 


A 


8439 


3 


767 


IIEDNIKQLKEMKFTYLINYIQDEINTIF 
NDYIPYVFKXLKENLCLNLHKFNEFIQN 
ELQEASQELQQIHQYIMALREEYFDPSI 
VGWTVKYYELEEKIVSLIKNLLVALKDF 
HSEYIVSASNFTSQLSSQVEQFLHRNIQ 
EYLSILTDPDGKGNEKIAELSATAQEII 
KSQAIATKKIISDYHQQPRYKLQDFSDQ 
LSDYYEKFIAESKRLIDLS IQNYHTFLI 
YITELLKKLQSTTVMNPYMKIAPGELTI 
IL 


8379 


22280 


A 


8440 


103 


354 


NGCECDFLFLFLFYFFETESRSVAQflGV 
QWHCLGSPQPSSPRFKRFSGLSTLGGRG 
GWIMMSGVQDQPGQHGETPFLMQflGLKT 


8380 


22281 


A 


8441 


3 


160 


KLYPLKIVFGMNGRVWVKGKTVQQTLIL 
ANVLEACELMTLDQRIIjMFILAES 


8381 


22282 


A 


8442 


459 


3 


CGGIilPVRASWLLCLPKCjaWAMAGAPPP 
AWLPPCSLISDCCaSNQQDSVGVGPSEP 
GAGYNLLVHRFLSLSEKRSIRVRVTRFS 
RCHLSPLSLTRKGNSLTPCTSRVRQCLA 
LLRLAHGALHPLSCAHCLaLPSEMTPVP 
QMENAEITRLL 


8382 


22283 


A 


8443 


2 


318 


RQGNMTAALQATLLNPPINTKSQAVNDR 
ARGIVLWVLISFKAiroiEKAVQSLDKNG 
VDLLMNYMYKGFESPSDMSNAMLLQWHE 
KALAAEGVTSIVRVSTALIPA 


8383 


22284 


A 


8444 


2 


109 


VTPFPMSCDLQGDCaCRNPQAQEHSRKD 
LRGYSHG 


8384 


22285 


A 


8445 




69 


LVLNSWPQVIRRPWP PKWGLQV 


8385 


22286 


A 


8446 


1 


163 


PSEKHNIWVGVTQFSRCRLSPLSLTRKG 
NSLI PCASRVRQCLTLLRLVHGACTH 


8386 


22287 


A 


8447 


1 


147 


GQAGLELLMSGDRPTSASQSAGITGVSH 
RARPSSISFILELRGSVRKKF 


8387 


22288 


A 


8448 


914 


729 


GLTMDTQKDVQPPKQQPMIYICGECHTE 
NEIKSEDPIRCRSCGYRrMYKKRTKHLV 
VFDAR 


8388 


22289 


A 


8449 


134 


392 


TCPPPPEPPSPLTCAVIiVPQTRRWRGLG 
SL FRHAC CVALP LQLLLLL FLLLLFLL P 
IREEDRS CTLftNNFARSFTLMLRYNGP P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide Insertion 














PT 


8389 


22290 


A 


8450 


37 


248 


AKITPLLQPGRQEQDSISNKKILRCCLT 
MLPRLASDSSGFKRSSHFSLPTSWDYRH 
SPPPLASVPFLLMC 


8390 


22291 




8451 


367 


165 


MFFVFLVKTGFCHAGQARLQLLASCDLP 
VLALQSAGITGVSHCARPISTLKDTIKT 
SHFVPLKISGK 


8391 


22292 




8452 


2 


282 


GKRMAGGPEMMQLRLDGNRLYITTSMFS 
AWDKHLYPDL IREGSJMiHVDVDTALGG 
LKLNPYFLVDFWKEPLGPSLAHELRYPG 

GDCSSDIWI 


8392 


22293 


A 


8453 




163 


HHDFHAQSLLANIERIAILVFAACEAAA 
VLS LLVSI SNTYGLDYUHNLKLLQC 


8393 


22294 


A 


8454 


374 


197 


TPEPKQSSHLGLSKCWDYRHEPPHQARN 
STFDSNHSSLAAEDKLLGYFGLSQWQPG 
YQP 


8394 


22293 


A 


8455 


343 


161 


MGMGPSVPGTGYOTiLVOlLLRPLEKCSI 
RVGVSQFSKYHLSRLPLPRKGNSPTPCA 

FWVR 


8395 


22296 




8456 


1 


193 


MGFLUVGQAGLKLPTSGDPPTSASQSAG 
ITGMSKRVQPYYVLSICQTLHSVCICLL 

IILLTTVL 


8396 


22297 


A 


8457 


342 


263 


WLRP WPALWEAEAGGSLELRS SRPA 


8397 


22298 




8458 


101 


303 


KAAKYIiFQRLHFCFIiNLKKDNMLLFNHR 
MENFKELFGPGTVAHTCaSTPSTLGGRGGM 
I TRS ADRDHPG 


8398 


22299 


A 


8459 


3 


140 


YLKHLCMYVCKYLROGLLMLPRLVLNSW 
AQRILLPWLPKVLGLQV 


8399 


22300 


A 


8460 


327 


248 


FPRLVLNSWPKaiLMPWLCKVLGLCja 


8400 


22301 




8461 


327 


79 


S IP F ILAFFI YLS FFFKETGSQS VTHaG 
VPGI IIAQCGLQLLGSGDPPDSASLWG 
TTGTYPTP ILSVFPHMPSFFILLCMLF 


8401 


22302 


A 


8462 


365 


273 


MGFQHVGQVGLRWPQVICLPWPLKVLGL 
QA 


8402 


22303 




8463 


375 


217 


QAGLKLLTSDDLPASASQSftGITGVSHH 
AQPRHHFFKTRFIYIFFSSPSSVLS 


8403 


22304 


A 


8464 


238 


354 


RGKCFVQAFLYKECSGGQARWLTPVIPE 
LWEAEVGRWIT 


8404 


22305 




8465 


367 


248 


FQMRIHKWLIDLHSPSEIKQITSISIEP 
GVEVEVTIADI 


8405 


22306 




8466 


341 


153 


NMILGWAQWVTPVI PAIGEflHWGGLLYP 
RSS KPAWATQQNLLFTKKKKWI YI YI Y I 
YIYERY 


8406 


22307 




8467 


368 


101 


VEMGFRHVGQAGLELLTSSDPPASVSQS 
AGSSTSLPTADLLTPGWVGVYGLTOKWP 
VTLIAMQRLPAVLEKKSVARKMSENKIV 
LKRGQ 


8407 


22308 


A 


8468 


362 


248 


HVGQAGLAIiLASGDPPTSASQSARIAGV 
SHRARPEAP L 


8408 


22309 


A 


8469 


384 


196 


SARLGLP KFWD YRREP P CPADDML I LIW 
GILETFSLIFTATMESRNYCVYFBCMRNK 
HUSKTS 


8409 


22310 


A 


8470 


386 


300 


LAMLPRLVPNSWSQWLSPLPPKMLGLQ 

V 


8410 


22311 


A 


8471 


380 


178 


FFFFFFFFFFFFFFFFFFFFFFMYI YI F 
AIVYQTLACNNKFLQMERVTRKLRQC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Ij=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unl<:nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8411 


22312 


A 


8472 


3 


154 


FLVDMGIHHVSQAGLKLLTSGDLPASAS 
QS?IFFFLFFFFEKQILAQAGG 


8412 


22313 




8473 


3 


162 


GLYHVGQAGLKPLTSGDPLASASQSVRI 
TGVSHRTRPKI I FICL IFFLFFLVA 


8413 


22314 


A 


8474 


266 


107 


LCLFLIEMRFPTWQAGVKLLSSSHLPS 
SASQSDRITGVSHCFWPEVFKRGIF 


8414 


22315 


A 


8475 


361 


,69 


RLPGPSNPPISASLAAGITGAEHHAQLI 
FVPLVEMGLCQAGKHNFEKYSRVLIYS P 
IVSFTHFI 


8413 


22316 


A 


8476 


I 


154 


KKLI PTLMDHGAGFKS S VKE ITTAWEM 
AREVELEVEPEDGNELLQSRGKT 


8416 


22317 


A 


8477 


3 


193 


SHYQSLISSNHGHKHCGRPQGPLPRKTI 
DLCSLDYQLTFPPLLTHDPVKSPSVRNT 
QELSLIP 


8417 


22318 


A 


8478 


408 


103 


TGSRLTLSSRVECSGVITAHCNLELPGS 
SDPLASASQVAETTGMHHHVWLIFLGIF 
LVKMASHYVAQDDLKLLGSTDPPVSASQ 
S VGI TG I S HHAWAT I C YK 


8418 


22319 


A 


8479 


59 


166 


WVNTCLCLHTHTHTHTHTHTHTHNYFKE 
TPIHQYI 


8419 


22320 


A 


8480 


394 


253 


FLLLLLESGFCHVAQAGLKLLSSCDLPA 
LASQSAEISGVSHCAQPLA 


8420 


22321 


A 


8481 


379 


184 


CFSLPKCWDFRGVP ISPGDFWNFI IKLG 
VCPFCQGGFELLAPNDVAASSFQNAGIS 
GINFWAWAL 


8421 


22322 


A 


8482 


367 


170 


CQLSSGLGGRSGRESERKKEREREKDRE 














3GPHPYSLI 


S422 


22323 


A 


8483 


324 


136 


QRSSCLSLPSSTOYRRMPPHLANFCIFC 
RDGVLPCCPSWSQMTTSFLSLNFLICLV 
GIILGRG 


8423 


22324 




8484 




57 


GERRYGTCIYQGRLWAFCC 


8424 


22325 




8485 


35 


339 


GWLLRFFWWQWKNGNNIjNVSLIDGWLSY 
DTLGYNGIUSTCHMYILTEGDSQKKKKKK 
KKKKKKKKKKKIDPAGNSKIYNRKIFKT 
PRGFEKWALGPLLILFF 


8425 


22326 


A 


8486 


330 


66 


RPGVSRFNPWPLIFFLRRVFLCRPGWNA 
WQSLLTKISASQVQAIFLGGGGCSEPR 
SCHCTPAWATTAIOjCFKKKNGKNFTDIM 
FSK 


8426 


22327 


A 


8487 


3 


209 


CLSLLRSWDAQAPPPHPANFFLMFWFCR 
VGVLIiCCPGLASSDPPPLVSQSARITGM 
SHCTWTIFLSFK 


8427 


22328 


A 


8488 


3 


216 


GQSITVSCTATNSDVGSSNLVSWYQSSN 
LVSWYQQHPGQAPKLIIYEDNKRLSSEK 
KKKKKKKKKKKKKK 


8428 


22329 


A 


8489 


326 


62 


PAYFSIGSFFLFLFFFKTECHSVTRLEC 
SGAISAHCKLRFPGSSNSPASGGGGRSE 
PKSHHFTLAWTTRAKLHLNKNKIKTVKE 


8429 


22330 


A 


8490 


2 


156 


S SAG IGRTGCFIATRIGCQQLKARGEVD 
ILG IVCQLRLDRWCGCKQGQQR 


8430 


22331 


A 


8491 


399 


254 


KNNKENKT I NNKI TNKLI I KLKNI S FL Y 
FFFFFFYFFFFFFFFFFFFF 


8431 


22332 


A 


8492 


2 


130 


FRRVGQAGLELLTSGDLPASASQGAGIT 
GVSHRAQPTFLKSC 


8432 


22333 


A 


8493 


3 


94 


HLPAEFTPAVHASLDKFIiASVSTVLTSK 
YH 
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SEQID 
NO: of 
nucleotide 
sequence 



SEQ ID 
NO: of 
peptide 
sequence 



nucleotide 

location 

correspond 



residue of 
sequence 



location 
correspon 
ding to 

acid 

residue of 
sequence 



Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide i 



KSGFSNVGQAGLKILTSSDLPVSASQSA 
G ITGMSHCTWPLYP YFGLGSRS FHS I EE 
HCWYQLAPFPSCENPFVIVPFFVHRFYL 



8434 
8435 



GIRRGEILEVIEFTSNEEMLCRDPKGKY 
GYVPRTALLPLETEVYDDVDFCDPLENQ 
PLPLGR 



8436 
8437 
■8438" 
8439 



LTILVLAMWKVGFFKRNRPPLEEDDEEG 



22339 
22340- 



8499 
8500- 



PSSQDYMI-IAPPIILANFLNKFFVKWCLAM 
LPRLMSNWPQTILLSPPKALGLHV 
KLMSFGPTLLSARCVSTGQIGMNELNLN 
lUftMSVAATINNElMSFARTWMKLKAIT 
LIILTLEHKTKYCMCSLISGSLRMTTQG 
ETGSHCVDQGGVQWHISAHCSLDLLGSS 
PAS AS R WG ITS A-!HPI J.WS I S SLPP 
LLLHPHNLLNLLLLLLPSLSIVLNLHIL 
PPLLPPL3LLCSI 



NNSAKKKKKKKKKKKKKKKKKKKI 
AAF 



8444 
8445 



RIFFFFETVGFVAQDGVQGHGAILQPRL 



DWHIAYVLLYGPRRVEIMEEESEQ 



TRFCCCMLFWWFCVLCVCVAVFPPWCI 
LFFIWLFFFFFFF 
CAPLPPLKFF FPPRGFIFRGGGSPIFPP 
PKJCGVLPKNPPKGFFPPPPFLKPAPGGA 
FNYPPFGGKIFFS PPEGKWAPPGGFLRG 
APPFFFFFFFFFFLFF 



SFQHVAQAGLKLLGSRDPYTLAAQSAGI 
TGVSHHflRPVLYFSSVLFSMEFLSTMSA 
VILPCICSFCVSLAVPSSCILQQPLPPT 
CTYIKTETCI 



TPPVSSLFFLPLKEIIFFCPPF2QSLPP 
PFFFFPPPLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFKYLSLPFTL 
SKEKKK 



QSLSESPQTVYFIYLIIYAVFFTTLTII 
LIFSPSFRMELKFLLRCYFFAPFYAFLF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFSQPGAQTTAGHAGV 



MCCYIMSLLKVICTIIKEFSTKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKIKK 
KGGVPL 



QAQLGDIGTSCYTKSGMILCRNDYIRLF 
GNSGAAGACGQSI PASELVMRAQGNVYH 
LKCFTCSTCRNRLVPGDRFHYINGSLFC 
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Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 

peptide 
sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=]soleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EHDRPTALINGHIjMSLQSNPLLPDQKVC 


8453 


22354 


A 


8514 


99 


262 


MNKRFLLNKKKKKKKKKKKKKKKKKKGG 
PFKKHPGGAQIFPGGEKKIFFLKGGL 


8454 


22355 


A 


8515 


85 


2 


GWGWAQWLTPVIPVLWEAEAGGSRGQE 


8455 


22356 


A 


8516 


244 


423 


VFAVFNFLGWVMDMLINLAWI ILQCAC 
ISNHHWHLEYIQFLFANYTSIKLGKKP 

SKSV 


8456 


22357 


A 


8517 


3 


114 


KINGEDKQKILDKCNEIINWLDKNQVCV 
FFFFFFPI 


8457 


22358 


A 


8518 


413 


91 


GGWGKISFRGGGFLKKKKKKLLKKKKK 
WGGGPKE KKKRGGF FF FFLLNF FFWGGV 
KKKKKKKKKKKKKKKKKKKKKKKKKERK 
GRKEGREGRRKEGEGRQE INMTA 


8458 


22359 


A 


8519 


1 


163 


SRLPGRSRRRSRGRRRRRRRRRRRRRRR 
SRVLIFPWDQHLDLNIVyilEDVDAA 


8459 


22360 


A 


8520 


191 


21 


RFTTHVDAWATVTVFLIESVFLHVGQAC 
LELLPSDDLPASASENARVTGVSHRAWL 


8460 


22361 


A 


8521 


2 


144 


AAATSQAWWCMPWPATWEAEVGfiSLKP 
GRLQLPLTMIALVHLSrail 


8461 


22362 


A 


8522 


90 




KPTPFLPTLIARTTLLLPISPLILIIL 


8462 


22363 


A 


8523 


305 


146 


VFRIHSCGIRGSVDVKITDGLLVIERlE 
NVPPGPHNKNKNPYAl FQSS S I ESQ 


8463 


22364 


A 


8524 


176 


2 


LAWPPLFLPKCWDYRHEPLQPflSLTTFV 
REWDKIGWAWWLTPVIPALWEAEVGRSQ 
GQ 


8464 


22365 


A 


8525 


2 


200 


CLIPSSWDYRHMPPRPANFCIFFFFLGD 
HVSQVGLKLLTSKDPLAWTSQRAGITGF 
SHHTKPQGFF 


8465 


22366 


A 


8526 


2 


217 


S FSLLFPPSFSLLLPPLRSLLLPPLPS F 
SLLFPPSFSLLSPPSFSLLLPPSFSILL 
HPTFSHLYPPSPSFl 


8466 


22367 


A 


8527 


2 


186 


1 RGLRPLEKRGIRVGVSRFSRYHLSRLS 
FARKGNS PTPCRSRVRRCPSLLHGLHPL 

SDKPQ 


8467 


22368 


A 


8528 


3 


153 


CLGFLSGWDCRCIPPCLMTLKKLFVEMG 
SLFWPQWILPPWPLKVLGLQV 


8468 


22369 


A 


8529 


2 


388 


RTAVHTFILVLQVCVCRWHLSLAALTPA 
HLVCSSHRILSYLSIRGSWDHRCMTTTP 

PASAYQSAEITGVSHWAWPTDSLWQKPH 
MVIWCISCFLNVLIGTM 


8469 


22370 


A 


8530 


293 


64 


YRISIKVSKWFKGWVSSPWIFFFFVFFF 
S FFFVFFCFFMFFLKQYLKFI INIl I IL 
TQALYICTTSKICYSFLYFR 


8470 


22371 


A 


8531 


2 


250 


S CLGLPS SWDYRGTLPCLANFCI FNHDG 
VSLCCQGWSQTPGSQVIDEAVYSIYAYY 
ILALLIVYVAQWEVWLKHFCCILKYT 


8471 


22372 


A 


8532 


2 


201 


FHHVGQTGLELLSSGDLPASASQCAGIT 
GVSHCTWPSILYRMNFKNLSAVFIAWVI 
EGNYTWKTCI 


8472 


22373 


A 


8533 


319 


200 


TFTFFLQRWGLTMLPRLVSNSWAQAICL 
LQPPKVLELWA 


8473 


22374 


A 


8534 


160 


297 


KTETLALVAHACNPSTLGGIGRQVSSAH 
EFKTILGMMTKPILYTJOi 


8474 


22375 


A 


8535 


333 


228 


PMRGHYIAIVKSHDFWLLFDDDIVEVSS 
FLISYF 


8475 i 22376 




8536 - " 


3 


325 


PPSSUiPCSWILDCCASNERDSVGVGPS 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPGAGYNLLVCCFLSLLEKRSIRVGVTR 
FSRCRLSPLSLTRKGNSLTPCASQWRC 
LALLRLAHSALHPLSCSYCLALP 


8476 


22377 


A 


8537 


216 


5 


KLGFLVNLVEFQVKFFFLFFFLEIILIE 
MRSHYVAQAGLELLGSGDLPASASQSAE 
ITGMSHHAL.PWYSF 


8477 


22378 


A 


8538 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WATCGRRPAS 


8478 


22379 


A 


8539 


136 


22 


RPRRELVL11PWPQAILLPWPPKVLGSQV 
WATCGRREAS 


8479 


22380 


A 


8540 




319 


YIYMSMYMI.IYVCIYVSIYICVCIYINS 
ISPNTYMIiyMLiyvyVYMCVYICVCVY 
lYTNSISFNTHITPIKHSDVHTVSTTRC 
NLGGRGFCHTWPLPHLAGW 


8480 


22381 


A 


8541 


331 


156 


SVNGLLSLPACliPVCLPACLPSFLPSFL 
PSPLSFFLLLLLLLQKPVNSLIFSQS LS 
IS 


8481 


22382 


A 


8542 


72 




KIGQARWLAPVI PJiLWEAEAGGSP 


84S2 


22383 


A 


8543 


2 


83 


QAGVQWRDLGSLQALPPGFMPFSYLGG 


8483 


22384 


A 


8544 


293 


2 


KTVWHYLVNAPEVE IQAIYSQETCKS MY 
QETCTRIFIAILFIIAPNWKQRQDTMNP 
GNKENGKVATLDSDGFLVRFQSLMRSNC 
PLYPFNKFPILLK 


8484 


22385 


A 


8545 


314 


122 


LVETGFLHVGRAGLELPTWSDPPTSASQ 
GAGIAAVSHRGQPVDFKNNISTQIQGRP 
IICNYKTF 


8485 


22386 


A 


8546 


316 


123 


LVKTGFLNVGRAGLEIPTWSDPPTWASQ 
GftGIAWSHRGQPVDFKNS ISTQIQGRP 
IICJIYKTF 


8486 


22387 


A 


8547 


3 


247 


GIMATERLRNYTGGIYAEYQDTTYINHV 
VSVAGWGISDGTEYWIVEHSWGEPWVRC" 
FHFLKVIPRKKTSWYSSRLNGSVYV 


8487 


22388 


A 


8548 


289 


25 


QRREKFFKUGPPIFFRGPFGFPQIFVSF 
PFFFPKIGVFFVFFFRFSLVLKGDFFSN 














RPCL 


8488 


22389 


A 


8549 


289 


2 


KKKIGETLCKLKLCKKFLDVTPKAQF IK 
QKKEKLINWQFIKIKNFRSPKALLRRGK 
GKPGTGRKRNANHISDKGPLCRIQKKNS 
RGRAQWLTLW 


8489 


22390 


A 


8550 


290 


187 






QDSINK 


8490 


22391 


A 


8551 


293 


183 


RDEGRGISYMLPRLVLNSWPQAILLPQP 

PKVLKLQA 


8491 


22392 


A 


8552 


364 


87 


MLPLDFFLGPSLDFCPPFCESGSGGIKV 
PESTTPRASAFLPPGAANI.HHII.QLVGA 
PRVPPGFHHVGQDGLDLVDLVIRLPRP P 
KVLGLQGG 


8492 


22393 


A 


8553 


57 


225 


KWQKWTPWAGGVAHACNLS TLGGRGGKI 
ACAEEFKTILENIVRPCLSKNKWISFS 


8493 


22394 


A 


8554 


326 


2 


KSKRGYSEPLVFS ICSARAPRSHSVSQA 
TVYGHAHASTQPFPPHLNRSSYLSLLSS 
GDYRHDPPPLAYICISVIGNDVEHLFLC 
LLS TC I S S VNESHVYS TLVP I FGP 


8494 


22395 


A 


8555 


372 


148 


YICFLRSSVRQILFTPDCSDCSDaWVPCaSf 
IRLPGSTHWCASASQGAGTTGARHHARL 
IFFVWFFCIFSIFSRENH 


8495 


22396 


A 


8556 


422 


269 


AADQERIiHTYQLNYYHFCKRWGLTTFPR 
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NO: of 
nucleotide 
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SEQID 
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sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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Predicted 
beginning 

location 

Ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
List amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, I=lso!eucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVSS SWVQAI LPTWPHKVLRLQA 


8496 


22397 


A 


8557 


669 


407 


AGVKWRNLSTLQPPPRRFKRFSHLSLPS 
SWDYRCPLPCPANYCIFLVEKEFRHVGK 
AGLELLTSGDPPAWVSLIAGITGVSHRA 


8497 


22398 


A 


8558 


195 


352 


LGLQAGTTASGYSTVI LGGMLKPRSWAW 
WPWPATREAEVGGSLEPGRSKLQ 


8498 


22399 




8DD9 


345 


219 


MLARMISISLPCDPPASASQSAGITGMS 
HHGWLNLFKSIHL 


8499 


22400 




8560 


137 


2 


VKYCLPTRSLIKTVRPLSPSQATASPVT 
CTYTPRWPEVTEESQKN 


8500 


22401 


A 


8561 


324 


193 


RSSGNLPASAAQSARITGVSHRTRPKCI 
I S KE YVAENNHL KS TS 


8501 


22402 


A 


8562 


326 


121 


HNHTHTHTKNHKKHQQPQHTHTYHTQTD 
lYVLQAASQKSLILISSLAV 


8502 


22403 


A 


8563 


3 


85 


HEHTHTHTHGHTHTHRHTHTHTLCARM 


8503 


22404 




8564 


2 


153 


ARDRHI PPCLTNFS VFWRDGGLAMLPRL 
VSKLWAQAVHPPWLLKVLRLQA 


8504 


22405 




8565 


2 


177 


ARVGFHYVGQAGLGRLTSGDPPASASQS 
AGITGMSHRARPMSSLTIfllLQDTWRAY 
AG 


8505 


22406 


A 


8566 


353 


212 


AGFRHVGRAGLELVTQDGPRTSACQSSG 
ITGMSHCaHPEKSTAFILF 


8506 


22407 


A 


8567 


76 


191 


ELIFCRDGGLTVFPKLISNTWPYAVLPL 
WPPKVLGLQV 


8507 


22408 


A 


8568 


1 


142 


GTRGFLHVGQAGLELPTSGDPPALASQG 
AQMTGVSHCASQKKEFLEW 


8508 


22409 


A 


8569 


250 


95 


SASHAPLPLPPSHLPLRAAGLSPLCPPR 
LVSSYRPHVILLPLPPKVIiGLQV 


8509 


22410 


A 


8570 


3 


226 


LVYSLCEHIWRFIVISHARVCCIYVWF 
FECGYADRRVADKWLWIH 


8510 


22411 


A 


8571 


1 


216 


GTRGLGVGVSHLVSKNTHTHTHTHTHTH 
THTHTATASQLSVFKIWAFQRSLDKGLT 
YISNSYCSSKFYLYHH 


8511 


22412 


A 


8572 


356 


268 


LTMLARLVLNFRPQVIPQPWPPKMLGLQ 
A 


8512 


22413 




8573 


12 


175 


RWGFHPVGQAGLELLTSSDLPTLASQKA 
EITGMSHCARPSFFSFFLSFFFWKHL 


8513 


22414 


A 


8574 


1 


163 


GTRTGFHHVGQADLELLTSGDPPVSASQ 
SVGITGVSHRAWPVGLCIIALISNEY 














EFLVETGFRHIGQAGFELLTLGDQSTSA 
SQNTEITGVSHPAPGLAIIFKSGNQAFF 


8515 


22416 


A 


8576 


2 


179 


ARG ILCHSGTFKLDSRPSMVAHTCNPST 
LGSQGGRIACGQE IDTSLLKVARPRLYG 


8516 


22417 


A 


8577 




100 


GTRGFTMLPRWSDSWAQVILLPWPPKV 

LELQM 


8517 


22418 




8578 


3 


114 


HERHEILSIYLTIYLSITfLSIYLSnfLI 
YLPTITRI 


8518 


22419 


A 


8579 


1 


264 


GTRPCRI ELYRWESLAKAQETSGEEIS 
KFYLPNCNKNGFYHSRQCETSMDGEAGL 
CWRVYPWNGKRIPGSPEITGDPNCHiyF 
NVQN 


8519 


22420 


A 


8580' 




245 


GTSGTSGILGFYLEREDRLLQIRTDTIT 
LSHGYSTFSUSIRCDSRHHMNRMFQMLyQ 
SPFLGYIQLSHEVLAFGISYIYNTE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspari:ic Acid, 
K=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8520 


22421 


A 


S5S1 


1531 


1457 


QARNETIARRLWDVSCDLLGLPID 


8521 


22422 


A 


8582 


191 


2 


TITSYFSPTPAPGSHyATFCFYKFDySR 
YFIKVELWEAETGGSRGQEIETILMTTV 
KPRLSRA 


8522 


22423 


A 


8584 


2 


212 


ARAG lyPKEYKS FYYKDTCMRMF lAGLF 
TIPKTWTQSKCPSWIDWIKKMYHQGPIL 
SPMLECSGEVSGFK 


8523 


22424 


A 


8585 


2 


192 


ARGNSPASSSRVAGITGAHHHARLIFVF 
FLFVRFFFSFLKTGFKLVAHSNFQPLTL 
FAPLHFK 


8524 


22425 


A 


8586 


277 


66 


PLDFRLGHKRRLPFPKKKKKESKFVCVC 
VCVCVCVCFKRQCIiVTLPRLVSNSWAQA 
ILLSWPPKILALQV 


8525 


22426 


A 


8587 


395 


286 


PHLGLP KCWDYRREPPCLAEWLKKYVW 
VTYFSFRR 


8526 


22427 


A 


8588 


307 


177 


RYPSTINYS ILNRDKIMPKLDRMVYKAR 
PRVMCLPWPPKVLGL 


8527 


22428 


A 


8589 


167 


349 


RNLRELGEFFFFFFWFFFFFFFFVGKFS 

VFLLVGNTjFLFFGGGIFGLSRVMCFKEF 

RCFS 


8528 


22429 


A 


8590 


375 


LLSRFSWFCFVFRQGLTMLPRLVLNCWA 
RGILLPWSPKYLGLQA 


8529 


22430 


A 


8591 


347 




etgfphvgqaglkllassdppasasqsa 
qitgmsrraqpsakrf 


8530 


22431 




8592 


340 


114 


HRTAHCSLYLSGSSDPATSSSQVTGTTG 
KDQHTWLLFKYFCRARVSKINKZKSINK 
FYFLKTQTTPSEYGKASPG 


8531 


22432 


A 


8593 


86 


253 


LRLCCGLSCSTAKKKKKKKKKKPKKKKI 
KKKNLHPKRGGFEErLFFFPKGQKNLF 


8532 


22433 


A 


8594 


3 


135 


GFCHVGQVGLELLTSGDPPALASQNAEI 
TGVSHRAWPPVLFFP 


8533 


22434 


A 


8595 


2 


274 


PRVRTISLLGILVYRSHLISSLLCLEGI 
ILSLFIIATLITLNTHSLLflNIVPIAIL 
VFAACEARVGLftLLVSISNTYGLDYVHISr 
LNLLQC 


8534 


22435 


A 


8596 


3 


316 


SIPYSWGEKEGIPCMaPPQIQGTSRLND 
FTALSLHLNTHTHTHTHTHTHEIQHLPP 
QGITALILNSLLRHYCPSLAFPSLSPKQ 
NLTVRHCILSRDTWIiATASK 


8535 


22436 




8597 


385 


291 


KAPPLFFFFFFFFFFFFFFFFFFFFKPV 
KKILGV 


8536 


22437 


A 


8598 


275 


369 


GVQVLKLLTSGDLPALASQSAEITGVSH 


8537 


22438 


A 


8599 


98 


344 


VHYEEFMCVCVCVCVCVCVCWPRVRGVD 
MNPVEHPFGGGTHQHIGKPCTIRKNASA 
GRKVRLMRALWTVRrGGTKTVTLRKT 


8538 


22439 


A 


8600 


328 


195 


THVPPSIiAKFVFLVKMGFLHIGQAGLDL 
S TSGDP PALASHVFKR 


8539 


22440 




8601 


2 


214 


HVDWMGTFAI LS ELHWDMIflVDPENliR 
LLD SVLNCELANHFGTDFT PPVQAAYQ K 
WADVANALAHMYH 


8540 


22441 


A 


8602 


89 


181 


GLRHENRLNSGGGGCSEPKLCHCTPAWA 
TE 


8541 


22442 


A 


8603 


340 


57 


KEGHQEMEQAEDGGHGEKAPGTS PWTLG 
ELPKSAEGHTGLFVSYRGCRTQFRFKKV 
LFGGDRlIHGrAMLPRI,Vi:jSrSWPQEII.LP 
QPSKVQGLQK 
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SEQ ID 
NO: of 

sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

Ctll 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=Glutaraic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unlinown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8542 


22443 


A 


8604 


330 


132 


TGFLHVGQSGLKLLTSGDLPTSAFQNAG 
ITGGSHCTRPFSLFHSYLVLFYGCSSSQ 
VSLSILARM 


8543 


22444 


A 


8605 


546 


279 


RPGPTVSPRLECICMIMAHCSLDFPGSG 
VSPTSASRVARTTSTHHHA.WLIFYSSVE 
TGSHHVAQAGLELLGSSNPPISASQSAG 


8544 


22445 


A 


8606 


3 


249 


NRFNLGSGGCSEPRSHHCTPSWATEQET 
CITYLEYCFPPVSHTNLVYFKPLMVTIR 
SDYCILGLHASSIGGSDSLIIDCWD 


8545 


22446 


A 


8607 


159 


324 


VIQYRTWILHLHLCEVLVFWFLPYFFLK 
KTYTGIGQMRWLTPVIPALWEAEAGGS 


8546 


22447 


A 


8608 


334 


179 


TGFPHVGQSGLKLLTSGDLPTSGSQSAG 
ITGVSHCTWLEVI YFLNIS lYPS 


8547 


22448 


A 


8609 


278 


202 


QQPSTLRQDPPPAKRLHLSEGSDGH 


8548 


22449 


A 


8610 


236 


66 


AflSTFFIiQKIKRHLLSGTVAHACMPSTL 
GGRGIRISRGREFKTSLGNMVRPRVYKK 


8549 


22450 


A 


8611 


29 


177 


GKQVMALHCFIFFHFFFRRSLAMLPRLV 

SNSWVQEILPPWPLKVLELQA 


8550 










215 


H T S R 'I'L VHLRLKNFNKYL lEWGLSMLP 
RLVLNSWAQAVLPPWPSKVLGITGMSHC 
TWLYLHLSTDLLKLC 


8551 


^2452 


A 


8613 


3 


T, 


LRHGNCLDPGEGGCSEPRSCHCTPAWVT 
E 


8552 


22453 


A 


8614 


2 


167 


FTISLLGILVDRSLTEffllHNIKPSFTRE 
NTLMFIHBSPILSTRKQLAIQHLSRLE 


8553 


22454 


A 


8615 


2 


281 


FCILVETGFHCVAHftGLQLLSSGNPPAS 
ASQSVGITGMSHCTSPNLSLLTQSKSFC 
LL IQMHNPYAFINSQLTFS I LFLLS YFI 
FYDSLFLLF 


8554 


22455 


A 


8616 


282 


182 


GQAWWLTPVIPALWEAEVGGLLEPRSSR 
PAWST 


8555 


22456 


A 


8617 


379 


207 


SNQTTKKYINFFFVEMRSHYVAQAGLKF 
LGSSNPPTSASQSASITGISHYVWPISP 


8556 


22457 


A 


8618 


705 


483 


DKSFALVARAGVQWHHLGSLQPLPPGFK 
QFSYLSLPSSWDYRHAPPRPANFCILSR 
DGVSPCWPGWSRTPDLR 


8557 


22458 


A 


8619 


359 


220 


RQSLTMLPRLVSNSWAQAILPPWLSRLL 
GLQAIiAPIPGKSYWEKTP 


8558 


22459 


A 


8620 




186 


HEVSWVKRKQDEWI EFDDDTVS I VAPED 
ILRLSAGGDCHIAYVLLYGPRRVQILEE 
ESEQ 


8559 


22460 


A 


8621 


1 


116 


GTSFCRDGGLTVLAQLGSSFWPQRILPP 
WPPKWELQA 


8560 


22461 


A 


8622 


369 


188 


MYSLEMGFIGNGHAGLELLASGDLSALA 
SESAGITCMSLRYRTRFSFKRATCGLDL 
SLQS 


8561 


22462 




8623 


202 


22 


Y ILGFYFMP VI C YQERKKDKKWWE VTTI 
MSFVGI CMELGAI I LS KLMQEQKTKYHI 
FSQV 


8562 


22463 


A 


8624 


387 


315 


SNLGGGGCSELRLCHCTPAWATE 


8563 


22464 


A 


8625 


2 


135 


TRDLCSLVYLLTFPPLLSHDPAKSPSVR 
NTQELS I KKKKKGRPF 


8564 


22465 


A 


8626 


3 


232 


HASAFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLKKKKKKKKKKKKKRGRGLKKKKK 
KHNKKFKKGQKKYITFGRGAF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucTe" 
otide 
location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidline, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Voline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, t=possible 
nucleotide insertion 


8565 


22466 


A 


8627 


97 


373 


QHGFLKKKKKKKKKKKKKGGPLLKKPLG 
GPNFTGDGKKKFFPKKGGIKKPPGEFLK 
KNLFLGGKKNGKNPPKKLNPLGEKKNFK 
GKKGKNPP 


8566 


22467 


A 


8628 


2 


188 


RYGCPPPYHTFEEPVYIKSRQKRKEWNP 
PKLVS S Q PHGLHDFFKKKKKKGAAVLKD 

PSGGPS 


8567 


22468 


A 


8629 


287 


68 


GEEGRTQKGEOTGGGGSERKEGEEHEAR 
RTGRGEGGGGEAKGRPRSARRRGRGSRG 
SKSQTPSRHLPSHIITN 


8568 


22469 


A 


8630 


364 


136 


VSLLLGLEYSGTITAHYSLNFLLFIVET 
GSHYVAQTGLKLLGSSNPPTSDSQTFEI 
TGVSHGAQPKVQFWSKNLDT 


8569 


22470 


A 


8631 


355 


70 


LFPPRFFPFFSPLSPLKFFFSPKGFNFF 
REFFPIFSPPKKRVLSKNSPGGFYKPPL 
RGKIFXFPPPVKFGPPRGLFKGPPPFi'F 
FFLFFFFFLVI 


8570 


22471 


A 


8632 


2 


266 


AFTISLLGILVYRSHLISSLLCLEGIIL 
SLVliATVITLNTHSLLANIVPIAILVF 
AACEAAVGIiALLGS ISNTYGLDYGHNLN 
LLQC 


8571 


22472 


A 


8633 


1 


186 


LTHH INITIKPS FTRENTLMF IHLS PI LL 














PGGGQN 


8572 


22473 


A 


8634 


2 




VLEIAVALIC3AYVFTLLVSLYLRDNT 


8573 


22474 




8635 


713 


333 


EGPPPPRSKKKGTQGGGKDPPSPPEPKT 
PGNPQGRKTGPPPPGRPYTGPRPPGSGP 
TRGGGSRSSSSNTNAPGEKIFFSKNPGR 
KIFPPRAILVFFSPFPLKNFFFSLRLLI 
FLGGCAPFFPPPK 


8574 


22475 


A 


8636 


389 




FIFSFLFLIIFFFLNINFFFYIFSSLSF 
FFFLWFNLLLKHTPPIILFFFSLSPIFF 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFGTKKELY 
LLRARGCKQYKNQKLIWYLTF 


8575 


22476 


A 


8637 


2 


192 


VRTLGLRTDAIPGRLYQTTFTATRPGVY 
YGQWYEICGANHSFMPIVLELLALTIFE 
IGPEFTL 


8576 


22477 


A 


8638 


1 


87 


PTRPLEIAVALIQAYVFTLLVSVYLHDN 
T 


8577 


22478 


A 


8639 


2 


132 


INLASTLIIFTIIiILLTILEIAVALIQA 
YGYTLLVSLYUTONT 


8578 


22479 


A 


8640 


2 


113 


LFFIAEYTNII I INTLTTTIFLGTTYDA 
LSPELYTTY 


8579 


22480 


A 


8641 


375 


22 


FIFISPLAYLFFLPFPFFFIPPLCFFFF 
IPLFFLFLSFLFSLIFFLFSLILIFPPS 
LFFFFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFCWIjPHRVMIAKVGISWS kki 


8580 


22481 


A 


8642 


180 


34 


WFISDSLl'iJLGMVAHTCNPSTLVGQGGWI 
TCGRSRPSVPKRTPSQTCAPA 


8581 


22482 


A 


8643 


338 


133 


NLPLGLFYSLQKKKKKPFLRPGTLGAPK 
GNFQGAPPLFGKFAFFFFFFFFPFFFFF 
FFFYGFFFFLFK 


8582 


22483 


A 


8644 


156 


287 


LTYPIKPPLLCSQKKKKKKKKKKKKKKK 
KKKKKGGAFKKNPWGAQI 


8583 


22484 


A 


8645 




325 


LHHCHTHNFVNKWCFIRKKKKKGKKKK 
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SEQID 
NO: of 
nocleatide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A'^AIanine 
C=Cysteiiie, D^Aspartic Acid, 
E^GIutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histi(]ine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














KKREKKKKKKKKKKKGGGPLKKPPGGPK 
FSGGGKKKIFPQRGGYKKPPGGFLGKTL 
FLGGEKMGENPPKKIKPLGEKKIF 


8584 


22485 


A 


8646 


370 


3 


P VEAS RLLCL PKQAWAMAGAP P PAS L P P 
CSLISDCCASNQRDSVGAGPSEPGAGYW 
LVMHCFLSPSEKHS IWVGVTRFSRCCPS 
PLSLTRKGNSLTPCASQVRQCLALLRLV 
QGAGTHRTRG 


8585 


22486 




8647 


341 


88 


KKKKIFSPPIKRGPPSVYFICPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFF FFFFFRMGCDRWHGELYNKI 
PIWHKLCGDL 


8586 


22487 


A 


8648 


1 


108 


PTRPRTRGVASVLYFTTILILIPTISLI 

ENKILKWA 


8587 


22488 




8649 


3 


96 


EICGANHSFMPIVLELIPLKIFEIGPVF 

TL 


8588 


22489 




8650 


457 


297 


SPSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLFYFAIF 
FIYLIIKDNYMFQC 


8589 


22490 


A 


8651 


84 


352 


YEKTDVKIISILFNTDYMLEYNVLHTLG 
QIKMLLISKKKKKKKKKKKKKKKKKINK 
FFWGGLILTLLGIVLySPLEALKISPSF 


8590 


22491 


A 


8652 


2 


111 


NNIKPSFTRENTLMFIHLSPILLLSLNP 
DIITGFSS 


8591 


22492 


A 


8653 


1 


305 


RQQQQQQQLRNLRDFLLVYNRMTELCFQ 
RCVPSLHHRALDAEEEACLHS CAGKL IH 
SNHRLMAAyVQLMPALVQRRIADYEAAS 
AVPGVAAEQPGVSPSGS 


8592 


22493 


A 


8654 


129 


268 


LSEVLYLFPPKKKRLIILCLFFRNPGSS 
NLQKITKEPIIDYFDVQD 


8593 


22494 




8655 


281 


117 


CI ITLSFINTLQAY I IWLWFSEVCLYV 














DLHAHKLGVDPGNLNLLSHCLLETIjAGH 
LPGEFTPAGHAFLDKFLDFVSTVLTJfKY 


8595 


22496 


A 


8657 


3 


270 


RRRGRAHCSLDLLGSRNPPASASRVAVT 
TGTRHLAWLIWGGFLFFVLRDVLVHFH 
ATDKDIPKTGQFTKERGLMDLQFHMAGK 
MQS 




22497 




8658 


107 


11 


qfehtkptpflptl ialttlllp is PF I 

LIIL 


8597 


22498 


A 


8659 


21 


136 


VPIEHDVPVPNSRAELLKMFGIDRDAIA 
QAVRGLITKA 




22499 




8660 


1 


395 


Jj VTALi J£ S Jj X ± i'TrxgWUSiiTHH iSN LEK 
KKKGKGKKKKRGGALFKSIFGGSHFFGV 
WELLFFFLKGGIKKTILGPLGKTLFFGG 
GIjIiGAPLPRKI KGLREKKNFKGELGVKN 


8599 


22500 


A 


8661 


1 


255 


RTRGRTRGRTRGLTRGKKKKKKIMKKBCK 
KKKKKKKKGGaFLKISrPWGGPIFWGLPKF 
YFLPNKGSFFNLIGDFLKRPFFWGGAYF 

G 


8600 


22501 


A 


8662 


136 


358 


FHIVKGVSI rKFKCVILKKKKKKKKKKK 
KKKKKKKKKKGKKLKKKKGGKKKKGGGK 
NKI FFWVGFFYKRVWGYF 


8601 


22502 


A 


8663 "" 


198 


368 


LINLVQFTCILGTLLGILFIYLFLRRAI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlrreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TMLARLVLWSWPQAVLLPQPPKVLGLQA 


8602 


22503 


A 


8664 


407 


82 


KFLPHLGFPPFKRFFIKKEPFQWFFFLF 
GFFFFRRSFFLLPRVQCNGAISGPCNLC 
LNVKVRLSFKKKKKKCWDYTFEPLCPAL 
HFS FINKTEDCLFVCFTVYWPHGF 


8603 


22504 




8665 


166 


293 


KI I AKHNIARTNPYTFCIMKKKKKKKKK 
KKKKKKKKKKKKKKGGAL 


8604 


22505 




8666 


1 


164 


RAKAKRRNTTVSCRMRHLKIVYRRFRYR 
L YVG III GPGGLNETIiVFTCKKITVP 


8605 


22506 


A 


8667 


295 


83 


YIYMSHICTLSLSLSIYKRVKCFCFFKT 
WSHSVAQAGVQWRDLGSLQPPPPKWILC 
ISVFFTVLFVMKKN 


8606 


22507 


A 


8668 


2 


131 


INLPSTLIIFTILILLTILEIAVALIQA 
YVFTLLVSLYLHDNT 


8607 


22508 


A 


8669 


423 


107 


HCALPKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKK 


8608 


22509 


A 


8670 


432 


301 


SQILGRFSQKQHLSLGGGGCSHPRSCHC 
TPAWARGRLRLKKFIK 


8609 


22510 


A 


8671 


1 


106 


LKLLTSSDLPASASQVAEITGMSHCARP 

LSRIYMC 




225 1 1 


A 


8672 






CPANFVFSVEKGFLHVGQAGLET.PTSGn 
PPGCWFCFLQQQLHW 


8611 


22512 


A 


8673 


560 


135 


SSWGMDSIPTSSNMEETQQKSNLELLHI 
SLLLIESRLEPVRPLRSTFTBNLVYDTS 
DSDEYHLLKDLEEGIQMLMGRVEDGSHL 
TGQTLKQTYS KFDTNSHNHDALLKNYGL 
LPCFRKDMDKVETFLRMVQFRSVEGSCG 
F 


8612 


22513 


A 


8674 


2 


326 


SIiLMCVHRCECVCMRACLCaGVCMCIAS 
CLGLPMWWECyTWRVLVFHQFQDEELH 
DTVDLETIPLERQPRDVQHPVSTRILYL 
HVYFVAVTLTLIRILQLWTE2VFSP 


8613 


22514 


A 


8675 


171 


272 






KSGPF 


8614 


22515 


A 


8676 


60 


409 


LVWLESHEDVGPLIKDSKQEKKKKKKKK 
KKKKKKKKKKKRGGAFKKKPGGGQNLRG 
GEKKNFFFLRGEKKNPRGNFGKKTFFWG 
GENWAKPPPKKKS PEGKKKILK6KGGKK 
YLNFLGE 


8615 


22516 


A 


8677 


106 


286 


LFWNKEGNNWKKKKKiCKKKKKKKKKKKK 
kiCKKKKKWGGALLKISGGGPIFPGGGKN 
FFFFFGGGF 


8616 


22517 


A 


8678 


1 


97 


I IGQVAS VLYFTT I LI LI PT I S LI ENKI 
LKWA 


8617 


22518 


A 


8679 




141 


NEHQPGQHGEMPSLLKIQKISWAWWSMP 
VrPATQEAEMEKESQISIM 


8618 


22519 


A 




499 


140 


GVFFFSRRKRCGVSPPPPPKPVFFPLPG 
FFFWWGGCPSAPPPPGGGFPPNPPRGFF 
YPPPLKGNFFFPPPAGGGPPRGFFWAPP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFS 


8619 


22520 


A 


8681 


60 


224 


KQTKY I KE I TKH I CTWQGEYHDHEGG FP 
RVKLIHCTPDMLTPVISPDVGNSTTV 


8620 


22521 


A 


8682 


252 


383 


SQXKKKKKKKKKKKKKKKKKKKXXGGGL 
LKNFWGGPNLTGGGKI 


8621 


22522 


A 


8683 


405 


145 


KFFFSGGCFFFWGPPGKNVPPPKKMFFF 
LIPPPKKKKPPPPKKIFFPPPGVFFPPP 
PPFFFAPPPFFFFFFFFFFFFFFFFFFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 

location 
correspon 

last amino 


Amino acid sequence (A=AIanine 
C=Cysteinei D=Aspartic Acid, 
E=Glutarolc Acid, F=Phenylalanine, 
G=GIycine, H~Histidine, I=]soIeucIne, 
K=Lysine, li=Leucine, M=Methionine, 
N=A$paragine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIinc, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, V=possible 
















8622 


22523 


A 


8684 




406 


LyYFLTKSTTTYLAVPQPFPPTPSTPSR 
AKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGGPKKKTRKGGGGQKKKKGR 

KKKKSGGGKKNGGKPGRGGGGGR 














NLCLPGPSDSPSSA 














FTRKNTIMCIHLSPTLLT.SUSIPDIITGli' 

GLPKCWDYRREP PHPAAMFCI FSRDGVS 
PCWPGWSCL 


"8625" 






8^87^ 


"479 


366" 


8626 


22527 


A 


8688 


35 


161 


SNPPSSAFQVAGTTGVRHHDQLTFFFFF 
FLGGTGQILCRGGD 


8627 


22528 


A 


8689 




122 


THHIHNIKPSFTRENTLMFIHLSPILLL 
SLNPDI ITGFSS 














GIILSLFIIATLIYPTPSFSCY 


8629 


22530 


A 


8691 





85 


IFLRQFLS PRLE YSGVI KAHCS LNLLGS 

AETGraWAQGoSLLNLRNLPASFQ^ 
AG I I GI SHHDLLGI Q ITTWLLQKLS YLQ 
KKICKDQETGLLHAFHLHPYPYPMPSSP 
QLLQGPDILTNG 


8630 


22531 


A 


8692 


122 


38 


GRVDHSFMPIVLELIPLKIFEIGPVFTL 














FFVLSFFFWFYIT.LFFFFFFFFYFFT.F 
SFYCFNGSNK 
















8633 


22534 


A 


8693 


230 


3 


GSGIPSHPGHPSLPYHLEREWLAKTGIR 
DTSNSVHLLELCLRSQRHGHARWLTPVI 
PALWEAEAGGSRRQSIEPI 


8634 


22535 


A 


8696 


401 


240 


PPPPPPPCGSEPRPPHPSLSPPGSREMF 


8635 


22536 


A 


8697 


103 


313 


EKKKKKKKKKKKKKKKKKKKKKKKKKKK 


8636 


22537 


A 


8698 


354 


110 


KKGTLFEVSESFYTTEGVSAKINRIRSI 
VSVNKVARCIGEIFTDAVQSAPQKEVGG 
VSDSMVHKYEYAGCVDVKAVCSHGL 


8637 


22538 


A 


8699 


2 


164 


FLVETGFLHDGQAGLKLLISGDPPASAS 


8638 


22539 


A 


8700 


191 


371 


PPDSRWAIQGGHLEIiTPTTRQDLAQAPR 
ASGS 


8639 


22540 


A 


8701 


2 


170 


ARAGLYHVGQAGLKPLTSGDPLASASQS 
VRITGVSflRTRPKIIFKLIFFLFFFFFF 


8640 


22541 


A 


8702 


31 


229 


MKKYMVHPHSGILRSHEKKQAfiLFTIAK 
MKRNKQLYSQ 


8641 


22542 


A 


8703 


3 


126 


LQELRDPTLTFRLLGSPRPVWETRPVD 
DPTAPSNLYIQE 


8642 


22543 


A 


8704 


438 


31 


SEPCAGYNLLVCRFLSPLEKHNIRVGVT 
RFSRCHLSPLSLTRKGNSLTPCTSRVRQ 
CLALLWLAHSALHPLSCMHCLALPSEMH 
PVPQMEMQKSP I FC lAHRGS CTPELFLF 
GHLGSTPLTHLLSLRINVFLRTSE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 

last^amino 
residue of 
LTn'ifnIe 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc,V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8643 


22544 


A 


8705 


429 


319 


PGAIIAHRSLNLPGSSDPPTSASQVAGN 
TGVSYCARP 


8644 


22545 




8706 


426 


245 


RAAT P S P I KLTKKKKKKKKKKKKKKKKK 
KKKKKKKKKKRGAPLKKTLGGPQINRGK 

KKIFFFF 


8645 


22545 


A 


8707 


1 


108 


KHPRGGS FGYCFALAWVAFPLALVSGI I 

YIHLRKRE 














QIEGHTISALGDGAAWPVQGLIRHFRPE 
LEERMQRFAQQHQARQAAS 


8647 


22548 


A 


8709 


2 


83 


LVLEIAVALIQAYVFTLLVSLYLiHDNT 


8648 


22549 


A 


8710 


1 


81 


RPLSKTVRFNVLKVTKAAGTKKQFQKF 


8649 


22550 


A 


8711 


156 




RPLSFHPSGKKRLFLKKKKKKLCAVflHA 
OSIPSTLGGRGGRIMRSSDEDHPG 


8650 


22551 


A 


8712 


440 


233 


TPSYFVLLVERGFSMLPRLVSLTSGDPP 
ASAFRSAfilTGMSHHAQPRTGRSKAYLP 
FLEEPDLKHIFPF 


8651 


22552 


A 


8713 


92 


302 


AiVIHPTMGPQDQQHTGGLYAHTPLHPHSH 
THANTLIHSHIYAHAHSHTIPATCPKGP 
TQPYLCLLSPHIEL 


8652 


22553 


A 


8714 


2 


166 


KKINSFFSQGLTQAKVHGGRPSSLGPQT 
PVLKRSSCLSFPSSWDYRHEPLYPAN 


8653 


22554 


A 


8715 


1 


129 


GFRHVGQADLELLTSGDLPASASQIVRI 
TGVSPCAQHEFFKSN 


8654 


22555 


A 


8716 


381 


105 


RRSLLHSVLNGAQAGVQMRDLGSLQPPP 
PSSLPWPPKVPRLQPLPGRHPVWEVRSV 
SARPP IVWDVESPSAWLPSLESEERLCL 
AAIPSGK 


8655 


22556 


A 


8717 


385 


125 


FFFFLPPAFLLSGXFWFFFFYIFSFFFF 
FFFLGSPPPPIFFFFFFFLNKIFFFFFF 














FFF 


8656 


22557 


A 


8718 


409 


264 


KTDQRWSSTSSSKIMSQSQVSKGVDFES 
SEDDDDDPFMNPSSLRRNER 


8657 


22558 


A 


8719 


■' 


269 


KKHVKRVLSHLKASCPEELLHFSSMPQL 
LELQLMGGGSHTHMDQHTHTHMHPHIYT 

HAHTHTWAKLNELPWVEMNLLS LGAFQ 
QRLP 


8658 


22559 




8720 


430 


141 


LFFFLSFFEDGVLLCHPGWSWAQSQLT 
ATSTSKKKKKKKATPPGIPKFLIGKSGK 
PPRVLLIGHWAPFKFFFFNPRIAFF 


8659 


22560 


A 


8721 


386 


290 


RWGLTMLPRLVXJiTYWLQAIIiLLWPPKVS 

GLQA 


8660 


22561 


A 


8722 




157 


HEVFLVETVFLHVGQAGLELPTSGDLPT 
SALWEAEWG6LLEDRSS6PAWET 


8661 


22562 


A 


8723 


493 


186 


FSPQGGEKRAILGLEKLCPPGVKDFSAR 
P PKEVGNEGRVPKTKLFFVFLKKKGFPL 
IGRGGFKSRPWESPPRPPQKVGVQRLNP 
PPGPFFFFDLIWPNRNGS 


8662 


22563 


— 


8724 


3 


186 


GGGTPPFPGVWGKGNGGGTPPAKTLPPF 
PSPLFFFSPCPSPGEGGSREPLLSTPLA 
CICI 


8663 


22564 


A 


8725 


370 


67 


FAPPKNFYKGAPPIFFIFLFFILFIYFF 
FFFFIFFKFLVFLLETGFHHVSQEDLDL 
VIHLPQPPKWGLQAWATTPGQFFVFLI 
ETGLHPVWQDGHNLLTL 


8664 


22565 


A 


8726 


966 


652 


LGSLKPPPTGFKRFSCLSLPSSWNYRHA 
PPCPANFVFLEETGFLHVGQVSLELLTS 
GDPPTLASQSAGITGJyiSHRDWLGQHLIV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutantic Acid, F=PIienylalanine, 
G=Glycinc, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NHCYLFYQGHIKTTRSLLPPP 


8665 


22566 




8727 


S5 


425 


HMHSFAHELHTFLHTHTNSHTNSQNQSP 
TYTSKSHTHSYMRFTDSHDSQMQQTHRH 
SliLHMTQLYTCTHTHTHTRTVIPLHLHS 
VKGWLNTKVALGGRTS HERE S HI AGRLL 
A 


8666 


22567 




8728 


393 


124 


LGSGEPQLFQSPPSGGPPGPKSHWNSWG 
NANPPVPPYIIESAPPPPHFPFGAQSKY 
SGGQAARLCPCPPFFLKKKKKKAPGSGT 
GTFSFQ 


8667 


22568 


A 


8729 


25 


217 


AVQPIRVQWHRSLQSPQEQI ILAPSLAK 
VDMEMTQLTQENADFATRDRYHHSSLVN 
REQLMPHY 


8668 


22569 


A 


8730 


1 


611 


PGIFYSALLSLDTSILMQLCFIMHRYRK 
NLTAAKKNELVQKTKSEFNFSSKTYQEF 
NHYLTSMVGCLWTSKPFGKGIYIDPEIL 
EKTGVAE YKNSLNWHHPS FLS YAVS FL 
LQTSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSSPSPEERTVNVSSIRGKKWSWYLD 
YLFSQGLQGLKLF IRSSVHHSS I PRAEG 
INCKNQY 














VEKTQAS ILGCKCSTPRAALGFRERTLP 
LRECPCCGPPEAQMELEKRALGSQAQWL 
MPVFPALWEAEVGESLEARSFETSLGNI 
A 


8670 


22571 


A 


8732 




68 


VYVCVCVCVCVCMCVCRAKVGM 


8671 


22572 




8733 


382 


25S 


NYHSS LHLETPGLKQSSCLSLPKSWDYR 
HES PCPALI FNSL 


8672 


22573 


A 


8734 


479 


326 


PPLYGFFFFFFFWGGGFFFFFFFYFFFF 
FFFFFFFFFFFFFFFFFFFQTTF 












2 


GRRPPGLYFFFFPPGKKNFQGRGGFLFF 
FPPKGFFFFFFPIGFFFFSSELGKDWPP 
PKKKVFSQRFPFFFFPPPLFLFLFFPPP 
FFFFFFFFFFFFFFFSFFFFFFFFFFFF 
FFFFFFFNKQVFISRIiLCARHCFRPPAS 


8674 


22575 


A 


8736 


7 


75 


lAVaiilQAYVFTLLVSIiYLHDNT 


8675 


22576 


A 


8737 


2 


176 


NFGIJ:jABTGFI.YV6HftGIiEVESS6DLPA 
SASQRAGITGVSTAPGFNRYFYKQTIYK 
YG 


8676 


22577 


A 


8738 


408 


111 


GSSPPPRAGGENFLKKNAGGKHFPGGEG 
GGGFYPLYPKKPFFS PK6FI FWGGGGGK 
WPPPKKGGFSKKPQKVFFTPPQKKKKFF 


8677 1 22578 


A 


8739 


5 


249 


YEGLKIOaMQASPFQRGHPVHHKKKKKK 












EGALGGKFLAQTSFK6EKQKVFIKY 


8678 22579 










FFIiRQSIiSVTQAGVQWCDLSSLQLLAFQ 
GSSNSPTSASQVAGITGVHHHIQLIFIS 
LVETGPHHVAQAGLELNLQ 


8679 


22580 










LQV 


8680 


22581 


A 


8742 


19 


112 


FGRILLVEEKKKKKKKKKKKKKKKKKKK 
GGGPL 


8681 


22582 


A 


8743 


362 


234 


KMDQKCLSEVLQRWFPCCCCCCCCCCCC 
CCFSLPFIFEPSRS 


"8682 


22583 


A 


8744 


3 


419 


LTLRRFQLNLTALTKKKKKKKKNLKKKK 
KKKKKKKKKKKKGGPLFKNSLGG P I FPG 
GGKFIFFFFWGGFIKPPGDFLKKTFFLG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconinc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GE I FGP P PPKNLTPLGKKKI FKGVGGKJT 
PPLFLRRKKFFSGGPFKKIPSPGLGF 


8683 


22584 


A 


8745 


444 


- 


KKPSKE I LYPENSRFFSPLSPLKFFFS P 
KGPNFWGGGGPLCPPPKERFFSKNPPGG 
FFSPPLKEKI FFFP PPVNLGPPRDLFKG 
APPKFFFFFFFFLWEMFYWLSLSSGSQR 
LPQQVPTVEPSELG6MGAYFVSKRSTYL 
QRGWRPER 


8684 


22585 




8746 


104 


222 


NKTFCLLKKKKKKKKKKKKKKKKKKiCNK 
KKKKKNWGGAL 


8685 


22586 


A 


8747 


403 


201 


MEKYNWQPHSGILYSHEKEQAALFPIA 
KRWRQPSCLSIDEWHNIMWSSLTVEYYT 
AMKRNKQLYSQ 


8686 


22587 


A 


8748 


1 


90 


RTRGAVYAALERMGLDGCVEDLRSRLQR 

GP 


8687 


22588 


A 


8749 


444 


85 


SPPPPGLFFFFFFPKKKTSPPPTKKGFF 
SPPPPQKPFFFLKPFFFLGGLGPNFPPP 
ffKNFFSKNPPRFFFFPPFKKKNFFFPPP 
FFFAPPKFFFLTPPPLFFFFFFFFFFFS 
FFFFPFFL 


S688 


22589 


A 


8750 


2 


309 


EFFPPYWEFLKINACMFSPEKKKKKKKK 
ICKKKKKKKKR6PPLKKTPGGPQI PRGGK 
KKI PPLKGGQKKPPRGFLEICNPLFGGGP 
FGPPPPPKINPPEKKKNF 


8689 


22590 


A 


8751 




76 


TASVSKGGGLQGITMKDSDBEEEG 


8690 


22591 




8752 


433 


145 


SFFFFFFFFFSFFFFGKKSSFFTPL 


8691 


22592 




8753 


2 


124 


GHLLMHLIGaATLAISTINLPSTLIIFT 
ILIKKKKKGRPF 


8692 


22593 


A 


8754 


80 


201 


FINAMSKKKKKKKKKKKKKKKKKKKKKK 
KKKKGGGLLKKTPGGAQF 


8693 


22594 


A 


8755 


415 


1 


lYFPTPEKFGPPKETLKKGAPFFFFFKQ 
KFPFFGPGGKQKGGFGSLQTLPPGKKKI 
SRPNPPRKRGFKDAPPPPGKFVFLKKKG 
VFPGGQGGFKTPTPRDLAPPAPLNPGAL 
FFFFLRQSLSLSLRLECSGTILAHAS 


8694 


22595 


A 


8756 


3 


136 


F TATRPGVYYGQ C S E I CGANH S FMP I VL 
ELIPLKIFEIGPVFTL 


8695 


22596 


A 


8757 




326 


KKFSLGSQGGEAGVFPRPPLGKLQTTAL 
NSGAKGTPPFLFPEPHKRGGPPPAPGWE 
GIFCPHFFPMGQKGEGGLWHTGEGPH 


8696 22597 


A 


8758 


1 


164 


SLISSTQGHKQCRRPQGPLPRKTRDPCS 
HVYLLTF PPLL S YD PAKS PS LRNTQE 


8697 


22598 


A 


8759 


431 


79 


FFFFFIFLKKVSTQSPPPQPYFFFFLKT 
FFLFQPLLKEGGSWGGRGDKPPPLAPFK 
KWGVFFFFFSFFCCFVFFFFPLFFFFFL 
FFFFFFPFFFPFPFFFFFFFFFFFFFFF 
PFPFFFPRCVLTRL 


8698 


22599 


A 


8760 


60 


194 


ARPTCPATAVTQKKKKKKKKKKKKKKKK 
KKKKKKGGGVLKKKQGGRKY 


8699 


22600 


A 


8761 


383 


38 


CASMTFPKKKKKKKKKKKKKKKKKKKKK 
KKKK 


8700 


22601 


A 


8762 


399 


164 


HITVKSLLVPMDDPPKKKKKKKKKKKKK 
KKKKGGGFKKNLWGGQKLTGEKKKIFFF 
LKGGKKKPLGIF 


8701 


22602 


A 


8763 


353 


48 


NFFFKVFFFLKDFSHKGAPPKKGGPPKK 
TPPRKKFFFLKIKPLFFIAFFFFWSLFG 
FFLIFSLFPFFFFFFFPFFFPFFFFFFF 
FFIVFTITLMHFKIYLH 
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SEQID 
NO: of 
nacleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


otide 
location 
correspon 
ding to 

itcid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
I>=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8702 


22603 




8764 


2 


93 


FAMLPGLVSNSWPEAICLPWPPKLLGLQ 
MM 


8703 


22604 




8765 


398 


63 


PPPPVFFFFSGGFF FWGGGGQKS P PRER 
CFFFKMHPGFFYIPPFVGKKVFFVLEGV 

FFFFFFGFFFFFFFFFFFFGFPSVSHL 


8704 


22605 


A 


8766 


3 


83 


ILEIAGALIQAYVFTLLVSLYLHDNT 


8705 


22606 


A 


8767 


450 


306 


QKKKKKRENKPEGLGTVAHACNPSTLGG 
RGRWITQPGQQEGNFISKY 


8706 


22607 


A 


8768 


1 


111 


WSLALVAQAGT^GFKQFSCLGLPKSWDY 
RHEPPCPGL 


8707 


22608 


A 


8769 


385 


261 


FLAEMGFCRVGQAGFKLLNSGDP PASAS 
QSAGITGVIAPVL 


8708 


22609 


A 


8770 


3 


293 


SLGSKPLGLLSLSPVKWFRIFGKHRNKV 
WGKDGGTDRNQSSSAFKHLQHGDSDPKQ 
NKIKACSSKFYLRRCVKRSFLLIKKKKK 

KKKKKKKKKKKKKKK 


8709 


22610 


A 


8771 




281 


CVCVCVCARVYIYIVyVYICIYTYMCVY 
MYICIYIHVYIHVYTCVYVYIRIHMYIC 














ICVCIYTCIYTYIYTHTYKHSVYYLHNF 
YINPKLLQS 


8710 


22611 


A 


8772 


295 


161 


THTHTHTHTHTHTHTHTHTHTTCSLPPS 
LAQLLGNCCIKaAISILYIL 


8711 


22612 




8773 


1 


318 


FFFFVRWSFTLVAQAHVQWRDLGSLQPL 
PP6FKQFSCLSLLSSWDFRHTPPCLAWF 
VFLVETGFLHVGQAGLELPTSGDLPALA 
SRGAGITGVSHHPQPPLCFLFL 


8712 


22613 


A 


8774 


3 


70 


FMPIVLELIPLKIFEIGPVFTL 


8713 


22614 


A 


8775 


369 


191 


TYKINKIQWAWWCAPLVPAAWKAEVEGL 
LEPGSLRPASVIQQDPHLKKGTYSRVLT 
THL 


8714 


22615 




8776 


2 


124 


GFHHVGRAGLEFLTSGDPLASAFQSAGI 
TGMNHHVQPIVE 


8715 


22616 




8777 


1 


162 


LKYYTADENGKTSRLLPQRPSDECGA6V 
FMASHFDRHYCGKCCLTYCFIKPBDQ 


8716 


22617 


A 


8778 


346 


138 


YNSPPYKEKTIPLQARVNFGPPRDSLKR 
PPFFFFFFKRQSLTMLPRLVLNSWAHGI 
LLPWPPKVQVLQA 


8717 


22618 


A 


8779 


2 


117 


LNLDTTAVQVENYPRIRESYKVSFLSAL 

EEYTKKUJTQ 


8718 


22619 


A 


8780 


2 


97 


WDLTMLPRLVStJSWAQVILPSWfPPKVLG 


8719 


22620 


A 


8781 


403 


276 


1 1 STVFQRGS CP I PSRKEVCSERPRRLQ 
KDSLTGFPHTVYKH 


8720 


22621 


A 


8782 


2 


119 


QAGLELLTSSDLPASATQSAGI TGMSQR 
ARPHFQCKIWP 


8721 


22622 


A 


8783 


245 


101 


ELQFKMRSGWRRSQPISWGLTLVPRLIS 
NSWAQVILLTWPPKVLGIQA 


8722 


22623 




8784 


2 


124 


AKLEKKKKKTGRAKRRMQYNRRFVNWP 
TFGKKKGPNANS 


8723 


22624 


A 


8785 


3 


355 


VGPSEPGAGYKLLVCRLLRPLEKHGIRV 
GVTGFSRCHLSWLPLARKGNSPTPCASW 
VRHHLALLRLTVCGLHPLS CTHCLTSPS 
EMNPVPQLEMQKS PVFCVAHAGS CRLE L 
FLFGH 


8724 


22625 


A 


8786 


238 


9 


PVKLGAGPPQAPFWLHKWAPLIKPFFKG 
PIGSLQIPLFFFLRQSLTT.PPRLECSAT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WLTKSSTSWVHAILLPQPPE 


8725 


22626 


A 


8787 


336 


10 


CLSLPSSWDHRRLPPHPASFCIS 


8726 


22627 


A 


8788 




147 


IFYLLETGFHYVGQAGLELLTSGDSPAS 
ASQSAGITGVNHHALPE IHVF 


8727 


22628 


A 


8789 


3 


130 


DLEEGIQTLMGRLEDGSRRTGQILKLDH 
SSEFSKTRELYPVF 


8728 


22629 


A 


8790 


206 


3 


VPPSGPIKKGDKKKNLYLIFLI KNVPNK 
KLRKGVSPLAIPFFFFFFVFLLETGFHH 
VGQASI^LLTS 


8729 


22630 


A 


8791 


124 


390 


RAGAQSNLMEPPPSGVKalFPPNFPKKW 
GNGTPPPAPLIPOGLKKKBVSPCXSPQGS 
EPPAPGGPSPLaPPKGGNHGRDRLPPPP 


8730 


22631 


A 


8793 


2 


301 


HEERERERERERERERERERERERERER 
ERERERDALFAFFPPRGaRPPPEIEHGV 
FVGA6SVREIYPTLSIFFFSPPAHRGRE 
LRSRCVSHSLHLISRA 


8731 


22632 


^ 


8794 




285 


ARGERERERERERERERERERERERERE 
RERBRERERBRERERERERERERGAPPP 
PLFFLGRGGALEAHAISPPPFLGRPPYI 
YIFFHMRARPPHRYLCftQRETRPAL 


8732 


22633 


A 


8795 




321 








RIPRGDLRDQFPHPLftHSRDSFCGSADH 
FGRGVFNKVSWTDPPTHRVTSSLGGGV 
ERDLLTLSGGGTYAPWKKMCATGEDQ 


8733 


22634 


A 


8796 


2 


140 


HEERERERERERERERERERERERERER 














E 


8734 


22635 


A 


8797 


2 


223 


SAHEHERERERERERERERERERERVSR 
RNLSRGGGGVPPPLQNVRIHSGGPARGT 
LHHIKKKTSLTDVGLAQ 


8735 


22636 


A 


8798 


1 


176 


ARGEREREREREREREREREREREKISF 
LGGGRGTQFAIERGVIQFCGRRYTTGRV 

RN 


8736 


22637 


A 


8799 


335 


27 


PPTKFFFFFFLVEMGFCCIAQGGLELLS 
SSSLPASAYQSAKIIGVSHHAWSVF 


8737 


22638 


A 


8800 


342 


191 


STSLSLPKCWDYRREPPCPAWLSYFFKD 
PFSKCNPRLRYWGLGLQHRNFG 


8738 


22639 


A 


8801 


104 


332 


AVPLTMVKIQPLWKRVWHFHWKSKLELP 
CDPAISLLSMYPKEMKSLCQRDVCTPRL 
STGPLTIAGMWNPPKRSSMD 


8739 


22640 


A 


8802 


563 


258 


FFLNLETRSWYVAQARVQWLFTGTVTVH 
YSLDLLASSHPPISASGVAGITDACCAQ 
LTLPFLKLGLLGLGGGGVGYPSYFGLEG 
FHNDCSRVRVLCMWSYQ 


8740 


22641 


A 


8803 


2 


199 


ARGLLVRRFLSPSEKRSIRVGVTRFSRC 
RPSPLSLTRKGMSLTPCASQVRQCLALL 
RLAHGACTH 


8741 


22642 


A 


8804 


266 


108 


KLWSKCVALPPSFSSSCSSHIGCACFP 
FTFHYDCKFPQASQAMLPEQAVEP 


8742 


22643 


A 


8805 


1 


140 


KWWFKRPGVYYGQCSEICGMIHSFMPI 
VLELIPLKIFEIGPVFTL ' 


8743 


22644 


A 


8806 


2 


90 


SSAAAEENNDEKKEEAEKTEDDMEFGLF 
D 


8744 


22645 


A 


8807 


229 


34 


KKKIFFFFPGFFWPPQKFFFKAPPPFFF 
FFFFFFFFFFFFFFLDLFIYLIILSYTK 
LISIEAAAPI 


8745 


22646 


A 


8808 


43 


173 


NADSGHAQWLTPGMPALWEAEAGGSQDQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted, 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline; 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 












l34 


EIETILAGAGKPRLY 


8746 


22647 


A 


8809 


259 


lYMIFKNKFFNRDRGLPMLHRLVLNWAQ 
VILLSWPPKVLGL 


8747 


22648 


A 


8810 


391 


193 


VLPKIPGIQFPPPCRPKKRPRPRFKKPP 
PKEKKISFSNPPKIWPPQGYFKRGPPRL 
FYFFFFWIL 


8748 


22649 




8811 


304 


3 


TTPPTNIFFCFLFFIFIFLWYFIFIFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
QCF I GHDDQKMMVHTHTQTS SHLLQGGS 
NPQGKGQS PGWVLWCA 


8749 


22650 


A 


8812 


395 


268 


FLVETGFHHVGQAGLELPASSDLPALTS 
QSAGITGVSHHGWS 


8750 


22651 




8813 


782 


LSRLFYFCVLFCLYMKTTQLPYFRGLVC 
LFVLRQGLTLSSRLECSGMIAACCSLNV 
PGSTDSPTSASRWRTTGVRHHTQLI FV 
CFVEMWFHYVAQAGLEPLGSSSPPALSS 
QGAGITDVSHHTPLELCF 


8751 


22652 


A 


8814 


95 


236 


ATMPGLKNIYFLKNKDKGLTMLPSLVLK 
SWARVILLPWPPKVLGLQT 


8752 


22653 


A 


8815 


381 


58 


SCLLFPPFWGARGGGPPRAGGSNPPGPP 
GLTPFPPKKPKNYWARGGGPFIPPPREG 
WAGGFFLPRRGRVPLAPGSNMLELGPLP , 
SPPGYQKKTPFSKKKKKEKKKRKR 


8753 


22654 




8817 


441 


158 


FFFFLRTDGYLTMLPRLVSNSWPQAILL 
PQPPKMLGL 


8754 


22655 




8818 


415 


342 


RLVLDSWAQEILLPWPPKVLGLQV 


8755 


22656 


A 


8819 


4O0 


171 


NFFFPPGVKFLGGGGPQFPPPQKRGFFQ 
KTPGGFFKPPPKKKKKFFPPPPKIGPPQ 


8756 


22657 


A 


8820 


2 


268 


INIILAPTISLLGILVYESHLMSSLLCL 
EGI ILSLFI lATLITLNIHSLLANIVPI 
AILVFAACEKKKKGGAVLKDPWGGQSLR 
VIAE 


8757 


22658 


A 


8821 


2 


258 


LCLPNQAWAMAGSPPPASLLPCSLISDC 
CASNQRDSVGVGPSEPGVGYSLWRRFL 
SPSEKRS IRVGVTRFSSWVRWLRTVI PA 
T 


8758 


22659 


A 


-8822 


250 


456 


GGGDKPGLIETFPPGLKPFFLLNLLSGW 

ELGPLAPPPFKFCFFKGRGFPFLPRFVF 
VAMLLLTCCKKRD 


8759 


22660 


A 


8823 


492 


376 


QENCLNPGGRICSELRSCIiCTPAMATER 
ACLKKQNKTNQ 


8760 


22661 


A 


8824 


1 


116 


LTPL P PSAP PS VDDWLKTP PEWVCS L P F 
HPQRMIISRN 


8761 


22662 


A 


8825 


2 


191 


FIjVETGFHHFGRAGLKLLNSGGALTSAF 
QRAG I TG VI TGVSHCFEVMRVWTGTQYF 
LCYILKS 


8762 


22663 


A 


8826 


393 


75 


PRFFPPPPPKIFFFPPPPFFFLGGFPPI 
PPPPKNFFFPKPPPGFFFPPPLKKKIFF 
PPPPFFSPPPNFFFPPPPPFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFSI 


8763 


22664 


A 


8827 


81 


380 


KI ITKHWIARlTSIPYTFCIiyiNYLKKKKKK 
KKKKKKKKKKKKKKKKKKKKKGGGPFLK 
NPGGGQKNPGGKKNFFFFFGGGKKNPPG 
FFKKKPFFGGGKFGAPPPPKN 


8764 


22665 


A 


SS2S 


354 


13 


TKPKTPPIiTKGPRRGFIiAPRVPGGGGP I 
TPPPQNFLGEKGTLARKKVFPPGRPAPK 
RPPRKKKGFEMRPIiKKNGPGPWKPLGFF 
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nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

location 
correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Plienylalanine, 
G=GlycinE, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LAHP PGFGPKKKKPENQTHKKKCQRRKP 
RF 


8765 


22666 


A 


8829 




215 


KHSTTHSHCPRTIKLKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGPPFKKTLGGPPF 
SPGGKKKFFFFLGGL 


8766 


22667 


A 


8830 


I 


127 


ILFFWQRRGLALLHRLVSNSWAQVILPP 
RPPRVLGLQVIYFK 


8767 


22668 


A 


8831 


264 


55 


FFFXFFXFFFFFFFFFLFVFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFWCSCS 
YVLIFIQPFIKSFH 


8768 


22669 


A 


8832 


399 


85 


PPQKNFFFPQAPKISGGGGPQIAPPQKK 
GSFQKTPGGFFI PPQKKKBUYFPPPGKM 
GPPQGFFKRPPPLFFFFFFFFFFFFFFH 
NRHLFKVKHLQNSFHLVQLRL 


8769 


22670 


A 


8833 


47 


288 


NCLYRTKKKKKKKKKKPPYKKKTLRGPN 
FPPAGPPAPLPLSGGEKKPSRGLLRRPP 
TLGGAAKGPPPPPKLTPLRKKKIF 


8770 

"8772 


22671 

"22672 


A 


8834 


1 

n 


146 

l56 


INIILAFTISLLGILVYRSHLISSLLCL 
EGIILSLFIIATLITPSTLC 
EDPKTSPKPKIIQTRRPGLPPSVSN 
PTRPTIITPILLTLFLrTQLKILNTNYH 
LP PS PKP I KKKKKKKKKKKKGGGP 


8773 


22674 


A' 


8837 


152 


292 


AKTFNFYKVEFINVFENGLCLLllKNTL 
PKKKKKKKKKKKKKKKKKKK 


8774 


22675 


A 


8838 






VASNSIHPKAKENTAPHTHTHTHTHTHT 
HTHTHTHLFMAI FK 


8775 


22676 


A 


8839 


2 


106 


HVGQAGLELLTSGDLPTSASQSVGITGV 
SHRAWP 


8776 


22677 


A 


8840 


364 


224 


APLCGRQICDCDCTYPSPHTYIHTHTHT 
HTHTHTHTHS 


8777 


22678 


A 


8841 


266 


1 


NSLSVEFLILFPPYNYSPKQCI CSHQNC 
LNFWVI KCVNTPHPNFLNLRMPLLAKRG 
AHACDPNTFGGGGRWITRSVDRDHPGQH 
NETP 


8778 


22679 




8842 


343 


52 


KTGFRHVGQPGELIASSDLPASASQSAG 
ITGVSHRVRPGLHILDKrSSFLDTSFADI 
FSVLYLKA61ASLLH1YIHFLLPLRDLL 
LVSSLTVSFPTAV 


8779 


22680 


A 


8843 


256 


35 


LFIFSLSLQYIFCVCGCVCVFMSLCFLV 
CAWGHVQLHVCSCVRMCVCENDVCVCGW 
VQMCFCIKNFQKEVYQI 


8780 


22681 


A 





333 


11 


PPIKKKAPPPKGRAFFFFFLKKKKGGPP 
PPQKKTGGGGPKKRGGVKKPPPGFFPGF 
FGPLFFWGPPFLPPFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFNFISTKHLF 


8781 


22682 


A 


8845 


15 


288 


RCGLTVLPMLVSDCWAQVILLAQPPRVL 
GLQAGASASTPITSVWCSQGAIYPEVGT 
TGSTIIAAASSGGEFDSVLQNDICMCFL 
TQQFHF 


8782 


22683 




8846 


I 


397 


ECAHHTQL I FLFLI ESSLHHVGQAGLKL 
PISSDPPTSASQSAGTTAVSHHA 


8783 


22684 


A 


8847 


1 


179 


ARERERERERERERERERERERERERER 
EREREREREREREREREREREKKKLGGG 
GGGPQFPIGGGLTQGGGGG 


8784 


22685 


A 


8848 


2 


233 


HEERERERDRERQRERERERERDRERER 
ERETFFSSSGGGPKETTLFFETGAPKEG 
YQNMHARL I P SAVP FKKRGRG 


8785 


22686 


A 


8849 


2 


180 


HEERERERERERERERERERERERERER 
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SEQID 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 

last^amino 
residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gljcine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ERERERERERERERERDFVFYRWGEGHI 
DRQYCSFLCPGWCHCPM 


8786 


22687 


A 


8850 


2 


79 


PEERERERERERERERERERERERERER 
ERERE 


8787 


22688 


A 


8851 


1 


178 


ARGERERERERERERERERERERERERE 
RERERERBRERERERERERARLFARGEK 
KKHTGGGPPLFYYLYI 


8788 


22689 


A 


8852 


3 


76 


TRRERERERERERERERERERERERESP 
A 


8789 


22690 


A 


8853 


5 


227 


EERERERERERERERERERERERERERE 
SLTPLLTGKWNPCPPPPH 


8790 


22691 


A 


8854 


1 


189 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RHSRLFPRESVCVARAGGASSPPRG 


8791 


22692 


A 


8855 


2 


271 


HEERERERERERERERERERERERERER 
EREREEPPRGEKYCIRVPPQKILSPACD 
SLLYVTHNLCVSHSTPRGERYVALFNSR 
ETRCF 


8792 


22693 


A 


8856 


2 


309 


HEQQQQQCXJLRNLRDFLLVYNRMTELCF 
QRCVPSLHHRALDAEEEACIiHSCaGKLI 
HSNHRLKUiAYVQLMPALVQRRlANYEAA 
SAVPRVAJVEQPGVS PSGS 


8793 


22694 


A 


8857 


44 


203 


GYSTCVGMHAHTHIHTDTHTHAHTYIHQ 
CLIQIGLCTFATCTSQVNKSSMftEL 


8794 


22695 


A 


8858 


329 


182 


1 1 FVCLVERGLHYVGQAGPELKASDDLP 
AIASQSAGTSGVSHCVSSKMS 


8795 


22696 


A 


8859 


1 


175 


GLTLPPRLGCHATITAHYSLNLPGSSNP 
PTSASQVAGTTHSRPCPEITQTALQRGP 
HD 


8796 


22697 


A 


8860 


3 


350 


HEAIGLRSNIKNGLDHFLPLGTPTPLIP 
ILAMIETILLLVQPIALATOLTANITAG 
HLLMRIiNGSATLAISTISLPSTLIIFTI 
LILLTILEIAGflLIQAYVFTLLVGLYLR 
DNA 


8797 


22698 


A 


8861 


139 


3 


TFKEADIKGGQPPHLftSFCIFGRDGVSP 
CWPGWSRTPDLKRSTHA 


8798 


22699 


A 


8862 


3 


134 


SPTPPPSSKPSSIPRKSSVDLNQVSMLS 

PAALS PASS SQRHES 


8799 


22700 


A 


8863 


346 


219 


RLVPPCLDNFLNFCRYSCLTMLPGLVSN 
SWAQVIFLPQPPKD 


8800 


22701 


A 


8864 


1 


147 


GTRFFFERGIDHVGQAGIQLLTLGDPPT 
PASQRAE ITGVSHHTWPNFI F 


8801 


22702 


A 


8865 


267 


132 


KKKFWPFFFFFLRDGVSLLLRLVSNCWA 
QEILLPWPPKLLGLQV 


8S02 


22703 


A 


8866 


343 


215 


LAQGLKT P I PKRVP PHTQKKKFCSERP T 
RIQTDRSCWVSPLS 


8803 


22704 


A 


8867 


1 


242 


GTSKKIDKLDYIELKCFCTAKETISRVK 
RPVEWEKI FANSS PE KGLI SRINKEAKK 
LNSSAIAAHNCNPSTLGGQVERIT 


8804 


22705 


A 


8868 




219 


GTRRPTWATWQNPASTKKKKWS F F FETE 
SRTVPQAGGQGGNLGSLQSLPPGLKEFS 
CLSMPGAPKGQSGSLMG 


8805 


22706 


A 


8869 


1 


181 


ATGGILLUiDWSLAYESKHLLEGAKSE 
SAEELKKRAQELEGKLNFLTKIHEMLQP 
GQDQ 


8806 


22707 


A 


8870 


1 


255 


AAAPPNAPGGPPGPQPAPSAAAPPPPAH 
ALGGMDAELIDEEALTSLELELGIiHRVR 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Pretlict- 

nucle- 

iocation 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pl]enylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, ti=Lcucine, M=]Vlethionine, 
N=Asparagine, P=Prol)ne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl<nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ELPELFLGQSEFDCFSDLGSAPPAGSVS 

C 


8807 


22708 


A 


8871 


383 


233 


ATTFPNPGFLFVRLFVLVQWPDHEPPGF 
KQSSHFSLPSSWDYRHPPPPPG 


880S 


22709 


A 


8872 


211 


294 


AXKKKKKKKKKKKKKKKKKKKKNI IWGG 


8809 


22710 


A 


8873 


398 


100 


QKERPFGFPFFKPGPFQKRGGKGAFPKD 
GEHYFFWVLEKSTGFPPKTKRVYYPSPV 
YPHPPAPQKEGFPGGSPPPRPIASPFGG 
KKKE KKSYIEKENLF 


8810 


22711 


A 


8874 


3 


214 


DAWVLGILVYRSHLISSLLCLEGIILSL 
F I lATLI TLNTHSLLEKKKKKKKKKKKK 
KGGAFKKNPGGAQI 


8811 


22712 




8875 


2 


254 


PRVRPEGRMHTPQSWFQANPMACMTFSK 
KKKKKKKKKKKKKKKKKKKRGGGF I KKM 


8812 


22713 


A 


8876 


2 


135 


TINLPSTLIIFTILILLTILEIAVALIQ 
AYVFTLLVSLYLHDNT 


8813 


22714 


A 


8877 


2 


296 


T.ARGAEVr,GYGSHSRGRVPGALVGQGAG 
RLFTEHPGSSPATIiAIYPTPEGTSVAVS 
ISAPPKARSRPYPPSRSCHNLSLAGSSV 
LVPPGPVHRWVGRP 


8814 


22715 


A 


8878 


164 


29 


RKIGPARWLTPVIPALWETEAGGSRGQS 
lEI HjANTVKPQACEVL 


8815 


22716 


A 


8879 


381 


219 


GEKMTEEEVHMLVAGHEDSNGCINYEGK 
R 


8816 


22717 


A 


8880 


2 


136 


ANGNSFATRLSNIPVIGNGNKPWISLPR 
GKGIRLTVAEERDKRLATKQSSG 


8817 


22718 


A 


8881 


3 


402 


SGFNIHYAAGPFALPFIAE YTNI I I INT 
LTTTIFLGTTYDALSPELYTTYFKKKKK 
KGGRPFFFFFFLKKKKIPPPKKKNFGKK 
GETIiKGRGQAP IFQPKKILSPPKKKKKK 
RGGAFKKSLGGAKKWGGGRN 


8818 


22719 


A 


8882 


1 


406 


RYSTPSEGEVGERYSTPPGETLERYSTP 
PGETLERYSTPPGETLERYSTPPGETLE 
RYSTPPGETLBRYSTPPGEALERYSIPT 
GGPNPTGTFKTYPSKKKKGGGRPKEPLG 
GPRFTGVGRVKVFSLWGGVIiNAR 


8819 


22720 




8883 


484 


307 


KKILNRRVHWLTPVI PTLWETHAGSSRG 
QEMETILGNTVKPPASASQSAEIKGMRI 


8820 


22721 


A 


8884 


1 


131 


GSLTHHIHNIKPSFTRENTLMVIHLSPI 
VLLSLNPDI ITGFSS 


8821 


22722 




8885 


406 


106 


KGGAGPKIAPPKKKZiPSPIPPPFFWPPP 
VFLKGPPPPSPFNPFPPPVFPPR6PLFF 
FFFFFFFCETWLPPRPSHLVPTLPHVP 
AWLIPIDQYLFISHSC 


8822 


22723 




8886 


425 


325 


SSTHTHTHTHTHTHTERSHLLAFSPSAG 
ELWIXlfljyi 














VCVCVCMCVYICIYPCWLPCWHMCLELY 
KVFKGIMKGV 


8824 


22725 


A 


8888 


2 


86 


YSRTRAflALIQAYVFTLLVSLYLHDSAY 


8825 


22725 


A 


8889 


400 


62 


FLYLKSFPPPPERGVFSPLPPQKFFFSP 
KALNFLGGGGPKFPPPKKRFFFKYPPRV 
FLYPPHKKKNYIFPPPDDFGPPRVFFKA 
PPPHPFPFFFFFFFFFFFFFFTRLEWVQ 


8826 


22727 


A 


8890 


3 


100 


SLEPGTSGLKGFSC1.TLPSSBDYRHEPP 
CPAH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 

nucle- 
otide 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Hisf idine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, y=Tyrosine, 
X=lInknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8827 


22728 


A 


8891 


32 


142 


VRPTRPGQVASVLYFTTILILIPTISLI 
ENKILKWA 


8828 


22729 


A 


8892 


3 


382 


TPHNPLSRPSYEPKPASTPSRAPKKKKK 
KNPKKKKKKKKKKKKKKKKEGGALLKKT 
LGGPKLPGGKKKNFFFFRGGKKKPPGDF 
LKKTLFLGGGNLAPPPPKKKNPWGKKKI 
FWGEGGKKNFFFFG 


8829 


22730 




8893 


368 


199 


KKKHI SPPSRKRAPQGDFLRPTPP IKWL 
YYYLSFFFLILYIFFFFFFFFFFFIFFF 


8830 


22731 


A 


8894 


3 


136 


TINLPSTLIIFTILILQTILEIAGALIQ 
AYVFTLLVSLYLHDNT 


8831 


22732 


A 


8895 


1 


144 


lATINLPSTLIIFTILILLTILEIAGAL 
IQAYGFTLLVSLYLHDYPYN 


8832 


22733 


A 


8896 


153 


375 


PQKKKKKKKKKKKKKKKKKKKKKWGGGF 
LKNSGGAP IFGGGGKFFFFFLGGGFKTP 
RGFFGKNPPPWGGKKKKN 


8833 


22734 


A 


8897 


332 


400 


WLl'PVIPJO.WDAEaGGSFEVHSS 


8834 


22735 


A 


8898 


215 


359 


telvpnavlpcplpcssghhij:.caihz^ 
rvtimpkdiqlahrirgera 


8835 


22736 


A 


8899 


414 


73 


PHLLCFHKKaWAMftGALPPAIiPPYSLI 
SDCCASNQRDSVGVGPSEPGVGYNLWR 
RFLSLSEECRS IRVGVTRFSRCCPSPLSL 
TRKGNSLTPCTSRLRQCLPLLWIAHGAR 
TH 


8836 


22737 


A 


8900 


374 


88 


VSPPSPLKIFFSPKAFNFWGGVGPHFPP 
PQKRVLPQKPPGGFFSPPLKKKKFSFPP 
PViCLGPPKGSFKGPPPFFFFFFFFFFGL 
DFMYTGSSNVW 


8837 . 


22738 


A 


8901 


433 


56 


PSGPPPPGGFYPGKRGFFPTPLWFGGP 
KRDPPPRGKKNPSWRSGGKQTPRFPRDP 
LLKAFFPGGPPQRLGKAQGPFWFFFGPK 
KGKTQTGPPFFFKKLGDPWGPKKKPPPG 
GAPRLGPPKKKFL 


8838 


22739 


A 


8902 


3 


73 


lAVAL IQAYVFTLLVSLYLHDNT 


8839 


22740 


A 


8903 


3 


149 


PNLSYIIGKDTWVEHWPEEDECQDKENQ 
KQCQDLGAFTESMWFGCPN 


8840 


22741 


A 


8904 


351 


137 


FPPEFFA^iPKKSGILFFSKTWMEVEVLF 
LSQLMLGQKTQYCMFSIilNGRLriKAPG 
SLKGDHHPPLGLLEG 


8841 


22742 


A 


8905 


142 


5 


KIFLQKKKKKKGLGTVAHACNPSTLGGQ 
GGWIMRSRDQDHPGQQW 


8842 


22743 




8906 


367 


244 


HVPQARAKFFCI FS RDGVSP YWPGWS RT 
PDLMICLLFIVEL 


8843 


22744 


A 


8907 


146 


248 


SQI LGRLRQENCLNLGNGGCS EPRWCHF 


8844 


22745 


A 


8908 


2 


197 


AAASTHHARLFLCVCLFAYLLVEMEFHH 
VAQAGTELLDSSNPPPLASQNAGIIGMS 
HHTQPQYIC 


8845 


22746 




8909 




127 


KKTRLERAQWLTP VI PAFWNAEAGTLLE 
ARSLRRLPWPPNVL 


8846 


22747 


A 


8910 


368 


280 


LTMLLRLVLNSWAQAILLTWPPKVLGLQ 


8847 


22748 


A 


8911 


355 


211 


QQQNLHLFWPGTVAHTRNSSTLGGQGMR 
I TGAYEFETTLGWTARPHL Y 


8848 


22749 


A 


8912 


325 


190 


KKKKKKGGGGPQKKKKNPPPPPTSKKKK 
KKKKKKKKKKKKKKKKKKKKK 


8849 


22750 


A 


8913 


1 


411 


VNVPAPRGGAYRGRQASFSSGGLHPVPA 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Argiiiine, S=Serine, 
T-Tlireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














YRPLCLPNQTTNSAMAGAPPLASLPPCS 
LISDCCASNKRDSMGVGPSETGTGYKLL 
VCHFLSPLEKCSIRVGVTRFSRCRLSPL 
SLTKKGNSLTPCTSRVRRCLALLRL 


8850 


22751 


A 


8914 


258 


9 


GAIPCLHCETNPSHISSTQELQTPEPQL 
PGVPLELPPPGSCFKCRKSGHWAKECPQ 
PGIPPKPCPICAGPHWKLDCPTGPRL 


8851 


22752 


A 


8915 


326 


42 


EKIHRKYFCYSFFVFLVEMGFYFVGQAS 
LKLLASSDLPAS PSQSARVKAMSHCVRP 
VLVIFKNKILQNTENSTKIYVYPPPWNQ 
TLTFCHFCFK 


8852 


22753 




8917 


3 


106 


GLEL PTSGDL PASASQNAG I TG I SHHAR 
PIMTYS 


8853 


22754 


A 


8918 


320 


171 


AASTTDGSYKCLCLPGWPSDKPNYCTP 
LISTALNLEKCPFGLPHLSGSS 


8854 


22755 


— 


8919 


_ 


201 


VKPSDRYHLMPIFTAAYPHQNSTYNVPV 
STRMVMAEEFKQGKRLSLCSDLYRKDLR 
TIVDPWSCaT 


8855 


22756 


A 


8920 


3 


182 


LLRPLEKRGIRVGVSRFSRYHLSRLPFA 
RKGNSPTPCASRVRRCPSLLHGLHPLSD 
KPQ 


8856 


22757 


A 


8921 


142 


1 


SCHPGWSAWRPQLLRRLRYENCLSLGG 
SQCSELGS CHCT PAWTTEG 


8857 


22758 


A 


8922 


1 


117 


LFFNYAWGWSTTLLSRLSLNSWVHVILT 
PWPPKELGLQA 


8858 


22759 


A 


8923 


357 


121 


LGAFSDGLAHLDNLKGTFATLSELHCDK 
LHVDLENFRLLGHVLVCVIAHHFGKEFT 
PPVQaAYQKWAGVANALflHKYH 


8859 


22760 


A 


8924 


343 


133 


LFFNFNFLETRLREWLIFVFSVETGFPI 
KRAQLKLQASSDPPARGSQSAEISGMSH 
YAWPCKSNRAFKCS 


8860 


22761 


A 


8925 


321 


44 


PDSGGSPASASQVAGTTGACanJAWLMFI 
FNMDININIPFSAKRDKLSMDKINEI IF 
SLLIHRISSFQIMYFLCSSRIESRKRWL 
KQTGRKKE 


8861 


22762 


A 


8926 


335 


106 


HFVFLIETGFHRVGHAGLELLTSTDPPA 
liAYKSSGITGVSYRSFTKLHLCACVCLC 
lYTYLGLVLSFTPGSWEKKT 


8862 


22763 


A 


8927 


2 


160 


LSRVPPRSANLSISVFAEKRCHFTMLPR 
P I SNSWAQGI LLPSQAPKVLGLQA 


8863 


22764 


- 


8928 


3 


140 


RVITEEEKNFKAFASLRMARANARLFGI 
RAKRAKEAAEQDVEKKK 


8864 


22765 


A 


8929 


126 


59 


NSGGGGCNELRSCHCTPAWATE 


8865 


22766 




8930 


3 


104 


KNMACYCRIPACIAGERRYGTCIYQGRL 
WAFCC 


8866 


22767 


A 


8931 


311 


112 


MENYLSKMQQELEKWITRELKEAAAELE 
SGS IAS PLGSTDESNLNQDLVWKASREY 
VQVLKKNYMI 


8867 


22768 


A 


8932 


345 


134 


GGGSSPWFPPFWRPRRADSLRVGVLNPP 
GPGGGAPFFLKNPLIGWGGGARLWFQLI 
LLRVEPENSFDPGR 


8868 


22769 


A 


8933 


262 


39 


RPRRPHPGNFFFFFNSGSHYVAQAGLEL 
LDSSDPTLPSQSAGTTGMSYQAWPLDHT 
FENRDCVLLHCSSPVSNT 


8869 


22770 


A 


8934 


84 


22 


TPAWVWWLTPVI PALWEAKAG 


8870 


22771 


A 


8935 


277 


111 


HRI IMHAiVGSHGVSGdFMKYDLdSLMVT 
VIEEHMP FWQFFVRLCGIVGG I FS STG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
eorrespon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


8871 


22772 


A 


8936 


2 


106 


DSAIPRRLECLKEDVQRQQEREKELQHR 
YflEFVF 


8872 


22773 


A 


8937 




193 


LMWDFSPSGLDGAFQRGYYWYYNKYINV 
KKGSISGFTMVLAGYMLFIYCLSYKELK 
HERLCKYH 


8873 


22774 


A 


8938 


240 


69 


DSGNQLKRS FSGEARWLTPVI PALWEAE 
VGESLEVRNIIYILQNMSNIGNIHLIIK 


8874 


22775 


A 


8939 


329 


177 


RWNLAMLPRLVLNSWAQAICPSWPLKML 
GLEAEPSCPARTWFKVTS PRLH 


8875 


22776 


A 


8940 


358 


120 


HPSNFFVB'LVEMGFHHVGQPGLELFPAR 
YVPTIiAFRSGGITGISHCAWPKEWALPR 
KASPGLSGRRGSPHCATCSSKAW 


8876 


22777 




8941 


333 


104 


PHVYSNGHICLS ILTEDWSPALSVQSVC 
LSIISMLSSCKEKRRPPDNSFYVRTCNK 
NP KKTKWWYHGKQFEFVITF 


8877 


22778 


A 


8942 




241 


APLVAFTQTOLEDKGGLSKIjVEAIRINF 
NDRYDEI CHHWGGNVLGPKSGTRIAKLK 
KAKARELATNW I KCTLLS FL Y I KR 


8878 


22779 


A 


8943 


1 


240 


APLVAFTQVNLEDKGGLSKLVEAIRINY 
NDRYDEICHHWGGNVLGPKSVTRIAKLK 
KAKBJCELATNWI KCTLLS FLYI KR 


8879 


22780 




8944 


306 


150 


AASTLCPRLECSGAIMftHCSLNLLGSGD 
PPASASQVGQARGSLEPGRWRLQ 


8880 


22781 


A- 


8945 . 


284 


2 


GKSRRS PLDLCTSGFTGSTHFTLI ICW 
PRYVATCPPIQLNFVFIKTWWSHYIVQA 
DLKLMVSSNPPVSASQSAGNTGMSHHTW 
LENRLLTTKK 


8881 


22782 


A 


8946 


1 


226 


WSFALFAQAGLEFLTSGDPPASASLSAG 
ITGMSHHAQPAVSLNSIFSASTFCATSS 
YCAKMELKELKFMLIVDAA 


8882 


22783 


A 


8947 


363 


96 


NTAPGYMPLLSIPFLVLSKAVFLFLLGM 
FTAALLIIAQRWEQPKCiASTDEWINKMW 
HMHTMEHYSTCCGVHAAVNISIQISEWN 

CWEW 


8883 


22784 


A 


8948 


1 


208 


GRVGQPGLRLLTSGDLPASASQSAGVAG 
VSHRAQPTPALNCVLTCWVLQHYI KHWH 
GKRSEYFTIFNDM 


8884 


22785 




8949 


69 


298 


SRGHQRVADAALLSFPDVICLGLMLWGS 
SS IVCILHRHKRRVQHIHRTSVSPTSSP 
ESGATKTILLEQERTRSPRS 


8885 


22786 




8951 


2 


201 


LYDLLNMRS ISLPLPSLVCIHSHTHEHT 
YTH IHTHAHTYTHTHSHWVLS LKQKKMV 


8886 


22787 


A 


8952 


1 


285 


GASPPASLPPCSLISDCCASNQRDSVGV 
GPSEPGAGYNLWRRVLSPSEKRSIRVG 
VTRFSRCRPSPLSLTRKGNSLTPCASRV 
RQCLTLLWLVH 


8887 


22788 


A 


8953 


341 


256 


LGLGGGGCSEPRWHHRTPAWVTERDFIS 
L 




22789 


A 


8954 


227 


43 


ILFFIGKHTHTHTHTHTHTHTRAKCLFP 
PARFPTSPNSTTNCTFPTEPAAWPRGLV 


8889 


22790 


A 


8955 


310 


79 


RPRRPGLMERVKVFIFSILRESFYKIRI 
TNQAQCCMPIVPVTQETEAGRSLEPKSL 
RLAWAIQQNSI SKKNNiraNFF 


8890 


22791 


A 


8956 


374 


198 


FFFFLPNRHGLAMLPRMVLHWVQAVLPQ 
PPKVLGLQG 


8891 


22792 


A 


8957 


138 


10 


RPRRPRLCQAQWLMPIIPALWESVAGGL 



1088 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspun 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
JE=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, t,=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=^erine, 
T-Threonine,V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














LEARSLRPAWFFFNT 


8892 
8893 


22793 
22794 


A 
A 


8958 
8959 


354 
364 


216 
223 


AGPRVFPGGEGGFRPPPPGGSPPQGPPK 
GGESGGKPPPPAHWVFF 


8894 


22795 


A 


8960 


259 


11 


FSALKDPSW\iLQS LFS FIVTVLTVAHVY 
EKLSTLTSLAARRGliAMLPRLASNSRTQ 
TILLPWPSKVLRLQARATEPGLGLLL 


8895 


22796 


A 


8961 


334 


213 


AASTKNFERMEWGLTTLPMLVLNSWPQV 
ILLPWPPESMRL 


8896 


22797 


A 


8962 


260 


3 


KGGfFPIPl'ieiKPGFFFSPALIFPKVF 
I FSQNFFVFLKKIFFFFFVBTVSSHWK 
AGLELLASSNPPAFTSQSAGMTQiyiSHHA 


8897 


22798 


A 


8963 


425 


248 


ADEWIHKMRHIHMIEYYLAIKRNEMLIH 
ATAGMHLEWILSEKDHMYDSIYMKCLAW 
AlO, 


889S 


22799 


A 


8964 


1 


118 


GKLMELHGEGSSSGKATGDETGl^EEA 
DGYEPPVQESV 


8899 


22800 


A 


8965 


2 


130 


GRVGPRMAUJSWAQVIHLPWPPKVLGLQ 
VAMKRKIMILSTSY 


8900 


22801 


A 


8966 


119 


364 


LGWYLIEKIQICCSPCLGADPFQQMISP 
LRDGSLTVLPKLVLNSWPQVFLPPQTRK 
VLGLQAEVTAPGHIIFMIIMINTFK 


8901 


22802 


A 


8967 


238 


333 


TRCIiRLLPRLVLNSWTQALLLPSPPKVL 
SLQV 


8902 


22803 


A 


8968 


297 


169 


NPPPPHIYKFFFIFNIFFFFFFFFFFFF 
FFPFPPPPMSKILIP 


8903 


22804 


A 


8969 


2 


212 


rvrprdRplvrvrvgrnkdg&ettpspg 
llpahltfpiidyhlnqpfifglrdtdtg 

ALLFIGKNMDPRGP 


8904 


22805 


A 


8970 


242 


357 


TLDIIKSFKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKI I 


8905 


22806 


A 


8971 


191 


265 


I LMPXKKKKKKKKKKKKKKKKKKKR 


8906 


22807 


A 


8972 


398 


97 


MFLYNRILWFFSPHPPLRFFFSP3ATIF 
LGGGAPHFPPPKKGFFSKKPPGVFLSPP 
QKEKIFSFPPPLFLAPPKDFFKSPPPFF 
FFFFFFFFVSTGSSVS 


8907 


22808 


A 


8973 


370 


265 


YNSSLPSSWDYRHVPPCPDNFCIFSRGW 
SQTPDLR 


8908 


22809 


A 


8974 


3 


119 


GLKLLTSSDPPASASKTAGITGVSHHTQ 
TIFVFSSHKS 


8909 


22810 


A 


8975 




110 


VAQAGFKLVSSANLPTLASQSARVTGVS 
HRARSMKQ 


8910 


22811 


A 


8976 


3 


152 


WNQPKYPSTGEWMEKMWYIYTKLVQPMA 
HGPHVAQDGFEGS PTQIHKLS 


8911 


22812 


A 


8977 


375 


193 


FQDLPLGSTSNISFFKRWSPAMVPRLIS 
NSWPQEILLPQTPKVLGITGMSHCTWPG 
GHISI 


8912 


22813 


A 


8978 


2 


395 


RAALPTQASAMAGAPPPASLPPCSLSSD 
CCASNERESVGVGPSEPGAGYKLLVCYL 
LRLLEKRSIRVGVTRFSRCHLSPLSLTR 
KGNSLTPCASRVRQCIjALLWLTLSALYP 
LSCTHCQTLPSEMNLVPQL 


S913 


22814 


A 


8979 


120 


19 


ARVQWHDHGSTATLNSWPQVILLPWPPK 
VLGLQA 


8914 


22815 


A 


8980 


204 


48 


LNRD KGLTMIiARFVRNS WPQAI PL PWP P 
KVLGLQALATAPCQMSFLHAPLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, F=Proline, 
Q=Glutamlne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8915 


22816 


A 


8981 


335 


221 


FFFLFLFFCMITSHCSVNVLC3ASDSPTS 
VSHIAGTIGT 


8916 


22817 


-A- 


8982 


360 


150 


LSHMPLLFVSLYPDHRAFMQVI FFFFKT 
ESPSVTRLECSC3AIIiRHCNVCLPGSSDS 
PASASYAGYSFYL 


8917 


22818 


A 


8983 


346 


116 


GQGRGVETESCSVTQAGVQWCDHHSLLS 
LELLGSSDPSTSASPVTGTTGGHYHTSS 
HFLFSVSSSYFPTPVCLAYR 


8918 


22819 


A 


8984 


350 


263 


LPMLPRLVSNAWPQAIFLLWPPKVLELQ 
A 


8919 


22820 




8985 


3 


159 


EEYNLLVCRLLSPLEKCS IGVRVTQFSR 
CHVSPLS LTRKGNSLTPCASWVR 


8920 


22821 


A 


8986 


2 


253 


KHWGELKNDLSICGTLHSVDRYLNIKL 
TNISVIDSEKYPHMLSVKNCFIQGSWR 
YMQLPADEVHKQLLQDAftRKEPLQQKQ 


8921 


22822 


A 


8987 


2 


115 


GARIVGHLTHALKQGEYGLASICNGGGG 
ASSMLIQKL 


8922 


22823 


A 


8988 


3 


159 


GFYHIGQAGLKLLMSGDLPASASQGAGI 
TGLSHCAQAGIFFFLPYDIFTVF 


8923 


22824 


A 


8989 


1 


123 


SHYVAQAGPKLLGSSNPPASASRSAGIT 


8924 


22825 


A 


8990 


462 


303 


LARTPSRPTRPPTRPPTRPPTRSPTRPP 
FMLSVDCLLLY 


8925 


22826 


A 


8991 


382 


178 


PQKFFFFFFGGKPVKNFQKFFLRAKGKG 
GEKKPPQNKKVF 


8926 


22827 


A 


8992 


532 


432 


RDAGLTMLPRLVSNSWAQVLLLPGPPKV 
LTLQV 


8927 


22828 


A 


8993 


418 


172 


KKGKKKKKNIFLIFFiSYFFFLI FFFIFF 














FFFFFFGGAWSLFRGGDSGEVKGTDSMS 
RQCFQGF 


8928 


22829 


A 


8994 


2 


722 


AVRLNISYPPQNLTMTVFQGDGTASTTL 
RNGSALSVLEGQSLHLVCAVDSNPPARL 
SWTWGSLTLSPSQSSNLGVLELPRVHVK 
DEGEFTCRAQNPLGSQHISLSLSLQNEY 
TGECMRPISGVMLGAFGGAGATALVFLSF 
CIIFWVRSCRKKSARPAVGVGDTGMED 
ANAVRGSASQGPLIESPADDSPPHHAPP 
AIATPSPEEGEIQYASLSFHKARPQYPQ 
EQEAIGYEYSEINIPK 


8929 


22830 


A 


8995 


498 


323 


epkaelnegdiaWhikreegwfkgtlq 

rkaktgllpgs fveni 


8930 


22831 


A 


8996 


21 


511 


LI idgveayalnasgwni IVFDPKGWA 
LFRS fkavkekldtrrgsnse letavkd 
lgkavsykgmygdvaiwysgqyvengv 
kknflpdntmvlgntqarglrtygciqd 
adaqreginasarypknwvttgdpare f 
tmiqsaplmlladpdefvsvqla 


8931 


22832 


A 


8997 


178 


306 


gainainwppfnnvgmgktlgfpnl^4al 
kqipfspvgektkrr 


8932 


22833 


A 


8998 


351 


211 


S IQSLRMQPVPCY ISGRHHHYHSHHHIH 
HHHYSSYFLKYPQEYUiPL 


8933 


22834 




8999 


1 


381 


FRLFKFALMELCS lEGISDQPLFFVFGF 
FFFGKEAS FGPQGGREGGKFKLMEPLAK 
GVKGISCPKPPHPCLNPPKPKRGGESSP 
PEGPKKTGAPGPAKKGLFFLTKGKNPWP 
GPGKTRKKAVFPKNF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=ThreDnine, V=Valine, 
W=Tryptoplian, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8934 


22835 


A 


9000 


173 


399 


IKSEWAAKKKKKKKKKKKKKKKKKKKKK 
KKIFFFGGGRFFFFWGGDYFWGGGGVKK 
KG6GKKNPGPQKKNGGGKK 


8935 


22836 


A 


9001 


2 


81 


EFTPAVHASLDKFLASVSTVLTSKYR 


8936 


22837 


A 


9002 


3 


835 


DAVRANDDLKENIAIVERRNNLLQAELE 
ELRAWEQTEHSHKXAKQELXETSERVQ 
LLHSQNTSLINQKKKMDADLSQLQTEVE 
EAVQECRNAEEKAKKAI TDAAMMAEELK 
KEQDTSAHLERMKKNMEQTIKDLQHRLD 
EAEQIALKGGKKQLQKLEARVRELENEL 
EAEQKRNAKSVKGMRKSERRIKELTYQT 
EEDRKNLLRLQDLVDKLQLKVKAYKRQA 
EEAEEQANTNLSKFRKVQHELDEAEEKA 
DIAESQTOKLRAKSRDIGTKGLNEE 


8937 


22838 


A 


9003 


429 


280 


RGFRGHKFLEPPGFPGFPPIMGPPSPPF 
GSPPQKNFPLQPKPPLGFKPPP 


8938 


22839 


A 


9004 


2 


108 


LSAYQGTPLPANILDWQAIWYEIRGYVI 
IKPLVWV 


8939 


22840 


A 


9005 


3 


158 


SSTQGHKHCGRPQGPLPRKTRDLCSPVY 
LLTFSPLLSYDPVTSPSPRNTQE 


8940 


22841 


A 


9006 


3 


370 


LLLTGLNRLTTDLYSLYIFTTTQWGSLT 
HHINNIRPSFTRENTLMPIHLCP ILLLS 
LSPDIITGIFSYTRHYWSSTYLETNER 
GGSYKNRLGGPESKGLRIKDDSLCWGPL 
HDTSAYYENK 


8941 


22842 


A 


9007 


363 


76 


KLFNVGGGGLDKPPPPKKFFFFKKNGLF 
FFYPLKKKKNFFFPRELLWGPPRFFIKT 
PPPYFFFFFFFFFFFFFFFFFFFFFFFF 


8942 


22843 


A 


9008 


1 


87 


KMLKKPKFELGKUVIELHGEGSSSGKATG 
D 


8943 


22844 


A 


9009 


420 


3 


RFYFPKPRKRGGGGVFFFFPPKKKFFFL 
NPPPFFFFPPPKKKKNIFFPPRKC3GRGS 
KVQTMGPPLFFFFFFFFFFFPFLFFFFF 
FFFFFFFFFFFFFFFFFFFLRETSSNNV 
lAYQIENSLLKLDLQRQFCLPLPGRV 


8944 


22845 


A 


9010 


2 


74 


LHRLGGPEAGLGEYLFERLTLKHD 


8945 


22846 


A 


9011 


33 


132 


IDPRCRNSARAPLSLAGPHPGM6DASKH 
MGQM 


8946 


22847 


A 


9012 


83 


607 


RRSGTPCGYNVTSQNGTIYSPGFPDEYP 
ILKDCIWLITVPPGHGVYINFTLLQTEA 
VNDYIAVWDGPDQNSPQLGVFSGNTALE 
TAYSSTNQVLLKFHSDFSNGGFFVLNFH 
AFQLKKCQPPPAVPQAEMLTEDDDFEIA 
ITGEGKPTRMPQSRKKSC3H1ISVLISAK 
LKRRJFL 


8947 


22848 


A 


9013 


483 


364 


PPPPPPFLKTPAPGIFFPPPLGGIGARPP 
PPSFFFFFFFFF 


8948 

8949 ^ 


22849 


A 


9014 


74 


180 


LLSTYVGRLSARPKLKAFLASPEYVNLP 


22850 


A 


9015 


488 


317 


HVGQAGLVLLTSGDPPTSASQSTGITGV 
SHRAQPLRSFSFVLSLLQKRIVLC 


8950 


22851 


A 


9016 


149 


13 


DERGGLQAVAHTCNP S TLGGRGGW I TRL 
3DGDHPSCRILAGDASQ 


8951 


22852 


A 


9017 


20 


107 


FDSTADSDQVNPIQGLASKWDYEKNEWK 
K 


8952 


22853 


A 


9018 


448 


239 


VDHQQVIWNRERI SNSQNG IVKEI KGAD 
TFIFGHTPAVKPLKFAiSfQMYIDTGAVFC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

peptide 
sequence 


Predict- 
ed end 

Dtlde 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy5teine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GNLTLIQVQGEGA 


8953 


22854 


A 


9019 


3 


209 


CPPLSPGCTNSARAEPSVHALPDALNNL 
RTHEGSGDGPSSSVDWNRPEDVYPQGIY 
VISAPSIYAREA 


8954 


22855 


A 


9020 


504 


20 


NYAQMSYSTARASAMESWAYFMGRRKFV 
ASRQASQMFLCWLEEAIWRWTLPSKA 
HFSFQEARSAWGNCDWIGSGRMAIDGLK 
EVQEAVML I EAGLS TYE KE CAKRGDD YQ 
EI FAQQVRETMERRAAGLKP PAWAASAF 
ESGLRQSTEEEKSDSRPAAGS 


8955 


22856 


A 


9021 




168 


LPRLEAAARIRHEERERERERERERERE 
RERERERERERERERERAPPPTLCET 


8956 


22857 




9022 


2 




LCSLVYLLTFPPLLSCDPAKSPSARNTQ 

E 


8957 


22858 


A 


9023 


508 


371 


FFCRDKYLATLPRLIiANRWPQVIFLPQP 
PKVLRLQPCTTPSLKDL 


8958 


22859 


A 


9024 


2 


270 


RTHSAGRPYQCNQCEKAERHSSSLTVHK 
RTHVGRETIRNGSLPLSMSHPYCGPLAN 


8959 


22860 


A 


9025 




185 


PPAATPQGPSPINSPPTKKKAKHBCNRTK 
KTKKKKKPPPSKKKGPQKTPKKTMGGPL 


8960 


22861 


A 


9026 


104 


319 


LVFLSKYTPPLLVSLWIHGLSLLSFLPS 
LPSFLPSLLTYLLPSFLPSFLTSVINPL 
PSLFFLVFKLSQFWT 


8961 


22862 


A 


9027 


476 


364 


SLKRLTSGDLPASASQSAVITGVSHRAR 
PIIYFKYVQ 


8962 


22863 




9028 


36 


154 


FLVGQAGLKLLTSGDLPASGSQSAEITA 
VSHRTWRTHYF 


8963 


22864 


A 


9029 


2043 


1237 


WLRTRVAPALPERLGACTQLGPVLPCSQ 
PYWCRQCPEYRRQJiAQPPHCPAPEGEP 
OAPQALGDAPSTSVSLTTAVQDYVCPLQ 
GSHffiLCTCCFQPMPDRRAEREQDPRVAP 
QQCAVCLQPFCHLYWGCTRTGCYGCLAP 
F(3:USICDKCIJX3VLiqiiINSYESDILBaSfY 
LATRGLTWKNMLTESLMALQR6VFLLSD 
YRVTGDTVLCYCCGLRSFRELTYQYRQN 
IPASELPVAVTSRPDCYWGRNCRTQVKA 
HHAMKFNHI CEQTRFKN 


8964 


22865 


A 


9030 


15 


389 


IVDHYPCGGIPTSGSVTQaGAQWHNLGP 
LQPRPLGLEGSSCLSLICBWGHRH&PPH 
LARCaiFSSEVEQLRSRLQHGAVFVKQRK 
GTPEDPRQGLRTLISSCIiLHLVALALWF 
SLPFHGPRIHMQ 


8965 


22866 


A 


9031 


3 


243 


TUISaFMFSCLSLPSNWDCaiHPPSCPA 
KFCTFVEMEFHHVGQAGLELLTSGDLPT 
LASQSAGITGVSHHAWTRCCCCF 


8966 


22867 


A 


9032 


1 


245 


PTRPAGIAICICMCMKNHRATRVGILRT 
THIWTVSSYPGPPPYGHDHEMEYCADLP 
PPYSPTPQGPAQRSPPPPYPGNARK 


8967 


22868 




9033 


46 


124 


QELMTHGAKSPDGTRVHNAFLFVTTV 


8968 


22869 


A 


9034 


235 


435 


KKKKKKKKKKKKKKKRRPFKLKGPFKPP 
LKPQGGLFEAPPFGGPPPPGFFFAGGGA 
PPRGFFKKKGK 


8969 


22870 


A 


9035 


333 


70 


CPIYQIPILKDQVNLSSISSPFSHLIKD 
LWKTLFISYITELIFVCFVEMMFHHVAQ 
AGLELLGSSNPPTSTFQSARITGISHHA 
CLNI 


8970 


22871 


A 


9036 


297 


411 


NKTMSFAATWMQLEVILSKLMQEQKTKY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

last amino 

residue of 

s^q'uetce 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leudne, IVI=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 










46T ~ 


167 


CMSSLISGS 


8971 


22872 


A 


9037 


NKNYPLLGGPPFGSPSPENLNGGFPLTP 
GGGGPINPNSPPPPPPGGRRPPFLKKKK 
KKKKEKETNTYILQDTCDRCSAVLPIMQ 
VKIVMICLIMHRLF 


8972 


22873 


A 


9038 


466 


292 


NGCLLEPASWNSLMIIFIFCREEGLTML 
PRPQVILPPLPPKVLRLLPPPLSS 


8973 


22874 


A 


9039 


34 


124 


GIRNIKFSSLSTFIAHKKYYCSSHAAEH 
VK 


8974 


22875 


A 


9040 


54 


239 


RRRRKLRRWGLAML PRLVSNS WHQAI L W 
PPKVLRLQACATTPGr.TRRWLHGQIWSP 
GAGSV 


8975 


22876 


A 


9041 


469^ ^ 


223 


PWARVCVCVCVSVCVCACVRVCARASES 
QTCAAVPSLGPAGAGRPALGCCAVCEDE 
VGSCPHARPLPRAAVDBCAAGRRGLCF 


8976 


22877 


A 


9042 


491 


380 


SPGWAHSYNPSTLDGRGSHITRVRKFM 
TCLATEGAV 


8977 


22878 


A 


9043 


144 


360 


IHSHWGYWLDVSLIAAHLAINPDCFMYV 
YIFSQCTDKLTEKEFIEGTIANKEILRL 
IQFEPQKVKEKMKNA 


8978 


22879 


A 


9044 


23 


263 


RWNSDGKGSLRPAWATWPDPVS IKNTQN 
THTTHPTPHSPHTPHGQQPFLSPTPFPW 
PVSSPSPFPPFSHTCLSCLLP PGA 


8979 


22880 


A 


9045 


461 


287 


PPSLANIYLFIYLFIGRDGVSLFPRLVS 
NCWAQAILLPWLPKVLGLQGEPPNLALK 
SL 


8980 


22881 


A 


9046 


485 


6 


NNKNSWAWWRVPVLPATREAGAGESLES 
GRRRLR 


8981 


22882 


A 


9047 


475 


307 


LFLRDGSLTVLPRLVSNSWPQAILLPWL 
LKGL6LQARATLHMACSSCINSTQNILQ 


8982 


22883 


A 


9048 


483 


338 


LRE6ERQREQEREREREKERERDRERAH 
TTAFTQLLPQYVAIILLPCY 


8983 


22884 


A 


9049 


680 


303 


NDLLFGSLYVKQFAASGTGRVTLPEPSE 
EQWCLRCFQDLFGDDHNKNGFKMFDSS 
NYHGQDLLFKDATVRAVPVGEKTTYRGW 
LGLDYVAALEGMSSQQCSQAGRTGPPCR 
LPWCYSDPPTPEP 


8984 


22885 


A 


9050 


26 


165 


AFGYHRVGQAGLELLTSGDPFASASQSA 
GITGVSHHARSQYFFVYV 


8985 


22886 


A 


9051 


461 


83 


HKINLPSHKKKKKKKKKKKKKKKKKKKK 
K 


8986 


22887 


A 


9052 


475 


209 


MAAPPKKKKKKKKKKKKKKKKKKKKI 


8987 


22888 


A 


9033 


531 


404 


SVTQAAMQWQKHNSLQGSTLEFKQSSLL 
SLPTIWDYKHADLS 


8988 


22889 


A 


9054 


66 


416 


KKKTNKIKMGGGPLPFKISPPFLKKTLS 
WKKGAPKTPPLEMAPFFFPQGGKKNSPF 
KGGKIFFFFFPKKTFIKGPPGGEHTPPL 
GGGPQIFSLPPPKGVPQKNPPPLLRGGL 
FFTI 


8989 


22890 


A 


9055 


1 


104 


ILAVNGETLEGVTHEQAVAILKHQRGTV 
TLTVLS 


8990 


22891 


A 


9056 


3 


106 


LKRWGIAMLTGLVLNSWAQVILPSWPPE 
VLGLQV 


8991 


22892 


A 


9057 




299 


PTRPINIILAFTISLLGILVYRSHLISS 
LLCLEGI ILSIjFI IATRI YPIAPPLPPL 
EPEKKKKKKKKKKKKKKKKKKHBCKWTKK 
RKIKaKHKNI.GGGGL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 

otide 

location 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=:Arginine, S=Serine, 
T^Threonine, V=Valinc, 
W=Tr>'ptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possiblc 
nucleotide deletion, \=possible 
nucleotide insertion 


8992 


22893 


A 


9058 


482 


146 


LTS S KKKKKKKKKKKKKKKKKKKKKK 


8993 


22894 


A 


9059 


2 


290 


YFLFAYTILRSVPNKLGGVLALLLSILK 
KKKRGGP FKRTLGGP KFNRGGQGNI FF L 
MGGGLKPNLGILGRNLSLGGGKNWPNLP 
PKIKGFRENKNF 


8994 


22895 


A 


9060 


413 


48 


PQKKGFFQKNPKGFLKPPPKKKKKKFPP 














PKFFFFFFFFFFFFFFSIENPPSSYTKW 
LRSLDVCHLLPP ITPQVPGPTPVLLFST 
LPDCHHSLL 


8995 


22896 


A 


9061 


2 


274 


FTISLLGILVYRSHLISSLLCLEGIILS 
LFI lATLITLNTHSLLflNIVPIAILVFA 
ACBAAVGLALLVKKKKKBSRPFKRILGG 
PQINPG 


8996 


22897 


A 


9062 


42 


255 


QEFRTRRERERERERERERERESSGWR 

AKFHSNFSAGAIAHTIRVMLPPSGIYSN 
EKSINKCGFALLFF 


8997 


22898 


A 


9063 


20 


413 


YALRNSARAGDPHGQLGRRDGAVGGRGW 
LMPREALHPHWaUVQAWaGSPGLFLGAP 
WPQGGVWLPWLCVSMSASSSSINQLPPQ 
KKKKKKKKKKKKKKKKFWGGGPGPQSPF 
WGGLKKFGGRFLKRGGGEN 


8998 


22899 


A 


9064 


6 


306 


ECEHSTTYSHFAGGGGPVMVGGDADARS 
KALLGVCVGSGTEAYVLVLDPHYWGTPK 
SPSELQAAGWVGWQEVSAAFDPNSFYNL 
CLTSLSSQQQQRTLD 


8999 


22900 


A 


9065 


463 


126 


AHYQLVYTCKVCGTRSSKRISKLAYHQG 

WIVTCPGCQNHHI lADNLGWFSDLNGK 
RNI EE I LTAr^GE Q VHRVAGEGALE LVLE 
A^^GAPTSTAAPEAGEDEGPPSPGKTEPS 


9000 


22901 


A 


9066 


23 


244 


SCHAGIRHEERERERERERERERERERE 
REREREIYFLALFSQKGRVHGFFKTPRG 
GQKRELLSNRVNTLCTE 


9001 


22902 


A 


9068 


396 


143 


KDIQNPOQAWWLTPVIPJVLWEAEAGGSR 
GQE lETILVNMVKPHLSVLIAKNSSSHC 
PMCWLAELQWYDLCSFQKIFFGIILIPG 


9002 


22903 


A 


9069 


828 


99 


VDREPVDREPVS/CHPDLEERLQArtPAEL 
PDEFFELTVDDVHRRLAQLKSERKRLEE 
APLVrKAFREAQ I KE KLERYPKVALRVL 
FPDRYVLQGFFRPSETVGDLRDFVRSHL 
GNPELSFYLFITPPKTVLDDHTQTLFQG 
EPLPQRSGCSILaAEEPAGVyLEPGLLEH 
AISPSAADVLVRRYMSRAAGSPSPLPAP 
DPAPKSEPJUiEEGaLVPPEPIPGTAQPV 
KRSLGKVPKWLKLPASKH 


9003 


22904 


A 


9070 


20 


427 


DGTRRERERERERERERERERLYLTEDT 
LHSLSLEOaRSGASVCVQSVYPTGASAP 
HSI INGHKRVLVLCVETEHTFPIiftEKDR 
AFLGDTSVFVQRAPVGGANTGASAGGAP 
PSRGEPKEESGESHEDMGPQLFV 


9004 


22905 


A 


9071 


297 


126 


CSIYDKRIPTFGLGEGVSERQRLTLLLP 
CLPPALTETPTSSDWQSTDATPTLTNSS 


9005 


22906 


A 


9072 


117 


286 


WLRGSEKMFQVQEINEQYLQCILIKGI 
FLVKKKKKKKKKKKKKKKKKKPRPEYIK 


9006 


22907 


A 


9073 


3 


302 


P PMETLRQWTT IVHSQEEEEEHFEVIKD 
EVKWARKH6QPGTPVAIATQLPPRTSA 
AFPAQQQPLQPLAPTTVLSVSADWNALL 
QRPNKYIWGLQYLEM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last\mino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l]nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9007 


22908 




9074 


437 


79 


NQKREEEEERRKKRRRRRRKRRRRKKKK 
KILRQKKKKHRKAGSREKKKILRHLPEK 
NKESKKHYELINALSRLQDTRLIYQKMH 
FYILAKWNWKIKLKIQYHLQALENKVPR 
HKWETYA 


9008 


22909 


A 


9075 


51 


185 


CRADFMFFHDKI IVYLCSLVYLLTFPPL 
LSYDPAKSPSAi?NTQE 


9009 


22910 


A 


9076 


452 


310 


QFSRCHLSSLSLDRKGNSLTPCTSWVRR 
CLALLRLTLGALHLLSDNPQ 


9010 


22911 


A 


9077 


438 


326 


LLHCWWDCKLVQPLWKSVWRFLRDLELE 

IPFDPTLHF 


9011 


22912 




9078 


37 


169 


GIPGFRHAGQRRLQLLITGDLPTSASHS 
AGITGTSDRARPPSTF 


9012 


22913 


A 


9079 


1 


408 


HKGRLSKEE lERMVQEAEKYKAEDEVQR 
ERVSAKNALESYAFNMKSAVEDEGLKGK 
ISEADKKKVLDKCQEVISWLDAIsITIiAEK 
DEFEHKRKELEQVCNPIISGLYQGAGGP 
GPGGFGAOGPKGGSGSGPTIEEVD 


9013 


22914 


A 


9080 


31 


168 


VRRNPRLEHTVTITAYCSLELLGSSNLP 
ASASQVARTTGLQLLNLQ 


9014 


22915 


A 


9081 


460 


319 


YFFVEMGSHYIAWAVLELLGSSNPPASA 
SQSAEFTDAHKPTDCRPLR 


9015 


22916 


A 


9082 


465 


326 


P CVAE FCTHTHTHTHTHTHTFKC VGSHL 
HTMND 


9016 


22917 


A 


9083 


33 


269 


PGNSRPISSVRGIHPPCLANFCFLNVFV 
EMGSHYWLAGLILGSSDLSALAVQSGR 

VTGMSHCAWPEIFPLLENWIIL 


9017 


22918 


A 


90S4 


47 


300 


DLITSLFlLKQSSLATMNIQTPFKjriYF 
DLIIKTLJLaFFIFCFIYFYFLFFSYGG 
FFSFFFFFLKGSSFLPPRLNGGAKIWW 


9018 


22919 


A 


9085 


437 


220 


NIAIKSIiSLSLSLSLCVCMCVCVSACVC 
MCLSLSLFSPSSLSHSLFPFSTPNimiS 
HLPILSTQENMLLLLQ 


9019 


22920 


A 


9086 


227 


23 


DCPKftNILGLTPRLVSNSWPQVILTPWP 
PKGQSLLCRQGEGDRIWTWHPWWLLHPP 
PQARFPPRTHFH 


9020 


22921 


A 


9087 


206 


326 


PHWAQWPTPVIPALWEAEAGKSQGQEFB 
TSMMTVKPRLY 


9021 


22922 


A 


9088 


364 


123 


SRWLPLKAPGEDTSLPLPASGGSRCSL 
AWGSIPPISASIPTWPSVLCPCEIFPLL 
FLLYGDCNLGPTLIPDDLIMRSLP 


9022 


22923 




9089 


373 


205 


MPPRLAHFCYFSRDWSPYWSGWSQTPC 
WDYRREPPFPPCVRTSYSVQYCACYAGL 


9023 


22924 


A 


9090 


283 


2 


REFSFSAWLAFKIFWKLFSPPFWRVRP 
EDWFI PGGCGPNEPNFPPCPPGVGKKGE 
PRFKKKKKKNFISSQAQWLMPVIPAFWE 
AKAGGSLEP 


9024 


22925 


A 


9091 


16 


261 


CPEDLLVGQNVKVEDSDMGDPDRRPMCM 
FACFYLLVYRRCLAMLPRLYLNSWAQVI 
LLPQPPASLELLQAHATTHPALCLSF 


9025 


22926 


A 


9092 


17 


109 


RQGLAMLPRLISNSWTQANLLPWPPKEL 


9026 


22927 


A 


9093 


282 


114 


KKGGKGALLVFIiKTGRKPFFFFLRQGIiA 
MLLRLVLNSWAQAILLPWPPKVLGLQA 


9027 


22928 


A 


9094 


205 


337 


NANLHGRMQRLTPCLPEIWEAETGGSQG 
QEIETILANMVKPCLY 


9028 


22929 


A 


9095 


62 


287 


EHSCAKLFCFAFESGPYSVAQAGVQWHD 
HSLLQSQTPGLKQSSCLSPPASAGFYKE 
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SEQID 
NO: of 
nudeotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=lsoleueine, 
K-^Lysine, L=Leucine, M=i\Ief liioniue, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonlne, V=VaIine, 
W=Tryptophan, V=Tyrosine, 
X=TJnknown, *=Stop codon,/'=possible 
nucleotide deletion, V=posslble 
nucleotide insertion 














SPSFYKDKSCHHLGKHSGA 


9029 


22930 




9096 


170 


9 


AASTGYCIGRFCTKLKIQSGGQVEWLSP 
WPKREAEVGGLLEARSSRP IWATW 


9030 


22931 


A 


9097 




104 


ELRPGGRAEPEARTOGDRDPPVLLRHYPH 

IFEECG 


9031 


22932 


A 


9098 


350 


142 


LCLS CPKGSGYHAI4APWRGNFCSFGKNG 
GFPKLGRWLKWPQMGPLPQPPKHVGFQ 
GLSPPPGPPPFLV 


9032 


22933 


A 


9099 


494 


342 


YFYFYFVEMVSHHVTQAGLELLASSWPP 
ALISQHAVITSGSHHTQPKLYF 


9033 


22934 


A 


9100 


30 


232 


GG I MLCHGRCACFP FAFCHDCKFPKVFQ 
PCFLYSLQNCSFLYKIPSLRSCRLQLAE 
LLSIIYLENPL 


9034 


22935 


A 


9101 


26 


171 


RNSRRLRHKNRLSLGGGGCSEPRSCQCT 
PAWQTARLSLKNKKQQQKKP 


9035 


22936 


A 


9102 


486 


239 


HHTCPHQVCVCVCVCVYRVCVCVCVCVY 
lYMYMCryiYIYIYICIYICRERERERE 
REMWFHHVAQAGLELLGSRDLPAPAS 


9036 


22937 


A 


9103 


487 


344 


TGFHHVGQAGIELLTSSDLPASASQSAH 
ITGVSHHAQPLFMI FCAKC 


9037 


22938 


A 


9104 


470 


221 


TPGLKRYTCLSLPKCWDYRCEPHARQGA 
FLCAFSPLSSQQPWGKRPHFHFSYETNK 

VERGKRLPLRSPSTEQGFDFRCVLFQT 


9038 


22939 




9105 


468 




YVFF CRDGGLTI SS RL'TLNS CPQQSPCL 
SLPKCWDYRHEPPCPSCCSNLLLMFHFT 
MVFLIDSECSHFTP 


9039 


22940 


A 


9106 


466 


329 


RVAGTTGTHHHAWL INI PS VETGFHHVG 
PQVIQPPRPPKALGLQA 


9040 


22941 


A 


9107 


409 


306 


AGFELIiASSDLPASASQSAEITGVSHHA 
RPSLIF 


9041 


22942 




9108 


417 




MWAQADMESS KEPGLFNWI INDSLVQ 
AYPELKERLYEEIKKAQRRGA 


9042 


22943 


A 


9109 


533 


399 


RGSLRHETRLNAEARRGSELNSRQCTPA 
WATERDAVSKKTKQKA 


9043 


22944 


A 


9110 


16 


239 


LVHFEVRWNSHEVSLTADSRKSHLIFPQ 
PNYADTLINQESYEKSEPLLITQDLLET 
KGEPRQLQVSFFPPKREE 


9044 


22945 




9111 


68 


219 


EPPHIiANFLQRLVFTMLLTLILNSWPGV 
ILPPGPPTVLGLQDKLPVLLPS 


9045 


22946 




9112 


192 


57 


DRVPLCPPDWSAVARSWLTAASVSQVKG 
ILSPQPTECAPPGNST 


9046 


22947 




9113 


21 


201 


TIVSWEFMNLGGGTCGEPRWCHCTPDW 
GTERDSVSGEKKEDICDVftRRHWVGSFYN 
GPQ 


9047 


22948 




9114 


2 


144 


TFT IDRCAKDLFVAKQVGTKLS VNEPLS 
FSVES ILKHPSSAITRVSQ 


9048 


22949 


A 


9115 


156 


316 


M ILNKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGPFKKTLGGAKITPGWKNYFFFFMG 


9049 


22950 


A 


9116 


1 


84 


TKPTPFLPTLITLTTLLLPISPFILIIL 


9050 


22951 


A 


9117 


1 


176 


HLNLGGGVCSEPRSCHCTPVWATEILSQ 
KKKKKNGFKFWEPRPQKEGGGGRKKPPG 
VP 


9051 


22952 


A 


9118 




184 


AAAGYPGQQQNFHSVREMFESQRIGLNN 
SPVNGNSSCQMAFPSSQSLYRTSGAFVY 
DCSKF 


9052 


22953 


A 


9119 


69 


374 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaiiine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, Lr=Leucine, ]Vl=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














WGGPKIFVLRRVLSFFIRGEKKNTFGDF 
LKKT FFWGGE KLGKPPPKKKTPWGKKKI 
FKGERGKKPPFLGGKKK 


9053 


22954 


A 


9120 


405 


93 


EFSPLLLTTNQKRGFFPPSPPKIFFFPK 
RIYFLGGGGPHFPPPQKRGFPQKNQKGF 
FFSPPKEKKYWLPPPGKFCPPQGFFFPP 
PPLFFFSFFPFFFFRCQIK 


9054 


22955 


A 


9121 




380 


WDYR CPPPHPTNFC I FSGDGVS PHWPGW 
SWPQVICLPQPPKVLLTPWVRDRTRDLE 
GGDLWYGACLGACLHLGPQAAS FGHGIL 
GPASLLCC I PKHVHMDTLSPGVKLQS CD 
TGVALRVGEKREGS 


9055 


22956 


A 


9122 


2 


534 


DMIREQKIYHKYLAQRREEEKAQEKEFD 
RILEEDKAKKLAEKDKELRLEKEARRQL 
VDEVMCTRKLQVQEKLQREAKEQEERAM 
EQKHINESLKELNCEEKENFARRQRLAQ 
EYRKQLQMQIAYQQQSQEAEKEEKEREF 
EAGVAANKMCLDKVQEVLSTHQVLAQNl 
HPMPQGMPQ 


9056 


22957 


A 


9123 


389 


46 


GPNFLPPP I FGFFSPFStLKfFFSPKTL 
I FLGGFFP I FPPPKKS FFPKNS PGVFFC 
PPFKKKNFLFPSPLNFAPPRVFFKPPPP 
PFFFFFFFFFFFNFYRLYFIWVWGLAKV 
VW 


9057 


22958 


A 


9124 


422 


39 


AQQKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


9058 


22959 


A 


9125 


333 


168 


KGPPClWcHVEFHKLEAGSaSVAQAeA 
QWCDGNLLQTLTLGLKHSPHFSLPSR 


9059 


22960 


A 


9126 


175 


89 


KPPPHFFFFFFFFFFFFYFFFFFFFIFC 


9060 


22961 


A 


9127 


368 


250 


CFNYLSRDRGLPLLPGLVLKSWPQAILL 
PQPPKMVGVQG 


9061 


22962 


A 


9128 


392 


157 


FFSPKHLNFGGGGGPKMPPPKKKVSPQK 
PHPGFFS PPQKKKKKFFPPPVKMGPPKS 
FLKSPPPFFFFFLISSIRGLPL 


9062 


22963 


A 


9129 


396 


13 


FPPPPPPIFFFSPRRFFFRGGGSVALSP 
PPKVFFLLNPPPVFFFSPPFKEKTSFFP 
PPFFCPPPQILFLAPPPYFFFFFFFFLF 
VFFFLFFFFLLPLLFLGKTSFLLLLFLF 
lYLETGSCSVAWARVQ 


9063 


22964 


A 


9130 


55 


180 








KKKKKEKyEKTKAY 


9064 


22965 


A 


9131 


2 


95 


SFVHMEHEKAVLLLICSFQNTVDLGIQRE 
LTV 


9065 


22966 


A 


9132 


172 


404 






PKNSWGGPKKRGGGKIFPFSLLGGKKKT 
LGVFLEKPLFLGGRKMAPPPP 


9066 


22967 


A 


9133 


1 


125 


QPVSYPFTIIGQVAYELYFTTILILIPT 
MSLIENKILKWAY 


9067 


22968 


A 


9134 


1 


120 


QPVSYPFTIIGQVASVLYFTTILrLIPT 
ISLIENKILKWA 


9068 


22969 


A 


9135 


7 


269 


lATGQPRLQKLTLDPTLIiLKRFLLTKKK 
KKKKKKKKKKKKKICKKBCKKKKWGGGGHK 
KKPPGGPKTYPVKKKIFFFFKGKrKKPP 
WGI 


9069 1 22970 


A 


9136 




64 


LVSISNTYGLDYVHNUSLLQC 


9070 


22971 


A 


9137 


400 


259 


S QPLRRLRHKNI IiNP GGGHYS ELGS CHC 
TPAWDTEQDSIHSWLLSSV 


r9071 


22972 


A 


9138 


3 


310 


ECFPNSPNMNGHCNLIiVQSTS PS I STM? 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted- 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, lVI=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPKYVALEKVFLLNVLWFFYLKYRDNRP 
GSVAHTCNPSTLGGRGGWITGAQEFETS 
LGNIARPSLIKRKKYKLF 


9072 


22973 


A 


9139 


34 


142 


NCKKLNTHTHTLTHSHTHMLTHSDTFTH 
THTHTKYK 


9073 


22974 


A 


9140 


146 


2 


PPPPPPEGVSLCGPGWSAVARFRLTASS 
ASRVHAILPPQPPQQLPTRP 


9074 


22975 


A 


9141 


3 


102 


PPASASQSAGVTGMSHHAQPIEPLRCSS 
HRTT 


9075 


22976 


A 


9142 


245 


51 


NREGDLL.CCPRWSQTFGLKQSCLGFPKC 
WEYRRGPPHI,AYrMSMRICHILIQMIFYY 
NNPCLLYS 


9076 


22977 


A 


9143 


2 


108 


GAKCVSDRIKHAFLIKEQKI IVKVLKAQ 
AGSQKAK 


9077 


22978 


A 


9144 


3 


200 


MGFIflVGQAGLKLPTSGDLPASASQSAG 
ITGVSHRRWPVSGFFCSEAKECFWEEQL 
INLDKNSQK 


9078 


22979 


A 


9145 


405 


227 


WFFLGFPLKGKLGPPKMFFRAPRVYS PF 

FPPFFFWPPPPRIIFPPGFFKFFVFLAG 


9079 


22980 




9146 


387 


220 


PLFFVGCFFFFFFFFYYFFFFFFFFFFF 
FFFFFFFFFFFLGGCEHNFI IRKNIHN 


9080 


22981 


A 


9147 


539 


220 


VINFKHILIKTVIQSLFFFERGSCLPPR 
LECNSMIMftHCSLNLLGSSHPPASASQV 
AGTTGAYQHTQLIFIFFAETGSCYVAQA 
GLQLLGSSNPPPWPPEVLGLQV 


9081 


22982 


A 


9148 


3 


141 


GGGGCCEPRWSHYTPAWATSAKLCLKKK 
KKKRFLEKSCLTPLLKa 


9082 


22983 


A 


9149 


413 


240 


FIFCRDGGLTIVILPGLTSIJTWSQVILPPQ 
PPKVLGLQAYHADELPHLATWGVLYGLT 
V 


9083 


22984 


A 


9150 


395 


193 


LLFGPCLLNLITQFVSGHLLDIKLQMTL 
SEGYFPLNIQESPFYEftPLDDPSVRHRE 
GEILPISLGPG 


9084 


22985 


A 


9151 


1 


77 


LVAAIVLGGTGWGFKNIRRGRSGGA 


9085 


22986 


A 


9152 


355 


181 


IVDRPDPCRDADAQREVFTLSARYPKIW 
VP PGDPAREFPF IQ SAPVMLLAAPDELV 

SAQLA 


9086 


22987 


A 


9153 


373 


251 


FKKIFFCRDRAIiNMSPRLVSNSRTQAIL 
PPQPLKVLRLQA 


9087 


22988 


A 


9154 


204 


95 


DYHWSQPGMVAH&CNPSTLRGRGRWITR 

SGDRDHPG 


9088 


22989 


A 


9155 


44 


280 


AELFGTLMREDGLSLGGGGCKEPKSCHC 
NPAWATEPDCLRKKRKESPYLLWTPPD 
SSYSPFFFLLLPAPDNMDXFLS 


9089 


22990 


A 


9156 


2 


117 


ERLRHENCLNLKGRGCSEPGWCHCTPAW 
ETKQDSISKK 


9090 


22991 


A 


9157 


3 


207 


HASAHASADAWDYRRVPPCPAWFCILYV 
SHVGKAGLNLLTSSDLPTSASQSAGITG 
VSHHARTPTYV 


9091 


22992 


A 


9158 


117 


267 


NRVLLCCPDWPQTPGLKGSSLSLLSSWD 
YRHIPPRLAARSSYLGPGFYH 


9092 


22993 


A 


9159 


415 


167 


LFFFPIKNXFFFSPPLFFFFFFFPPPPX 
XXFFFFFFPFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFLELKQNF 


9093 


22994 


A 


9160 


217 


41 


SSQIHRDHMrRMVRQGAVaHaCNPSTFGG 
KGEQITRAQEFKTSLflNMVKPRLYYKPC 
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nucleotide 
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SEQID 
NO: of 
peptide 
sequence 
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NO: in 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LY 


9094 


22995 


A 


9161 


341 


164 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFYVCF 
VSVELKKWQNYS FDVRI I LRVEP 


9095 


22996 


A 


9162 


190 


402 


IHTGKHPKCYRNSYSVLFLASKKKKKKK 
KKKKKKKGGGFKKKKNLWHGGGEI IFFF 
WGPKKIFGGGGLKKG 


9096 


22997 


A 


9163 


108 


209 


LSVKRCAKMPKKGKVAPGIKEIRTRSHQ 
DSEDI 


9097 


22998 


A 


9164 


2 


145 


KHCGRPRPQGPLPRKTRDLCSLVYLLTF 
PPLLSYDPAKSPSARNTQE 


9098 


22999 


A 


9165 


170 


402 


ISRCHFPSLILSYVKITKTIKAPKKSKK 
KKKKKKKKKKKKKKKKKWGGGPFKKKNF 
LGGGGGEIFFFLGTiPKKKRGG 


9099 


23000 


A 


9166 


399 


113 


WRLRWEDCLSLGGRGCSEPGSCYCTPMW 
VTEQDQSSGYEGFQRPSSLSKARIPSVA 
AERSMCQPACVCNLSVEHLSLiSCLIiAQ 
QGKALYKLLNK 


9100 


23001 


A 


9167 


3 

T34 


138 


S W I TGVCHYAWL I FAFLVE TGFHRVGQA 

DLELVTSGDATTLRSF 


9101 


23002 


A 


9168 


301 


EKLTQSHSVTQAGVQRL.WTiSLRPPPPR 
FIOIFSCLSLPSSWDYRNLETVLQEETH 


9102 


23003 


A 


9169 


201 


1 


GATILRFKTKKKNVFFKKKKKKKKRKKK 
GKGWSGTESHWNLSGEGSSGLHERKLQA 
PAGPQEFEKLA 


9103 


23004 


A 


9170 


2 


191 


FVPQPLPLGFKRFSCLSLLSGWDYRCTP 
PHLANFCGFFFLKFFFFFFKKKNFFSPR 
GRGGALI 


9104 


23005 


A 


9171 


2 


191 


GGTESCFVAQAAVQGCDLGSLQSPPTGF 
SETASQTVDPDKVKDPGKLRTMFVAELP 
KTIVTDS 


9105 


23005 


A 


9172 


3 


143 


SWFLYVGQAGLELLTSGDPPALASHSAG 
I TGVSHRARLQTG I FRAV 


9106 


23007 


A 


9173 


361 


174 


VLGIILSMSETEIFMTKCa:.DQSGTVflHT 
YNPSTLGGQGGRVIEAQEFETSLGNTGR 
PHLFKI 


9107 


23008 


A 


9174 


493 




LVETGFNHVGQAGLELLTSSDPPASASQ 
SAGITGVSHLALVLI SRKSC 


9108 


23009 


A 


9175 


1 


115 


FRKIWYIYTMEYCIAVKKKEMLSFTITW 
MNLEDFMLVR 


9109 


23010 


A 


9176 


286 


96 


PLFFFNRWGLAICLGWSAVAIHRRDHSA 
LQSQTPGLKRSSCLSLLSSWDYRRVPPC 
LANESHY 


9110 


23011 


A 


9177 


399 


237 


RNTKWLFFLRRSLTLSTGLECSGAI SAH 
CNLRLPGSSDSLAMGIFKDLALLDM 


9111 


23012 


A 


9178 


427 


101 


GRTGRQGRGAPHTSHDGRPGRDAPRFPD 
VMAAGKRRSSLPRWDGGRAETLLTFQTG 
QPGRGAPHI PDDGRPGGDAPHFPDGWA 
GQRLQSQHFGRPRQAAGRWRL 


9112 


23013 




9179 


193 


41 


IVFLFIYLFCFFVGTGPCYVAQAGLKLL 
VSSGPLASAPQHVGITGMSHHT 


9113 


23014 


A 


9180 


292 


119 


KITKINKQILNLSlfTLDQMDtTDICRIP 
HPTATEYRFFSRflHGTFFRSGTKNSAWH 

I 


9114 


23015 


A 


9181 


3 


279 


ALSPPYSSECCKLGSFNRELSSPSPGGQ 
KSEIKVSAGPRPLPRVQGDPSPPLPASG 
GPRCSWACGQITAVSASGVTHPSPPLLH 
VCCVCLI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan,Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


9115 


23016 


A 


9182 


402 


238 


QMQWHNLWPLQSLPPEVTPLFCrjSTLPNN 
WGYRWPPPPPTNFFFLKGVSLGRPGSV 


9116 


23017 


A 


9183 


3 


207 


EFHHVGQDGLELLTSGDPPASASESAGl 
TGVSHRAQQACRFLHFFEGTCFLSQELS 

YLKLGRDTIIC 


9117 


23018 


A 


9184 


3 


106 


YPHYVKS I VaSTFI ISLFPTTI FMOiDQ 

EVIISN 


9118 


23019 


A 


9185 


110 


248 


GVLVGSPLRTRFHSQLQAEGAEEIGWT 


9119 


23020 


A 


9186 


925 


1132 


KCPSAWAAHPTSEIRISQGGSQAAWLV 
LFFLLKNPGDVIPALWEAEAGGSQGQQF 
ETSliANMAKPRLY 


9120 


23021 


A 


9187 


399 


292 


ASQVAGTTGAHHHARLIFFVFLVETGFH 
HVSQDENF 


9121 


23022 


A 


9188 


7 


82 


PCaaPPASVSQSAGITGVSHHAWQIF 


9122 


23023 


A 


9189 


1 


117 


LPISLPSFLPSFLPSLLPSFLPSSFLPF 
FFPSSVTPYLK 


9123 


23024 


A 


9190 


238 


46 


NKAFRIKAFVSPSNKKFSSFFFFUSIEMK 
SHYVDQAGLELLS SGNPSILASQNAKIT 
GVSHHVRP 


9124 


23025 


A 


9191 


2 


282 


LIFYFVDNMLVHAGLELLSSSDLPALAP 
KVLRFTGVSHHTRPELYFQRLLAPGTLL 
INSQAAGTNFPIGLSRIKPTQSGHGMLQ 
EVWGTSGLR 


9125 


23026 


A 


9192 




136 


CLCVCVCVCVCVCVSCKQTLMCCFLENF 
DV 


9126 


23027 


A 


9193 


469 


347 


PCPANFVFLVETGFLHVGHAGLKLLISG 
DLPNSDQKFYVN 


9127 


23028 


A 


9194 


427 


238 


PPARPTAFPYIEFFVEMVFRYFGKAGLK 
VLISGDLCASASPGVGITGVSHHAQPK 


9128 


23029 


A 


9195 


34 


178 


SLVHWEFERGRAEWLTPVIPALWEAEA 
GGSRGQEMETIIiANTDFWTG 


9129 


23030 


A 


9196 


22 


338 


CGPYSMWWNSIKRERKEGKKEKKKRRRK 
EQKERERKERKERREGGREGKERGRREG 
KKRKKNKRKKVGLIGSPCIVFTWQAAYC 
EESGGVWPGGETEHTQEGLVQ 


9130 


23031 


A 


9197 


3 


91 


QGLSLGGGGCSEPGSCHCTPAWATERDS 


9131 


23032 


A 


9198 


1 


203 


RTRGNKS IDDTSNFDDFPESDILQPVPN 
TTEPDYKS KDWVFLNYTYKR FEGLTQRG 
SIPTYMKAGKL 


9132 


23033 


A 


9199 


2 


144 


PTRTMPCCCCCCCCCCCCCRCLFMTGVL 
FTCCHNQFENHSGCLPRQI 


9133 


23034 


A 


9200 


1 


99 


YAAKELVHDRYRPTIEMERRRGLWWLVP 
RLSLE 


9134 


23035 


A 


9201 


3 


149 


LNAVQRMEWQLKVQELDPAGHKSLCVNE 
VPS FYVPMVDVNDDFVLLPA 


9135 


23036 


A 


9202 


284 


21 


KGPPLFFFFFFLRQGLVLWHRLEYSGMT 
TAHCNLELLVSSDPPTLSSQNAETTGVS 
HCAQPASLILSKPYIGFLHQIQKSCLPV 
TFL 


9136 


23037 


A 


9203 • 




77 


LRLVIiHSWRQAILLLWPPKVLGLQA 


9137 


23038 


A 


9204 


529 


340 


GVLLWRQAEAGELSLGGGGCSELKVRHC 
TPAWATHAETPSQKINELIKSMAVELTD 
DFIT 


9138 


23039 


A 


9205 


413 


241 


HPPPPELKLFSCISLISRWCHRHRPPCP 
YVFVLLAETGFHVGQAGHKHTYCCVSLP 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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lo'':>tion 


Predict- 
ed end 
nucle- 
otide 


Amino acid sequence (A=Alan in e 
C=Cysteine, D=Aspartic Acid, 
E=Glutaiiiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unk.nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 




correspond 
ing to first 

sequence 


location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 














T 


9139 


23040 


A 


9206 


33 


152 


QLQGLESWGMERCALPGYPWFTNLCKY 


9140 


23041 


A 


9207 


70 


402 


HVDWEALGNDPRLWLGICKLKASGIHE 
WVHTRTCAHTHMHTHTHTCTHCRSPRW 
VLNLGCTFELPGALKNARAYPHPNATKP 
MTSESVGIRPTRQVFICAPRAEQHCSTA 


9141 


23042 


A 


9208 


46 


262 


GGIPPAAVRGGRSWLRDVLSRLAAAGRR 
RRGTRRGRGRAGGAGGGRDRVGSRREPA 
AARTAAEGEJ?AVRGKC 


9142 


23043 


A 


9209 


15 


206 


PIEPLSERTNSAGAPPPflSLPPCSLISD 
CCASNQRDSVGVGPSEPGDSDMQTSLWA 
KVPVSKD 


9143 


23044 


A 


9210 


475 


110 


NLLAPHLLVSliEKHSIRMGVAQFSRCHL 
SQLPLARKENSLTPCASGVRRCIiRLLRL 
MLGGLHPLSCTHCPTGPSEMNLVSQLEM 
QKSPVFCVAHAGSCRLELFLFGHLPL 


9144 


23045 


A 


9211 


420 


132 


PPPPNYFGGAGFFPPPPFFLRPPPPFFF 
LPPKKKKIFPPRGPKKIFFLKAPPPSFF 














LVRSHFILTIM 


9145 


23046 


A 


9212 


353 


87 


FFFXKQXFFFPSLFFFSPPXXFSFYPF* 
YLKYPLLPPSFFLFFFTPPPPXFFFFFF 














FFFFD 


9146 


23047 


A 


9213 


2 


276 


FIQLLKRRLCELGYEVLEHPSYSHDLLP 
TNYHFFKHLDNFLQRKFEHNQQDAESAF 
QEFAESQSMEFYATGIHTLISPWQKCVD 
CSVSYFD 


9147 


23048 


A 


9214 


402 


276 


LSCLSLPSSWDYRCPPPKPGYFFYFFLF 
LVKTGFPHIAQAG 


9148 


23049 


A 


9215 


386 


37 


GFKKRVCSPFFFFFFFFFFFFFFFGREG 
FFLQKKK7TCMNPSSNKLLTLGLQGKKE 
IQPALMKKNSISQS 


9149 


23050 


A 


9216 


1 


142 


KDFRVQVWYNHSSLQPQPPGFKQSSRLS 
LPKYWDYRHEYLHCIQLRP 


9150 


23051 


A 


9217 


402 


181 


XXXXXXXXXKFFFFXFFFFFFFFFFFXM 
FQQEKLLFIFYVWIWTF 


9151 


23052 


A 


9218 


2 


87 


SGSLIFGLIYICM C VCVCVCVCVCVYPC 


9152 


23053 




9219 


420 


271 


YYCIFRETGFHHVGQAGLELLTSGEPPA 
SASQSAGITATAPSPFLTFLLV 


9153 


23054 


A 


9220 


139 


413 


SGYHSKTLFQKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGGGAFKKK 
I FFRGGGGNFFFFLGPQKNIGGGGLKKR 
GGGKKPPPPQIFFFVG 


9154 


23055 


A 


9221 


134 


236 


LHRWGPAMLPRIILNTWPQVILLPRPPK 
VLELQA 


9155 


23056 


A 


9222 


364 


280 


STHLS LPKYWDYRHEPPCLAKKDAQSFS 


9155 


23057 


A 


9223 


1 


147 


TLLPMLESSGTIMAHCSLMYLGSSDPST 
SASQSARITSVSHCTQPPLPF 


9157 


23058 


A 


9224 


1 


208 


AGAPPPASLPPCSLISDCCASNQRDSVG 
VGPSEPGAGYHLWRRLSQSEKRSIRVG 
VTRFSSLLLSSHA 


9158 


23059 


A 


9225 


1 


145 


FQHVGQAGPKLLTSGDLPASASQSAGIT 
VSNRVRPKTKNFYTGNTRPN 


9159 


23060 


A 


9226 


396 


210 


KCWDYRHEPPYPALTNLLFIEMGSYGVA 
RAGLKIiGSSNPPISASQSAGVPGLSHH 
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od 
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Predicted 
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correspond 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cyste!ne, I)=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
IV=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















9160 


23061 


A 


9227 


2 


154 


FHHFGQAGLELLTS SDLPTSASQSAE IT 
C3MSHHTQPRMVFFSMFSSTAQC 


9161 


23062 




9228 


1 


238 


FFLLALC3IFILHAIYLSIYLSIHPPSMH 
PSLS I YLSIYLS I YLS I YLS ISLSLS LS 
I YLS lYLS I YLS lYLSDFLSSAR 


9162 


23063 


A 


9229 


3 


65 


RC5DLITEPFLPKLLTKVKGS 


9163 


23064 


A 


9230 


25 


1S8 


YFRTAWNQAWWYVP IVSATWEAEAGGLL 
EPTSSRLRCAVIAPVNSYCTSTWAVS 


9164 


23065 


A 


9231 




232 


YDLKFNNHWFYFLFETEFH3L3KVECN 
GAISAHHNLCLCDGARLHHRRGGKKTKK 
QKTLPS 


9165 


23066 




9232 


3 


195 


FFVGLGSPYWQTGLELLCSGNPPASAS 
LNARIT6MSHQVQPNTGRLS FYLGSS I Y 
GSIHftMI 


9166 


23067 


A 


9233 


380 
Tl 


244 


SVPLAKMGFHHVSQASPELLPPVIHSSQ 
PPKAIGLQAEATAPSLH 


9167 


23068 


A 


9234 


101 


LLLINTLMFIHLSPILLLSLNPDI ITGF 


9168 


23069 


A 


9235 


411 


226 


FFFFFATNSQGGPTSFWGPENNAQVAGP 
LRKKIFTPVGSPKKIFPSWETPVF 


9169 


23070 




9236 


404 


262 


QGLflMLPRLVSNSWTQAILLPWPPNQSA 
GGSGVSHHTQPIFISENIH 


9170 


23071 


A 


9237 


369 


152 


HAFFVS IWRSFHVSNSYMTVTVQMXHNL 
SIPLPDPHFFFRGGGLAMLPRLVSNPWP 
QAILPPQPPKVLGLQA 


9171 


23072 


A 


9238 


389 


190 


RPKKKKKFFPPPGGKGVFFKQAPPPYFL 
NFIFFFFFFFFFFFFFFFLIVTHNFLIL 
FLNCDKLYIT 


9172 


23073 


A 


9239 


330 


3 


GGME PPFFPPTGAQWGYFGS LQNS PPEF 
KQFLCLRLPNVLEYXSIPPCPTNFCSFy 
KNGIFPCWPGWPFKFQKKKKISWVWWCA 
LABAGGSLEPRWRWRLQSAKMAPL 


9173 


23074 




9240 


122 


248 


KKKKKRGGLTPVIPPLWEAKGGGSRGQK 
IQTILAGGGKAPLY 


9174 


23075 


A 


9241 


2 


286 


FFVQTEFHHVAQAALQLVS S RD PPTS AS 
QSVDITGVSHCAQPVIVSCDDQVTLVQR 
RQVTVTSLIAFCQVAYPKGAKSQWPQSF 
FFSSPFLFSL 


9175 


23076 


A 


9242 


2 


95 


HMKKCSSSLAIREMLIKTTMRFFFFKLI 
TCY 


9176 


23077 


A 


9243 


377 


293 


PLSPRLVLS3WAQMILLPLPPKVLGSQA 


9177 


23078 


A 


9244 


422 


147 


KKGGTQGFSPPRVFKRGPRNFFFGPPKK 
KKNPPPPRGKIFFFLKGPPSFFFFFFFF 
YFFFFFFFFFFTHKYFLLGDLLLIYHLS 

KKNYSFF 


9178 


23079 


A 


9245 


402 


208 


TIGRCHHTQLNLKFFVEMGSHYWQAGL 
KLLD STNP PS SAFQS AGI SGVWPYAWTE 
IMFLYSIPI 


9179 


23080 


A 


9246 


314 


168 


EPYHT.HFLPPHHYYYCRWGL IVLPMLVS 
NSWAQVILLPWPPKVQ6LQV 


9180 


23081 


A 


9247 


3 


320 


WQAGLKLLSSCDQPASASQSAVITGVN 
HHARPLI FLYEMDYGSRTVL I IRKWDDN 
EQSNRTIFLKKDELTGNRSACLKYTPMD 
INVNSGLGWRYRFGMHHQCRW 


9181 


23082 


A 


9248 


22 


303 


KGSYFSFTYEAQTESCSIARLECRGAIS 
AHCHLRLLDSSNS PASASRVAETTDHKT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515.1 
26 


Predicted 
beginning 
nucleotide 
location 

ingto first 

residue of 

peptide 

sequence 


Predict- 
otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVPERSCFTPKERNVIQKGQEESEEMSN 
HLFTWLPILH 


9182 


23083 


A 


9249 


13 


174 


LGLKVRHRLWLI FVLSVETEFHHLGQA6 
LELLTSGNPSAGrTGVSHCARPTSHS 


9183 


23084 


A 


9250 


3 


141 


HVGQTVLRLLTSGDPPTSAS ESAGITGV 
SHRPRQIFISLFTICIR 


9184 


23085 


A 


9251 


56 


262 


LRIPLTFGGWKGEEEDRAVMHLRLCGD 
DS PMSESLHFIKLPRTVLVDEDARPEKK 
KKKKKKKKKKKKK 


9185 


23086 


A 


9252 


401 


275 


PAD LFI F I WEDGGLTMS PRLVMAIS WS QA 
TLQPWTPKVLGT.QV 


9186 


23087 


A 


9253 


411 


258 


PKVFKYRCDPPRliAWFLFFVQSLLPRLG 
LNSWAQAFLPPWPPRVLGLQA 


9187 


23088 


A 


9255 


391 


167 


GLKILTSGDLPASASQSAGNTGMSHCAR 
PLMSFKIRYFLFTIKMKLKLLWPVWSA 
GVHPPHFSCPIVITNKPRI 


9188 


23089 


A 


9256 


9 


255 


FFCRYGGIAVLPRLVgNSWfQRlfcSPWP 
LKVLSLQACTHHFFLMVNSNAARIKTIE 
MLLSEDVNFFVALLCPSFTISHFHL 


9189 


23090 


A 


9257 


487 


42 


RYFI FLFYFI FKTETGSYYVAQflRLRLL 
GSSNPPALAFQSARITAVSHRAQLTCIK 
YTFLFTHIP 


9190 


23091 


A 


9258 


' 


243 


ILSSQQSFISLTAVILIIfWIGEAFASK 
RKVLIVEEPS INLEWLYGCPPPYHTFEE 
PQPKKSEIYSKCFFFSVFVHFGGWG 


9191 


23092 


A 


9259 


411 


2 


FFFFSPPVSLSLCVLESSPPPFFFFSPP 
PKKBFFSPPFFFFSPRVFPPPVFFFPPP 
PFFFFPPLKKNFSPPPPGKIFFFFFRPP 
PPFFFFFFLFFFFFRPFFFFSFFFIiRWE 
EVLYFIMOIHISDAYPPNVDMNKE 


9192 


23093 


A 


9260 


417 


219 


SVEMGFLHVGQASIKLPTSGDPSASGSQ 
SASITGLSHCARPHSQIB IVCCYSVPVQ 
CLHSLAARY 


9193 


23094 


A 


9261 


408 


172 


CPMHFRKW6HRGCPPPRVFFFVGRDGVY 
VAQflGLELLDLSNSPASASQSNGTTGMS 
HLGQPLLFIUSrDVIAVIPLQGS F 


9194 


23095 


A 


9262 


401 


132 


HQDNKPAPPGFFFFSPPPKKGFFSKPFY 
PVSPRFFPFPPFLNPAPLFYFWQPKKKI 
FFFPPPAIKFFFFKRAPPLFFFFFFFFF 


9195 


23096 


A 


9263 


393 


220 


IiFFUilKIPFFPPGGSKGGDFGPHTNPPP 
GVKPIFRPHPPEEGGPKEPPPRPREFFI. 

GF 


9196 


23097 


A 


9264 


206 


438 


FPILKTHFRLGTVAHTYNPTTLGGPGGR 
NTCSKFKTSLGTWGDPFFHKSENCLGWG 
VPNCGPCFWGAQQKSAWSEGL 


9197 


23098 


A 


9265 


333 


48 


ALLAGPTPFFFLGKKKTRFGNHALFFFF 
FFFLNEVLLCNPGWIMWAIHRCDHGTL 
QPPTPGLKRSSCLSLSS3WNYKLKPLHP 
AGVHFLDFVS 


9198 


23099 


A 


9266 


3 


275 


TGFHHIGQAGLKLLTLGDPPLSASHSAG 
ITGVSHHflWPMHLESYWRVQYPFMPQVL 
YFWAPSGAQHCAQI I 3NAI PTLLSRPQE 
EPASEA 


9199 


23100 


A 


9267 


49 


226 


HGPEPCMDADAQREGIKVSARYPKNWES 
TVDPAREFTMIYSAPLMLLADPDEFASV 
QLA 


9200 


23101 


A 


9268 


159 


236 


PRLVLISWPOAILPPWPIKWGLQA 


9201 


23102 


A 


9269 




208 


IFNYCVFSSGVALAYGIYKQDLPAIiKEK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRGWFVDMGHSAYQVSACAFNRGQLKV 
SIYFFSRRLLWW 


9202 


23103 


A 


9270 




209 


IFNFCVFSSWAIiAYGIYKQDLGALEEK 
PENWFVDMGHSAYQVSVCAFNRGKLKV 
SIYFFSRRLLWW 


9203 


23104 


A 


9271 


379 


263 


ESCSVGRLECSGAVPAHCNLCLCLPDSS 
GSPASSSRLC 


9204 


23105 


A 


9272 


312 


3 


FHREGLAALQPWWPNKVSSKKKRKKERE 
RKKERKKERKKERKKEDGATAD6ISEAE 
VIRVHPYPVGFGVLSEGDIWTEDMRAGR 
RPHAGESGDTVMLPQAQE 


9205 


23106 


A 


9273 


405 


246 


PTKKNKYLHAPPLFHFFFIIPNPNFFFF 
FFFFFFNFFFFFKTKGCEFLYFSY 


9206 


23107 


A 


9274 


2 


227 


LVETGFRHIAQAGLQLLSSSYPPTSASQ 
SVGITGVSHQPQPSFHFERKSLQLKTLL 
LCVKVGVRACFALFFQGSS 


9207 


23108 


A 


9275 


70 


304 


NLFLKASTYfSEHLYHLtipTPIVYIRKR 
LDAMAHACNPGTLGDRGGWIARTPDLKC 
SAYRGLSKCWDYRHEPPRPVIV 


9208 


23109 


A 


9276 


2 


260 


TVFYHVGQAALELLTSGGPPAIiASESAG 
ITGVSHHAQPFLLILVISLFPOJQSFHI 
SPYRFSFTQLKHGFNTVLYPLVINISEI 
FF 


9209 


23110 


A 


9277 


406 


330 


PHLVLNSWTQTILLPWPPKVLGLDV 


9210 


23111 


A 


9278 




160 


ELLTSSDPPASASQSARIT6MSHHTRLK 
YIVYRVSRKSKNIPLKHIGIQSFP 


9211 


23112 


A 


9279 




91 


GLTVFPRLVWNSWAQMILPPWPPKML6L 
QV 


9212 


23113 




9280 


3 


107 


LGETGFHHVGEAGLELLNKUMHPPWAPR 
VLGLQA 


9213 


23114 


A 


9281 


336 


234 


AHYKLLLKIKLPGQAQWLTPVIPTLWEA 
EAGRS 


9214 


23115 


A 


9282 




241 


DRPEPCMDAtlAQREGINASARYPKNWVT 
TGDPAEEFTMIHTAPLMLLADPDE lESV 
QLAQSWPFGAIASLGRSPCRKRN 


9215 


23116 


A 


9283 


404 


277 


IFFVETRSHHIAQAGLELLGSSDPPYFF 
QVFTEMSLSQRALF 


9216 


23117 


A 


9284 


399 


231 


HYVAQASLKLLYSGDPPASAFQSAE lAG 
VSHCTWPTGSLNKIQNNIHSRISSM 


9217 


23118 


A 


9286 


394 


231 


AFFFFFFFGERGFHHVARTVLKLLDSSW 
PPAWSQSAGITDVSHDFLISLYRIY 


9218 


23119 


A 


9287 


420 


311 


LNFCKDGLAMLPRLVLNSWAQAVLSPQP 
PKVMGLHA 


9219 


23120 


A 


9288 


398 


173 


LLSFFYLFFYFFFFILFFLFFFFFFFFF 
LMFKALLFSCHYFPAMVTFKFSLSISGV 
VRT SGYIMLFSYL CLLKTM 


9220 


23121 


A 


9289 


270 


23 


TPEQCRGIKLFIYSLILYVRDGVLLCCS 
GWPWAIHKCDHSTLQHQI PSLKQASCL 
S LLSSWE YRYAPLHLASANFLKVTW 


9221 


23122 


A 


9290 


273 


46 


HLLNMPLFTEIWTERLFLFYILCFVFLV 
ETGFFYITQAGLKLLTSSDPPAYASQSA 
EIIGTSHOTRPICYVNVTNI 


9222 


23123 


A 


9291 


424 


230 


CLPFTHSLSLSLSLSLSHTHTHTHTHTH 
TIYMNK 


9223 


23124 


A 


9292 


2 


146 


NLFLVETRSHHVARASFKLLSSSNPPTS 
ASQSSEITGASHRTWPEVFF 


9224 


23125 


A 


9293 


393 


202 


GRERERKREREREKERKKERKKERKPER 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 
location 
corrcspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=JVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Ttireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














PEGELTYCVREG I TAAKLGKRRHKASKT 
LISHKNKN 










252 


193 


GFCCCCCCCYCCCSDKHHPG 


9226 


23127 


A 


9295 


409 


269 


SQYFGRLRRVDCLSSGVQDQPGQQGETP 
SLLKYKIYIHISWAGWHV 


9227 


23128 




9296 


2 


213 


FCCGLLIHLIYCKTEYSGQSTVAIHRC3JJ 
HSVLQPQTLGPKQFSHLSLLSSWDYRYS 
PLCPAYIFMLSIGY 


9228 


23129 


A 


9297 


169 


408 


RFARHMWSPHEERLPPPAGIRVRPPPQP 
AASLGPPRMAPS I SRRFQMTPYYKHNDF 
AGQAQWLMPWPALWEABAGGSSE 


9229 


23130 


A 


9298 


199 


51 


LHPPPPSLSLSLSHTHTHTHSHTHTHTH 
TLWLGRPGPAQDWLVMGNLT 


9230 


23131 


A 


9299 


1 


176 


MGSHSLAQARMQWCNQSSLQPQPP6LKQ 
PSPFSLPSSWDYRHVPPCLASIFFLKLN 
YH 


9231 


23132 


A 


93 00 


412 


209 


MGFHHVGQSGLKLLTSGDPPTSASQSAG 
ITGVSHENWPHCGFNLVFLMPDDVKHLF 
MCILAICVSSL 


9232 


23133 


A 


9301 


195 


347 


KIKGFRPGATAHAGWPSALGHQGGRTAI 
AWEFTISMSNMGRPCriSRKyKN 


9233 


23134 


A 


9302 


3Sfi 


190 


GFLFFFPPPKKEVFPHPFILQFPGFFPP 

PPFFKRAPBFFFWGPKKKKNSPPPRALN 

FFFFKGAPP 


9234 


23135 


A 


9303 


27 


173 


SFQLWGHLKVPKIRPKRAUHLRSGVQDQ 
PDQHGKTLSLLKKYKISRAW 


9235 


23136 




9304 


374 


121 


QWCDHGSLQPQTPGLKQSSCFSLPKRWD 
YRREPPHPTSLSPYIVAWPCTLCPPPLS 
ALWKHLIHQKCDQVMVLQVPQRTHNVTL 
T 


9236 


23137 


A 


9305 


70 


196 


PQVIHPGQAGLELLTSGDLPASASKSAG 
ITGVSHHAQPISHF 


9237 


23138 


A 


9306 


378 


256 


QKWVSICCPGWSQTFGLQESSCLGEPKS 
WDYRRQPQPCLTL 


9238 


23139 


A 


9307 


206 


3 


PCHLGWSAWQSWLTAflSNSWPHAILPP 
QPPKVLGLOftSLKFLKAGITQVTFSDYN 
KILNKEIKRGR 


9239 


23140 


A 


9308 


338 


161 


HFKKKIYIYIYVyiHTYIYIYTHIHIYT 
HTYTYIYRYIHIYIHTYIYMYVYIYIPS 
LPF 


9240 


23141 




9309 


503 


288 


FFLNLDKSQEWWLMPVIPTLREAEAGGS 
LEVRSSRPASSDPPTSASQSAGIIGMRH 
HARPWLRENGRIQW 


9241 


23142 




9310 


418 


309 


FFCKWGI SI CCLPKLVSNSWPQAILQPQ 
PPKVQGTL 


9242 


23143 


A 


9311 


442 


275 


VSPPPLGGGFLSPPFKPPPFFFNLKGGP 
PFPTIPFLI IKGKKFCPPPPFFFPGLF 


9243 


23144 


A 


9312 


2 


95 


HENRLSLGGGGCSELRLCHCTPAWATER 


9244 


23145 


A 


9313 


318 


154 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFI 
FILFIYSLLYWCVGVLGS 


9245 


23146 


A 


9314 


388 


199 


EQPKCVHRI KQIWHICALEYYAAFKKKK 
VLPYAVIQIKLEDTMLSEINQSWGKYCM 
rPLIGQI 


9246 


23147 


A 


9315 


2 


97 


STPGLKRS SHI SL PKYWDYREGPPSPAS 
ILF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalaiiine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


9247 


23148 


A 


9316 


3 


235 


TMWKAEGIPIRTTLDNSTTVQYACLLH 
HLTMKAKSTVRD-DPQNDLTFLRIRSKK 
HEIMVAPDKEYLLIVIQNPCE 


9248 


23149 


A 


9317 


253 


71 


LFFFFLFFVDTKFHHLAQDC3FQLLSSSN 
SPTSASQSGGITGRSHQAWPYVHLLTLQ 
KUMY 


9249 


23150 


- 


9318 


425 


95 


CLLFCPKKKKKKKKKKKKKKKKKKKKKK 
KKKGGPIKKKKKQP 


9250 


23151 


A 


9319 


25 


214 


VRIDDRPEPCMDADAQREGINSSARYPK 
NWVTTGDPARECGMIHSAPLMLIADPDE 

FVSVQLA 


9251 


23152 


A 


9320 


50 


231 


GHRPEPGRDADAQREGIIASARYPKNWV 
TTGDPARECTRIQSAPLMLLADPDEFVS 
VQLA 


9252 


23153 


A 


9321 


401 


226 


QWLFTNMIIGHCSLELLGPSNPPISASR 
LAATTGTRHHAQPSKFRNLKGWI.CAKRA 
RT 


9253 


23154 


A 


9322 


410 


159 


LGSLQLPPPGSKRFSRLSLPSSWDYRGC 
SELRLRIICTPAWVTEQDSVPPKKKKQIF 
RPFPFSYGESLGVGLRHLCFSKPSRRF 


9254 


23155 


A 


9323 


310 


25 


KWGGLFFFSPPKKRGFFKQVFFCVPPVF 
FFPPFFKTGPGDIFLGPLKKKIIFPPPG 
GKKFFFKRGPPPFFFFFFFFFFFFFFFF 
FSEVECVCLCR 


9255 


23156 


A 


9324 


406 


289 


PGFYHAGQAGLKLLTSGNPSASASQSAR 
ITSISQRTRPH 


9256 ■ 


"23157 


A 


"9325 


1 


116 


LHSVLLPFFYFCVCVCVCVCVCVCVKHK 

Y'vTvAGGAPGR 


9257 




1 




; 




FIFLFTVGGLTGIVLtANSSLDIVLIIDTY 
YWAHFHYVLSIGAVFAIIGGFIH 


9258 


23159 


-X- 


9327 




107 


RGCSEPRLRHCTPAWVTEGDCVSKKKKK 
VPEAWG 


9259 


23160 


A 


9328 


257 


2 


GFFSNISASVIIPPPISRKNLFFPFRVY 
FGPPPKTFKGPPPFFFFFFLS PRLECSG 
VIMAQCSLTLPGSSDLPLSASCVAMTVS 
I 


9260 


23161 


A 


9329 


66 


384 


QNLCGWMKSMaWDNPWRKADWGQIRES 














KRGGPFLKISWGGPILGGRGKI FFFFFG 
GGILTPRGVFWKSPFFWGGKKK 


9261 


23162 


A 


9330 


1 




IIGQVASVLYFTTILILIPTISLIENKI 
LKWA 


9262 


23163 


A 


9331 


33 


412 


NLFYVMAEILHTHIQLCCQVKLHLSVPR 
PSLPLLCCWNFSLVLHEHSQPASEKKKK 
KKKKNKKKKKKKKKKRGGVFKKKPGGAP 
FCGGGENFFFFFFGGGLKKPRGVFGKNT 
FFWGGEKRPPPPPY 


9263 


23164 


A 


9332 


401 


311 


PTRPVDLLVDDSVKESLIGYGPTEDSKE 
FS 


9264 


23165 


A 


9333 


5730 


452 


TRSAPRAASWLEDPREVRSVCLSATFFT 
LSLQKPRLLLFSPSWHLGVPLSVGVQL 
QDVPRGQWKGSVFLRNPSRMnTVPCSPK 
VDFTLSSERDFALLSLQVPLKDAKSCGL 
HQLLRGPEVQLVAHSPWLKDSLSRTTNI 
QGINLLFSSRRGHLFLQTDQP lYNPGQR 
VRYRVFALDQKMRPSTDTITVMVENSHG 
LRVRKKEVYMPSSIFQDDFVIPDISEPG 
TWKISARFSDGLESNSSTQFEVKKYVLP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

ding'to'"'" 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glufamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NFEVKI TPGKPYILTVPGHLDEMQLD IQ 
ARYIYGKPVQGVAYVRFGLLDEDGKKTF 
FRGLESQTKLVNGQSHISLSKAEFQDAL 
EKLNMG I TDLQGLRLYVAAAI I EYPGGE 
MEEAELTSWYFVSSPFSLDLSKTKRHLV 
PGAPFLLQALVEEMSGSPASGIPVKVSA 
TVSS PGSVPEVQDI QQNTDGSGQVS I P I 
IIPQTISELQLSVSAGSPHPAIAELTVA 
APPSGGPGFLSIERPDSRPPRVGDTLNL 
NLRAVGSGATFSHYYYMILSRGQIVFMN 
REPKRTLTSVSVFVDHHLAPSFYFVAFY 
YHGDHP VANS LRVD VQAGACEGKLELSV 
DGAKQYRKGESVKLHLETDSLALVALGA 
LDTAT.YAAGSKSHKPT.NMGKVFEAMNSY 
DLGCGPGGGDSALQVFQAAGLAFSDGDQ 
WTLSRKRLSCPKEKTTRKKRNVNFQKAI 
NEKLGQYAS PTAKRCCQDGVTRLP^MRS 
CEQRAARVQQLDCREPFLSCCQFAESLR 
KKSRDKGQAGLQRALE ILQEEDLIDEDD 
I PVRS FFPENWLWRVETVDRFQ ILTLWL 
PDSLTTMEIHGLSLSKTKGLCVATPVQTi 
RVFREFHLHLRLPMSVRRFEQLELRPVL 
YNYLDKNLTVSVHVSPVEGLCLAGGGGL 
AQQVLVPAGSARPVAFSWPTAAAAVSL 
KWARGSFEPPVGDAVSKVLQI EKEGAI 
HREELVYELNPLDHRGRTLEIPGNSDPN 
MI PDGDFNSYVRVTASDPLDTLGSEGAL 
SPGGVASLLRLPRGCGEQTMIYLAPTLA 
ASRYLDKTEQWSTLPPETKDHAVDLIQK 
GYMRIQQFRKADGSYAAWLSRDSSTWLT 
AFVLKVLSLAQEQVGGSPEKLQETSNWL 
LSQQQADGSFQDPCPVLDRSMQGGLVGN 
DETVALTAFVTIALHHGLAVFQDEGAEP 
LKQRVEAS I SKMJS FLGEKASAGLLGAH 
AAAITAYALTLTKAPVDLLGVAHNNLMA 
MftQETGDNLYWGSVTGSQSNAVSPTPAP 
RNPSDPMPQAPALWIETTAYALLHLLLH 
EGKAEMADQAAAWLTRQGSFQGGFRSTQ 
DTVIALDALSAYWIASHTTEERGLNVTL 
SSTGRNGFKSHALQLNNRQIRGLEEELQ 
FSLGSKINVKVGGNSKGTLKVLRTYNVL 
DMKNTTCQDLQIEVTVKGHVEYTMEANE 
DYEDYEYDELPAKDDPDAPLQPVTPLQL 
FEGRRNRRRREAPKWEEQESRVHYTVC 
IWRNGKVGLSGMAIADVTLLSGFHALRA 
DLEKLTSLSDRYVSHFETEGPHVLLYFD 
SVPTSRECVGFEAVQEVPVGLVQPASAT 
LYDYYNPERRCSVFYGAPSKSRLLATLC 
SAEVCQCAEGKCPRQRRALERGLQDEDG 
YRMKFACYYPRVEYGFQVKVLREDSRAA 
FRL FETKITQVLHFTKDVKAABNQMRNF 
LVRAS CRLRLEPGKEYLIMGMGLDGATY 
DLEGHPQYLLDSNSWIEEMPSERLCRST 
RQHAACAQLNDFLQEYGTQGCQV 


9265 


23166 


A 


9334 


46 


324 


S KYLPPVITFQHE IWVGTNIQTI SDWVL 
ICCPGWSAVTQSRLAAASTSWAQAMSSS 
SCLGLSSSCYYHHTPPCLANFVAFVETK 
LYTMLPGLS 



1107 



wo 01/64835 



PCTAJSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gliitamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Tlirconine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


9266 


23167 


A 


9335 


338 


178 


EVDIAVSHRRKQAFNLCLLKCKPHNNCL 
NLEGGGSSELKLCYCIPAWATDQSL 


9267 


23168 


A 


9336 


396 


254 


KLSTDCTSKEI INRHPTEWEKI FANYAP 
NKGPTSTIHSSCLMFPYLF 


9268 


23169 


A 


9337 


454 


359 


SSWDYRHAPPCPANFVFLVETIFIQGEI 
GLL 


9269 


23170 


A 


9338 


540 


328 


IKKRGAPRRGVLFFFTRKKKFFSYYKKS 
TKERVSFLFVAPGCIQGGKKKKPPTKKK 
KKKKKKKKKKKKKKK 


9270 


23171 


A 


9339 


21 


209 
2 


AEEFPDFKLYYKANQNTYRGQWNRTETS 
EITPD I YliGIQLVSNSWPQVI LLPWP PK 
VLGLQA 


9271 


23172 


A 


9340 


345 


KIFFFLIYILFFFNFFFFFFFFFFFFFF 
FFFFFLENRWFLMENFIKGRGHVKGPLG 
QARDHDISGRRWWRREQCRASQLPGVRR 
SLLQVQAGGPVHIRISGLQDARQGPRVA 
GA 


9272 


23173 


A 


9341 


15 


252 


LERVIRQSAAITDSFFNFILLILFIYLF 
FETGSPCVSQAGLKLLGSRDPPVSASQS 
ARI IGVSHRAQPALTDFPSISF 


9273 


23174 


A 


9342 


484 


276 


RHAPPCQANFFVQTRFCHVAQSGLKLIR 
SRYLFASAFQSVRITCMSHCARPLYFLM 
FKELQYLKSPTKS ' 


9274 


23175 


A 


9343 


464 


326 


LVETGPHHVGQAGPELLAPCDPPflPASQ 
SAETTGMSHRSRPPANFV 


9275 


23176 


A 


9344 


486 


280 


SFSHSHSCSLPSSWDYRCTSSHLAHFFF 
KRWGFTILPRLILNSWPQVILLPQSHRH 
EPLCLETLKTLNV 


9276 


23177 


A 


9345 


276 


179 


DRDFTMLPRLVLNSWPKTILLPLPPKIL 
RLQS 


9277 


23178 


A 


9346 


45 


143 


RWNSRQENCLNLGGGDCSEPSLCHCPPA 
MATE 


9278 


23179 


A 


9347 


229 


73 


PPPHMIFFFFFFLFFFYFFFFFFFFFFF 
FFFFFFFFSRLQVSVYCSSVLNKNVD 


9279 


23180 


A 


9348 


391 


139 


IPSLKRSSHLSLAKCWDYRHKPPCQVTF 
SFLNVEIYATKFFLSTPLAAWSKYWYLV 
FLLSFSSKYYLTPISFLFLTHNVSSS I F 


9280 


23181 


A 


9349 


53 


191 


lYVTYIWYSEVSHAYNSSTLGGQGGRIT 
GAQEFKTSLVNIAKPLLY 


9281 


23182 


A 


9350 


373 


235 


EEGELLEPGVQGCSELSWCHCTPAWGTE 
QDHCVSYNNSNTHNNVNY 


9282 


23183 


A 


9351 


476 


82 


YLATYLPTYLSIYLPTYLPIYLPIYLYL 
GSHYIAQAGLKFLGWVGHNGS CL 


9283 


23184 


A 


9352 


286 


142 


ELQFKMRFGWRHSQTISWGLSLLPRLIS 
NSWAQVILLTWPPKVLGIQA 


9284 


23185 




9353 


3 


102 


FFETESHSVARLEYSGAISAHCTPVIKL 
YLYT 


9285 


23186 




9354 


3 


270 


TAGAPPRPANFCIFSRDGVSPCWPGWSR 
TPGLRCDPPASASQLLGRLRHENCLNLG 
GEGCT6AILAHCTSTLRAQAILQPQPSC 
PAPQ 


9286 


23187 


A 


9355 


1 


112 


GEGSAHCNLCLPGSSNSPASTSQVAGIK 
GVSHRARPN 


9287 


23188 


A 


9356 


1 


63 


VALIQAYVFTLLVSLYLHDNP 


9288 


23189 


A 


9357 


244 


140 


QRRHTHTHSHTYTHTHTHIPNKKIPILT 
ILKCIA 


9289 


23190 


A 


9358 


464 


181 


LGAGGMT6ILFSL6ASMVLGGVAQMIAP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireoninc, V=Valine, 
\V=Tryptophan, Y=Tyrosinc, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KARTRRIQTTDNGKQNTYFSSLDMMVAQ 
GNVLPVLYGEMRVGSRWSQE IS TADEG 
DGGQVWIGR 


9290 


23191 


A 


9359 


11 


469 


IDLRCRFRHEERERERERERERERERER 
EREREREREREREREGRAPPPPLFFGGG 
GPPAEAHALSPPHLGRPPLYTYFFSWR 
APPTERSARTQREARRALKSVCWRERE 
THSLPRARCVCSHPTGCGGAPGARVSLS 
QSAHCFLCVCGAG 


9291 


23192 


A 


9360 


22 


172 


GLSHEERERERERERERERERERERDTV 
CPLFCPTGGFILREARVKSGCLCF 


9292 


23193 


A 


9361 


15 


345 


PPNRPPAEFRTRHLISSLLCLEGIILSL 
F 1 1 ATL I TLNTHS LLAN I VP lAI LVFAA 
CEAAVGLALLVS I SNTYGLDYVHNQKKK 
KKTWGGGRAPNSPYGGRNNNPRPAV 


9293 


23194 


A 


9362 


61 


298 


SCDSPLQECCIHTHTHTHTHTHTHTHTH 
LFSDIKTHTWP.EWALSLFLLTEEEGGGT 
LFQGTT.IFGHTLFWDKPLWQKEF 


9294 


23195 


A 


9363 


430 


2 


GGAGPPPPCWTQGPGVPPPTLSPRVNGP 
SLiCSDFPSPKFYKEKRKKKGNKTPKRKG 
KTLLPTETIVLHVSPGCESKPGAGSADR 
EAPGVRDGGSDASSSGTRASELGDGPVR 
VNFVLGDPGEAGRGFSMLGHPPGTSGPR 
TS 


9295 


23196 


A 


9364 


432 


291 


GITSI.RHHSQLIFLFLVETGFHHVGQAG 
LELLIHQHRPSKVLRLQV 


9296 


23197 


A 


9365 


343 


507 • 


GQGLTLVPQVGVQWQNLSSLQLLPPGLK 
QSTCLSLPKCWDYKCGLPPPASFCFVL 


9297 


23198 


A 


9366 


417 


128 


MGWWPGRGAPPFLDEMAAGQRRSSLSRV 
GSQAEGLLTSHTMGGPAETLLTSQTGWR 
LGRGCNL6SLE3Q6RRI.GQGGCNEPSSR 
HCTPAWAPLSTE 


9298 


23199 


A 


9367 


500 


285 


SLPRSWDYRRAPPRPANFLFLVEMGFHH 
VGKAVLELLTSGVELLTSGDLPASASQS 
AGITGVSHRAWQIRE 


9299 


23200 


A 

— 


9368 


1 


229 


IVDLTTWGI PTHTIHKYTHHIHKYIHTH 
AYTHHTHRHKHIYTYIHTDMVWLSPHPN 
LILNFSSHNPHVSWEGPNGR 


9300 


23201 




9369 


451 


88 


KQTNIYTCIYMYVYIYTCIYTYVYIYTR 
TYTCIYVCVYTCIYVCVYTCIYVCVYTC 
rYVCVYTCIYVCVYlRVYTCVClYVYIR 
VCVYTCIYVYVYIRVYTYICIYTCISTY 
IQCIYTHIYKVTKCSG 


9301 


23202 


A 


9370 


1 


208 


ADETIEQRGGIPKFVHDNYVIRRTAVDR 
G I PLLTNFQETKLFAEAVQKSRKVDSKS 

LFHYRQYSAGKAA 


9302 


23203 


A 


9371 


354 


202 


PAAQREGITASAHYSKNWVTPGDPAREF 
TMIQSAPIiMLLADPDEFVSVQIA 


9303 


23204 


A 


9372 


1 


336 


KSLIKYKGYQVAPAELESILLQHPNIFD 
AGVAGLPDDDAGELPAAWVLEHGKTMT 
EKEIVDYVASQVTTAKKLRGGWFVDEV 
PKGLTGKLDARKIREILIKAKKGGKSKL 


9304 23205 


A 


9373 


2 


187 


NILVTALYSLYlFTTTQWGSLTHHimrr 
KPSFTRENTLMFIHLSPrLLLSLNPDII 
TGFSS 


9305 \ 23206 


A 


9374 


96 


240 


ACWLMPVI PALSEAEAGRLVQPHRLRAA 
WATWQDL I STKKKKKGGP F 


9306 23207 


A 


9375 


3 


293 


ST ISAFTTSTTSATTTS TTSATTTSTI S 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L,=Leucine, IVNMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














APTTSTTLSPTTSTTSTTITSTTSAPIS 
SNKIiHKKKKKKKKQKKKKKRKNDDTKQD 
TITKIHSGGGSL 














SL 


9308 


23209 


A 


9377 


336 


199 


EGI TASARY SKNVIVTTGDPARE FPMIQS 


93 09 


23210 


A 


9378 


321 


128 


EPGFTMWASLAPTSSLQVIHRLGLPKCW 
DHRREPPHPAYHSLYESGPSVQNGEEKG 


9310 


23211 


A 


9379 


440 


286 


SSDPSASASKSAGFTGMSHHTQLLTSYS 


9311 


23212 


A 


9380 


392 


154 


KGSSPLSPPISIiAGGKPPPPRKFFLNFF 
FFFNRDRVSPCLPRLVSNSWPQAILPPQ 
PHKALGLQAYATTASFIFLIVTV 


9312 


23213 


A 


9381 


403 


178 


GRRHEPCRDADAHLEGI IFSAFYPKNWE 
TTGDGAREWTMSNSAPLMLLADPDEFVS 


9313 


232! 4 


A 


9382 


303 


206 


DPVSTKICSLQPPPPGFfCRFSCLSLPSS 


9314 


23215 


A 


9383 


215 


116 


SWNFWCCCCCCCCCCLRWSLTLSPRLH 
SCGTM 








9384 


2 


285 


ARHLLMQLSGGATLAISTIDLPSTLIIF 
TLLILLTILEIAVALIQAYDITLLGSLY 
LGDNTYKKKKRGAAFEETKFTSAGMQSK 


9316 


23217 


A 


9385 


12 


123 


GYEHFLLPTLHTQTHAHTHTHTHTHTHI 


9317 


23218 


A 


9386 


3 


230 


KCWGYRCEPLRIiAPLRCFFTPGGISLTS 
GPALLSNAS ALAE VEGQQLCP PTP FLP T 
LIALTTLLLPISPFILIIL 


9318 


23219 


A 


9387 


590 


396 


ARVQWPDLGSLQPPPPGFKRFSCLSLLN 


9319 


23220 


A 


9388 


224 


79 


GLGPVGHACPTLWEGEAGGSRGQEIEAL 


~^20 


"23221 


— — 


"9389 


"249 


1^ 


LVYLITVKKILFPVHIRTFMKIDHMIGH 
KTCIJSIKLQMTEILQSVFSDYNRIKLEAS 


9321 


23222 


A 


9390 


400 


254 


LRQPGSTNSPVSflSRVAGITGTHHHTQI 
I FVFLVEMWHACNPSySGG 


9322 


23223 


A 


9391 


3 


64 


PPQKSDPPYKLVPVHLDSIL 


9323 


23224 


A 


9392 


266 


106 


LTANFLFLVETGFPHVTRVGLELLASSD 


9324 


23225 


A 


9393 


430 


348 


FGKPQPPPKGKSPPPRGrTFPPQKAPKG 


9325 


23226 


A 


9394 


481 


311 


LSLSEKRSIRVGVTRFSRCCPSPLSLTG 


9326 


23227 


A 


9395 


3 


136 


VFPRLVLNSWAQMILLCWPPKVLGLQAC 


9327 


23228 


A 


9396 


143 


347 


NFVFLVETRFHHVGQAGLTLLTSSDPAS 
QSAGITRVRPEILTLKRYHFPSGILGTE 
GMVIEMAYLLKV 


9328 


23229 


A 


9397 


2 


231 


LELLTSSDLPASASQSAGITRGSHHARP 
SFTFRCVIHLEFNFMRGAREERLHFYCR 
WVIRRLTTFGRGYPSPAASD 


9329 


23230 


A 


9398 


383 


200 


QHYLNRRLSLAQWWMPWPATQEAEAGG 
LLEPRSSSLQCSMMLPVNSHCTIiAWATH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L>=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V^Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














REPHL 


9330 


23231 


A 


9399 


3 


416 


GFLHVGQAGLKLLTSGDLPTSASQSAGI 
IGVSHRAGCLKSIYLTFPTRKFFVQMSI 
CDFKVEIEGKIGFISVPISLQSILHIVN 
NFFFFLETGSFPPGWRAWEKFGLIAPLA 
SGVKGIFLPLPRENliGWRPSPIPPA 


9331 


23232 


A 


9400 


400 




FGYPQVFPLPPFLKPGPGIFFLGPLKKN 
FPFPPPGLKIFSFKRAPPFFE-FVRKGLI 
MLLRLVSNSWAQVI IPPPPPEVLGLQA 


9332 


23233 


A 


9401 


324 


154 


FFFFFFKXXXXXLLFFFFFFFLFYIFFF 
FFFXFXFXFFFFFFFFFFFFFFFFFFFF 
F 


9333 


23234 


A 


9402 


413 


312 


PKLVLNSWLQALLLPKPPKVLGLQAHAT 
ITQPK 


9334 


23235 


A 


9403 


363 


3 


FKKKNFCLGGAGWFFFFYDPPPKKNIF 
YFPSKKIYPLSPASSSFFFFLCFPFFIF 
FFFFFFFFFFLENRWFLMENFIKGRGHV 
KGPLGQARDHDISGRRMWRREQCRASQL 

PGVRPRV 


9335 


23236 




9404 


401 


82 


SELPHDPINPTTGYiyPKKMKSVCFTDI 
CTPMFTTALFSIAKIWNQPKCLSKRITI 
FMFLVTYSQLFCTHYPILPYPFFPAPLS 
SSHNYHYTLYLYENILLDSTYE' 


9336 


23237 


A 


9405 


382 


219 


WAEWHGFLVPTSPGGEAGGSLEPGKRRV 
QLTQISPLPSSLGGKVGLPLKKKKNN 


9337 


23238 


A 


9406 


3 


195 


YLYSLVETGFHHVGQAGLELLISRDPSA 
LASESAEITGMSHSIQPKSAFYIQIPET 
FWQPAFK 


9338 


23239 




9407 


371 


1 


FRRLRLKNRLHLGNGGCSEPRLCHCIPA 
WETRAKLGRKKKEERKREKGTKMYNKMY 
KMYLELNLTCPEPASPKQMTRQliEVLSS 
NPSPGTFQFICKKQFKFSCCFALRVSIY 
LLMSVFEYCSC 


9339 


23240 




9408 


326 


229 


RQGLTMLPRLVSNSWAQVILPPWPPKVL 
GLQA 


9340 


23241 




9409 




231 


IjESASFPPGKVAHTCNPNTLRGCAGWIA 
WAQKFKTSLGISIMAKPCLYKIK 














HPGWEWWLVSVIPTLREAKVGRSLEPRS 
LRLAWDTWe 




23243 




9411 




192 


VFLVEMGFLHVGQAGLELPISGDPPASQ 
SAGITOVSHHAQALTFLNI P INP3HPPK 


9343 


23244 


A 


9412 


3 


83 


KMCLNLGGGGCSELRLCHCTLTWATE 


9344 


23245 


A 


9413 


371 


301 


EDHPRMVILRLLRYIVELVWRMH 


9345 


23246 




9414 


486 


337 


AGLELLGSSNLPALT3HIAEITGMCHHT 
WLAYIFVCAFGWVLLLDLCLAF 


9346 


23247 


A 


9415 


378 


254 


IVAHCSLDLLGSSDPLTSASRVAGPTGM 
CHHWLIWFFVC 












309 


WRYSSNEFLLFLLFYLFKSNFTVFCCC 
YCCCCCCWRQSLTLPPiCLECGV 


9348 


23249 


A 


9417 


162 


339 


RSQVSWLMPVIPTFWEAJWGRYQARSLI 
PA 


9349 


23250 


A 


9418 




147 


GGGGCSEPRSCHCTPAWATRVKLGLFKK 
KKKINFLFVPPFI FLKSKTGV 


9350 


23251 


A 


9419 


283 


126 


KMMYKGQAQWLIP 1 1 PAFWEAKERGLLE 
VRSLRPAWATEQDSTATKNKNKirF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQU) 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
otide 

correspon 
ding to 
Inst amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glyeine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9351 


23252 


A 


9420 


133 




ELMFSWAWWIMPVI PALWEAEAGGSRGQ 
EFKTSLPNMVKPRLN 


9352 


23253 




9421 


3 


133 


LRGLYSGAGI IRFEAKKWNALALWAWDI 
AGDNCAICRNHIMDL 


9353 


23254 


A 


9422 


409 


291 


ILILGGGSCSEIiRSCHCTPAWATRAKHR 
LKNKSPINVy 


9354 


23255 




9423 


1 


210 


RF FHLGLS KHWDYRCES PHAGTHNEVL I 
YATIWMNSENIVPNEIIQTQRTNIVSFY 
LYQVGRIGKFIDRR 


9355 


23256 


A 


9424 


479 


306 


GVPTNQPWSNRVKTPSPTKNIKISWPWW 
CMPCNPSCSEGQGRRIALNPGRWRLQ 


9356 


23257 


A 


9425 




209 


GF FL FQGGGHHAGQAGLELLTSGDLPAS 
ASQSAGITGAGPNQGIFVALESPMESAA 
LLHLHWLSSPCS 


9357 


23258 


A 


9426 


628 


433 


RQGVAHLPGMECSGTITTHCSLELLGSS 
DPPASASQSSRLAWSHCAQPSLLFSVL 
CLKPIFCCA 


9358 


23259 


A 


9427 


417 


125 


FLRQSLTSSPHLEYHGAIPAHCNLCLPG 
SSDSSILASQSIRITGVNHCTWPDAAVL 
RQNFFFSEKPQFYSEGPQQTHI IKSNLL 
YLKSTKCTCYPYL 


9359 


23260 


A 


9428 


366 


256 


SFFYRDRSLRTLPRLGSNAWSQAILPPG 
PPKVLRIiLA 


9360 


23261 


A 


9429 


3 


140 


GFCHVGQAALKLLI SSDLPTSPSQSAGI 
TGVSHCAHLTRLFSSHE 


9361 


23262 


A 


9430 


368 


3 


VRASQLLCLPTQASAMAGTPPPASLPPC 
SSISDCCASSEQGSMGVGPSEPGAGYNL 
WCSSLRTLEKCSIRVGVPQFSRCHLSQ 
LPLARKGNSPTPCASWVHRCLTLLQLML 
RGLYPLSCT 


9362 


23263 


A 


9431 


3 


13] 


LTMLPRLVSNSWAQGSSHL3PPKCWDYR 
CEPLHSGLKDIFLE 


9363 


23264 


A 


9432 


384 


274 


LSFAMLPHLVLNSWAQESAHLGLPKCWD 
YRHEPLGLA 


9364 


23265 


A 


9433 


431 


273 


PPPLGYFFIVKMEFPPWQDGFKLLNSS 

DCSTSASPSGRISGVNHSTRPRIF 


9365 


23266 


A 


9434 


211 


473 


CVCTFFSFLSMHI FFLFLVGEAESHCVT 
EAGVQWHDLGSWQPPPPRLKRFSCLSLP 
SSWDNRRVPPYPANFCIFSRTRVWSCWP 
GWS 


9366 


23267 


A 


9435 


391 


188 


LPGSSDSPASASRVAGTTGGCRHTVLLG 
FEFKDNIHWIKLVEKAHTQYGNIYICVI 
KYMPLFYPYHH 


9367 


23268 


A 


9436 


331 


115 


SRDRGFAMLPRLVLNSWTQAICSLLDSQ 
SAHLS LPKCWNYRREPLCPASGE ILS F I 
KP S SS KS FHNS TKKRQ 


9368 


23269 


A 


9437 


3 


109 


lYGSYSLPACFFPSFLPSFLPSFLPSFP 
PFFLSFF 


9369 


23270 


A 


9438 


2 


167 


ECNGAISGHCNLCLSGSSDSPAPAWATH 
VKLRL IKKKKKKKNQ I LFFYPPRKTLF 


9370 


23271 


A 


9439 


648 


310 


VCP C I LfiHPAFLFAKSGNP LE S S LLDHS 
SPTFKLTDRENLRHVKCTSHSQSLPTSG 
FQPGLPDSTLCAWYPTRSPKLSPAGWVG 
GYRAPVGRRGPGSRNSWRLHWTP ITSVG 
D 


9371 


23272 


A 


9440 


1 


170 


YE I WFLDVGQAGLELLTSGDLSASGSQS 
AGTTGMSHHARPLKLFTI SVS SNKKVKC 


9372 


23273 




9441 


133 


244 


LFGYQTRGLAMLLKLDSNSWPQAVLPPQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop eodon, y=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPKVLGLLA 


9373 


23274 


A 


9442 


365 


176 


RGSiXIDMVPSESCTGYNLLMCPLLRPLEK 
RSIRVGVTRFSRCHLSQLPLARKGNSLT 
PCASQTO 


9374 


23275 


A 


9443 


368 


231 


VEAGFCHIAQAGLTFLSSTDPSALASRS 
ARITGVSHHARPIFLKE 


9375 


23276 


A 


9444 




194 


KMLEAKKKIEQQGGFTFENKGVLSAFNF 

GTVPsrar 


9376 


23277 


A 


9445 


3 


349 


IRSGSYACYFCYPRECYALFHCTSMIVS 
YGPYYFMNIDQRHESRLSTIYIFKFEIG 
SKHVAQAGLELLCSSSPPAFTSQSAGIT 
SVSHHTRPQKIFHVIYSCDLHFHPGIEG 
PQR 


9377 


23278 


A 


9446 


3 


141 


HftECSSMMIAHCELEIiLGSTDPPASASQ 
SAETTGI S YHAQPYSTF 


9378 


23279 


A 


9447 


2 


282 


ICPTRSRFPPRVIiKNGRKKDRLECSGAI 
SANCSLCLPGSSDS PASASRVAGSTVAE 
EDSTHRAVRHQGSSRSAPSGRGSWAGKQ 
ERADRAQQS 


9379 


23280 


A 


9448 


307 


430 


GliAWWLTPGIPIIiWEAEWGGSLEVEFET 
SLASMEKPSLYYK 


9380 


23281 


A 


9449 


2 


253 


IQCVCVCVCVCVCVCLSVCLHFIAILGC 
SPWGSHLPCSVSSPSGRGPGGEDLSPLP 
PTMSVSPEADFPPLAAAAPGNIMTAIT 


9381 


23282 


A 


9450 


1 


166 


NTGGGGEEIHVFSSCFAEDRLQKGKDRP 
GAVAHACNRSTLGGRGGWITRSQIFFC 


9382 


23283 


A 


9451 


1 


123 


NTRGLVMLPHLVSNSWAQVILQPLPPKV 
LGLQASVTVHINN 


9383 


23284 


A 


9452 


396 


291 


SLCIKRVCACACVCVCVCVSGAKLRKLV 

LDFIffiS 


9384 


23285 


A 


9453 


1 


147 


FAIRARFSNTYTKIGTIHTYIYIYTYIY 
TYIYTHTHIYILSQVLMEYEF 


9385 


23286 


A 


9454 


394 


235 


TRSCYIAQS WQWLFTGTLI IHYNLQLL 
SSRDTPVSASQYLGLQAHTTTPDSC 


9386 


23287 


A 


9455 


1 


85 


TNPVATLLEWQDALCF I PVGSAAPQGS P 


9387 


23288 


A 


9456 


339 


165 


KTHWWGKKSFLKNPFFFFFFFETKSRSV 
TRLECSGVILAPKQKTiaTOliiriKTTINKT 

I 


9388 


23289 


A 


9457 


3 


209 


LLI LFS I KMRNRPDSPALLSTQLPPI F I 
YIHTHIYLYIYTHIYIFIYIYLYIYLFA 
YIHIYVSLFPLK 


9389 


23290 


A 


9458 


374 


156 


QAGLELLTSGDPPISASQSARITGLSQH 
AQLSELTLTVFTTRHCTCLCAHFLYYTV 
FliEGRNDDEFISIPST 


9390 


23291 


A 


9459 


287 


3 


EAGLLWRKATHKYAHTHTHTHTHTHTHI 
RGLQNMINCKYITYQEIMTLAQAKNKGV 
IPDSFPFTSSPPLTYPPALSFHFQKHIP 
NKTTTYHGHC 


9391 


23292 


A 


9460 


3 


72 


ktelkmKftnilessflmnnks 


9392 


23293 


A 


9461 


327 


92 


ISLFLVWGALFFPKKIIYPPKKKSPLFF 
PPPPPQENFNPPSFLAPPVFPPQENPKP 
AGGRKIFCPPPPVFFFYFFFFF 


9393 


23294 


A 


9462 




173 


ACRGDSGGPLVTRDLKDTWYLIGIVSWG 
DNCGQKDKP6VYTQVTYYEMWIASKTGI 


9394 


23205 


A 


9463 


258 


39 


LKVILPSDPSSIFVCLFLENESRSVTQA 
GVQWHDLGFKRFSCLSLLSSWDYRRVPP 
P PAKFC I FRKHGVS P C 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
.e,ue„ce 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histldlne, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopiian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9395 


23296 


A 


9464 


104 


336 


NHVS QCE E I FLFDE YYI FSRAVASMFWV 
DAELGSDIYLDGKLKNSFLLSFLGFFFF 
FFLGKKVFFF3QNGGLGRPGQ 


9396 


23297 


A 


9465 


410 


0 


liNLGGGGCSEPELCPCTPAWATE 


9397 


23298 


A 


9466 


254 


61 


KKKKDAFFTIPLYPSSQNLFAFTQMDPD 
TNISQQLTWTVLPQGFRHSPPTFLDKPF 

KQTSINFA 


9398 


23299 


A 


9467 


403 


255 


PAPQLGGPPMMPMMAPPSSWMMPMGCAP 
AMRLPIGGDI IWLCVPTQIS P 


9399 


23300 


A 


9468 


1 


1428 


PARFKDLNLAGTAEVGLAGYFMDHTVAF 
EDLPVRMVCS STCYRAETNTGQE PRGLY 
RVHHFTKVEMFGVTGPGLEQSSQLLEEF 
LSLQMEILTELGLHFRVLDMPTQELGLP 
AYRKFDIEAWMPGEGEFGEVTSASNCTD 
FQSRRLHIMFQTEAGELQFAHTVNATAC 
AVPRLLIALLESNQQKDGSVLVPPALQS 
YLGTDRITAPTHVPLQYIGPNQPRKPGL 
PRQPAGPRVGYKALQPWPAQRLGPVGRP 
PCRPEGQPLSCCCCGCSCWGRLTRCLAE 
WTEVLTVGQVQRGPCTALLHKELCGTE 
PHGCASTEEKGLLLGDFKKQEAGKMRSS 
QEVRDEEEEEVAERTHKSEVQEQAIRMQ 
GHRQLHQEEDEEEEKEERKRGPMETFED 
LWQRHLENGGDLQKRVAEKASDKETAQF 
QAEEKGVRVLGGDRSLWQGAERGGGERR 
EDLPHHHHHHHQPEAEPRQEKEEASERE 


9400 


23301 


A 


9469 


265 


170 


GTESHSVTQAGGQWLNLSSLQPLPPGFK 
QFS 


9401 


23302 


A 


9470 


289 


110 


RDLELASFLSFFPSFLPSFFPSFLPSFL 
PSFLSFSFLISFFPLPSFFSFISFSLSL 
FFE 


9402 


23303 




9471 


67 


210 


TVTTGQALTHAYNPSTIjGGRGGQITRGQ 
DFETTLVPGDLNFSRVLASS 


9403 


23304 


A 


9472 


2 


186 


TSLWRQAjaaTVEKKTDKYTEVLKTHGLL 
VCTQKS CB FLKNKNKAWLGKQAJIAPARK 
VEIRC 


9404 


23305 




9473 


417 


265 


FLSSFLFFFFFFFFFFFFFFFFFTV 


9405 


23306 


A 


9474 


430 


120 




9406 


23307 


A 


9475 


1 


150 


PTRPWQAGFKVLSSGNPPALVSQCFRI 
TGVSHRARPHLLWLRAPWLPVL 


9407 


23308 




9476 


404 


109 


PPPLFFNFGPFFLVEARLHHVGQAGLKL 
PTTSDQPDSASQSAGFTGVSTRAQPTLI 
TLY 


9408 


23309 


A 


9477 


391 


235 


PPFFFFCRDRGLTMLPRLISNSWAQVIL 
LPWSPKMLGLQAEATVSGLNKRYS 


9409 


23310 


A 


9478 


438 


277 


KRAPFFFFCRDRGLTMLPRLISNSWAQV 
I LLPWS PKMLGLQARATVSGLNURYS 


9410 


23311 


A 


9479 


399 


279 


PPPPVFKNPPPKKNKGAPKKKNISPPPG 
RKKYFFFKGPP 


9411 


23312 




9480 


247 


26 


LKAGDDSRTVLGKVHQSPCFSLLLFFIQ 
VSCRDEGLTVLPRLVLNSRPQAILLPQL 
PKLLDHRGBPPHLAWHG 


9412 


23313 


A 


9481 


2 


176 


HAPPHPANFLFFEEKESCYVTQVSIKLL 
DSSNPPALTSQSAGITGEWQHGHPMNCL 
SG 


9413 


233 1 4 


A 


9482 


2 


178 


HAPPHPANFLFFVEKESCYVTQVSIKLL 
DSSNPPALTSQSAGITGVSHHGQPMNCL 
SG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 

amino^dd 
residue of 

sequence 


Predict- 
otide 

correspon 
ding to 
last^aniino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVl=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9414 


23315 


A 


9483 


2 


151 


GRVGEGPELVSLLVLDQHPVGTSDHRDA 
TQLGFQPLMSDLGSTGSTEGA 


9415 


23316 


A 


9484 


420 


325 


FFVFLLETGFHHVS QDGLDLVI CPPRPP 


9416 


23317 


A 


9485 


400 


104 


LFFFLFFFFFFFFFFFFWNPPKTFFKKV 
FGLNFPTLLKLKFNPPPPGPPKKKKKEL 


9417 


23318 


A 


9486 


117 


280 


SSTNQANNCFFFFKKRGLAI LPRGALNP 
WAKGILLPWPPKGLKLRGGAI GPGQK 


9418 


23319 


A 


9487 


76 


353 


VLWNTLEFCLIYLDLLFLLFFNKEALIR 
KKKKKKKKKKKRGGNKKEKKKKTPWQEK 

AGL FLGDTI lYSRGAVLKKSGGETPRHT 

PSGGLMPR 




23320 


A 


9488 




122 


HAS3DJIYFLHCSHSDLGYVL1AGTASLQ 
DFSFFQKSPPP 


9420 


23321 




9489 


1 


155 


PTRPTVCSGAMLAHCNLCLPGSSESIiAS 
ASQVAGTTACTSTPDPFFFFFSL 


9421 


23322 


A 


9490 


3 


212 


ETGFHHVGLSGFKLLTSGDLPTSflSQSA 
GITGMCHCVWPQFAYFCLLLVYLDHLKF 
TIIMLVLCLLFYD 


9422 


23323 


A 


9491 


2 


163 


ETLFEHVGQVSFEPLTSGDLPASASQSA 
GITGVSAMPSWMIKFFIAFYSCINC 


9423 


23324 


A 


9492 


474 


296 


SQPAGTADCLLPLPPGSCFPTIPCPAAI. 
VPLPSFPSARLWPPQSLHQLLPLPGTCS 
SCT 


9424 


23325 


A 


9493 


7 


151 


LCAVEFVYMKPPAHEEAKAPSR6FWRY 
APWTASSSEKGRDDGMGPRA 


9425 


23326 


A 


9494 


52 


363 


TTAPSLSVFLFTFTGELFVYWWVGVTLE 
TSFISFFLFFFSFLEKDFSFSPAGRPWP 
NFGLIQPLASGVGGISRPNPPSRWELRA 
LPPPPGGLGFFFWKKGGPPG 


9426 


23327 


A 


9495 


383 


118 


WKPRPWVPPGAGVQTPPGPQ6ETPLFLK 
KKKKRGRG6PLFGPPSFGGGGGEIPLTP 
APKGQEIYFPPPPPPPGGKKKSLFQKKK 
KRIR 


9427 


23328 


A 


9496 


554 


419 


FFFETKPCSVSQAGVLWRDFGSLQPPPS 
QVQRDSPCPQASQVAWD 


9428 


23329 


A 


9497 


428 


281 


GSYVQLTFKYFSIYIYVHTHLHVYLYTH 
I YV YVHI C I YKEGY I TGSVLA 


9429 


23330 


A 


9498 


144 


16 


KVKKKVSQCPGIVAHNCNPSTLGGRGGR 
lAWGQEFETLSLQKN 


9430 


23331 


A 


9499 


2 


155 


NLALLGS SDP PVS AS PVSG I RG VHHHAQ 
LIFFFFFGKRAWFNPPGGRKGEE 


9431 


23332 


A 


9500 


2 


184 


FIYGRDSLTMLPRLVLNSWPQAILPPRP 
PKVLGFRCEPPHLASLILFYRDILFTLG 
LKSV 


9432 


23333 


A 


9501 


465 


153 


PHKVFLNKGPFETPQGELSRGEWKTRPP 
PPRERSRPFHRKGGFGSPPKLRQGPPFV 
FFSWPPWCPPPKKGARPQQKGFFYFPPP 
FFNPPFFFFFCLFFIFEIV 


9433 


23334 


A 


9502 


429 




CRTLLGADCVCVCVCVCVCVCVKaFPPI 
SYSCHLVPSDLPLLGVQFKCWVHFGLSF 
LTCPFPSPPWPGGAQPGSCYWGLGCLCL 
FACKLGAELTEQTLGIPASGPGTWATPL 
GSCASVLRRLPGGSERDTDLKVSVLSG 


9434 


23335 


A 


9503 


436 


202 


LPGGYIFQGEGVEKTKFSPLLHENNFFS 
NFFPPKKNNGGGPRIPPPPPPPLFKKGV 
FFIPKFPAPPRPKKKFPKKKKK 


9435 


23336 


A 


9504 


3 


112 


SFDLLSSSDPPALASQSAGITGVSHSHG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQBD 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first^ 

residue of 

sequence 


Predict- 
ed end 

otide 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














QKVYICNM 


9436 


23337 


A 


9505 


426 


208 


VGHNLLVPRFLSPSEKESIQVGVTRSSR 
CRLSPLSLTRKGNSLTPFASRVRQCLTL 
LRLARAPTDLRPLSGTP 


9437 


23338 


A 


9506 


441 


189 


CPANFFVPLREMGPHHVGQAGLLTSSDP 
PASASQSAGITGLSHCAEPTVGSDVLSL 
PSGQCTSALTGFMLGSTTVHSNAAAEA 


9438 


23339 


A 


9507 


117 


278 


NKALLCHPGWNHIKKVNGRLGMETHTCN 
PTTLGGQGQEFENSLGNIGRPHLYK 


9439 


23340 


A 


9508 


3 


126 


FFVEMGFRCVGRAGLRLLTSGGLPVLAS 
QGAGITGVSHCI 


9440 


23341 


A 


9509 


421 


134 


FLVERRSHHAAQASLEILSSSDSPTSAS 
QCAGITGMSHCSUlPKyTSFFRLHEHMPM 
ATPTKLPPTLASASPQVKVCLYPLLPST 
SKGEIFTLQLY 


9441 


23342 


A" 


9510 




200 


PQPPTFYPSFLPSTPEPPAFYPRLPPST 
PASRLLPQPPAFYPSACHLLPQPPTFYP 
SLGNRARLHL 


9442 


23343 


A 


9511 


414 


204 


ERDRKKERKKEGRKERERERKKEKKRKK 
KKEHRKEGRKEGRKEASKGQRWVIMVD 
T 


9443 


23344 


A 


9512 


1 


211 


FHHVSWAGLGLLTSGDLPASASQSflRIT 
GMSHHARHRHGFDNLSLCftLAPSPTPLW 
RRHRRRQHFTVPEK 


9444 


23345 


A 


9513 


390 


265 


VFLVEMGFHHVGQAGLELKTSGDLPPLE 
SQSAGITGVSQQFS 


9445 


23346 


A 


9514 


438 


180 


IGFHHVGHftGLDFLPLGDLPSLASQNAR 
ITGVSHRAWPAMAFVNCHGAGGSVaMRT 
TRGHSC31HLGIGGFWPASLLQFVLSAGS 
L 


9446 


23347 


A 


9515 


126 


11 


RJSTLDFFLSLGIiAMLSRLVNSWPQVIFQA 
WPPKftiRFQV 


9447 


23348 


A 


9516 


2 


249 


FLHVGQAGFVLPTSDDFRPTltrPQSSGI 
TGVSHRTRPLRCYFLRLNNIPLYVYAYY 
ILPIHSVISGYVGGCTAHYCHPRKW 


9448 


23349 


A 


9517 


403 


57 


PEGALVPQBCKKKKKKKKKKKKKKKKKKK 


9449 


23350 


A 


9518 


406 


140 


PHYNHTFVYNGVRLEDLQHMCLELTVWD 
REPLASNDFLVGVRLGVGTGEIPSPILT 
CLKASCLFYSYYHSRPPTQSSSVMTWF 
EPFS 


9450 


23351 


A 


9519 


3 


75 


GIEWMVNCWENVHRPPRQRDIT 


9451 


23352 


A 


9520 


183 


6 


LLTHLILTSALIGEIMVAPILQKNSGSE 
RECRLPRLVLNSWPQGILLPWPPKVLGS 
QV 


9452 


23353 


A 


9521 


89 


339 


LQCLQKSGKLEPGMTYTKLIDADVNVGN 
ITSVQFIWKKHLFEDSQNKLGAEMVINT 
SGKYGYKSTFCSQDIMGPNILQNLKPC 


9453 


23354 


A 


9522 


3 


329 


PSLMFPWLKLSFPGRPRGRQTYSRYQTL 
ELKKEFLFNPYWTRKRRMEVFHAt,GLTE 
RQGKIWFQNREMKGKKENTRDNLPEAQN 
EEKVEEKGIEEKEKEEEEKEENRD 


9454 


23355 


A 


9523 


1 


156 


AKTSMNVNE I FMAIAKIOiPKNEPQNPGA 
NSARGRGVDLTEPTQPTRNQCCSN 


9455 


23356 


A 


9524 


1 


272 


AKNSARALLGILGYRSHLISSLLCLEGI 
ILSLVIIATLITLNTHSLUVNIGPIAIL 
VFAACEAAVGIiftLLVSISNTYGLDYVHN 
LWLLQC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=FhenyIalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=MetIiionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9456 


23357 


A 


9525 


1 


,62 


GNDAWVCTNDMFRLALHNFG 


9457 


23358 


A 


9526 


1 


111 


GD I IHNENMRQGRDKYKTLRQIRQGNTK 
QRIDEFEAL 


9458 


23359 


A 


9527 


427 


340 


IPIGMIMSCDSHSHHELVMSAQLNAIFH 
D 


9459 


23360 


A 


9528 


439 


27 


PRRTDKWLTVEMIQDGHQVSLLSGELTV 
DQRASI IQRFRDGKEKVLIPTNVCARGI 
D VKQ VT I WT FDLPVKQGEE PD YE T YLR 
RIGRTGRFGKKGIiAFNMIEVDELPSPMK 
IQALFDGRNKQLNAEVPCIQAAMED 


9460 


23361 


A 


9529 


1 


236 


REVLITTNVCARGIDVKQVTIWNFDLP 
VKQGEEPDYETYLHSIGRTGRFGKKGLA 
FNMIEVDELPSLMKIQDHFRKS 


9461 


23362 


A 


9530 


437 


240 


QQVSSARPEPEAGEVSPPVGAGVNSNSW 
AFKYGPGNPKQSGPEPKKQTQVSFLLRR 
KGEASQPRQ 


9462 


23363 


A 


9531 


413 


260 


LFDQWKVASPSGLRTSTVRYGDPSKLK 
LEESLRQQAEKEGSALSVRISNV 


9463 


23364 


A 


9532 


431 


260 


ASSGQMFSVKQQAPPVRDLPPFGISLFG 
GQIYDNISPRPPQGIVAPPGGRGNIPNL 
G 


9464 


23365 


A " 


9533 


2 


134 


IHaVCVHMCVCACVCVYACVCVaGYDGR 
IFLALSSTVPSMVNIM 


9465 


23366 


A 


9534 


419 


53 


LLLLTNGDRQTQREKIEACACQSYFDAV 
WGGEQREEKPAPS IFYYCCSJLLGVQPG 
DCVMVGDTLETDIQGGIJSaGLKaTVWIN 
KNGIVPLKSSPVPHYMVSSVLELPALLQ 
SIDCKVRMST 


9466 


23367 


A 


9535 


439 


109 


NCIVIIDSTPYRQWYESHYALPLGRKKG 
AKVTPEEEEILYKNRSKKIQKKYDERKK 
NAKISSLVEEQFQQGKVLACIASRPGQC 
APADAYVIEGKELEFSLRKIKRRKGK 


9467 


23368 


A 


9536 


1 


185 


NTTSSHHFTIPCLVLYSCTSTHTHIHTH 
THTHTHTHVRVFSYIPSQDVKEGCIPAP 
SVFSW 


9468 


23369 


— 


9537 


168 


280 


NPNS IRLTSCCCCCCCCCWKSPYNVYNR 
LLRTGFTFP 


9469 


23370 


A 


9538 


142 


324 


KIVKMGFPHAGQAGLEIiTSGDTPGSTS 
QSAGITGVSHRSRARSIFLYPYLPSVYL 
PWRRV 


9470 


23371 


A 


9539 


3 


139 


YIGGMGQAWWLMPGKPRQNDYLSPGVQD 
QPGQHSETPSLQKGFKN 


9471 


23372 




9540 


383 


318 


MSTQHITIQSYPIGEDEESESD 


9472 


23373 


A 


9541 


372 


207 


YGLNMCRQCFRQYAKDIGFIKVGVCRRT 
PCCLGREGCGGYFFRERQTDRSSLEW 


9473 


23374 




9542 


2 


360 


ARGDDDHRSSDRKDGVIEEQIERLQEKV 
ESAHSEQKNLFLVIFQRVIMILTEHLVR 
CETDGTSVLTPWYKNCIERLQQI FLQHH 
QIIQQYMVTLENLLFTABLDPHIIiAVFQ 

QFCALQA 


9474 


23375 


A 


9543 


379 


75 


TNLQQ FAMSHGSAQVKGHGKKVADALTN 
AVAHVDDMPNTLSALSDLHAHKLRVDPV 
NFKLISHCLLVTLAAHLPAEFTPAVHAS 
LDE FLASVSTVLTS KYR 


9475 


23376 


A 


9544 


380 


56 


RDLSMVRMKSMFAIGFCFTALMGMFNSI 
FDGRWAKLPFTPLSYIQGLSHRNLLGD 
DTTDCSFIFLYILCTMSIRQNIQKILGL 
APSHAATKQAGGFLGPPPPSGKFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9476 


23377 


A 


9545 


304 


92 


S KSRVTNNTLRSTRKDTM ITLNTHSLLA 
NIVPIAIIVFTACEAAVGLALLVSISNP 
YGLDYVHNLNLLQC 


9477 


23378 


A 


9546 


394 


108 


KEIDFSLVDVTADDAGNYSCMYYQTKSP 
FWASEPSDQLEILVTVPPGTTSSNYSLG 
NFVRLGIAAVI WI MGAFLVEAWYSRNV 
SPGESEAFKPE 


9478 


23379 


A 


9547 


1 


335 


GNAVGLNLHRINFDKYHPGYFEKDGMKH 
YHLHRNQSFCPTVNLDKLRTLFSEQTRL 
NAAKNKTGTAPI IDAARSVYYKVLGKGK 
LPKQPVIVKAKFFGSLIFCFRIWILID 


9479 


23380 


A 


9548 


326 


188 


NFLRMGTVAHASNPSILGGLGGRIAGAQ 
EFETSLGNITRPCLYFKK 


9480 


23381 


A 


9549 


381 


152 


VTLISLSLLLQWIGVUliWIARIjmNE 

VEKQWRIiNQIiNNAATILVFFTWINVFI 

TAFGAHKTGFLAARfiJSRNPL 

IHRSKNKWKFHLKTCilmLNGRDYIFSK 

AIGDAEW 


9481 


23382 


A 


9550 


170 


65 


9482 


23383 


A 


9551 


544 


361 


RRLHQENRLNPEGGGCSKPRSCHCrPAW 
ATEQDSSSKEKKERKNLKGNTVKSVPLP 
LTSHA 


9483 


23384 


A 


9552 


369 


181 


GVLGATSGDMQIHTFGSMIIGSITGMVS 
ELGYKFLTVSTADVMIQTLNLHLFSHMS 
HQQKKSY 


9484 


23385 


A 


9553 




516 


SLVHWEFGQAIARRLKPFGVQRFLYTG 
RQPHPEEftAEFCJABFVSTPELaAQSDFI 
WACSLTPATEGLCHKDFFQKMKETAVF 
INISRGDWNQDDLYQALASGKIAAAGL 
DVTSPEPLPTNHPLLTLKlSrCVILPHIGS 
ATHRTRNTMSLIAANNLLAGLRGEPMPS 
ELKL 


9485 


23386 


A 


9554 


386 


56 


LFPACVLRWQFASDGGDIGFGVFLKTKM 
GEQQSAREMTEVLPSQRXKaHMVPEDGS 
LTCLQAGVYVLRFDNTYSRMHAKKLSYT 
VEVLLPDKASEETLQSLKAMRPSPTQ 


9486 


23387 


A 


9555 


375 


150 


I WGKTS FHVS I PLSLFFPGDVLYDLLQ 
HILKQSKPRILFSPFFHPCaJSIHTQPEV 
I LHQTHEE GTGRGFSFIiA 


9487 


23388 


A 


9556 


3 


354 


CLPTQASAMVDTPPPALLPPCSLISDCC 
ASNEQGSVGIGPSEPGAGYNFLVCRLLR 
PLEKHSIRVGVTQFSRCCLSPPSLTRKG 
NSLTPCTSRVRRCLALLQLTLSELPHCP 


9488 


23389 


A 


9557 


463 


40 


PIKVEDPEPVKKPPKEQRSIKEMPFITC 
DEFNGVPS YMKSRLT YNQINDVI ICE INK 
AVISKYKILHQPKKSMNSVTENLYHRFI 
DEETKDTKGRYFIVEADIKEFTTLKADK 
KFHVLLNILRHCRRLSEVRGGGLTRYVI 
T 


9489 


23390 


A 


9558 


338 


104 


NFKKKKKRRTKKKEIRTEEKGRGAHKKT 
FRGPKKKETRGSNRSPEGKKNHGAKKEG 
ERRPRKKKKKKKKLARYDFTCL 


9490 


23391 


A 


9559 


1 


152 


NTCVCVCVYTGFCHVAQVAHELSSSNLP 
SSaSQSAGITGISHHAWPGFGF 


9491 


23392 


A 


9560 


517 


238 


SGPGESVPHPWSAVRTRPAKKPGSQCLL 
QSAFPLWCSSRRTTWAEERKLNTETFGV 
SGRFLRGRSSRGGFRGGRGNGTPRENPT 
SHRAGTGRV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leiicine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9492 


23393 


A 


9561 


709 


280 


NTS PALTAPCGRPRGIRPMQCWARVPAP 
SSACPAWQLPSLCLRSTVPAPHPSPAAH 
QPCELSQHHRGGLEPALPGTRQRLCSRT 
WHRCVRFLPGAQEAGELGPAACATGRLP 
VSH3TDMEQAPGLTSLHQEPHSTARGGT 
ATH 


9493 


23394 


A 


9562 


3 


130 


YRGFCHVGQAGLELLTSGDRITSASQSA 
EITGVSHRAQPQMS 


9494 


23395 




9563 


409 


160 


NPGVKRSPCLSLSSSGGYWSMPPCLAYY 
KKKKRRFLKDGGLTMLPRLASNSCAQQF 
SHFGLPQSWDYRHEPLHIAQFFLFSLL 


9495 


23396 


A 


9564 


1 


241 


GICLGSNIGSGWMGTSSLCNPySNTLNQ 
NLLSTTKPFPVPSVGTNCGIEPWDSGVT 
SGLGBMLFLYYYYLLLFSQMTPQS 


94% 


23 3 97 


A 


9565 


2 


215 


IHWMNGWMDGWMDGWM6SCVRAVTGPGT 
SSPHTSTCGHAVCVCVCVCVYTCAHVFL 
ALFGVEKtniyFLWKA 


9497 


23 3 98 


A 


9566 


244 


117 


DHQHLNWSVFISFSFKVIKARKNKKGIL 
NPDSSMETSPDFFF 


9498 


23399 


^ 


9567 


414 


77 


FGGVPPGTLVPPTWLPPVRVPPPPAFCL 
GELSPPPFFWRKGIiAPPIJJSrFFFSPGQR 
GFPSPGGKGPRPLPRPAGFPGFVAKPPF 
RPPGFSSTPSDCGGLPPFFPFFFKKKVF 


9499 


23400 


A 


9568 


418 


198 


PPVSLTEVIVSKSHIOffiLVFWNMPGTLE 
HPEGDNNYMEFIEVLTEMiERVLLVRGG 
GSEVITIYSYPTLNARA 


9500 


23401 


A 


9569 


549 


415 


RQENELNPGGRGCSEPRSCCCIAAWATE 
GDCLKKINNNKPIQTEQK 


9501 


23402 


A 


9570 


546 


1 


KLECRGAITAHCSUSILPGPGDPPTSATQ 
AAGTTGTCH 


9502 


23403 


A 


9571 


415 


215 


SRLRKKI Y I YI YMCVCVYVY I CVYI C I Y 
VCVYICIYVCICVYICIYVYICVYMYIY 
MHEFITFFSI 


9503 


23404 


A 


9572 


374 


138 


LAASAAIQELFKRISEQFTPMFRRKAFL 
HWYTGEGMDEMEFAEAESNMMGLVSEYQ 
QYQDATAEEEGEFEEEAEEEVA 


9504 


23405 


A 


9573 


370 


233 


GSPPVRSVHQSGVSLSGTQVDEGVRSGS 


9505 


23406 


A 


9574 


3 


399 


YSS PGP I ALftLRDP CAHRRPS PNSS LS P 
AQGHPARAQQGPQPQQGPPSHLGSDLRP 
HVAHTRGALHGPAPAPASPAPGGSGSGV 
AASMCSAPRPGRAGGTLGSVLTAHHPAS 
AGCAHRDAYVCKAPGRAVF 


9506 


23407 


A 


9575 


314 


48 


SILPHSDPERCFSIQPEEGTIHTAAPLD 
RERRAWHNLTVLATELDSSAQASRVQVA 
I QTLDENDNAPQLAE PYDTFVCDSAAPG 
QVSN 


9507 


23408 


A 


9576 


505 


264 


I FWEMEFHHFGQAGLRLLTSGDMPASA 
SQSAGVTDMSHCTRPDTPFNFKELPGAV 
VTFLLCGMQVGRDELEDDWMGAWS 


9508 


23409 


A 


9577 


508 


314 


PATWKGCLSNF SS YLTKPADF I EHQVLS 
WEQVPDGFFIFNBRFKSFTWVLNNVAE 
FVCKYKLL 


9509 


23410 


A 


9578 


512 


1 


RMREAIFSPGRLPRGAAPLRGPAGTLVP 
WMP TP GE CWLSLSVS AAGEKP YKCS VCE 
SAFNRKDKLKRHMLIHEPFKKYKCPFST 
HTGCS KE FNRPDKLKAHILSHSGMKLHK 
CALCSKSFSRHAHLAEHQHAHTGNYKFR 



1119 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Add, P=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CAGCAKGPSHHKYLKDHRCRLGPQKDKD 
PV 


9510 


23411 


A 


9579 


3 


487 


WDLIYVGRKHMQVEHPEKAVPRVRNLVE 
ADYSYWTLAYVISLQGARKLLAAEPLSK 
MLPVDEFLPVMFDKHPVSEYKAHFSLRN 
LHAFSVEPLLIYPTHYTGDDGYVSDTET 
SWWNNEHVKTDWDRAKSQKMHKQQaLS 
REAKNSDVLQSPLDSAARDEL 


9511 


23412 


A 


9580 


1 


840 


SKYLYDEGEEEEVFNDEWGAAG6VAVPT 
LNALERGFLSAMDWHLYTDPREIPEVLS 
WLESCVAEQQGRWRGWYTYTDLCVLLEQ 
PTWQLALGSLCQRLVKLSCLLAVAYVSS 
VALAVASVAVIHQSLGLSCTPTPGPPDL 
GLTSRCT.T,EPCTPSVPQCT,PSI,ANVSSC 
LEGSMGLRSLWGSLLASLTPPPLPPPDP 
PAPPTLLHNCHLCQKLQRDSPTCHACLH 
PNETVPTALSSPWYHTYGLAPPWPWSPV 
PLSLPQPQQCSLFSVMEIARLKSFVFPG 


9512 


23413 


A 


9581 


2 


158 


IHAGF I S VS VLDN 1 1 Y I CMCVCVCVC VC 
AHMCACISEHITHAPRLPEVFYTP 


9513 


23414 


A 


9582 


3 


144 


YTAGVISVSVLDNIIYICMCVCVCVCVC 
AHMSACFSEHITHAPMTiP 


9514 


23415 


A 


9583 


503 


3 


GKCVCLSAVWGSLLSAFRGSLSVCCLGQ 
FVCLSAVRGSFSVCCPGQRVCLSAVRGS 
VSVCLSAVQGNVSVCLSAVRGSLFVCLL 
SGAVCLSFCLLSGAVCHLCRAPTPCLS R 
SLAHRRCSWrVCEMTQLISVSYFFCWGW 
WRHRNNIHLQKCRLRILMAYTGQGPP 


9515 


23416 


A 


9584 


2 


1358 


AGLDS LHKFQVKI EKEKVYVRAS KQALQ 
LQRRTKVMAKC ISP SAGYS SS TNVLI VG 
AETLRQEGFSDRIVLCTLDRHLSYDRPK 
LSKSLDTQPEQLALRPKEFFRAYGIEVL 
TEAQWTVDVRTKKWFKDGFKLEYSKL 
LLAPGSSPKTLSCKGKEVENVFTIRTP3 
DANRWRLARGRNWWGAGFLGMEVAA 
YLTEKAHSVSWELEETPFRRFLGERVG 
RALMKMFENKEVKFYMQTEVSELRGQEG 
KLKEWLKSSKWRADVCWGIGAVPAT 
GFLRQSGIGLDSRGFIPVNKMMQTNVPG 
VFAAGnAVTFPLAWEHWRKVNI PHWQMA 
HAQGRVflAQNMLAQEARMSTVPYIiWTAM 
FGKSLRYAGYGEGFDDVIIQGDLEELKF 
VAFYTKGDEVIAVASMNYDPIVSKVABV 
LASGRAIRKREVELFVLHSKTGDMSWIiT 
GKGS 


9516 


23417 


A 


9585 


360 


239 


QCRKGLBKLVSFGGLGIWSLIDVLLIGV 
GYUGPVHGSSYV 


9517 


23418 


A 


9586 


160 


1156 


MPRLTFAPKGWPHPPTSLHPGQVTDQTT 
WWLFQELPTPSENSMPPGLSTPTASQEG 
AGPVPDPSQPTREQIRLSSPERQRLSSL 
NLTPDPEMEPPPKPPRSC3ALARHALES 
3FVGWGLPVQSPQALVAMEKEEKESPFS 
SEEEEEDVPLDSDVEQALQTFAKTSGTM 
NNYPTWRRTLLRRAKEEEMKRFCKAQTI 
QRHLNE lEAALRELEAEGVKLELALRRQ 
S S S PEQQKEa.WVGQLLQLVDKKNS LVAE 
EAELMITVQELHLEEKQWQLDQELRGYM 
NEEENLKTAADRQflEDQVLRKLVDLVNQ 
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SEQID 
NO: of 
nudeotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

location 
correspond 
ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

sequence 


Amino acid sequence (A=AIanine 
C^Cysteinc, D^Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














RDALIRFQEERRLSELALGTGAQG 


9518 


23419 


A 


9587 


419 


130 


RRDPHLPRIMGRTRLNRKNAETRINAQL 
PLTDKGRMARHVLDNSGEWSVTKRQVIL 
LHTELERSLEYLPLRCGVLTGLAAIASL 
LYMLTHYLLPYA 


9519 


23420 


A 


9588 


2 


399 


FS CE FNMEANQCPLWEPS YPDLVINVG 
EVTLGEENRKKLQKIQRDQEKERVMRAA 
CALLNSGC3GVIRI/1AKKVEHPVEMGLDLE 
QSLRELIQSSDLQAFFETKQQGRCFYI F 
VKSMSSGPFPEDRSVKPRHV 


9520 


23421 


A 


9589 


19 


205 


LYCLVEWHDLCSLQPLSPGFRQFSCLNL 
LS S WDYRHMPPRPT IKLSS YSS IKL I YY 

LFMLVL 


9521 


23422 


A 


9590 


500 


407 


KLTNEVRSVKRSQLFPESFMCKEKLGNT 
Nil 


9522 


23423 


A 


9591 


727 


236 


RRSRGLEGREALALCPGDGSHLLCRRTD 
SS FS FMAFFFTFMAQLVISI IQAVG IPG 
WGVCGWIATISFFGTNIGSAWMLIPTV 
MFTVMAVFSF lALSMVHKFYRGSGGS FS 
KAQEEWTTQAWKNPHVQQAAQNAAMGAA 
QGAMNQPQTQYSATPNYTYSNEM 


9523 


23424 


A 


9592 


381 


278 


HPSVYKVaSGLKEGLSLFGIIiNRCHCKW 
GEKLLR 


9524 


23425 


A 


9593 


412 


156 


SQRdLsLcSCHfitiLANaVAiiicncfGlPSA 

LLGPPLPGWIYDIAQKYDFSFYICGLLY 
MIGILFLLIQPCIRI IEQSRRKXMX3AH 
V 


9525 


23426 


A 


9594 


263 


50 


KHAAPPASLSLSIiLHHGQKRACFPFAF 
CRDCQLLEGSPAMLPVQPAKLLVLIiEQV 
CLLCALLIPPSGSRI 


9526 


23427 


A 


9595 


432 


309 


GTFSERGPPLEPRSQTVTVDFCQEMTDK 
CTTDEQPRKDYT 


9527 


23428 


A 


9596 


457 


29 


TLIiPGWITAQASEGEIOaSLPSSLPPLL 
YLPSWRCSPSGALTHPAACPTGQRSSSA 
LLPSLPQaPPGFCGMTGIiRPLPSPSVGC 
C3APSPQLLCPTQLPPAPESSEGGC3ES 
RCVAKVKYTRDLGDFLEKCTNGEASALE 
YP 


9528 


23429 


A 


9597 


864 


407 


PSRRLSPTGRTTHGSSAACAPPPPRLER 
RSRTSRPPWGPPHSASARTQRACTTWYS 
RSWACS CSWI TSRPSWMTTKRTCMIFST 
I PTTPRTALWRGLSLWKDAWLLTRLKLA 
PSSPTSMPPTTTCTACSSTWMTCGSPAE 
WASTPMPRSSTT 


9529 


23430 


A 


9598 


421 


145 


LTQHNGDAAASLTVAEQYVSAFSKLAKD 
PNT ILLPSNPGDVTSMVAQAMGVYGALT 
KAPVPGTPDSLSSGSSRDVQGTDASLDE 
ELDRVKMS 


9530 


23431 


A 


9599 


1013 


604 


PGRPTRPDICLLLERMEEQVKNVMKTFR 
EELYNIEKAFEVERQELLASNKKKWEQA 
LQAHHAKELEYLNNRMKKVEDYEKQLNR 
QRIWDCEEYNMIKIKLEQDVQILEQQLQ 
QRKAIYQLNQEKLEYNLAGAEEER 


9531 


23432 


A 


9600 


409 


191 


HHVGQAGLELMTSSDLPSLASQGAGITG 
VSHCAHTVYGLS IVFFSSHRGFASVRTP 
PSNENF PDRP I S I AHP 


9532 


23433 


A 


9601 


367 




PPR YS PPLGGVRGGVP PGPGVLAP PF PK 
GKPPFFFKNPNLPRPGGGAP YS PFFGGV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Aspa)-agine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide Insertion 














GRGISFNPGGQGSPGQKKPPGPPRRGTK 
GFFSKKKKKESTKLPVTSHTTHCWFLPE 
ALDLELDPLY 


-9533 


23434 


A 


9602 


1722 


394 


GWNGSWNDNLVDTS PLKRDPLQDI CRR Y 
MEDLKKI CFYRELNSKTTLKFVHTS FHG 
VGHDYVQLAFKVFGFKPPIPVPEQKDPD 
PDFSTVKCPNPEEGESVLELSLRLAEKE 
NARWLATDPDADRLAAZVELQENG CWKV 
FTGNELAALFGWWMFDCWKKl'IKSRNADV 
KNVYMLATTVSSKILKAIALKEGFHFEE 
TLPGFKWIGSRT IDLLENGKEVLFAFEE 
S IGFLCGTSVLDKDGVSAAVWAEMASY 
LETMNI TLKQQLVKVYEKYGYH IS KTSY 
FLCYEPPTIKSIFERLRNFDSPKEYPKF 
CGTFAILHVRDVTTGYDSSQPNKKSVLP 
VSKNSQMITFTFQNGCVATLRTSGTEPK 
I KYYAEMCAS PDQS DTALLE EE LKKL I D 
ALIENFLQPSKNGTGSGRSCLGVPPNTV 
MTLCGAYGNRATRRNCHTLEPCG 


9534 


23433 


A 


9603 


14 


336 


DFVERTQYTHTQTHTHTHSHAVRLSPRP 
VQSRPEASQTGPTRTKPALGPAtlPSPRG 
AP CPDDPGSLRMLGYGSRASHNSRRGRW 
GHGRCKDQRAMGPHSHMWEAALPPGALS 


9535 


23436 


A 


9604 


477 


264 


LVETGFLCVGQAGLELLTSGDPPTSASQ 
SAGITGVSHRTWAALSTFTVLCNYHYLF 
LEVFHHPKLKLSPH 


9536 


23437 


A 


9605 


1 


378 


EGINFSDNLRRCWMVGMPFPNIRSAEL 
QEKMAYLDQTLPEAPGQaPPGKaLVENL 
CMKAVNQS I GRAI RHQKDFAS I VLLDQR 
YARPPVIiAKLPAWIRRRVEVKATFGPAI 

AAVQKFHREKSASS 


9537 




A 


9606 




254 


MVS LTQE L CPVAMRVAEGHNKMLSNVAE 
RVTVPRNFIRGALLEQAGQDIQNKLE 


9538 


23439 


A 


9607 


404 


272 


PPKFLLESGPIRWDWWSLGALMYDMLT 
GAVGAQLKARCI IGLC 


9539 


23440 


A 


9608 


409 


205 


HMNGSLGSGDIDGLPJOISLNNISGISNP 
PGTPRDDGELGGNFLHSFQNDlSrYSPSSL 
PCSWSLFYSLPF 


9540 


23441 


A 


9609 


449 


263 


VWICPCDPPASASQSAGITGLSHHLPKC 
WDYRLEPPRPMCFCFYSFTMRKRIILLS 
LVKYSR 


9541 


23442 


A 


9610 


366 


75 


TPGLKRS3CLSLMSRWGYRIEKKRTIIR 
SWEAIKEQDGREVDWEYFYGLLFTSEN 
LNLVHIVCHKKTTHKLTCDSSSIYYPQT 
RLKRKQPVRKRQ 


9542 


23443 


A 


9611 


1 


801 


PGYCGSWVFTCGALRQLSGGRDLSRGAR 
MGNSALRflHVETAQKTGVFQLKDRGLTE 
FPaDLQKLTSMLRTIDLSMNKIESLPPL 
LIGKFTLLKSLSLNNMKLTVLPDEICITL 
KKLETLSLNNNHLRELPSTFGQLSALKT 
LSLSGNQLGALPPQLCSLRHLDVMDLSK 
NQIRSIPDSVGELQVIELNLNQNQISQI 
SVKIS CCPRLKILRLEENCLELSMLPQS 
ILSDSQICLLAVEGNLB'EIKKLRELEGY 
DKYMERFTATKKKFA 


9543 


23444 


A 


9612 


179 


329 


KGLAFCPPPGRGGGPPPPLWPPPPSYKK 
IFGLKTPKSGALSPPPPPPGNF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9544 


23445 


A 


9613 


410 


298 


PRFTPFSCLSLRSSWDYRCPPPRPANFL 
SLSSVLHGT 


9545 


23446 


A 


9614 




130 


NPGSHCVAQAGLQLLSSGNPSASASQSA 
RITGVSHCAWPNTT I 


9546 


23447 


A 


9615 


1 


146 


NTWRLHMVPRLVSNFWAPGILLSWPPKI 
LGLQKRATTMPGPE I YTS IN 


9547 


23448 


A 


9616 


552 


26 


RLSIRNLLTHAFFAEDTGLRVEIiAEEDD 
CSNSSIAIRLWVEDPKKLKGKRKDNEAI 
EFSFNLETDTPEEVAYEMVKSGFFHESD 
CKAVAKSIRDRVTPIKKTREKKPAGCLK 
EREDSQCKSMGNVFPQPQNTALPLAPAQ 
QTOAECEATSrVDQHVRQCJLIQREPQQHC 
SSVTGDL 


9548 


23449 


A 


9617 


420 


48 


FSQILRASYEFDSPFWDDISESAKDFIR 

HLLERDPQKRFTCQQALRHLWI SGDTAF 

DRDILGSVS 3Q IRIOTFARTHWKRAFNAT 
SFLRHIRKLGQIPEGEGASEQGMARHSH 
SGLR/iGQPPKW 


9549 


23450 




9618 


lyo 


285 


RQGLTMLPWLVSNSWPQWLLPWPLKVL 
GLQV 


9550 


23451 


A 


9619 


392 


308 


SPIKPKIPLSAPRKMTNSVKYRLKFRFG 


9551 


23452 


A 


9620 


166 


23 


FCNPIPSSLPSSLPSFISPFLPCFLTPF 
LPCFLPPFLPFLSFMTLTF 


9552 


23453 


A 


9621 


49 


374 


DRRGIRIMAAALFVPLGFSLLGTHGPSG 
AAGTVFTTA*YLGSKILLTCSLNDSATQ 
VTGHRWLKQRWLKEDS \LPGRKPEFNV 
GPHPPNGDKSSFVFPPPSPCPRLTF 


9553 


23454 


A 


9622 


85 


472 


shvfpplrltltlhSMAvpetrpnhtiy 
innlndkikkdelkkslyaifsqfghnl 
dilvsrilnmmgqafvifkevssatnsl 
rsmqgfpfydkpmriqyaktysyiiakn 
kgtflsc/drklik*tpq 


9554 


23455 


A 


9623 


170 


2 


IFFFFCFERGSCSVA*AGVQWRSYSLL* 
S*TPG\SSNTASATSVAGTTGL\HHHAE 


9555 


23456 




9624 


2 


469 


RRLCSDRGLQRSLGSMSAAVTAGKLARA 
PADPGKAGVPGVAAPGAPAAAPPAKEIP 
EVLEDPRSRRRYVRGRFLGKGGFAKCFE 
ILDADTKEEFAGKIVPKSLLLKPYHR*K 
MFMEIFIHRSLGHQHVL*FHGYF*EHDF 
MSEELEL\CRPRSFLF 


9556 


23457 


A 


9625 


661 


1393 


ASPPPSQTTPAAPGCSAGEVSLAVGLCL 
QERRLVDVAGFSIFIPSRYLDHPQPSKA 
EQDASIPPGTHEALLQTALSPPPPPTRP 
VSPPQKAKEAPNTQAQPISDDEAS/RWG 
RNPG*CS*SGSPHQWPLPHLP*GRL*A* 
PSPCCPG*VG/PYHSPQARRRPITTSIS 
QRSWKFWPKPTVFRETSGGPWAMPRPSM 
PSRAS I SLS PRTRSRKQPRPLTLGCCVW 
HWHTDGERATCGDVRRAHHS PR 


9557 


2345S 


A 


9626 


I 


286 


ptmaklialtlmgmelalfwnhqasyqt 
rlnalrevqpa*lpncnlvkgf* tgsed 
f\eilpngltfissglenpwi*rldp\n 
spgkmllmdlne 


9558 


23459 


A 


9627 


173 


491 


EGPLPLESSSNWGQADLDKKSRELLWKT 
VYLL*LNIQLPYNPVirPILGINPRETHT 
C \ HQEMCTR I F I ATL FTKAKA* K* PRC P 
*AREW/IK*/IWNIHIIEHY 


9339 


23460 


A 


9528 


351 


1 


VliGDAIQSRRGSSRKAGSYLLSRSP/CS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glufamine, R=Arginine, S=Serine, 
T-Tlireonine,V-VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ESGRGCSCWAWNLGTMSSVASGHGAWVPR 
ELTS*AWPEPGPAGQREGRPPPTPCPPT 
CAHGPPAQP*PRGKAAYPG/CLGLQWPD 
*SAFNP 


9560 


23461 


A 


9629 


599 


11 


KMWR* *H*AKSQIRNAIPFTIAHK/RDL 
KSLKIQLTKEAKDIYNENYKIPL/NEIR 
DVTNKWKIJI P\ CS * I GYLYSTYKYKC I C 
LVNIYL 


9561 


23462 


A 


9630 


226 




KCDNLGYSTFFFFFFPFFFFEIGSYEVA 
Q VGV* WRN YSNS / DASAFQVAGTTGVCH 
HAQLI FNLFVEVGSHYVAQA 


9562 


23463 


A 


9631 


241 




VSGCPLPRG*RPPPQPQSPPPSTPGTSD 
HPCTPCR/PALRR*SP**LPCLDLPPAS 
GPPSTSQNSTaTMQPT*PKLNSSSS 


9563 


23464 


A 


9632 


201 


1027 


mtltpapgqgqqssqclwrrpl/plppg 
pdseepgsgaag/pgssl* spdlppaap 
pkp*tss*emi!itvgstqegsglaqrttq 
hcpvppst/rvdilgtclqpfpalclts 
apwlvrmppsqrssreip*rleaf/slr 
tpipahsaqpshqekep/pspr*dhgd* 
ppp*ppkpthqtlagqaprsgqglarpt 
lpyppspijivtpricrvhv*ghlrrprg 
pppssarrptwdwrvglpwedlewlcr 
gpawllartsapfsslpthactgkkttm 

PF 


9564 


23465 


A 


9633 


103 


379 


YHC IT I I VLKCI FFFFPLKPCLI LS PRL 
EARGIILGH*I!ILCLT\SQLLRRLIRKSP 
LTLEGGVCSDPRSPHCSPAWPTKQNFFS 
KGKKKILD 


9565 


23466 


A 


9634 


155 


405 


VPAFRSvmiSISVSFPFFfiTEP(3FVTQ 
AE|yiQGLDIGNGQPPCPKIiSSFA*ASQVM 
GTTGPRH/HC*7VLIFLVKTGFPEIVAKrW 

F 


9566 


23467 


A 


9635 




423 


GAMRGDRCEGRGGRFGSRGGPRGRFRPF 
VPHIPLDFYLCEMAFERVKPAPDETSFS 
EALLNRNQDLDPNSGEQAS ILSLVTKMT 
YE IDNLTVAPGTLDVLI EERQQGGPYI K 
GTMTT * RH/ S VADLT VI LQ I LTTWE SVA 


9567 


23468 


A 


9637 


2 


492 


GSRAIGFGIiAHELERHLIKIWLASIVR* 
DIA*LMNWSG\SHKDLaGKYRPlLEKAI 
QLSGSEQLQAFKAFEESMVWENVSLVI S 
RQMLTDFCTHLPNLPDSTVKE lYHFTLE 
KIQPIVISFEEQVASIRQHFASIYEKEE 
DWRNAAQVLVGIPL*TGQKQYNVDYKI.E 
TYLKISELYLEDDDPVPAY 


9568 


23469 


A 


9638 


1 


417 


WQGLGLVMGSPSRRLQTKPVITCFKSV 
LLIYTFIFWITGVILLADGIWGKVSLEN 
YFSLLKDKATIVPFI\LIASGSAIILF6 
ILLCIFSV*HYPV*LIjLVTFL1*LLVFL 

NKWWISPHFLILTFNYLLFDIIIDNSS 


9569 


23470 


A 


9639 


125 


451 


lYRKPLKSIRRVGGLHLNKIFFFFLKQI 
WGWARGLMPC*HFGRLKKGDCLTLGIGD 
/SRG/RYDHDPALQPRQQSKTLSLKINK 
IGWAQWLMPGIPAPWEG*AGRSFGAG1 


9570 


23471 


A 


9640 


670 


927 


GHVLLPRRAGCAQPGTGPARAAPAAVQR 
LPGGGVQPSGPGGRAGQGQGLRGT*AGG 
NPGTGF/GGPHASALLPPSEAPGSLLPS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QPR 


9571 


23472 


A 


9641 


115 


460 


ISWIFLIICVGF FGGGGGARVPLCHL S P 
RLECSSAIRARCGFDFPGSSDP*ASAS* 
VAGTT/G/SCHHVQLIF 


9572 


23473 


A 


9642 




431 


AMRAEVLVYCCRGALASGCVQAFRARY \ 
WVLI I VDVPDNEQANAS I IVKLTDSFTE 
QADQLTAEVGKLLGE * KVDAILCVAGGW 
AWGNAKFKSLFKMCDLMRKQSIWTWTIF 
SHLATMHLKEGGLLTLAVAMAVLDGTPG 
MIGD 


9573 


23474 


A 


9643 


154 


408 


IIMNAQSVEEDSILIIPTPDEEEKILRV 
KL*EDSDGKYGSKIPWNHLPEPENFRLL 
FKHVGYLD* SGLY\EILTKF* I LCVLCY 
Y 


9574 


23475 


A 


9644 


3 


403 


YSQFFGGPKLEDPIRPGVFSQPGEHRGH 
PFQ I F* KVGLGNGAEPGFPVLGKVKCKD 
NLKLGGQGCSRVEMHPRTSAWAIERDPV 
LKKKKRQR\EHQCSNEQKHTN 


9575 


23476 


A 


9646 


3 


399 


ARAMVLS PAD KTWGKAAWGKVGAHAGE Y 
GAEALERMFLCFLTTKTYFPHFDLSHGS 
AQVKGHGKKGADSLTNAVAHVDDMPSAL 
VALSDLHAHKLEVGPVNCKLLRH*LLVT 
LGG\HFPAWLTPGGIAFLEKF 


9576 


23477 


A 


9647 


22 


407 


APSAWGMCHFTEEDKATITSLWGKV\NV 
EDAGGETLGRLLWYPWTQRFSDSFGNL 
SCRSAIMGIPKVKAHGKKALTSLGDAI* 
HLDDL*GTFAQLSELHCDKLHVDPENFK 
LLGNVLVTVLAIHFGKE 


9577 


23478 


A 


9648 


243 


2 


QIATTTLGPGSEGVGKRDLGSSSRHHSQ 
VESVKH\PRDLAQDA*GHGSAPRPLCPP 
PAQSWHCLRGPCLAPHHPTLCEPSR 


9578 


23479 


A 


9649 


283 


196 


SYFI*IITTTHSLI\ISIIPLLFFNQIN 
NNLFSCSPTFSSDPLTTPLLILTT*LLP 
LTIMASQRHLSSEPLSRKKLYLSILISL 
QISLIITFTATELIIFYIFFETTLIPTL 
AIITE*GKPTPLIQ 


9579 


23480 


A 


9650 


10 


354 


QLRVI TRLT Wmr* TSA* SRGVSYP* AH 
\SSnaLHL*RELTEQRFIVSVNCaiAHC 
RHQSASEDHVKEKNEVTEFAQTWDADES 
AENCDKSLHTLFGDKICTVATLRETYGE 
MAD 


9580 


23481 


A 


9651 


2 


404 


AFGTTKWVTYISVLFLCSSAYSRGVLRL 
DAHKSEAAHRFKDL6EENFKALALIAFA 
QYLQQCP F\ EDHVDLVHEVTEFAKTCVA 
DESAENWDKI \nHTLFGY* LCAVACFRE 
T\YGETAAC*AKHEPVEDE\CFLR 


9581 


23482 


A 


9652 


83 


396 


NIPGVGGGGVPIPLVWKYNISHTQR*RE 
PYIQSLRSGNWSERMSHMVail*RQRI-IHP 

yyi \llqaqlsma\ sdpmivmmdsvita 
gvlva/shrarrissclghri 


9582 


23483 


A 


9653 


67 


954 


REGNHMTERNCKRPPQDTGPTQ/RPGPP 
PPEVPWQDASSAPTPAAaPVGKLLPS*G 
PPGSaSCQGR*TLGSVHWL*GSPLPSPS 
AGGRALPYGPAQGQSQOnfENTGRGTPL 
PGSRET*GHELTTGVEHQRHKAPREPQA 
GQPRLCPWVQSLPPT\ARSPSWSQGERP 


9583 


23484 


A 


9654 


1 


252 


RPPQLLASDSLPPSRPPSFPPSLLLSLR 
PPLSRSLCLSHYLCPPLSLIQQIiTSFPD 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

location 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=:Isoleucine, 
K=Lysine, Ij=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V=Valiiie, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














TFSPPKSTTSGPPQPEDSPSS*PPTPSP 
P\PPSLLSSLPPSLSSAPPF/YVHSVSP 
TISAPLYP*YNS*PHFPIPFPPRKVQHL 
ARPSPKTAEPPWTIRRI 


9584 


23485 


A 


9655 


1 


394 


SLRIYCFVIEVICFLYLL/FIMYSIS*G 
I*LLMNFTILCIGY*LLQYFFFVWKSF 
YFNFIRIPFILFLYLICLLVF*LIFFDL 
SYF/MCLRLI IFNLFY*LFLLYS ICLWN 
MVLLLI SKLF*F*FCGRFALIQ 


9585 


23486 


A 


9656 


122 


371 


SP YLLLLQGGVSWI VWLLM VESTKSSFL 
FPETGSHCAVQGGVQWCGRGSLQS*TPG 
/SQS *VAGITGMCHHAS*LEKFFFVEMG 


9586 


23487 


A 


9657 


425 


3 


KTSMMNS I VPHI S I PTLNVWGLNVPLKR 
YRIAE*IKIHQPSICCLQETHPTHKDSH 
KLKVKGWE * I FHfiNG\QGVAlLI SDK/T 
DFKATTVK/RDKEGHYI I IKGLVQQENV 
TVIJJiCaPWTGaLKFIK\QLLLDLRlJEI 

mm 


9587 


23488 


A 


9658 


179 


373 


VHSWLLWGSQSFVLFRPLTD*MRLTH/ 1 
MKGNLLESKSTSLSVKLIQKHPHRNl QN 
V/iiQNIWVSWNS 


9588 


23489 


A 


9659 


103 


373 


LKHVLRLCLQSERKLHYLFNIDQWGKFY 
TWDDLYECISSKFTTHFFFSFL/FFSF* 
SQGLALSPRUDCSDAIIAHCILEFLGSS 
HAPTSAS 


9589 


23490 


A 


9660 


216 


424 


TDINVTTKTIKPLDENIGITLYDLGLGS 
GFLDMT*T/AQVANEKIG*WDFIRNLKL 
TC IGHYHEVKKTNPW 


9590 


23491 


A 


9661 


276 


2 


PRIGKLPTNMIEPLIIKEMLINLAMRFH 
LSPVRMAI / IKIKKSNHCW*GCREKGI \ 
IYHYWWECKLVQPL/WKSSLK/FIK*LP 
IDLSFVPAIPLL 


9591 


23492 


A 


9662 


239 


356 


ICIKRRK/WSGTVAHTCNPSSLGGQGGR 
IA*VQEFQPGQQ 


9592 


23493 


A 


9663 


1 


340 


RHELMTPHAFGAMKRVTFISLLFLFNLA 
YSRGVISRDaHRSEDAHPIKDLV*ETFK 
ALVMIAFAQHLQQRPFEDHV*LLNE\VT 
EFAKTCVAEESAENCDKSLHTLFGDK* C 

TV 


9593 


23494 




9664 


759 


1039 


KRI PFGRPRRVDHLRSGVRDQPGLHDET 
PS\LK1QKLRGQGGICL*SQLIiRRLR*E 
NHlJSrPGSRGCSEPRSCHCTPAWATE*DS 
ISKKKNTKIS 


9594 


23495 


A 


9665 




355 


ITDLYSMFHFG*VDLATITSLWGKGNEQ 
KAGR*TLERLLDVYPWTHRFFDSFGNLT 
SASAIMGNPIVKAHGKKVLTSLGDAIMH 
LDD/LKGTFAQLSELHCDTLHVDPENFH 
ENPCDGS 


9595 


23496 


A 


9666 


364 


3 


APNIPPSVTRIPALLAFRSQCLCGSCPL 
PLATEPVGLLLSMSFQAAPCFTWWVTEA 
WLTGISWPPDFLISLLCI/HIKYNAVIF 
RKYRPRVGAVAHICNPSTLGG*GGWIA* 
AQEFETSLGN 


9596 


23497 


A 


9667 


1 


357 


P I SNAMRHLGQEETATITSLWGKVNVED 
AG*ETLaRLLVIYPWTHRLFDSFGHLSS 
ASAIMGHP*V\KAHVKKALTSLGDAIKH 
LDDLKGTFAQLIELHCDKLFVDPENFKL 
LGJStVLAT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

correspond 
ing to first 
amino acid 
residue of 

sequence 

( 


Predict- 
ed end 

location 
correspon 

last^amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIefliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine,V=Valine, 
W=Tryptophan, Y=Tyrosfne, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9597 


23498 




9668 


24 


369 


aprpdamghfte ed kat its lwgkgnve 
daggetlarllgvypwtqrlfdsfgnls 
sasaimgnpkvkahgkkvltslgdai*h 
lddlkgtfaq\lselh*dklhvdpenfk 

LLGK 


9598 


23499 




9669 


12 


365 


LLLMDGERK* FLETDSAPCEDAMNTVEM 
TTKNLE* S INLVDKAVA/SSFERIGFNF 
EKSSTLGKMPSNSIACCREIFHERKS/S 
MWQTSFLSYFKKLRQA/PPAATTLNDHQ 
PSALRQDPSP 


9599 


23500 


A 


9670 


346 


2 


YPYDPWSLINKMFFLFKEFQNFFLKEGT 
LTFPFKGFRGDLSLFFWPPYKLFNLKSF 
L/CKFLE I * RYFFG*I IFFFPFFFFFFL 
RQSLTLSPRLECNGTISflHCNLCIiPD*S 

NSP 


9600 


23501 


A 


9671 


121 


364 


HKKKTAGRVQWPLLVIPflLWEAKEADCL 
S SGVQGQPGQHSKTPSLP* PS KSWDYRC 
TPPHLASFCNFFSFFFGRDGVLLCCPGW 
P*TPELKQSA/CLGLPKCWDY/RARATA 
PaLLFF\FMLKYVLVKPAVNIflNJ«J* KE 
LSEPDVKNEICFV 


9601 


23502 


A 


9672 


437 


223 


EME/FSLLLPGLECNGTCLflH*NIjRLLG 
SSDFPASASPVAGITGVRLHAQLELYFL 
NLLGFFFI ISLWYLS 


9602 


23503 


A 


9673 


100 


338 


KSRPSAVANLTPVIPALWEAERGGSPEL 
KQFSLSLLSSRnHRFATPC/RG* FFIFF 
VEMEFHCVAQCGFELUDSSSPPTSA 


9603 


23504 


A 


9674 


161 




SFLWKLCLRGAPSCMRCQLA\LL*DVSQ 
LGY/SGVRDPLEEAVCPFSDLKLRAGR 


9604 


23505 


A 


9675 


304 


■ 


VPPLASQLGDKRRIFFQKKKKKKKNKDK 
IHIIISirLKKEDKI*YSLIIK\TL*KL 
GME*TYLNI IKVI YDRPTAS I ILSGEKL 
KSFPLKSGR*QECPLL 


9605 


23506 


A 


9676 


111 


346 


SKEGSKEGSLKIWGFLVLAFPFPFPFPF 
SFFFF\*RQAGRPRLECRSMITAHCNLE 
LLGSSDPPASAS\QVAG'ITGPTSP 


9606 


23507 


A 


9677 


3 


349 


ARAGRIIKDLWLGFFFFFFFFYPKKTPL 
FTRDKGGGKLQTAKKNNPHFVKREKKKP 
PPGGTRGGKNPPGNP FWGFGPQR FFFGG 
KTFGGGAHPPDPTLG/PSPQS *KRPFGK 
RGPPP 


9607 


23508 


A 


9678 


38 


368 


QASLLKESESERKSPTSLTLSQKLGMIR 
LCEEGMSKAKAD* RLGLLH/QVSQLVNT 
KEKLFKE IKSAISVNTLMIRK/RNSLTA 
DMDKVWAWIEDLSSQNNPLSQSLIHSK 

A 


9608 


23509 


A 


9679 


187 


345 


PFI LFFETES YSVA*AEMQWCDLGSLQT 
PPPG\SSNPPVSALWEARAGGSRSQ 


9609 


23510 


A 


9680 


192 


468 


ERTSLFTIHLLKIiLTITNSAaMSNHV*V 

• Fv* T* IFLSLGQMPKSVTAGSNGQSLFR 
FFK\NCQTLFH6GCAMLH/ SHTQUiTAI P 
LFRNLWQHLS 


9610 


23511 


A 


9681 


141 


367 


GSWAASPCSCTMDDFAKESFTWDYVLL 
ENCPNMGDYWAPQFMTDNYVRVTQLNW 
DGVGTQ * KD YI /SS EENL 


9611 


23512 


A 


9682 


193 


3 


IFGERR*ILKIYKLTGHVGLRL*SQILR 
HI /RLNLGGGGCSEPRSCHCTPWATER 
DSISEKKEK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

peptide 


Predict- 
ed end 
nucle- 

correspon 

acid 

residue of 
sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


9612 


23513 


A 


9683 


1900 


1988 


NLVHFEEHFPSLFTPSPSLCHIF/CSRD 
RVSPY*PGWSRSPDL/RYLP/CLDLPKC ' 
WDYRRES PCPAS SRFLMKFLYF *NTTLW 
Q*KTIPLLF*SGDC/S*FF*YNFLaHTS 
STMWSGISRNSRPVFKLADKAPSHSPL 
SILAFGVEFF/NLGSLQPQPLKLK*ASH 
LNCLSSWDNRHaPPCPDNFFFNIL*RWG 
FPILSRMVLNSW/PLCSLSTSASQSSG/ 
ITGVSHCAGPILVF 


9613 


23514 


A 


9684 


69 


358 


ILIVKVFFPHTQKPNFVLRLSLQSIQQM 
LLPERKWTHAQKTCVSSENTERCHEKQV 
ITSHWLGMVAHSCNPSTLGG6G6WIT*G 
QVFET/SLANTVKP 


9614 


23515 


A 


9685 


66 


344 


I GE KLLLTDKHRlIWVtiETE *APGKDSVN 
IDEMT/TR/DLEYYIN\TDKAVTGFERT 
DFNFKSSIRGQSL*YSII/CAKGSQQKQ 
QISLVPYFKKLPQ 


9615 


23516 


A 


9686 


269 




PKIQNGHCKTLPKNPSQSDTKTFPLFTS 
GSSPQPHKLSFCFSAGPTLPSKTQLKIH 
L\WQGTVPHAYNPSTLGSRGGRiT*GQE 
FKTIQ 


9616 


23517 


A 


9687 


229 


361 


DRYYYKSHFKPGAEAHAC* LVLAS FFMC 
YRAIMINK*HLYHRKL/VYQVIRH*KYR 
D/ I *THVTLTTTL*DRYYYKSHFKPGAE 
AHACNPSNLGG*GG* II*GQEFKTSLAK 


9617 


23518 


A 


9688 


318 


46 


KliYALNDMASNTCKEYDNLIQYYRYKIY 
KRLIQHD*VEFIPEiyiE/NWFNI * /RKS I 
NVIHYIHSLWEEKNYMI ISLDVAEVSDK 
IKLTFLIF 


9618 


23519 


A 


9689 


168 


3 


GNSNTMIFFTILYLFFLYFF/LFFIFDM 
KSHSVARLECSGVILAHC3SrV*LPGSSNS 
P 


9619 


23520 


A 


9690 


192 


3 


WHKASLSNPQPAGCMQPHMALHEAQHTF 
VNFLKTLWAQ\PVAHAYNPNTLRGRGGR 
IT*GQEFK 


9620 


23521 


A 


9691 


388 


1 


CRSAevC!WRSTPDPVCl.GITSSGCRTAE 
IVACSFLWKLHSRGAPARCQPELSCMRC 
\R* TLL* GVSQSGGTGVHDPLKEAVCPL 
AELENGARRSTALFRASRQDSLSLLKLR 
PQLPLLPGALSQIEGI 


9621 


23522 


A 


9692 


226 


3 


WFLLPQVKLAIMTPRLYFPNLPFFFFF * 
DRV/CTVAQAGVQWRDLSSLLESLLGSS 
SSPTSASRLTGTIILQHHAE 


9622 


23523 


A 


9693 


305 


I 


NHAITTVN* FGLIRHLVTKAAFNSGEVD 
I V/ S I KDPL lALNYTVYMFHHDSTHGKF 
RGTVK 


9623 


23524 




9694 


320 


I 


PRDPPAS ASQSAG ITGI SHQAWPDM IL Y 
lENPKDSSKNPLGLINKYSKVAGYKINT 
QKSAAFL*TNNYI,KN*P/MRTIPFTIAT 
KKKYLETYLTMEVKDLYTENYKM 


9624 


23525 


A 


9695 


263 


1 


VR I LL FS ES IiALS PRLE CS 6V I SAHCTL 
CLPGSRGALSLLSRVDY/Y*VFLVKAGF 
RHV/GQAGLEFLSSGDPTVPSLPDFWDY 
RCKPR 


9625 


23526 


A 


9696 


262 


359 


I RK IHKI WS G WAHTCNPS TLGGR* TKA 
HEGSLYCLRVGKIVSNKVGTRS* FFRTQ 
K*V/HLFFS*VYHINGSH*SRTSL* IRK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

residue of 
sequence 


Predict- 
ed end 

location 
ding to'' 

residue of 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I^lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














IHKIWSGWAHTCNPSTLGC3RVGWIT*G 
Q 


9626 


23527 


A 


9697 


166 


473 


TQDTSQPWKNIWISFGSLKVFENVDIHK 
LYLELDNRHRSWMTDILLFFYF*DGSFT 
VT\RLECNGAILAHCNLHLPGSSNSPAS 
AS*VAEVGGTLEPRRLRLQ 


9627 


2352S 


A 


9698 


456 


59 


LFLWKLHLRGAPGYVRCRliA\LLGGASQ 
LSYSGVEDPLEEAVCPFSDLKLHAGRTT 
TLFKAVRYGHLCLQKFLLPFTOLCPAPT 
GGVFTVRQASMSCDGLLIV* \VSERH*L 
PKPQQFWELVPKFVRPGLKSL 


9628 


23529 


A 


9699 




375 


VFCSSVTIVSEIAFLMWLSAQLLLVYRN 
ASDFCMSSLYPETSLTKVTQGQAL/H*N 
G*LT**EQYWVKTDQE®VI/CQQSTGSS 

GQSSQAREIKGPGAVAHTCNPSTLGGRG 

RWIT+GQEFETSLA 


9629 


23530 




9700 


239 


361 


R:TOTFFFFETE3NSVA\RLQCTGVTLAH 
*NLCLPGSSNSPA 


9630 


23531 




9701 


169 


1 


I FYWKAI FFETESHS VAYAGVQ * SNLGS 
LQLPSS\GSSDSPISASQEA6TTnaHH 


9631 


23532 


A 


9702 


37 


348 


HAKNFDVSLLPYTKTHEHFVKEDIWIAN 
KHVKRCSATLVTKE I PPQS /TLKWLKH* 
PPC*QGNGRTGSPIP/C/WWECKIVQLW 
KIV*PFPKKLS lYL/Py/DPPILLLGI 


9632 


23533 


A 


9703 


16 


339 


ARLNTFAMAaKC*SERKS/R3Sr/SLTIiNH 
KLQMIKLIEEGMLKAVHGQKLRLLYQ/T 
SQFANAKDELLMD 1 1 SATPATV* MI G* Q 
NSLIADLEKVRW*LDDQ/TSHISriHLSH 


9633 


23534 


A 


9704 


276 


2 


GISRGGLSKPPFFP*DPPLDYLGQSPFC 
KVKRNS/RFRTTKSPLDFPIPPLFFFFF 
EKVSQAISAHCKLHLPG*CHSPASASRV 
AGTTGTRH 


9634 


23535 


A 


9705 


352 


2 


CNSRSDFFQIMMNCFtRWAVSPRVECS/ 
GVITAHCNLNLPDSVDPPTSTS*VAGTT 
GHTP\HTWLII I I I I VAGI FLFLFFLFS 
FFGRDGVSPCCPSWSQTSEFRQSACLSL 
PKCiTOY 


9635 


23536 


A 


9706 


6 


351 


IWNSRPRRPLGGQGGRIQDWPKLHGKTR 
SL/LKKLQKLARRGRASLWSQVLKRLRL 
TWVDCLSLGG*GCSQP*SH/IHCSPAW 


9636 


23537 


A 


9707 


76 


359 


CFLFVeLDFVIFLA'!fFCN*YLFPFLISH 
TFFFFLKKGLIFAPGVNLRDQNKFYFTL 
FDVFFNIYYC*IFFIF/CLVLLFFIPPF 

FFYCFWRFIVL 


9637 


23538 


A 


9708 


342 


3 


GEELLLMDEQSKWFLE\TPGENATNIVD 
LTTKDLEYSISWDKARaGFERIGSN*N 
STLELKTVN/SVPHAK* IFCKRLQSVWQ 
TSLLPILRN\LP* SPHPLVSAMLI IQQS 
STLK 


9638 


23539 


A 


9709 


211 


350 


RQDI ALLPRLE *SGT ITTH/ CDPQLSGS 
SEPPTSASLVAGTTGACHYV 


9639 


23540 


A 


9710 


253 


3 


VKCFSREHMLIRRSAIGGKKHIEKGTLI 
MSFWWYVSQSEH/NLCNHYAY\SHIKK 
VWLGTVADVYNPS TLGGQGGR IT * GQE F 
E 


9640 


23541 


A 


9711 


80 


324 


LITLTMASMCSNERKSHTSLTSNQKLEM 
S KPTEEGMLKAKTS * KLGLFHQT /SQLV 
KTKKKFLEEIKSTTPLNaPLiyiRK*NSL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9641 


23542 


A 


9712 


246 


3 


QLAKHGLLFFLFFEMGSH *LRLECSL\T 
ALLSKISAHCNLNLPGP\SDPPTSASQV 
AGTTGACLLAQLIFKNFWETGSHYVA 


9642 


23543 


A 


9713 


2 


359 


AAASTTIKEQDMLKFYFLFYFFETESHP 
VTQVGVQWCNLCSLQPPPP\GPSDSPAW 
VTEQDS VS KE * KRKKL IKINVYLQVTTH 
P*HSPQHITOVLSVIiN*LNEKINRRVTK 
SSSKEKDR 


9643 


23544 


A 


9714 


278 


53 


KLFRGLNNHRKFQTRCGGLFFFLFIYF* 
DGA/YALSLLECSGTIMAHHSLDILGSS 
HP P I SAS LAfl\GWRTCS PS C 


9644 


23545 


A 


9715 


166 


402 


TL S L KKNENISRAWWHVPWSGVWKAEA 
R*SLVPRSLRL\HCTPAWA 


9645 


23546 


A 


9716 


168 




VD CI IMRRS lAKSPRRKCKGTISARCKL 
RLSG\SGHSPASAS*EaGVTGVGHDPR 


9646 


23547 


A 


9717 


1 


553 


TS IRLPFIiSIFPPSRSENRILPKIYCai 
GVEAKGRGGGWGTCRGHRNTMTWPGPQL 
LLSWGRGMKRQDQGWSWGGRGEGDTGCI 

PPPPGSTSRPWTPRSPWPLRLHLFSASG 

PPG3SSCSP*LFPLTDSQVGRCSGPDPN 
PPYHLMKVQPLPALLT.PLLKPHRFPSLG 
Q\PDPPPSSPPSP 


9647 


23548 




9718 


13 


364 


PPNR*RNPLIELTNHSLIYLPTPS\GIS 
A**NFGSLLGACLILQITTGLFLAMHYS 
PIRLQLAFSS lAHITRDVNYG* IIRYUI 
ANGASIFFICLFLHIGEGLYYGSFLYSQ 
T*NIGIILLIiAT 


9648 


23549 


A 


9719 


82 


351 


GEALEPELPMPALSH/HAATIiAPLPTT 
PFVGTK7TIVGQAQW/HQMi*SQHFERP 
RRVDYIBLFSRNGQNPISTKNTRISWAW 
WRVPVIPA 


9649 


23550 


A 


9720 


179 


365 


FSLFLASLKAQIIDSKPFFSWTLEWI*K 
YEVnSrFFLYI \ IKLHFMNLTSLCFYCHS 
VCTILL 


9630 


23551 


A 


9721 


2 


320 


PIPPGNESSPPTAQEDMOAAHKH/IRRY 
STSLAIRET/QL/KTTTREPYAPITMAT 
TETSaNTTCWREYGESGSYCWVGCKTLR 
PLCKTVWQFLKKPSM*LAYDPAITLF 


9651 


23552 


A 


9722 


■ 


344 


PLPTQNWPQGQSFLHYPVLLPAVSVLWS 
SFFFFFWKQILLLPPRLEG/NGQNSG*W 
KFPLPGPSLFSCPS FQTSGNYGPPQQAR 
AIFWKFKIKTGFHGVTREGLNFLTSGSA 
PLGS 


9652 


23553 


A 


9723 


156 


3 


VTEAYFILFYFETESHSV*WHDLGSLKS 
PPPG\SSNSPASASQVAGTTGALH 


9653 


23554 


A 


9724 


76 


337 


YLS lYLS lYLAIYVS I I YLP lYHPSIS I 
YVSS lYS INHLPLYYFSTHLSSTHSS I Y 
LLSISILSINYLLLSISLSIYLSI*LSI 
YDSSIHLSIHLSHLYIYLI*YLSIYLSI 
YLAIYVSI\TIYLSIIHLSLSMYHLFIL 
SIIYLCIIFLPIY/LSTHSSIYLLSISI 
LSINY/HTSIYLSIYLFVY/HLYIYHPT 


9654 


23555 


A 


9725 


249 


436 


BLAG* IQDHP/GQHHKTLSLQQIKNLAR 
RGGRHL* SELFRLRQEDHLTPGVRGCSE 
L.*SHHSTP 


9655 


23556 


A 


9726 


177 


3 


HLN**FSNLIFFETESPSVT\RGIISAH 
RNPRLPGSSDSPTSASRVAGTTDT 


9656 


23557 


A 


9727 


272 


233 


RKNQRIYQIARKRLWEMARISPLRSMI I 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 


Piedict- 
nucle- 

correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T^Tltreonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LNVSGLNFPLKRCRLAEWTEKKK\DPII 
CCL/QKTHFTG/KDIYRLKIKGWKKIFH 

ANGSQ*Q 


9657 


23538 


A 


9728 


368 


380 


RKGQRI SE lAIKRIiNKMARI SPLIS 1 1 1 
LNVSGLNFPLKRYRRAEWTEKKK\DPII 
CCL/QKTHFTG/KDIYRLKIKGWKKIFH 
TNGSQ*RR 


9658 


23559 


A 


9729 


294 


2 


KGNLSPKKKKNLKNFKVTPQNFGLKDQK 
TQGKPTLKVSQNQKKPKFK*NLKGKRPL 
KLFPKKKKLNPSRSWFYEKKKK\INJCLU 
GTLVRQINKKEKNQ 


9659 


23560 


A 


9730 


7 


284 


SQGLGHLHRENYL* FETESHSVARTGVQ 
WRNLSSLQSPPPG\SSNSPASASQVPVI 
T 


9660 


23561 


A 


9731 




322 


RSLKKKKREEKKGKNQPGQHGKTLFLPK 

TPKI*KIKKIYPG/RGGTHr,NSQLLRNL 
RKENHLNSGGKGCNDPKLGHCIPAQMTK 
TPFKKKGGAKQNGKKLLQLIYGIK 


9661 


23562 


A 


9732 


168 


2 


GFYRKSLSCLSVWICFKDFFFFFEMVSR 
SVA*AGVQVSIHDLGSMRPPPPG\SGDCPA 


9662 


23563 


A 


9733 


43 


317 


LKLQPRDTHCIWLSTTSPFIYFFFETES 
CSVAQA*MQWHDLGSLQPPSPG\SGDSP 
ASASCTGMHHHTWLMKCINSFDVPDMPL 
GLGLNSHI 


9663 


23564 




9734 


8 


296 


NVVSACSLNGLVKGHLSCLTUSTQIL+MI 
TLTTERFLKAERGQKLGLLHQTA/QWII 
/AKEIFLNEVSSATPVYEGIIRKLNSLI 
ATMEKACWWIEDQT 


9664 


23565 


A 


9735 


3 


300 


TRPSKTSPAALSKCSSHRKT/R/TCLTUJ 
QKLEMIKLSKEGILKAERGQKLGLLHQT 
A/QWN/AKEKFLKEVRSATPVYEGI IR 
KQNSLIANMEKA+WWIEDQ 


9665 


23566 


A 


9736 


352 


1 


PPFYLLNPPTFFKGPPLIHFFLESFLEN 
FGFQGAFFKLQNFFAFLFSLIiNGPFLEN 
PFYFKFPIErC\KFLGQHTFYCPYLGFF* 
RS FFFFFFFFETESRPVA\RLECSGAIS 
AHCKLG 


9666 


23567 


A 


9737 


130 


3 


KIFFFFFFFPLRQSLSYaOAGVQWC/N* 
LGSLQPPPAGLKRFLV 


9667 


23568 


A 


9738 


167 


3 


ISRSVFCFFFETESCSVTQV*EQWRNIG 
SHRPQPP\GSSNYCASACRVARFTGLV 


9668 


23569 


A 


9739 


140 


419 


iisivkylsrhffkgniqspnkhrkairei 
qis\*tdnnkcw*gcgevqyfihccmce 
mvhslwkivw*lltgl/hielsddpail 
lggippeemkt 


9669 


23570 


A 


9740 


93 


371 


KLCNGYYQESKREPTERN*1FAHHKTKG 
CISKTYKELQLNNKNS/NPTQKWAKDLD 
ISPQTMKQTAKNQVKRCSTSLVIREMQI 
YTKSIPLHIH 


9670 


23571 


A 


9741 


1 


288 


GTRDHLRSGVRDQ\PGQHGETRSLLKIQ 
N*GCGHGGSCLSSQ \LTWRLRQVNCLS 


9671 


23572 


A 


9742 


296 


563 


FIKHLNSYMLGQNIiATKLSEICKLPFYI 
E K * IQAG YGGTCL * S QLLRHLHHDITCLN 
LGGGGCSEPKI\HCCTPAWATDGGSVSK 
KKKSRR 


9672 


23573 


A 


9743 


32 


400 


DALVPHWSSYSPICLDLNVLGIYLFSDR 
VSLLVPRLECSDAIMAQHSLDLPRLR*S 
SHLSLPRNWLMFVILVETGFCHVA*DGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

Tocation 
correspon 

last^ainino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible • 
nucleotide insertion 














KLLGSSNPPALA/FLKC/WHITMRKSQL* 
PLLLGLLSADHSD 


9673 


23574 


A 


9744 


60 


356 


HSEVDSIIIPILQGRKQISEATRPQRMCE 
REGGLFISETLLFLCPAPSCS/HFLTPA 
Q * S YF * KNVSWLGAVGDTCNPS TLGGRG 
GWIVRGQEIKTILAWMV 


9674 


23575 


A 


9745 


2 


446 


PMVNKHVKRCSASLVIRAEQIKTIMRSH 
CTPNS *M*KTDNTKSW/*GCGaTGTPIH 
C/WMKYKMVQPLWKKAQQFL\KHIHLPY 
DPINLLLGIS*EK*IIHLC/YKKTYVRIF 
lAALF 


9675 


23576 


A 


9746 


257 




VWLRLSFPSFLLPILSPSSQGDLVCCFL 
MVF*RDGVSLPIPiM3VQWLDRSSL*P/P 
SPGSSSPPASAS*VAGTTGlSaiLHREPYA 
SC 


9676 


23577 


A 


9747 


148 


3 


I KLCEQPHK*TIHVLKHK/ L/ WPGMVAH 
AYNPSTLGGHGEWIT*GQEFDSC 


9677 


23578 


A 


9748 


769 


1 


YPQLFKIAKKKKKSHVPIKS IMVSQYGH 
*ENKKWVLIIHKQATLQ/EGKGNFGDTFK 
VTLKDKIAVAVKTRQERLPQELKLKFLL 
EAKILKQYNHPSHVKS IGVHTQRRPl YV 
IKELVPGGDFLSFQRKKNELKL\VKFSL 
DADSGWCISSKKCIHRDLAVRNCL/VGE 
NHVLKISDFGMSRQEDGGVYSSSDLKQI 
PIKWTAPEALNYGRYHSESDARSFGILL 
WETFGIiG\VCP*PGMT]SrQQPQEQVERRY 
WMSVSWQC 


9678 


23579 


A 


9749 


607 


986 


SFSSSSSPEFISYGCDLLPMRDPHVFIT 
HPAFNLPLFIALGMFFSLFSPSPLFIYF 
LPPPPHPPC*HSPP\PPPPLTNSLPVFL 
PPPPGVPSSPPNTSPPHLPVPHPPPSPP 
TSTHLSSPPPPSPTH 


9679 


23580 


A 


9750 


313 


339 


AR*GLSCSG*S*TPGFMQSSCLNLPKCW 
DYRC/RATVPSLCFVF 


9680 


23581 


- 


9751 


213 


453 


NLAVNVDEINLFFFPLFLIVTLASKFCS 
/HLLVGR*FIQSCHYFLFFIFDTESRSV 
TXRAEMSATVLAHYMLCLPGSPDSSA 


9681 


23582 


A 


9752 


368 


3 


PLSSPRLPFPLFDPSYIELISISSIRFL 
SPPCYFLCVFLS ISWGFSLVSQDGSDLL 
SS*SAFLGLPKCWDYHL*PPHPAWRELS 
FFFS/SFFYriR*GSLSPSLECSGMIIGH 
CSLDILVSSAEA 


9682 


23583 


A 


9753 


2 


335 


ARAGFSGGDWNVGMQSTASGLSLFHCFI 
S *GELPI ILYILLKCYITL* IN* LMFGS 
FYFRDLQHFRLP/TS IVGKKRPGTVAHA 
YNPNTLRGRGGWITQGQEFETTLMIMVK 


9683 


23584 




9754 


467 


120 


SQLFGRPRQTDHLKSGV*DQPG\QHCET 
PSPLKIQKLAGRGWV/CAP IGPRLLGRP 
R*ENGIJ!ISGGEVCSEPRSRHCTPAWVRE 
RDCa:,KNQNKTKSG VL FHSPHQCFLISNT 
SRNLL 


9684 


23585 


A 


9755 


137 


368 


DGVYLWTHRPYCGLGSLNFGSVI IVLP * 
VKAYGWMVLTSLGDAIQPLADPECSF\G 
QLRELRCDMLHVDPEDFRLLGK 


9685 


23586 


A 


9756 


172 


267 


S KSGNKPNDYHLMNGSVRWCIRTV* QLL 
KRLTTELPYGPAVAFLGVYAREMJCAYVH 

mktyslqkmct*if i aalfg i vkkwk* a 
k*lspdewisk/m/wcirtveyyltire 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

location 

ing to first 
residue of 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po5sible 
nucleotide insertion 


9686 


23587 




9757 


56 


318 


QFWGFSFFFFFFCFIDTETCSVSQAGVQ 
WHNQSWLTA\P*SDPPAS\AS*VAETIG 
MCHHDQLI FFFFKQALCPRGWSJySAQSQ 


9687 


23588 


A 


9758 


179 


1 


KDTDRVSRGNTQDTLKPPPPG\SSDSPT 
SAS * VARIIGVHHHTWLIFCILVEMRFH 
PRA 


9688 


23589 


A 


9759 


219 


208 


NFCNPQMPLYQKILFWVRVSLLSPRLEC 
SGTIITYCSL/RTPKLK*SS*LSLGVQG 
CSKL 


9689 


23590 


A 


9760 


133 


371 


AMAYQLYENTTLGNSLQESLDELIQ\LR 
NRVNFRGSUSTTYHFCDNVRTFVUSDVEF 
REVTELI *VDKVKI VSr.\DGKQTGF 


9690 


23591 


A 


9761 


813 


3 


CGEGWAAGQDQGSRMGSCLPVPRAGRQL 
GSLFPAFQVYCS SCGTQRSSQSQWHKTE 
RSLPGNNPAPQDS/ASAPKYQTPGPVLG 
VQSPLNQQSCHTALSPEASSSAFPHTTP 
ATLLHQAETLSLTQGSRPTTWPVLLKMP 
A*RPRESRH\GR*EWTKHSQftRSQEAIN 
ARPRKALGIiSDPAPSLHPGSLEDSPKFS 
LGAPKAPPLPASSPTAQETQQSHISQQP 
TATGFLGflKLCPSPQHICQRGGFPGVTP 
LLLLGEWQVGEHSFPDVLV 


9691 


23592 


A 


9762 


96 


377 


RYHTNMAAQIPESDQIKQTGFHHVGC5AG 
LELLTSXFKEFLGTYNKLTET+FIiDCAK 
DFTTREHKSEETTLSEHCLPKYSTMT\H 
RISLT\FTDNHI 


9692 


23593 


A 


9763 


164 


399 


TDEElJiLRDEQ/RKHFLKIESTAGADAV 
NIVEMTTGDLE* CINLVDKVAARLERTE 
TNFE/RCSTVRQ/MLSKSIACCTE 


9693 


23594 


A 


9764 


225 


25 


KCKITMNST* IE/TVFKDYYEHLYVCKL 
KNLEKTDKVLTIHNFLRLNQ *EPEVLTR 
TIKMYKIYMDIE 


9694 


23595 


A 


9765 


128 


218 


FFKDLPDCFPKWSHCFTLLIAVYEGFTF 
ST*LVIREMQIKTP\l*yHFSPNRMVIM 
KKSNINKCWQGRGESKALIHC* *ECKTV 
* PLWKTVWQI LKKLI IG 


9695 


23596 


A 


9766 


194 


551 


KNFFFLEMEF/SVLLPRLSCNGVISAHR 
RLRLPLSSYSPASSSQVaSDYRACTTTA 
G*ILYF**RQGFHHVGQaGLELPTSGDP 
PASASQSAGITGVSHCPQLKKSILHETP 
KGLTGVTS 


9696 


23597 


A 


9767 


238 


3 


S FLWKLCPRGAPACLRCLSA\LLGGVS * 
SGYMGVRDPLEEAVCPFSELEHHAERTT 
ALFEAVPQGCLSLQKLSAAFCSC 


9697 


23598 


A 


9768 


343 


1 


RGAHH3RGDCVRFTGFCAPIPVLNHPKP 
LFPFFV*KSPIERflSLLGPPLKRSKNRF 
PFLR/VPRFFKNHPEFFFKKIFLFPPFF 
FFFEMESHSLAXRlESHGAILSHCailLSL 


9698 


23599 


A 


9769 


191 


2 


FRINH*TSH/LRGQKKKKKTNSKAIGRK 
QITKIRA*LNETQPPESIQRINET*S*F 
SGKIKLI 


9699 


23600 


A 


9770 


165 


2 


GKPPNCFYILNFKKKKGQDVFNRT/WLG 
WAHACNASTLGSQGGWIT*AQECKTS 


9700 


23601 


A 


9771 


97 


912 


VILSTGCSSGPLPGWQSVLHHSLTRCPF 
SFFLSSPTPEME/FSLLLTRLECNGEIS 
AHCaJLLLLGSSNSPASASGVAGNTGI / C 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 














GLHAC*AS*FLYFHSVETGFHHVGSGLG 
LELLTSDD\RPASASQGAGITGVTPRV* 
PLNSFSRHGLSCFFKETRVLSCCPSWT* 
TPGLK*SSCLNS*E/C*GYRHTPaHPAQ 

dmvfifity/cpffvfpll/vyiprflf 
lrwsftlivqagvqwrwlgslqpppg\h 
kqsetpsqi*tnky*lln*litfffsik 


9701 


23602 


A 


9772 


3 


363 


HELRSSANEKVFASFIGPTILGLPGVLL 
I ILYPPLL IRTS I YLISN\RL\VTTQQ* 
LIKLTSKLMITIHNSIGRS*SLILGSLI 
IIIATTNLLGLLPYSFTPTTQLSINLAM 
AIPL*AGAV 


9702 


23603 - 


A 


9773 


38 


385 


YLI ILDSPEKGLICGCLCNCMHSSLLDH 
LNLT*SS*MCLYANIHIYVAISCIYFI* ' 
F*SI/YIFII* YFHYS lAYVYI Y/ lYIY 
MYTHAHIHMCI 


9703 


23604 


A 


9774 


13 


1330 


DRVSLLLLRLECNGS I SAHRNLGLLGS S 
DYPASASQVAGTIGVHHHTRPTFALFLV 
E TG FHHVGQAGLE LPTSGDPPPL\ASQS 
AGITGVSHCTWPHLSTTGKILSSGTICP 
GIWGQRWWAFLIL\SLPWL*LWHPCLSS 
IMCLSLFFFLCQ/RCKPLISD 


9704 


23605 


A 


9775 


376 


1 


KYFELMYIPVIAICLFNSEVFLNK*QFF 
NLTSSSGIQNLNRLIISNEVESIIKSLP 
TVKNLGPDGIAATF\TYKEQLTLILKLF 
QKHEEARILSNLTEETSITLILRKQQPK 
KVPNEHTVKILV 


9703 


23606 


A 


9776 


192 


2 


NVYQPPPVFLYPPGVAGGV*PF*SPPVF 
RVK/LGF*GSLFFFFFFFETKSRSVTRL 
ECSGVILA 


9706 


23607 


A 


9777 


77 


330 


WLRKFGSHNSGKLTLYLNVYYLQTSFFF 
FFFKIGV*LSPTliECRGPIWVCI*NLCFP 
G*RDFPTLTS*VL\GPRVILEFGFLEKT 
GFSHVPQL 


9707 ■ 


23608 


A 


9778 


133 


325 


PRWVHFPYLGDCTPRVSEMLGLQGWPS 
SSPFFVCFET*SCSVAQaGVQ/WWCDLG 
SLQPLPLG 


9708 


23609 


A 


9779 


143 


362 


RLAPGSCSTCFPLT*TVi;iYYLI*FNIFF 
VFEMESLSVS\HLECSGftISAHCNLCTP 
AWTTERVUAftaNSRSMN 


9709 


23610 


A 


9780 


260 


1 


QKQNQEKKRKKIFR]!JIKI-IQRDYNEQ*yG 
/NKIJ3NPEDMNKF*ETRyT* /PNLHQEE 
IGMIJSRPVTNKVL*LVIKIPLTKKSPEP 
DGFV 


9710 


23611 


A 


9781 


230 


-j 


SSDSKTGSSWLVRCHFFFFLETGSGSV 
TQAEVQWHDHSSLQPPPPQPPK*/PGTT 
GA*STIAS*NAEITGVSHHAR 


9711 


23612 


A 


9782 


20 


322 


SQHFGNPKGNPKIFRNLEINFFRIENAD 
HMIISTDAQKVFDKIQHLFII*TS*QT* 
TERIFIJSrLIKAS*KKPTMIIIIiNG*RLN 
I\FPKTGETKHIC 


9712 


23613 


A 


9783 


158 


379 


LVKIICJSfSFQYISFFFFFFLETKFCFFP 
QVEVQGGNFG*LNLPLPGLNHFS\ASAS 
REPEIPGPPHHPG* ILVF 


9713 


23614 


A 


9785 


54 


312 


KTOKSTKrGRNQSKMAKNSKNQNASSPP 
KDHNSSPARKQNWTENELDELTEV6FRR 
WV1TNSSKLKEHV*PN/GKETKNLEKRL 
DQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 

last amino 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9714 


23615 


A 


9786 


13 


613 


pgagwarqhhgv*dqpgqhg\krpsllk 
leklaghggrcllspaiggglkqgesle 
tggrdcsephlchc\mpawgnrnktlsq 

/QNKTR 


9715 


23616 


A 


9787 


794 


164 


FSFLFCFFFLRWSLTLSPSLECSCVTSP 
HCNLQLPGLSHSPATVS*VAGTIGTCHY 
TKLS*FF\FDDSL*APFLGWDTETYYP 
DPP SS KGLLPHSGQHGQQHP PAI SRAWL 
SC*VLPHLRS/STSLGQP/RIW*LSKVE 
V*RGFTFSPTQNTPTN1INCS lAPGRVD* 
SLLID*LQWCDHSSLQPQTPVLK\HPPT 
LASQSAGITGISHHTRPD 


9716 


23617 


A 


9788 


256 


380 


YIFRQAKTQGLPPCALFREAVGENTP*V 
FAC/PKNVLFYVHP *LFG* I * lYQLKIT 
F*NYEGMUT/FVFRFLFETRSHLSPRLE 
CSGT I TTHCSLELVGSTNS S I SAS * VDG 
NTGVRHNASC 


9717 


23618 


A 


9789 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKGNKE 
LNVRAKTIKLLEENIDRNLOJLGLGNCF 
LDMTPNAQTTKGR\IDKLDFIKI*NFWL 
DT 


9718 


23619 


A 


9790 


288 


2 


QVWQE*AETESAIHCWWERKAIQLL*KM 
RWQFLBMTNV\ELPYGSAI PLVGIHGRE 
LER*ST*NIMFTAPLFVIGK/T*NIiPKW 
PSPYEQITKMHSC 


9719 


23620 


A 


9791 


142 


361 


PPGEGKRVPFTTEPSPRIPSKAKSVSQG 
DTCTPMFTAALFTIARMWNRSKCPSFEE 
* IKKMRCITT/MECYSA 


9720 


23621 


A 


9792 


305 


346 


TYEYTDYGGLIL**LITFYDHAVL^IFL 
ICFLFL/YALFLTLTTELTKTTGGDAQE 
IETV*TILPGIILGLIALPSLRRLYITD 
DAPDASLTIKS IGHQWY*TYEYTDYGGL 
ILNS 


9721 


23622 


A 


9793 


2 


317 


SRDRPRVRDR*LFSTNHKDIGTRYLLFF 
A*AGVLGTALSLLIRAELGQPGNLLGND 
HI YYVIVTAIiAFVI I FF IDLP IL I \ SS F 
G\SVYVLLILGVPCTAVFLHSS 


9722 


23623 


A 


9794 


384 


1182 


RIGKIKG/LCFLFV*WFLKFFFKMEFLL 
PRLECajGKI \HCNLLLMGSSNSPTSASQ 
VAGITGMC*F\VFLIEMRFFHVGQAGLK 
LLTLGD/PPRPPKVLGI 


9723 


23624 


A 


9795 


205 


1 


GLQIKCTMRYHPAPTKMAITYFPLKKGK 
G\NNKCW*GS/GTEIGILIHCWWECGMV 
QLLWKTADAWADAW 


9724 


23625 


A 


9796 


307 


348 


QSARL*EAVCPFSDLQLRTGRTTALFKA 
VRQGHLSLQRLLLSF\VCLYPAPRGGAY 
RGRQASLSCGGLHPVRASRLLCLPKQAW 
AMVGAPTPASLPPCSSISHCCaSNQRDS 
VGYDPSSP 


T725 


23626 


A 


9797 


193 


381 


1LLI*IHILLSMISSPFFFETGSHSS/V 
SRLECSGTASAHCSLDLPGSGGSPTSAF 
*VAGTTGA 


9726 


23627 


A 


9799 


153 


359 


FGTYMCFADSYQLSYLFWGTFSWLEEE 
GV*WCICGSLQPQPPRVK*/FLPSLSLL 
SSWDYRCAPSC 


9727 


23628 


A 


9800 


2 


396 


ARAARAARELEELIKIFFFFFFFWGQIL 
ALMPKGGGQGGILTYPNPPLPG*NNFPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

correspond 
ing to first 
amino acid 

sequence 


Predict- 

location 

ding to'' 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspart!c Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleuc!ne, 
K=Lysine, L=Leucine, M=Methionin6, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ltpprtginglgppgrvnfgifkkkggf 
ppgaegv*npgprgtpppgfpkgr\dka 
phpgpl* tfekvprppnfpq 


9728 


23629 


A 


9801 


256 


347 


LYI LLLNSLK* LVKKLR*NVVIiRSLI IR 
YKVTTKRF* * FSKPAQSEELQLLTSAPG 
LNF*RYIISFFFWIQSFALVA*VGVQWC 
DQGS PQPLPPGFKRFS /CRSWDYRHEPP 
R 


9729 


23630 


A 


9802 


109 


401 


HLRRPCPTPPFALRTSGDQPLSPEARPD 
SGLLHSPLTSPRGLNGQCPPRGSTLNFN 
P*AS IPASPFPSGQPQASQEPPPVASEH 
PPDT *GDPAQPP P\RPQDFWGPATVPRS 
QAGQWPSPLPSDFSKGAQWPVPPTR/PP 
PSTSTQSRWT 


9730 


23631 


A 


9803 


371 


2 


PGFPLFSFPEGNGPSKRQTGD* IRCLF* 
DGKVWECSPPKKKKKTKKRKKAVIP/CV 
PVQTKCI WEGGEETLVGDV* V* P* \GS 
FKHWflMFPEK/DCLCTLYEASFKTKES 

RRVDGFVCVRVGT 


9731 


23632 


A 


9804 


-188 ' 


1 


FT^FFFFSFT*SQSVTHAGVQGT,EQST,PP 
G\3GIJSRASASQVAGITSRRKESWIIFV 
FFVETG 


9732 


23633 


A 


9805 


112 


383 


VFINIRVFRLFTFSFFFKRGFNLSPRVK 
CNG*TKGHCIPDLPGVR*SFPPSLLTNW 
DYRCAPPRPNKFFF/SFETGSHSVTQA 


9733 


23634 


A 


9806 


599 


237 


FRDRSLTLLPRLHCHCVITAHCSLKLLG 
SSELPQASPLSSWDYRHSPPCLANFLFF 
VETRSHY/ASRNSLGSSNPT*A/FPKCW 
DWQV*ATAPSIAYWFSEAEKLILRL]:iftL 
KVSDENICTNF 


9734 


23635 


A 


9807 


168 


2 


PQQDVFFFFLETGSYSVTKAEVQWCDHG 
SVQL * PPG\ STDP PTSASQEATGTHHRA 


9735 


23636 






225 


3 


GPLGYLPPYFPQPLISRG*NPSDFLKIS 
RVFYGT/HYKFEFLFLFFFETESRSVA\ 
RLECSGVISAHCKLCLPGS 


9736 


23637 


A 


9809 


541 


3 


RQLTGINSGRRQFQPQMSNYVIPYMMDM 
I ILKTKKSNNKYWQGCEKTELLIHCLWE 
CKMVQLVWKTV*QFLNRSDIEFPHDPGI 
PPL/GYKRKKMKTCLPKKLCVLVEP/RM 
FTAALFEVAK**KQ/PQTPITR*INTMW 
RIYTMKYYSAIPRKTPLIHP lA* LDTES 
II*TEKIQSQETTYY 


9737 


23638 


A 


9810 


213 


1 


FLCVDFVPCNFAELKLALVGLFVCLLFG 
SVTQAGVQWCNL\GSL*PPPPGLSDLPT 
SAF*VAGTTGVSHH 


9738 


23639 


A 


9811 


54 


388 


PARPLPRQ/WDERPNQQPTKKKKKRGRN 
LPTKKKTQNYSHQNPVAQENPGGKTNQK 
KPKTNPKKKKGGAL* KKTQRGPKQTGGR 
K*KISPNKGGK*NPCGKLLEK]SIPFSGGE 

K 


9739 


23640 


A 


9812 


429 


24 


RSKGAEFPICPCRSSKFSVLQQRQVRIV 
FQHPELGESVAR/CRYSQLLGRLRQ* SH 
LNWGGRGSSEPRSCHCIPAWATRASSI F 
CNFQASSVEVRRSARKKLFSD ILKRHNT 
INWRVSGLLLVDSYFGRLATPVRTQ 


9740 


23641 


A 


9813 


149 


350 


CHFPCFHTTL/CE*EPGEDVTGPIGTKF 
FPAPGRTDGHTHTHTHTHTHTHTHSDPH 
HLCVWNTLIHFV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

amino acid 
residue of 
peptide 


Predict- 
ed end 

location 

ding to'' 
last amino 

residue of 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=L'nl<Lnown, *=Stop codon,/=possible 
nucleotide deletion, V=po$sible 
nucleotide insertion 


9741 


23642 


A 


9814 


66 


401 


PBNKKTVYRKPIATIIFNCERLNALPLK 
LETRP*YLLY*LFNIMLEVLASWROEK 
EINLSIFT*DIVI K/MKI PKNLENKLLK 
LLSPFSKITGYKISIQKLTSSFLYYGFT 


9742 


23643 


A 


9815 


82 


1 


appparfffffff*ekfssfppgevqrg 
vfcpcnt/cfp*g*kiffl\kplkpcvgf 
wppppgpknffyf*kkggfpfwargffi 
ptpfvpplhpfkkwgvkeg*appparff 

FFFFFL*GRVSLCHPGWirA 


9743 


23644 


A 


9816 


256 




QHFP*LPIPHIPIQT*QHFPQTRSPFQG 
PPLCQKPFSSPWGSKWFILLSFFFFF* 
DRVSLC3n>GWNAVM*SQLTAASNTW\VK 
* S SHLSLLSS *DYRCVLS 


9744 


23645 


A 


9817 


53 


411 


TIYCSNYVYNFVSQHFRSLYS*KNCGLC 
L*FHLFVLQS *NFMQFHM*FCLKI *VDP 
IITFEV*FTyi*FF*TYSSVSFTDAYNF 
VKPHR/VYRGPGAVAHIOJPSTLGGQGG 
WIT*GQEFK 


9745 


23646 


A 


9818 


100 


417 


DEHSEKQDEPLQRPRSKLLPFPGTPPSS 
LLLLNHIGQESGLTHVRIiASCLRMRGFC 
FDMKBRLCLLPMLBCRGMISDHCSLI/R 
PG*DKPPASAYQVGETTGTYHRA 


9746 


23647 


A 


9819 


136 


2 


lYMGMQPGAVAHACNPgTLGGQdG/WWI 
T*GQEFKTRLANMVKPC 


9747 


23648 




9820 


257 


426 


IVPGLFLGTGSCSFTHAGVQ*gNCSSVQ 
S*TPGP\SDPPASASRVAGTTGAHHKAW 
L 


9748 


23649 


A 


9821 


151 


416 


LHKLCLMVIVIESGTAKTPSLSPIFNSVGL 
ILLLLBYSIDSLC* SVLCCYKEIPBAG* 
FIKKEGL/WLGWAHACHLSTLGGRGGW 
ITRSGN 


9749 


23650 


— 


9822 


50 


513 


RGDPRVRPRVRIIKLSEEGMWKAEIGRK 
LDLLHHTISQWNSKEKFLKEMBCSATSV 
IITRMIRE*NSLIADMEKVLW*DQTSHN 
IFLVQNIIKSKGIGLFKLMRAKRDBBS/ 
AEGKLDASRGRFVRLK/EKRSHLHNMKV 
QDEASVDIETTASYLKV 


9750 


23651 




9823 


48 


406 


LI IWYTLTTRWIGHSAVSYPLLRPPYS 
LRHNIEIRPINNPTMASKCSSERKSLTS 
LTFHLKIBCMIKLSEEGVSKaKTG*KI.GL 
L/R /QTVSQWNAMKKYLKE IKSATPVN 
T*MIRKQNNL 


9751 


23652 


A 


9824 


143 


444 


WNKKMNKNEIiADMI LWGCGDKTI FMRKL 
TRDETKYGI PQLALRGHSHFVRDGVI \N 
LDGQVDLS \G * WDGTLHLWDLTSGSTTR 
RFWK/TKDVLSADFS 


9752 


23653 




9825 


297 


2 


HRSPTKETKIIASIKKKTMITKDLNRQF 
SKQDKHLTSEYMKRCPTS /LREM*AKTI 
MR YHLTP I RRAP I KNKNK/NKS KCW * GC 


9753 


23654 


A 


9826 


175 


400 


GGKGGKNFSLKGGEKKTNLGIFGKKPIF 
GGGTNGANPPPKIKGSKEKKNF*VFFPP 
FPFKNFFFP*NL*FLGGGWPHLSPPQK* 
V/CFPKIPKLVFFSPPLREKFFPPLPP* 
NLGPPGF/SFWGPPLFFFFFFFF*RDKF 
SFYHPGWKAVIAAHFSLELLGS SNPPTS 


9754 


23655 




9827 


64 


622 


MFSFFLFDLIIiNHLFFCFV*MFSFFLFD 



1137 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

ing to first 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LILNHLFFCFVIRQQFSFLFF/ ISFFFS 
FFHCWWTPELCDFSVCIHSDNSFRFLH 
LF/NF*HMTLCISELR*TLKQSCYSYTK 
IC**FYSVQLSFLTILY**CKS*QALSI 
ALQNEFLSV\QYHKNMNYITLTIBFSRN 
KFLSVTFLLFSIFSHNFI3ISASISIFIL 
HVYIKNNPIGFFK 


9755 


23656 


A 


9828 


298 


1 


MRNCLTDDERKWFPEMASTPGEDAVNTV 
EMTAKDLE/Y**YINLVDKAASDFEMID 
SNFEHCSTV/WKWLSNSTACYRENFHER 
*GQSVQQTFVRVGERV 


9756 


23657 


A 


9829 


361 


1 


FLTGTQWGK/DSPFNTWC* DH/ WNIHRN 
LDLYLIPYI KINLKQLTGPNLRAKT I KL 
PEQNiaENLCDL*LSRERYSTKSITRIR 
KL/DTLGFIKIKNKCISKDTIRKR*ATD 
WBKIFANHVLKG 


9757 


23658 


A 


9830 


402 


2 


RFHHEFSFVLFGPFAKVAFPIGAKNFIS 
NHYWAGVFPFWEKKNKIDSFLPPFSKNP 
ILGELKA*F*IIFLF*GFQIFF*SLFFCP 
FGGK* IPGVFSFFFFFLVFFEIGSHSVP 
\RLKCSGTITAHCSLDLIiGSSN 


9758 


23659 


A 


9831 


2749 


3215 


FCQ* IKMYVMCICI YNFRRVYLCIYTYL 
HTHTFTHTHKHTPTQIPEKDS/QCSLSD 


9759 


23660 


A 


9832 


3 


386 


KLRLGQLNTSAMAAKC*SERKS/R/THL 
TLNQKLEMIKLSEEGMLKAVIGQKLGLL 
YQ/TSQVVNAKEELLKDIKSATPVTA*M 
IGKQNSLIVDLEKV* W* IEDQ/TSHNI 
SLSQSLIQN 


9760 


23661 


A 


9833 




370 


RRCRWPPDPSRTVGRQIGKLVTHRPTVF 
QERGCPFPLTRQAGSHHGGGAFAQVISP 
TKS ISPCGRGGSRL * SQHFGRPMQVDHL 
R/LGVQDQ/RWPICGQYGETPA\LLKLP 
KISWAWWLAPVIPA 


9761 


23662 


A 


9834 


123 


409 


GQENRBTWKIVHLV*VLYT*HIKRIiNCF 
SHFI*CYQPTASQflHVHHDSNDSSTHV\ 
N*NSRWPGTVAHAC3SIPSILEG*GGWIT* 
GQEFETSLAMM 


9762 


23663 




9835 


223 




PKPQKMGTF*PPMEGS*MGPPHWEKMGP 
FLKKKEGHPFFFFFFEMESRSVT\RLEC 
SGA/TISAHOSLCLPGSSNS 


9763 


23664 


A 


9836 


377 


515 


FILFLRQVLTLLPRLEYSGAIVAHCSL\ 
AF*G*SDPPASASRV 


9764 


23665 




9837 


274 


1 


AGEWHDQI CVFKI PLMKNGLHGARVGQE 
SWKERNQQV*DRCQKQNLSDS*IQT*R/ 
M/WPGTVAHAYN/PSTLGGQGEWIT *GQ 
EFETSLflHMVK 


9765 


23666 




9838 


553 


114 


GAGVKTHPGHKGKTLFFFKFFFQKLPGV 
GGKTLPS PLSRGLGRENSFHPGGKGSNK 
QSSPPSFPG/WGKKGGGLPFQKKKKKEK 
RKKGKKNGVAPSEPPLKYSN/WQATWG* 
RKLLNDTiyiRVQPTKSRMQKILQDKGPVN 
GIFTKTGRL 


9766 


23667 


A 


9839 


36 


434 


LPFPQCVTEFIIVLISWCYIREDACKML 
KHVAITIINVCIYICVCVCPSTLYVYMY 
MLP/HLSDTLLTLN*VTLVEMLSSFQCL 
MFWYHGH I * K*HE IJ3 WAHACNP STMGG 
QGGWIT* CQEYKTSLANMVKPR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
25 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Ttireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 


9161 


23668 


A 


9840 


268 


1 


RFPPFFCFLTILFWAPGWFSNFLGVRA 
PQPP* IFFPI*KKPSPLIFFFFFFEKES 
RSVS \RLEYSGT I SAHCKLCVSGSQHS P 
AS AS 


9768 


23669 


A 


9841 


463 


86 


LYMHE FI SGLFLLFCWS I /WSVLCQ YKN 
CFDDKVYGLDYGNTS QVFT *NS KL I KL Y 
TLTMYS FLCINQTT I KPSEM IQATKflNI 
DK*NYMKLENFCTAKETVNKMKRQLKKW 
EKVFSIHIPQEVNI 


9769 


23670 


A 


9842 


37 


400 


VHSPPFKFWSLYVSVLKFPFCSLYFKFL 
ALYS I FLQMCQHLLV*AFFF * KRGLNS V 
T\RLECRGAISENCNLCL/SGSSDNWAS 
LPRKAEAGNSFT/CR*SQVAGNPIPS IS 
TRLNQRKTPFES 


9770 


23671 


A 


9843 


42 


407 


NCLSFLITFFFFFLFEISIKVSFCPQG*G/ 
RGGPFWPHGTLPPRG*GNNPP*LPGRGE 
*RGAPPPPGYFWLFGKKRGFPL\GQGGS 
KPPALKEPPPLGPPKGG\NYKRNPPPPP 
GNFF*LPPYQW 


9771 


23672 


A 


9844 


113 


379 


LGPWATSIjRGRFLGYQLQQSKKH*NYP 
TVR*GCGEMRTSVHCW*ECKTA*PL*KT 
VWQFLKKLNTGLPFDAETPL/LAETPKE 

LKA 


9772 


23673 


- 


9845 


284 


528 


plpktmeimldkkqiqtiflfkfkmghk 
iaettrniknafgpgtane\kctaqwwf 

KKFCKA*E\SLEDKEA\*GHPSEVDIN 


9773 


23674 


A 


9846 




1295 


leqgcnlflhkhs *lgevfffvcfrrs f 

TLVAQAGVKWRDLGSL/HKLPSSLSLPS 

swdyrpplprla/nffvflvemgf\tvl 
armvs is * prdppts asqsagi i gvshh 
agpvagilmfcrwnisnkpkgavfkkk 


9774 


23675 


A 


9847 




387 


AYRMKIIDRISLSLYTAALWVTLFLFLF 
LFFFFKTEFCFAPQAGGQ/WGQFKLMDP 
NPPPPG* K\DFLVSHPRDLGIKGAPPQC 


9775 


23676 


A 


9848 


298 


438 


KIPRGAPNFPGGGKIFSPPLGGPIKTPR 
GALEKKTLF*VRGPWPGFVKKESPRPKK 
LG*QIFWPPGKKPNPPMGG/SU3PPPPL 

PGGPKFPRGGKNFFPPFRGAYKNTPGGS 
* EKNPFLGGGKKRKQP 


9776 


23677 ■ 


A 


9849 


251 


498 


AIIKKMESKKFW*RHGEIGTIiIHCYWEF 
KMV*PFWICrV*QFLK/D/LNMGLPFDSA 
FQLPDICLGELKTYVHTENCTQMFMAAL 
F 


nil 


23678 


A 


9850 


478 


37 


E SRNKHSHLWS INFQKGLR* LNRGKQPL 
NK* CRHNSI STCKRMKLDPYLTPYI KI \ 
KSKWTKDLNVRAKTIKSLEENIEVNLHD 
LG*GNDFLDMTPKAQTTK*NID*LDIIK 
I*NFCGSKDTINKVRRQSTE*EKIFANH 
VADKLK 


9778 


23679 


A 


9851 


2 


378 


RLEGLFLCALFCS lAICMFFFFFFFF* K 
KRGPFGFQGGDPGGNFRQLDPLPPGIKQ 
FWGP\PPRKAGKTGGGPPPGANLGFWGK 
KEVPHGGQKGFKP/RNPRGSPRPGPPKG 
GVAFVFRPKPPGLEQ 


9779 


23680 


A 


9852 


229 


3 


FGPYKIFFKKKGAPCPLENPPVF\HQiCL 
GLGFFFFSFFFF*DGVSLFLLRLKCSAA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arg!nine, S=Serine, 
T-Threoninc, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I LAHYKFLLPG* SDSAASAS 


'9780 


23681 


A 


9853 


^08 


420' 


GPRVCVRLSLRDDSDLLATEDSLAHTEG 
DPVTASSHSQAPSLSVHPGAALEGVCWG 
TGAIAGTPEPQKPPQLTPGGRPA\PIAP 
GLTWEPPSHPPPQ*RSPPPPP 


9781 


23682 


A 


9854 


2 


256 


FFIFPLRYYIYCARFQFLSPILYLGK*K 
MD* *RRVFQEKWEQAYFFVEV\SPMCLI 
CNQTLSVSKEYII*RCHYETNHGENFD*F 
TE KMHDENLLHLTTRQP 


9782 


23683 


A 


9855 


323 


3 


I IQMANKHRKRH\QSQ* * VRVMQKKI IM 
I HR Y I PQWLKTS KS DM I KCC SG YGTTE I 
FTHYLWECKLLHLLWEMVWL\ILLKFKT 
DTPHDPAI PLLGTYPAECMHMYPRA 


9783 


23684 


A 


9856 


28 


385 


DGVTQAGTQWGS KFTAA* TA* YQGS I PN 
AWSFFSPPPVLKTPPPPQKK/YPPPKKK 
KIPPPKKK 


9784 


23685 


A 


9857 


436 


518 


AANRLENI YRHLI YDKD/EHYRIVGKDD 
LFNKWCWVNWI /ASNKEKNLDSYLIPHT 
K/LNSR*IIQrJ3VKD* IIKLLEDNLYLH 
DLGDRQKFLGRI/PVFTIKKKIGKFDPM 
LKLSTFVHQKTPC*RLNSA 


9785 


23686 


A 


9858 


153 


492 


RLAGSDPGVADVSVQLQRQEKSPCPSWK 
AVSQAESSSSSAGVSLVLFRLLADWMRP 

1 PI keenkl\sqstdlnfnfiqkhchrn 

TQNSV*PGGVGPPGTCGPFRRQSSP 


9786 


23687 


A 


9859 


2 


419 


TTGKLQVSHKKSTYSHFSKKPTHTTNKD 
MKRYSPSL/AIREMQSKTTG\MRYHFTP 
KYG\]SINKCWLGCGETETLIYGW*ECKMV 
QPLWKA.VWHFLK*IiNIES/PIY**NYS* 
YISYSWEKTCITIYDPVILLLGIYPREV 


9787 


23688 


A 


9860 


88 


419 


TFFFFFGFWFLKTKPLFVPQLFEGPGPI 
LG*WNPPPRD*KNFS/GPNPPGGGE*RA 
QPPPPGYFLFFKKK\GVSPWGGGGSKPP 
TPGNPPPGPPKIRVIRGGPPPPGGRKHF 


9788 


23689 


A 


9861 


301 


401 


KRA*GGQEPIKVCPFFY]!IRKVGNGSPPK 
JCFKKGRKAAFFGLRANKEGLFVKRGKISI 
WVGKVGEPIT/DPFKEFGGR\LPEKDGP 
LV . 


9789 


23690 


A 


9862 


220 


415 


KIMGGAQIFRGGGGFFFFLEGWEKNNFG 
VSFRRKFFFGGGVFIiPPPP*HKKNYFSS 
QRQYISLGGGGRKTPPPKKNFLLKDTPK 
LFFSHPSKKKKNPPPPRKIWAPPMIF*! 
PPPPIFFFFFFFFFFFFFFFFFKISr*P/S 
FFFNFKKPIFKTFLSPPFKVFPPPKKKK 
ICKNPPISYRRWPLAt 


9790 


23691 


A 


9863 


116 


366 


gqefetspamfcfetescsia*awqwh 
dlsslqplppgXsgdspasasraagell 
eprirrlqsveitplhsslgkrvrlhl 


9791 


23692 


A 


9864 


619 


295 


fffemefslllpelecngaisahrnlrl 
pgssdspasaggll*sqvagitrlrhhd 
*lily/flvemrfhrvg*aglelltsgd 
ppssasqgagitgmshsaghygkif 


9792 


23693 


A 


9865 


12 


432 


iadrrlfftnhkdigtlyllfga*agvl 
stalslliraelgqpgnllgndhiyngi 
gtahafgi ilvivi pi iiggfgn*lvtl 
iigapdmafprimnisfgllptslllll 
tsailearagtc*tvypcaja\ggyshlg 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenj'lalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














A 


9793 


23694 


A 


9866 


30 


380 


LFSTNRRGIGPLYLLFGA*SGVLGPALT 
LLI*AELGQPGYLLGDDHTYNDIDTAHA 
FVIILFIVIPIIIGGFGD*LVPLIIGAP 
DMAFPRIN/NISF*LLPASLLLLLRSGI 
REGRSK 


9794 


23695 


A 


9867 


256 


395 


EVLRKSEKSFETVCIINNKC/WPGMVAH 
ACNPMTLGGRGGRIT * GQEF 


9795 


23696 


A 


9868 


458 


2 


PKKRFFSQKPPRGFYSAPLKGKNYIFPP 














RFFFFFIiEKG*KF*NFFFT*NMVFFYIS 
/CSKKFPFQFVLTPIPF*KVPKPVWFFW 
DFPI FFKKIFFFFFFCDRVSLCHPGWSA 
VARSRLTASPRV 


9796 


23697 


A 


9869 


144 


425 


IPLCSRIYSFGPC*FSLISSSSCLSPFL 
SIPLCVLS**T**YLFFYISMREIYKTR 
TKGIYPGAQENSLYTH\FSIEVQSTIKA 
ERFWPGAVREACNPSTLGGRGGQITWGQ 
EFE 


9797 


23698 


A 


9870 


138 


401 


DLRLKIiNLSKNIIRLQGTVESSVLTIKE 
MQ IKAKVRYNLI PVKI KR/SGNDRWW*G 
CGERGTLIRC**ECKLVQPLWKTVWSFL 
KKKKK 


9798 


23699 


A 


9871 


179 


441 


PSGKGGIKGPPPPPKKIFGNFFFFLKKK 
GVPPGPPGGPKPRPLETPPPNPPKGGEY 
GGGP\WPPPP IP *FFKGPFKkKGPPWTN 
RGNP 


9799 


23700 


A 


9872 


57 


435 


FTQMRiaSrLKNNSGMMKK*GS ITPLKDHS 
NFPAVDPNQNEMFEIPGKEFKK/SDY*V 
TQGDTRERCKPT*VLKTIIiDMDEKCSKE 
MDILMKKQSELLEKKDTFRELQNAMESF 
NMRLDQVEERITELE 


9800 


23701 


A 


9873 


228 


443 


FSSKFILISVFLFLFLETGSCSVAQAGV 
QL*DYSSLQPQSPG\SGNPPISASQIAT 
TAGACLHALLIFVFFV 














9801 


23702 


A 


9874 


349 


2 


KKPKKKNFLPPKMFGFFSPFSP*KFFFF 
LKGENFFRGFFPNFPPPKKKFFSKNSQL 
VFFLPPLKKKIF/CFPTPVKFGP/SQRF 
FLKGPPLFFFFFFF?LDRVWLCYPGWSA 


9802 


23703 


A 


9875 


405 


2 


KKASRMENVRKKKKLEMIKLSEEGMLKA 
NIGQKLSLLAKQQWNAKENLLKE IRSA 
IPLNTQMIRMQNSLIADMENIL/WWI/ 
EDPTNYNIPLSQSLIQSNALT\FNSMKT 
ERGEEAA*EKFDAISG*FMTFKERS 


9803 


23704 


A 


9876 


355 


394 


THPYYSHQEYQSP*P/LTGALSALLKTS 
GLAM*GHFHSITLLILGLLTNTLTIYQ* 
WRDVTR*SAYQGHHTPPVQKGLRYGI IL 
FITSEVFFFAGLF*AFYHSSLAPTPQLG 
GHWPPTGITPLNPLEDPLLNTSVLLASG 
VSIT 


9804 


23705 


A 


9877 




85 


TI YIHI CVRI YACE CVCVCAYIHI CGVP 
AHLTEQNL\KPARTQSPPAPQKNPELPV 
DLVS I SLDREAQDVELNHYRLGNTCRTD 
VLPKVHTLCLRL/NFFPCLPILEELQEL 
P*LENIPLFPQPIDYIYTYMEAYICV 


9805 


23706 


A 


9878 


2 


389 


GRGQFPPEFFYFFWGGSLVPSTPNKKKP 
LGEVLGGGWSNRAPKGWGGGFPPPILAF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KPKGPFGPKFVLKFSNVPPLLFFPKMRG 
PLPKGGGFLPFFFFFETKSRSVTQPGVQ 
WCNFGSL*PPP\TGSSNS 


9806 


23707 


A 


9879 


351 


' 


FLGPSKNFNPFPPQKWPFAFFPKSHRAR 
VNEIFS/RLSVINF*GLTKPQRLFFSLL 
NFSS \ PI PPFQFPSRLIKEIRFSTFLTP 
ENSKIKKPLFFF*FFFF*DRVTLCCPGW 
SAWQSR 


9807 


23708 


A 


9880 


123 


387 


SFLWKLRPSGAPTCMRCLLA\LLGGVSQ 
LGYTGVRDPLEEAVCPFSELQRHAGKTT 
ALFRAVRQGCFFFAYF*IELFPCY/CSG 
RVGG 


9808 


23709 


A 


9881 


341 


659 


S FLGRVQWLHACKSQHFGRLRQVDHLRS 
GV*DLPGQHDETLSLLKMPK/QKLARHG 
GMCL* SQLLRRLEQENHLNLGGRGCNEL 
RSQHCTPAWATEQDS ISKTKTKK 


9809 


23710 


A 


9882 


216 


1 


PKFFWVLRD6QIFKPGgGPQ6FF/GGFG 
NPQGKKKPGFWGLG* KKKGFFFFFFFFF 
LRDGISFCCPGWSIQW 


9810 


23711 


A 


9883 


2 


244 


GRVGAVGRREGENFIKCDELPPPPPEGK 
QQNLCKFLTGFSLPCQDKKP\CFFV*KK 
KKKKKKKKKKKKKKKKKKEKGGGGA 


9811 


23712 


A 


9884 


310 


2 


IFLQTLFPNTFENSLSEYCIIFSEYSFI 
**KQ\TNFILSSILLYyi*PSGSYISNL 
*Y*ELFSQ1ILSYQHFLWI*KKWGLGAVA 
HTYHPSTLGGQGRWIAWSPG 


9812 


23713 


A 


9885 


242 


358 


KTVWLYMNI *MftIinSIHME*CSLPLVIREM 
* IKTPVTVH\TVGMAAI*NKK*NITR/C 
W*ACRETAVLVYCWGEFYTQPL*KTVWL 
YMNIE/I PCDPAIPLLGMYLKE*RPGTV 
AHTCNPR 


9813 


23714 


A 


9886 


360 


2 


LLKLGFPFWAKKGKKKRLWVEGPLGKPP 
GGFCQKLFFSPGGGPKGFSPSIIILPKG 
* PFVWGPFREVF* KKRFKLKKKFRPTFS 
/Y/LFFFPETESHSVA\RMECSGAIlljftH 
CNLCRLGSSD 


9814 


23713 


A 


9887 


236 


^47 


IKGFFPF*TESHSVT\KliiECTST7SAHC' 
NLCLPGSSNSP 


9815 


23716 


A 


9888 


310 


3 


NFFFFFETGSCSVTQARVQWSDQ/GSLQ 
PRSPGL\SDPPTSAS*VAGNLKFLSGLC 
SSL*SLEIS*DTO;SQVQITDDLCYTW 
LCLFFEMESHSWQAGVQWH 


9816 


23717 


A 


9889 


119 


391 


APAYWNPHARDSPFFFF*TNFPFFPQVG 
GQWPNSGSPQAPPPG/SPPP/YSASASK 
ISGAPGAPPPPGPNFFFSFFSSPTFFSR 

DGVSPYEP 


9817 


23718 


A 


9890 


195 


3 


DGVSLLLPRLECNGTILTHCCLSPV/SW 
DYRRLP*RL/V*FFLFLVEMGFHHVAQA 
GLELLTSGDHP 


9818 


23719 


A 


9891 


3 


378 


RDGERNMLALGTASAKALKYSLZvLSLPR 
LECSGTI ISHSSLNPP/GLKWSSCLSST 
SSWDFRPAPPHLAKKTKQQ/HATFF*EG 
ES PYVAQAGFKLIASSSPPAFSLPKCWD 
YKL\DRHAW 


9819 


23720 




9892 


231 


1 


PKPFFPKFFSP*N*KPFFQISPAKIQKI 
RVYQS\HKKFFSFFFFFHGVSLLLPRLE 
CNGTISAQCNLCLSGSSDSP 


9820 


23721 


A 


9893 


2 


347 


APARQEWWCVYVCVCMCVCIWLHRHC 
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SEQI0 
NO: of 

sequence 


SEQID 
NO: of 

sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

Tocatlon 
eorrespon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














E VLCF YKMLGKKR S I YAY* LVY * Y I Y IH 
MCVYIYI/YIYV/YMYIYTYVCIYTH 


9821 


23722 


A 


9894 


28 


382 


TAHLEAGATGYRTFLLPAIGLIiCIiPWVL 
DDGAGQTAQ*SMLLDHiiMLIjAHRAHELF 
I /DTYYEFEETYIPKDQKYSFLHDSQTS 
FCFSDS lATPSNMEETQQKSNLELLRIS 

LLLIESW 


9822 23723 


A 


9895 


93 


386 


FWKIISHYYFKYFSCSILISPSDLPITC 
ICMFHLLKLCQSSEFLHUiLFVWLVFET 
QSHSIKQPECS * * liSAA/CHLPGSSDPP 


9823 


23724 


A 


9896 


174 




TLDWGPRKKNPFFFFFKTRSRSVTEAGV 
Q*HNHGPLQPQPAG\SSDSPTSASQADA 
W 


9824 


23725 


A 


9897 


251 


386 


LQRLECSGTI/SAHCNLCLLGSSNPLAS 
AS* lAGTTGTLTGDVDST 


9825 


23726 


A 




127 


393 


GHMGTLTGSPATLSILAIiFFWGGRISTQ 
LDGINKS XFSLFLYFLRHCSLSLRLE* S 
GAI ITYCELEIiLGSGDTPASASQETGTA 
GTCHT 


9826 


23727 


A 




3 


334 


KLRLGQIiNTSAMAAKC*SERKS/R/THL 
TLNQKLEMIKLREEGMLKaVIGQKLGLL 
YQ/TSQFVNAKEELLKDIKSATPVTA*M 
IGKQNSLNCDLEKSCWIjIQDQTG/HWI 


9827 


23728 


A 


9900 


356 


1 


GRKPFStniPKATLLPMGPVLGGRRALMG 
PDSRPGPVPSCSLVLLTPLAPLPLTARE 
SLa>CPPS*TPQPSVKP/H/GKIjARRSP 
CWSGRQSLP*AEIVPLHAPPALGDRDE 
TPSQKKKK 


9828 


23729 


A 


9901 


118 


343 


I PYAKEKKKQENVLRFIHVNLCISNWFW 
FETGSHFVTQG'*VHWCISrLSSLQP/PIirLQ 
GSGDPPTSASRARGATGVQH 


9829 


23730 


A 


9902 


321 


50 


3LRTRV*RPPSAPG*NPFCLKPQKIFQG 

ggXesplsqilkrvkqensyhlggkgfn 
*pklppcpltwatkktsppkkkkkknyr 

TWETSVS 


9830 


23731 


A 


9903 


317 


8 


NeyDPNSGIVIPLLGILPKSTK\T*VRT 
KTCT*MLIVALCI ITKKWK*S KCLSTDK 
QVNKI * YIHIMEYYS PIKGRKY*HTLQH 
/WINDSrLENIS *KRLDIKKPHI I 


9831 


23732 


A 


9904 


287 


3 


RGAUSTGRGGPRGVKSPGFAPFKIjQLKPP 
GTFI)NPK\QFPFFPNPPF*KKGAFFHFF 

LPLF*SPRWTEFYSVT\KLECGGAISAH 
CNLCLSCSSDSRA 


9832 


23733 


A 


9905 


2 


406 


PRVRTSSRSRATALFFFFFFFVFFWGKG 
EIWAPPLKNC/IPPEKFCYFGGGAPGAG 
LPPPPGVIFFSLCRWKKKKKKGAPPGG 
VLGKKKKTPFTPKGGTFLTLGGF*KKSL 
FGKKTLLWVGGPLLLKNFF*EKFR 


9833 


23734 


A 


9906 


1 


296 


IWVGATECI FKNNPTIWGHQETHFTCKD 
T YRLKV/ KGWKKI FHTFHTNGNQK* AGI 
AI VI SDKTDFKS KTI KR*KGHY/ IMTNG 
SIQQGDILDWYKSNCSF 


9834 


23735 


A 


9907 


474 


41 


FiyiEYLTLSFIHVLLQ*FIHCFVHLFIHS 
FFYALMKSLIQ*FIHLYSCF/ICIDLLI 
SS CI YS FI YALVKSLAWSLTHS FIHTFL 
Q* FVHLFTPLLICS FTHSS IH* PSHSLT 
FIQLHLFFHALVSGFIWSFIYLFCKINV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FLIDK 


9835 


23736 


A 


9908 


286 


84 


LKPPPPFPFFFQANFLCFIRD/GFHHVG 
QDGLPLLTS*SASHGLPKCWDSRCEALR 
PASPLISGPHQTI 


9836 


23737 


A 


9909 


316 


63 


DQLLVRSIGFEALMSPIFFFQMEF/SLS 
LPKLECNGAISAHRNFCLLGSSDPPASA 
AV* /PiXIftNLLKLCASVSYCIELVLNELH 
WIK 


9837 


23738 


A 


9910 


370 


461 


F*F*FLFSETESHSVA\QLECSAMISGR 
CNLRLLGSGDSPATREAGAQESLEPRRQ 
SEPGLCHCTAAW 


9838 


23739 


A 


9911 


294 


28 


pgknstvqkepgkwfcemkftpgenavn 
iv*pttkdlet/wini/dktvagcertd 
snfersstmgkml\snsiawyteifrgr 

KSQLMGI 


9839 


23740 


A 


9912 


307 


463 


CKL*TLSNNDVIVGSLIVT/TCTTLVGV 
LIMGD/RLCMCGEKEYLGNLGTFLSIL 


9840 


23741 


A 


9913 


377 


28 


REMLTVKPIiAPSFFRWSRAYRAIFFITP 
TWVSKSGKVTGPLFFLHIFFPGP/CPGL 
PIFFFIKTKGTRFFFGWFFFFERVSLCC 
PG*MECISrGAISAHHKLRLPGSRYSPASA 
SBFHR 


9841 


23742 


A 


9914 


384 


725 


hltairmtvikdtenskcergc\ktlmh 

SCWEYERVQPLWKTRW*F/LQQVE/LPS 
PCDPSIP/LLRMYPK*IJJTD/CKR1«IC/C 
IPTS lAALFPVTK/SWKQ 


9842 


23743 


A 


9915 


187 


3 


QPHKHLGLDNIYIPFEIGFCSVAQTGVQ 
*HDHGSLQPHPP\GSRDPPTSGSRVAGR 
PRQENG 


9843 


23744 


A 


9916 


15 


167 


DGVSLSLPRLEOSrGAILAHCNLR/LPGL 
K*FSCLRLPSSWDYRLPPPHPAHV 


9844 


23745 


A 


9917 


345 


477 


YCSRTSLPL/TLKKGSA*PGAVAHAYNP 
3 ILGGRGRWIT*GQEFK 


9845 


23746 


A 


9918 


460 


42 


HQRPKVDKTTKMGEKRSRKTGNSKWQSA 

s/appkerssspameqswtendfdelrq 
egfr*snyselkeevrthgkevknlekk 

LDEWLTRl/SNAEKSLRDLiyiELTPKA*Q 
LPPERTNLSS*FNQIjEERVSV!yiEDEINK 
MK 


9846 


23747 


A 


9919 


282 


21 


AAGYLQPSEFWLHSLGSRRGLPISVFPV 
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SEQID . 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl<nown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RGRQAWLSCDGYHPVRPSRLLCLPTQA 


9851 


23752 


A 


9924 


254 


3 


PGGFYFPGFLPPPLGFPKGFL*RLWPPC 
PATFFFLYQISKFPFFFFGKYQKNFFFF 
FFFFPESKSRSVT\RLECNeAISAHYNL 


9852 


23753 


A 


9925 


309 


1 


YTPTIMLLGIYSNELKT1IVHMKT*I*MF 
TATLFLIAETWKQPRCSSLS EWI /QQLW 
YIQTI/EYYSWERNIK/LMKTW/MKLK 
YILQSEKTQSEKTT YI * FQLYDI 


9853 


23754 


A 


9926 


237 


2 


RWWFNFTQLILKCI FLRTKFFS *VSK*N 
FKYFLGFWFFFFLRQGLTPVT\RLECSG 

TISAQCNLRLPGSTDFSASASQE 


9854 


23755 


A 


9927 


244 


1 


KPQCLW:iK*D*WHNTQKLL:VEKGKPL,IH 
YWWERKLV*PLWKTVKRLMR/NLKIELP 
YDPVIPLWGNYSKEMK^LCQREDSRA 


9855 


23756 


A 


9928 


2 


644 


IVQVDQKIRLNIHCLQETHFKCKYTYRL 
KVNR*R*VyHVNINQEKALVAIDaAYFR 
ARKITRDKERHY/ IMIMGSMLQKD 


9856 


23757 


A 


9929 


1 


364 


GTSGTRPINNTTVFSKCSSERKT/R/TH 
LTLNQKLEMIKLSGGGISKTDMG*NLGL 
LHQTVSQWNTKEMFLKEIKSATPLNT* 
MIRHQNSLIAAIGKVLWWIDQ/TQHNT 
LLSQRLIWNKflLT 


9857 


23758 


A 


9930 


88 


374 


ILKACGPCPGNPHPWGPL*GQGGLGLNP 
SKSPPHPHPRGDPTWGGTPIGDYGPKPP 
KLPHGflGLPKKPCB SGGONN/PKISGP 


9858 


23759 


A 


9931 


384 


2 


LCSPRFFYRKIWGGGQGSRLLPPPFWKP 
GGKDCFSPGVLDPPGEQRGAPCFP*KKI 
RWG*GRAPWPPPQEG*/VGRAPLTPKI 
PAPVSFGCSPALPGEGQSETPFFLKKKK 
KEKSEAMIPPPPGPRA 


9859 


23760 


A 


9932 


32 


359 


IPFFLFFF/C*KGGPPPVPQAGGQGPKT 
PPWPPIWAKKTKPLSKKKKKKKKGAGPP 
PPPKGGPPPLKKGEGPLLKRKFLKKNGG 
EISIFKKKICIWAGG6GNPREPPPFGGAR 


9860 


23761 


A 


9933 


138 


1 


TGFFCCC*SFALVAKGGVQWCBLGSLQP 
PPPG\SSDSPTYASCSC 


9861 


23762 


A 


9934 


103 


355 


IiAGC2ACCS*STSANRCVIIHA\PGYANGR . 
CDTITCILVFIRNSWIiITPIALITMFCY 
HVG*VRPQEQAPGNRVSLSDLLL\PPVT 
C 


9862 


23763 


A 


9935 


29 


363 


AWSHHAFLLFFFKRELAFVPQRGERENIT 

fvslkppppg*spfsapnppkggnkgpp 
pppqliffyf*/gekgfspynpegpkpp 
tlgkpglf*ntkkicpqprglkkggqhi 


9863 


23764 


A 


9936 


173 


363 


PKKRPIISLKEAFNSNPGD*QKFLKRGL 
PPMGSQFKRGNYPYDKNLEKLCPGGGWK 
GSG/YHYGL 


9864 


23765 


A 


9937 


360 


3 


NQKMRKNQSSKTGI CKNQSACPPPKERS 
SSLAMEQSWMENVFDELREEGSR* *NYS 
ELKEEVRTHGKEVKNLEK/NLDEWLSRI 
S/N*EKSLKDLMELKPKaQELHGECPSL 
SSQFNQLARA 


9865 


23766 


A 


9938 


194 


370 


VSNFFFFFETEFCSFAQVGAQGQNFR*L 
KAP P P \NCCLF PT * AS RWGTTGTCHHS 
WLI 


9866 


23767 


A 


9939 


426 




SQEFKVAVSYDLTMHSNLSNSETLSQKE 
ierKQTKTKQKKTY*EGRKHIKRCPTPLI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRE IQIQTTVRYHFTHIRI /VLLPQKQN 
KCWCRVEKLKPLVRC*WECETVQPLWIT 
VWW/FIiKILNIKLPFDLATPWCiyQME 
FSC 


9867 


23768 


A 


9940 


255 


16 


RVIYNIYIMYILCIYTPSRLLVEIGFHH 
VGQDGUDLLTS*S IRLSLPK\CWDYRYE 
PPCVHIYIYLHIYLIYMSDGVKY 


9868 




A 


9941 


206 




FLQGHTAIKNGRG/WPGAYAHTCNPSTL 
GG*GGRI I * GQEFKTSLTNMVEP 


9869 


23770 


A 


9942 


3 


402 


HEEUJPRSATSSPALVIiHVTIAHVTIAF 
AEGCFQHDFCFSQPENRHIPLVFTTASK 
VRNIFYLLCRGRGILHFFLVCLKIDPAI 
PLMGIYPKXKSLYHKDTYIHMFIAAEFT 
VAKIRYQSKCPSTDD*IKKICV 














CIKILLKFLCIiFCILPPCIJJFFLRQSCS 
VARAGVQWX SLQP * P PRLKHTPTSAS * S 
AETRGTHHffiVHLIFSYNFL*R*VSHVA 


9871 


23772 


A 


9944 


321 




VPSWAG\PVAHACNPSTLGGR6GRIT*G 
QEFE 


9872 


23773 


A 


9945 


142 


480 


QEAKSFDFSDSTTMEVMLDKKQIRVIFL 

fefkXmgrkaaettcnienafapgaand 
*qmrwwlekfckgeesledeecsgrpse 

VDDDQLRAI lEADPLTTIQE VAKELS I D 
H 


9873 


23774 


A 


9946 


191 


3 


FFLPRGGGDSVFKKPPKECKNGFPKIFPF " 
CPaFPLAT/lNFPPQIFFFFFFF*DGVS 

LCHPGWNAV 


9874 


23775 


A 


9947 


451 


461 


K* INK*ICVKGW/RKEHHANINQKKVDVT 
IFISDNMNFRKKNKITGDREGRYIIIKQ 
P I HQENI AI liNVYVPNNRVAKYVK* KL I 
ELKKEIHKFTILVGDFNTILSTIDRTTR 
*KVS 


9875 


23776 


A 


9948 


186 


405 


NHTLAISLF*PYKP*AILKLGIYRKERK 
LVC*RDICALMFITELVTAAKLWNQPKC 
PSTDK/WKMWYICTMEYYS 


9876 


23777 


A 


9949 


18 


412 


PEFRITTILPEKKWLL*LTN*VSKFSGY 
NINI*TSVAFF\ySrmKISETGIKIVPF 
TILSNRMKYLGM/TLTKNMKDLHTENYK 
MLRKENEEYQMGNMGSWILKINIVKISI 
LHKVIYRVKFLSNFNAILLKK 


9877 


23778 


A 


9950 


1 


396 


LAFDGRAERLKTGFHRVTQDGLELLTS * 
SVRLGLLKCWDYWREP/PVPGLWD 


9878 


23779 


A 


9951 


307 


405 


GI RVRR* PQGLRPPNPDALVFKKMVWPP 
PTN\PRGWGGKGWEPPQGPQDPPPQAAP 
GFLPETWRGPWNSLPTRGAWPPNSNLPP 
WPPAPSRHPAQRAPGPSTAAPPPGGGGE 
GWDPPGRV 


9879 


23780 


A 


9952 


2 


402 


YGRPKEGGSLRFWLTTPGPKGETPFFL 
KTQKITGGGGGGP * FPLPRRVRQENSFY 
SRGGGFN* PKHAPCALTWGKKQKFFLKK 
KKNTPQKPKKLPGCRIQY*TQK/SSSLF 
MGGTPGKEQ 


9880 


23781 


A 


9953 


249 


1070 


S S F I S RF YF I CI YLFETRS HS VTKARVQ 
CCDYN*LQPQPPGAQEILPLQPP*SAG\ 
ITGVAPNTPQL I FYFFVE/ SGGSTHVAP 
RIiALNSWAQNIAL/RLPKVLYFTF 


9881 


23782 


A 


9954 


75 


396 


GFKGRKRGL PLPLWQ KLKTLNLS EKGLS 
KftNTGQKLGLL/RPVSQWNPNGKFLRE 
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SEQID 
NO: of 
nucleotide 
seqnence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

lieginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


nude- 
location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucinc, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V^Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=€nknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IKSAF/PIEK*VI*KQNSLIPDMEEVLG 
V/WKTKKYSHNI PLGLSLTQKHVLTLF 


9882 


23783 


A 


9955 


181 


458 


RLWNDGLYNIFDLKLQ/S IHV* S LLLL F 
ILEPESHSVS\RLECSGAVTTCCSLNIP 
GWSDSPPQAFQVAGTTGVCPNAWVPSAF 
L 


9883 


23784 


A 


9956 


67 


430 


lAPRLIYSPIKFQWSFMSVRQIQVQSLA 
IGNFHNPRLPILICHCYAFHRILLFQMV 
RNFILYSISISANKQESKCILKYILLYI 
NLRNWLGTVAHAYN/PSTLGV*GGRIS * 
AHEFETSQGN 


9884 


23785 


A 


9957 


77 


422 


LPLALWASCRPQLCLDTDFIHGPPTSGA 
ASQFLCFQTMQPPTCSSSRLVNL*PALV 
LDHTTSKWCFRSP /SFFFLLPLPLPPA 
PLP\FFFFFFFDRVLLCHPGWSAVARFQ 
LTAAS 


9885 


23786 


A 


9958 


3 


422 


LHSSLVTKQDSISKKKfCKKKIFLGGTKK 
GGPSLGGFFFGPPGFPKIVKKKRGALIG 
FPFFCPKKKPKKPLFFPKPLGKPPRKKG 
F*IGEPWKT/EGPGFPKKLNFVFFSPWG 
GNFLFPPQKPGGKVGKILLQPKKKPPGK 


9886 


23787 


A 


9959 


370 


2 


PFSSWVANFREGDGEKCSVLQGIRFCMF 
CRYINVTPLVHFPFPISKGTSFFPLYFF 
SPEVM/HFP *LPY* FGWALLSRF* /MSL 
L*LFFSKFPFFPSFFFF*RQFHSSPRLK 
CNGAISAHCNLCLP 


9887 


23788 


A 


9960 


256 


2 


AWEEETAPLHSHCLGNQARPNNHRHAPX 
P*LIVFVFSVETGFHHVAQAVLKLLASS 
DPPTRFNLLSSMDYRGLSPQPMSRKSAQ 
EG 


9888 


23789 


A 


9961 


269 


3 


HRQACGQVRSPIQCWWACKIAQPSLQIV 
RSSLKG*I*SPYDPVIPLLVIYPRELKT 
\SLHTKTCT*RFTAALFIIARRWKQLKC 
PSIDE 


9889 


23790 


A 


9962 


268 


324 


SGTLSH/WIFS*VGIEITMWLKVPDQTK 
KISLRSAEAAIKYFLTQATASIIIRRAI 
LFNNRLS EQSS ITNTTNQYS SLI I IMAI 
AIKVGMAPFHF*VPEVTQGSPVTSGLLV 
L 


9890 


23791 


A 


9963 


341 


2 


ITFLPTKIIRQSNFKTGKRKTDSNDLNR 
HFKREDTQMANKYM * F I * ICMQ I \ KTTMR 
YYFISKRTTISRKTSNIQC*DTWSPGTF 
INCLLECKMVRLLWICMTWQFLTEVHIHL 
VY 


9891 


23792 


A 


9964 


250 


2 


YHPVNLYFKYISPIDV*FGCFIYIFDYP 
LCSCFSLHPLVYLQ*L/LFFFFEAESHC 
VARLE CSGWSAHCNLRLPGSSDSPAS 


9892 


23793 


A 


9965 


3 


334 


RTLRHHIWLIDPSFE*HLSCFHLLAIVN 
SAAINIPC/HSYV*THF*YS*MYRGRTA 
GSYAT/SMLNLLRNCSTLLHENCTF*PF 
YH/WE * VPI SPQSCQKLDFFI FLIIAI S 
EG 


9893 


23794 




9966 


179 


352 


NQE*TEILNRPMSNQIESVIKYLPTRKS 
HRPHRFTAELYQTY\ I LLKLFQKVEGEG 
LL 


9894 


23795 




9967 


2 


350 


THPSINSFIHVFIHASMNSLIHLPTHSF 
SIYLFTKSFIHLFIHFFSFLFFLRPSFT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P*=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoptian, Y'-Tyrosire, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PVAIYSFIHPFTYSSIH*FIYPCVYSCI 
DEFTHSFTHAFIQHLFIH*II/IFIYSP 
ISFLFF 


9895 


23796 


A 


9968 


228 


558 


CHQLRQELAIFTSFVILQLFSGHLDVYM 
QAWAQRPDKYEYDNK* FIEI KKI IQFTL 
ISKRMK/YVGINLTR*VmLH]3EIIYKTL 
MKEIEEDTSEWKDISCSWNGRINSVKT 


9896 


23797 


A 


9969 


196 


2 


DRKVFFRPLFI I FGLKKKIFPLFWFFPS 
/ IRGRG* I FFFFFFFFETESPSPRLECS 
ETISCHCTP 














TQCDMCVFIYTHIYISHICYMV^IYIHT 
RTHTGI LFTPKRE KNSNTCTCATKDNP / 
CMTLC * /HKRRQTHTIITHTHTHTHTHT 


9898 


23799 


A 


9971 


163 


380 


NHRRyKKNGU/HMGNK!l!KTVTKMINSNP 
PITHNNMIVNGLKTPIKRQKLSDSIKKQ 
YPSIYCL*KT/YLKYKD 


9899 


23800 


A 


9972 


328 


1 


AKCGFLKKLNIELFNSIPGHSCKT*TaM 
FAAALYIISKKYKQPSCPSNDE*iyiNEIW 

hiltigysavkkiswg\rwmnleniarl 
sgrqrghilqdsvdmkspe * ai pq 


9900 


23801 


A 


9973 


200 


3 


MASKCSSERKSHR/ S ITHNQKLEM I KLS 
EEGMSKAKIG*KLGFLCQ\VIHWTAKG 
KFLKEIKSATP 


9901 


23802 


A 


9974 


HI 


517 


KLEQPLIKRTIANKPIKKINTTILIITI 
NVNCSLNTLTK/RQRWSD* IFKWQDPTMI 
/C * KKN/HCKYKDTNTLKIKG*KNIYDA 
NMLTQI KKKAGVAIL I /DKIDFRA/TD I 
TRAKEGHFIMIKGSVHQEYVTILNVSAH 


9902 


23803 




9975 


342 


3 


FLATKKIFFFFPVFPPQGPPPVLVAPFF 
WAFGGGPPKWAPKKKFFGPRGPPFENFF 
F*GFFFFKTGGFFFLRFSKKVFFFFFFF 
SETESCSVA\RLECSGTISAYCNLCLPG 

SR 


9903 


23804 


A 


9976 


13 

^4 1 


326 


ILDHSNRPSSSTLKTPNAGKDVEQKQFL 
/LCCW*ECKIVNYLGR*L/WQFLTKLNL 
L*PYDPEILLLGIYQKELKTCIHTVACT 
*MFIEALFVIAKT*KQPRYSSVG 


9904 


23805 


A 


9977 




396 


LRPRTQPLPCSPAPLRQCHTQSLLPPFL 
LRLIFYFFCFLKTRAHSVAQAGAQ*RDH 
GSLQT*TPTLACTPASA/VARTTGTHHH 
AWLIVFFL/ lEKVSHSLVCDKT 


9905 


23806 


A 


9978 


287 


2 


QPHNS S YTQ I FTPKI I IKTPPPNLKQNH 
TTPIKKKTQ*FP/PSTNP/SYPSTHPPT 
HLFIYPLIHISPPPSPTHPRIHPPTYLS 
IHPSTHPSIHPPSR 


9906 


23807 


A 


9979 


97 


332 


GNMDLFLYFFFFLETGSHSVAQARVQWC 
DHSSL*PQPPGSSGPPISTC/RLGLQAC 
MCHHAWIjIFNFFCKGRLSLFGLLR 


9907 
"9908 


23808 


A 


9980 
9981 


168 


2 


STWIiRWDYWKTFFKTSKS/WPGTVALTY 
NPYTLGGQGRHIT*GQEFKTSLANTVKS 


23809 


A 


267 




LKRERR * RRAN/ S KTSRRKE 1 1 KI RRE I 
TATENRKP I E K/ SNKTNG *FFEKTDK/I 
DNPLERLRKKERD*ITKVRNGREDIL*N 
NDLIKVKKD 


9909 


23810 


A 


9982 


230 


412 


CHIYLC/D*PIDGIV*CHIYIYFGHKH'r" 
HTYTHTHTHT1ITHPTC*VSKILLKKKRR 
TCSVCA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Prcdict- 

otide 
location 
correspon 
ding to 

residue of 

peptide 

sequence- 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutan]ic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l>^Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y-^Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9910 


23811 


A 


9983 


229 


415 


SYVFCHIRFCPLDSLNKEIVA/SQVQWC 
VSWSTT*VaEAGGSI.EPRSLRPDWASQ 
RDPISKN 


9911 


23812 


A 


9984 


2 


288 


RGYDSTSSAYTVGHFT*EDKATITRLSG 
KENVKHaGGETLRRLLDVYP*TQRIIDS 
VGNIiNSA\SAMM\GPPRLPKVIiGLQAWT 
■rMPGP*KYIHIFF 


9912 


23813 


A 


9985 


246 


3 


FLLDYQYCaLCCHCFLENFANaSHENIL 
LGGVKHTKFPKYS+KFSVL/WLDAVftHS 
CNSSTLSGLGWITRGQEFETSIiANMVK 


9913 


23814 


A 


9986 


342 


1 


FLNFGLTGFFPKGPPKPLGLRDGALPLD 
PPQFLGVFKRGAFFIiGTQ*FHF\LEGFK 
FWFWEPKAFFPFF/CCETISAHCNLCLP 
GSSDSAASASWIAGITGAHHYARLSRRL 
RQQN 


9914 


23815 


A 


9987 


330 


1 


NRPLHNLVMASKCSSERKGYTSFVLSRK 
/LRKMI KLSEESMLTVTTG* KLGLLCQT 
/VQTOTKEKFLMEI KSATPGNKGTVTKS 
NTLTADREKV*VIWIKNKTNHKIPLSQ 


9915 


23816 


A 


9988 


209 


2 


IKGSHFPVLI*YRKIIIIFGGVKGKGGF 
YS Y\ LFFFFEKGYHSVTQLKCSGMILAP 
CNLCLPGS*DSPTS 














QHRWMNTEMIiNYLC * 1 1 YVHF/ HTHTHT 
HTHTHTHTHTHTHT 


9917 




A 








IQ IRVFENSHLWPSYVHFPHGHSYLFCS 
L* *AFFRH\G*FVLSVHFLKIFLNLKKI 
FFETG\SHSVAQACSGMISAHCNLCLLG 
SSDR 


9918 


23819 


A 


9991 


3 


368 


SLDPRRSRLHLAKIVPPRSSLGDHARPC 
LLKKLELFFETGFCCVAQPGVRWCSHGP 
LQPRIPG\SSNPLTSAT*VAG\IAGMCY 

HTQLQLAF 


9919 


23820 


A 


9992 


294 


391 


SGTVAHACNSSTLGGRGGWIT*GQ/ESQ 

TSLTNM 


9920 












TDKELLLRDEQRKCPVEMEPSPGKHAMS 
NVDMTKDLEYSINLVDKAAVELERADSN 
FERSSTLGKMLSNSISCYKEIFLERKSQ 
LM*QIPLLLM/FKKLPQPP*PSA 


9921 


23822 


A 


9994 


193 


395 


IFWITCCSSLSVLAASPCTFMLWRQLLS 
LNLMNQ*DHSLSQSLIQSKSATLFNSVT 
AERGDESAEEKLEASRSWFMRFKERSCL 
HNI KVQGEAARTDR/EAAAS YPENLR/V 
KDEGGYTK* RI FWDE lAFCWKTMPSKT 
FIAREKSIPGFKVSKDRMTARCGRSCL 


9922 












SFLWKLCPRGAAAYVRCVSA\LLEDVSQ 
LGYTGVRDPIEEAVSPFSELKRCRERTT 
AVFTAVRQGHLSLQK\FLLPFVQLCPAH 
RGGV*RQ*ALLCCGGLSPVQAPLPLCLP 
TQ 


9923 


23824 


A 


9996 


376 


1 


CGRTDAPASTSPTAMTVRFLRPPQPCVL 
DGLHNCPARAPQEQNSLGEVDKRGPREQ 
TR\PATAAPPRPLGSPSWICPGDW**CR 
NSLQGFSLAASVARGRDWALAGLPATVS 
ARFQEQQKMNTV 


9924 


23825 




9997 


495 


982 


VQKFLRPNLAPKKHQRKLAPNSLQGRLR 
SLPSPTVWCTMAPPTGVLSSLLLLVTIA 
VCLWMHSWQKNHW * AS FKKTHD * TGLA 
EP\SCARKQCSEGRTYSNAVISPNLETT 
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SEQID 
NO: of 

sequence 


SEQID 
NO: of 

sequence 


M 
eth 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

location 

ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 

G=GIycine, H=Histidine, I=IsoIeucine, 

N=Asparagine, P=Proline, 
Q=Gluta™ine, R=Arginine, S=Serine, 
T=Threonine, V=VaIinei 
W=Tryptophan, Y-Tyrosine, 

X=XJnknown, *~Stop codon, /=^possibIe 
nucleotide deletion, Impossible 
nucleotide insertion 














/WFEGRCYLVS CPHKENCE PKKMG 














IASKCSTERV/SRSHTLKQKLERFKL*E 
K»1/SKKrEIGQK*AIiLYQ/VSQVWSIAKE 
KFLKEIKVD\LHaQMIRK*NSLTADVKK 
VLWKIDQTL/PLGHSVIQSKftLTLFSS 
MKAQ*GEEAA*EKS 


9926 


23827 


A 


9999 


30 


393 


DTTILKLRQLIIGAVATKHWSERKSLTS 
LNLNQKLGTIKLTEEGMLKAKKGQKRGL 
SCQTVGQVMN/SKEKFLKEIKHAIPGNT 
*MISK*NSL/IIADMEKVLVG/WSKVIF 
*DQTSHNIP*SQN 


9927 


23828 


A 


10000 


133 


1420 


EQRRLPVTPGVASLQTLCCFLPRRGCSH 
G*ESAATHPPGPELVLLLLQGH*AGYLG 
/DFG/PRTATGSPG\PAGVQPGPCLVPQ 
LSRAAAAG*PDPT**GAIFSYHCKALRA 
TAaHHCPPGC*WPG\PAGSPPGTSRPLS 
HPSSPSPPWTGVSSFLLNLVI.CP*ASDA 
QGHACYPGPVPALAGQGHDSAGGRLGDP 
GHHWQQAALHG/DQGPPGHQAHAGTQPS , 
AQGLGC*ATHPGEGG*VGYQGLPAG*GR ' 
SHPRFPIVTRPMGRSCPYATVRPGPGSQ 
GQTPPGLV\PRGEAGQ1IPGGAAPGLSCI 
QEVTGRWWW*GKKPLGHrPPCGAVRCPS 
S*PHPDTMFKGTGREGCLPPTSPVGVTG 
QMS \EGSRPCEWTLTMSPWGE *SPRAS C 
AMLHFCPGSRVGGYHGCPPLHHMGPQST 
AGQAGQPHTIiDIKAS 


9928 


23829 


A 


10001 


1818 


6682 


IKFPEAEEEEIQEVLVLLFRVAEEKENP 
SRVGNSQREYENQWLYSICNQLRyRHN 
L/RHVKKDERGYYEELIiNYSRDHLMLYP 
YRLLDIMVKGFSITPFSYYTGIMENIMN 
SGHNFTAADCLRLLGIGRNQYIDLMIQC 
RSSKKFFRRKTAGDLLPIKPVEIAIEAW 
WWQAGYITEDDIKICTWPEKCATDKTV 


























DSGPQLSGSLDYNWHSLYKKGFIYUDV 
PMSDDSCIAVAPLEGFVMNRVL/NFDYF 
ETLLYKI FVSVDEHTNVAELANVLE IDL 
SLVKNAVSMYCRLGFAHKKGQVINLDQL 
HSSWKNVPSINRLKSTLDP*KMLLSWGG 














GESRRPVQEASSATDTDTNSQEDPADTA 
SVRSLSLSAGHTKHIAFLFDSTLTAFLM 
MGNLSPVQSTGEGEAQRYFDHALTLRNT 
ILFLRHNKDLVAQTAQPDQPNYGFPLDL 
LRCESLLGLDPATCSRVLNKNYTLLVSM 
APLTNEIRPVSSCTPQHIGPAIPEVSSV 
WFKLYIYHVTGQGPPSLLLSKGTRLRKL 

VLTMLNDALTHSAVL I QGHGLHG I GETV 
HVPFPFDETELQEDSC\NMGVHKALQIL 
RNRVDLQHLCGYVTMLNASSQLflNRKLS 
DASDERGKPDLASGSDV\NGS TES FEMV 
lEEATIDSATKQTSGATTEADWVPLELC 
FGIPLFSSELNRKVYRKIATHGLV * EKR 
AFKMLLHSSRKLSLQVLNFVHSFQEGAS 
TLDIHTEPSFSSLLSQSSFADMGVPLPA 
KKFN/SLKIVSY 


9929 


23830 


A 


10002 


439 


1135 


MAVDFYNFVTKLWTTGYLRISFLAYKF 



1150 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W-Tryptoplian, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FSFPFLDSI.S/LLSPRLECSGVIPASLQ 
PLLPGFKRFL/CLSLPSS*GYRRAP.PPC 
PALLYF*VETGFHHVGQftGLELLTS<3KL 
TRLGLPKCWDYR 


9930 


23831 




10003 


216 


572 


LTLVS I FTAKQS VTLPI I CMLEMRKWLL 
NDTSAF/PII*KVLFPKIFFFFFF*KGV 
LFCP PGGREGAQi-GLTKPSPSCSLKH I FL 
PNPPGGWKQRRVPPFAGYFCFFIKREVS 
LLWRGLPN 


9931 


23832 




10004 


3 


354 


YSNNPKDRKGETGINEKTNNKMVDLSSY 
TSVITtNF*ML*/D/KIN*KFVKSP*KK 
PTLCCLLETYFKYGIGGLKVKRWKKI *H 
ANTIGKKTLVAILI TKQTSELS * YPNKE 
KHYILIKE 


.9932 


23833 


A 


10005 


187 


455 


ASIHYSLGVRICIYIPGCIHPGEKSQKC 
MTCGKNSTQKPELVN/HR*IiKGHRCMK* 
NEGGKTFTRVQSL/ACHKRVYTERKSYK 

C 


9933 


23834 


A 


10006 


134 


456 


IWNIiPLGKPCAWKQVTSDIiISPDAIFIN 
AAIHSLSFFPFPFETKELFVPQVGGPWH 
NLN*LKLLPPKL\SDFLVA*VSKKVGIT 
GPPHHPGLVIWGFKKKGGSPMLPR 






A 








CYYTIQTTEQPAKDSMLGIKPHVS ILTL 
NVmStMITPPKRLGVADWI K/TQDPAF CC 
LQETSLTCNRTHRFKVKG*/WKKIYH 


9935 


23836 


A 


10008 


169 


462 


YPCWVDMAKNTSYNKCWQGCGEIRILPH 
CRMEYKMaSPLWK/SQFLNK*NMELP*D 
LPIPLLDIDPKEWKTGVQTKTCR/RMFI 
AALFPTSRS 


9936 


23837 


A 


10009 


490 


724 


EMPP*FMEKNFLAPGG\VTPPGVNTTRV 
NPKTAFDPKRSSSPFLEKSQKPGLIEGL 
THLKTLir,NPT,KKDESGEIPVLF 


9937 


23838 


A 


10010 


306 


588 


KQF IR AFT EMSWWLTPVI PVLWDTKADG 
SGIRDQPSQHGETPSLLKIEKLAGHGGA 
GL* /SQLLERLRQENHLNPGGGGCSEPR 
SCYCIAAWVTE 














I SLGQDSTD 1ST INI PRMAFKLSSKRKS 
HMSFSLNQK/L/EMIRLS/EEAC*KPRS 
QKLGLLSQS*PSCESKFLKETKSATPVN 
TQMI * K*NSLITNK\LKEVIiMVWIKDQI 
KHTIS LRQSVI *CKA]:iPLFNSMKaERGE 
ESAKC 


9939 


23840 


A 








FWLSLIKKPLSSVTQKKINRET\ILLTT 
KKKKKDCNFLEGGLLIGGSVCRLVYRHY 
ATLYSVFCVDSSKSERGSVDPTQVFVET 

*DKCFDY 


9940 


23841 


A 


10013 


203 


453 


keelyisrerksstslllnqsqk*likl 
seqampnaktigdrlkarplspnshlikt 
kqkflkev/kcatp 


9941 


23842 


A 


10014 


237 


483 


TLIRYIILQGLFSYIPPFFFETESRSVA 
QAGVQW*DLGSLVPGSRHSPSSASQVAG 
TTG FHY/HAWL I FCLFCATGS 


9942 


23843 


A 


10015 


157 


875 


DDPVRGRGEESAMPSGGRRNRSVSSSWV 
GSMAGITTIEAVKRKIQVLQHQADDAEE 
*AEHLQ* EAEGKRWAWEQAEAEVASVHG 
RIQLVEEELDCAQECLATALQKLEEAGK 
AADESERDTKVIE IWAIiKD/E /EKMELQ 
EIQLKEAKHIADEADGKYEEVACKLVI I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Hlstidine, I=Isoleucirte, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopiian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EGDMGCTEEEAELAESRC*EMDEQIRLM 
DQNLKCLSAAEEKYSQKEA*CEKERKIL 
TDNLMKSPMR 


9943 


23844 


A 


10016 


579 


30 


CGISTKKNKWMIKQNSSELSTCVHNQIT 
LHKIAMATQWG\KNSFFNK*C/W/EN*V 
ST/CKKIKLSPSVTCTI*KQYLKWIKYL 
NIKI*QTS*IS\NGGKYDIGLGTNFLLQ 
KTKNLMSFS*L*NQMHKEQI lEKWDYIK 
LQNFCTSKEKFSGVKMSPKKW/DE I FEN 
HI FURS * FSQYINNV*NSTKSE 


9944 


23845 


A 


10017 


44 


228 


EFVCRVRGLWW \ CWLASQLLWEAEAGG 
SLEPKSLELQ*AMIM\DCTPPW 


9945 


23846 


A 


10018 


471 


40 


PGEGRFQQTRFPPFLPPWGPKGGPFFKK 
PKKKKRKEMIKLS* KPRQANQ*LGLLCQ 

/ TS Q WNAKEKFLKE I KNAT PVNTQEVR 

KLNSLIADIETA.'LWGTEDOTSHNIPFG 
QSPIQINILAVFHSMKAEQSKEATEEKF 
E/-IRS 


9946 


23847 


A 


10019 


217 


486 


KK I S * P VYLSLYLS I YLSVCLSVYYLS 1 
YLSIYLSIYLCIYLSTIIYYYLSIYLPV 
YLSTNLI.SI/CLSPIYLLTYHLSSYLLS 
YLPTYLP 


9947 


23848 


A 


10020 


"32y 


ni5 


TENQQGVLftLAALSKIRMMPTRLKWFLN 
IDYTKFCEAFHVFKKRKKKKINKNIKQL 
KMPLTKRIYLAFTEYSTQ*QQNTH\FFL 
SADETHGKIE/HVLGPKTRHNKFKR 


9948 


23849 


A 


10021 


329 


490 


GESPTDNHCRSPKSQVEKVIYTFFETES 
RSVTPAGV*WCOTjGSLQPPPP\GTSD 


9949 


23850 


A 


10022 


3 


394 


YRVLEVLGYMLNIRFWLGCSSCBQAAQ 
FREGSRCTTACRHPAHTLHaAKMSRRKV 
S3AKCR*RETPKKRSARWSAKPAP\QSE 
TKPKKSAGKDKSSTSSXFSSSSSLKaEK 
GGKEKQAEVADQETKDLSAE 


9950 


23851 


A 


10023 


448 


477 


SQVFETSLAS*MSKIIIL/WAGTGflHTCN 
PSTLRGQGGQIT*SQVFETSIiRS 


9951 


23852 


A 


10024 


198 


455 ■ 


SLKTNHREIPEDFHGQPCGKLVHSEKPR 
VRMGQTESKYASYLSFIKVILRRGGVRA 
S/TRNLIMLFQTIEQFCLWFPE*GTLDL 
KDW 


9952 


23853 


A 


10025 


469 


26 


PPRAVQPPFLPSSSLHTCVPLCRTVFSA 
TSRPDSLLPALQVSAKMSPVL*SSLLLP 
SLGQGFPMASIAPRL/PHRSLRSCWAA 
AHSBAELDRHEAFLQCSLGQWPRNDFLF 
SFFRDRVLLCHPGWSTAA*S*LTAASNS 
GAQVMEFQ 


9953 


23854 


A 


10026 


222 


489 


KKKKKKKKFCFIARVKSNGAISAPCNLC 
LSGSNYFPASTSLVSGNTGACPLARVWF 
FY*NL\FLVKMGFHHIGQESI 


9954 


23855 


A 


10027 


313 


2 


QDWEISKLYSYCHS*VRRL/IFLKIRLS 
LYYILQHSLNCNI IG*LQFES CYIKLAG 
DKIAFRFLTLKTNHQLGLVAHAYNSSTL 
GGRDGWIT*GQEFKPSIiaNM 


9955 


23856 


A 


10028 


49 


388 


TPKIRGVFPFKGWFLGVFGAFPFPQKDI 
FYFPRTPMGTRLVS PGKKKKKEV*RVLY 
GLKKAKKI FVLKVKFSHKRGFFGE I QFK 
KLFKW/HLKRNPPPKKKKR 


9956 


23857 


A 


10029 


394 


231 


GRQIASAQEFKTSWGNMAKLCLFI*RDR 
VSPCCPSWS*IPELKQSACLSLPKY*DY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

™pond 
ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K— Lysine, L=Leucine, M— Methionine, 
N=Asparagine, P=PrDline, 
Q=GIutatnine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=-Tryptoplian, Y==Tyrosine, 
X==Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














RC/RWTTPGL*HLLFRCT,LIII,PSYLFF 
CKSFKLLYMDI 


9957 


23838 


A 


10030 


265 


2 


HDNMRGSDYFPMNGLAVPFKRQVFTIA 
SNFLEKSPPRIiAFT*LQVD*ERKWAG\P 
VfiHTCNPSTLGGQGGWITRGQEPKTSLA 
NMPKP 


9958 












krgklfffteitfkrgkkaggggsflkp 
hlfgrpgggnhrvril*krgfykmgrw 
lkslpcdspplasqkggvsrksppprpf 
cpf*rlfp*kkkafrlfipkk:v**ai.v/ 
slpfigegktlffff*drallcrpgwna 
vapsglklssclsllrs*dykhvppcla 

MF*KFF/CRN 


9959 


23860 


A 


10032 


167 


424 


GREFDLRMStiMVKPPLgFFFFPFSETMS 
LSVTQaGVQW*DITALSASWTPVISDSP 
VSASRVAG\IGTTGVRHHTQLI FL 














VEGCLSSGVQDQPEQHSETESLQKKIFF 
\*IiaGHGSMCL*SQLLGRLRWEDHSSPG 
G*SCSEP*SCHCSPAWATE*DPISKKKK 


9961 


23862 


A 








CPTCPFVWI PLCSEQPVQLYMWHTHTH 
THTHTHTHTHTHA/HLLFSLSFSFLRQG 
L3LSPTLWRSGMI/CGSTQSQSPRLKQF 
CSPRYSRG*GWRMA*/DPGGRGCSE 


9962 


23863 


A 


10035 


200 


448 


LYLHLVQLSPLDFETPLFRVECPVSLSL 

clsfs*fe*vcffflvslglf*grvllc 
rpgwsawqsqltad/sdfpvlkhsscl 


9963 


23864 








463 


MILGISKISFSLAITLANFPPTLNLFIF 
LDTGSCSVAQADVQWHMQGSL* /SELLG 
SSDPPTSAS 


9964 












glssqltdghllallrvggsevgsrees 
ereeagreregerrdsrgg*rererees 
gareerekkekrer/erdreek 


9965 


23866 


A 


10038 


1 


490 


pnqiqnavlimisfsicfvsviffksei 
yngqpkfksffff*descsvaqagvq/w 
cg/lqsqqprppg\ssnp\ptsasgvag 

ST 


9966 


23867 


A 


10039 


255 


446 


FFLRQSCYVAQARVQ/YAIHRCSHSTLQ 
LPALASSNPSALLAPRVAGTAGMQHHNQ 
L*YFFRIHS 


9967 


23 868 


A 


10040 


157 


390 


DHTCPPPSTADYRLS*FLYFLLLLFRRS 
LALPPMLECSGAISAHCTLCVQETRETD 
AAHFKAARTIRAPHPAQLTIAFQVQTHV 
YQDGH/DGSWWCDLRTHPPRS 


9968 


23869 


A 


10041 


290 


30 


tlkkrkefssvlcprgyivllllvhqfl 
ffqtgscsvtqagmq/w/cdqsslqp*t 
pgXshnppvsasqvagptgmhhgflhte 


9969 


23870 


A 


10042 


190 


929 


^MRYISYLTRLVKIKNFflHKSWLESLSP 
TLKHS * /W * CK*V* SLWKTVWHYLVWL\ 

kddltfdpaipflmiyliemcaqvhpe I 

YARMFIAVLFEIFKHQ\NNPNKLWYIHT 
RESYKTIKIKE 


9970 


23871 


A 


10043 


457 


724 


hfgrprqvdclssgvqdqpgqhgevqsv 
lkl\kklaghggahl * sqllwrlrhenh 
lnlggrgcseprwchftpawtteqdsv* 

NHNNNN 


9971 


23872 


A 


10044 




429 


kpdedttgrenyrpvfftntdanls KI K 

CMYPCISVH/HVHMYGKIPEMMFT* Y* * 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequete 


Predict- 
ed end 
nucle- • 

location 
corrcspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysinc, Lr=Leucine, M=Metliionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














*LFLSFFFYEMESGSVPRLECSGVISAH 
CtJLSLPASSNPPPGFPR 


9972 


23873 


A 


10045 


30 


220 


LGVLYKKNS VFNKC YWDN* I S I WKRM \G 
LDP YLTPHTKINFKWI KDLNI I TKTL I C 
CSWMKTKA 


9973 


23874 


A 


10046 


203 


361 


TYKLLVIFIVIDHF/LFSFLSFFFRDRV 
SFLLPRLECSGAISTHRNL*liPGSSD 


9974 


23875 


A 


10047 


198 


358 


IQRKCFTFFFFFEV*SCFVAQAEVQYCG 
SPG\SSNLPFSASEVSKTIGWILPS 


9975 


23876 


A 


10048 


270 


272 


RKNQRFFKIARKRLNKMTRISPLISIII 
LlSIVSGIJSrFPLKRYRLAEWTKKKK\DPII 
CCL/QKTHFAG/RDIYRLKIKGWKKIFH 
TNGSQ*QRSEFF 


9976 


23877 


A 


10050 


256 


1 


GVTTNLYFWKHFLNPPFLVKKCENCP IN 
KVFP*KPPLLKKTLFFSKKKFCGV/HPT 
EKFFFFFFESLALSPRLQCNGTILaHCnsr 
LR 


9977 - 


23878 


A 


10051 


112 


359 


S KVSEEPTENEVEHNLKV* S FI LPliKQY 
SM* *QKKNIHIH\ I SFYKK*ELTWPGMV 
AHACNPGRLGTEAGRSPEGQEFETSLA 


"9978 


23879 


A 


10052 




2 


RVSSYTLITLNLIIRLlSnMLlitRYRRLNGF 
FFLKKP* PNS ICCI QKTNLI YKSTYSLK 
VKGWKHIHXHailGLKKQAEVAI \LFISD 
KKDFKSKIVKRDKEGHYVMTKGSIQQ*D 
RTIL 


9979 


23880 


A 


10053 


376 


6 


TRPSQHPDFILLNKCLLFEATPFLVICY 
SGHRKLRPVAPPIRGSKSQIIPMPPAVL 
PRWHLSCLS FR I P S S FF*LRLPRLCKSR 
/SAWGPVSPHLPQFLFFQAISFFFFL*D 
GILLCHPGWSAVA 


9980 


23881 


A 


10054 


102 


347 


FLSFFVDGKLCLEAKIWVLAPAVFIVTR 
VLLIiliSTL/RQVELRNIYLCSNVHIHFH 
LYFCIYLSAYILQTMSLY*HL*YHSNV 


9981 


23882 


A 


10055 


251 


51 


ISLVNWLFILENMKLDPYLSPYTKINIR 
/WQ*DL]!IVKNGTTKVLEEN*GTIYVMVG 
WKRPILACQQPKS 


9982 


23883 


A 


10056 


169 


342 


SQKQYSTCQNVFCFVLFLETRSHLV\TT 
LECSGAIMAYCRLDLPG*RHPPTSAS*V 


9983 


23884 


A 


10057 


1 


586 


AAARPAQGKARPGLLLRRGRWELRAGF 
LTAKGAFRWWLSQiOEIVKMSYPPQSRCGC 
GGMGAAGPPSLTVHQELCGAYSPDGTTE 
ALPLASWPRARPS PAKAPAYDTAKLPAL 
IGCGSRRPPGVNPGASSLKPGACVSEGA 
GPTGTLESAGSRPPTPLPPPV/CCPGPP 
ARADH*FCHVTPQCR*/PPPRPSP 


9984 


23885 


- 


10058 


393 


43 


HARPGAECERPAEGEAARADAFSGRHAC 
PHS LYCRL* RGFFVHKLHLNKA/RLIKH 
RKADNIKCMQRCGAPGTLMHC*WDYETM 
GLLEQTVFS FE IRSHCARQAGVQRNNHG 
SLQEF 


9985 


23886 




10059 


228 


466 


GLLHGPPIYDLPEPVSTTPSLYHPPIVP 
*AS PLGP SAILTLPFHSPP PA\ PLFSP P 
HSVLGSSGPRSLPT/SPTHLPTLTPH 


9986 


23887 


A 


10060 


293 


9 


DILQFYTFSPIQ\CPVYILHL*YISLQR 
L*FHQKYLIGIYFIE TWSCS VARLECSG 
RITPRCNLNFPGPTYPPTSASGVAGTTG 
VCDFLGLPWSR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 


Predict- 

location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metiiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9987 


23888 


A 


10061 


199 


428 


TAYCRTLPLKKSNPTISGSFFFFLETGP 
RSVPG*RAGG\DLMAHCSFDLPGSRDSP 
ASPRQWGATGPCHHPt«.IFE 


9988 


23889 


A 


10062 


191 


2281 


lqtrgklfkykekgictklhrseiiigk 
qwkeavcpptgqaqsqggldpksrealr 
gghgfvpgggvcghqel*lhsdglkavh 
pagsepsdpggvcjaaacqnaatggtplq 
pmpetargc/tatassreagpgmdp/re 
a/gpeawqlgkftrsswggsstsggpawv 
firdtpswwgipgtvsppagprcppptr 
pqweqlgsgrpg*mqttaeetnaaprag 
qeetp srgp sadpgvs qhvlshrrscpk 
ceersvppngXagptlwtelhpsdasvs 
altpl*gdvevbpy6\cegyikp6rrth 

QHKWLDLDLHPAASRTCRKMLE]1^>RV*F 
LLRLPELAETYSSTPPACAQRPRGTGGG 
/PPQRCVLMSGSHQGGAS*KGRDAPSLG 
SGLERPEEGGPGWLS/RQPSPHAVQPCK 
HMLCLGPAF*RGRGARGVPVPVRASHFP 
FHGAAWWPGDHMSQP*WQMLSRPARGHR 
PSWQVRRQCTVLGALKPGLPDLKCPGPT 
AFLPG *GVS /EENGQIPNVSTQLYLQNL 
PLPREQN*RRFPRDLSSGGAGPGTG*CG 
GGGGHALSPTPSRRIiRAHSKDNSWAQPL 
CCWMPAASGC\PGIGCNGVPPVAaS*Qa 
AA*TPPWSEGSLPAG\*QPSAHRCVTRV 
PGGRI PHLLAQAVPPDDRAFPNCQASGQ 
LLGVHSRPSLSAAGCRQPLAVSGIGCNG 
VPPVAAS 














I SREFLFLETKSLY IMLKGS IQQEDI IF 
VMI YTLNTGAPRY / 1 K* ILLELKRQIES 
DTVIPEDDHTPLSALNRPSRQK/VDLIC 
TIEQMDLIDIYRTFHPTASEYLFFSAHG 
SFSKKYYMLGHKICLETFLN 


9990 


23891 


A 


10064 


100 


466 


LGILPLQSRPSGPGGQCTPTLPFMVSLP 
LPVRGCFLHLCWCLPYSGALQSPCHTQL 
VCWHIPCF*CCGVSP*RSSCLEVHLYTH 
THTHTHTHTHTRFVLM*MRTPLRKGKRQ 
VMRLCGGGGRA 


9991 


23892 


A 


10065 


226 


377 


GLAGSPCISGSVSFCSGCIYXRGHGIMF 
ICNDCKVFRFCKSK*1ICFKEHNP 


9992 


23893 


A 


10066 


292 


458 


KVSSICLKMSFLFLFFFRTGSHCH*KVS 

SICLKMSFLFLFFFRTGSH/SSPRLECS 
GMI IAHCSLKLLD3SDLTASAS *VAG 


9993 


23894 


A 


10067 


252 


36 


TGFNPLHF IYYYYYYYY\FESKSCS HAQ 
AGVQCCDHGSLKPQP\PSSSHPPTSAP* 
VAGSRNAHQANSTAWPMVS 


9994 


23895 


A 


10068 


56 


481 


GILKNCHNLMIKLERMRSCVLMGEQRK/ 
WIY*DESTPGEDTYMTVKITINDLEYFI 
NLVEKTVA/GVETVDSNFERGPTVGKM\ 
I VCYRE I FHEKKS PSMQQL\YF*KFPQP 
LQPSAATTLVS *QPS \SKQDPLSARRLQ 
FTAGSD 


9995 


23896 


A 


10069 


263 


435 


HFSFLSFFF*KTESHSVT/RVEGRGGIS 
VDCKLCRPGSEDSFASASRVPGTKGISQ 
GRG 


9996 


23897 


A 


10070 


116 


474 


RGAGAWGVSGRSSPPLSALPIQPPPHLS 
TPPPLETP/PPVLH^PLKDLGGWGALLA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQH) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


ed end 

otide 
lociition 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienylaIanine, 
G=GIycine, H=Histidine, I=lsoIeucine, 
K=Lysine, li=Leucine, M=]VIetliioDine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
n ucleotide insertion 














Q * EL PG * HS PKKKKKKKKaPRPPYK*QP 
PKTPVDPPDGPFLDPPLFGWPPPP\VFL 
PGERPPPL 


9997 


23898 


A 


10071 


159 


377 


KSHMPLTIiNQKLEMIKLSEE/GLMSTAE 
IGQKL/GLLLPNSQWNAKEKFLKEMKS 
TSPG1JT*TI IKWPGAVADA 


9998 


23899 


A 


10072 


ISO 


473 


AIEIQLLSRPLCLVJPSGRFPDAQVAWIR 

V W V AV* GRPr ,R * V3 p A K PGKQR PS PWGF 
PS I S / i-JDI S PTKS S SASPPGLSADCKFT 
CIIYRCRALVGLDCCG 


9999 


23900 


A 


10073 


98 


440 


gqlnklsgpypenvgytlpvlinplaqp 
viystifagtlitalssh*fft*vglei 
nmlafipvltkkVinprsteaaikyflt 
qatas i illiailfnnilsgqkkkkkgr 

PF 


10000 


23901 


A 


10074 


358 


404 


PCLYTYHLHNSNST*SLRLVSIMIEIVT 
SNLHRTVPHSYTPPTQLSINLVMAIHLS 
AGAWIISYPSMIKNAriVHFLPQaSPTPL 
IPILVIIETISLLIQ\PRPLAVRLTGNI 
TAGHLLMHLIGSATLAISTINLASTLII 
FTILILLT 


10001 


23902 


A 


10076 


3 


419 


KWRKKMWCLQKMEYYSS*K3KKU:,SHVT 
IWMDPEEVMLS/EIS/SVTE6QILFNST 
YV 


10002 


23903 


A 


10077 


335 


414 


DKSPF/D*RFFLPPNSNGFLVKMPPSVK 
RLPFFFIFPFSFFFFETESRSVT\RLEC 
SGTGLftHCNLRLPGSRDS 


10003 


23904 


A 


10078 


279 


1 


KDTQLPFIQFFFETGSCSVPEAGVQWCD 
HSSLQPPPP\GSSGSSPLSL*VAGTIGM 
LIFKIFSHNPLNFFFFLRWSLALSPDWS 
AVARSWFT 


10004 


23905 


A 


10079 


397 


3 

[' 


KSSSLFQNHLWANSSTYSHLIPTLGYIL 
NHGNQFDPDTL/RDKM*NFFCTIVCPHY 
ELPSLEQW/VSLGSLNYDTILPLDLFCK 
RQGKWSEIPYAQDFMTLYQNLTICQT/P 
QNPPPPPKESSKYELDIIDDPPH 


10005 


23906 


A 


10080 


281 




IFLGEWGPFFPPQKKSFPPKIPQWVFFT 
PPYRKKIFFFLPR*NWA/HPKIFFKRPP 
PFFFFFFFFFFFGFKENCFILLPPKEHI 
TQAPAWGFAG 


10006 


23907 


A 


10081 


207 


35 


QENRRC * RGRG E S / GTL IHGWRE CQT VQ 
PLWKTVWQLLKRLNTEFPYDLAILLLGE 
FH 


10007 


23908 


A 


10082 


2 


408 


laplhsslgdrarlrlkkkkkkgkirpl 
lalffknqnpsfkknshsfygalmepVp 
pphippfffwkvqpplfpfrgtlefpgp 
igtcklqtlgspg*wdppcplphqspgr 
gglipgrkgedppsfrglppphf 


10008 


23909 


A 


10083 


195 


2 


ECKQRPQLEMVSFRRRSHEPFFFETGPR 
SVAQAGMQWH/DLSSLKPPAPG\3GDLP 
TSAS*VAGTI 


10009 


23910 


A 


10084 


219 




RLKIKLLKIIITTTIDSGICSFKSCKL* 
H/LKIQNMWLDKLAHaYNPSTSGGRGRQ 
IT*GQEFKTSLGNMVKP 


10010 


23911 


A 


10085 


364 


413 


KKKGGE KGPLLKKER * KTNGQFLVHTNF 
/RLPGLKYFF/CPPPPSKWGLRAPPPKP 
GDFFFFFFFFLVFLVETGFHHVGQAGLE 
LLTS* PIPLGFPKCWDYRRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVl=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po5sible 
nucleotide insertion 


location 
correspond 
ing to first 

rlsidue^of* 

sequence 


10011 


23912 


A 


10086 


200 


3 


INFIT.IJ.FI EVLI.IFETGPCS lAQATVQ 
WRDHQTLQPQPLGLKQAILP/ASAP*VP 
RITDVHHHARLN 


10012 


23913 


A 


10087 


154 


329 


FSLSDFKTYYKflAQHRQSCL*yQHFGKP 
\RRTDPLRPAVQDQPDQRGETPSLLKKK 
KKK 


10013 


23914 


A 


10088 


157 


390 


NSSSFFPPSCQAQ/KSEVKKSSGPHSSS 
S SG *RGE /CLPLLVSGGSWHFLAYGSLT 
PVSACVFT*PFSMSLCVYIQISVS 


10014 


23915 


A 


10089 


305 


• 


KKKRAQSLPGQKGGRGFSQPVTAGDRS F 
RPKIRKEALGLGQLGLTDLCRTFPPTAT 
G*TFFSSAPFPG\YPPV*PTKKKNTNIiN 
*FTRIKirSRIFSGHEK 














RGFFQPLGFWDPGPISNLGGPPKNLLLK 
AEIMGYNFCVPPFPPGGLPTTHFFFPPR 
NPPF/CFPFTPSLGFFSPNPPGFFFFFF 
FFLSRVSLCHIiGWSAVA*SWLSAfiS 


10016 


23917 


A 


10091 


84 


408 


EKKGENKLFWGFLLLGTLFIGGYRFKI I 
FCT,VGYFIFIYI.FGEME/FSI.T.I.PGL*C 
NGTCSVHHNLPLPGASQSPASTSGEGQI 
TGMRRHPRLY* lYLGRWSFALVARAVMQ 
WHVLGSPQPSPAWCKHFSCLNLRRGWYY 
RHAPAPPSLLNIQKI ILGFWLAPVY 














FQINSSVTRRSFVIjAAQAGVQWRDLGS p 
LLPPP \GSSDSPASAS * VAGI TGLHVGL 
SHK 


10018 


23919 


A 


10093 


298 


386 


KLCEK** ITPIRMAI I * KKREQENS \KC 
W * G CGE 1 GTLVYCL * E YKMVQ P LWK I VW 
QFLKILGIEL 


10019 


23920 


A 


10094 


310 


1 


I CMNEMI LYKQDFGRT III RKI VI YFW I 
FN*FGH* IRIY*NLLRYKTCML*RG** I 
NLPQRSFKNLRT .KT * *W/WPG'rVflHAYN 
PNTSGGRGGRIT*DREFETS 


10020 


23921 


A 


10095 


40 


407 


EHTHTTACEPGGI * IRPVD*[WLISWL* 
HGTITCIOWTLGETGQKPQGI SLI IIFF 
FFG\TQSLFVPRNGVHCRGPISAR*KLC 
LPGPPLS PT * AS * VAETTGACYCTWFEN 


10021 


23922 


A 


10096 


189 




ppffffsnlfkksfyeqglalsprlkcr 
giiia/hcs/l*tpg/yk*ssclslp\s 
swddrcapprv 


10022 


23923 


A 


10097 


260 


401 


DHLRSGVQD*LGQHGETSSLLKVQKLDG 
RGGLQLI P\RRLMHENHLN 


10023 


23924 


A 


10098 


187 


464 


WYMTRCGSLGPSSAAHESHHP PPQQLMN 
P\PSPSSAAHESHHPPPQQLMNPTIPLL 
SS*IPPSPSSAHESPGRVGRPEFTGGAH 
GYLHLSPP 


10024 


23925 


A 


10099 


397 


1 


FFGGAPCPPPRRKGKTPPPPRKLFPPFF 
PKECPPLPKiasrC*NKNPLLGPKKKNKRPP 
PGRFFKKGFF*NPPKAPLTNFWEKKKKK 
\SPFFKGVFFPPGEPKKPPPFFFFFFFF 
FFFFPTRPPRVRPRVRPRV 


10023 


23926 


A 


10100 


253 


3 


NAINKNKMQ*TKTSEWIO/YADHNATIKQ 
KQKRKCQVL/WRGCGETETLVACWWECQ 
MEQPAPVEN/W*HFLENIKIQLPrDPAI 
HE 


10026 


23927 


A 


10101 


121 


404 


VLAS WS I FFFFFFWKGGLI FS PRLEGG 
G/LILG*LKPPLSGLGPFSCLTLPEGWE 
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SEQII) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alaniiie 
C=Cysteine, D=Aspartic Acid, 
E=Glutan)ic Acid, F=PlienyJalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LKP P P PDPGFFFFF* GKTGFPRGGQEGL 
NFLTWEPPPLAL 


10027 


23928 




10102 


468 


1829 


1 RDT YTQNPDKAKGYFSC I FFE I EETTM 
SRKQNQKDSSGFIFDLQSNTVLAQGGAF 
ENMKEKINAVRAIVPNKSNNEI ILVLQH 
FDNCVDKTVQAFMEGSASEVLKEWTVTG 
KKKNKKKKMKP KPAAEPSNG IPDSSKSV 
S IQEEQSAPSSEKGGMNGYHVNGAINDT 
ESVDSLSEGLETLS IDARELEDPESAML 
DMLDRTGSMLQNGVSDFETKSLTMHS IH 
NSQQPHNAAKSLSRPTTETQFSNMGMED 
V/RPRHQ* KS *VPI FEKSVKDLQRCTVS 
LAEYRVWKEEMDAS I KKMKQAFAELET 
LINGSRKWALLAEMDKVKAEAMEFLLSR 
QKKAELLKKMTHVAVQiyiSEQQLVELRAD 
IKHFVSERKXDEDLGRVSRFTCDVETLK 
KS IDS FGQVSHPKNS YSTRSRCSSVTSV 
SLSRPSDASAASSSTCASPPSHTSANKK 
NFAPGVY 


10028 


23929 


A 


10103 


254 


2 


KKKDFPLTPLQFGGLKIQLFFFLQKKKN 
FYFPP*1FPKIFFFFPFWFGTQSCPVT\ 
RLECSDRIPAHYNLHLQSSSHSRASVSH 
E 


10029 


23930 


A 


10104 


256 


419 


KVIELHTGR/CAVGQHVGRPRRVDCLSS 
GIGDQPEQHGEPHIjY*KNAQSSQAWM 


10030 


23931 


A 


10105 


153 


439 


TTVTSLCIYYTIYYYFRLLSTHAYTHIY 
THIYV*KLTIKQPQAGPSEGF*KKEF** 
RRW/WVGTVAHTCNPSTLRCQGGWMT*G 
QEFETSL-OlCJMVKL 


10031 


23932 


A 


10106 


424 


157 


S SHLGHPRWVDHLS PGVPDQPWQHGETP 
SL/LKLAGGACL* SQLIGRM/RLS SGGR 
GCSEP*SHHCIPVWETEQDPISKNKQTI 
PRMFKN 


10032 


23933 


A 


10107 


213 


3 


NYKCPPTPNLNTLKALYFKKLILSPKA* 
AIKEKID*/IIKIKHFYASNIINRMKRQ 
LTEWKKIFANHVSDKiG 


10033 


23934 


A 


10108 


300 


1 


TPKQRGKKGSPHEKPFPVNLPGKTLF/Y 
EV* FMMPLK*QKKKErFQGFFFFFLICGT 
VYHQFTSK*ERGVPGTVAHTCNPSTLGG 
RGGWIT+DQEFETSLG 


10034 


23935 


A 


10109 


120 


421 


KKIKKKlSrKNHMII\SIDAE*AFDKIQHR 
FMIKTUSKLGPEGKHI/KIMKAIYGRGG 
QDGQLHTAKK/EPLPLRENKISGKP 


10035 


23936 


A 


10110 


102 


418 


TLVCVeVCVCVCVCVSLSVFLSLSLfiLS 
QAGVQWCDHSSLQPRPSG\SGDS PTIAS 
WGGITGVHCHIWPTFYFLFLIRLKSLM 
L**TSSEMaiiTKMGVHSHGLFN 


10036 


23937 


A 


10111 


382 


2 


VWNATMPSKCSTEKSHMSLTSNQKLEM I 
pnjSEEGMLKAKIGLKLGLI.C/QVSQVVN 
AKKKF*REI *NPTPVNIR/DMQKVLV/ V 
KTEEQTSHNIPLNQILIQNKSLTLFNS I 
KAERSEEAA*EKCEDSRG 


10037 


23938 


A 


10112 


61 


427 


ENTIYQVQCSIKEEYLQRRTLKYASLFQ 
KHICGSLAFLHLL*PKLYHSRSNAESDW 
I IQLFS VQPNVKE ISKHKIGW*RGHACN 
/RQHFERPRHKDHLRLGVRDHPCQHGET 
PSLLSLLKIHI 


T0038 


23939 


A 


10113 


71 


399 


NVLGKEARSVGWEAVRGQISRVS IQMKW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














KGLIKGEKLKMEKSQACICIYFFFFETE 
TNFIAQAGGQGHDLGFMEPPSPG\ *SDS 
PA*APHEQNGL/TGMHHNPRVIFV 


10039 


23940 




10114 


362 




SKKKSWGSHSGSKCHTSFLSYPTKEQKS 
YLS LFYKCNLQS LTFKELLDHFRPVCLP 
TIY*imiWP/RDLPLKPSFKTLML*NQT 
L/WPGWAHACa!rPS ILGGQGGWITRGQE 
LKTIKANrVK 


10040 


23941 


A 


10115 


314 


3 


WRERRCGAHLMRCaSNMAHDKPSEMLNL 
*KPKMA.T/RNTDHDNFW*GHGETEILIS(H 
YMWDYKIM*PLWKTI*QFPKMFNIKLPY 
VSYIQLLGIYPRHLKINIHTKT 










44 ! 394 


PGFQRVSQDGLDLLTS*SAHLGLPKCWD 
YRHEPLRPAGIYSYPAVLPVLSTYRASN 
IT*MLMTPKPTSG\YSFLLSERFSSFSY 
LRDISAYMSLSCLKLNVFKSKPIIFLSP 


10042 


23943 


A 


10117 


398 


3 


MESCCFMGIEFVLKEEVLEIGCRTM* lY 
IjHY*SLHLRMVKMVTSLLCVFYHN*TIF 
KKSSVFLYAGNKQL* I * ILKNYI YVMIP 
FTITSRNMKRHNLTKDGKDMSTETYKTQ 
LKEITD\*NKWRDTTFMHQN 


10043 


23944 


A 

. 


10118 


104 


386 


kvpvwatcgvkmhrlaghggwrlwtrlp 
wrlrrdnrlssgsrgaXaaeilplchct 

PTWVTEHDSITYKY*KKiGAPCWRASS*C 
TLSGHALFFFA 


10044 


23945 




10119 


258 


401 


ynypfltcvmkekigagmaahacnpstp 
ggrgg*\trgqhletsptnmv 


10045 


23946 


A 


10120 


310 


384 


gvfpfltflyggikggpppprfflf*fa 
flffff/etgshsvaqancsgsvsahrs 
fhlpgpgdpptsas*vtgttgichhs 


10046 


23947 


A 


10121 


245 


479 


tsslyrkiqkrflymralevlkicntyf 
mnterfplklfgqfs/friisi*ffpfl 

FPKI YHYFLRDRV* LCHPGWSAV 


10047 


23948 


A 


10122 


268 


469 


nsvfwlgwaha/w*vnpstlggrggwi 
trgqelettpanilpp 


10048 


23949 


A 


10123 


221 


423 


gqagavahacnpnalgrprredslrpgv 

GDQPGQHSKTPSL/LKSDI!IF*KMGWLGP 
WRWWLTPIIPALW 


10049 


23950 






73 


183 


AGGGWALT1JTVINPT\PPSTPFTPCLSY 
S CT I S TASAFRFPG \ S TSTAVLTHVR I L 
KSTPDAPHPPLKI PRARRNIQRDHTLS S 
NLFYYNHQLNVTEELTYYDLT* INPTAT 
IHAIHTMPLLFLHNLNCIS I 


10050 


23951 


A 


10125 


342 


2 


PNLGPCPPPRGTKGGPFFPSFHPSI\IG 
PPNFFSF*RP*FF*MLKSLGVP/SPTGP 
IPKFPFFPGFKNYFSPPL\PKGPLLKPP 
RALFFFFFFNRHRVLL\CPGWSPTAGLK 
ESSC 


10051 


23952 


A 


10126 


85 


442 


KKVINIKCW*RVWRTGSLIHCSWG* kmm 
PHWRAVWQSFR/SLSMYLSYEPALPLPG 
SYPRALRTTCVCTHSRCFSLLELPQA/W 
NH 


10052 


23953 




10127 


180 


484 


KSFFFFSQSAQPIKVIPAP*NFHFFGVK 
RSSCLSFLRKWGPRWVPPPPFFFFFFFL 

ifvemrshyaaqaglkllass\vpppsg 
iprtld*sypd 


10053 


23954 


A 


10128 


376 


1 


WSPGAQLLFISCLYYNLWSLHTGRTPAK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaiiiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PP*GWTLQKTTPLSCPAP*LPPPLPSPL 
QQIDLKEKLVFTAPACSIi/PHSPNSSPW 
PQRRWPSVS/P/PVAHPAAPRLPPPPVC 
ARPLWLPEAPLLGIPPH 


10054 


23955 


A 


10129 


140 


454 


CLIRSSINSVHNMIYIHSTPDRHLGCFQ 
FLWTKNAAVSRIiAYVF* * I * I PTYDHI 
HTHTPAGY I PRHGI AGSQLTNI FNFS S * 
LSSNSF\QVGCTSYFLFPSHI 


10055 


23956 


A 


10130 


340 


473 


AGAEHSVLRPRPPLAGSSCPSS /CGGHC 
VALGVSCPGGGCRAACVPGCCCLLCGLP 
SDVCVC/ S VCWGGSMGSHLGGWAAPGF * 
GNSREGPGNCSSGRKAGLAGGGLGPAWA 
EQAPDILCCGPGPHWLGflPAQVPLAPPL 
SPPPCLYGCLYSDRGDL 


10056 


23957 


A 


10131 


280 


2 


DSLALLPRLEYNGVNVAHCKLKSSWIRR 

SSHLSLPiaiLDYRSMSH\LPGLEIF*YS 
YCB;-*CLLTSPYFEFLMKSLEDI3SCFIP 


10057 


2395S 


A 


10132 




3 


IISTDLISHSWTLYLISSLVYVFLKQHS 
LAVIiPRIiDCSGMIIAHC/KLQTPGLKR/ 
FL/CLSliPST*DYRHTPPNRANFYYYYF 


lOOSS 


23959 ■ 


A 


10133 


272 


3 


VGKPRVFPYEIAFNFGRVTS II,GPHIKK 
YHFQCRGGKHCFF* KaPPFFFF\FNETG 
FCSVPRLECSGTIIALCSLNLQGSCNSP 
TSVSQV 


10059 


23960 


A 


10134 


217 


3 


FPRFRPIiVSPALTELGQF/HFLTRGL*K 
KKPFPFLFFFFSETESCS IA\HLW6SGM 
ISAHCNLCLPGSSRTRG 


10060 


23961 




10135 


229 


3 


KNDYLKCWQGCGRTETQMLGCWDIK/'LI 
QP/LWKIIWQFLLKLNMHLLCNPTLPLL 
GIYP*EMimVHTQTCICTDAW 


10061 


23962 


A ■ 


10136 


84 


352 


RREFKTGLATWQNLVSTKKYKTTVRYHL 
TSTHMDIIKGKR*QVCW/RKC/GDKGTL 
UDC / WMRMLVQP I WKI V* VWKFCRK* KL 
NLFFPPPPQ 


10062 


23963 


A 


10137 


280 


2 


KCVWPEERILREKSQHLGKIKQEGRLN 
AWLIQPGQHGETLSLQKI /QLAGHGGA 
PLRSQLHRRMRREDHLSPGVRSRSEP* * 
LSLHWVTEQ 


10063 


23964 


A 


I0I38 


2 


386 


HIQEKQLPNKDSTLNPYFLLISILKRKD 
EKIQMPSKHMKRCSTSLVIREMQIKIRS 
HFPH*VGKILKSANIKRW*MCRKRG\TL 

ITCW*TYK*TYKLVKTF*R1ILQHPLLK 
THQSYYPAIPHLKETQA 


10064 


23965 


A 


10139 


229 


1 


TRSYHESLTILHTNHHLNCSPFPQAITP 
*SRFPYSNRA\WLGSVAHTYNPSTLGGR 
GRR]Sa.*GQEFKTSLGNLPK 


10065 


23966 


A 


10140 


261 


429 


LSHLFHVALI*LGLVW/SFVFVFFETGS 
LPELECSGVITAHCSLDLLGLSGLPTSA 

F 


10066 


23967 


A 


10141 


377 


34 


WVFGSSGPPPPGFKKFFSPPPPKNKNPR 
GPPPPPIFFFFFCKKGVSPLFLPPGGGP 
/HPFLSPPPPGVKPENFFDPGGGGFP*! 
KIFSRPPPPGGKQNFFFLKKKKKKKEHQ 
IKR 


10067 


2396S 


A 


10142 


208 


424 


FFSLFMQTFHIHIRFYFFLAGGGGAGVD 
EVS/LLSPELECEDTISVHCYIHLPGSK 
QFSCLSLLS?I*NYRHAP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

peptide 


Predict- 
ed end 

location 
correspon 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=GIycine, H=Histidine, l=lsoIeucine, 
K=Lyslne, Ij=Leucine,M=Methionine, 
N=Asparag!ne, P=Proline, 

T=Threonine, V=VaIine, 
W=TryptopIian, Y-Tyrosine, 
X=Unknown, *=Stop codon, /""possible 

nucleotide insertion 


10068 


23969 


A 


10143 


■ 


297 


PTESPQVSNLKHKSFISFYLFIDLLSVC 
LSVSVYFETSLTLLPRLECSGAITrHLS 
LLSS*GHRHAPPCQV\FFVETRFHNVSQ 
SGIELLSSSHLPTPA 


10069 


23970 


A 


10144 


185 


488 


SGFPRGTELIGAVCVCVCVCACVCVCIK 
LGYI*1JHKRI YI *LLyiYSTYYIFNLSL 
*LSVY\SVVNRNSM*DRSQSI*IFILPK 
LKTCLENTHTLKHTHEGG 


10070 


23971 


A 


10145 


283 


3 


liGWPGRKRDGSGKLTIDYGRLDKLVSPX 
I * SAI SDMASTTKAVLQAQRDQYS VLDL 

ahaffsirXnqsqfsfirdspqyaftlp 
seehlnsldgq 


10071 


23972 


A 


10146 


146 


380 


cflwdnffffiftektgscsiaqagagt 

Q/WVGIISYCSLKLMGSRDPPTSLSCVA 

gttgmphhaqviyffifl*rqgl 


10072 


23973 


A 


10147 


275 


3 


IKGMLKAKTG*KLGLLCQyTSQVENAKE 

eflkeiksatpvht/rmirkrncliaem 
ekv*vtlidqtghwiplsqslpqskclt 
lfnskkaer 






"A 








ki flpqppqemvfqgfappgg* i flvf* 
*kqpfpplgrfvfk/lpggipfffffff 
fffffeteshait\rlecsgtisahchl 

RLSGSSDPTRP 


10074 


23975 


A 


10149 


413 


139 


S LRVWP I FGLLQANKKKFFW ILFLS / 1 R 

plf ffsrerggtplfp pfflrnpgkks f 

PPG*PPPPLKIQFPGKYFFTSRPAPFFP 
LFCQPPGKNLNYWGLGPGFPKIFPPPPL 

fffff*drvslcrsgwskvars*ltwgk 
afftwvs 


10075 


23976 


A 


10150 


100 


431 


GPRLTDHLRSGVPNHPGQHGETSFLLI/ 
HWYSGG*GRKIAIMRLrjPKGGGCSEPKSC 
HCTPSWGKKRMFVSKKKKKKRE 


10076 


23977 


A 


10151 


134 


417 


GLAAPLVG / WGKEN P P P * P FG *GE KTGW 
GPHPGKIFLKKKLLFFFFF*KGGLSFPP 
GGRERAHFWLKKTPPSRKKEILPPPPPR 
RGGGGPPPPPFL 


10077 


23978 


A 


10152 






lafpilqevpqcfcnllhslrmysstgs 
klfplaspqphsllyhltlahapnilee 
lqspntalllkyl*eaalkkksr*hits 
sl*kvdlsffletkprsra\rlkcsgti 
sahcslrllgpsnscatasqvag 


10078 


23979 


A 


10153 


382 


297 


gln*eswsgekgtl/lhcmweckfvqpl 

WKMV*RL 


10079 


23980 


A 


10154 


209 


15 


sspkf*ffflspkkiplpff*piffftk 
tp/hfff/sffffvcvsgshsvprlecs 
gaitahcssts 


10080 


23981 


A 


10155 


153 


422 


rtsgkngnrgqiilvtmvtnwlfsffff 
plktgphfviqaggqgrahgslqa* ifg 
\ s gdllaiivslqagtagfhhhaqls fff 
kktglf 


10081 


23982 


A 


10156 


257 


3 


rrlqnskdcclffpleaprgatarcqpe 
lscmrclsa\llggvsqsgymgvrdphk 
eav*plaelkr*agrssapfkavrqgcl 

S 


10082 


23983 


A 


10157 


37 


386 


datrfrinsvgtaeggldriepndlq*y 
inlvgtavrrpdriesnfesstvgkkls 
wkivcyrsffceresqltqqtsmlyyfk 
klpqlpqpsaittlh/srqdppsahrlq 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=GIycine, H=Histldine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
1=1 hreoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=UnIcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














LGQ 


10083 


23984 


A 


10158 


407 


272 


HKGGKRERKRKKKEKKEC3GREKGQERRK 
ERB/RERKKGRKREMKKVNRYSKCTQ* * 
LLSN 


10084 


23985 


A 


10159 


248 


478 


VKWHRLLLDPSVEDPWKKSLKGPLQP* 
/RL* I FFF*NSGLLCHPSWRAVAQSRFT 
GVSTSQAQAILPSFSLPSSWDR 


10085 


23986 


A 


10160 


458 


129 


GFSVLNRLEDEDLRQE1SILLMPAV*DQPG 
QHSKTPISTKNNMNPAGHGGVYP*SQLL 
RTVRPEDHLY/SNKPCSYLCAPAWATET 
LSLKNAFSCCSNHYGLSFQTSFHRILMF 


10086 


23987 


A 


10161 


82 


291 


YSKTKSIiFFAGHGGTCIYSQLFRRQVQV 
RQEDHSNPGV*GCS/DTMIMPMNSHCTP 


10087 


23988 


A 


10162 


6 


409 


lrinilpepislSpltapavfmpwtnys 
scdtfslasmtllplhflstslalpwsl 

FLAPSLFQISKHQLYLSPVLLTLSSCYF 
LNP CFRNHI LMNLGRPRQANYLRSGV* D 

qpgqrgetpsl/lkntkitwpww 


10088 23989 


A 


10163 


249 i 415 


ap p ec s lpllvlllllllll flrwedhl 
gpgg*gcsepksyhctsawvtr\gdpvs 


10089 


23990 






1 


405 


rsewrlrqperqsbtvkkkkkgffffsr 
gggggkifglmepspsglgqfflftple 
nggyrppplggpfffffeocggfplcgpg 
gfefpalgtppllfpkglnfrggptppa 
rffffwgpvfsvfspkii*nppvfrgkng 

KNRPPKKKKPGRG\WGPPRKFNPFGNKR 

ggvpragnskppgpqrgnppflkkkkkg 
ppkgggr*ppfsrgvnkkncpkpegegs 
inpki fppppprekkkkpffffltvslc 
rsgwrnlhsl 


10090 


23991 


A 


10165 


175 


404 


ai kmvwiqavp sqclhsvgfylprsvcv 

KTP*NKRHRQGWAHaCNPSTLGaRaGW 
I \RGQEFETSLTNMVKP 


10091 


23992 


A 


10166 


317 


2 


PFKNSPLLLNEMTNIHFLEPTCNYVNVS 
PFLK*HKFLFF*EMESCSVT\RLECSVT 
IIAHCCLKLLGTSKPPTSASWAPGMIIG 
MRHCPAPKVCSHASAHASAHAS 


10092 


23993 


A 


10167 


325 


2 


KMGEKQGHFIKEEIRKANKHIERCSRLL 
AIEEM*IKATVTI/RKHQTKC*QG*RES 
GSLVHG* *KHKIVHPLWKILSVSYKTKH 
S 1 1 1 IGYSSVLLGTVLVHSGSDAW 


10093 


23994 


A 


10168 


245 


3 


GEPFLFCFPI KKPKVFFCPYFNQAVFFL 
ETHRGFFEFFFLF * KKKKKS /LGMVAHT 
YKPSTLGSRGGQIA*AQEFKTILGNMS 


10094 ; 23995 


A 


10169 


164 


1 


MNKMSQyTFPYKN*QWGG\SVAHPYNPS 
TLGGQAMQ I T * AKE FETSLSNMAKP C 


10095 


23996 


A 


10170 


214 


419 


LS*PPPPFPGFKKFNRVLRFYGKSKPGP 
/FMFVAIFQKGGPFYQNIPPPVKLEPFP 
PTIl3HVSFFF[i*DGWLCHPGWNAVA*S 
QLTASSTSRVH 


10096 


23997 


A 


10171 


168 


1 


WTWWLFPGFS I TNNYCSHFSS LHLAIW/ 
LGTVAIIACNPSTLGG*GARGQEFETSLT 


10097 


23998 


A 


10172 


139 


3 


PAH/CES* PPGLKRSSYVSLPSG*DYR/ 
HTPPH\QLIFVFWEMGFHQPA/SA/FP 
KC*DYRS\DHHKWPT*HL.K/SFHFIKK* 
II*LICRYrACSRLWYLFCLIRCMV* *1ILD 
DLSlLYLHLWKFQFLFIYIiFFEMES 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


JM 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, ]>=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10098 


23999 


A 


10173 


253 


2 


FPLKICNTLPPQIFSTQRGLLSLPFTIF 
LLSLILLRPFFFFF*TESHSVPECSGMI 
SAHCSLRLPGSSDSCASAS\QVAGKTCA 
R 


10099 


24000 


A 


10174 


300 


3 


GRAFFFKGLVKWVFLTPHPIPFLLKNFL 
KRVFFLAPSEKFVFFKNWPP*LSFLIFG 
EKNFFFFFFFFSEMESCSVA\RLECNGM 
ISAIICNICLPGSSDSP 


10100 


24001 


A 


10175 


117 


381 


EILRYIVSIRIRRTNLKFQQEELRVNIK 
NNLSSLGNMVRPRL/ECSGVITAPTAVS 
APGFK*SSHPSLPSSWDHRRAPLHLGDF 
FLIETRSHHVAQAGQVIFNVYSKFLLLK 
FKISPSDPYRNNISQDFLAIGPTNMVPL 
QPYIVLCACRTKNNKIIDLDSEQSSRT 


10101 


24002 


A 


10176 


194 


412 


TLKNLLLSLHCLSFVPTDLYQPERLSKP 
PRPAKKYKFLIL/WLGWAHACNPSTLG 
GRGGWIT*AHEFDINLTK 


10102 


24003 


A 


10177 


100 


339 


VBVKSLLLCVCVCVLLL*RISDHLPTLY 
MILG * CFFS YLVIS YS \NS T Y CI LQRHL 
LAIQKQP/ CS FLRQyHLVGWI.HKKKIITH 
THTHTHTHTHKSNTHTHTHKSKDLTGTQ 
NTTEFLLK 


10103 


24004 


A 


10178 


229 


2 


KBKYIKSKLHFEARHNGSHL*SQHFGRP 
RIiMDCL\VQDQPGKCGE]SrPYLQKKTKIS 
QAWWLVPWPVYSHEASKTLS 


10104 


24005 




10179 






SGS S PS TS PQGRPDPLGLFS AKI VLGQL 
ATIPLPLCFLLKIFHPLTLPPLLLLGGE 
FNLS PQQQDS ISLPWFIiCYRYGPTYKAF 
FYV*ASVPEKI\FFFFFF*DTVSLCHPG 
WTTVAQSRLTA 


10105 


24006 


A 


10180 


398 


1 


QNIMLYTINTYNFICQLKKKTIQWGENF 
FFNKWC*EWWIFTCKEiasrLHPYLIPYTK 

tnskwiselnlXkslkvnrggnihdfqf 
gngfiigdtksktk/akisklefikiknf 

CVLNDI IKKVKRQLIEWKKYLQ 


10106 


24007 


A 


10181 


252 


419 


garfkgsnftsaggqgiiffmgppklis 
rpvf*hrgegktpgvtqiinrleehplfa 


10107 


24008 


A 


10182 


101 


380 


FCWIYWSWLIGYLFFPFRTLNILaMCX3G 
S CL* S QYFGRQRQAnHLS PGVRDQPGQY 
GETPS/LTKISWAWMHVPWPATQEAEV 
GGSLEPREVK 


10108 


24009 


A 


10183 


2 


377 


PGQVDCFSSGVQDQPDQYGE ILFLLKLQ 
KLGGHGG/ IR*EDHLSLGGQGCKEPRLY 
HCTPAWVTEQYLVSQKKKKKKKK 


10109 


24010 


A 


10184 


140 


3 


DGVLLSLPRLECNGAISAHRN\SPSCDS 
PASASQAGCCGTCP*SQL 


10110 


24011 


A 


10185 


364 


3 


SVCPPRLGGVSQSGDMGVRDPFEEAV*P 
LflELERCAGRSTAFFRAIR*AHLSLLK\ 
FTRNHPFPHVPCPRVMSR*FGLLPFFQR 
CPAQRGEIWRRSLAAVALRS CGGRLQVR 
TCWQLCLHR 


10111 


24012 


A 


10186 


2 


249 


SVRLNKNKENTQ IRNKKGDTINLTE IKR 
I IKEYSEQLFANKLH/SLDEMNKFLER* 
NP LHPGAEAQRLVS *RFWRATAIDHMRP 


10112 


24013 


A 


10187 


270 


2 


pwaqrgcvfktnqkiffgffei phld* p 
cvfqrpps\f*vrlifigggfkpfffff 
fhmkscsvarlecstisivhcklelpgss 

DSPASA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


Predict- 

location 
correspon 

last^amino 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid,'F=PhenyIaIanine, 
G=Glycine, H=Histldine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Metliionine, 
N=Asparagine, P=Proliiie, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=UnknD\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possil)le 
nucleotide insertion 


10113 


24014 


A 


10188 


428 


28 


PI FDKG/E*GN* WGKKS IFNKWCWEKCI 
C/ KI^liRDSSLTSY/TRINLKWI KDLNVR 
TK/STKLL3ENIDIV/IYDLGLGIGFLD 

HTLIL'YR-Vi'K/AKIEKLDKIKIGNFYASK 
DPKiaiKS*FIjiFIEQGK-B"JNUvSDDTS 


10114 


24015 


A 


10189 


85 


243 


DRVS/LLSPRLECSGMISAHCSFDLPG* 
SDPFTSVPKSSWNYRCPPAEIKLS FT 


10115 


24016 


A 


10190 


96 


440 


LQSNHINTLNVNRLSSSMKRFRLSDEIF 
\KKHGPALCCa:,*EMYYKDKGTCTLKIKG 
WKNIYHRNflMNNKT/GMVLLSGKIRLES 
N/TVIRVKEG*LVI/INGRLI/QEDLIL 
LFILNVTAP 


10116 


24017 


A 


10191 


137 


423 


AKDLDRQFSKEDIHMTNRYMKTC/SGRQ 
LIRETQ IKTTMRYPLTPLDLRKSKDKKS 
** * YEQKGTPAHC* *ECKljVQPLQKI I * 
TFPKLLKINL 


10117 


24018 


A 


10192 


221 


72 


AKVK*WVFFCFLFFF*DGVS/HVWAQ 
ECSGAISAHCNLCLPGSSDSVRVG 














SDKTKYW*RCRAIKTLTHF*LKSKLVPL 
L*KTI * CYLAKLKIHIVFDSAI PL*DTY 
FR /E * FCICAPKTYSNMVIEALFVPAWQ 
TSKKKNWK*LSCPQIVDNFmiHLMKYH 
LE 














ATMPSYQRIiLLTAVHELESPE*MDQFLD 
TYNLPRFNQEEIENLNSPIMSNEIQSVI 
KCLPFGPNPGPNGFIIEFYHTYKEE/LP 
ILL* LFQKIEG\ ILPNSFHEVS ITL IRK 
TDKnAT*ETHKP ISLMNTDA 


10120 


24021 


A 


10195 


3 


353 


EAAPMPVKFFQLLEPSGPPGLSMSCLGD 
CNGYACFSVSLHQSANQSIDQ/SIYQSA 
IHLLIY/VSIIY*SI\YLLL*SLI*PMY 
VP I YPLSCIiS I YIiS BWPLSCIiS FNNLS I 
TRHGSMCL 


10121 


24022 


" 


10196 


118 


395 


DFNAGLLPSYQVLSLYFCF*NGV\CHCT 
P/RLGDRPRLR/PPKKKKKKTPPPPLGG 
SPPGGEPYTPRPGVGPPPPPFEHPPREN 
PGCFHPPPKAWP 


10122 


24023 


A 


10197 


437 


160 


FSYPRVFPLPALLKRGPGILFWGeNKlCN 
YSSPARGSKICFFKRGPPFFFFFYFFF* 
QTESCFVAPVGVQWCDHGSLQPQPPG\S 

SGPPPSTS 


10123 


24024 


A 


10198 


259 


3 


RQGNHVTHVYSSRLQLILSPPASLAVFI 
QKIYVNLCTYIHIVNLYR*MCICVCICY 
TCIL\ICIYSLVAVAHTYNPTTLQAKIiG 

GS 














1 1 SAHCNLHFPS SWDYSRTP P YPANFCI 
FSRDRVSPCWSGV*HHLEQHSKMLSKK* 
NKIDMHGGMCL* S * LLK/RAE * DVLRQE 
WC LNPGGRGCGE PGS RHCTPAWATRAKT 
LS KAS YAS S S S S PKKKKKKS PS 


10125 


24026 


A 


10200 


369 


3 


TAPPFFFFCRDGLRTSLIIFPSRATPYS 
LVT3G3PLPAGAGGRGL3LEIRVLDLVQ 
DGCPCSQAWAAALGGPGWWAPFLCPPET 
ASCS VAQEGMQ * CNLGSLQPPS PG\S SD 


10126 


24027 


A 


10201 


314 


2 


LHHMRPEV*FTPSKLGVTGYiyiVSNISYP 
WRCRAHK*SQVLVSLRHKHLFPLEMESG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 

peptide 
sequence 


Predict- 

nucle- 

location 
correspon 

last^amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SEVQL\DTNRCISANCNFCLPGSSNSPT 
SAS * EAGTTDS RQ PDAWADAW 






A 








KIGILLKTH*PSIC/VSITNF*yNDTGE 
LKVKG* KNIYYANINQK/KAGMVMFVLD 
ETDFRAKK/DYQGH*IMIEVSVHKE/E/ 
LVILNLYASNKRVTKYMK*KDTDIKGEI 
DKSTTIVGDL 


10128 


24029 


A 


10203 


351 


1 


PKFPPLNPAWGTTIKIFFKKKNAHNWFS 
PRKEKALQKIHKKG/RLLAKEISFLGGF 
*TDYKTNPDSQSIRTLGEKKKKIRPGTV 
AYAYNPSTFQG*GRWIT*GQELKTSIiRN 
MVGRV 


10129 


24030 


A 


10204 


2 


392 


FLEGVLLLLPRLECNGAIPAHHNLHLPS 
PRFKRLSCLSLPSSWDHRHRPTRPANFV 
FLVETGFHHVGQDGLDLLTI.*SPCLGLP 
KCWDYEH\DHHAQPSF 














PLSLSNLFPIPIjNSILKIFFIIiK*L*IH 
GKLHKMYRKV/LV*TSPSPSQC*HLR*V 

*HNIQNKKLMGLGAVAHftSNPSTLGGRG 
GWIT 


10131 


24032 


A 


10206 


281 


2 


ISPPLFFLFSPPKKIFFSQKHIFLGPPL 
FYPPPLFFS PPP\PHFLWGFRPVPNPMP 
KIDPQTWAKNKDKGPNFPPPFFFILPPK 
KNIFFTKTYFFGPPPFLPPPPFFFSPPK 
KK/SYPFPLKKNSPNPPPS*K*AAISLP 
PPFFYFFFFFFFSFFLENTWKMADAW 


10132 


24033 


A 


10207 


340 


1 


STKKLVQKLGEHTQCWQGCEATGTLIHC 

rwecklvqplwk/sawqilrql*nfmhl 
pynptiqlqgsysre\qhtcpykdlyva 
alftiaknc*qakflstge*inklwcih 

ITE 


10133 


24034 


A 




136 


3 


K1TRGPPPRSGKFFPFLPIjFFFFFF*DRV 
\YPSHPGWSAMVPSQLT 


10134 


24035 


A 


10209 


327 


2 


INKKYYEQLYA*KFDNLDE lYQFPGRYN 
LPKLTQEKANNLNRPVSLRETKS TINNL 
PKRKVPDPDGFTGELYQTFKEEITP\FS 
TEAKGILPNSFCEVTIIVELFLLRQ 


10135 


24036 


A 


10210 


89 


485 


EIVPLLRFPLVIFIASPTTFYTYGFVSG 
EKSQYKKREKLCDKTLEQSRSLLGMHST 
CGIALHPKREGQMCTLCACFY* fflkes 
HSAAEAGVQWHNLSSLQLP?\PSSSNS 


10136 












IQQIGSNTGNIPAMELALKRNSKNINRL 
AGWNGP*VFAC/PKKVLFYLHP* LFG* I 
* IYQLKITF*ISFYEGMVP/FVFRFLFETR 
SLLSPRLECSGTITTHCSLELVGSINSS 
ISARAEFLQRGANL 


10137 


24038 


A 


10212 


62 


469 


SIiLWKLRSSGASTSLRCLSA\LLGDVSQ 
SGYTGVRDPLEEAVCLLSELERGAGRTT 
ALFEAVRQGRLSLEKL\LLPFVQ ICPAP 
GGGIYR/GQ*ALLSCGGLCPV*AS* */m 
CLPC/EA*TRLLKPPPGS 


10138 


24039 


A 


10213 


197 


1 


WTQPNEWCVTNKDASFQQRCTKKIKT/ 
WPGMVAYTCNPSTLGGRDGWIT*GQQFK 
TSLANMVKPH 


10139 


24040 


A 


10214 


31 


213 


HASLDEGFHHVGQDGLNLLTS * S TCLGL 
PKCWD YRHE P / PAPNWWKKFLS S KAFKR 


10140 


24041 


A 


10215 


442 


29 


ptinptmackclskrkshtsftlnqkle 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TIKLSEEGMLKAEAG*KIiDHVH/QVSQI 
VNAKEKFLKE IKSATSLNI *MIRKQKRF 
IVNME/KKVLMFWTEDQTSHHIPLSQSL 
IRKRP*LFNSLKAERSEEAPEKLEASSG 

G 


10141 


24042 


A 


10216 


195 


533 


LVWFWLRVTYGVAAKMSSGTTI IGRLSW 
GWRLHF\KMflHL*GYWWEASVPIiHMKLS 
SCHGTG\SLRVKDPRERLRQKLQCFYDL 
VLEVTYWHFYCILFIRSKSr.SPAHIQSK 
RI 


10142 


24043 


A 


10217 


274 


494, 


CVLKGLPFALPWFFFFF*DKVSPSPRLE 
CSVAl/SAHCSLNLLGSSDPPASASQIiA 
GPPGIiRR 


10143 












VTI KIYLHPHDTTTPLRQGGSRLKEILV 

sy*qqkvetrpshvifftvistqfkktv 
ttrsgavahtcnpstlggregqi/t/rg 
qefetsmtnpvk 


10144 




A 




163 




AVYFS FWRVCEHDIASLYFENSGKRDNK 
FVFLVLQAQGPAWRQGEVHKHIiLPCVYH 
FAFWLTQQPQSFF*DG/QSHSIjAR/LE 
CGGVISAHCNLYLPGSSNS 


10145 




A 








V* KVQM/ WPGMVAHTCHPS TLGG*GGW I 
T*DQEFETSLADVAKPC 


10146 


24047 


A 


10221 


211 


473 


VTWLLLFFS I FRAS S LSTD S VS YVI AFQ 
FYFYLFI YFYLFI FETESCSVA*AGAQW 
CDLCSPQPPPRGS\SEEPSTLQMTSGSS 
NL 


10147 


24048 


A 


10222 


142 


394 


GVCQSLLGSLCQLGGTGVRDQLEEAVCP 

laelepcawrtlllriccslqsqqagif 
/rfc*syihicpfpqvllssemgvlsir 
p 


10148 












KRKTKKSFWTPSGFGLRPP IPKKKKFFN 
SRGKQGGFGRGSGSRGGGGSTSGRGGGY 
MGKVFGMGQ *NFTGDQGLAKGGFVFVFL 
GRTTIGIKGALKRLFA/NNEHDLR- 


10149 


24050 


A 


10225 


155 


443 


GQAREGTLKSQFPFPSPPGWPLILPGGV 
FSPKFFFFFFETRV/CAVAQAGVQWCNI 
DSSNSCASSS*IARI TGAHHH \TEL I FV 
F*ISPPSQ 


10150 


24051 


A 


10226 


290 


485 


KRGEGPVQKKKFFFFFFETESCTVAQAG 
VQWHNLSSL*PPPPG\SSDSPAEFPR 


10151 


24052 


A 


10227 


228 


424 


KLYQNLHDTLFGFELSHYEEVH* KMLKG 
SSWEFVMHSS/VSNKCILICSAKK3CKKK 
KKKKKKKK3CKKKK 


10152 


24053 


A 


10228 


269 


454 


DLFAASQGHAQLLEELMATTLSLTHTHT 
HTHTHTHTPWSLF*SR/SALSLVC 


10153 


24054 


A 








FYI FSKFLYVWLNVSHLDSHI CFCIQS V 
I ISCFGVFYYYYY\FF*RQSCSWHTGV 
QWCDRSSMQPQLQGQNDPPTSASQVAGS 

A 


10154 


24055 


A 


10230 


18 


472 


GPEEFGTRRERDRERERETRDSRERVER 
VRERER* /RQRERERERERERER 


10155 


24056 


A 


10231 


52 


361 


WREaiYYFQIKITVYLENTKDFIQKLHTK 
CW* G* KE I GRAIHCWWECKM / VIQPVWK 
R/VWQFLRKLNMLLPHKPAVELLDIYPN 
ELKTHVCTKftLYEAGRGGSRL 


10156 


24057 


A 


10232 


3 


402 


APRLPLGGWI S FS S PMGAMGU^RATG I M 
GSFFFFFLKRKSPLGPQGEGRGQNLG*G 
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SEQID 
NO: of 
nucleatide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NPGPRG*THFPG*TP*KAGWNGEGPPPR 
GNFGFLKKKGVPQKGQGGSKP/RDPRGT 
PRLGPPKGLE*RGGPPPPAPQK 


10157 


24058 


A 


10233 


48 


470 


PGRRFTTAGLGKQEPAASGPRAPRPAQT 
PLKREAARPSLEHLPEGA*APRSSARSP 
LLIPAASLRSPRRRTCRRRLRHFGI ITR 
PFQPGVRLPDSGPGLA/RWACAA.VPAAS 
RALPAIiRRPL*RRPGHGAPSSKSHGHSR 
SP 


10158 


24059 


A 


10234 


169 


446 


CPPGPAPSHLLEDRDDPQAGRQSSFYAG 
WWDRWREKTLFLFLRQSLTLSPTTPFFF 
\FEAESHSVTHSGMPGVISAH*NLRLPG 
SNNPPTSAS 


10139 


24060 


A 


10235 


264 




KLKPIFKMVFLKIMFMSYIMEVFFFIET 
G&RSVTQAGVEWHDLDSLQP*PP\GSSD 
PPTSVSQVGKWHKTWEAEIAVSRDHATP 
RAKL 


10160 


24061 


A 


10236 


306 


3 


SQDMVKSPKKKFP*LV*GLTEI*KIEA1j 
VFNKKFFFFFFFPETGSTGSCSVA\RLE 
CSGTIIAHCMLQLLGSSNPPASASRVIN 
SIKKLFANFIVSANPRPT 


10161 


24062 


A 


10237 


20 


469 


QCGAIPRRGDHLRSGV*DQPGQHGETPS 
LLKIQKLSRHDGH/CRRLSHKNGUSIPGG 
RGCSELRSCHCTPAWATR\GNSVSKKKK 
S 


10162 


24063 


A 


10238 


313 


3 


TRPRPTQGPPHSGGRACLCI FSPHGPPQ 
TRLSTPVSPSVRPLGFQVLLSAHHGFLL 
CQHGMF\PSLFTMSNITLWPGAVAHTCN 
PNTLGGRaGWIT*GQEFQTS 


10163 


24064 


A 


10239 


274 


49 


KGNPFFFVETGSHSIAQDGVQWCNHSSL 
QP * PP\G\SSDPPTSASRVAGTTGVCPC 
EDKLSHILRRCYAQVFMLAG 


10164 


24065 


A 


10240 


144 


454 


VCKIKI IYLIVLLEDYIN*FVKSSL*LK 
S*K/LIQQMWAIISLPECLNSWCLGKV 
G*KRCSQQGTVAHTCMPSTLGGRGGMIP 
R 


10165 


24066 


A 


10241 


289 


178 


FNYS SI KMWTDGWVWWCLLPVI PALWEV 
EAIESF/CSRDEIPLSCPGWSPVPKLKR 
FYCLNLP * CWDYRQEAPPHPAI S I IFDA 
TVIELFS 


10166 


24067 


A 


10242 


352 


294 


PPRTTKMW*DQPGQHVEAVTLQKMQKL 
VERGGKHPQSQLC/REDH*SMGGGGCSE 
PRP * P CT P AWVTERDHVS KIK*NKIK 


10167 


24068 


A 


10243 


293 


1 


VSLCRE S SNS FI WQ CP WS H I LALC FQR 
EKKRLDLP/S*MVGK*EIPFLVPLLFLP 
HWPLLLMK/YM* IFFFGNRVSLCHPGWS 
AVAQSQLTAASVPRR 


10168 


24069 


A 


10244 


218 


3 


LERH I PVRP III CD YKWVWRF/HFLRPL 
HN*FLFSFFSFFFFFFETGSYS-VT\RLE 
CSGT I SAHCNLCVHLLL 


10169 


24070 


A 


10245 


185 


1 


KKQPQQTKNPLNNVKKEGKGQHFFFFLE 
T*SRSVAQAGVQWCDLGSLQPPPPG\SS 
DPPPS 


10170 


24071 


A 


10246 


2 


265 


AHKKKPAKHIAPSTFLYSETEARRSMPK 
AAQLKCGRAGIILFVFKTESCSVAQDAV 
LWavHGS lqp*fpg\s SDPADSACRVAR 
TTSI 


10171 


24072 


A 


10247 


2 


430 


EAGGREALDGCGLTGRESGVQRRRDSMG 
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SEQm 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamie Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Tsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














CQGTAGRGTRAGKEGPWRPEHLSRRPPA 
SSVLLSWAIHLAWVEGS*TGSLCDLEAP 
GPTLSPWCSWSPELLQLHV/HHHTHTHT 
HTHS/HPNTHTDTQSLCHQPSFSVSPHA 
RTVr, 


10172 


24073 


A 


10248 


201 


16 


TPPGGVFFLGFFFLFFFFETESCCVAQA 
GVQWCDLGSPHPPASRRPGNRSSI /AAA 

*GYHFK 


~Tor73 


'24074 " 


A 


10249 






GFDPFEC3MLSYLEKEET/WPGTRAHTCN 
PSSLEG*GGWIT*GREFKTSLTNMEKP 


10174 


24075 


A 


10250 


92 


389 


FQPGQHGEILSLYEIKK1+/ELAGC1DM 
CLWSY/LLRRLKQEDCSST*VQGCSEP* 
CLSCIPSWVTEQDPDSGRQI*KKKKISL 
GRRGGIFLFLKIWHVFFL 


10175 


24076 


A 


10251 


221 


415 


KKKKRGGRFKGSNFTSPGWRGNIFFMGP 
PKIJSISH&GV*QRRD/WEKPGGPQFNCPG 
SNPPFPPGGK 


10176 


24077 


A 


10252 


94 


383 


GKLNRCGIWLLGRCQRSPAVSNALWLHR 
VAPEEEAGEAWAGTLSLQLSCKSKITPN 
KIVY* KKNKT/WLGTVflHVCNPSTLGGR 
GGWII*GREFETSL 


10177 


24078 


A 


10253 


249 


2 


YPTMIFKCSSKRKSPTSLTLNQKLEMMK 
LSEEGMSBCAELS+KLGLLH/QVSQWNA 
KEKFLKEIKSATPJOTTGMIRKRNGLIA 


10178 


24079 


A 


10254 


132 


414 


NNTYFQGKIFILNSQVKGMLICCWWECK 
*VEPLWKA.VWRFLK\DIiEQNSKEYKL*T 
WNQPRYPAVVDWIKKMWYIYFMEYHA\A 
IKKNEIMFFAA 


10179 


24080 


A 


10255 


359 


3 


YSLRHNI EMRPVSNPTRASKCSHERRSV 
ISLTLNGKREIIQEASKEGTLIOVNKPKA 
KLFVPVRQWMAKEKFLKEAKSAaPLNX 
RMIRKQNSLTADTEQV*VIWIEY*TSHS 

IPLSQSL 


10180 


24081 


A 


10256 


283 


2 


slqpwklyvhvflpkyatyppfgtLpkt 
hffhkilpskagsdlnsiyamtyshrhi 
tiffeteshsvtrlecsgai \sah*nlh 
lpgssnspatk 


10181 


24082 


A 


10257 


216 


386 


PSKNPPPLINPFFFFETDSCSFA*AGVQ 
WRNLSSLQTPPP \GSNWSCASAS /RIAG 

I 


10182 


24083 


A 


10258 


263 


1 


FSQWLILPSPLSYSFLFLRQGFALLPRL 
ECSGEITA\PSSAS *VAEI TGACHHAEF 
LNFFVEMEFHHWQAALKPLGSSEPPTP 
ATK 


10183 


24084 


A 


10259 


248 


47 


IPKSRVKRENRLNPGGKGCN*PRSPPCS 
PPWUAKQNSLSKK/NKVKI YQWLHTETQ 
TVHGDIRGQEK 


10184 


24085 


A 


10260 


159 


2 


KRDMSSLKLEY*KKE/RIWPGAVAHTYS 
LSTLGGREWIT*GQEFEISLTNMVK 


10185 


24086 


A 


10261 


190 


405 


ASLHMFMYFFFFFFPKQIFFFVPQAENQ 
WGNLGKRNP P P PGLRD F\ PAS P PKEAGF 
RGGPH*PGLFLVF*GE 


10186 


24087 


A 


10262 


39 


293 


PSSSTSRIYPEDTPLITQKYICKILFIV 
ALFIFPKHWKLCKCPHIG/E*IKLQCIY 
TWRYYVAVQKNEENLYQLMYWDYQEILV 
SD 


10187 


24088 


A 


10263 


282 


3 


AFYNRFFLKIFFKRGPRGKEKKGGKKEK 
KKWGKKKKKECFFFFSPGQIFFLGGVFFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FLFFFEAESRSVT*AGVQWCDQ\SSSQP 
RPPRLKPFFQ 


10188 


24089 


A 


10264 


196 


376 


kkggghi infpphkgykpgpekkkffff 
ffemescsvaVrl*csgmi/s qahcnlc 

LPGSSN 


10189 


24090 


A 


10265 


514 


157 


gltlsfrleysgmmsl*e*lpg/fr*fs 
rlslpsswvfplmyvyhyaqqspciffv 
btgfhrwqaglkvlglrqsahlglpec 
w\dyrcepl*ltklicftyglr*rskpp 
ffvdgiqlf 


10190 


24091 


A 


10266 


57 


378 


RSRIW*DSVILCS* LVS V/ VDQSyFy*V 
TEILEQIKLLGSKIMCFCILQLHFLCFL 
CGSFVGST\lFIFSRISriWLGTVAHaasrP 
STLGGRGRWIT*DQEFETCIASMVK 


10191 


24092 


A 


10267 


1 


212 


LIQKDTNrYYIKEAL*IEN!ari/KKISS 
IVSHYKMQIKIMR/YHTEMAIIKRTDKC 
LYECJRETRTLMHCW*GC 


10192 


24093 


A 


10268 


3 


399 


MaNKHIKIFSASLIRMEI*IHATMRE/H 
*FIPIKWLVLKSKRESNKCWQGCGRTGI 
VIHFQRE CKIVLLL * KTV* QFLKKLNIR 
LGAVAHSCNPSTLGCLRPRVQDQPGQHG 
KTLSLLE IQKLAWHGLHLSPS 


10193 


24094 


A 


10269 


190 


386 


FSPFLPLVTILECDKAICRRGKSFDFFL 
LGPSFYFFKLLL/FIFYF*ERVLLCGPG 
WSTVAQSMLTA 


10194 


24095 


A 


10270 


166 


327 


KTNSKTAEVCPLSVITLNINGIiNSPIKR 
QR*TDF*NPI\ICCLQETHFRFKGTN 


10193 


24095 


A 


10271 


180 




I FFLFEVETCSVA* AGVQWNDLGSQQPP 
PP\GSSDFQLIFAFLVETGY*IMVPSSS 
CRT 


10196 


24097 


A 


10272 


273 


3 


flglfsgnplkftnkpvchltlpi lqri f 
fffesefhsWaqagvqwcdlgslkppp 
pg\ssdfpgrrr* q*flmitplhssrgdr 

VRUiLKKK 


10197 


24098 


A 


10273 


399 


27 


YHFVPTRM2U:iRRKKrN*CW*TCGEIGNL 

vycwweck/lv*llrkivwqflk/dvkq 
nyhpvilllsiyri*fdajflckshvem 
* LI sgwvgawwevigsrewflmsgfti 

SSCHQDCEFSQDLIV 


10198 


24099 


A 


10274 


275 


452 


kinlvlqldfffvemeshsvtqagvqwc 
hlsspqpppp/gssgdsrasds*vagit 
svhqhagl 


10199 


24100 


A 


10275 


2 


379 


rgpggrglgsgtcfhpgrggepivyqhl 

F * FFGP \ PENYIL ILPGFGI I SH/I VT Y 
YSGKKEPFGYIGMA*AMISIGFLGFIV* 
AHHIFTVGIDVDTRAYFTPPYHTFEEPG 
YIISRRRRKTKGFLF 


10200 


24101 


A 


10275 


2 


407 


fpspspspfasvslslsfhglplpehtw 
fidgsptkpnrhsqakagyaivpsisii 
eata/papfttc*qaelialtraltiak 
glciniytd/shilhhhavmg*k/gflt 
tqgssiidvslietllkrfllpkeaga 


10201 


24102 


A 


10277 


203 


545 


flwrgqslalllrlecsgsill/ahckl 
gphrfkrfslpqp pe *lsnw\dyrrlpp 
cpanfctfsrdgfttlgqaglqlltssd 
lptlaspkcw\dyrheplviigltfnrsf 
nfff 


10202 


24103 


A 


10278 


250 


3 


akwflemkstpgedaaniiem/ tkdldy 
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SEQID 
NO: of 

sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=IsoleHcine, 
K=Lysine, L=Leucine, iVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DINLVDEAVAGFGRIDSNFERSCTWKM 
VSNSITCYRELFHERKSRLMCQM*LSYF 


10203 


24104 


A 


10279 


333 


3 


S L I LLLFQ SMNHTHVRVTF I DQ FCVCSN 

YSTDWLFrilLSriSLKTrYSLRYNIKIR 
PINHPTr-rarXSKCSSERESRISLTLNQK 
\LElTKJj/SiiEGMS KAQ IG * KLGLLCH 


10204 


24105 


A 


10280 


251 


3 


I TS KVIDSWPTTS 1 1 TiiNLJXGLNTSF* R 
QRLL/DWASSPK*DSIICyLQEMNFKyK 
I FKKIGRRI YYANI SQKKVGVAVLTPDK 
V 


10205 


24106 


A 


10281 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDIiGIiGNCFLD*YCIL*TKINSKGNKE 
LNVRAKTIKLLEENIDRNLCDLGLGNCF 
LDMTPNAQTTKGR\IDKLDFIKI*NFWL 
DT 






A 






383 


DAKRKVSSTEGDSEGVAHERRSAHLSAK 
PCPAEVE/AECPQKaAEKG*SSE 


10207 


24108 


A 


10283 


127 


377 


RIPGGSALGKEIRQCFGVKGQREIGKIE 
MSQPTQEREWEGIGSQKAAQVDHLRSG 
VQDQPGQYGETPTLLKIH* lAGHG/GRC 


10208 


24109 


A 


10284 


328 


370 


VGPGPGF*SSGRVSSCCPGWS\PSELK* 
SAHISLPKCMDYRCQPLRPATFFFKTAS 
HYVAQAKVQWCNYSSLQPGTLRLKCCIF 
KKIQWQW 














KWCVLTAPLSGRFSVCLPFIjGPPYFLQ 
HSDIEIKPINYPTVASKCSNERRSHTSL 
TLN*KLETIKLIEEGGMSKflETGRKI.SL 
L/QSVSQWNAKEKFLKEIRSITPVNT* 
MI 


10210 


24111 


A 


10286 


199 


329 


LFFSFLNCLSPHRIEESY* KE IKSAPP I 
TPGIPRK*TSLIAGKKIIVFMDWKDQTNH 
NIP/LSQSKTITLFNSMRAETVEEAEEK 
L/ERGCFMRFKERTHDHNIKMKGPGWA 
HTCNPNTLGGQGRQIT 


10211 


24112 


A 


10287 


358 


387 


YTNALLLYV*DQPGHLMGKDSLFNKWC * 
KNEYSHAKKKQNH\P*FSLYTKTNSKWS 
KDLNVRAKP I KILE KK 


10212 


24113 


A 


10288 


111 


373 


SAQCQPDQHGKTPTLLRIQKLAGPDGAC 
L*SQLLR\*GCGEPRSRHCTPAWAT/EG 
KTPSQKKKKK 


10213 


24114 


A 


10289 


197 


375 


iriglckfir*fletklhtvakvecssa 
isaygtlslvdsgdspasas/rwgttg 


10214 


24115 


A 


10290 


118 


371 


QCLLMMSPGLKT*MTGRIjRQENRLNPGG 
GGCGEPRSCNCTPAWATK\GNSASKNKQ 
T\KNKQQQQQN 


10215 


24116 


A 


10291 


214 


368 


CGYHQKNKKTNPKNSKCWQGCGE IGTLM 
HYWKDHKQV/Q/PL*TIVQQFLKTRr 


10215 




A 


10292 


2 


376 


PFPLMAQRPTHLVPLSASSGTSLLMPST 
LLWLPWLYKAHAVPTVPLSKLFDHTKPY 
AVSKNKLGALSr*SY*VrPVPPRSRF/S 
FIRDKPTS 


10217 


24118 


A 


10293 


1 


379 


NILHNLDFFKNTTLEPQSSRLQRAVIAP 
LHFSLGNIVRRCFKKKSGWAQWLTPVIS 
ALWEADVAGSLEFKSETTLGNMMRPCLN 

AAPF*SSGRVSSCCPGWS\PFELK*SGH 
ISLPKC*DYRCQPLRPATFFFKTASHYV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

seTn'^n'ce 


Predict- 
ed end 

otide 
location 

ding to'' 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Argjnine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possIble 
nucleotide insertion 














AQAKVQWCNYSSLQPGTLRLKCC IFKKI 
QWQDI 


10218 


24119 


A 


10294 


201 


384 


GEERLSVQCSAFK*LR*HINISLQTFFF 
LRRE SHSVA\RLECSGVI SAHCMLCLPG 
SSDSPA 


10219 


24120 


A 


10295 


161 


400 


VCVIESVKANHLVDEHALKSLSFHLSLL 
FGV*GVOTCVCVCVYMCF/CVC lYVC 


10220 


24121 


A 


10296 




433 


GEGGGGCSGWSGVETAGSRPAPGAEQDL 
LPAPALGRWQGLNSVCYPAWPQPNRRQP 
LALLCAAGLGPGQRLECSGMIMAHHSLD 
LPGSGDPPMSAG\S IAGTTGTCHHaW\E 
FHHVAQGGLELLGPSDWPPTVLGLQM*A 
TTPGH 


10221 


24122 


A 


10297 


209 


1 


LTFELLYHHYTLYFF*ESGSCSVAQARM 
QWCDLGriLQPKALG\SGDPPASAS*DA* 
STGVDYYSWHTCI 


10222 


24123 


A 


10298 


225 




RPGMPLSALPCTGHftiPARQPHSAACSGP 
KCPWCQSPQPAVGWTV*VCPLPLSPCLC 
/FCVPQPWHGGLPLPCPPV 


10223 


24124 


A 


10299 


212 


367 


VTAGSGHMCYTWWECELEQSL* KTVWRF 
LK* LKVELRFDPAVPLL/S I YPED 


10224 


24125 


A 


10300 


97 


406 


LHRCTPAWWERGCLKTTNQNNGQHNVE 
GEACSSRPFTTI C/EEKIHLVCALTEED 
♦RLAAETTANTTDIPIGSAYRILIKKLK 
PSKLSTC/YVAKQLSPGQLQTK 


10225 


24126 


A 


10301 


3 


422 


YRSSSKTFLIIRRGFIPSSTFLLAHLST 
TLSTLSHFQKGWVKLFFFFFFKKGFPFW 
PPGGGGRAKP*F/HGNPPPPG*KNFPAL 
PRQGGGKKG\GPPPPGQFWFFKKKGVSP 
WGGGAKTPELGNPPPGPPKOGG ITGGPP 
PP 


10226 


24127 


A 


10302 


330 


1421 


RTKGRPliLflLPVLLLnFLHLAFFSAPP 
GPSSLSSTEDLKWTLLSPAQSRKLQPQS 
LHRAGVRDLQWQGRAGAELTAAC/WPAP 
PRPETAVLLQGLPC/QEGRLLLESLWGF 
ALWSAEGTIFYASATIVDYLGFHQTDV 
MHQNIYDYIHVDDRQDFCRQLHWAMDPP 
QWFGQPPPLETGDDAILGRLLRAQEWG 
TGTPTEYSAFLTRCFICRVRCLLDSTSG 
FLTMQFQGKLKFLFGQKKKAPSGAMLPP 
RLSLFC lAAP VLLP SAAEMKMR S ALLRA 
KPRADTAATflD/SKMRS*LTLSESRLPN 
*FEIWDSAAPGTLPSLPFFGAFDGQHIF 
CIHTRALRC3STRTRQCIAAALEGASFRN 
HVEL 


10227 


24128 


A 


10303 


172 


383 


KI ENKWQC3YEE I ETLVYCWWE CKMVQL 
LWKTTWWLKILKLGPP*\DSIIPLLGI 
WQGTEIRFCKRYSNT 


10228 


24129 


A 


10304 


205 


2 


NFYPA*KTDFFLVGSLFIYFE*ASHPrT 
\RLECSGTIVTHCSLDLLGSCNPPASAS 
HVSGTAGAHHHCI 


10229 


24130 


A 


10305 


423 


30 


LLEGKLTNRKDIRTKTPSVRRLHQSPKV 
DRTAKMGKKQSRKSGNSKNQSAYPPPKE 
HSS S PAMEQSWMENDELREEGFRQS3 /M 
SKLKEEVHVNGK*VKTREKRLAKSLTRI 
TNA.E KYLKDLMELKTSVLRPL 


10230 


24131 


A 


10306 


211 


387 


IFFFF*IKACSVAQAGGQGRHLGSLQPP 
PPGLKQDSC/SPSRGVGTTGMTYHAQFI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFGF 


1 023 1 


24132 


A 


10307 


229 


382 


ERAWQPG * HGET/ PSLKKL * KLSRCGG 
MCL YSXLLRSLR * EDHLNPRSWGC 


10232 


24133 


A 


10308 


168 


417 


GSYCFFYRRNLLIVKYMRSVHLLFIMLL 
YRGD*WYy'i"iQyrJFCILQKC;EKLIYKIA 
/HVRIMPSSLHGGGG* IT*GREFKTSLAN 


10233 


24134 


A 


10309 


253 


402 


KGGGGGVKTPARGNPPPGPPKGGEKRGG 
PPPPTQKGFF/CFFFKKGTRPPGGKTGE 
RAKPNEI PAPGBCKE FFI.PKPPEKGE IKG 
/PRPPGQINF* FLKKTGFKKSGGGGSKP 
RPGGTPPPGPQRGGKNGGGPPPPPKKGF 
FGGKKKKKNPPR 


10234 


24135 


A 


10310 


182 


400 


LHVTMIIFWLYLIKNMLLNFTYF/CYIF 
SVATSTFKIVCV/WLGAVAHICCPHTIX5 
G*DGWIT*GQEFKTRLA1SM 


10235 


24136 


A 


10311 


371 


400 


FrWGNI*ASNKHIKKT\STSLAFRKIQV 
K\TTMRYLSPPTPIRMT/I/I*KSYSSK 
CW*GCEEIRSLIHCKMIQPI*KT/WQFP 
KIINIQLLyMPAIALLGNRTRG 


10236 


24137 


A 


10312 


113 


404 


QRCGKIGTFTHCWWEYKMVQPLWKTDWN 
LLQRLNIELPCDLAIPLUSLYPREMKTS 
IYICT*MFIAALFII/AKWHQP/KCSSA 
DE/IGKMWHICIMK 


10237 


24138 


A 


10313 


146 


1 


FIiTQKFFFFFETESRS *AWQWLDLGSL 
QPPSP\GSSDSPASAF*VAG 


10238 


24139 


A 


10314 


157 


434 


FFFSEIFFFGPGVFSPPPFFIGGPGIFF 
PCPLKKKILPP PRGKKFFFFKKGP/HP F 
FFFFFP*EGVSLLLPRLEC»IGAILAYCK 
IHLPGSSDLP 


10239 


24140 


A 


10315 


277 


2 


ffftafflgvpgvfppprffkpgpgfff 
lgp*kknffppprgk/ikffflkgppsf 

ffffff*egvslllprlecngailaycn 
lhlpgssf 


10240 


24141 


A 


10316 


1 


388 


KTITHEVITECIWLKVLLLYFPNCMCIY 
VRMHICVCVCVHISI*MDHIVSYKCIHI 
\LCIY 


10241 


24142 


A 


10317 


134 


399 


LGLPGGDTRGGGTIENPPFERALNSQNG 
GK\ WP LLGP KTRP LGWGLCST PKFWEKP 
RVRWDLN* TTPCLE KRAPWLKAKKTPA 
FEIRI 


10242 


24143 


A 


10318 


233 


3 


EFLFSLICAICL*CALKQFSSCFFCLAL 
lELLGSMGLLFH/SSLETFQPFFLQIFF 
/SFFPLFFGANYTYNKPFEYSLI 


10243 


24144 


A 


10319 


221 


S 


CDQPKSPPLSGVQMGDILWLVFFLFLF 
AF*R*GSHCVAQAGLELPGSSDP/PQAT 
SQVAGTTGMHCTWLVF*F 


10244 


24145 


A 


10320 


1 


407 


rs tpd pvylgftsggcktekragcfflw 
klhtkgaparcqpelsgmrc\r*pllgg 
v/sqsrymavkdsleeagcplselkrca 
grytglfkggrldhlnllrlcpqttlcp 
galsqedgsfiykpltgaaaflsym 


10245 


24146 




10321 


115 


398 


adeelllmnkqikyllemestpgkddvn 
tvemtkeseysi/nlkavagferidsnf 
erlsavgkml*nsitcykeifh/rksql 
msqtsllsyfklp 


10246 


24147 


A 


10322 


137 


454 


FFSLYFQGFFFFFFLGEKFFFSPGGKGI 
CPKPPFGPPPPPGKGFSPPPPPGKAKKK 
GPPPPPP\GFLFF*KKPGGGFLGGFFFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 

ding to 
residue of 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIetiiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptoplian, V=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGGKIFFSPAGGKGGPQKPPFP 


10247 


24148 


A 


10323 


246 


372 


ISIKSYQLW\LQGVAYACNSSTLGGRGK 
WIT*GQKFKrSMaNM 


10248 ■ 


24149 


A 


10324 


415 


1 


IWGAPRVFPPPPF*NPPPEKNFWQPKKK 
KKFPPPPG*KIFFFKGPPPLFFFFFFFF 
SETESCSVT\RLGCSGGISAHCNLRLPA 
SSNSPPPSSLRMPSTPVLRPHGDLLSPQ 
LSAWVLSWPFLPSLIiLPPDLDFIVQ 


10249 


24150 


A 


10325 


3 


559 


GDGITGLSPTISKFLFFVFESISLSPRL 
GSSQAIMAHCSLTSRREPPCRSINFFVS 
VITriNVNHLTNGKIGRMDFF*MM*YTIQ 
ICNLQNTHCRP/KDTNCFKMKRRKEIIH 
ANSNQKRAGVAILIPDKIDFKPKNVTER 
HRMLVRES/ISQEDVTIIHTY 














NQGLGARNQKSQRVFLAGFFFFESSSCS 
VAQAGVHW*DHSSV/PGSSAPPASATW 
VGTTGVHNHPWLIF/VFFVETGSH 


10251 


24152 


A 


10327 


145 


396 


CDYFGEMFSYUliIiNIYLCDDPAIPLLPY 

vpyenentdslkdmtdahssfihnspkl 
etkcpgtdewi*mtkksdyckhw*gyrv 
tgtllyyegecki v*llwgnv* lf\lln 
iylcddpaipllpyvpyeXmktrihskt 
*pmpiaalfiiapnwk/pkcpgtdewin 

TL*HTHAVQYHVAVER 


10252 


24153 


A 


10328 


94 


639 


AS I S S S I KWDNI S F nVGLLGELNE lENI 
CKAVNMKQAIiNKWKLILLFVIENNNVSA 
SSDINFKIFYAYYIAKA.VLLIITEHITLS 
IVLILKKQKNTRAVTKCW\RGGREKETL 
IWLVG*ECKLVQP]:.WREVWRFLIKVKIE 
LPYDPT/ILLGIYSKERKSVYRK\NICT 
PMF \ VAALFTTAKTWKH 


10253 


24154 


A 


10329 


143 


3 


SAWFFFQDRVLL*/SPRLECTVSISNHC 
NLRVPGSSKNYCLSLISIWDR 


10254 


24155 


A 


10330 


153 


389 


rqgltlfpglvSnfwlpvlsshlglpkc 
wgyqcyhawps*cf*nrsi/wsgtvtha 
ydpstlggqgkpit*gqefets1a 


10255 


24156 


A 


10331 


382 


422 


rykldl* klntelpydptvpllgnypky 
*t*af\sktct*mfiaaysqppkgrskc 
sspne*inkmwpiyimr/yysaik 


10256 


24157 


A 


10332 


194 




CKRQCNKEDHNI/CSK**WPTILF*TTK 
RKG/WLVAVAHACNLSTLaG*GGRAS*A 
QEFKTSRANM 


10257 


24158 


A 




348 




NSGQSWI MYPLLKKGVGRVS P PDPDI VN 
S I SMWCGGRLFSNERFASLLTI KKKLA 
GHGGTCLQSLPLGRLRREDRLSQGGQG/ 
C*GYSE*LCCRCIPAWAAEGDLWSWSPN 


10258 












IPGSPKRGPGGEKKKTFFHYLFLRAAPP 
LPRFIFFPPKNPPLKGGGGFFFPPPPKL 
KTPRLRKKKKKENFCPWKKFFKKKQTPQ 
RGRFFFFFFLRQS FAVAQAGVQWCDLGS 
LQPSPP\G* SYHPARTRG 


10259 


24160 


A 


10335 


3 


396 


LLRPPCSLRHNDIEIMPHDNPMIASKCS 
SKKKSCISLTLSQKLDVIKLSGKGMLKA 
KTGQNL/GLGHLYRTVRHAVNVKEKCLK 
EIKKAALLNTQMTRK*NSLIADTEKM*V 
V*IQE*AS/HSMPLRQNLIQNK 


10260 


24161 


A 


10336 


165 


442 


GQMELLIIARNFGILLHFNLSIYYRRLF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 
peptide 


Predict- 

location 
correspon 

litst amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, Tf=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FS I PEYLFIFTYLFIYLFIYFESVAHSD 
AQVE CSGT IMALT* KKNASLP/ASAPRA 

AGATG 


10261 


24162 


A 


10337 


33 


430 


TGSTHASANFCVSLVWTQL I SRS SGQGL 
ARLQLRCLPCCILFFSFFSLVIYRLHSY 
LRLPWRRICLQVPSGC*QNSFPaiFRIQ 
GSLLL*/MQQW 


10262 


24163 


A 


10338 


284 


3 


SQHFGRPRWVDHLRSGVRDQPGQHGETL 
SRETP\SLLKIQKLSGHGSACL*SQLFQ 
RLRQENCLNPRGGGCSEWRQHQCTPV*V 
TEQDSVPKKK 


10263 


24164 


A 


10339 


414 


2 


ERAGPFSRKIFFSPPLAKKVKYPFPPTP 














FFPSLW*BOQ*GPSLGRLFYPV/SWGIFP 
SKP*6FPGPRPKVFCFLG/LPPGPF6GP 
FFFFFFGDRVSLCHSRWSAVAQSQLTAR 


10264 


24165 


■ A 


10340 




422 


ASLGNGGRPSPSLAAALQFDL*LRiVGII 
FALFKaARQGDLSL*ELLLSP\VCLCPA 
PRGGAYRGRQAFLSWGGLHPDRASRLFC 
LPKQAWAMftGAPPPATiTiLPCSLISYCRA 
SNQQDSGGIGPSEPSAGYKLLVPGFLLP 
AEK\RTIYGGVTRFFRGP 


10265 


24166 


A 


10341 


116 


432 


SRQRMPSTYHVQGSGDTKTGEKPCBNGN 
KGCGSSILRSCLHRKRPSF*SLKALVWC 
GGACLEAQLLRRLRQEAHLSQGVPGCNE 
L*PGYCTPAWATEP\DSVSKKK 


10266 






10342 




458 


SLWVCVLTTLPSSCSPISLPIiRGPPYSIi 
RHRNIEIRPINNPTMMKCSSKRKS*MS 
LTMrQKLE\MINIiHEECMLKAKTG*NLG 
LLHQIVDQWNaKKKFLKKIKSAIPVNT 
QMVHKQN 


10267 


24168 


A 


10343 


53 


408 


SFQFFCSVFSPSLWFYIiLVFDDGDVQM 
GFWCGYPFCLLVFLLTDKTLSCRSVGVP 
CRVRCQCA\LLGGASQLG\SRGSGVRDP 
IiEEAVCLLSDLQLRAGRTTTLF*AVRQG 
HLSLQRLL 


10268 


24169 


A 


10344 


265 


421 


NNFPPSPPRGGGNPGPPPPPRFFFFFLK 
NPGLPPF*VIYLL/YFTLTPSFCLWCYT 
ISVYGCGFVFFFFSFFFFFFEKKIFFFP 
RVEPORIPVYPKILFPG+NNFPPSPPR 
GGGNPGPPPPPRFFFFFLKNPGL/HPF* 
KKGAKFPDLINPPPWP 


10269 


24170 


A 


10345 


207 


420 


WQKGCPVTFMELGWGPLYPCFLHLWIQ 
PTSHRKYIHL/WLGMWHTCWPKTLGGQ 
GGWIW*GQE lETS LTD 


10270 


24171 


A 


10346 


115 


3 


APRGAVKKR/RPWMGAATHVCKPS TLGG 
RGG*IT*GQEY 


10271 


24172 


A 


10347 


41 


421 , 


FTFVSVCVHFKG I FLS FLS FFFETEVPG 
SPLGPIOIGGGGGNSGPWNPPPPGLRGSP 
PPPPQRTGKGGPYPPPPGNFFFKFFFRK 
RGV/PPL*HPGGPQSVT*GTPPPGPPKF 
WGAQSGGPVWGKNPLF 


10272 


24173 


A 


10348 


228 


1 


SKQVKEKYDSFLHKIQKQAT*NRTFFRV 
TKKHVEITKK/WAGHWAHAYNASTLGGQ 
GGWITLGQEFETS LANMVKP 


10273 


24174 


A 


10349 


405 




GGFLKRGPPFFFFLRQGLSLLPRLECSG 
MTVAHCS PNILDS SNLSLRRSWDYSHQA 
QLIFLVSF*VLGSHSVAQAGVELLAS/P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartlc Acid, 
I>=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=iVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptDphan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KCWDYRHEPL/WPSQSQNLNSFFLDGFS 
LLLSRLECNAAI SAHCNLRLLGSS 


10274 


24175 


A 


10350 


301 


546 


RHIFMNLPFHNLVKIIFHCRGKTLFLDI 
SNPYLKKQVSSSKPNCK/HLKEI*LFLE 
AKDYWLGWPHR.YNPSTLGGQGRS ITG 


10275 


24176 




10351 


873 


1258 


FCYQCYFHLLHHLHHHRPSPRLEPPPPG 
AALPPVPEGRI IPLPCPSSGPRQTTGPG 
KGRRHPAPHRTPAASAASLPARRTLPAA 
PW* PLSFQNTTPGSA\ PGGGNQPASGAG 
GNSRT 


10276 


24177 


A 


10352 


184 


576 


GLSPLSHLKMAASSSEISEMKGVEESPK 
VPGEGPGHSEAETGPPQVLAGVPDQPEA 
PQPGPNTTAAPVDSGPK2VGLAPETTETP 
AG\ PQKQPRPQTSA*AQBGNQRPTAAPK 
THAQETVSKPEVSKEATAD 


10277 


24178 


A 


10353 


110 


384 


VPGSPRKTRSPQQMERVLAPKTAKKGCP 
RPGDHRSRLCJKrCRDTRARIKWflDSVERP 
PWAG\PVAHACNPNTLGGEGGWIT*GQK 
FKTSLVHM 


10278 


24179 


A 


10354 


123 


422 


RKQIANDKSKLFIINKYLNVKFEINSPIK 
/RERIiAGWIRKEKHDPSICHL* KTHLT\ 
KDTNSLKMEG*KKIYHANNY/QKKAGVA. 


10279 


24180 


A 


10355 


36 


426 


PPERPQYSARVCVCVFYTECRGLAPKPE 

crgaiimcllnvlgsq*ssnlrvwss* 
dytqlifklfvqtgshciaqagfell/s 
skdpptla/fpkhwdyrh\dhhsw 


10280 


24181 


' 


10356 


193 


3 


FQTRIFKISFFFKEFLKFKKFFFFILI* 
ESRSIAQAGVQWRDLGSLQPPPPXGSSS 
PPASASLY 


10281 


24182 


A 


10357 


205 


377 


SPWFQPYYRLTIPYQKC*LATVCI*VY 
* FFKL ILYPAI LLNS 1 1 1 WINFTSDPL* 
FSSHITG*LSLIKNVWVGPGMLAHA\*N 
PSTLGG/RGGWIT*GQEFETSLANMVKP 


10282 


24183 


A 


10358 


51 


376 


LCRPFQLCHGDRTDMMCPFPEELQDRCS 
CRSTSMRVAKGTTPCFLKVLQNIYTV*M 
FCRTFRKHGWPLATRMLV\ I YKNSGPV 
TLQGRGTTYLSCHHGKAGRVCRATQHAA 
GWVTKQRACI 


10283 


24184 


A 


10359 


295 


376 


ASELTSQSPGSLFENQKKGRCIAFLPC* 
KCNSE ILVHCWWGCKM\WKIVW* FLKTL 
\RLELVYDPASPLLAIYPRCI 


10284 


24185 


A 


10360 


451 


1 


CQDIKTILRWEQEVLVIPSVTKYKQVGK 
S FVSRRYLWSGCVLCYAFVTEAS KTWHH 
FP YRS PGFWGRSGGRQFDS * PKSQADFR 
APQGPPTPEPEGAERREPEIAPPPTADP 
SPKPT/HSPPRSGAPPSAPFPLLGEAER 
GVG\PGQGRPRP 


10285 


24186 


A 


10361 


2 


776 


f fflrws lals prle cs gai s ahcklrl 
agsrhspasasrvagttgaryharli/l 
/ylflvetgfhhvsqdglnllts *sacl 
glpk\c\wiyrrepprpayfifflrrsl 
alsprlecvqwcdlrst,qavppg\lhhs 
pasaslvagttgahhhtrivlill\fyy 
flrwsvtlscrle\agvqwhnlgslqap 
ppgftpfsclslpsswdyrcpppclanf 
lyf**rrgftmlarmvsis*prdpppsa 
sqsahdlfivgq 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

nucle- 
oiide 
location 
correspon 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10286 


24187 


A 


10362 


188 


365 


CPQLHTQS1ILINMDASC2RYQFELKIQAL 
/WAGTVAHACNPSTLGSRGGWI T*GLEY 

ATSL 


10287 


24188 


A 


10363 


146 


1035 


GWAVYQTLQAAWLGPRTESGVSFHGRFS 
LPHHPPQPTLHAWPAPPLQPEPCVPLRE 
DPFMPESW*LPPSIPAEGVSF*ECDFSR 
RMMGKGS PVPTVAP PLGPAPEPFCVTAG 
CAPMHTPTHTQN PPHCS I FLPKIS FKS K 
HFy*LLFCKCLSSSVFSLPESILLLFIV 
ESVCQPGERCLALNMGWDGKGSGRRWAT 
KSQGVMDIAATQTPA/VFSPV/PATSGP 
AGVSGGK 


10288 


24189 


A 


10364 


229 




plenaksiywgkgrllkn]srarlftva*s 
vinqpqfslmvdwikkk\mwrmyttesca 

ai * h /de ivscrgtwmeleamy 


10289 


24190 




10365 


186 


449 


RFFFFFPKFFWVFC-LKSPFPFFG/RLGI 
*EKDGGFFFFFFFFPEt4aSGSVA\RLEC 
SGAVSAHCNLQFPGSGSLPASAPRVA 


10290 


24191 


A 


10366 


445 


3 


RVDNGVRGRYRGKRGERSR* TRWGWGHP 
QRWDQDRSRERPRETGREEAAGRRHETE 
AAETGRRRRERGRSDRNHK/RGRPRERM 
IEAGRnaGRRER*ARQESWTEKKTARQE 
EGDHD/RQGHQETHAHKGWGIIN*D*GRQ 
ABRSENRRMY 


10291 


24192 


A 


10367 


1 


491 


ktapfptslesafvwavqdrvs/llsp 
gpecrgaiXlthcc/lrtpg/fkpishl 
s/lps/swdyrhvpppldklilcrdnvs 
qsglkllassnpltsafpst/ydyrret 
pgprasp*tqmqpcfmp/rs*sqp*tqi 
a/r*qsnwgsq*vijsinrelafgtmffgq 

RPKAFF ■ 


10292 


24193 


A 


10368 


572 


40 


lhsfrqhfsntypqctpgtp*vtqslaa 
rpaglpeptsasggagavtppprtd\pp 
svewsgkkgldpvlrgqaprpspanpap 
apqgpaprkgtecvptasktprpggrwe 
paapphahpgkdlqppstlvkecpeaec 
praltgdlpssgenrnglsafqgqrrpa 
qksgencen 


10293 


24194 


A 


10370 


36 


517 


pflfsdidflfsfyflsffff*krglil 

PP/6PEGQGGHKG*WPLHPRG*REPPNL 
TPQTRGNKGNRQTGPVIFGILRKNGFPP 
FGQGGFKPPGLREPPGLRLQRGGNNGCiN 
PYP/P/PGKFF*RTKRGEKSPQPPYNKP 
NKKFNPAPMEEFSSFKEGRGPSIiN 


10294 


24195 


A 


10371 


60 


403 


DVEQPKLRSYPTLKCKPRSAILTICPWA 
LQYRSHGLIFGFSWGFLSFETESCSVSQ 
AGMQWCDHGSLQT/PTDSTSQAQAIDPP 
ASAP\QAAGTTGRCHHTCLIFVFL*RWG 
LAKP 


10295 


24196 


A 


10372 


358 


33 


DSLGVPGAPLLSPGVTPHKQWFFRPYTQ 
IKKLGFFPQIFFGFKV* KAPFFFWGFGN 
LKKKGGFFFFFFFSEMASGSVA\RLECS 
GAI SAHCNLQFPGSS SLPTSAPRVA 


10296 


24197 


A 


10373 


264 


3 


RQTSVSGSNDGKSWPPADIDTI \KYVIG 
NRLITTQQ*LIKLTSKQMITIHNTKGRT 
*SLILVSLIIFIATTNLLGLLPHSFTPT 
THCI 


10297 


24198 


A 


10374 


298 


384 


TLVI*KDGVHTTPHHQQHaYPGPKEYML 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 

location 
correspon 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HKAPLLHSVSRDPSPFAQSSNCYNRSIK 
QEPVDPLTQAEPVPRDAGKMGKTPLSEV 
SQNGGPS/HK*GQYSGDV 


10298 


24199 


A 


10375 


84 


518 


IMKLRQKSRSSLKIMRS KLQHTRAS /GT 
QLR/PMLGGNFIALH/AHIKKLGRTQIN 
NLTSQLKELEKQD/RNPKARGRQEIT\K 
1*AELKETET* KKE IQKI YE FRV/ MFLE 
NINKP\PASRLIQRMIERHTIRNDKGNV 
TSAPPDV 


10299 


24200 


A 


10376 


70 


363 


FHQKNLVPGLWVQLLGRRTLQSQHPPAP 
PPSSSPEVNVPSEVQAAD*LCCIFYDVK 
VRKCSTPQKKTRKRKKGCLF/CAPGQTR 
/CLC*iaiGEKNLVGKT 


10300 


24201 


A 


10377 


299 


2 


YPVAGRGNR*GCGAHaLGVGGAAMPLGA 
TISSVSLTISSLCAFPRRGGLEPSLPAP 
HTPPPPPE/P*ASSPSRGEWHLVPPCMQ 
PGLPPLPPPRIiSPLV 


10301 


24202 


A 


10378 


302 


3 


WPTVATTCSCERKSHTSLT*NQKLEGTQ 
LSEEQMSKAEIGQKPGLCHQT/VQWNA 
KKKLLKEIKNATPVHTQITRKXQNSLIA 
DT\EKV*VIQIEDQTSHV 














NEYVFLEKSLKLAGRVGSCLLGRPRQVD 
HLSSGAQD*PGQRGKTPSLQHIQKLSRH 
GGMCP/RRLRWEDHLS/SRGG*GCSETR 
ST 


10303 








458 




ITISTISCKATSFEEAPRASRPPPDEWV 
SQAGPAPLRGWEN*GTRHEDDWPEASLK 
DPRRSGEPEAPL/PLRLML*SRLPPPLR 
WRRRPPPPDSHTPSPGNGKSPRVPRTGS 
KDKRPEALPP* FAAGEPLEANVPPVQHG 
GQSVRSRGSYIiTAT 


10304 


24205 


A 


10381 


327 


2 


LSAPKKFPPQKSLFLGANFRSPPFPGRP 
FTNRFSDUIVNPLTKKIYFSHKSSNPPT 
FFFFFFETGSNSVT* SGVEWRNHNSLRP 
ESPG\SGDRP*V*HTTGAPYHTWLV 














NTALETDTTFSGGGGGHQLCVRSRVSGG 
G*LIMPDLHNHPLQQTSGRNQRLQGEVT 
DPSPGMSGGAQTGPKAPPRPPSALRHSG 
SLQGQP*FAHLYCGTIPYPCYQTAPLPR 
APPPTPTGH*CPGEA\PQDELPVCGCSP 
WPRPPGQAQAGRNKH* FSGQRSSRDTTA 
CSPCSQSKQSPSRPPCVWDS/PGQGPGP 
KAPAGPPPALPEASLGSVHCSP*VGVMT 
PTRPGHFLVPGKVTLEQftPGGGASGGPV 
P 


10306 


24207 


A 


10383 


39 


433 


SHDQTYTEEELLFMDEQRKWFLLMESTP 
GEDAWWEITTKHLEYCINLVDSTVA/ 
GVSNFERRSTVGQMLSNSIVCYGKIF/R 
*KR/SQSMWQTSLLF\NKLSQPP*PSTP 
TILISQQPSMWRQDPPSEERL 


10307 


24208 


A 


10384 


283 


1 


NVFSCKPTSSCLISPDSPTPDRRFSKAD 
TYRNFI«IFPLRRHTHMHTHTYMHTHVHVC 
TQTiCLHLRPH/lTHGLSL/TCAPPVL*R 
PSFKPLHPTPV 


10308 


24209 


A 


10385 


12 


471 


SPPWWTHGFRYWEPTPPRFSVPHDALGQ 
FHTALFRLTRRFQLLSSCINISPHTCLP 
PCLGLCVGGPLEPAVCWAR/CPP*RGMV 
SGWKASPLG/CPPTLLPPC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaiiine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionin6, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threouine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=UnknDwn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10309 


24210 


A 


103S6 


287 




SFCVRSRPAPTSGRTPFPAAPPFPAAPP 
FPAATTPSTAEDGVPG*G/PAPAANCSP 
HSSQRGSAAMGIPLPQAS/PPAPTPAPP 
SP*SLPFPPAPMY 


10310 


24211 


A 


103S7 


336 ■ 


3 


LIEGKLTNRKDIRSKTASVEHHHQSPKV 
DKPTKMGENQSRKAENS \ KI KSASS PPK 
ECSSSPATEESWMENDELREEGFR*LVI 
TNFSELKEKTGKLNPAALVCGRSEDQTT 


10311 


24212 


A 


103S8 


15 


470 


APSPDAMGR\FTEEDKATITSLRCKVNV 
EUAGGETLVKLQVVYPWTQKYFDSFGNL 
SSASAIMGNPKVKVHGKKELTSLVDAI K 
HLDDLKGTFAQLSELHCDKLHVDPENFK 
LLVNVLVTVLAIHFGKEFTPEVQAYLHK 
MVTWANSIiRYK 


10312 


24213 


A 


10389 


3 


438 


QTQREPTMVLSPADKTNVKAAWGKVGAH 
AGEYGAEALERMFLAFPTTKTYFPHFDL 
SHGSAQVKGHGKKVflEALNWAVAHVDDM 
PNTLSDIiNDLHaHKLLVDPVNFKLLllHC 

lletldghlsaeftp\avnasleqftaf 

EITWLT 


10313 


24214 


A 


10390 


2 


444 


TMSFNTIiRICLDCLCSTLQPTRSIPGYP 
SSPLPGHPTPPMTPSSSVPYMSPNQEVK 
S PFLPDLKPNLNSLHS SPSGSGPCDELR 
LTFPVRDGW/SGALPPAAQPGC 


10314 


24215 


A 


10391 


405 


3 


KM I LTCLALPAQHCPPSACPQTMAQP PL 
PLS I KGAftSMSDKLPYKVADIGLATWGH 
KALDIVEN/EMPQLMGMQELYSASRPLK 
GACIADCLQITVETAILIETLFSLGVQE 
QWSSCSIFSTQEHAVAVFAEAGMP 


10315 


24216 


A 


10392 


248 


5 


RLNKVGGHGGNYEHLYVNKLETLQEMYK 
FLDIYWLPNLSLEEIE\NIiNRPITSNKF 
ESVIKHFPTKKISGLDGFTAEFLTNN 


10316 


24217 


A 


10393 


262 


3 


LPEDRQAGRS QGPWPQAVGRARKMPPF 
ISYQVYSKNFFEIESCSVTQAGVQWENl 
SSMHPPPP\6SSDSPASASQAPGIPGAH 
HHA , 


10317 


24218 


A 


10394 


287 


410 


ALKGSSRILIMLdAWLTPVIPA/LWEAK 
AGRSSEVRS 


10318 


24219 


A 


10395 


175 


1 


GGREVDDEIPLVRESTIFHIFFFFFETE 
SHSVAQAGGQWRDLGSLPSS\PPG\SCH 
SPA 


10319 


24220 


A 


10396 


661 


963 


STLIAFIVISTLFPLLDMTEIYFSLLDE 
IVDTLGEGAFGKWECIDHKAGGRHVAV 
KIVKNVDRYCEAARSEIQVLEHLNTTDP 
NSTFRCVQ\MLEWFEHH 


10320 


24221 


A 


10397 


1 


268 


PQTHEEATMGLS TADWTNVKAGWGKGGA 
HAGHDGAEALERMFLS FPTTKTYFPHFD 
LSHGSAQDKGHGKKVaDALTII\AVAHVD 
DMRNAL 


10321 


24222 


A 


10398 


33 


302 


SRGATLIYVDNENGEPGTRWAKDGLKL 
ESGPS IKRLDGRSQVSTPRFGQTFDAPP 
AIiPKATRKASGTVNRATEESV\KTKGHL 
VQEHP 


10322 


24223 


A 


10399 


16 


233 


PTDYEFNTTTYRECGPRRPEFSTSLDLL 
SQPCRAVYM\VGEK1IDIPFELRIVDLIT 
GKS ILGFGEPKSQEGDR 


10323 


24224 




10400 


177 


1 


KLLDLGQPQSPALAACLEYSGAIAAM\C 
NLCLBGS SWPPTSASRIAH I TGVHHAE P 
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SEQ ID 
NO: of 

sequence* 


SEQ ID 
NO: of 

sequen^ce 


M 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

loca^ion''^ 
correspond 
liig to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

"tide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=Glyeine, H=Histidini', I=lioleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /'■possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YA 














HVNKDTLESHLVWKLKSALLGRKS ES VT 
GPS CHPEPSDKTVEL/WQGAVAHACHLS 
TLGGQGGWITLAQEFETSLDNIV 


10325 


24226 


A 


10402 


104 


3 


GIILIPKPGRDTTKKENF/RPLIPIDAK 
ILNKILftN 














SOVMAVAljpAPtiAtiARPELDlLQFLQRFL 
QILKVLFPSWSSQNALMFLTLLC/LTLL 
EQFGNYQV6LIPNQYYGVLGNKDLEGFK 
TLTFLAVMLIVLNSTVESFPLSVSLVLS 
SPV 


10327 


24228 


A 


10404 


3 


404 


ARAQRRGDLSATGRNWSPLPPAGLPATV 
VLRHSGSLMaATCEISNIFSNYFSAMYR 

s Nedytlasvppaatfgaddlvltlsnp 

QTSLEGTEKflHWMG\EQSQFWPKTQ6LD 
WIS FQAEKNKYEASAIGFPRWDK 


10328 


24229 


A 


10405 


254 


443 


nqlssimvmfkkirsfeedfndpekvyg 
sgdkvag\rlivevcedtrvkavrilac 
gvakelrm 


10329 


24230 


A 


10406 


26 


461 


GEVARRSCCGAMAGTALKRLMAEYKQLT 
LNPPEGIVAGPMNEENFFEWEALIMGPE 
DTCFEFGVFPAILSFPLDYPLSPPKMRF 
TCEMFHPNIYPDGRVCISILHAPGD/DP 
HGLREQ\PERWSPVQSVEKILLSWSML 
AEPNDES 














qqaakmaens<Sragkssg/t /pargrgr" 
c/paeqviagfnrlrqeqrglaskaael 
emelnehslvidtlkevdetrkcyrmv6 
gvlvertvkevlpalennkeqi qki let 
ltqqlqakgkelnefrekhnirlmgede 


10331 


24232 


A 


10408 


180 


446 


kkaaasvsaasgshlsnsfaepsrsngs 
mvrhs s s p yvry psdkpflns dlrrs ps 
kptlgypesnsraifsal/knlqd/kir 

RLNL 


10332 


24233 


A 


10409 


1 


417 


eaglvtmeeigilgekaqdeipalsvsr 
pqtglsflgpepedledlysrf\kklqq 
eleflevqeeyikdeqkklkkeflhaqe 
evkr iqs i plvigq fleavdqntaivgs 

TTGSNYYVRILSTIDRELLKPNASVALY 


10333 


24234 


A 


10410 


305 


53 


leaqphappllghhlpcwripshlgtcr 
rpramrvqgseeegva/gqapplpwvhr 

GSLPLVRRRPAGKPTGGNAREEKGKVEG 


10334 


24235 


A 


10411 


419 


2 


NNSRKFGNGTLKLVEVE I YP L I S E S I PD 
KSELLLTPQDKKHAPLPSAVPNPSA/CP 
DLEPHTTPAS PVS \ PAKTVETRPSAPQG 
PLPGPVRRPHGRFS VRAAS PQTS GRPGA 
PRPPGPALREATDAPRAATPPIAALAGH 
S 


10335 


24236 


A 


10412 


74 


322 


MDKMATTQ I S KnE LDE LIE AFAKVDLS S 
NGFICDYELHELFMEANMPLPGYKAHET 
IQKLMLDGDRNTDGKIS\YDEFAYVSM 


10336 


24237 


A 


10415 


1 


415 


PQTHREPTMWASADKTNVMAAWGKVGA 
HAGEYGAEALERMFLTFPTTKTYFPHLD 
LSHGSAQVKGHGKMVADALTNAVAHVDD 
MPNAHiYALSDLNAHKLWLDP/VUB'MLLS 
HCLLVTLVAHLPDE FTPAVHACLDKLL 


10337 


24238 


A 


10416 


19 


454 


APSPNAMGHCTEEDKATITSLWGMVNVE 
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SEQ ID 
NO: of 

sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 

peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^'Valine, 
W=Tryptophaii, Y«^rosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














DAGRETLGELLVVYPWTQRFFDSFGK\Ii 
SSDSAIMGNPKVKSHGKKVLTSLGYAIM 
HLDDLKGTFAQLSELLCDKLHVDPEWFK 
LLGNVLVTVLAIHFGIEFTPEVQASWPK 
KMNAVA 


10338 


24239 


A 


10417 


2 


395 


ALSSSQPSAPMHPFYTRAATMIGEIflliA 
VSFISKFLRTKGLTSERQLQTFSQSLQE 
LLABHYKHHWFPEKPCKGSGYSMIGVirP 

\kmislxlqacieggi 


10339 


24240 


A 


10418 


2 


374 


LSTPHAFGTGKWGTFISLLFLFSSAYSR 
GVFREnSHKSEAAHEIKDLGEEDFTALV 

viafaqhlqqwpfedhXvqlandvtefa 
ttcaadessencdqslhtlfgdllcsra 
tldeshcvqdgep 














DPKAQLPEPLRVLWQLTYAMAAGSR/TS 
LL/LAFALLCLPCFKAGPPNRPLSRLFD 
HAMLQAHHRTTAIDTYQEFENLYPKD/Q 
QYS/FLMTPTSSALDSIPTPS 


10341 


24242 


A 


10420 


166 




NFLKKPGPFQKFFQRGLPLTLTFFFFF\ 
FETKPHSVSL\LECGGTVSAHCNLCLPV 


10342 


24243 


A 


10421 


2 


251 


LGCTQHRSQELVAAATSHQTCIQASEDV 
KEI/FARARNGKYRPLKISIENGQLMIG 
SY/SSQPSDSWDNDYDSFVLPLLEDKQL 
CY 


10343 


24244 


A 


10422 


251 


1271 


KEDLSPRAPMSGTQSTITDRFPLKKPIR 
HGSILNRESPTDKKQKVERIASHDFDPT 
DSS SKKTKS SSEESRSE lYGLV \ QRCVI 

AAMRA\GVQTGDRI I KVNGTLVTHSNHL 
E WKL I KSGSYVALTVQGRPPGS PQI PL 
ADSEVEPSVIGHMSPIMTSPHSPGASGW 
MER ITSPVLMGEEMMWHNQKVE I LRKM 
LQKEQERLQLLQEDYNRTPAQRLLKE I Q 
EAKKHI PQLQEQLSKATGSAQDGAWTP 
SRPLGDTLTVSEAETDPGDVLGRTDCS S 
GDASRPSSDNADSPKSGPKERIYLEENP 
EEQKKG 














QRNMVGQRLVEPRRLKPGFINVKSYNGD 
WEWHQGFFLVCFFFLRWSLALSPRLECS 
GAISTHCNLCLPGFKQFSCLSLPSSWDY 
RHAHHCTQL I FVFLVEMG\ FHHVGQAGQ 
GFFSI.EKSLTII 


10345 


24246 


A 


10424 


2 


343 


PQTQREPTMVLSPADQTNVYAAWQKVGA 
HAGEYGAEALERMFLSFPTTKTYFPHFD 
LSHGSAQVKGHGNKVS\DALTNAV7iHVD 

DMPNALSALSDLHAHKLLVDPWFKLLS 
HC 


10346 


24247 


A 


10425 


31 


342 


RAAVMPREDRATWKSNYFLKI IQLLDDY 
PKCFIVGADNEGSKQMQQIRMSLRGKAV 
VLMGKNTMMRKAI \RGHLENNPALEKLL 
PHIRGNVGFEITKEDLTEIR 


10347 


24248 


A 


10426 


196 




KNLIISQTKKVICPNTLFLVFPKRVFFF 
FFFFETESRS/VSPRLECSA\SISAHYN 
LCLPGLSDSS 


10348 


24249 


A 


10427 




405 


RLECWLEPPHGAGLQGLGWVWSCSVSTG 
PTMQALV\LLLCIGALLGHMSCQNPASP 
PEEGYPDPDSTRALEDDEDPIFIDPENK 
LPEAGSNFGYDLYRLRAIMSPTSNVLLP 
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SEQID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 
ed end 

otide^ 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide Insertion 














PLMVATDLWDLSLEAQQLTESI I 


10349 


24250 


A 


10428 


223 


361 


EGPTEENMAflKVFES/ IGKFGLALAVAG 
/GPVH SALYNVDVGHRAVI FE 


10350 


24251 


A 


10430 


3 


359 


LTQREPTMMjSPGDMTHVKAAWGKVCAH 

agdygaealermflyfpttNktyfphfd 

LSHGFAQVKGHGKKWDALTNAVAHVDD 
MPNALAaLSDLffiUHKLRVDPVWFKLLSH 
GLLVTLA 


10351 


24252 


A 


10431 


273 


1 


NHKDGECKQSGKTEKSKNQSASPPPKERS 
SSPATEQSWTENDFDEFREEGF I \RQSN 
YSELKEEIRTHGKEVKNLEKKLDDRLTR 
ITNAQK 














SFLWKFCIiRGIPSHVSCQSA/LLIiGGAS 
QLGYSGVRDPLEEAVCLFSDLKLHAGRT 
TTLFNAVRQGHLSLQ 


10353 


24254 


A 


10433 


281 


2 


TNQEKKKGDPNKIRSGKGDTTTDNTVTI 
I RD YYG/ HTV TNKLKN/LEE VDKF LDTY 
YLPRFNQREIENMNQPITSNEiaSVIKS 
LQTKKSPGPMAS 


10354 


24255 


A 








SFLWKLRLRGAPGCMRCQSA\LLGGVSH 
LGYSGVRDPLEEAVCPYSDLNLCAERTT 
TLFKAVREGCLSLQKF 


10355 


24256 


A 


10435 


133 


2 


FPKPPGNFFFFFETESHPVTXRLECSGA 
ISAHCNLHLPGSSDSPA 


10356 












KGGVFFFFFFFFETESdSITQTGAQWHT 

\YGSLQPRPP 














KGTRKSLTFJEQLLSAKYWAAGCMHYFR 
GSGKYSNGYPVFVFVFIFSETESCS\VA 
QAGGQWCDLVSLQPPPP\GSSDSPGASF 


10358 


24259 


A 


10438 


354 


2 


KRG LKKM PFFNPRNMGE FL KLWGPNP KK 
I FKNCKI FKMGLKKKVLGFYKNSKNLKR 
GKLWPFVIFFFFFFFFFFFLETGSYSIA 
\RLKCSNSGTNTAHCSPELPGSRDLPDS 
TTQVAG 


10359 


24260 


A 


10439 


126 


^ 


gmydcvclfcfaikegrerekerekerk 
rerqreker/ererdkerkreerk 


10360 


24261 


A 


10440 


29 


347 


I QKPTADTKKFFKKKLKYTTRENHLHNK 
EDRKEGKKEEKTTKQHGSSKSLLINKTL 
NINGLKSPIKTHRVAE/WNKNKDPMLYC 

iqetpftykdthrl/riqgwkkifh 


10361 


24262 


A 


10442 


405 




IFFPHPFFFKKSFRLSPQVEIQGIFLGP 
WKLFPPGFNNFS/CPNLPKNLGFKGLPP 

rpsyffrifckkkgf/h/rvypegfpfl 
algffplfpfkswgfqgdppswgpffff 
fffvlvaegtlsegcrlirgrwknlvp 














LYNDP IS lYQVTLYRRVFFWVLYSAlTOS 

lrwkanflffsffkmasdpvaqagvqwc 
dldslqpqpp\gl\sdtptlasqvsett 
g 


10363 


24264 


A 


10444 


171 


3 


rhptmagssemflgsgfgcrltvfvlfe 
tescsvt/regsgaisahgylslpgssn 

LP 


10364 


24265 


A 


10445 


173 


3 


ci»inwcfsififlkinfileteshsvsq 
ag vqk \ l i ti/hcslklmgssdpp ssasq 


10365 


24266 


A 


"10446 


187 


329 


SFLWKFCLRGVPGH\fRCQSA\LLGVASQ 

lgysr\tjjpleeavc?fsdl 


10366 


24267 




10447 


27 1 226 


ysypvylllhiflcwkgslyfiyfvfem 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid' 
residue of 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threoiiine,V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ESCSFAQAGMKWCDLQPLPPG\SSDSIiA 
PASQVAGVDAA 


10367 


24268 


A 


10448 


119 


296 


SP IHYILVI I ICFIHVILLI IIFSEVGS 
HS VAPAGV\SGAI lAHCNHKLLGS S S P P 
VSVS 


10368 


24269 


A 


10449 


111 


363 


slkklvfffs lythnffpffdptlknst 
rhslylslslslflshsv\srlecggti 
iahcslnlpgssdphtsmsqvVagptgs 

C 


10369 


24270 


A 


10450 


218 


1 


QRFWGPGPNKVFKNLGSAFFLGRGKLFF 
LE IWGGFFFFFLVFFLRQSHSIAQAGVQ 
WCD\ LNSLQPLP PRFK 


10370 


24271 


A 


10451 


179 




GPFKLTLFFMGKFALGREKCHFFFFETE 
SCTVAPiMSLQWRDLGSLQPPPPVGSSDS 
PAS 


10371 


24272 


A 


10452 


71 


232 


SS PLSDLLFNIVLEVIiAKDIMQEVEy I K 
LILIGKDEVKWSLIVDKI ILYVKNSKH 


10372 


24273 


A 


10453 


153 


3 


ARGELYFLEIGGQFFFFFLVFPLRQSHS 
IAQAGVQWCD\LNSLQPLPPRPK 


10373 


24274 


A 


10454 


338 


221 


MGFLHVGQftSLKLLTSICP\LGLPKCWD 
YRHBPPRPVIPS 


10374 


24275 


A 


10455 


214 


366 


YNYKYLIHYYSYLFFIFI FF/NI IHYI F 
FILFFYLTSFIFLSYFFFFFITLF 


10375 


24276 


A 


10456 


155 


2 


DRVSLLLPRLECNGAILAHCNLCLLGSR 
HS/PRVAGITVAHHHARLIFCIF 


10376 


24277 


A 


10457 


870 


1249 


EGPRWADHLRPGVQDHLGXQHGETPST, \ 
QKIQKIiAREGGTC/LCQSQLLGRLSQEN 
CLNLGDGGCSEPRLHHCIPTWGDE\GDS 

PKKKKE 


10377 


24278 


A 


10458 


222 


404 


KQRPGRAQWLTPVI PKLWEAELGR \MLE 
ARTLRPAWAT 


10378 


24279 


A 


10459 


122 


362 


LSEDQLNCYPQRLHHF IFLPVI TRVQMS 
HYLCQQSCLFHFF/SFFFLYFEMESSSV 
TNRLECTGMISAHSNLYLPGSSQSSG 


10379 


24280 


A 


10460 


3 


102 


AASTLALSPRLECNSAILAHCKLH/LPA 
FTPFSCL 


10380 


24281 


A 


10461 


133 


339 


KLS KFQGSKKPE IKANSLDLALFYHNYI 
TRQCFTSYSNFFGDRVSLLSPRLECNGV 
ILAHCNL/RTPGFK 


10381 


24282 


A 


10462 


3 


193 


ASNMMTELKSQIPILTLNVNGtiNAPLKR 
HRVAGW I NDPTI CYLQETY FT/ CKNNHR 

LTAK 


10382 


24283 


A 


10463 


126 


314 


RLGGS T I KVQRDLS SDRSKPGRFLYTSN 
SSYVRKGKRNKACFTERVP/ PVNDI I 


10383 


24284 


A 


10464 


1 


332 


KKNLFLSKDCLSSLFRFSKSPPAILGPP 
RGWTPKTRFIGPKAFFFKKGSLSLGRKY 
HFGLKPVP/CPPFLSKTRFLPGVPFFFF 
FFFETESHSVA\RLECSG/AISAHCKLR 
L 


10384 


24285 


A 


10465 


173 


365 


RFVCSTIKVLRDLSSDRSNPGRFLSLQI 
PP XLRKDKRNKAYFTK/RPS PVTTD I IS T 


10385 


24286 


A 


10466 


335 


3 


PFKKFYFKISTRALLEPVFFGKKIRVPP 
QRLSPIWAPLFKRGPPNSFQVNGVPLFF 
FFFFVEHGLALSSGVECSGTRLECSGAI 
/STAHCHLCHLDASHPPTSASWVARPTG 
T 


10386 


24287 


A 


10467 


818 


1008 


VHTDVLADI LKS lOTTAEKEQT/RRCFLS 
\MLQGHCPFLTVMMQHGLCGVELGGLPG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nu cleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unl4nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGDFCKRPR 


10387 


24288 


A 


10468 


111 


1 


MVSFGRPRQADHKV\GVEDQPGQHGETP 
SLLKNIRI 


10388 


24289 


A 


10469 


204 


1 


LWGPILKIFCQILKTICEGNFRGPTFFI 
FFFEMESQSVTQARMQWCDLSSLQPLPP 
6S\SNS RASAS 


10389 


24290 


A 


10470 


190 


330 


ERIKKQDLSICCLQVTHFTFKDSQRLKV 
KGWKK\ IFHTNKNQBCRIWT 


10390 


24291 


A 


10471 


162 


2 


SCLQAS.VQMHDLSSPQPPTP\GSSDS 


10391 


24292 


A 


10472 


120 


3 


SWGFFFFFFXEVESYSVAQVGVQWCDLG 
SLQAPHVMILVK 


10392 


24293 


A 


10473 


298 


1 


RRALPCPANFLYFYFLSLLLFFETEFHS 
\VAQaaVRWHEIiGSLQTPPPG\SSDSPA 
SGEYSANYTGEYSANYTGEYSANYTGEY 
SWYTGEYSVNYTGEY 


10393 


24294 


A 


10474 


1 


380 


AVTGRAGSMVAPR\PLRRLVLFYQGKLR 
SMAGNFWQSSHYLQRILDKHDLLNERQK 
DLKVLSEEEYWKLQRFFTKVIQALCEHL 
KLRQPGIATATAYFKRPYARHSLKSIDP 
VLIJIAPTCVCLASTAN 


10394 


24295 


A 


10475 


289 


2 


TQGFPPINNS FYALYPKQKI PFFSDKLF 
RVKGFKGPVFHYGVKDFKNKDGVFF IFF 
FEI!tASGSVAQAGVQWRlILGSLQPPl\PG 
SSSLTTSAPRVAS 


10395 


24296 




10476 


138 


402 


ITGRAPPKPFPLKRGYLSGGLTNIGFS F 
SFSFFETESRSVTQAGVQWRNLGSRSWL 
EST/SPDSSNS/PAPASRVAAIT 


10396 


24297 


A 


10477 


2 


336 


ARETNDTTQTRLLPANNTMTRSDPYI S I 
LTLISrVNGLNTPIKR\KRVASWINKQGLM 
GCCLQETHLSSHETHNDTHKLKIKTWRK 
lYQflWGKQKKflEW/lLISDKTDFKP 


10397 


24298 


A 


10478 


51 


250 


RKVS LFKKWCWNTLFI CKK\LS LDTYRF 
AK INFEWI TDLNVREKT/ I KLLKGNI KE 
NICDLGLGKDFF 


10398 


24299 


A 


10479 


216 


2 


gftrkalsgffktgwfwvfpnfsklffk 
iojfserfffffffemeshsvtXrlecsg 
p i s ahcnlhl pgs s lv 


10399 


24300 


A 


10480 


191 


3 


FSTDGNVWCLVKYSiyiyiYFFFFLIFF 

femks CS /vs prlecngalsahcklclp 
gs chs para 


10400 


24301 


A 


10481 


135 


768 


lggqvlscasqpvslfplffflisflgc 

hpppkvlsshp\ppcsrp/cpqppkg/p 
plpkhacpp 


10401 


24302 


A 


10482 


126 


333 


mwgqrycvmcfvi wlclfllv/www 
flw/csvyivywygyawlfiffffffs 
llcfavclfvwlldrf 


10402 


24303 


A 


10483 


240 


2 


ateknmglgtsvrglsgsnpsygrvrer 
I GGGVCLWSQ I llrgc \lrwedhlglgg 
ggcseprsshctptwvteqdpvlv 


10403 


24304 


A 


10484 


136 


2 


nrpsyifsfffeteshsvaqagvlwhdl 
sslqplpp \gssds para 


10404 


24305 


A 


10485 


161 


328 


EKNFKPYGSCRHFYLFlyFETESRSVAX 
ELECSC-TISAHCNLRLPGSSN/SPAS 


10405 


24306 


A 


10486 


"193 


347 


ILF3FLLFIYFFETESCSVAQAGVQWHD 
HGSLRSRAPG\SYNAPALDSQGTE 


10406 


24307 


A 


10487 


114 


2 


GAWQSEVQWRDSGSLQAPPPG \SRHSP 
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SEQID 
NO: of 

sequence^ 


SEQID 
NO: of 

sequence 


M 
eth 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 

location''* 
correspond 

sequence 


Predict- 

otide 
location 
correspon 
ding to 

acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

^SlydrK;fl^His«dine*l=Iso1le^^ 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=TryptopIian, Y=Tyrosine, 
X=Unknoivn, *=Stop codon, /=possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














ASASQVAGLV 


10407 


24308 


A 


10488 


314 


49 


IRRSTERSEPRYSMVCRRQPRQGGPCLX 
RSQLLGRLRQEDHLS PIAGDCSEPRLH\ 
LACTPAWATEQEPVLKKMSPTLFCLFLW 
PLPCQ 


10408 


24309 


A 


10489 


291 




KSPPPPNRRGLTLRSRGQIGNGVSFYYE 
APVTDFI FFFQSLVLSTQAGLQWCNLSS 
LQLPPPG\SSDSPASASRVAPVSQGHAT 
APQPGRQTLSSC 


10409 


24310 


A 


10490 


226 


3 


GSRCTMPRRRLAHAFPAARMPKRKVSST 
EWATEEEPKRRSARLSAEPAPAKVETK/ 
PEKAAGKDKSSDKKVQTKLV 


10410 


24311 


A 


10491 


2 


153 


arglvffvylvemtlchvaqaavkplts 
seppXlglpkcwdyrrepprpt 














THYLCLLRFFFLDRGFLLLPRLDC/NAP 
ISPNLNLRLPASSDSPASASRVTGITW 
NHYARLTLLFIFQNETLF 














I PGWPFKNFSLFPLFFFFFEMESCSVAH 
AGV/GSSDFPASASQVAGTTSMRINAWL 
IFVFFSRN 


10413 


24314 


A 


10494 


373 


1156 


KVQLKVYLFFFLRLSLALLPRLECSGTI 
LAHCNLPPSRFEQFSCLSLPSSWNYRRP 

pphharlifvl/svetgfhhvgqa6lel 
ltsgdppasa/fpkcwdyrhe 


10414 


24315 


— 


10495 


181 


3 


sflwkfSlrgvpgrvecqsa\llggasq 
lgysglrdplqeavcpfsdlklcagrtt 
tlfk 














aasttrlgdrdqpgqhsetpsllkiqkl 
g\sggghlrlrqenclnp\ggrgc 


~i04i7'~ 












T qswfnt lksihv thhsnrlk/ e vnHi V 
stdaekaadkiqhpflikkq 












LVKKMLTGSTLGKSYRHSPFS INQGHNA 
LRiaAG\PLPRI<AGY\IjQGFSPLRYGL\ 
WDGKDLTIHQPDTREGSVLSRISKRGRP 
L/CSCLPLGTECLSI 


10418 


24319 


A 


10499 


138 


1 


WYGLIYLLLEMESCY VAQAGVQWHDLP S 
LQPPPP\NSSWSPASSC 


10419 


24320 


A 


10500 


138 




WEGLI YLLLEMES CS VAQAGVQWHDLGS 
LQPPPP\NSSNSPASSC 


10420 


24321 


A 


10501 


200 


343 


LEAHSFLSFFIFIIIFFDTKSHSVT\RL 
ECSAAISAHCSI.LLPGSSNS 


10421 


24322 


A 


10502 


84 


339 


PRDGKRRDKQKQETGGEHYKSQLRGRLR 
QEDHVSPGGRGCSEPHSHHCTPAWAT\E 
SNSVSKKS 


10422 


24323 


A 


10503 


141 


362 


EWNCCVCC/CCCCCCRCF 














tikkqkntgglifnsyilpplflepgdl 
\rfldvdnrvklpv 














gypcnpilyymstvgfttslldfdiple 
hpilkvcpthtralrdlssdhsnpgril 
stskssly/ekdprhkayftk/rpspvn 

DUST 


10425 


24326 


A 


10506 


16 


375 


KKFFKKKKKKIFFPPFPIiKNFFFPKRVN 

ffgrgwpkksppkkkvffkkfpggfkkp 
plkkkki ffsppvkmgppkgffkkgppp 
ffffffffffffffffffffl/lclipr 


10426 


24327 


A 


10507 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKIQKL 
G\SGG6PLRLRQENRLNP\GGRGC 


10427 


24328 


A 


10508 


313 


2 


QSKWLLEEESTLGEDAVNIVEMTTKDLE 
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SEQID 
NO: of 

sequence 


SEQ JD 
NO: of 

sequence 


M 
eth 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

location 
correspond 

residue of 
peptide 


Predict- 

otide 
location 

ding to 
last amino 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-^Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














YraNLVDKARAGFERIDSNFESSSTVAK 
ILSNCIVC/Y/REILHEGKSRPMWQTSL 
SYFKELPQTTQHSAIITLTSPRA 


10428 


24329 


A 


10509 


325 


1 


ECAKEMNAETKRNIAIDATETQRI IRDC 
YEHYNKME1ILEA^^DKFLDTHNLPRLNHE 
EIENLNEPIMSNETETV/I/NSLPSKKW 
SGSEGFTAEFYQRY KTPLI LKLFKK 


10429 


24330 


A 


10510 


126 


14 


DSSGQ VQWLKP I IP V/ LWEPEAGRSPE V 
RSSRPACPTW 


10430 


24331 


A 


10511 


435 


769 


PFKFPPLSHACDGGSLFCRLRLECSGVI 
T\AHCKPQLPG/SQSDFPASTSQELGP/ 
TGACHYTRLICCREEVL\HVAQAGLELL 
CPKDPPTSASQHARDYRH\GHPAQL 


10431 


24332 


A 


10512 


147 


412 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYPTK/RPSPVNE 














QTCPTGSPASLLELPNSIATQWELVRSL 
IAGNALKDEND\SAVKMLASLIMSYKAA 
AAEDYKADCPPRNPGP 


10433 


2433 4 


A 


10514 


286 


1 


SFLRKFCLRGVPSRVKCQSA\LLGGASQ 
LGYSGVRYPLEEAVCPFSHIiHLHAGRTT 
TLFKAVRQEHLSLQRILLPLVGRVGRPN 
DSRCDRVIGH 














RF VCS T I KVLRDLS SDRSNPGRFLS TSN 
SSLY/EKDKRKKAYFTK/RPSPVNDI IS 
T 












2 


IKKPADSESAGALILDFPASQTVRNSFL 
LLTSHS VCYFCYSS PNGLRQQLKI 1 1 FF 
FFLRQSCSV\SRLECSGMILAHCNLCLP 
GSNN 


10436 


24337 


A 


10517 


62 


292 


PATLFSGVILPPTNSINAFSGSDSKSAH 
K/CEMLGTENFNFFTCHTH IIAEHSNS T 
HTHTHTHTQTHTHTHTPLLFYE 


10437 












RGKKI FFPLEKGKNLVWWITALFFFFFE 
TDSHSV\TRLKCSGTILYHCSLCLPGSS 
DF 


10438 


24339 


A 


10519 


42 


194 


ILVETF\CLRFVSLENLVKMRFLKLKKK 


10439 


24340 


A 


10520 


120 




I FAKRWP IKFFPFFFFFFET VFHS \ VAQ 
AGVQWCDLGSP 


10440 


24341 


A 


10521 


202 


375 


SFLWKLHLSGVPGHVRCQSALTGG\ASQ 
LSYSGVTDTLEEAVCPFSDLKLRAGKTT 
TL 


10441 


24342 


A 


10522 


272 


402 


PKIFYVTKKAWNYYPYTITEYTCSF/ IR 
KFFIH lETKYEDNKGSN 


10442 


24343 


A 


10523 


147 


439 


LILYVNYISRK3TVYDSRHPCTYYLIPE 
KWCYKYHCMFIiCT/SSSIPFPTPSTPSR 
ASPKKKKKKKKKKKKKKKKKKKKftRGG 


10443 


24344 


A 


10524 


7 


415 


RFVCSTIKVIiHDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/HPSPVNDIIS 
T 


10444 


24345 


A 


10525 


321 


442 


TSFSFLFFF/UETEFHSVASRLECRGTI 
LVHCNLCLPGSSNS 


10445 


24346 


A 


10526 


477 


3 


IEGKLPNRKDIHTKNPSVCHHHQRPKVJ3 
KTTKMGKKQSHKTGNSKNQSASPPPKER 
SSSPAMEQSWTKKDFDKLREE6FRGSIY 
SKL/REEIRTN6KEVKNFEKELDEWITR 
ITNAKKSLKDIiMELKaEAQELRDECTSR 
AASANCVAVPQAFDRSHGVQ 



1185 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

amino add 
residue of 

sequence 


Predict- 

otide 
location 

ding to'' 
last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=IVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10446 


24347 


A 


10527 


265 


411 


rfvcstikvlrdlssdrsnpgrclstsn 
s/spvrkdkrnkayftk/rpsprn 


10447 


24348 


A 


10528 


81 


390 


rfvcstikvlrdlssdrsnpgrvlstsn 
ssly/ekdkenkayftk/rpspvndii 


10448 


24349 


A 


10529 


253 


23 


RIFLFYFHSFLRQGLARAEGQPHNHSSL 
QPPPPG\SGDPPASASRAARTTSAYHHI 
PLFFFRSYCVAQAKIINMLSLL 


10449 


24350 


A 


10530 


10 


398 


RFVCSTIKVLLDLSSNRSHPGRFLSTSN 
SSLY/EKDRENKAYFTK/RPSP 


10450 


24351 


A 


10531 


289 


423 


RFVCSTSKVLRDLSSDRSSPGEFLSTSN 
SSL/QRKDKEHKAYFTER 


10451 


24352 


A 


10532 


152 


401 


PHCS IHSQVRIQANVHKGHRQRTYGSVI 
PHILPLHVLKKTFSLRDFHFSVSLKICNL 
VliTCLHLFL/GVRTPENDPFVSMMLLFT 


10452 


24353 


A 


10533 


2 


515 


LDRISLLLPRLKCSGPVSAHRNLYLP/G 
FKRFCSLGLPSSWDYERASPRVAKFVFL 
VDTGFHHVGQAGLKLLTSSDPPSLTY/P 
QCWDYRREPHLAAF 


10453 


24354 


A 


10534 


469 


566 


PTKNTKISREWWCVPV/IWEAEARESLE 
PGKQRL 


10454 


24355 


A 


10535 


354 


47 


KPKNLPGGGGPPFYSSFSKGKKIPLPWK 
AKVPINQILPLPPHPGGGKKTPFPKKKK 
KKERKEOOKMATVQKGM/PHKYYHGKTGS 
YNVIQHAVGNAVTNRTRGS 


10455 


24356 


A 


10536 


1 


409 


TPKKRFFPKKPKGFFFPPPKKKKKIFSP 
PPKFWPPQKNFKKAPPPFFFFFFFFFFF 
FPFFLICSNPFFFFF/SKNPILKPPLPPL 
FRVFPPPKKKKKKDP 


10456 


24357 


A 


10537 


204 


435 


TSGSLYKLLGMGDLGLGICTTIFKQ6GHQ 
WSS\GWHPEATTG/WWAFKGLHWDPKA 
HjaHiQLWNIAL/QER/FGKMTKACY 


10457 


24358 


A 


10538 


208 


382 


RFVCST I ECVLRDLS SDRSNPGRFLFTSN 
SSLFER/DQRNKftFFPK/'RPSPVNDI IS 

T 


10458 


24359 


A 


10539 


63 


409 


GWRLTNFPVGKTPS TSGTGLLFI FLLLN 
LMGRLKPSSISMKREFRKQNRGQKC/EF 
MKKKKKKKKKKKKKKKKKKKKKIGGGAF 
LKI FRGGPNFPGGEKI FFFFLGGGI KSR 
GGGFG 


10459 


24360 


A 


10540 


3 


1322 


R6YAWPNGALPASTVPCGFAACPGEFLC 
SVNGRQAPAGSSGFLPPVPSLCPHTVCR 
ATFQCPCEDSTCISLRRFPWGSETIILLCP 
APIPSVPGVPCGTFTFQCEDRSCVKTLS 
ASPLPLLLCPPHHLPPSPDCGLQGPSSR 
IVGGAVSSEGEWPWQASLQVRGRHICGG 
AT , I flDRWV I TAAHC FQEDSMASTVLWT V 
FLGKVWQNSRWPGEVSFKVSRLLLHPYH 
EEDSIIDYDVALLQLDHPWRSAAVRPVC 
LPAH/SENREQGLQ/CSWITGWGALREG 
GP I SNALQ KVDVQ L I PQDL CS EAYR YQV 
TPRMLCAGYRKGKKDAC/QGESPG 


10460 


24361 


A 


10541 


154 


3 


GVGNFFFFFLRWS LTLAQAGVQWRS LGS 
LQP/LTSGSLFRQFSCLSLPSTRP 


10461 


24362 


A 


10542 


3 


406 


LMVLWTAHLPALAPGSRTS LLLAF \ALL 
CLPWLQE\AGAVQTVPLSRLFDHAMLQA 
HHAHQLAIDTYQEFEETYIPKDQKYSFL 
HDSHTSFCFSDS I PTPSNMEETQQKCNL 
EDLRISLLLIKSRLEPVQVLTSMF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
Q9/S15,l 
26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Add, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10462 


24363 


A 


10543 


3 


414 


HSSAHAYGTYESSGKRKYLQRPSRTAEL 
KKALKEKKNRLLLQQRSKSVTSS SSS SR 
DSSARDSSSEREETSTSSSSEDSDTDES 
SSSSSSSAYSTNSSSSFDSDSDSSS\SR 
GRSTSTDSSSADD/STNEEESELE 


10463 


24364 


A 


10544 


240 


509 


TFTSFI IGTPHLGEFCKCrorFGI /GKFR 
MBCVACQIETLGILSFFFETESRSVTRLK 
CGGAILftHCKL 


10464 


24365 


A 


10545 


275 


2 


GCSRPFDTAIPLMGPYPEEKKLLFEKNK 
/DTCTCMFIAftHFPVAKMWIQPKCPS IN 
EWIKNPVTHIYIHTHTYMCIYIYTHI CV 
CVIHIYI 


10465 


24366 


A 


10546 


233 


2 


WSWMAIQSVLPQTAFTELKNKYSCIQET 
PILFYFLKQSLPLSPSLECSGVIIJiHCN 
/LQ/RPGSTTFPASASKVERTTGA 






A 








KKKIPNEGGCLKEDTKAFLKTPPMLQRR 
TPPAFPKGPGGFLPDETSIPLPPSKGNPPI 
YSWERFFPRFFKNIWAPVPRANLLSFFF 
FSIFWRTGSRYIAQAGLELLGSSYPPAS 
CLPESWDSER\DHRARPTI 


10467 


24368 


A 


10548 


190 


1 


AASTFFACVCFRLFFVFVFETESPSVAQ 
AGLQWHSHSSLQPPPNLG\S6NPPALAS 
PVAGNC 


10468 


24369 


A 


10549 


155 


1 


ARGEKYFLEIGGGFFFFLLVFFLRQSHS 
IAQAGVQWCD\LNSMQPLPPRVK 


10469 


24370 


A 


10550 


223 


392 


RFVCSTIKVLRDLSSDRRNPCRFLSTLN 

/SLAVRKDERYKAYFTK/RPDPVYDIIS 
T 


10470 


24371 


A 


10551 


183 


3 


WQKLLFLFGTESCSVARVGVQWRHLSSP 
KPPPP\GSSNS/PASASRVA\TGAHLHA 
RLIFVFL 


10471 


24372 


A 


10552 


297 


10 


TDEELLCMEEKRKWFF\DMASIPGEDTV 
NTDEMTKT/DLEYYINLVDKVSAGFERT 
DTNP\ERSSrVSKMLSNSITCYRKI/F/ 
RERKSQQMWQTSFLSF 


10472 


24373 


A 


10553 




3 


LLTDRTLSCRSVGVPCSVRCQCA\LLEG 
ASQLG\SRGSGVRDPLEEAVCPFPDLQL 
RTGRTTALFKAVRQGHLSLQRLLLSF\V 
CLCPAPRGGAYRGTQASLSCGGLHPVEA 
SRLLCLPKQAWAMVGTPPP\TSLPPCSL 
ISDCCASSQRDS/VGVGPSKPGAGYNLV 
VRRF 


10473 


24374 


A 


10554 


262 


453 


GWGILTANLLVFGGGSLSISERIFFSPL 
ATGFCSVAQAGVQW\LIIAHCSLEFLGS 
SD 


10474 


24375 


A 


10555 


112 


372 


KKKGGGPPFFFFFIFFLIRQRKAKLWL 
AFNERGLRKPD / 1 FYYAGVGKIVKKKYS 
GFF I E LVTT CGKNYLMCALAVI D PRDSN 
I IRS 


10475 


24376 


A 


10556 


50 


358 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDI IS 
T 


10476 


24377 


A 


10557 


203 


350 


qlhadkvsrfathaecrgvimayfnlnl 
ph/fkrssclslwsswdyrhap 


10477 


24378 


A 


10558 


98 


387 


iqdtdlirgvyryrdkkraeekdspper 
egerereiekqrkker/erkkerkkest 

HQKE 


10478 


24379 


A 


10559 


237 


3 


EVFFPCNI FE r BSCS vtqaraqgqdhgs l 
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SEQID 
NO: of 
nndeotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

pSsr' 

sequence 


Predict- 

nucle 
otide 
location 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














QPPTPGVSSSPPTIiflSQSAVITGVNHHT 
QSGLTFQSPKQKALGKMCISSGG 


10479 


24380 


A 


10560 


37 


359 


RLKCGGPVIAYCKLEILGSSNTPTSTSW 
VAGTT/TCHHAQ 


10480 


24381 


A 


10561 


170 


3 


RPFFLPWGGKKFQGVSFFFFFFLVFFEI 
GSHSVP\RLKCSGTITAHCSLDLLGSSN 


10481 


24382 


A 


10562 


1 


388 


LPEPLMVLRTAMLVAiyififGSRTTLL\LA 
FALLCLPWLQEADAVQTEPLSMLYDHRM 
LQflHEAHQLDIDTYHELEETYIPEDQKH 
SLLHDSQTSFCLSDSIATPSNMEETHQK 
SNLEMVRISLLLIETWLE 


10482 


24383 


A 


10563 


224 


392 


CVDPILKWFPLRTATRQGCPLS\LFNIV 
LEVLATAIRQEKEI KDP/QIGKKEVK 


10483 


24384 


A 


10564 


185 


2 


RYFGGPDSPQNGKTVFNPLFFFFETRV\ 
HSCCPNESNGTISTHCNLCLPGSSNSPA 
SASQVA 


10484 


24385 


A 


10565 


175 


324 


APGYGTKYRVILCKILLLRRLRHKNHLN 
PGGRGCSEPRSHHC/ITPAWVTE 


10485 


24386 


A 


10566 


3 


367 


HELLWSTIKVLPYLRSDRSNPGGVISTS 
NSSLC/EKDKKNKAYFTK/RPSPVNDMI 
ST 


10486 


24387 


A 


10567 


128 


391 


GETCEGVYGPQQGGKQTWKVFHSKKDEG 
PCSKTWDVGLGSSPQFIFLKKVRFFFFF 
FETESHSVTRLECSGTPGT/HCNLCYPG 


10487 


24388 


A 


10568 


251 


485 


SFLWRFPLKGVPGHVRCQSA \ LLGGAS Q 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQRFLLPLFSY 


10488 


24389 


A 


10569 


176 


3 


fvppilkkmgpflkkkegkpfffffsem 
esrsvtVrlecsga/tisahcnlclpgs 

SNS 


10489 


24390 


A 


10570 


212 


386 


rfvcstikvlrdlssdrshpgrflftsn 
ssly/ekdkrnkafftk/rpspvndiis 

T 


10490 


24391 


A 


10571 


134 


489 


slfflpsprdrwsltqstemppswkfpg 
reirspllamlhslllqptaflcpsstl 
llslkpdslrtgslflsffsflyfeacs 
va\rlkpsgtisahcnlclpsssdspas 
ayivsg 


10491 


24392 


A 


10572 


1 


470 


gqsrgipllltldlekpvslllsvtnly 
sknsaqfstilqtlsfpatftpspsipl 
ssayffffsdrvs /llsprqecsglnlg 
ahcrpqp prfkrfs clsps s \ swgltrr 
htttpglifcif/sveagfhhvaqtgle 
llslsnppasa/fpkcwdhrr 


10492 


24393 


A 


10573 


242 


410 


vmlrmqiyfslhn?iffffetgshfia\ 

RLECSGM I PAHCNLCFPGSSNAPASASQ 


10493 


24394 


A 


10574 


236 


1 


artffihilkascgdnaitgvlwhratw 
cpsvlldrdslservceilcihac/vivir 
ihehthtyiyahthththththt 


10494 


24395 


A 


10575 


241 


35 


RS S YLF I FNFFVE TR SVAQ PGLKLCCSA 
WAGTPG\SSDPPTSSSQSAGIIGMSLNL 
FLQWLIIDQAGNTN 


10495 


24396 


A 


10576 


294 


72 


FKNFKSPLFSLFtFLILGTRFCYVAQAG 
LKXSWPQGDPSALASRVAGTRGVHRHTQ 
LQVSFNYYICVLAMHSGSQL 


10496 


24397 


A 


10577 


1200 


1625 


PDGWSYSSGHSWLPEEASNGRKRWSSCW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

test^amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GlutamicAcid,F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VPPQLSPPQAPPSPATHLGPTPPLA/SP 
PQAPPPPATHLGPAPPPPLSSAP\PPPD 
TLLGPAAPPPPSSGPDPFRYSP/EVPPQ 
LSPPQAPP 


10497 


24398 


A 


10578 


226 


325 


ERHTHTPHSHTRSHTRLTHTHA/YTHTH 
AHTHA/YTHTHAHTHFPS 


10498 


24399 




10579 


108 


320 


TIOTFLKVNVMKLSYLRFNTRNCSFFFL 
AGVQGeDHt3ST,QTQPPG/SSCDPPSSAS 
RVAGTTGAYQHTQLI F 


10499 


24400 




10580 


1 


288 


ARGERERERERERERERERERERERERE 
REREREREREREGLSFFFSRSRGQGKLW 
CWC/RPSARPRERERHAVFSAPPSLQKK 
KKVFGGREGTHSLFYPPCGNSLRR 


10500 


24401 


A 


10581 


1 


476 


REWGUUjSPRLECSWIIAHCSLGSLGL 
KRFSCLRLPSWKDYRCAPPCPSLNLFFV 
ET\GPHYVAPGWSLLLGSSD/PPPLASP 
KCRDYRHE 


10501 


24402 


A 


10582 


210 


346 


PHCSIHSQVRIQANVHKGHRQRTYGSVI 
PHILPLHVLKKT/FSLRDF 


10502 


24403 


A 


10583 


316 


461 


lpnilgwlffvsetgpcsvthagvqwc 
dehgslqsqtfgXssdlptsd 


10503 


24404 


A 


10584 


172 


383 


NLDLYLTSYIKINSKWIIDLNVIAKTIK 
LLE\iCNVAGENIPDLGIiGKEFLD\KTQK 
ALS\TNKKIDKL1!IFTR 


10504 


24405 


A 


10585 


273 


374 


GPGTEAHTRNPSTLGGRGGAIT/R/GQE 
FKTSLAKM 


10505 


24406 


A 


10586 


150 


344 


KLKCKII IKLGIKI SFAVIVWEIGFHF 
CYLGWVQW\PIRTHPGSSDPPASASQSA 
GITSMSHHT 


10506 


24407 


A 


10587 


199 


3 


SFSYIRVHLGHIKIITRHKASCEVYANCK 
AKGI ITGIKKK/RFWPGTVAHACNPNTL 
GGRGRWTTRLV 


10507 


24408 


A 


10588 


216 


1 


PRGAPTCMRCQSAL\LGGVLQLGYTG\V 
RDPLEEAVCPISELKRCAGRTTALFRAV 
RQGRLSLQKFLLPFLV 


10508 


24409 


A 


10589 


190 


3 


ISAQPWAAFEKRFIFIYRAPA/LKFMIF 
FSPFKGVPFFFTFFFFFEMESRSVARLE 

CSGVISAH 


10509 


24410 


A 


10590 


369 


40 


KTERNSININKNDVHTKTPSKGRQHQRP 
KADKSREMRKNQHKKAENS KNQWP S S P P 
RDHNSSPARKQNWMENEFDELTEVG/FR 
KVGNSSELKEHILTPCKETKNRTRGW 


10510 


24411 


A 


10591 


114 


398 


RFVCSTIKVLRDLSSNRSNPGRFLSTSN 
SSLC/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10511 


24412 


A 


10592 


132 


1 


I LYLWCVCVCVCVCVCVYLL I LV/ CKVL 
SCLGEPALSVGAFQMS 


10512 


24413 


A. 


10593 


3 


287 


I GF ITLNVGL YL YLLCY I YVY F FQVNY I 
VFI LVLAVFLLKNRLLRYWTAL YNEEVT 
REKISHYFI /NSYIHTVGKINKIID 


10513 


24414 


A 


10594 


2 


363 


RAEVGVMAPLYLLLLLSGALGLTDTWAG 
SHSLRYFSTAVSRPGRGEPRYIAVEYVD 
DTHFLRFDSDTAI PRMEPRKPWVEQEGP 
Q1WE\WTTGYAKMIAQTDRVALRNLLRR 
YWQSEAGE 


10514 


24415 


A 


10595 


156 


3 


ngleldsddgciallffyfetefcsWa 
qagvqwcdlgslkpqplgfkptrp 


10515 


24416 


A 


10596 


80 


306 


MDEELLFRDEQRKWFLEMESTGEVAVNI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
otl 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 

Iminoacfd 

peptide 


Predict- 
ed end 

otide 

location 

correspon 

last^mino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cDdon,/=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














V/EMPIKDLG/YYRNLVDKAVPGCERVD 
FNFEISSTGWVWWLTPAIPVL 


10516 


24417 


A 


10597 


232 


389 


ILCWVFCCCCCCCFET3FLVIQAGVHWC 
DLDSLQ/SPPPGSSDSPASAS 


10517 


24418 


A 


10598 


463 


154 


mgqgkprfplffhpfyiffflkqgfsls 
prvqgrakkvpgtlaswgqkillpqppr 
/rvrqencustpggggfsepkippsfpaw 
gekggsllknlkkkktkhic 


10518 


24419 


A 


10599 


208 


38 


LGPPQGFFKTAPPFFFFFETESRSLAQV 
GVQWWDLaSLQLPPPa\SSDSPaLSKKF 
LL 


10519 


24420 


A 


10600 


200 


3 


CQPE LS CMRC \RQS LLGGLS PSGGME VE 
DPVEEAVCPLAELKRCAGRALLVRICCS 
LQSQQflGTFK 


10520 


24421 


A 


10601 


73 


427 


RWCSTIKVLRDLSSDHSNPGRFLSTSN 
SSLY/EKDKENKAYFTK/RPSPGNDIIS 
T 


10521 


24422 


A 


10602 


193 


401 


GEVSLLSPRLECNGVISAHCNFR/LPGF 
KRFSFFFL 


10522 


24423 


A 


10603 


181 


395 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10523 


24424 


A 


10604 


384 


10 


PRRPGGECVHCLGDSLVTDRRYSGLRSS 
DQTLSGDPMEAVPWSNSLSCSHYRLKPA 
SGITWGCKQflGFFSPFLFFKMESHSV\T 
RLECSGAISVPCNLCLPGPSNSPAPGGS 
LGPRSSRPPWTT 


10524 


24425 


A 


10605 


215 


379 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLC/EKDKRNKADFTK/RPSPVNDI I 


10525 


24426 


A 


10606 


1 


163 


QTHREPAMVLSPADKTNVKAD \ WGNVDA 
HAGEYGAEALERMFLSFPTTKTYFPR 


10526 


24427 


A 


10608 


283 


2 


IQKHGLIIRCPQETPFKNKDVGMQKVKG 
RGEKSIMQIL/YFFFFIJIS/HLSPRLE/ 
CSGAISAYCNLRLSHSSDSPA/VDGTIG 
ACHHTQLIFLFLVE 


10527 


24428 


A 


10609 


144 


327 


CSWSWFCGCVRVSTLIKKKNFFFEMEFR 
S \VAQAGVQGRDFGILKPPPPG\SSDSP 

ASASRV 


10528 


24429 


A 


10610 


137 


1 


HKCPSTDEWDERV/ WCMYTKEYCSALTK 
EEIMSFGKTWMELENIMI 


10529 


24430 


A 


10611 


201 


334 


SFFWRFCLRGVPSRVKCQCA\LLGGASQ 
LGYSGVRDPLEEAVCPF 


10530 


24431 


A 


10612 


146 


2 


KTPLKKGLKRVPFFFFFETESCSVTQAG 
VQWHNIGSLQVPPPVGSRHSP 


10531 


24432 


A 


10613 


205 


259 


TTMPGLSFCTRTKSNKNLSFLTLDGSAR 
VDLFLYLLLFRTIFQNF/CGRDKVLLCC 
PGWSQTPELKQLICLDLPKYWDHRYEPL 
CLAYPFALGQNLTKI 


10532 


24433 


A 


10614 


153 


2 


GVLMVSDGVLSMLANLFFSFLFF/LFET 
QSHSI\SRMECSGVISAHCNLCLP 


10533 


24434 


A 


10615 


213 


8 


KPPFFFPFFFLFFFLRWGSHCWSGY/C 
KGT 1 1 VHCGLELLGSS CS PPAFQVSG I T 
GLCYCASIFIFEE 


10534 


24435 


A 


10616 


146 


1 


ILTIRDYMVKITLFFFFEPKFHS/VLPR 
LECSGAI SAHCNLCLPASSDS 


10535 


24436 


A 


10617 


208 


2 


NNFWFSSSGKYGQYFSSETESRSVTWGH 
DLYSLQPPPP\GSSDSPflSASQVAGITG 
SCHHAQLILVFIVE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10536 


24437 


A 


10618 


120 


2 


SFFFFFETEFRSVAQAGVQWHDLSSLQP 
PPPG\SSDSPSL 


10537 


24438 


A 


10619 


162 


1 


SFHWKLRLRGAPGCMRCHSA\LLGGVSQ 

^GYSGFWHPijEEAVCFFSDL'i'LCAG 


10538 


24439 


A 


10620 


193 




T FF FFE'i'ES RS VAQAGVQV;aDLGS LH/ S 
QPGQQSETP 


10539 


24440 


A 


10621 


313 


460 


KPGLWRGTRSQKQVLFFEPESHSVA\KL 
ECGGTISAHCNFCLPHSSDFSA 


10540 


24441 


A 


10622 


243 


505 


PTGCPKQEQVQMLRPWFFFVFETQFN 
S/VPRVECSGTISAHCMLRFPGSRDSPA 
LASRVAGICRR/CATTAQLIFVFLVETG 
FCHLVG 


10541 


24442 


A 


10623 


67 


430 


LAHRTWKNQHSSTHGCDLFFFFFFFFFF 
LIKKGVFFFPPEGGHGHRFFFINIKFLN 
YKRFFFCLFEDVLMISVGPPLAFFFFFF 
WRGGFSYLLLRCV/LTEtALGAPLFAPM 
FFLFLFGEEL 


10542 


24443 


A 


10624 


3 


207 


PGGQLGSECSGVRMDEDVL\TLKSLIIG 
ESGVGKSSLLLRFTHDTFDPELAATIDP 
SSWNPARNAATP 


10543 


24444 


A 


10626 


151 


380 


kmlfSOvQwltpiipalweaeaggsleS 
lrssrpalkrsetss ppslpcqkkkkkk 
kfphaggqmlevpfpgrvgag 


10544 


24445 


A 


10627 


210 


3 


nfcqvkgfwkqkssqikgfgknkkkkgg 
sffffffeteshsvt\riiecsgtiivhc 
sldfpgssdpptsa 


10545 


24446 


A 


10628 


237 





LCFLSGASCSPYIDVYTEVNGVNPTFI I 
LTNKTNTPGPLFLII FFFGETES LS IA\ 
RLDCSGAISTHCKLCLPGSRHSP 


10546 


24447 


A 


10629 


1 


352 


RGPLS IQDYISKALFFLLLFGFLLLLLF 
FEVRSCSLTQAGVQWCNHSHISSLQSPT 
PG\RSPSDPPASAS 


10547 


24448 


A 


10630 


48 


200 


aikpknflffsetksssvtqagvqwrdl 
sspqppppVsssdspapan 


10548 


24449 


A 


10631 


209 


376 


S FLWKFCLKGVPGCVRCQSA\LLWGASR 
LGYLGVRDPLYEAVCPFSDLKLCAGRTT 


10549 


24450 


A 


10632 


141 


1 


AGFFFFFLFFFLETESEFVA\RL\QCSG 


10550 


24451 


A 


10633 


18 


522 


PLYSLTKRHTPREA/ KDNLKSTQLLSVI 
DAISEGPIEGPVDGLKSVLLNSTPVLDT 
EGNTN ISGVTWFRAGEQEQTPPEGFES 
SGSETVLGTEVKYDTPITRTITSAWIDR 
LRFTFGVQALVETTSKGDRNPSEVRLLV 
QIQRNGGOTTEKDITIKGKTTSQYLASV 
V 


10551 


24452 


A 


10634 


127 


2 


NLSPLFFFFETGSRSVTQAGVQWSDLGS 
LQTPPP\GSRHSPTK 


10552 


24453 


A 


10635 


336 


452 


RI FFFLFETEALSVAPAVAQWFDLGSLQ 
SPPP\GSSESPA 


10553 


24454 


A 


10636 


259 


3 


gtriffpfnssplsfppvqkksvspfff 
sf/lffffetesrsvt\rlecsgaisph 
cklhlpgsrhslasatrnvlrplwriha 

SCA 


10554 


24455 


A 


10637 


96 


413 


gdntsqhsailinyfiffftflff/etg 
shsvaqavrleyssvitahcnfrlrgss 
nppkkkkkggavlkdpwggqslpglaty 
yffpyrganknllgdfwegplfv 


10555 


24456 


A 


10638 


180 


473 


cymgrkawlvlsnlltlkkkkkkkkkkk 
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SEQlD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamlne, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=^possible 
nucleotide insertion 














KKKKKKKKKKK/EMPVKQKNRGRALYKK 
KKGGRPFPIFWPFFFFFFLGGQKKHPGG 
FFRKKFFFGGGKKKK 


10556 


24457 


A 


10639 


312 


480 


ETVDSTSGEDDVNIAEMATKDSEDYINL 
V\GKAV2iEFERTDSlIFE\RGKMMSKSI 


10557 


24458 


A 


10640 


2 


516 


MKPLENLSQTASLftRGATLLRPVLRRLC 
GLPGLQRPAAEMPLRARSDGAGPLYSHH 
LPTSPLQKALLAAGSAAMALYNPYHH/E 
ERPRISTSTLDLGKLQSLPEGSLGREYL 
RFLDVNRVSPDTRAPTRFVDDEELAYVI 
QRYREVHDMLHTLLGMPTNILGEIWKW 
FEAVQ 


10558 


24459 


A 


10641 


56 


455 


PLCSYIQFIYHSKFNSNQISLVHTIPLS 
KLFGYFSQLYS lYS YLCQHF IHLLSIFI 
TVFC I VFYWVFVHL I SVPMY I CVCL CTN 
VCVCV/COiW 


10559 


24460 


A 


10642 


272 


33 


GRWFLFLGPAKYFLTGGRFPSDFFGPLK 
INPPFFFFFFFDTESHSVAQAGVHWHNL 
GYLQ/SPPPG\SSDSPAPAEFHHTV 


10560 


24461 


A 


10643 


285 


1 


MGNFLKRFFKNEKKFFFPILRAHPFNFF 
PGKSPFKTPRRALPLGVPPKHPFFFFFF 
FSETESRSVAELECSGMISAHCNL\NLH 
LPGSGYSPTUj 


10561 


24462 


A 


10644 


100 


466 


FLLKFRLREVPSRVRCQSA\LLGPTSQL 
GYSGVRDTLEEAVCPFSDLKLCARRTNT 
LFKTVRQGHLSLQRFLLPFVQLCPVPRG 
GVYRGRQASLSCSGVHPVRASPPLCLPK 
VPPPS 


10562 


24463 


A 


10645 


260 


460 


LKPIIAERETIDKGRLPYYSFFFFETESC 
FVAQAGVQWHDRYSLPPPP\PGSEGSRA 


10563 


24464 


A 


10646 


360 


504 


QIGHICAYVEKTELRLIFFFLFFFFRQS 
CSVT\RLECSGTILAHCm,CL 


10564 


24465 


A 


10647 


241 


45 


WEVEVPRVAPCHILKRDYYFI ILFLFLF 
FEMES YSFT\ELECSSArSTHCNLCPPG 
SRDSPGIPPH 


10565 


24466 


A 


10648 


76 


462 


FLWE YQE KAH ILWSL?SKLILSRSRFTV 
KEKVRQKPGILFIYLFIFEMESCSVAQA 
GVQWRDLGSLSPQKKKDS/DQSKAITFV 
EGINSKRGGWTGPHF/HCSLKMI FLI 


10566 


24467 


A 


10649 


3 


359 


qtqre ptmeltpadktnvkaawgkvgah 
age ygaealermfls fpttktyfphfdl 
shgsaqvkghgkkvadsltnaeahedym 
lnaltalkdlhahklwvdqvnfklXlsh 

CLLVTLA 


10567 


24468 


A 


10650 


266 


3 


TPEKKKKIGGLGAPFPGYPKKIWPPNFF 
FFKKTKFFFFQRVGWAPPPKSFFFFFFF 
EVESCSVAQAGVQWRDLDSLQPSPP\GS 
RGSP 


10568 


24469 


A 


10651 


278 


461 


KKIICGFFFDTGSCFVAQAGQQWRDLRS 
LQSPPPG\SSDSLTSASRWGMSRHMRA 
VF 


10569 


24470 


A 


10652 


161 


456 


VFFFPGENI AFVENKTTLS YHFMFTRMT 
AAKKMENYKGCWGWET I GSFMPCWLESK 
RVQSLW\3CVWQFFKWLNVKLLYGLAIPL 
TGICPKKLKKTVQGY 


10570 


24471 


A 


10653 


91 


486 


P CFKHGHTTW YMYFF SLSL I L FFVF SL K 
PVl LAYKTF S FP I T IRVLLQRRHRQ KFV 
LDGFNGVDQNNILGSICVSLIKIFFLTE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














SC FVAEAGMQWOJICSSLQRPRP \GS CHS 
PGISPPVLG 


10571 


24472 


A 


10654 


267 


470 


NLHMRCEKEQNFNKLSFKDLFGLOWLWF 
FIFCFWRRILTLSPRLEC/RG/MIFAHC 
NLCLPGSGHSPASA 


10572 


24473 


A 


10655 


775 


1401 


TFPSQIRYGLRKRPNSLTLSPRLECSGA 
ISAHMQRPPPGFI'Pi'SCLSIiPSSWHYRR 
PPPRPAICVCVCVCVCVCLVETGFHRVN 
QDGLDLLTS /S/ 1 PPASA/ FPKCWDYER 
E 


10573 


24474 


A 


10636 


287 


454 


LPLIYFLYFYETECHPIAOaGVQWRDLS 
SLQSPPP\GSNDSSASASPTVPTSAM 


10574 


24475 


A 


10657 


212 


64 


LFLSKFYFFEMESCSVAQAALQWCDLGS 
VQPLP\PGSEDSPASAWGYLD 


10575 


24476 


A 


10658 


368 


473 


GFIDHTRRERPEfepyLTDEWIKKMWHIH 
TTEYYSAL 


10576 


24477 


A 


10659 


358 


1 


ILAVFCFCGSAPGFEGGKLTPFFNRGRE 
TFLALVKTP PPGKARGPVF I S KNKKI PE 
FKQPPNPNPNSFFFFFFETESHTIAEAG 
VQWHNLCSLQSPPPG\SSDSPGRWSLQR 
TEIAPL 


10577 


24478 


A 


10660 


2 


235 


KRDLIRHYPKEDIYMANKYILK\CSTLT 
MHTETLIRTTMRYHL IL I KVT I F KKTDN 
MTAVGVHI CWPNTLTGHGRGIA 


10578 


24479 


A 


10661 


9 


538 


CVTVRIPSRPTRPLSSDRSNPGRFLSTS 
NSSLY/EKDKRNKAYFTK/RPSPVNDI I 
ST 


10579 


24480 


A 


10662 


374 


38 


SFLWKLHLRGAPGCIRCQSA\LLGGVSQ 
LGYSG\TIDPLEEAVCPFSDLKPRAGRTT 
TLFKAVRQGRLSLQKFLLPFVQLCPAAR 
GGVYRGRQASLSCSGLHPVRASRPLCSR 
R 


10580 


24481 


A 


10663 


268 




ALIPLSSLTISAFHLLLLETVSTSSPKM 
ECSGAITSHCSFNLPDSSPSPTSASR/V 
IGTIGARHQAQLMFIYFC 


10581 


24482 


A 


10664 


64 


451 


FSSERKSHMSLTLNQKLEMIRLSEEGLS 
KAKVGQKLVRLHQTVSQWDAKEKLLKL 
I KS ATPVN IGM I KQHNL I AD I EKFGM I W 
TDCQTSHRTVLCQRL IQSK/ALTLFNSM 


10582 


24483 


A 


10665 


355 


474 


ILFYFIFFETESCSVAQAGVQWRDLGSL 
QAPPPG\SRDSP 


10583 


24484 


A 


10666 


317 


481 


GHTCPWQTFFFFLFRDRVLLHHPGWSAV 
TQSWLTAALT\FGPKRSSCLNLLNDWD 


10584 


24485 


A 


10667 


57 


329 


VKNTQWGKDSLFNKRV\FKNWASIYRRI 
KLD\LTSYAKINSKWIKDLNVRLEIVKV 
LQVEYPSFKILGNGSVLDFVFFYSGIFA 
LHLMGEHP 


10585 


24486 


A 


10668 


86 


468 


ENYKIPLMGGKWFLLFPSIPPYFFFFSR 
LGLTLLPRLKCSGDHCSLQPRPPGLKRS 
SCL\GFPKCWDYRMEP/CVPR 


10586 


24487 


A 


10669 


254 


25 


GSHICKVPAAIYSNTCTSSEHGDG6WG 
GCSSGSTTHPSPVADSFFWVETVSL/S 
VTQAGVRWCDLSSLQPPPPGI 


10587 


24488 


A 


10670 


206 


3 


YGRPRKLCNIGLKSGFLDPTHWGQHRVI 
SFFFIMETDSRSVAQA6VQWHNLGSLHP 
PPPG\SSDSPAS 


10588 


24489 


A 


10671 


20 


355 


GFTSQSELLYSIDPITRPTDQVTIGISA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutaiiiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QSYCRVHVHNRVYDLDVESGHPNGAPAI 
KGSVCSTIKVPGDVSSGRSNPGRFVSTS 
NSSLY/ EKDKKWKACFTK/RPS PVND 1 1 
ST 


10589 


24490 


A 


10673 


112 


359 


NKAQQCVHENHFKLKDMTLNI KVWRNl 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 
GDEE/HFIKIKKSVIQEDFSIINIYA 


10590 


24491 


A 


10674 


317 


2 


TGPGFFPQIWKVFPPFPLKIFFTQKFLV 
SFWGFKTIKVGPFVFFPKGPQPRFFSPF 
LGVQAEKFFFFPFFFFFFFFCERGSHSV 
A\RLECSGAISJiHClILCPPGFK 


10591 


24492 


A 


10675 


240 


495 


DHRPEKTKSSTCQAGEVPLLGLFVPFRS 
STDWWPAFl /MEG/HICPTQSTNSNIN 
PWKHPHRHTQDHVWPNAWSPHGPVKLM 
HKLI 


10592 


24493 


A 


10676 


271 


484 


NPAGQTCRIKSFFFSFFETBSCSVTQTG 
VQWHDHGSMQPQSLGP\SDPPTSANSSA 


10593 


24494 


A 


10677 


364 


121 


KQEWRSLSHHALCNDQflSPLPGSGHWK 
SKQKLSKAAPCftGSSKHKHL/HKECWKH 
THTHTHTHIHTHTHTHIHAHTKDRFA 


10594 


24495 


A 


10678 


354 


539 


FFFFFFIVFEESHAIAQftGVQWCHLSSP 
HLR\SPPPGFN\DSPASaSSAGITTLSS 
SVRL 


10595 


24496 


A 


10679 


123 


853 


RWSLCHPRLE\CSGTISAHCK/L/RAPG 
FTPFSCLSLPSSWDYSAEHHARLI FFVF 
LVETGFHHVSQDGLDLLT/SG/ IPPALA 
/FPKGWDYRRE 


10596 


24497 




10680 


251 


30 


ASLGVSAPLCIHVSPCPPWGDLGKTWWV 
GGGFFFFEMEFHS/VLPRLECNGTISAQ 
CSLCLLSSSNSPASASRIPP 


10597 


24498 


A 


10681 


203 


1 


NFPAPGKLSPPRDSLKTAPPFFFFFETG 
SRSVA/EftAVP/W/ CDLDSLQP PTLPDS 
SDSPETSASGVAGITG 


10598 


24499 


A 


10682 


237 - 


397 


DSLTLSPRLECNGSISAHYNLC/RLGSS 
MSPASAS 


10599 


24500 




10683 


8 


388 


LYMCWFRPGFLAHNSHDHGYSLTLSCWG 
KKKKKKKKKKDS 


10600 


24501 


A 


10684 


201 


463 


I YWED IVQGIVADRLTSRKHNS KFLGEC 
LMFGQGFLFFEMDSHSftflEAGVQWRSLG 
SLRPLPPG/SQVILLPSSDSPALASRVD 

EIAGV 


10601 


24502 


A 


10685 


151 


1 


PEKNRKTQKGAPPFFFFFETGSPSVAQA 
GLQWRSHSSIQNEPP\GSSDP 


10602 


24503 


A 


10686 


165 


3 


CVFVEDI ISNFFFFFMETESRSVAQAGV 
QWREHGS \ASRVAE TTGVHHHAQL I FR 


10603 


24504 




10687 


292 


384 


WTE IGWVWWLTT VI PALWEAEAG/ E I TR 
SG 


10604 


24505 


A 


10688 


161 


365 


rppcw irtsrwcwryyrfdgst i kvlrd 
lssdrsnpgrvlstsnssly/ekdkrnk 
ayftk/rpspwdii 


10605 


24506 


A 


10689 


286 


3 


QPSSVWSFRSVI CAPAP FS GMGLS PEL Q 
QCIVGNFASRYTMTKSSAVLFILIFSLI 
FEOiEEL/REAPASLVSFLPPQMLRIKWE 
SEQWLVLLCDRG 


10606 


24507 


A 


10690 


169 


2 


SDTPAWWPRKHVFPPGFFFFFETBSRFV 
T\RLECSSTISAHCNLCLPGSSDSPASG 
S 



1194 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utaniic Acid, F^PIienylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaiiiine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10607 


24508 


A 


10691 


240 


406 


DGIRRLSFFFFFFFETKFGFVAKRLEGR 
GTIWVNVNLHLPGSRDSPASASQVAGIT 


10608 


24509 


A 


10692 


343 


3 


QVPLNLSLHLKLGMLFPGCINGSLSQFL 
QFFAQISPYQTGFPSFYKTS lYHNSLHQ 
YSLSFLPYFIFLFFFSFFFFLKQSHSVS 
\RLKCSGMVSAHCNLRLTATSTTQVQVI 

LV 


10609 


24510 


A 


10693 


196 


3 


YDAGHTXKTLHFPTVYPFVLFCWKECKM 
VQPRVJKIVRR/FLKKLNTKLPYDPAIPF 
LSVHQKELRT 


10610 


24511 


A 


10694 


245 


391 


KTDYQPGAVAIIFGRPRREDHPRLGVQDK 
PDQLEKPVSTKNTKLVW/SWWL 


10611 


24512 


A 


10695 


252 


389 


KTGGKVLFFFFETESRSVAQVGVQWRAL 
RSLQPP\QPGTSDCPASflS 


10612 


24513 


A 


10696 


385 


1 


PPNKAKMISSKDNKNLHWGKDTLLHKWC 

weswivtcitmkldshlspytkinpkwi 
kd/lktikilgenikktvldiglhkXnm 

SKTSKftMTKILDLIKLKSFCPAKEIISR 
VNRKSTEWKKVFASYLSN 


10613 


24514 


A 


10697 


296 


430 


khiOarrggs/cgn/rqhsgrprradhl 
rsgtoeqpeqpgekphl 


10614 


24515 


A 


10698 


280 


468 


DYLCL/SLS lYLS lYLSIYLS I IHQF I Y 
HLFIYHLSNLSSISIFFTKWLS 


10615 


24516 


A 


10699 


15 


393 


RSVGVLGPVRCQCA\LLGGDSQLG\SQG 
SGVRDPLEEAVCRFPYLQLCTGRTTALF 
KAVRQGHLSLQRLLIiSF\VWLCPAPIGG 
AYRGRQASWSCGGLHPVRA/SMLLCLPK 
EAWAMAGAPPPASLPPLS 


10616 


24517 


A 


10700 


176 


1 


DWTTNFTTFLYNFKPSSLMPYLSHLFKT 
LR/MWPGAVAHARNPSTLGGRGGWIMRS 

GDE 


10617 


24518 


A 


10701 


494 


80 


FNKKDIHSDTPSEGHQLQRPNVETLKKM 
GRNQCKKGENPKNQNASSPKDHNSSTPR 
EQNWMKNESDELIEVGFRRWVITNSSEL 
YKG\DVLTQCKEAKWLENRLGKVLTRIT 
SLE KNGHGLME VKN I AQELCEAS AGWR 


10618 


24519 


A 


10702 


264 


410 


KKGPLFTPPGGGGG/ PQiOCPPGPLNPGG 
QRDSSFFPPPGGGNTGETPPGG 


10619 


24520 


A 


10703 


125 


3 


NRGNKGQV/ QWLMPVIPALWEAEAGRSP 
EIRSRDQPRQHGETL 


10620 


24521 


A 


10704 


184 


2 


RLRVLAPCRHLPRAPRTWPKRPFFFSFF 
LFFPQTQSHSVA\RLECSGAISAHSNLC 
FPPTRP 


10621 


24522 


A 


10705 


187 


406 


LFLWKFCLRGVPGHVRCQSAL \LGGASQ 
LGSSGVRDPLEEAVCLFSDLQLRAGRTT 
TLFKAVRQGHLRLQRILL 


10622 


24523 


A 


10706 


82 


410 


i lrgkfgkhyfnridwrealrqslslfn 
fi i fsnflaslhkpemetelkgsfielr 
kalfqiinardasllstvdsdfsfcrkfs 

r/cskcgq 


10623 


24524 


A 


10707 


390 


2 


SKDCRTAKI AACS FLWKIjRSRGAPARCK 
PELSCMRCLSA\LLGGVSQSGGTGI RDP 
LEEAVFPLAELERCVGRSAALFRAS RQE 
HLSLLKMHPQLPLPSGALSQADGSFIYK 
PLTGASAFLSEMPCQERR 


10624 


24525 


A 


10708 


229 


392 


YS WWRQMHS VAHAGVQWCD/ LVS LQ PQP 
PG\SSDPPASASLAARTTGAGHH1TQLIF 


10625 


24526 


A 


10709 


193 


2 


HYLCYQDHVILLLLLLLLFLRWSLTLSL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

correspond 
ing to first 

residue of 
peptide 


Predict- 

otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cj'steine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenylaIanine, 
G=Glj'cine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














RLECSGTIMAHCSLDS\SSNPPISASQA 
AGTTGACYH 


10626 


24527 


A 


10710 


180 


11 


LLFFFETRS/HVSQTWQCGMl SGHCNL 
RLSGSSDPPPLASRVAGTTGKHQNIWLS 

R 


10627 


24528 


A 


10711 


234 


408 


MKLIMLKVILFPQIHLQIQPNFYQNIFF 

FETELCS VAQ SGVQ WHNLS SLQHPPP\G 

SSD 


10628 


24529 


A 


10712 


312 


470 


TGPHCVT\RI;ECSGAITAHCST,DFAGI.S 
TSPTSVSQVSGTTGTWMKLETI ILS 


10629 


24530 


A 


10713 


64 


392 


PKMVIRI SSETSLYASLPLQMKGQRQK/ 
CEPQP KKKKKKKKKKKKKKKKKKKKKAR 
G 


10630 


24531 


A 


10714 


169 


427 


^^S^QKTMSrKMVGVSFYLSIIIL[IVNELNA 
PIQEHRVAEWIKKEKKKKK/DPGICGLQ 
QTLFFYEDP/HDPLRLKIGGW/RKYYPS 
RGTQIK 


10631 


24532 


A 


10715 


176 


410 


ARSSWGLHTAVPVAFFSriRFLLEIFLDR 
DTRCSPPAAFFFETGSCCVA\RLEGRGA 
ITAQCSLNLLGSSNPPTSASRVA 


10632 


24533 


A 


10716 


234 


389 


NSGNMDRYKDVQNI IQNPICWPGTTOHA 
CNPSTLGGRGGQIL/RGQEFETSLA 


10633 


24534 


A 


10717 


190 


2 


GPFPPHPGGFLRGFFVPNTIPPPTFFFF 
FFLRRSLSVAQAGVQWCDLSSLQPLPPG 
P\SNSPCQ 


10634 


24535 


A 


10718 


201 




WPLFLPKMFFFFKRFPHMGPSPWVGAQA 
KKKKIFFFLTGSHSVT\RIECSGTISAH 

CSLNLPGPSH 


10635 


24536 


A 


10719 


195 


419 


EYHTSLVTCGNPCVYRSOTJKU-IQTSRRK 
VITKIRAELNEIETEK/LQGSGETKIWF 
FEKINKTGLELLGSSDPPVW 


10636 


24537 




10720 


218 


407 


GKKNriAFKKKKRKEKKRKKSPSQSNMNS 
AKIEARTNI KLWKHGWKNCEI IDA\LQ 

KAFGDNA 


10637 


24538 


A 


10721 


231 


409 


GTQLHLGGFFFSEKELSFCFWF/MFALF 
EMEACSVT\RMECSGTVLAHCNLRLPGT 
SNSSA 


10638 


24539 


A 


10722 


40 


401 


PLCPSESSGNTLMASSDPSTPAVPPPNT 
THPPLCLSKSHLPLRPKQGLPSGNLLQL 
PLTLL I PLLGAPVACWQLPQQCTLSTFF 
FETKSHPVAQaaVQWCCLGSLQPLSPG\ 

SRDSPDSAC 


10639 


24540 


A 


10723 


210 


14 


HVMGLLLLFLNKLTVNNFGWLAGFWCLG 
FFWFFSFFETRSGSVT\RLECSGMISAH 
YKLCIPGSSH 


10640 


24541 


A 


10724 


347 


462 


TFFFFFFETKSRCDIQAGVQWCDLCSLP 
PSS\PDSSDCP 


10641 


24542 




10725 


169 


386 


dkkqakti kwgknsfsnkwcwnnciatg 
krmkldp\yltpykk/inskwikdlsi 


10642 


24543 


A 


10726 


257 


54 


PLFFFFKRQGLTLSPRLECWGTVTAHYN 
LKLLGSRDLPTSASP\SAGITGISNPAH 
LFTYLNPTHLQT 


10643 


24544 


A 


10727 


2 


401 


NNYDRAETQIYQYMCLNPTFYCLQETHL 
TCNDI YRLKVKGRRE IMQI ENKRVGVAI 
LVSDKTDFKPTTVKKKLHYIIKGSIQPE 
DLIILCTYSPNIRASRFIKRIPDLRKEI 
A/HTVKVGDFSI SLNRLSRQNT 


10644 


24545 


A 


10728 


154 




PMCSLLGLSKGGI IFFFFFEAEFCSVAQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 

ding'to"" 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVQWRDLGSLQPPPPG\SSDS 


10643 


24546 


A 


10729 


34 


151 


PRPPVPSLLDRGRLQLWRQRGLRHRAHS 
NG F I GGKQQ I MKLLKNYVERP VG / VAVA 
IMFDPDPRYPRSWIEDDFNYGGSVASAT 
VHIRMOSLVEUNKS 


10646 


24547 


A 


10730 




408 


vflltvrtlicrsvgvwwrstpdlvclg 
issggcrtaniglyqmllpdhssgsfcl 
rgvpghvecqsaSllggpsqlgysrvrd 
pleeavcpfsdlqlhagrtttlfkavrq 

VHLILQRFLLRFVWLCPAPRG/GVYR 


10647 


24548 


A 


10731 


274 


89 


enipivfkfppvcspllwqpqe/ceypkl 
cklclklhkhhvfy\flffffetephsv 

ARLECSQ 


10648 


24549 


A 


10732 


251 




TFQMMQKCFSHRKI FHNLLDKASYKI VY 
KEDPFSPSSLSSSVSLKHNFFFLETESC 
SVTQAGVLWCHLSSLQPPPPG\SRDSP 


10649 


24550 


A 


10733 


375 


1 


APFPPLWVRGSPPFSPLVFPHPBCLKPPL 
AS ILGPPKERiCWGS IRiCPCLFIWKKLVS 
VWPGLELQIFKFIGEFPFSFPSLVGRKP 
NFFLGPVFFFFFEKESHSVA\RMECKGT 
ISAHCNUILPGP 


10650 


24551 




10734 


82 


398 


SFLWKLRLRGVPSHVRCQSA\LLGGASQ 
LGYLGVRDPLEEAVCPFSDLQLPAGRTT 
TLFKAVRQGHLNLQRFLLPFVQLCPASR 
GGVYRGRQftSLSCGGLHPVEAS 


10651 


24552 


A 


10735 


2 


341 


tfcaiswlesgvedgprsrisyristff 
fffkgapepkevhgrpkpgiaptslpga 
hgttglsgwttpsi ggwqppppprenpk 
gehppapvagdtfqXpbocpppilkvffp 

K 


10652 


24553 


A 


10736 


171 




HSSIQSQTSG\SNDPPASAYRVftGTTGV 


10653 


24554 


A 


10737 


190 


1 


EHKTTVRKIPKMEEMMSGSNMLRIIIF 
LFLFFETKSCSVT\RLECN/GSISflHCN 
LHLPGSSMS 


10654 


24555 


A 


10738 


252 


413 


GLLGLQNFSYKVHLTKAHLKKCSWLDAV 
AHAYNPITLGGRGEW/IQEFQTSLTNV 


10655 


24556 


A 


10739 


101 


249 


AGS TSRCI /QELSDLLEHLEQENCLNPG 
GRGCSEPQSCHCTPAWVTETQKK 


10656 


24557 




10740 


185 


403 


LGLPVHTMKSNYHYHLCHHHHHLQHHHH 
HHHCHCHYHS/HHHNSQHPPPPPPPPPP 


10657 


24558 


A 


10741 


119 


425 


IKFHLHSMERTHLFLWHCWAHKQRNCFK 
ADFVITDDIKQLCPPQSWWTRAGKLPLG 
AGRGGSHMLSHHFWRPRHADILYLGVLD 
QPGQQGKTPSTj/LKNTKSSW 


10658 


24559 


A 


10742 


112 


2 


GETFFFETGSSSVA\RLECSGAISAHIM 
LHLPGSSKSP 


10659 


24560 


A 


10743 


188 


3 


PPPLFFFFFFFFFKTGSGSAT\RLECTA 
HCNCCLPGSSHHPTSAYQVARTTDVCNH 
AWLIFV 


10660 


24561 


A 


10744 


176 


3 


SLYSKNKNPSHLLFLPIPIKFPFFFFET 
EFRSVAQAGVQWHDLSSLQPPPPG\SSD 

SP 


10661 


24562 


A 


10743 


250 




GPRRIFFLKEFYPRFVFGICNPAPGGFFS 
GGKKPGPFPFNPRPIKFFFFFFETEFRS 
VAQAGVQWHDLSSLQPPPPG\SSDSP 


10662 


24563 


A 


10746 


155 


3 


PPHPFPFNPLPIKFFFFFFETEFRSVAQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide^ 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arg!nine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl4nown, *=Stop codon,/=possible 
nucleotide deletion, \=pos5ible 
nucleotide insertion 














AGVQWHDLSSQQPPPPG\SSDSP 


10663 


24564 


A 


10747 


157 


1 


TPRPFPFNPPPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10664 


24565 


A 


10748 


157 


1 


TPRP FP FNPHP I KFFFFFFETEFRSVAQ 
AGAQWHDLSSLQPPPPGXSSDSP 


10665 


24566 


A 


10749 


153 


2 


PPPLSWTPAPLRFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10666 


24567 


A 


10750 


157 


1 


PPPPFLPFPRPLKFFFFFEETEERSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10667 


24568 


A 


10751 


155 


3 


NTFPFFFTPLTPKFFFFFFKTEFRSVAQ 
AGVQWHDFSSLQPPPPG\SSDSP 


10668 


24569 


A 


10752 


86 


406 


SFLWKLHLRRAPGCMRC3UiA\LLGGVSQ 
LDYSGVRDPLEEAVCPFSGPKLHAGRTT 
TLFKAVRQGHLSLQKFLLPFVQLCPAPS 
TGVQEGRQASLSFGGLHPVQSSR 


10669 




A 




364 




GQFLHSLD/SHWKKSEDFCFLMFPHSSN 
VSAIHQDH 


10670 


24571 


A 


10754 


250 


51 


GEKKKKKPRVFFFFFFFETEPGSVT\RL 
ECSGVLSAHCNLRLPNPNDSPASASRVA 
ASAKLG 


10671 


24572 


A 


10755 


66 


446 


SFLMKFCLKGVPGCVRCQSAVLLGGASQ 
LGYSGVRYPLEETVCPFSDLKLRAGRTT 
TLPKAVRRGHLSLQRLLPPSVCLCPAPR 
GEAYRGRQASLSCGGLHPVRASRPLCLP 
TQALAMVGAPPPGSL 


10672 


24573 


A 


10756 


252 


482 


RL PRQP VRKMVAG VRGCCVWGWS KS LQ 
RSTTLDWQGPPQRGGPILLFFFFETLCH 
/TRLEGNGE I SAHCDLCLPGSN 


10673 


24574 


A 


10757 


205 


488 


PLESLRSPLGPLSWRTASSVSPVICIYY 
I Y I Y I Y I YTHTH Y/HIHTHTHTHTHTHT 
QYI FVC WFLKDRVLLCHPRQSAGARSW 
PTRTSASQRQEM 


10674 


24575 


A 


10758 


272 


471 


YSYVLFFIFLGIESCSV/AFSAGVQWHN 
HS SLQLQTPGLKQS SHLSLP /ASASPVA 
GTTGMRYHARLIF 


10675 


24576 


A 


10759 


17 


342 


GTLSSGAQVLIGRIESIVWGLKPWfiT.G 
GCPSPRAVHWLLASSDWRPSLQDGAEGW 
KKGEANGNHKR/GIAVISDQIDFKTKTI 
KGDXKKSHYVMIKGPIQQEAITIINI 


10676 


24577 


A 


10760 


42 


498 


EFRERGREKEREKERKTEERMEDRERKA 
EREREKERQEGREREREKGRQRERERKR 
ERERKRERE/RHEPGSL 


10677 


24578 


A 


10761 




305 


ASWDDPAHWKIWFHI PGGWAHFFFCRLL 
DCPPWAPRPPAPR/VLL/TPAAAAAAAA 
AASRP EKKRAEENVGAI P PRKRP PRWE E 
RRGPRKGSAPPGRGAGRRAR 


10678 


24579 


A 


10762 


364 


2 


FPQKQNGQYLPLFPKPTGLCGKGKTRLG 
FFLNYFGKKKKKRGRELGFPPFCQIWGT 
S lYRGSMGYFFFFFFWRQVSLALLPMLE 
C\RGMIRVHCSLDLVGPSNPSTSASQIA 
RTTGTCMCHH 


10679 


24580 


A 


10763 


152 


484 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVfflDIIS 
T 


10680 


24581 


A 


10754 


208 


420 


SFLWRFPLRGVPGHVRCQSAXLLGGASQ 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQR 


10681 


24582 


A 


10765 


319 


85 


GKLLNWHRCPGSKFQGIMFFFGETESPY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

peSde"' 
sequence 


Predict- 

"oMtion 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=UnIcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITQSGVRWHDFGSLQPP\PPGSRDSPAS 
AS\RGITGARKHTQPIFLLVSNS 


10682 


24583 


A 


10766 


122 


3 


KFF FF FE TE FRS VAQAGVQWHDLS S LQ P 
PPPG\SSDSPSL 


10683 


24584 


A 


10767 


240 


86 


RAPPFFFFEMESCS lAQAGVQWLSLGLL 
QAPPP\GSCHSPASASCELMFSKI 


10684 


24583 


A 


10768 


244 


85 


RAPPFFFFEMESCSIAQRGVQWLSLGLL 
QAPPP\GSCHSPASASCBLMFPKI 


10685 


24586 


A 


10769 


35 


296 


EVKSPSARQPPRLQSEEHLRPAftAPSQR 
EVGGQPPPGQPPCPGGEG/PPPPGSPDW 
EVHSPSARQPPRLGGEPNSSLRTGHDDD 
GGFV 


10686 


24587 


A 


10770 


260 


484 


MDEELXiLMNEQRTWFLEVESSPGEDAVS 
I IQIATRDLEYDLNLVEKGAAG/LERKH 
YSFERSSTVDKILSHNTACY 


10687 


24588 


A 


10771 


248 


393 


TQEGKKLINWPGTVAHAOJPSTLGARGG 
RTTRGQELETK\LaHKIKPCL 


10688 


24589 


A 


10772 


239 


2 


SFLWKFCLRGVPGRVRCQSA\LLGGASQ 
LG\SQGSaVRDPLEEAVCPSSDLQLRAG 
RTSALFKAVRQGHLSLC2RLLLSF 


10689 


24590 


A 


10773 


158 


455 


LFFTLCPSLLQHIAVMLELGLIOSPKSIQ 
P I FWVFLQGTE P /HFLVTPVR/ CCLPLL 
KLFCLLVFGMESPSVPHRGECSGVISAH 
CNLCLIiGSTDSSASPSRV 


10690 


24591 


A 


10774 


166 


403 


KKTFLGEPLFWGGAKKKKPGKKNPGFFP 
RG/IKPRVFFSRFFFFGPPPKKGFPQKS 
FFLKSLPGFFLFGGCPPPFFFFFFFFEM 
ESCSNTRLECSGVILAHCNLCLPGSSDS 

P 


10691 


24592 


A 


10775 


208 


1 


RLCFFYFRKAILGKAQIKNIAFFPRKGS 
FFFFSETESRSVAQAGL/LDCSGAI SAH 
CKLRFPGSRQSPAS 


10692 


24593 


A 


10776 


192 


29 


IFRKEFPCLNFFLLFFETE3R3LAQAGV 
QWRDLGSLKAPPPG\SRRSPASRDTGV 


10693 


24594 


A 


10777 


275 


490 


KPGFFLLQMAKVYMI FFFFEMEFCS \VA 

qagvqwhdlgsmqhrppg\sgdspsclp 

SI/AGIPG 


10694 


24595 


A 


10778 


340 


3 


LKVPVGKPRSTLLGVKKVPFFGWKRKKF 
IiftLFFFPRPPPGEGFSTAFLAQKPTPRV 
VPALQFPKMQSPLSPFFFFFFETKSCSV 
TQAGVQWCX3LRSPQPMPPG\SSDSPASC 

L 


10695 


24596 


A 


10779 


313 


1 


anpfggpsggdpfssrvfpppgpknetp 
ffknkttatkktgnkggwapsphrggpk 
kgpalwdkkgknlwpffffeteshsvt\ 
rlqcsntilahcnqclpgs 


10696 


24597 


A 


10780 


116 


422 


ILEDTNIQTIETLLAIREVQIQTTLRNH 
FTLTGMAI I /RKTDNNKCMRECGKI ETL 
I CCWRECKMLCNEVGTALE I WQFLQSL 
NIELSYDIiAMPGFI IFPRE 


10697 


24598 


A 


10781 


120 


418 


TQTTGAPQLHLASRWLSRSGGLTSSPQE 
IPKLFWSIESPLGSSKHLSLQVFVCLFV 
CSFVFEMESCSVARLECSIVISAH\CTL 
HLWGS SHFHAS ASRVA 


10698 


24599 


A 


10782 


134 


1 


ssffflcqtescfgvqwhdlsslqpsts 
raqa\lslpsswdhrr 


"10699 


24600 


A 


10783 


2 


435 


CSHRGDSSSYSQLSGIRAGDLGGGGKDI 
FRLLPTTLHIFAGKESYDWCVTHERMC 



1199 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucientide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














LC/SCLFIYMCLCMCVHV/HYIHEACVF 
MCLCL 


10700 


24601 


A 


10784 


432 


161 


FSRAGFH/RVSQDGFDLLPS \CLPPLGL 
PKCWDYKREPPRPAWKLMCRQVHRKCTW 
LFIQIGSTLFKTNGGLSAVSHAYNPNTW 
GGRGERIA 


10701 


24602 


A 


10785 


270 


442 


NVELRLGLALSPTLECRGTIMflHCRLDF 
PGLMQSSHLSHRVAGTTG/TCHHAWLIF 

KFL 


10702 


24603 


A 


10786 


12 


424 


L IQLPRLECSGALITUICNFTY/ SLGSGD 
PPTSASQ/VLKTTGVCHHAQL 


10703 


24604 


A 


10787 


350 


3 


DSSSVQTNKGIPGQVLSDILSCAVKASV 
LVI IPLHNALSSWLVPALPLWKSWQVGK 
SQQAHSSVSGLC/MHYRSHTHaHTHSPH 
RHRHTYTYARAHTHTHTQMLSAYLPSKQ 
PSGSLS 


10704 


24605 


A 


10788 


155 


3 


HAFFALCIRNRLECNGVILAHCaTLC/RL 
LGSSDSPVSASQVNGIAGACHHAQL 


10705 


24606 


A 


10789 


188 


2 


RRDLSSLQPPRGGQ/MRGCIYTHTHTHT 
HTHTHTHTVHWGWGKRHHVPKGKMSANG 


10706 


24607 


A 


10790 


19 


405 


IRPTISRVERGINSLVASEGQRLPWDGI 
ACSQGLVWQQTWGPLGPFPSLLGMPHR 
PTFRDLNSEPAPGVANVSGTLSTPLPGA 
SHGLLVFFFETESHFVAQAGVSWGDLRS 
LPPPPPG\SSNSPVSAS 


10707 


24608 


A 


10791 


157 


. 


kpgpfpfypgplkifffffetefrsvaV 
rlecsgtisahcnlhlpgssdsp 


10708 


24609 


A 


10792 


157 


1 


QPRPFPFNPRPPKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10709 


24610 


A 


10793 


83 


387 


sflnklhpggapacmrcqla\llggvsq 
lgytgfrdlleeavcpfselkhhagrtt 
avfsavrqgclslqkfllpfvqlcpapr 

ggvyrg/rqallschrlhpv 


10710 


24611 


A 


10794 


204 


419 


kgvyghsgs fs ppaplacfrdktlfpvs 
lwkefvhspwckctlpqplwktvwrylk 
nfkme \ i p yvpe i pllg 


107U 


24612 


A 


10795 


207 


2 


RRGFTMLVGQNSLDPSTSRSAHLSLPKC 
WDYRCKP/ PAP 3/ LREGFS YLLHSLLH P 
TPAPGSHHLWAALIC 


10712 


24613 


A 


10796 


2 


237 


FFFLRERILLALSPRLECSDANMSHCSL 
NLPG/FSQSFCLSHP\SRWDHRHMPPYP 
VKFFGIFVGDRVLALFPKLISYII 


10713 


24614 


A 


10797 


342 


40 


DRVFFCS PRLECSGAI lARCS / LRTPGL 
KQSSHLSLPKCWNHRCKPPRPARCSLNE 
SHSAKKWPGQSTD S SACEEVQ P TS P F P T 
VPYPTLLS PSAGHGEGR 


10714 


24615 


A 


10798 


209 


2 


CSVLRVILNPECIYWPWLELLAHDPIP 
RLVCPKSFFFETKFCSVAQAGVQWHDLG 
SLQPPPPG\SSNSP 


10715 


24616 


A 


10799 


142 


2 


IFFFFFEMESRSVARPGVQWSDLGSLPP 
PPP\GSSDSPASATPSPMQS 


10716 


24617 


A 


10800 


276 


2 


ILPIIIRNTCCCFFLKESRSCSVTQARV 
QWHII/GSLQPQTPG\SSNPPASAFQVA 
TGAHHHTSL lENTLI I INWEKNTHKLSG 
CTSMHFRFL 


10717 


24618 


A 


10801 


12 


364 


LPPLGLPYSLRHNNIEISPIHMPPIASK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sr.. 


Predict- 

location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PIienyIaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop eodon, /=possibIe 
nucleotide deletion, \=po$sible 
nucleotide insertion 














YSRERMSCISLTIiNQTLVMIKLNEESLL 
KANSDHKLCLLR/SVYQWNAYEKFL 


10718 


24619 


A 


10802 


235 


3 


HRPSEDLDFPPHWAVMRAPQYPSWDTRE 
GGPRSFLFLFLFFEMESYSVAQL/EGTI 
SAHCNLCLPS S S DS PTSRALCS L 


10719 


24620 


A 


10803 


349 




NQTPFFFFFFGGTETTSTTLCS\YGLLI 
LLKYPEVA/ESASQRDPEWEAAVWRWLE 
GPGSAQPPSAPAKGQELDPWGQRPVPS 
PDDKVQWPYTNA^/LLEIQRFISWKRTL 
TLDTLY 


10720 


24621 


A 


10804 


365 


3 


IDVCNVCVRKQYRGFCDQKYRAWISPVY 
PHKCGWHRVYHTPTPHCETEWLWVILHA 
QEH/TFSLTGRHTHTHTHTHTHTHTSKL 
APPASRALFGVAHVEAQKMiASPSSGRY 
LAITMFVQPCI 


10721 


24622 


A 


10805 


406 


96 


CPPEFSEESPRLLKFRVGGYLTPQVSKC 
GLGVWIFKVFFWSPPKVQTSLFFFSKT 
GSHSVT\RLECGDTIIifiHCDLCAPGSGD 
PPASATRVTVTVGLPPCPAR 


10722 


24623 


A 


1080S 


58 


369 


FFFFKGDRAQNNS/WGERCLLNKGYWDI 
WISTCKKMKSTPYLTIiHTKISSKGLKDL 
IIRAKRIKLLKKYIGINIflDLGLK\DFL 
NMTP KTLATKEKIDTLDFIKI K 


10723 


24624 


A 


10807 


126 


1 


KEPFFFFFFETESCSVAQAGVQRCmSS 
LQHQPTG\SSDFP 


10724 


24625 


A 


10808 


204 


415 


HLGFDSLTTCISLSDGLKYKATVFLVFF 
FERESNCXAVWAEOTGPISVlSrCNIiRLPG 
SGSSPASPSRGVEIT 


10725 


24626 


A 


10809 


168 


2 


KISKRPFFFFFFETGSHYVA\KLECSGV 
ITAHCSLDLPGSSNPPTSASWVAGTTGT 


10726 


24627 


A 


10810 


97 


389 


lAVSPSLSLSLSLSLSLSYLGLPYSLRR 
SN IE I TP INTPAEGSVCSSERKGHMSLS 
FNEKIEVITLSAQDMSNTKIG/RKLDLL 
CH/TSQWNAEEKFLK 


10727 


24628 


A 


10811 


24 


416 


LEYIARRYLGWWLFFFFFLNRQGEKSR 
GPFKFFFPRGFFSTRNGAPPGGPWGPLP 
WGGGPPVGFQKQGKGGAP\PPKKNRFPK 
GGPLTQPNLPKTPIENPKGPPTRGFFPS 
GPPPKKGAGPPPIFRVGPG 


10728 


24629 


A 


10812 


60 




KKRKNFPQKKISPYFYPLKWFKTPPLWV 
IC^QTPPV/CCFFEAPFSLKKPPRGLKKG 
/WGKIIFPPPLVPQKKKKKDTAGVAILVS 
DKKDFNPTKIKKDKGHYIMVKGSMQQEG 
LGCPDAWVPS 


10729 


24630 


A 


10813 


618 


1519 


FWFGVMCKSVCVPFLPVLCFWCVMCER 
GVLSQRSMGQAQSKPTSLGTMLKHFKKG 
FKGDYSVTMTPGKLRTLCEIDWPALEVG 
WPSEGSMDRSLVSKVWHKVTCKPGCPDQ 
FPYIDTWLQLV/YRPPPY 


10730 


24631 


A 


10814 


179 


15 


KVKRLKTP FFFFFFETRTCSVAOARVQW 
HNHSSLQPQPPGPKRS\PTSVSREAGT 


10731 


24632 


A 


10815 


61 


422 


NCFFLKGPPLFFFFFFFFFFLLQI IVFI 
YKFFAFLFQMEFRSLPRLDKCNGAISAH 
CNLSLPSSWDYRNLPPRLANFFLFLVET 
RQPASA/FLTCWDYR 


10732 


24633 


A 


10816 


170 


440 


RQGLSLLPRLECSSMTTAHCDLKLLSSS 
NPPISAPHIALG/LTGLCHHTQL 


10733 


24634 


A 


10817 


322 


443 


FIFIFYFFETRSHAYAQAGVQWRDIALL 



1201 



wo 01/64835 PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=lVIcthionine, 
N=Asparagine, P=ProIine, 
Q=G!utaminc, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \^ossible 
nucleotide insertion 














QPPPP\GSSDSPA 


10734 


24635 


A 


10818 


134 


3 


KKaHLPFFFFLEGTESHSVa\RLECSGT 
ISAHOSrLCLPQSSVSV 


10735 


24636 


A 


10819 


76 


245 


FliLRKGTRQGGPLSS/LLFNIVRDVLftR 
AIRL\EKKIKDIQIQKEEABLSLFADMI 
IF 


10736 


24637 


A 


10820 


233 


440 


FFEPGaLKKWKSMTHKlWERetaiTTLN<3 
QGIKAAKTASLKKHLQQWGKVMLMANKL 
LRGERAW/FPPRIRG 


10737 


24638 


A 








RNPFFFLETEPHSVAQGEVQWCNLSSLQ 
PPP\PGS 


10738 


24639 


A 


10822 


168 


1 


PLRGLSDSSLGIMKRMATDLSSLQPLPP 
G\SSDSPASASRVAQITDSHHHAWVIF 


10739 


24640 


A 


10823 


97 


405 


LCVNICFNFPWLGVEWLNHIIGVGLTFF 

FEMESCSVARLECSGAISAHCNLL/LPP 
SFGSSDSPASVT 


10740 


24641 


A 


10824 


186 


2 


EPGTISLVALKLQKWPRR/SDHLRLAW 
DQPGQHDETPSLLKNTKI SWAQWQAPVI 
PALEPV 














KKTQIGGAPGGALFFFFFFETGFPFVA\ 
RLQCSGTNQAQCSLNLLSSSNPSFPaLQ 
VTGTPSACP 


10742 


24643 


A 


10826 


252 


38! 


GQQEFFFPCSVPQAGGQWCNLSSLQAPP 
PG\SHHSPASASRVAG 


10743 


24644 


A 


10827 


77 


427 


I PQVHCPMS PPVPMAC IPRVSS FTSWVF 
HNLLPPSECPLGPLVPASSHPRPCVCCR 
PCTSWS\CPLWPRPPCSNSPV/TCVPCL 
PCLCISEIPSCVPWP/WTYSSLCPMSHV 
PDSPCPLP 


10744 


24645 


A 


10829 


20 


518 


SFAFSLLQHLTETSFfllNSCSBATLLFL 
SVFL/RAQTLTAPCQTRGPRRGKDRGSG 
SSSFSGPKATKESSVERRKSFDSWGHRF 
AA\QRLMDNQAERESEAGVGLQRDEDDA 
PLCEDVELQDGDLSPEEKIFLREFPRLK 
EDLKGNIDKLRALADDIDKTHKKFTKAV 


10745 


24646 


A 


10830 


351 


3 


LHFSPFLQHRQNIKVWLMRASMQRHSRT 
HGAFPLHQDEIQTRPRLRSAASSGPSLL 
SDHILSTLPAFTCANLSSRFCSSSPCSC 
SCLRA/CCTCHFDLSTLIHAHTRTHTHT 
HTHTMY 


10746 


24647 


A 


10831 


8 


376 


gmlpahladvlrhnsvgrpkhmrvmaga 
legdlfigpkae\ehrg 


10747 


24648 


A 


10832 


336 


38 


GVATEGVGEAAQGGEPRQPEQ/PPPQPY 
■ PPPPQQQHEEEMAAEARQA/AGAPMDDG 
FLSLDSPSYVLYSDRAEWADIDLVLQNV 
GPNPWQIIYSDKYTLWK 


10748 


24649 


A 


10833 


206 


1 


TYFFPFPPGLFFIAGIFFFFFLETGSH/ 
SSLRLECSGIITASCSLNLPDPNDPPAS 
ASQVAETTCATMY 


10749 


24650 


A 


10834 


226 


376 


RISQAIISFYFYFLFETESHSVAQAGAQ 
WSDLSSLQPPSPGV\SDSPALPS 


10750 


24651 


A 


10835 


213 


1 


DRVLLLSPRLGCSGMITAHCNLHLPWFK 
RFSCLCPPE/SSWDYRCPLP/PPRLTSV 
FLVETGFHHVGQAGIiR 


10751 


24652 


A 


10836 


351 


447 


RENLWLTPVIPALREAKAGI \LEPRSSR 
PAWAT 


10752 


24653 


A 


10837 


71 


472 


SASTAPMAPVKKLWKGG/ K/ KKKQVLK 
FTLDCIHPIEDGIMDAA/NSTNYEQFLQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=MetliioHine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertiDn 














ERIKVNRKAVNLGGWTIERSKSKLTVT 
PEVPFSRRYLKYLTKKYLKNHNLCDWLC 
IVANSKESYELRYFQITQDKEEEK 


10753 


24654 


A 


10838 


37 


382 


SRCIMAFYLYGIRSFPELWKSPYLGVGP 
GHSyVSLFVAGRCCSVENQQRI.CSVKTMS 
PQNTBamVIAKARYLRKDEGSNKQVYS 
VPHFLIflGAGKERSQMNSQSEDHK/LA/ 
PVRNAV 


10754 


24655 


A 


10839 


313 


617 


AFFLIVFQYVARERERVSRS/VHCiGLYQ 
HCYFRTL 


10755 


24656 


A 


10840 


309 


511 


WEQGSWEHVLQAPCSIiIAPRITHRYTHT 
HTHTHHYTHTHLYPH/ APTS I L 


10756 


24657 


A 


10841 


269 


450 


TPWPLKKEFFFFLKWSLALVAQAGVQWR 

dllgssdspNlqpsrfkqfscislpssw 

DYR/PC 


10757 


24658 


A 


10842 


420 


1 


LERGENGTRRDRRKGLSHCHQPlXDDSVIiP 
PLCHPPPLLVTMEEEITMLFIDIGSSMY 

kagfag/ddasramfpsivrcpwhhgvm 
vgmsqkdsyvgdeaqtkhsiltlkypik 
hdi itne\wnnmekiwhhtfynklhvap 

CI 


10758 


24659 


A 


10843 


431 


1 


GEHSWASDLAEDVTKVTRGPLGPLFWGL 
CSGNLSFSGCVPGLPAGAVPPWVPVPFQ 
GGA/SMVWKGPSPWTLHLLLRTWGLaGG 
VGGRSLGRWRAWPGNPGSCJGQGAPPAHQ 
ATGTPRSRTGSTGGlETVTIIiEGSHVSG 
MGIR 


10759 


24660 


A 


10844 


55 


297 


QRWPGTLLGVQPG/APPDSTSAStiSGGA 
RGGPVPTLEGGNTGSRKWEDPCWGSQNS 
PSSDGSKPPPGPLTSKVCDDSRLSE 


10760 


24661 


A 


10845 


109 


427 


QTGPSAAGLLEFARGPLQTLFAWVPAAV 
AAEQQIFVNRECCCLIVPLEFCLRGVPC 
CVRCQSA\LLGGASQLG\SRGSGVRDPL 
EGGSCPFSDLQLHAAHTTALFTA 


10761 


24662 


A 


10846 


17 


416 


S FLWKFCLREVPGRARCLSA \ LLGGAS Q 
LGYSGVRDPLEEAVCPFSDLQLRAGRTT 
TLiFKAVEQGHLSLQR I LLP FVWQCPAP I 
GGVYRDRQASLSCSGLHPIRA/SRAAVP 
TQASAMAGAPPPDSLPPCSLSSN 


10762 


24663 


A 


10847 


265 


429 


LSGASCCLFSLPIVLFALLLLSVLSFSF 
CAWCVPSWSLSPTLFLTFVLHFL*RFIS 
CF*WSNPKNLTFMVNFQGKGMKIGLAL 
ERI CI CGCSLVFPEAILETSCQNLFCTY 
ACAGVLS SVYDYLALLAVSFLFL*FFSL 
FFFFLYYLFLFALGASLPGLYLLLYFLL 
I.CCISYSI.F 


10763 


24664 


A 


10848 


12 


462 


QTLGTKMNEGLFATFIAPTILGLPDALL 
IILFPPL1.IPTS*YLINNRLIITQH*LV 
KLTSKQMITIHNTKGRT*SLILISLIII 
IVTTNLLGLLPYSFTPTTQLS INLA^4AI 

pl*agaevigfrski:<nalahflpqgtp 
tplipilvii 


10764 


24665 


A 


10849 


2 


462 


ttlhafgtmkeeafitllclftsansrg 
vyardahks eaahr i kdlaekdflalvl 
iayaqylqqcpfedhvkl*wevtefakt 
cvadesaetcdkslhtlfgdslctvatl 
retygemvdccakhepvryecflqhwdd 
cpnlp*vvrpevd 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop eodon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


10765 


24666 


A 


10850 


164 


1 


RGFQVRGE*NSGRETER*RERE*KLARE 
RAIGQEREREKSREAGRERERERKKE 


10766 


24667 


A 


10851 


417 


100 


MMLWGNLETRLWSQAQTIPCTKQRSSL 
FFPVLNAAIFRSKSKVLNCLEYFLGNTL 
FSWDLIPEILFPLSHYHHYHHHHHNHQY 
*KS*QWSASNTRFQGILDKNSY 


10767 


24668 


A 


10852 


339 


3 


ETTHSKARRGRSAAASCRGSALRRGRFP 
ESRRGREAAPVCPRHVLL*GAQSKQAAV 
AGKRSGTRHASRWPKSLFTPRRRRISLK 
RALHFWQQSADPSPSVSRIPPHGVGSS 


10768 


24669 


A 


10853 


1 


373 


LQQKGMRKRAGQSEMAPAGVSLRATILC 
LLAWAGLAaRDRMYLHPFHLVIHNEST* 
EQLTKSNAGKPKDPTFIPAPIQAMTSPV 
DEEALQDQLVLVAAKLDTVDKLMAfiMVT 
MIAIFLGFRIYG 


10769 


24670 


A 


10854 


1 


423 


VSCSFLKLKTMKHGLLLLLCGFLLKSQG 
VNYTEEGFFRARGHRPLDKKREEAPSLR 
PAPPP ITGGGYRARPAKAAATQKKVER* 
APDAGGCIflM)PDLWV*SPTGCQLQEAL 
LQLERPITNTVDELNNNVEADSQTSSSF 
L 


10770 


24671 


A 


10855 


343 


3 


RGCEAHPLPRSEGPAGSALAQPVMYCTI 
FAGTLITAISSH*FFT*VGLEINMLAFI 
PVLTKKINPRSTEAAIKYFLTQATASII 
LVIAILFNNILSGQ*TLTISrTTNQ*SSSI 
I 


10771 


24672 


A 


10856 


147 


1 


TRTPTGQCVSPKSMFLGAVMSCNPITL 
GG*GRRIT*GPEFDPSLflWMV 


10772 


24673 


A 


10859 


189 


422 


NHTMDDFERRRELIRHKREEMRIiEAERI 
AYQRNDDDEEEAAR*RRRRaRQERI.RQK 
QEEESLGQVTDQVEVNAHNSVP 


10773 


24674 


A 


10860 


90 


273 


SHQEIEQNSAMAPRKRGGRGISFMFYCL 
RNNDQRYMT*RL*SGIGWMLLSSGRMGY 
ALPG 


10774 


24675 


A 


10861 


25 


411 


APCAKPCGDWRSRGSLVWAMSGCNARKG 
DCCSRRCGSHL *N* I PTDWPLN* FFLTS 
AKVKKCFPKKESI YSQTVY* SPG* KM* T 
*DKEHPRYLIP*LCIQFYHLCWVTGTGG 
GIILKHGDEIYIAPSGV 


10775 


24676 


A 


10862 


33 


302 


SRRATLIYVDMENGEPGTRWAKDGLKL 
GSGPSMLALNGRSQVSAPRFGRTENAPP 
SLprATIRALGTWRATEKFVKTNGPLR 
Q*QPRS 


10776 


24677 


A 


10863 


1075 


1521 


YCHTGKGEQLGERFCEGVSRRGPAERGS 
DSQTPWPWPLCAAAGTSAGTSHSGSSSG 
AFS SWPCWTAAEPAARKRGRPAGSWS S P 
ATGAPGRCRHEILSRGAGGSAGFVCSGL 
AESGL*ESSSPGRSQG*PQGQRHPQPNG 
LPAPPSTSV 


10777 


24678 


A 


10864 


245 


418 


TSKLAFPrsrPVIYANKVCP*FSKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKVF 
FFEGPGFFF 


10778 


24679 


A 


10865 


132 


398 


LNMGKGDPKKPRGKMAS YAFFVQT* RE E 
HKKKHPDA3VNFSEFSKKCS*RWKTMSA 
KEKGKFEDMAKADKARYEREMKTYIPPK 
GETF 


10779 


24680 


A 


10866 


115 


455 


LLTRNMDRLLI LAGGMPGLGQGP PTDAP 
AVDTAEHVYISYLALLKMLKHGHAGVPM 
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SEQID 
NO: of 

sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

location 
correspond 

residue of 
sequence 


Predict- 
ed end 

location 

last^amino 
acid 

residue of 
seq'^uence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkiiown, *=Stop codon,/=possibIe 

nucleotide insertion 














EVMGLMLGEFDDDYTVRVIDVIAMPQS* 
TGVSAEAVDPVIQAKMSDMLKQTGKPEM 
D 


10780 


24681 


A 


10867 


64 


483 


QGERPAAAMKI * SLTLLSFLLLAflHVLL 
VEGKKKVKKGLYIKVDSE*KDTLGNTHI 
KQKSEPGNKGKFVTKDHTHCTWAVTEQE 
EGISLKVECTQLDHEFSWAYAGNPTLCL 
KLTDE rVYWKPVARNLRLQNDIMIYSPQ 














DTT ILKIiGRL I ILQWLLS EGKSHMSFTL 
NQK*EIIKLSKECMSECAELSQKLGCFCQ 
VGSQVMHAKEMLLKEIKSAT 


10782 


24683 


A 


10870 


934 


515 


KKVLLCLPGLGCGGVILAHGSLALPGSS 
SLHLSLQSSWGLTGM*YDARLIPVYFPR 
EMGSCHICQAGLNSYNSSIPPTLVSQNV 

HFGKQENRIFFFFILFPGGSRDEDRVA 


10783 


24684 


A 


10871 


373 


424 


EFKTNIAMyiG*CLKSQHLGRPRCAiqHMR 
LGVQNQLGQHGE I P FLLK 


10784 


24685 


A 


10872 




469 


RSRSGDSLPACDRPSGASALATCETIFS 
AISCFWDLPAPSLRLTPSCQPTMSSQIR 
QNYSTDTEAAWS LDNWYLQAS YTYLCL 
GFYFDRDDVALEGVSQFFRELAEENREG 

GKTHNAMTAAMGLETK 














N I YSC KETF SVPLliAI HVYHFLVGRGGS 
QGTEMLWHRVDLRYREQAGHSGSHL*SQ 
HFGSPRQVDQLRYGVSDQRGQHQE 


10786 


24687 


A 


10874 


83 


427 


ISLNMIRIAALNASSTIDDDHE*SFTRH 
NTQTKEAQEAKAFALYH*ALDLQKHDRI 
EESAKAYHELLEASLLREAVSSGDENEG 
LKHPGLILKYSTYKWLAQLASQREDLET 


10787 


24688 


A 


10875 


94 


■ 


KSQKACNPSTLGG*GGWIT*AQEFTTSL 

ANT 














GVTRGFHMRIEKCYFCSGPIYPGHGMMF 
VRNDCKVFRFCKSKCHKKFKMKRNPCKV 
RWTKAFRKAAGKELTVDNS PEPEKRRNE 
PIKYQLELWNKTIDAMNRVEEIKQKRQA 
KFIiyiNRL*KT*ELPKVQDIQEVQLN 


10789 


24690 


A 


10877 


65 


417 


RFAGAGA 1 PEAHAWPTDVHAAEE E KEMD 
LPDLASRVFCGRILSMVNTDDVWAI ILV 
QKNMLDRFEKTLEMLLNENNLASARLEQ 
MSERLLRHTRTL*DMKPDLGSLFRPIRT 
LEWKL 


10790 


24691 


A 


10878 





442 


APTPDAMGHFTEEDKVTITGLWGKVNVE 

SGSVIMGNPKVKAHGKNVLTSLGDAIKH 
LDDLKGTFA*LTELHCDKLDVDPENFKL 
LGNELETDMRIHFGQDFTPEVYAYLQKM 

VTWAN 


10791 


24692 


A 


10880 


1 


419 


GKHIRQYHEEKETGQRINIHEYYLGNML 
AKNLLFEKEREAEEKEKSYE IPTKNIQG 
QMTPYYPVGMGNGTPCS*KQNRPRSSTV 
MYICHPESKHEILSVAEVTTCEYEAVIL 
TPLLCSHPKYRFRASAVNDIFCQSLPG 


10792 


24693 


A 


10881 


54 


335 


RE I FTMSGALDVLQMNEEDVLKKLAVI T 
HLCCTKTDSPMEQIIYGS*TEYIYSIKL 
NRT * QNLMLAVP *NYDI *MHDEVTVMTF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l.ysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NNTLQMFEP 


10793 


24694 


A 


10882 


216 


3 


MS IKSMS ICCLRKKGKHGFQ IRGK*NSG 
RETERERERERKEARERARGQERERQKT 
SEGGRERERERKKE 


10794 


24695 


A 


10883 


3 


390 


GELKCSKEKCQSMSAKSRTIWI IGAPFL 
KGQPRGGVEEGPTVLIKAGLLEKLKEHE 
CDVNDYGDLPFADIPNDSAFRIVKNPRS 
VGKASEQLAGKVAEVBCRSG* I SLVLCVN 
LGGGLECLCGHAKVYSR 


10795 


24696 


A 


10884 


3 


375 


STMRAWIFFLLCEAGRALAVPQQESLPD 

ETEWEETVAEVTDVSVGANPVQGEVGE 

FDDGA*ETE * DWAENPCQKHHCKHGKV 
CELDENNTLKCVCQDPTSCPALIGEVEK 
VCSYDTLTFYSS 


10796 


24697 




10885 


95 


2 


KCTQGPSAVAHACNPNTRGGRGGWIT*G 
QEF 


10797 


24698 


A 


10886 


1 


114 


MGFHHVSQDGIiDLVTL*GACFSLPKCWD 
YRREPTHLAIl 


10798 


24699 


A 


10887 


1 


368 


EPTMELTTAEKTNGIO«V.WRKVGAHAGEY 
GAEALERMFLTFPTT*TYFPHFDLSHGS 
AHVKGHCKKVaDALTNAVAHVDDMPNAL 
SALNYLHAHKLRLDPVIIFKLLSH*LLWT 
LSAHLHVEFT 


10799 


24700 


A 


10888 


336 


9 


FRGQVIPAIFFPFQTTSLSYFQTGQPN* 
TPFFQKILiCLTRGGGKLLDPQTPGRLRQ 
KREDPLFPGV*NCN* P*SHPWWGTKQNP 
VSKKKKKKLKTSS IFVHRVSMSP I 


10800 


24701 


A 


10889 


3 


362 


GFLLAPLEMQELGTPMNRILQLTIAEQE 
TFLTPALLL P I PHQTYSTASAVPLAKPD 
T*PKDVGI]:jaLQVHFPAQYEDQTDLENY 
NNVKALICYTEDLGQTLMGFCS VQED INS 
LCLTVEQP 


10801 


24702 


A 


10890 


66 


333 


TLPGNIGISFVERVMEVLRPQLIRIDGR 
NYRKNPVQEQTYQHEED**DFYQGSMEC 
ADEPCDAY*VEHTPQGFRTTLEAPS*LY 

TPIVG 


10802 


24703 


A 


10891 


405 


49 


I PWMGTERTRASAQRDVAGPAENPACPL 
PEQCFPGPAEWTGPLTPPGSC*LRPGER 
ERHGPGCWGPRERLSSGQTHPPS PQGCA 
NTCPCHKHQQELLAGGVAVFYKLFYRYG 
WSPKWDL 


10803 


24704 


A 


10892 


2 


197 


PQPLRVLWTAHLAAMAPSSRTSLLLAFA 
L I CLP WLQKAGAVQT VPLCTL F YHAMLQ 
THRAHQLVIDTYQELEETYIPKDHKYSF 
LHDSQTCL*FSDS IPTPSNMEETQQTSN 
L*LPDIPAPGFCPDLPA!:ASKGWCRPWR 
SAMHAFLPRYAPNASRAPTGH 


10804 


24705 


A 


10893 


3 


334 


DQL PEPLKVLWTAHLLAMAPGSRTSLL P 
PYALL*LPWLQEAGAVQTLPVSRLFDHA 
MLQAHRAHQLAIDTYHEFDETYI P*DHK 
*SFLHDSQTSFCLPDSIPTPSNMEET 


10805 


24706 


A 


10894 


3 


341 


LLTPGVSDAICYILDSGYIIMSDTFTAY 
VI G*RFEVNG*Hfl.TVRFAAWPPVALPW 
LGV*WDNPERGTYDGTHEWTVYFKCRHS 
TGGFFIRSNKVNLGTDFVTADKNLYVLD 
Y 


10806 


24707 


A 


10895 


25 


351 


AMI QTRDLQGGRAFGLLKAQQD* RLDE I 
CTQLLDDLKYSNDEDLPSRLEGFKEKYM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

seTu'ence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/5 J 5,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=pos$ible 
nucleotide insertion 














EFDLIGNGHIDIMYLKLMLYKLAVPQTD 

LQLPRSIGEVYSGSGETISYPDFLM 


10807 


24708 


A 


10896 






FSQFKIAQFLKKIPFPGFSPFFQNGGNF 

LKFSKFF?NICRGF?PHPFF*KRGPFFCP 
KWGPQGHIG.^GPPAPQGQKIPPFQVPK 
KI'IGIKGPTPPPGFFFFFFFFFFSW 


10808 24709 


A 


10897 


135 


3 


STLKHR*RPGMVAHACNPSTLGGRSGRI 
T* GQQFKTSLTNMVKP 






A 


10898 


193 


269 


GNKYCLTILLLLLLLSFKRQSLTLLPTL 
ECSG*TRTPGPKGSSLIiNPS*VAGHGGV 
CL*SQLIRRIiRWEDHLSPGV*GHSDPRL 
HHCTLTWATE*DSVS *KTTTTTTTK 


10810 


24711 


A 


10899 


316 


374 


CCCCCCCCCC*CCCCCCCCCCYKETNFE 
QQ 


10811 


24712 


A 


10900 


38 


206 


VYCVLVFTICTLLCNTSLGLFHPEFFFF 
ETESCCVAQAGLQWHHC*SLRLLPHRI I 






A 


10901 


364 


2 


DYQHISPEKHCRPEGSGMVYLMCRKKKK 
RKKKNLSTKIENPQPSFIL*KPRNSAEK 
S*QKEITKISADSLirKKRKRIEKIYIN 
BTMSWFIEILNKIDQPLaRLTMIERKDS 
TKFRNERG 


10813 


24714 


A 


10902 


132 


3 


ATSSPWGYSAYRVAILLFLYFSNKLAFT 
VLYGFV*NYFLQEI 


10814 


24715 


A 


10903 


234 


352 


NFCFFETGSCSVT*AGVIMAHCSLDLPG 
SSNSLTSVSEE 






A 


10904 


250 


356 


TMBMMLDKKQI *AIFLF*FKTGD3CAAET 
TLNINNAL 






A 


10905 


280 


322 


QT*SLILVSLIICIATTNT,I.GI.LPYSFT 
PTTQLSINIiAMAIPL*AGAWIGFRSKI 
KNALVLCTSTPLPTSTMLPVHMMDTSSD 
IHPKI*SIR*QTCI QHRLLLWLKC 


10817 


24718 


A 


10906 


271 


365 


KWRPGWAHTYWPS ILGSRGGWI T*GQE 
FKT 


10818 


24719 


A 


10907 


135 


358 


LRYKLTNPKLTLLTCGLIFLKGNIVNIG 
QCNGVHM*YQHFGRPGQDCLSPGIQGQH 
GQHKESLSL*KSLKISWS 


10819 


24720 


A 


10908 


107 


212 


I KNEGMGQVRWLMPVIPALWEA* VGGTP 

EVSHSWL 


10820 


24721 


A 


10909 


165 


383 


PKNRP I IPLKRSS ISNPGDFQKSLRPGL 
TPMGPHFKRGNYSYDENLENLCPVGGNK 
GPGSH*GLLTCETCRGF 


10821 


24722 


A 


10910 


119 


291 


GGYRFYVKDTF*K*GFWPLTLFirLKPL 
AGHGGACLWSWRLMRLK*EDHLSLGGRG 
CSELWLCTLA*ATE*DPVS*KVRLLALN 
FVYNTKTFSRAWWCMPWLATHEAEVGG 
SLESGRSRLQ 


10822 


24723 


A 


10911 


283 


2 


SDNTTDTFLPFIYKYIFPITWEQGETWK 
KHPRNSRLKKTLRSETIAQIPLKCNLWP 
GMVAHACNPYTLGD*GGMVT*GQEFETS 

LANMARPSC 


10823 


24724 


A 


10912 


1 


360 


PHAFGTMKWVTFMSLLFFLSWANSRGGF 
RQNAPKSEVAHRLKDWGKKNFKAWG*MA 
LAQNLKQGPFENHGKLGKEVPEFAKPGV 
ADDAAENGDKWIiNTLLGNYLAPVAAVRE 
TYGERAEC 


T0824~ 


24725 


A 


10913 


270 


361 


SKTWP6TVAQTCNPRTLGS QGGW I T * GQ 
EF 


10825 


24726 


A 


10914 


157 




VFFQLSRLAWKGFSRFFFVFETESHSVA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ingto first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrnsine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*AAVQWRDLGSLQAPPPGFTPFSS 


10826 


24727 


A 


10915 


1 


357 


LEEWGPEREFAEEEKEKKNTKKKKIKNS 
TPPKKPARKGEEKPGPFKHAITTIFVAR 
IPLFCLKGFLWPSC* I IGRSSPTPGSKG 
HTGAPNCPPCPLAAFPKDVPNFNPKMVE 
AVKNPIL 


10827 


24728 


A 


10916 


102 


471 


PSTPIILTSSYPHYVKSSVTSTFIISLF 
PTTIFMCLYQEFIISN*HLPTTQTTHLS 
LSFKLDYFLIIFILLTFLVTWSIIEFSL 
*YINSNPNINQFFKYLLIFLITILILLT 
ANWLFQLFIG 


10828 


24729 


A 


10917 


2 


138 


RE PTMVLS PADKTNVKAVWGKVGAHAG* 
YGAWLFTfttiTLNVISD 


10829 


24730 


A 


10918 


178 


393 


LVLCTVLLMWRFHFHDSLSILKftQDVTE 
NLSNPTVLRGET6PSAVAHTSNPSTLGG 
*GKWITQGQEEETSrA 


10830 












TAWADWGSNETP FLLKLPKKLT* GGGVC 
LE FQVLGRVEPKNPENLGNQGFN* PKFR 
PWTSTWGAKQNFV*KKKKKNKNKKP*NS 
RHTPPP 


10831 


24732 


A 


10920 


221 


243 


1 1 IGSPLIDIKI SKCFLKIVAFGQAQWP 
VI PAliWEAEAGRSPEVGSPRPA* PGSLK 
VHVDNNWESIN 


10832 


24733 


A 


10921 


33 


292 


GTMWLEIRHAVEVQCVLVSELVIPTSGD 
KPEQC*DHYI.IT* YLILGKWGI ** ISGA 














FI 


10833 


24734 


A 


10922 




3 


KLAVYGSVCL*SQIiLGRLR*EDH]:jNIX3H 
GGCSEP 


10834 


24735 


A 


10923 


106 


2 


KLAVYGSVCL*SQLLGRLR*EDHLNLGH 
GGCSEP 


10835 


24736 


A 


10924 


195 


2 


ASLFKSQMIEAFIYLFIYVFIFETGSL* 
PMLVCSGVITAHCNLHVLGSGDPPTSAS 
QVAETMVL 


10836 


24737 


A 


10925 


127 


2 


YVCILKNLKGRPGMVAYACNPSTLGGQG 
GWIT*GQEFKTSIj 


10837 


24738 


A 


10926 


639 


282 


FFLSIKGWVQQFMPIIPTPWGLKQEDHL 
RPGLRDQPVQNSKTPSLLKI*KLARRGG 
ACL * SQLFRKPR * ENCLSTGELKPKGGI 
F I RL VQNKI PTTRG EKRKQSHTGS YQC P 
KIKKKG 


1083S 


24739 


A 


10927 


1 


363 


ALLTQALTCRQAGAEAPHAGATPSLMPP 
SLPQGFRDCSPSDAAYTMEMTIDKKQNQ 
VI ILFKFKMGHKAAQTTRWINNAFGPE I 
ANKGTVQWRFKNFCKRDESREDDE* YAQ 
PSKVATDQL 


10839 


24740 


A 


10928 


53 


379 


TEAELLTLYLLPNALLNHFTSPPLMFAD 
RRLFCTNHID IGTLYLLFGA* AWLGTA 
LSLLIRAELCQPGNLLCNDHIYNVIVTA 
HAFVIIFFIGLPIIIGGFGN*LFPL 


10840 


24741 


A 


10929 


180 


350 


EPMAKGKTESPGPKRCGP* I*WISQRG 
TLRFRGAGLFFMGKFLRLGENLLEI PRG 
A 


10841 


24742 


A 


10930 


361 


379 


RRYWWLG* VQWLMPVI PALWEAKAGRS P 
KAENL 


10842 


24743 


A 


10931 


121 


543 


HRNTGSTHASAHAYHI VHTNP * PLTGAL 
SALLMTS SLAMG IHFHS ITLL I LGLL TN 
TLT I YH* WRDVTRESTYQGHHTP P VQKG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 

eth 
od 


SEQID 
NO: in 
USSN 
09/^15,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 

location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 

nucleotide deletion, V=possible 
nucleotide insertion 














LQYGIILYIT**VFFFAGFF*AFYHSIL 
SPTPQLGGHWSPTGITPINPLKDPLLNT 


10843 


24744 


A 


10932 


227 


339 


VGGVKSVQLLIJSrCHFSR*MKKKKKKKKK 


10844 


24745 


A 


10933 


64 


409 


DQRNKSAEffiRAHLKKKKiCKKBCKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGEL*KKIN 














WAFFLPLLKRKFVGENFLKKNFFWGGNI 
GGQISSKKTfRGGGK 


10845 


24746 


A 


10934 


2 


362 


ITOI,SGPYPEKVGYTLPVLMiilPLAQLVIY 
STIFAGTLITALSSH*FFT*VGLEIDML 
AF I P VLTKKINPRSTEAAIKYFLTQATA 
SI ILLIAILFNWILSGQ*TITSTTSQYS 
SLIILRAM 


10846 


24747 


A 


10935 


154 


3 


GCPLSPLLFNIWKVLVEAVMQEKEIKS 
IQIGLEVK*SLLTDNMILYLGN 


10847 


24748 


A 


10936 


250 


363 


TAWPGYLYSFSFLY*ETTKIWLGAVAHA 
CDPSTLGGRG 


10848 


24749 


A 


10937 


199 


339 


NVLGIIiLVFRIIVEIYKCXSRLWLGTVAH 
TCNPSTLGCQRGQTT*GEE 


10849 


24750 




10938 


130 


240 


KNECiDPRDL*DNDKWPNIHVieVPEEDK 
DNGTERVFD 


10850 


24751 


A 


10939 


35 


235 


FI LVENTKKMCI CPLLNMI YAI LYLFVP 

SVFL*EENKQ*GITEKKKKKKKKKKKKK 
KKKKKKKKIKPGGL 


10851 


24752 


A 


10940 


179 


12 


DQPGQHNKTLSLQENKNKSSWVQCCV* P 
QLLGRLRWEDCLSPGGRSCEEPCSHSG 


10852 


24753 


A 


10941 


297 


278 


INQDNELTLINQSSKQIKHIOJINQTLRT 
KMNENLFA*! lAATILGLSATVQI ILFP 
PIIiIPTSKYLIllNRLITTQQ*liIKLTSK 
QM I TI HNTKGRT* SLLGGD 


10853 


24754 


A 


10942 


3 


318 


FGGGRGVRMYPDAFVLSVLLPSRLLFPH 
LFPHLFPLPPFLGIiAPYCFLTL*YF* K* 
* LYYSQS I IFFFYREMKNKLLVFWAINL 
FFLYYYNYMEMWTLICYWKN 


10854 


24755 


A 


10943 


190 


3 


KLSHKQSVYFKTPFI*KNHVLFK*KNLW 
VYPNDKFFFFFFESLRLSPRLQCNGTIL 

AHCNLR 


10855 


24756 


A 


10944 


281 


1 


KKWLFSSYSSLYGNGLB1.IPPFLRVSGF 
GKGFWKKFFFLRARGALFWGSPLKGFFL 
GFFWVFFFF*DGVSLLLPRLECNGI I SA 
HCNLRFPGS 


10856 


24757 


A 


10945 


169 


319 


NGVEDAFKNMVIiGWAQWVTSVI PALWEV 
KVGGSPEVRSFRPTRPI*KYGFRIiGTVa 
YICNPSTLGGQGRRIT*GQEFQTNPflNM 

VKPC 


10857 


24758 


A 


10945 


312 


1 


FARPGLLKSWDFQP * PLDPVWGFFFKRF 
PENWLEFEIFPFLI IMAKRKKLSKSLFP 
FPFLG*FKD*GQKTPFFFFETVSLLLPK 
T.ECNGAISAHCNLCT.PGSSD 


10858 


24759 


A 


10947 


208 


329 


I FTGDRSRNNRIG*ARWLTPVI PSLWEA 
EAGGSLEPKSLR 


10859 


24760 


A 


10948 


116 


374 


FYFGYLLFFFCFFEMAPCSRG* SWSARG 
QSFCNLIFPGSSDLYVSAFRVARITGAR 
DHACIIIVFWDGGCVDTITSGLAGWLR 
RG 


10860 


24761 


A 


10949 


264 


434 


QLAFCTDTLTSVREQCEQL * KCVKARKH 
IELCDEQ*SCRSHTD*DCTDELFDFLHA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GlLtaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysinc, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T 


10861 


24762 


A 


10950 


226 


429 


NFGFRLDKTVFFSKIRINAWAQWFMPVM 
PELWEAKAGGIT*AQELETSLSNIVRPW 
LYKKFKN*VWWC 


10862 


24763 


A 


10951 


188 


433 


YSABWKIDLGI EVFWVGKMT YKQ* FFWF 
VFKFIiFF*RQGRTLSSKLEWMSVITGHC 
WLDLLGSRETFASAFQVTWTIGSCHS 


10863 


24764 


A 


10952 


298 


445 


LFSKCSSKSIKEEGPGWAHACKPGTLG 
GRGGWIT*GSEFESSMNMVK 


10864 


24765 


A 


10953 


244 




KTPQGLWFCLFKKSPLKGIKGVSMGVL 
ICFFLK3RINLFNPRGLGQRKNFFFFFF 
ETDSYSVAQTQVQ*HNLGSLQPPSC 


10865 


24766 


A 


10954 


199 


3 


QETKKEQNKENKQIK*RSTRKKHRQGTN 
KTKERGERQTPPVGNRQTPTLGIHARPR 
RRATTSPHA 


10866 


24767 


A 


10955 


157 


3 


YSYTFSFIITTSILIIQTLSGHGGACL* 
SQLLGRLRQDNHLNLEHGGCTRA 


10867 


24768 


A 


10956 


263 


357 


GLSLKVLTQWLGAVAHACNPSTLGGRGG 
*IT 


10868 


24769 


A 


10957 


333 


2 


KWKADMLIPLVIKSFFLGIFFKMFTNSK 
RYIFLLQLIjAQHSSFFSFIRCMEL*WPY 
PIPLCYGQSNQPSV*LAYCRDLFNELIF 
FFFFETEFHSCCPGGISAYCNLCLSC 


10869 


24770 


A 


10958 


130 


228 


GSSFLGGSPSVAQA6VHLPDHGSLQ*DK 
SPSVQKMSKWIGCSSACL*SQLLHRLRC 
QGRLSP*GQACSEP*SGRCTPAWATEGD 
PPKKLLPYTTDS 


10870 


24771 


A 


10959 


250 


3 


VQACSITGALYQRRKAEDEPQTARPISG 
FTTSIAMRLMLLTCSGHIWPVADLAFSG 
IMP*GLFFPSRQNLALSPRLECSARA 


10871 


24772 


A 


10960 


199 


326 


VLGRMWSNQKNYTLLARM* KIK* *UNTR 
CWGGCGATRR IIHC*QECRI* KCQP QSL 
WRTV*QFLNTLNIR 


10872 


24773 


A 


10961 


323 


3 


LLFFHLPDNWQH*YVFCHYNLSFLQFYI 
N*IIQLWFCVWLLSMSIILKSHPCHFS 
K*LSTVT?FFRQSRS VSRLEWSGAI SAH 
CNLCLPDSSDSCASAS*VAGSC 


10873 


24774 


A 


10963 


120 


3 


PFFFFFF * EMRSCFVAQARVQ * CDQS SL"' 
*PSTPGLKQSSC 


10874 


24775 


A 


10964 


168 


441 


LTPVIPAL,WEAEAGRSPEVRSSR*SPP 


10875 


24776 


A 


10965 


19 


207 


APLKLNVAMELSLGQWDVSRSNLWEIPL 
KKGDTGRVQWLTPVI PALWEAEVDRTPE 
VTS VTRC * PLKLNVAMELSLGQWDVSRS 
NLWEI PLKKGDTGRVQWLTPVI PALWEA 
EVDRTPEVTSVTRC 


10876 


24777 


A 


10966 


209 




TKSALSNMWTTHIGLCKNFCIVL*DRV 
SVTQIGVQWHDLSSLQSLSPRLKDPPTS 
ASRYAGITGMHLV 


10877 


24778 


A 


10967 


109 


1 


DYLRSGV*DQP6QHGKTSSLLKIQKliAR 
LVGPIISG 


10878 


24779 


A 


10968 


141 


2 


aLflMLPRLENIIFRPGTOUIACNPNTLG- 
GRGGQIT*GQEFKTSLSC 


10879 


24780 


A 


10969 


175 


348 


LVYECKHILCVFNTSLFFPH*TCYK*RG 

ViWLGEVAHACSPGTLGGRGGWIMASGD 

RD 


10880 


24781 


A 


10970 


108 




KT KKLHMRS G WAHT C1'I?STLGG*GGWI 
T*GQDSC 


10881 


24782 


A 


10971 


184 


2 


S FLWKLHLRGVPGRVRCQSAP TRGASQL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \^possible 
nucleotide Insertion 














GYLGVRDPLEEAV* LFSDLKLCAGKTTT 
LFLV 


10882 


24783 


A 


10972 


245 


339 


SQISGPGTVAHACNPSILGGRGEWIT*G 
QEF 


10883 


24784 


A 


10973 


295 


340 


ICRVFF*ARKKNPFLSPLWGLTLENPLGF 
RLFLNP FLGLKI SQFFFPRQI * PIiAFFF 

WGTLPRPLNLKFGPFFFFFFPFFFP-*DG 
V3FCCPGWSQARRYIEINISLSTCPRA 


10884 


24785 


A 


10974 


IS6 


340 


RFYLTIRFKGHPPNEPIQ*SFFVLFCYC 
FDTESRSVTRLECSGSNSAHCNL 


10885 


24786 


A 


10975 


253 


491 


FLECTSLLKRQRVHPGQHSKAPPTKELQ 
TTGRGGALLQSHLLRRLRQEHCLGPGVG 
SYSEP*LHHCTPAWVIEGDSSICK 


10886 


24787 


A 


10976 


141 




SFSFFKFSPT6DMI6FF*HFFFFLRRSH 
SVAQftGVQWCHLSSLKPRA 


10887 


24788 


A 


10977 


3 


138 


HEETGFHLVSQD6LNLLTS*STHL6LPE 
CWDYEREPPP6REGDF 


10888 


24789 


A 


10978 




323 


YESRSATQftGVQWCELGSL* PSTSRFQ* 
FFCLNLSSTWDYSGLAPS 


10889 


24790 


A 


10979 


183 


3 


IKILFPFFFLRLSLALLPRLECSGTFSA 
HCNLCLLGSSDSAS*VAQITQTTTPSPM 
QLV 


10890 


24791 


A 


10980 


309 


55 


KNFGPNWVKFLGGKGGEMflFLGKFFFPF" 

FFFFEKESWFVFQG*MQWGDFRSLQGPP 

PGVTQISRLGKIGGPLFPKKKKKRER6N 

M 


10891 


24792 


A 


10981 


637 


838 


SQHLGRPMRVDQLRPGV*DQPGQHGEMP 
S LL R I QR * AGHGGTHL * S Q VLRRVRQDN 
CLNSGGGGCSE 


10892 


24793 


A 


10982 


275 


2 


RNRTLKMEFLSWFWGFWNWLLKMIRPKN 
VKDSTSKSMENTDSPWHELFKELGKINA 
FDTPDSLLVRGKFSDSIHNTFDHM*RTK 
EYNEARA 


10893 


24794 


A 


10983 


98 


346 


GHGHATLRGLCVLTFSFHITALSVSGTN 
DAEDCCLCETQKP I CGYIERNLLYLLIK 
DVCRVPAW*WERVYSLISRYSLWRD 


10894 


24795 


A 


10984 


30 


410 


LPEFTGRPKRTRTRGFSTNHTDIGTLYL 
LFGA* AGWGTAVS LL I RAELGQ PGNL L 
GMDHM YWAI VTAHAFE I I F F I VL P I I I G 
GFGN* LVPL I IGAPDMAFPR INWI CF* L 
LPP SLLLLLASAIAE 


10895 


24796 


A 


10985 


141 


360 


QTLRTKMNENLFASFIAPTILGLPAAVL 
IILFPPLLIPTSKYLINNRLITTQQ*LI 
KLTSKQMITIHNTKGRT 


10896 


24797 


A 


10986 




347 


HELRTKVKEHL IAS FMGPTSLGLPAALL 
IIL*PPLLIPTSKYLISNRLITTQQ*LI 
KLTSNQiyiITIHNTKGRT*SLILESLIII 
I ATTNLLGLLP YS FTPTTQLS IWLAMAI 
PL 


10897 


24798 


A 


10987 


46 


317 


KSMTPIRKINPLIKLINHSLIDLPTPSN 
I SA* *NFGSLLGACLILQITTGLFLAMH 
YSPDA*TAFSSiaHITRDVNYG* IIRYL 
I-IANGAS 




24799 


A 


10988 


245 




VSCLLEGKLTNRKDIHTKNPSVHHHHQR 
PKVDKTTKMGKKQNRKTGNSKKQTASPP 
PKK*SSSPATEQSWMENDFDELREE 


10899 


24800 


A 


10989 


255 


1 


SCLPWAHWISVQNQHKKTFLVIRTFFPI 
GSFHVT*LL6PPLILISPPRIFFFFETG 



1211 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 

last amino 
residue of 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G!=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SPTVAEAGVHSGAISAECNLRLLGSSNS 
A 


10900 


24801 


A 


10990 


166 


307 


GEKPWGPLIWGGLVGRSSWASCQVLELL 
*EMVFRHVAQAGLKLLSSSNPPASASQS 
AGITGVSH 


10901 


24802 


A 


10991 


109 


330 


TNQFKTKKEREAGKKKVKELERERERGK 
KRKEQRKNBKERR*P*NMD*RRIRER1IH 
SFDVYEFLICGLLNLLHV 


10902 


24803 


A 


10992 


244 


326 


GRYLKGWLDGPA*aWLGT7LLSLLIRPE 
LGHPAILLGIDLISSVIVTAHAWSILLV 
RVIPIIIAGFGN*LVPLIIQAPDMaFPR 
INSISF*LLPPSLLLLLPPPI*QMRAGV 
GEREVRVR 


10903 


24804 

24805 


A 

A ^ 


10993 

T0994 


101 ; 2 


KTFWARFVGTCL*SQLLRRPRQEDHLRL 
GGRGC 

TPSLLFFVNI ICFCLNPGGGGYSEQKLC 
LCTPAWVTE*NSISRPSQNNSRQSRQV1T 
HLF 


10904 


10905 


24806 


A 


10995 






KEERKNPRAIRW*PWGFFFPLFKEISL 
FLFVFRFWHGYPFPGAFFSTQKWLVF*T 
MVLFFFFFS«VSLLPRLECSGViaAHCN 
TFLPGSSDC 


10906 


24807 


A 


10996 


184 


379 


LCGMVYHFPS ILLLYL*CKLFFSPKLEF 
SSCCPLECNGTVIAHRSLRLPASGDSPT 
SASRVAGHGGTCL 


10907 


24808 


A 


10997 


136 


3 


GRVDGQHIMTHQSHaYHIVKPSP*PLTG 
ALSALLMTSGLAM* FH 


10908 


24809 


A 


10998 


274 


363 


pwayaepltlhdatysct*sliraSftl 
fia*ttrlri.lp *s ftpttqlcmnlama 
ipl*avapdigyssmitnalshllpqct 
ptplisilgiietvslliqpitlgvrrt 

ANIT*CHLLMHLIQSATLSISTI 


10909 


24810 


A 


10999 


305 


1 


NLHTTPSQIPLT*PLPNINIHNLHLTTT 
Q* IKYLILIPSNLFHNKPNLKTAYSVPP 
SPNLHLPSSSNSPASASQVAGKAGaRHY 
AWL I FVFLVETGYSLLV 


10910 


24811 


A 


11000 


154 


2 


IWGILQSRRCGSRL*SQHFGRPRQftDC 
FTPGVPDQEYFTM.PEWFFLV 


10911 


24812 


A 


11001 


142 


367 


GVCLYQRISIYIKKTKKLSQGLFYFLLL 
LLLF*EGGSHSVTKLKCSGGVSAHCNLC 
LLAS S HP P TS S SQVAGTTG 


10912 


24813 


A 


11002 


60 


385 


GNTLLSHTLRAHLIENLHAEFIDRTILG 
LPAA*LIILFPPLLIPTF*YLINIRLMA 
NRH*LIKLTSERMITIRNSIGRT*SIIL 
VSL 1 1 T I ATMSTLLGLLP YS I TRTT 


10913 


24814 


A 


11003 


214 


83 


SKKSASPLTFNIVFQVLDNSVREEEIRY 
TEMEKEEIKLS*FVD 


10914 


24815 


A 


11004 


241 


368 


SYLL* PS P * PLTGALS ALLMTSGLPMRF 
HFHSITLLILGLLTNTLTIYQ*WRDVTR 
ESTYQGHHTPPVQKGLRYGIILFITLEV 
FFFAEFF*AFYHYSLAPTPQL*GHWPLT 


10915 . 


24816 


A 


11005 


3 


396 


HEPHALGMPLTADLPSMASCSQTSLLLL 
LHLLHLP W I * EAEAYQAflG CS KLFDHAM 
LQAHRAHQLTIDTYQEVEETYIPEDHKY 
SFLHDFQTSFCFSDSIPTPSNTEETYQK 
SNLELLRISLLLIESWLEP 


10916 


24817 


A 


11006 


156 


335 


FGCCCCFLFF*I.VYCCYVMMLYYIYVLI 
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SEQm 

NO: of 

nncleotide 

sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

locttion 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














SFGFYF*YWIM*YFYRLVYNIYLVLFYL 
LYMS 


10917 


24818 


A 


11007 


107 


2 


KCSSSKHFTKEDSQITNKHIEKCSS*LL 
VEEMQII 


10918 


24819 


A 


11008 


129 


412 


KTRSGFKKTPGQKGKTPLLKKTPKITGP 
GGGALKPPFSGGFRPENPLNLRDKGSSG 
PKTPTFFSPWGKK+NFFQKKKKNLLYPG 
KIFFFWMKNI 


10919 


24820 


A 


11009 


63 


268 


CTFKFKNHCSTIQSTT*LDRSEKKKKKK 
KKKKKKKKKKKKKGGPL*KNFSLKGG 


10920 


24821 


A 


11010 


154 


422 


NEFSFFFFFWKPKPPFFAQVGdQYRNWG 
*LKPRPPWLKNFSGFASGEAGITGGVPH 
PG* ILVF* VKAGEHIENLVWNSKTGNQP 
ACASQ 


10921 


24822 


A 


11011 


23 


375 


TRYLTKIKKKKKKKKKKKKKKRGAPFLK 
TPWGAPPFPGSArFrPFFPGGPF*TPLG 
FFWETLFFWGGKILGHLSPKILPFWGKK 
NFFWVKGGKNLLNSPFLKIFFLGVFFKK 
FFPPG 


10922 


24823 




11012 


151 


361 


ACTTTPSSFFSNIDTVSLCCPG*VPGHN 
NSSLINFPSFLRSWYYRHAPPHPftHFFP 
ILIRSRYVAQAGLEFLRLRDTPISDCPC 
TVHYPPSLAN*SVFPFPLTQSPVSLPRL 
KRK 


10923 


24824 


A 


11013 


146 


378 


NFWFSIEFLHI CY* I YS FI OiIKVFL* 
* KKKKKKKKKKKKKKKKKKKGENLKGAG 
GGKKFLGGGKNNSFFRNGGCF 


10924 


24825 


A 


11014 


82 


264 


TELFLYTI CNWSAI LYILC* RNKVFLS F 
KKKKKKKKIERKI 














10925 


24826 


A 


11015 


381 


2 


GRTALFTLLKGP IHRPQMFFTPLRVFPG 
SYTGISIRQSGFFFIKRVIFFRL**KRL 
FLLPFLLPFFSFFLKIGTYSVRQECNGM 
1IDN*NFELLGRRNPSTSASQIAGNTSA 
Y*HTQLVFLCEDGI 


10926 


24827 


A 


11016 


1 


251 


PNCI VHSLRTOTLNFI ILLBLCFTFE * F 
LNKNLIFESQKKKKKKKKKKKKKKKKKR 
GftPLKKTPGGPKFYPASKKKISPQKGG 


10927 


24828 


A 


11017 


234 


1 


DINIKGSWVKGIPKPSVLSLQLFCNSKI 
FPN* KLI KKKWPNE PGVMVHVCNPSTSG 
GRNGQMA*GQEFKASIiftNMAKP 


10928 




A 


11018 


107 


342 


FQLCIGAIVHCFLFMKQHSEYKSTDHEA 
SSKCHTVQGSSQLSTSAFKQK*THKKKK 
KKKKKKKKKKKKKK * KKKGGAS F 


10929 


24830 


A 


11019 


264 


1 


CFKPLLFKYPGI IKAVNLPVSTASATFH 
KF*YVTFSFS* KKFLCVGGKTGSHSITQ 
A3CSGTIKAHCSLELPGPGDTPTPTSRV 
AAEA 


10930 


24831 


A 


11020 


1 


288 


RTRGTSS RSRAATLFFFFP FKKGGFFNG 
NLPTTFGKLGNKKFPLNLFPGGPPLPHG 
GLKKGPGAP QVPP LFLKKP QTSPP*GLR 


10931 


24832 


A 


11021 


115 


287 


STCEISFGCSRGCCWDYLMWEFSHFYF 
*FLETECCSVA*AGVQWSNLSSLQSLHP 
E 


10932 


24833 


A 


11022 


186 


1 


RPKVDKTAKMGRDQSRKa*NFKl!TQKftSS 
PPKEYNSSIjRRBQNWMENEFELTEVGFR 
KSVITN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
efh 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

last^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, IfePIienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=ValiDe, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10933 


24834 


A 


11023 


180 


353 


IFCFCIFSRDGVSVC*PGWSQSPELMIH 
*NTKIS*AWQRVPKIPATQEAEAGESLE 

PG 


10934 


24835 


A 


11024 


155 




PQARRGGTCLQS * LLRRLRLRQEDRLS P 
GV*GYSEL*SCPCTPAWETERLV 


10935 


24836 


A 


11025 


22 


325 


KLI LLYDPQLTFI FLNTRAWAHCAFCC 
WPIHFVFGNYGFVFSCLFTFIFVYSGKA 














RGLKNSWGGPIFWGFPK 


10936 1 24837 


A 


11027 


127 


259 


GQVQWLTPVIPTLWEAMVGGSTKLRSSR 
PFFSTP*PCIVMILYG 


10937 


24838 . 


A 


11028 


37 


408 


lASGALFFFFKAAGKRGDFKTEGAYQRQ 
RTIFKNKKRARREKTGRENLRGNYKNMG 
RGLKTPRGALGGPYLDKKGPLRGKGPFQ 
GGTLLGGGPKIKIQGTL*IRR]miPYIR 
RYNRFKKRQKNM 


10938 


24839 


A 


11029 


146 


3 


LLVLKDLPRIMVRSAPTYIYIYTYLYIY 
IYIYTHTHTHTFIT*VLFC 


10939 


24840 


A 


11030 


268 


435 


LMGIilNETN*F*KVE*WLGAGAHTCNPK 
TLGG*GGRIA*GQEFETRLRNIVRP 


10940 


24841 


A 


11031 


322 


2 


KLPHLQQQCJVKVWCYKPFLKNdieiC3HGG 
VCL+SQPIiRRLRQElIHSNPGQCTSAWVT 
QRDSVSKNKKKRKMEKYFSSVRQNKPVK 
HTKRMSPTKADTKSTHCGGGRV 


10941 


24842 


A 


11032 


251 


380 


HKRLHTIYYFVPLKKKMGPGLVGHTYNP 
STLGGQGGQIT*GRE 


10942 


24843 


A 


11033 


64 


324 


FWFFFFFFFFFUIPPGGGKKKKKPPPPL 
GKKKKRGKKGGEKFFSPPPHGGGGGPKK 
ISLKRVLKFF*NPPGGTBaCGGGPFLIPP 
GSP 


10943 


24844 


A 


U034 


420 


534 


CPGWAHAyiiIPSTLGS*GRQITIDREIE 

TILANMAKPH 


10944 


24845 


A 


11035 


196 


1 


KNGHSLPPGPQKGTPFPSOCKKTPKINLS 
ELGKSKAYSWPGTWHACNPSTLGG*GG 
WIN*GQEFE 


10945 


24846 


A 


11036 


10 


248 


PSDR* LFSTNHKDIGTLYLLFGA* AGVL 
GTALS LL IRAELGQPGNLLGNDHI YNVI 
VTAHAFVIIF 


10946 


24847 


A 


11037 


32 


405 


D YVS KRKEKREKKHKVI LE TS I S SHLVE 
WMLCSRYHPLIKKRSSVRILIALLCPQK 
VRDMS * GIGSKKQCWDSWLS IWIKMESD 
PFLIPYPKINA*RI,KDLCERIHLKI IIK 
REYLHDYRVKKV 


10947 


24m 


A 


11038 


337 


448 


JQIPRGFFGKNPFFWGGPFGGPPPPKKMG 






FGEKKKF* FKRSKKKKKKKKKKKKKKKK 

KKKTG 


10948 


24849 


A 


11039 


307 


415 


RFVCSTirVLRDLSSDRSNPGRVIiSTSN 
SSLK*KKK 


10949 


24850 


A 


11040 


213 


403 


VHRGIKYYFLNKLPDFPRQTFKK*G*RL 
GAVAHACDPSTLGG*GGWITLGQTFETS 
LTRMGKS 


10950 


24851 


A 


11041 


104 


2 


SAFFF*ETESCSVAQAGVQWCDLSSLQL 
PPPGFK 


10951 


24852 


A 


11042 


177 


3 


AYTAYICVCIHIYLYK*MYICINVHVCK 
I T YL Y I YKYLKYNAI YLHI GFFSFLHTI 
S 


10952 


24853 


A 


11043 


314 


395 


TVYILNLSINSVQSVLLSVFHRLRHQPR 
GRVQWLTPVIPALWEAQAGGSP*VRISQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IKTPA 


10953 


24854 


A 


11044 


310 


403 


RGKGGKKPRNPGGGKS * ALTKKKKKKKK 
KKKKKKKKPGGALKKKPRGGQKKTGGGK 
KKFLPKRGAKKKPPGDFGKKNQIGGGEK 
RGKTPPKKKKP*GKKKNLKGKGGKKTPK 
SGGGKKFPPRW*KKKLPPGG 


10954 


24855 


A 


11046 


181 


3 


EGPLSLSLLFFFF*RQISSLLIJILKYSG 
TI lAHCS EKFPSSElSPPTSASQaASTTG 

VHH 


10955 


24856 


A 


11047 


392 


41 


KESRSLSQGGREKGPFKFLAPPAPKFKR 

I FRPHPLKKLGPKGPPPSPS* ILSLKKK 

GGPPILARWFWNSCPQGFPPPGPPKRLG 
FKGGPPPPPLF*K;<KJPFLGGt:NQKVKN 
FFFFF 


10956 


24857 


A 


11048 


147 


307 


TVI FVFLLRKGLTIiLPRLESGMIMAHLK 
S * PPGIiKQSSCHSLLSSWDYRBRRG 


10957 


24858 


A 


11049 


118 


1 


HGKIPSPLQ*HK* lAGYGSACLQSQLLR 
RLRWESRSISI 


10958 


24859 


A 


11050 


54 


284 


RKIRRGGLHLWSNLLGRmWEDLLGRLR 
WEDRLSPRRGGCSEPRLCHCPPAWATE * 
DPVSKKKKKSLFVEKPQGGG 


10959 


24860 


* 


11051 


130 


340 


HNMHFAAHGSRINFDFFFFFFFFERGFP 
FCPPVGRAGTHFGIiLEPFPSRV*KQYYW 
LGTVAHVCNPSTLGGHVGES*GQEFKIS 
liANIVK 


10960 


24861 


A 


11052 


140 


337 


NIIMFFFFFFETVSFLLLGLEGKGTIWV 

N*NLCLPG* GDSPLGCS 


10961 


24862 


A 


11053 


197 


428 


IRSINWPTVASQYSSEWKSHIPLILNQN 
LDM I KFS EEGMLKAK I GLLRQTVGQ WN 
AKEKFLKKNQSATPLSI *IRK 


10962 


24863 


A 


11055 


180 




AI lALLYSRLGDYARRFHLRRKKEQWQG 
MVTHVYNPSTLGGQGGRTA*TQEFKSSL 
GNIA 


10963 


24864 


A 


U056 


158 


426 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAISAHCNLPA*FSCLSLPSACNCRHAP 
PRLTASASRGAGIADGVSFTQCSMVPRL 
ECSGV 


10964 


24865 


A 


U057 


220 


421 


KKKKKKKKQNKKDPGGAVYKKIP 


10965 


24866 


A 


U058 


162 


464 


HYY YYYYYYY* SLVFM 


10966 


24867 


A 


U059 




134 


APENRVD PRVRKTLVPL I LP 1 1 TTLANP 
CKKD*YPYYVKISIAC 


10967 


24868 


A 


11060 


382 


273 


SSCL*PQLLRRLRQEDYLS*GA*GCNEL 
**CHCSPAWVTEQDPVSKNIHTYIHSQS 
INYGSMIGYIHEQKGIIADHKPIIAEPE 
DSTYPRDHT 


10968 


24869 


A 


11061 


298 


378 


SCSASPCCSSRHGWSPCG*QTCL1.LAIA 
LRCLPWLREAGALHTGPGCRLFDHAMLQ 
AHKAHQLVIDTYQEIGENYIPQDQKNSF 
LLESHTSFCFSDSITTPSNMEETQQKSN 
LKLLR I SLLLI ETWLEPVRFLTRIVANN 


10969 


24870 


A 


11062 


39 


486 


RPTRPDAYHIVKPSP*PLTGAIjSALLMT 
SGLGME FHFHS ITLIiI LGLLINTLT I YQ 
*WRDVTRESTYQGHHTPPGQKGLQYRIM 
LFITSEVFFFAGFF«AFYHSSLAPTPQL 
GGHWPPTGITPLNPLEVPLLNTFGIIiAI 
GGSIT*AHH 
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SEQID SEQID 
NO: of NO: of 
nucleotide peptide 
sequence | sequence 

! 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to jfirst 

residue of 
peptide 


Predict- 
ed end 
nucle- 

iocation 
correspon 

last amino 

residue of 

sequence 


Amino :icid sequence (A=Alanine 
C=Cysteine, D=Aspartle Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T^hreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


10970 i 24871 


A 


11063 


269 


429 


atwqklplq i qknypglglvahacnpst 
lggqggrit*gqeflnqpawm™th 


10971 ! 24872 

1 
1 


A 


11064 


248 


55 


I lEGQAQWLMPVIPALWKAEAGGSPEVG 
SSKPA*PCDPPASASQSA6ITGVSHRTQ 
LILNS 


10972 1 24873 


A 


11065 


176 


-j 


LFPPPKKVPPWGMTQFKS ICLYTTHFFF 
FETESRSVARLECR6TISPHCKF*LPGS 
RE 


10973 1 24874 


A 


11066 


2 


397 


MVIWTAHLRAMRPGSRTWGLLTFALLCL 
PWLQEAGAVQTVPI SRLaDHAMLQAHEA 
HQLAIDTYQEFKETYIPKDHKySLLNDS 
QTS FCDSDS I PTPYNMEETQQKTNLE VL 
RMSLLEIDS *LRPAQSNKR 


10974 


24875 


A 


11067 


172 


399 


QILYL*KyAYTVTMQHLSHPIL*KKKKK 
KKKKKKKKKKKKKKKKKKKKTGG 


10975 


24876 


A 


11068 


192 


3 


Pia,TPMLKLIVPHIILLPLT*LSKKHII 
* INTTTHS I IIS 1 I PLLFFNQITSNLFS 
CSPTFC 


10976 . 


24877 


A 


11069 


204 


345 


DKSGEHSKTSSIQKNIjRLGAVTHTairPR 
TLGGQGRWTT*SQEFQTSP 


10977 


24878 


A 


11070 


3 


416 


HELPQPLHVtWTAHLGATaASSRTSLEir 
DFALLCLPWVQEDGAGQTVP * SKLFDHA 
MLQAHHSHRLGIDTYQEWETYIPEDRK 
L3FPDDCHTYFCF*HSIPTPSHLGETLL 
TSNLELLRICLVLIDSWLEPARILTS 


10978 


24879 


A 


11071 


108 


2 


PSPFFFETESCSVT*AGVLWRDLCSLQA 
PLPGSSC 


10979 


24880 


A 


11072 


400 


278 


KVK*LRQENQWNSRGRDCSEPRSRHGTL 
PWTTEQDS I PKNKTKP YKI S 


10980 ! 24881 


A 


11073 


139 


3 


KLAGCGRAHLSFQLLRRLRREDHLSPGG 
*GYSEPWSRYCTPASRA 


10981 


24882 


A 


11074 


17 




DHVIGQPRWLMPVIIALWEAEPGGLLEP 
KVQDQPGQNCKSLSLLTIKKKRRENFPG 
RGGTQLCSQLLRRLRVGCCIDPKLHNCL 
PAWMTERDPTFKRE*KKNDVLEAKNFFT 
ILTFIWTTKEKMVAQRIGSLFHG 


10982 


24883 


A 


11075 


412 


1 


QFFFLGVCCPTQIFGVGKKGEKKKKKRG 
FPPRVFFFFNPQIFSPFFFPGPFFFLGK 
FPPFFGERFLFFSKSKFFFPGVFKRGFP 
FPPKNFFFF*RIF*KFWFSPPKPFFFFF 
FFF*DRASLCHPGWSTVAQ3*LT3C 


10983 


24884 


A 


11076 


279 


410 


VPCCVAGRDGAPGVICAHFPAAQKPWDGG 
FPGP*SAGVGPPPIPR 


10984 


24885 


A 


11077 


12 


386 


lAHLLLFSFYHKDTGTLYLLFGAGAGVL 
STALSLLIRAELGQPGILVGNDHIYiWI 
VTAHAFG IIFFIIIPII IGC YGN*LATL 
I IGAPDMAPPRINNISFGLLPTSLLLLL 
AYAIVDAGARTGW 


10985 


24886 


A 


11078 


412 


3 


LAILSLSLSLGFPYSLRYKNIKIHPINN 
PTMASKCSSKRKTRTSFTLNQNLEMLKL 
I E [<3TAKAKRLKVRP LVPVSQWNAEE K 
FLKEIKSAIPLNIQMIR*QTSLIADMEK 
VSVL*IKDPTSYNIPLSQS*IHSK 


10986 


24887 


A 


11079 


78 


443 


RHRLPSDTCLTAAVP *PCKGSKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGGGFKKNS 
GGGQNFAGGKKIFFFFFGGVFKNPLWI F 
WKKTFFWGGKISGQLSQKKLSLWGEKKI 
FWGCGGETSF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 
nucle- 

locatiOD 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10987 


24888 


A 


11080 


395 


2 


KRVPKKKPKKHSPPGGRKGGSPP*SPPF 
GGPDQPIF*VGGFDPPWLTPENPFFFKK 
KNN*PGKGGMPLILKSLGGWGKKKIFTP 
GGGI,PKN*KETPPPPPGGKKKIPFPKKK 
KKKKKRAAARDLELVPTRP 


10988 


24889 


A 


11081 


24 


182 


TVS TTKCFSLL I P YTORKDME * KI * 1 1 Q 
KKKKKKKKKKKKKKKKKKKKKKKKKEGG 
PP 


10989 


24890 


A 


11082 


173 


3 


NNVICyiLriPSVPKyKTTLNMRGSAFF 
FFFFFI F* DGVSLCHLGWSAVARSWLTA 


10990 


24891 




11083 


284 


391 


S VAHACNPGTLGG* GEQ ITRGWEFETS L 
TNMEKARL 


10991 


24892 


A 


11084 


257 


3 


I KKLREVNMLGWLYYMHEEARPVDYVHW 
RGPEYSSLTKTIRHVIVKGHQHY*KaSA 
GRGGSRL* SQHFGRPRRADYLKSGVEDQ 


10992 


24893 


A 


11085 


42 


171 


ARRMEAGIHRVSQEGLDLLTS *Sja3LGL 
PKCWEDRREPLRLA 


10993 


24894 


A 


11086 


176 


325 


FKES KKKKKKKKKKKKKKKKKKKHGGAL 
RGGALLKKFLGGPQISGGGKRFFFFF 














10994 


24895 


A 


11087 


67 


379 


KKKKKRKKPKTKKRTGPVRQNVRWGRSL 
SPGHAPAEGPQGGSTESPPLLVGRCLIL 
GLREPGVPPAAYGRFLHDYMNSIQGKLE 
KQRPQ*GLPS LTGAAELIKH 


10995 


24896 


A 


11088 


128 


326 


KKKSGGGVFFPI IGGQGHFFGGGRFFFF 
FFLGGFFFKLFFFF*EKFFFLGGNLVKH 
SSKKSSWVGW 


10996 


24897 


A 


11089 


109 


2 


KTHSRPGTVaHTCNLSTLGGQGEW IT* G 
QEFETSL 


10997 


24898 




11090 


52 


347 


DHSLLELQMKLYASAQAAITKCHRMGGL 
NNTHFLFLFIiSLFFFFLKQSIAIiPPRLK 
GRGP I LVN* KLPLRGKRDSPASPPQCVG 
ITGLAHPPRLILFF 


10998 


24899 


A 


11091 


386 


48 


IHLQPPLQICTLKFFTNKNPHLFLTGCP 
KYALI FLCFGALVKTGFHHVGQAGLQLV 
PLCPPRLGLPKCSD*RREPPHPAKKQFF 
I DRVMLCYPGWC PALGINP S S P VGLP KR 


10999 


24900 


A 


11092 


164 


3 


VYSLLLRCP FS * ALTAKF INRCTYIHVC 
MYMCEPYAYTCLYTSTHIHVCMYIY 


11000 


24901 


A 


11093 


148 


325 


CLKLQPFGLGWAHACNPSTYFISYQ*L 
SPV I PALWEAKTGGS PEVRS SRPARP 


11001 


24902 


A 


11094 


162 


3 


LLGLKRPIFFFWGGKFFFFFPFFFEIES 
PSVAQAGVQW* CNLGSLQPASGRV 


11002 


24903 


A 


11095 


259 


358 


LRPGTVADTCN*GQARWLIPVIPALWEA 
KAGGSLEVGSLRLAWP 


11003 


24904 


A 


11096 


254 


2 


SWGFIPMEERERENRYWDRQQGHAIVSN 
IVINAFLYRYTEREKPGVSLSPRVECSG 
M I VAHCNLS LVGS S HS PAAAS * VAGTTG 


11004 


24905 


A 


11097 


132 


3 


KKKKRQPRWAHACNPSTLGGCGGWIT* 
GQKPETSLVNMVKP 


11005 


24906 


A 


11098 


2 


455 


LNLEQLI EPLEVLWTAHLEALDPS SRTF 
VLLAPALL*LPWLQEAGAVQTVPLSRLF 
DIIAMLQAHRAIIQLAIDTYQEFEETYIP* 
DQKYSFLHDSQTYFCPSDSITTPYSMEE 
TQQKSNLELLRIYLLLIESWLEPVQFLM 
SMFGMHLEYDT 


11006 


24907 




11099 


2 


375 


QLPEPLMDLWTGDLVAMAPGSRTS LLLA 
FVLLWLP*LQEAGAGQTVPVSHLFDHAM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 

correspon 
last^mino 
residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PlienyIalanine, 
G=Glycine, H=Histidine, I=]soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlirconine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$sibie 
nucleotide insertion 














LQAHRAHHLAIDTYQEFEETYI PKDQKY 
SLLHDSQTSF* FSDYI PTPCSMEETQQK 
S^3VELLEICLLV 


11007 


24908 


A 


11100 


1 


359 


LQLPKPLTVLGTAHLPAMCLAYATALIL 
GCALLCLA*VE*ARAAQTVPQSNF*DHA 
MLQAHRAHQLAIDT YHEFEETY I PQDQK 
YSFLHDYQTSFCLSYSIATPSNME5TQQ 
KSNLELL 


11008 


24909 


A 


11101 




370 


LPE PLTDLWTAHLVPMAPGS RTSLLLAF 
ALLCLPWLQQAGADQTVP *SWLFDRAML 
QAHRAHQLAIRTYQEFEETYIPKDQKDS 
FLHDSQTSFCF*DSIATPSNMGETPQKS 
NLELLPISLLL 


11009 


24910 


A 


11102 


2 


361 


AHLLAMAPGYRTYLLLAFALLCLPWLQE 
AGAVQTVP * S EL FDHAMLQAHRAHQLAI 
DTYQEFEETYIPKDQKYSFLHDSQTSFC 
FSDSITTPCNMEETQQKSNL*LIRICLL 
LNESWMEP 


11010 


24911 


A 


11103 


8 


366 


PLHALWTAHLAftRAPGSRTSLLLAIALL 
CLPWLQEA6A*QTVPISRLLDHaMLQJiH 
HAHQLPIDTYHEFEKTYIPKDHKHSFLH 
DSQTSFCF*DSIATPS1SIMDETQQKSNLE 

LLRIFLL 


11011 


24912 


A 


11104 


3 


341 


EPLRDLRTAHLVAMAP6SRTYMLLAFAL 
LCLPWL* EAGAAQTAPLFRLFDHAI1LQA 
HRAHQLYIDTYQEFEETYIPQDQKYSFL 
HDSQTSFCFSDSISTPCNMEETHQRSNL 




24913 


A-^ 


11105 


240 


329 


ENWPGAEAHTYYPSTLGGRGRQIT*GRG 
FE 


11013 


24914 


A 


IIIOS 


2 


316 


HEERERERERERESAIVEQIHTQGEGLT 
ELVFPYRRPAPSGLKFTFVYVQEHILGV 
GGNGLSPLQIGVILEDGRGLAHIRCGTR 
NKILRILKSQGRAPDLS*DLY 


11014 


24915 


A 


11107 


159 


1 


EDGHHSGLHSKQQKQPAWPSMVTHTCNP 

NTLGGRGGWIT*GQEFETSLTNMEKPCL 


11015 






11108 




355 


GMS LTGHSN Y 1 1 LJM L]M FLWPC V VAHTCN 
PMTLGGQGGRFT*GQEFEimiiAYRGK 


11016 


24917 


A 


11109 


145 


383 


EVNKDKGKYHVSQGIQSFSYVAYVYFRK 
LIYTMMTIVNSALLYTELVICQGRRVLV 
FDFGF*FWFPETESCSVIC3AGEQ*CDLG 


11017 


24918 


A 


11110 


199 


400 


TRA6FFPPFKKFLKGPGPGAPPFNLNFL 
GGRGGGFF* IPKKKIYPLLLPFLKVQKT 
PSKPPFFFFPFFLRQVLLCCPGWSAMAH 
SRLTRA 


11018 


24919 


A 


11111 


373 


177 


QFETSLGTMSGPSFYLKN*KSARCGHTC 
MRFQLLQRQSREDHLSSGVHRCSNP*LH 
RCTTTWVTK 


11019 


24920 


A 


11112 


204 


399 


NHFWDWYHFSSPFFFFFFEKNLCSVPR 
AEDQGMNLS *LNPLPPGLKQFFA* P SQE 
AGIIGPGHH 


11020 


24921 


A 


11113 


192 


16 


KKrEmiKR*RGYGETETPAHCGEACKLG 
* PLWKPV*QFLKKLKMKLP YDPTIPLPA 
IFS 


11021 


24922 


A 


11114 


113 


369 


AGGQGPNLFLQKPGPPGLKGPPPLPLPG 
GGNYGGGPPSRGIF*FFEKKGPG6VPKV 
GPNLRG*RDLPAPPSKaAHPL6NGFFSR 
N 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ion 
correspon 
ding to 
last amino 

rifdueof 
sequence 


Amino acid sequence (A=Alanine 
C=Cystein6, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, r=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11022 


24923 


A 


11115 


250 


387 


ILCDVEHSLWPGVEVHAYKPSTLGGQDG 
RIA*AQEPETSLNNIAEL 


11023 


24924 


A 


11116 


192 


1 


VCDLFWKPSPHALTRRRTSVFHLKR*FS 
WLGAVAHTCNPSTLGG*GFWRIT*AQEF 
ETSLGNMV 


11024 


24925 


A 


11117 


289 


202 


VFYCQ*KFIRKKKKKKKKKKKKKKKKKK 
KKLSKKI KKMDQQKLRSPTI K 


11025 


24926 


A 


11118 


412 


421 


RA*PTEQVTLGITEQSYSRGHINWRVHD 
LDVGS*HPDGAAAIKGSFVQRI.KGY 


11026 


24927 


A 


11119 


16 


472 


DRLKP SATHRFADAYHIGKANP * PLTGA 
LLTLLMTSGLTM*VHLHSITLLILGLLT 
NTLTIYQRWREVTRESTYQGQHTPPVQK 
GLRYGIILFITSEALFFAGFF*AFYHSS 
LAPTPQLGGHWPPTGITPLNPLEDPLIiN 
TSVLLASRVSIT 


11027 


24928 


A 


11120 


304 


346 


GLWHATGITP*PLTGSF*LLLMTSSLAM 
*FHYHSITLLILSLLSSTLTIYH*WREV 
TREST YQGHHTP PVQKGLRYG I ILF i TS 
EAVFLAEFC*AFYHSSLAPTPQL*GLWH 
ATGITPLMPL 


11028 


24929 


A 


11121 


174 


420 


FISALLKSAIQRNNAMFSTYYALTCTVG 
TAVSWSSVS*PQRNMMMS*VHAPAMAGR 
LVCHyiRVYREFEHVCKRAVHGQYP 


11029 


24930 


A 


11122 


299 


466 


IFIUIEICITFFFETRHFCH*AEVHWCS 
LGLLKPPPPGLRCDPSTSPSRVAGTRPS 


11030 


24931 


A 


11123 


138 




NPFNE EOJPPPKGPGPKKRFFFFFFEMES 
CSVAQARVQ*HDLGSLQP 


11031 


24932 


A 


11124 


241 


499 


NQRERS*HLYLLYNTVCLTHGLPILIIY 
SAYQLAIYDLLLRRDALARLQADLADRR 
RGLQPKKRFAFKTRRKDflASSTKVDAAP 
GI 


11032 


24933 


A 


11125 


252 


391 


PLSLDLQSlSIASLSHPrSPPLMFADR*LF 
STNHKNl *TLYLLFGRWR 


11033 


24934 


A 


11126 


45 


459 


AFLKNHPLET*SNHSFIDLPTLSNISA* 
GNFG*LLGACLILQITTGLFIjAMHYSPD 
ASTAFSSIAHITRDVNYG*IIRYLHflNG 
ASIFFICLFLHSGRGLSYGAFLYSQT*N 
IGIILLLATIATAFIGYVLP*GQISF 


11034 


24933 


A 


11127 


109 


301 


QDSLMRKKSKNRSPSHER*RSKCLERKR 
SGDKKKKKGGPPKKKIGGFPFGGQMELK 
PFSILGGF 


11035 


24936 


A 


11128 


322 


2 


SCHHVSSLGDRARPCLQKEKKKERKKVG 
DFNTPLS IMDGYQHWYQARKI 1 1 MDGTT 
K*KITRK*KIRNQ*DLTDVYRTLHPRVK 
YTFLS CAYGTFSRTHRI RGQAI 


11036 


24937 


A 


11129 


182 


2 


FLPRGFQPPPVWPGPLGFLFPPPSSRD 
PGFPSMPFFFFFF FETESCSVS * AGEQW 
RNLG 


11037 


24938 


A 


11130 


138 


1 


GWVFFLWFSFSFLFFFLR*GLSLSPTL 
ECSGAITAHCSLKLPGLS 


11038 


24939 


A 


11131 


304 


83 


WKKTDFPRKGRNSASRLPSDSNCNS3L3 
LQGLSLLYRFWTYHAVTGKGSQSRPQEQ 
VLGYHARKNSR*VHRVK 


11039 


24940 


A 


11132 


247 


225 


KNFWGGPPLRGGKKNFFFFFGGGYKK*L 
LLCLVNMYI*CV*YI*HKRII3DPKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


11040 


24941 


A 


11133 


242 


3 


KKKKI FFPPRYFWAPP YFFFLGP PP FF F 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIetIiionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=UnI{nown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFVF* TQFHLS 
PRLECSGTI SAHYNLHLPGPSDC 


11041 


24942 


A 


11134 


361 


2 


ASQSAGITGVSaRTRPTHGF]<Dt*RKPA 
EPEVTIH*I*ITIiMSCNIESPEKVCaDL 
IRGAKEKNLKVKRPAGTPIKASEITTRK 
TPCEGSNTRDRFQMRIHKWLTDLHSPFD 
GRVGGRV 


11042 


24943 


A 


11135 


1 


370 


IRQTFFQKRE IVGI ILCYWNVGYySQIH 
♦SSRMLWIFHPyRy/ISVKVILVILVPH 
EKKKICKKKKKKKKKKKKKKB 


11043 


24944 


A 


11136 


273 


2 


GGRAGQQILFCRGIWLGPPRSCQCTFRS 
PPGLAL*PWECFSLPPALKIRFPSFLPF 
FE'EXEFSRLSPRLECNGVTVAHCNLRLP 
GSSNFP 


11044 


24945 




11137 


130 


354 


GQLNKLSGPyPEKVGyTLPVLINPIiAQP 
VIYSTIFAGTLITAl,SSH*FFT*VGIiEI 
NMLAFI PVLTKKKKKGGPL 


11045 


24946 


A 


11138 


366 


2 


LVEM6FIiRVRQNGLYIiLTS*SARLGLSK 
CWDYEREPPCPASDWVFILTSPLIHALD 
GKEHTHTHTHTHTHTGLGICQSSLGKQS 
GGWGWLSANRGQFSPFAVCLWSFLPEV 
PWTSALFT 


11046 


24947 


A 


11139 


228 


1 


CHAGRGLPAAVLIILFPPLLIPTSKyLI 
NNRLITTQQ*LIKLTSKQMIAI PNTK6R 
T*SLILVSLI IFKATTNLLG 


11047 


24948 


A 


11140 


157 


337 


HIPTPPHTTCPERPSIRHNSIYYLKRYC 
LLRIILSLLPLQTSP*PLTGALSLIJiMT 
SGLSM*SHLHSI?LLaLRLLTyTLTIYQ 
* WRDVSRHSTYQRHHTPPVQKGLRYGI I 
VF.TTSKDIVFSGLF*AFyHSRLAPTTQL 
♦CHWPPSGITPUSr 


11048 


24949 


A 


11141 


T88 


3 


I PTPGGPPPLPPQKVLFKPGNPIiAPPGV 
FFFFFETDSHSWRLECSGVI *AH*NIiH 
FRGSR 


11049 


24950 


A 


11142 


224 


3 


svsspcisnthkvdvrssallqestSsr 
evmtghfhyliiyet*lrslaqsarlac 
sgtisahcnlrlpgssn 


11050 


24951 


A 


11143 


262 


371 


SYCGPGWAHAyNPSTLGGRGRRIT*GQ 
EFETSLTN 


11051 


24952 


A 


11144 


244 


379 


TIGKQRCHAGIEEVSDSSQAPWLKPVIP 
AL*EAEAGGSLEPRSLR 


11052 


24953 


A 


11145 


1397 


1565 


DRLESLLEMHIP GVYPNQWNTNFYLF I Y 
FEAESHSVAQTGLQ*RHLGSLQLPPPQV 


11053 


24954 


A 


11146 


284 


1 


GRGRQGGDSSGKGEGNRAMD I TGGLVGK 
GQGAGEAPVEVWilKELRGFKVEGEGDMD 
KERWD* GL*GGRERERERERERERERE 
RERERERESLV 


11054 


24955 


A 


11147 


64 


286 


THALAWTRNQRIGSGHGAMCLYSHLLRK 
LRQNDHLTPGV*KCLEL*LHH*TPTWTT 
KPTAKAI FFNLKKKDSWD 


11055 


24956 


A 


11148 


230 


358 


VSVSLI I FIATTNLLGLLPYS FTPTTQL 
SINLSMAIPL*AGAV 


11056 


24957 


A 


11149 


81 


367 


QTLRTKMNKNWASLIDPTILGLPAAEL 
I I L FP PLL I PS CKDV IWNRLMTTQQ * L I 
ELTWEQMITIHNTKGRT*CLILECLIMV 
lATTKLLGLLP 


11057 


24958 


A 


11150 


257 


2 


SVCAVITKYLRLGCFADAADTQRPLPPA 
TVKLTMFFNIATHGEPLGCIF *LFVDKV 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V-possible 
nucleotide insertion 














PKTAENFHALSTGEK*FSyKGSCFHRIL 
P 


11058 


24959 


A 


11151 


257 




WRCGEKETLIRCWWECKLVQSLGRTVWR 
FVRELKLELPFNPAIPLLGIHPKEKK*L 
DPCTCMFIVALVTIAKMmiQPKCPSMDL 


11059 


24960 


A 


11152 


17 


367 


NHKDGKKTEEKKQSTSPPQKDCSSSPAM 
EQSWTENDFDELREEGFRRSNYELQEEI 
QTKGKEVKNFEK1!T*DECITRITNTENCL 
KELMELKAKflRELCEECRSLSSRCDQLE 

ERVSA 


11060 


24961 


A 


11153 


213 


2 


TFCHFFFFFETESGSVAQPGVRWCDLGS 
LQAPLPGFTPFSWRQRLQ*AEIVPLHSR 
LPCLHFKKKAGVFSC 


11061 


24962 


A 


11154 


1 


347 


GTSNHFPFLYTyQVHNSTDYPRNRCCLN 
PSLRFHTSSKPLPARPHIMTHQSHAYHI 
VKPSP * PLTGALLALLMTSGLAM*FHFH 
S I TI J , I LCT iLTNTLS I YQ* WRDVTRES T 
YOG 


11062 


24963 


A 


11155 


223 


3 


YS'nini:iKILRNHFILFF*DRVLLCLPGWS 
AVASSRLTANSASCLSLPSSWDYRRAPP 
HPANFCTDRFHHVGQVG 


11063 


24964 


A 


11156 


143 


1 


RGFLNQCPSQNFPFFPFETESCSVTQAG 
MKWYDLS«MQPPPPRFKLV 


11064 


24965 


A 


11157 


149 


355 


LQPLPPGFKRFSCLRLLTSWDYRKDVDS 
AIjLSNyVT*KYISKTLCRHQSPEIVREF 
LTAMKSHKLTKVG 


11065 


24966 


A 


11158 


156 


370 


RSr)GCQKIPRl]MLGQWSVLHHTVDEQ*G 
PNRYI KKCSASLI lEEMHI INTVRECLT 
PATMATVREKTTSVG 


11066 


24967 


A 


11159 


298 


4 


HFQWIMWRQLVKGHLQRQKNAAFTDSH 
THLCYLKCTYFCVCVCMCI CVFVCV*SR 
AL* *SLYLFLCIKAVKaGHSVSGLESQH 
YGRLRQftDHLRSGV 


11067 


24968 


A 


11160 


364 


285 


NSWGKIKILB3K*PPPPKKIFFPKYPQS 
FFLSPP* KKKTNFPHPEKYLSPPGI FLK 
APPPL IFFFFFFFFF?FFFFFFFWG 


11068 


24969 


A 


11161 


198 


406 


KSVYMTLKKKRGSFHERGDVPAICaiFFF 
* IHI YMCVYIHICVYIYIYTHIYIYTHT 
QIYIYIYIYIYIY 


11069 


24970 


A 


11162 


95 


376 


KriTKHNIARTHTPYTFCIMN'LEITLQG 
B PKLRPPKPDKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKRGGGGFKKKPGGGP 
KIRGGEKIFFFFLGGD 


11070 


24971 


A 


11163 


343 


391 


WR* LSSTKHNDILTLYLLFRARAGVLRS 
AVSLLILAEQGQPGNLLGDDHIYSAIDP 
AHAFAIMFFIVIPMI IGGFDN*LLAVRN 
GAPiJMAI PH INNI SC *RLPPS LLLLLAS 
AI VEAGAGTG* TAYP PLE 


11071 


24972 


A 


11164 


1 


158 


QPMTPNRGPLSP* PLTGALSALLKTSGL 
AM*FHFHSITLLI LGLLTNTLT I YQ * WR 
DVTRES 


11072 


24973 


A 


11165 


265 


456 


KLLTS LS FFNLLRETWQVGLrjE LS QAKY 
KKTPRLGTVAHVCNPSTLGGQGGWI T* A 
QEFETSLANMA 


11073 


24974 


A 


11166 


204 


458 


KRCLLPLPPDSWTQSSQCRREQIGRAIR 
KILTFETQNIKRLKTEKQLLKLLE*SGK 
ITAHCMLDLPGSSSPPTSASQSAGI TAG 
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nucleotide 
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SEQID 
NO: of 

sequence 


M 
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od 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamlc Acid, F=Plienylalanine, 
G=Glyclne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
1-1 hreonine, V=Valine, 
VV=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S 


11074 


24975 


A 


11167 


256 


3 


KILFFKPGVLSPPWGGVNFFQLMPFQGG 
PPKKKRGPLSPPKKGPFFFFFFFF*ESR 
SLLPRLECSGAISTHCNLCRLGSSRTRG 


11075 


24976 


A 


11168 


1 


457 


HRTPSVRTPNCRGDPRVESTNHKDIGTL 
YLLFGA*AGVLSTALTLLIRAELGQPGN 
LLGNDHI YNVIGTAHAFVI IFFIVIPII 
IGGFGN*LVPLII6APDMAFPRIIS!NISC 
*LLPPSLLLLRASAIVEAGAGTG*TVYP 
PLAGNYSHPGAS 


11076 


24977 


A 


11169 


343 


428 


GR*LFSTNHKDIGLLYLLFGA*AGGLST 
ALSLLIRAELGQPGNLLGNDHIYNVIGT 
GHaj'VIli''FIEIPIIIGGFGN*LVPLII 

G IGEAGAGTG * TVY P PLAGN Y SHPGAYV 
DLT 


11077 


24978 




11170 


91 


2 


PRQDPSFFFFFF * T E S GS VAQ AGVQWCD 
L 


11078 
11079 


24979 
T498lj 


— 


11171 


219 


1 


IFKSKILCLPPPEPNFFFFFFFFFFFFF 
F FMRHGGACCS PS YLGG*GERMT*ASRV 
AGTTGVCHYAWLI FVFL 




11172 


292 


365 


RI*SLGQAQWLTPVIPTLWEAKVG 


11080 


24981 




11173 


251 


1 


ICDLQILTHGITTCVKMGLTTLFVSGIT 
FLIHHSLAIKFGV*KNPS*LGTVVHTCN 
LSS *GGRGGRIA'' AREFETCLGNIVRP 


1I08I 


24982 


A 


11174 


200 


407 


KI FETDVYVKCSNI LCLWAS I S FSNYFG 
GPFYLNL**FLSYFSFLETESHIVAPLE 

CSGMILAHCNLCL 














GTNFFLFGETGSGGVLQAGVQwPGLDLL 
ASSHSPASAS*VSGTTVDAA 


11083 


24984 


A 


11176 


140 


T 


PSTFPDWAMRRVSCLSKGDKGQV*WLT 
PVIPAIWSAKAGGSPESI 


11084 


24985 




11177 


210 


1 


TDEELLLMDEQRK* FIEMESTPDEDAW 
IVEMPMKDLEYYI1IWD*WAGLKRTDC 
SFERSSTVGKMLSN 


11085 


24986 


A 


11178 


133 


148 


STSTR*LFSTSRKDIGALHLLFGA*AGV 
LGTAVSLLIRAWLAQLGSNKEA 


11086 


24987 


A 


11179 


235 


1 


KFWKKDPLTKGGCPLKKKGGGGIFSKKK 
KNVFFPKKKFKGGKKKNFFFFKKKKKKK 
KKKKKKNKE*LFISFAHIAVHL 


11087 


24988 




11180 


210 


1 


QSLGQENI,QVFFPLFGP*PNYLDKGEKW 
IRVGKIWGHLFFFFETESRSVAQSPRLE 
CSGAiaAHCNLCLP 


11088 


24989 


A 


11181 


292 


348 


KKGALFFSPGG*KKNFPPGGKKKGPLFS 
KKKKNPP*KKPGEKEKTRFPPFLGGLGG 
GFPLAPGGKPPRNRVLAPPPNPGGKKKK 
KTPFSQKKKKKKKKP 


11089 


24990 


A 


11182 


168 


3 


KPEASMLKMDFFFF*EAGSHSVAHAGVa 
WYYCSSLQPPTPELSDPLAPSSKVGGIT 


11090 


24991 




11183 


87 


351 


KGIKWSLNIQRGVQPHLRSYNYSFFSFF 
S*DWVLLRLECSGTTTAHRSLNLPGSSD 
PPTSASQAAGTTGMHCHTRL1F*ETGFH 
HAVQ 


11091 


24992 


A 


11184 


55 


344 


TSLNLCFFFFFWGEKSPFLPPGGDEGPL 
FGQLDTRPWGIEKIFFAPPP*KGKRGVP 
PGGEEFBCNPPAPKGEPPLS *KK*NQPGQ 
GAP PCYP P FLGGGAKKI FSIPQGRVSN* 
PKRGPSSPPGGKKGDFSPQKKKKKQRFR 
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NO: of 
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sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 
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nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


C=Cysteine, D=Aspartlc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=IVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EV**PNSPSGAP 


11092 


24993 


A 


11185 


344 


190 


L VE TRFHRVS QDGLNHLLTL * S TRLGLP 
KCWDYRCEPPRPANTANSLNFPM 


11093 


24994 


A 


11186 


135 


3 


IFPKVFGGPFFFFFFF*ETESRSVAQVG 
VQWCNLGSLQAPPPGFT 


11094 


24995 


A 


11187 


305 


14 


IMLITSIRRSRLSSTAEKGILSHLRICG 
RVAKYLINNRLITTRE*LIKLTSKQMIT 
ILNTKGRT*SLILVSLSWYGYHLF*G* 
FD*SLLLY 


11093 


24996 


A 


HISS 


-m 


1 


LEQTFWMMFPPNKSIISIPSFGFLFKIS 
FPFLLFFF*DEVFLLI,SRI,ECNGTII,AH 
CNLCLPGSSD 


11096 


24997 


A 


11189 


^52 




QLCNTPLPKTNHAMLTSSNANTL I YERS 
AIAAASSPPILFPASKILIiNSRLITPHQ 
*LIKLASKQMIAIPNTKGRT*SLILVSL 
I ICFEVSLISHCWWISRLLMRYLEVGI 
NRGG 


11097 


24998 


A ■ 


11190 


270 


137 


CIINRLYVCVYIYIYIYIYI*IYFYMYV 
YNILCILGHRRESGVI 


11098 


24999 


A 


11191 


18 


342 


ACEKRKMSSKFALRATGNCYYSCTFCSI 




























KITV*K*SKTKKNIKGLGAR*ELSLA*L 
LGEPPIFAKGHPLMLKWVEICKRP 


11099 


25000 


A 


11192 


169 


3 


ERSGIIRVYQD*WQC[aAWSLA*KIIHPW 
LaAVAHTCNPSPSGGRHGWIT*GQEFK 


moo 


25001 


A 


11193 


407 


1 


KRACFRWLTLLISTLWEAKQEtrCUIAGV 
HDQPGQHGETPSPQKIKKSAWHGGPRL* 
EVRWED* LSQGG * GYSEPCSHHCTPAWA 
TERDPV+KQKRSIPWKELASERKDSGRC 
RWLVREMWPSCQ2iSVIQVRRIS3 


11101 


25002 


A 


11194 


110 


412 


VGGFRLLHSPAILMPLIWPSYFHRHSNT. 
EIKPISWPTVASKRASERKSCTSLTLNQ 
KLEMIKLSEEAT*KAEVGQKLGLLHQKV 
SQWHAKEEFLKEIESA 


11102 


25003 


A 


11195 


34 


155 


AWHEGMHHVSQDGLNLLTL* DACLGLPK 
SWDYRRKPPCPA 


11103 


25004 


A 


11196 


289 


453 


NQPGKSFMISSTFHFMMEFTLVHAGV*W 
LDLGLLQPPPPRFKGFSCLHLPSGWDR 


11104 


25005 


A 


11197 


177 


432 


KWEKQAQTLGGGNFPKGPKTRVGKNFEK 
LKGNPPFLCL*KRGRKGDTGPWKKPAPS 
GERGIPGKKGGKTGPPGFRGPMEKIKRG 

K 


11105 


25006 




11198 


139 


3 


AMVSLSPHLECSSMtMAHCSLGSLGSSD 
PPTSAP*VARITGMHHP 


11106 


25007 


A 


11199 


150 


492 


GDPLPGHHPV*EVRRPFARQLPRLRSEE 
PLRPVATPSGK*GASLSGSHPIQEGGGG 
GQPLPGHLPHPGWRSGVKPPPGQSARPG 
GEGHLLPATPTGK* GGPQSGQ PHLSGGR 
RE 


11107 


25008 




11200 


239 


64 


YQKYYYYYPF*BTESRSVAEVKSSGLIS 
AHC3SILHLPGSRLQLCLFPTSAVFGQBSH 
IT 


J 1108 


25009 


A 


11201 


31 


396 


I LTMRE I VH IQBGQ CGNQ I GAKFWE VI T 
DEHGMDPTGTYHGDSDLQLDRISVYYNE 
ATGGKYVPRAILVDL*PGTMDSVRSGPF 
GQIFRPDNFVFGQSGAGNNWAKCHYTKG 
AELGDSVLDV 


11109 


25010 


A 


11202 


85 


407 


VLLPHSEALEGADTiyiPHSYPALSAEQKK 
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residue of 
peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenylaIanine, 
G=GIyclne, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=posslble 
nucleotide insertion 














ELSDIAIiQIVSPGKGILAADECVGSMAK 
RUSTQIGVENTQENRRLYRQVLFNADDRA 
KKRIGGVIFFHETL*QKDDDGVT 


11110 


25011 


A 


11203 


302 


115 


THLDHVAIYLSIYLSIYLSIYHLSIYHL 
SIYLCRF*LID*DRVSLYFSGWLQMPGL 
KPSSCF 


11111 


25012 


A 


11204 


130 


348 


GNAYGGPCS VLVQRT* * I PALTFPSTCL 
DS KFQRDLVTLRTD FSNFTSNTVAE IQA 
LTSQGESGLDGALGWGQ 


11112 


25013 


A 


11205 


232 


62 


ISARNEGFNTIM*RTFFFFFFSSHCLSR 
LECGGMILAHCNLQLPGESLEPGRQSLQ 


11113 


25014 


A 


11206 


161 


2 


KKKGFIiLPLIEPFCIQEIQLRLGAVAHT 
YNPSTIiGGRGGWIT*GQEFKTSLAN 


11114 


25015 


A 


11207 


251 


3 


PNIILPKAPSPNTTLRIRFQIILNFRGAQQ 
TFGLQHKLTSKQMITIHNTKGRT*SLIL 
VSLI IFIATTNLLGLLPHSFTPTTQL 


11115 


25016 


A 


11208 


145 


287 


GFRLKL*INQKYSFLKKKKKKKKKKKKK ' 














^^^^^^^^^^^^^^"^^^ 


11116 


25017 


A 


11209 


436 


110 


ISGRGEQBCFKYFSPAPFFRGINPPPLGG 
E*NQNLKPP*KKIFLGSPKVPKKKGGPQ 
KILKKGPGGGGPPLFPPPLGGQGGGFPF 

S QGFKTPPGKKKKPPPKKKKKKKK 


11117 


25018 


A 


11210 I 217 


1 


DKW3VLHINYNILSSSLVHVLFKYLEN* 
K * KKIJL WPGMVAHAL'NTJMTLGGQGGR IV 
*AQEFETSMGNTVKC 


11118 


25019 


A 


11211 


146 


3 


QKVSHAQYRCLSWGYFFFILFLETRSRS 
IAQAGVQWRDLGSL*PPPA 


11119 


25020 


A 


11212 


187 


384 


GLICFYLTCFYFIFYFILFYFSLLCYVM 
LCYVLCYVMLCYVMVCWMLCYVI*DRV 
SFC3HPDWSAA 


11120 


25021 


A 


11213 


373 


482 


MIKLRHRLGTVRHTCNPSTLGGRGQWIA 
*GQEYKTS 


11121 


25022 


A 


11214 


64 


362 


ITGVSHFTQPLPFIGGLALSPKLECHGM 
I lAYCNFELPGSGDSCHLSLPSSWNYRY 
MPCMPS*LKKNF**RQGLTVLSRLEKDY 
LL*DFSALRSFPWGR 


11122 


25023 


A 


11215 


181 


368 


CDSAVPLLDIYPTEFKftDLKKIRIPMFI 
IALFAVAKR*KQPTYPSJDKWIKKLWAG 
CGGSRL 


11123 


25024 


A 


11216 


268 


369 


TTRPDMWHTYNSSTSGGQGGTIT*AQQ 


11124 


25025 


A 


11217 


90 


3 


IGQAQ*LTPlIPAIiWEPKSGGSPEVRSL 


11125 


25026 


A 


11219 


277 


272 


SLTAPaL*SQLLRRLRWEDHLSLGQSSF 
SEP*SCPCTPAWVTQ*DSVSKKKEKKKE 
KKRKEKKKVNQTPKATESPGQALIGSHS 
LQVQKRVYAHPIL 


11126 


25027 


A^ 


11220^' 


19 


343 


LELEVEPEDVSELLQSLDKNYLSCFLKR 
FLEMKYTPGEDWNTIKMTTEDLKYNTD 
LVDKAAAGLERINFNFEGSSSVGKMLSN 
S IAWYSEIFYERQSQFMWQT*LLA 


11127 


25028 


A 


11221 


266 


356 


FFEMESHTVA*AGVQWCDLGLLQSLPPG 


11128 


25029 


A 


11222 


105 


2 


RFVCS TI KVLRDLS SARSNPGHFI *TSN 
SRPRKS 


11129 


25030 


A 


11223 


145 


350 


RKGRQLLD*DLGWV*WLTPIIPILWEAE 


11130 


25031 


A 


11225 


155 


348 


PLFFFCFHRNTFY * KS VCKFPKKKKKKK 
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G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iyiethionine, 
N=Asparagine, P=Proline, 
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nucleotide deletion, \=possibIe 
nucleotide insertion 














KKKKKKKKKKKKKKMEGGGGGANTKFLY 
GENRKPPLFFFF 


11131 


25032 


A 


11226 


271 


3 


NFL S VFNFTR I CLGMCFLS IFLLIFGGS 
LKYQLFLAQGFFFFLHYFFTSSIHSSLL 
Y*MLDFLDLFFLYLFFCNFNFSVFLFCI 
LSKFF 


11132 


25033 


A 


11227 


3 


413 


SPAEPHRSYTMWNVKVGVUGFGRIERLV 
TRAAFNS GKVD I VAI NDP FI DLWYMVYM 
FQYDSTHGKFHGTVKAENAKLVINGNPI 
TI FHERDSCKI KRGDAGAEYW* S TGVF 
TTMDNAGAHLHG * ATRGI I SAP S AD 


11133 


25034 


A 


11228 


48 


413 


VFVLVSCGFFFFFFWEKKICFFWPGGKK 
RGKIGGTGTFGPKGKGNPRGPPPKMKEK 
GGPPPPPNNFGFLKKKKFYQGGQGG*KT 
RNFGKPPAPNPPKGGDLGGGPPTQGQNG 
FLKKKKKQKK 


11134 


25035 


A 


11229 


172 


404 


EFENS CS PGWQRRGVMI PGVTVEDMNQQ 
EFIRALVALLKKSRKLKVPEWVDTTKLA 
KRKELTL*DEKWFYT*AftSTT 


11135 


25036 


A 


11230 


247 


^ 


QPKKKSVSKKKICKELPCBPAIPLPGIYP 
KEKKSVYKKnNLHLEVYCEAVFTIAKIW 
NQPKC*SGDK* IKKI *YIYTVDLRR 


11136 


25037 


A 


11231 


1 


372 


TETTVEVAWCELQDRKLTKSEEQRFKEE , 
AEMLKGLQHPWIVRFYDSWESTVKGkKc' 
IVLVTELMTSGTLKT *VHQYYKS * PRSM 
REFDFmFRSTSFTLWQPKKGTVLPTV 
TVLAYFLAHKVL 


11137 


25038 


A 


11232 


311 


464 


KKATPENLCKWPGTVAHACNPSTLEGED 
GWIT*GQEFETSLTNMVPP 


11138 


25039 


A 


11233 


64 


418 


VPVASRRRRRCGRVGGGKAMADLDKLNI 
DS I r QRLLEA*GAKPGiCKVQLHENE ITG 
LCLEPREIFLRQPIIiL*LVAPLKICGDI 
HGPYYDLLRLFEYRGLPPKSNYLFLEDY 
VDRRQH 


11139 


25040 


A 


11234 


233 


421 


PTTSNMRSRLLRGASS*KKTAGPQQRNL 
BPALPRRWGGRSADNPPSGSLRKSGKNM 
QKTPGTA 


11140 


25041 


A 


11235 


343 


1 


NWTHGLSHSSVGENPGAVGSGSCMSGIK 
LSEGLRSLLEAVCISRAAVIKCHSMVAS 
KNRHFFSPSSGG*KSEAKVWAGPCSL*R 
RVPSLPLPASGGSRHCLVCX3CVTPFSAF 
TW 


11141 


25042 


A 


11235 


122 


419 


RTPRGPKPNPQLQRKTFFLIGQRIKTYL 
GrFGKDLYFWGGKNWDTLLPKI*RSREN 
KNF*REKWVNLPPYPGCLKN6ARQQHFA 
HLLILRDTKTYMLAF 


11142 


25043 


A 


11237 




379 


AFNHLHAGHGLSSftAMKSLVLLLCLaQL 
WGCHSSPHGPWLIYEQPNCDDP*TEEAS 
LEAIDYIWQNLPWGYKHTLNHIDEVKVW 
PQ*PSGELFDIEIDTLETTCHVLDPTPV 
A*CNVRQLNEHAVQ 


11143 


25044 


A 


11238 


1 


397 


QTMTPNRGPLCPSKDLRSSHVISLPLHS 
ATHTRPTNQHTNHIPMivlARRSTRKHIRR 
APHTTCPKRPSIRDWPIYYLRWFFLRRI 
FLSLLPLQPSPYPPI RRALAPWRHHPAK 
S PRRPTP * PLTGALSALLKTSGLAM* FH 
FHSIALLILGLLTNTLTIYQ*WRDVALE 
STYEGHHTPPVQKGLRYGIILFITSEIF 
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peptide 
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C=Cysteine, Df=Aspartfc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFAGFF*AFYHSSIjAPTPQLGGHWPPTG 
ITPLNPLEDPLLNTSVLLASGVSIT 


11144 


25045 


A 


11239 


267 


3 


FVSVRITPIYASAATWLS YVWVWASS PN 
VCPPGDRVALSPRVECSGTI SAVRNLRL 
PGSSDYPASA*PADGWAPPQYRCPPPRL 
ANF 


11145 


25046 


A 


11240 


391 


31 


VFFFFFFFFFFFFFFFFFFFFFFFPP PF 
LKNPFKPPPNFFFFCPKF*ILFLN*I *F 
* IKF* I* KI PGFFLKPPPFFPPFFFFPP 
GALFFFFFF*EGVLPCSPGWSAMSHLLS 
EEEKLTPHMSyN 


11146 


25047 


A 


11241 


371 


491 


KIFLCWARHDaHACNPSlFRGQGGHIT* 
GQEFKTSIiANMV" 


11147 


2504K 


A 


11242 


1 


221 


HRIDPLCRNSAHAFFFLSTGCHRVSQDG 
LDLLTS*SAHLGLPKCWDYRCKPPRPAL 
NCFFRTISQLLNLCIKD 


11148 


25049 


A 


11243 


382 


333 


TYKIKDFKTF*RWFHPNISRVEAEKIiFL 
SRGQRGDFIifiRPSESSPGGFTLSVR 


11149 


25050 


A 


11244 


169 


477 


TQKDEKESTTTTKNftENSKGQSAYSSPS 
DCNTSIiARAC2Iir*EEAEMDELTEVGFRRR 
VIMNFVKLKGHILTQCREAENHDKTLQK 
LLTSITSLEWNIKDLMELK 


11150 


25051 


A 


11245 


200 


462 


SLIYNELCTIHILYQYYVA*QRS*CFQQ 
LMSFY*RTLKNLKESLCKVALLYPSFLK 
WKRGWLGTVADT*KPSTLGGQGGRIA*A 
QEF 


11151 


25052 


A 


11245 


484 


301 


I FCEKKGL* FQLLRRVRWEGPFNPGNQG 
CS*LLSSPCPPGWITE*DPVSQKKKERK 
RNQVMELVSAQEKKNKPRSVGLQACAHS 
MLAFSTTPR 


11152 


25053 


A 


11247 


226 


458 


NTVAVDWGPRCFFSLLLSHAENCDMWFT 
KIELFIFTNHLLLKQDLEMLW*6QAHAC 
NPSTLGGRGGWIT*GQEFETS 


11153 


25054 


A 


11248 


304 


436 


SPSRCRK*KIG*AQWLTPVISTLWEAKV 
GRSLDPRNSRSAWTMY 


11154 


25055 


^ 


11249 


216 


451 


THKYLFYYLGYNPMLYIERETGSCSIA* 
AGVQWYSHGLLHP * PSGLKRSSQVAGVI 
CLAWFFFFFLKKWSLWLPRLG 


11155 


25056 


A 


11250 


260 


474 


DKYFLLYPLKALNWLGflHINI*KV"SKI 
MC* K*DKNMYNFFFSF *DGVSLLLPRLE 
CKGTILAHCNLCLPC 


11156 


25057 


A 


11252 


193 


496 


MTSCYSFYL* IKDASMVKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKaSSSQKAQDYS 
KASDYK 


11157 


25058 


A 


11253 


348 


478 


DLGWLKLGTVAHTYirSSTSGGQGRRIT* 
GQKFKTSLANWKPC 


11158 


25059 


A 


11254 


174 


475 


MHHWVLILQDIDDNT*NCNLHFSLCSQL 
KS I QKEKKKKKKKKKKKKKKKKKKKST 


11159 


25060 


A 


11255 


292 


48 


LPLTCHRPAVSHTHXiAFSNPTAGITIF * 
TRSSSWWAGTFriFKIVTFFLFFFF*ER 
VSLKLTRLERSGAIPAHCHLHLPEF 


11160 


25061 


A 


11256 


347 


16 


TQ S ADWC I YNP P ARHKSSP S PHPTQ * P S 
WLHLVDPAPGPWVELPTSPAPSACTPQP 
LGSRWDQPPQSKGQCPSGRLGPQGSPPQ 
VGLGKAGWRSQALPVGLTRSI ESHHW 


11161 


25062 


A 


11257 


238 


590 


APGAYIFPQQHNPTDSQVIFSLTASHLL 
ICSPLCISVLHFNGMNIDDENRNAHVHA 
RACAHAHTHTHTHTHTHTLYLMNYRETL 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
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lyiLLKTRGRATGSHARCI FS * SLLQACP 


11162 


25053 


A 


11258 


310 


461 


LSFGDVLQLESDRPGAVAETCGPSTLGG 
*GGWrT*GREFETSLAMILKPR 


11163 


25064 


A 


11259 


461 


174 


MGFHRVSQDGLDLL T * S SAHLGL PKCWD 
Y*CEPLRLAPVGKLFKCVGYKLVLCHGS 
GYMSMVIKIRMPGLIWSIjEGGSAAWHR 
YWSIWEISFGC 


11164 


25065 


A 


11260 


343 


472 


LLRRIiRQKDCLSSGGQGCSEPRLDYCTP 
AWVTK*DPVSHKQE 


11165 


25066 




11261 


229 


3 


VKMKLQ* VPVKVKI *KYTPLLGIYLKEL 
KAGFGRNTCTLMFIAALFTIANRWMQPR 
LP*IGECV]!qiJM*YIHTMES 


11166 


25067 


A 


11263 


163 


399 


KLLPGTRNUIftMEHPIFGWLRNPHaTAQ 
G*HPLSQSSLAIiHGRADHICYPELCTSS 
SSCIIAGYPN*EGMFASQHHRVH 


11167 


25068 


A 


11264 


316 


427 


TNKICHII*IINTTTHSLIISIIPLLFFNQ 
INHNLFSCS 


11168 


25069 


A 


11265 


184 


418 


KTPPGFFEKNPFFWGGKKGPPPPPKNLP 
LGGKKKFLRGKGGKTFFFFFF* KIGSSS 
SPPPPPGGEQAPPPPPKSPPHE 


11169 


25070 




11266 


219 


13 


KFFFSLEHQNEtteeOVPLFPPPKEGFLP 

kkpkqgfitpplkqkikppppggysgpp 

RVL*KiGPPHIFYK 


11170 


25071 


A 


11267 


3 


173 


SHHARPETGFHRVSQDGLDLLTS * SACL 
GLP KCWDYRREP PRQARKLYCLGMRTNE 


11171 


25072 


A 


11268 


157 


2 


PVniTHVKTTVGLGMVflHACMPSTLGGR 
GGRIT*ARGSRFIiLPPHWTSELR 


11172 


25073 


A 


11269 


291 


468 


ISEGKRSLFVILHFILFFETGSCSVAQA 
GVQWCDLDSL*PHIPGPK*SSHLGLHTG 

ACH 


11173 


25074 


A 


11270 


29 


478 


GWNPYNQQDIGTLYL*SGAEAGVLGTAL 
CLL I RAELGQ PGDLLGNDHI ynvivtah 
AFAIIFFIVTPIIIGGSGN*LVPLIIGA 
PDMAVPRINNIS F*LLPTSLIiLLLASAI 
AEAGA* TG* TVYP PLAGNYSHPGACVDL 
TIFSLHLAGV 


11174 


25075 


A 


11271 


61 


474 


P YNQGGNDAHAAWPHLTKWRSQGTTE I c 
REP EGAWQR P S Q PRLHKGRS PGQLHALS 
QNLPSYSGGSHLLSAYYVPDTISKHFIN 
TIGWLGTVAHACNPSTL*GEDRWIY*AQ 
EFKTSLANVL 


11175 


25076 


A 


11272 


277 


478 


KIYQPVKIHGPRPGAVAHTCNPSTLGDR 
DGWIT*GQEFETSLANMNSPSL 


11176 


25077 


A 


11273 


350 


11 


KEKTAGQARWLTPVI PALWEAEAGAS pe 
VRSSRPA*PPP 


11177 


25078 


A 


11274 


138 


3 


NYINLYAWLGAVAHACNPSTLGGRGGWI 
A*AQEFKTSLGNMAKP 


11178 


25079 


A 


11275 


359 


219 


FVIKMGFPQVGRDGFNFLAS *saplglp 
KWWDYRH* PPRLALFFFF 


11179 


25080 


A 


11276 


262 


3 


FCMSLVHLFSHSQDNYFTSFSL*IFNIP 
SGPFLFAFQDTLQ*P*DFIYFYFETESP 

s vtrlecsgaisahcnlrllvs sns pas 

AS 


11180 


25081 




11277 


285 


1 


vkkgdsvskifrkkkkrkdkirlhfmit 

TVNNLGI SGNLLNMIKEI YKNPTANI I L 
NGERLKAFSL*SGTRMSFLTPTFF*ILV 
DTQQVYIFMSH 


11181 


25082 


A 


11278 


160 


3 


TDKEVPLINAQRKWFLEMES IPGEDAMN 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last amino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GlutaiTiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, - 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IV'MTTKELQYYINLVEKVIjARPE 


11182 


25083 


A 


11279 


334 


488 


PLNTCGY*NLN* *NFQPGSVAPACNPST 
LGGQDRWIT*GQEFETSLaKMVK 


11183 


25084 


A 


11280 


527 


356 


FFVFLVETAFHHVAQDSLDLLTS *SSRL 
NLPKCWDYRHEPQRQAKKVGEFSETRS I 
Q 


11184 


25085 


A 


11281 


486 


248 


VEQLFSLLLPRLECNGVISAHCNLRLPG 
SCDSSASAS * VARITGASGSQAWLQVQ 
CLQPVQPGELLEVDLFQLWLQR 


11185 


25086 


A 


11282 


38 


157 


STGLHHVSQDGVDLLTS*STLPKCWDYR 
HDPLRPASQRL 


11186 


25087 


A 


11283 


474 


339 


FVFLVQMGFHHVGQaGLELLTS*SACLG 
LPKCWD*RHEPTDPAK 


11187 


25088 


A 


11284 


2 


462 


LETTSTARWU SVIALRHHNS CPYKMAWL 
IDKCVCSD YS KDWP FP I FLPLtRP P YCL 
RHNNMETRSHJNPPWPSKCSSEKTNEEG 
MLKVEI G* KLCLLCQTFS KDMNAEKKFL 
KEIKSATPMNTSMIRKQNRLIHDMEKVL 
VIRIEYQASHNIL 


11188 


25089 


A 


11285 


65 


280 


SRAVEFNLLTT*SACLGLPKCW3YRCEP 
PHPASP*FSKLLSSNLKTYLSSL*DSHS 
GF YS FVCSLGL I IL I 


11189 


25090 


A 


11286 


244 


488 


MRLGRVPSVLFCYVRMQREGIILEAENK 
PSPDIESVGALILDFTASRTIRNKILLF 
IIFPG*GILDSPNVP 


11190 


25091 


A 


11287 


345 


445 


KRPGTVAHACNPHTLGG*GRWIT*GQEF 
WTILA 


11191 


25092 


A 


11288 


223 


500 


SPGTRPKVLECQSTLdRQAILLLGEGWL 
MGQSHRLGVQSQLCLCVLWDLTCSLSCS 
FQIIKLR*RPGVMVHTCNPSTLGGCGGR 
lAQGQEFK 


11192 


25093 


A 


11289 


157 


22 


LSCQEVKGEIGEAEICWAQWLPPllPAL 
WGAKVG*SPEVRSSKPA 


11193 


25094 


A 


11290 


305 


497 


KLVNCIYFKGVSFMVRELYCNKrV*WLG 
TVAH I YNPS ALGGACGWI T*GQEFETSL 
TNMAKPRL 


11194 


25095 


A 


11291 


478 


1145 


QHVQACPERPQMMGTLERSRAVAS KIGH 
SYSLDSQPAEAVGKPWPQQACTRVTELT 
EATGKLIRTSHIGKPHPQSFQPARATQK 
LRPASQQGVQMKTQGGASNPALQIGSHP 
MCKSSQFKSDQSNPSTVKHSQPKPFHSV 
PSQPKSSQTKSCQSQPSQTKPSPCKSTQ 
PKPSQPWPPQSKPSQPRPPQPKSSSTNP 
SQAKAHHSKRGQKRGGKH*R2USISRDL 


11195 


25096 


A 


11292 


352 


468 


TFPGQHGEaPSLIiKIQKLAGHGDACL*S 

RLSPR 


11196 


25097 


A 


11293 


295 


24 


ENIQGSKKNERQHMHNGVGTWNSNPGFV 
STWVYRNCLACLDISFSFIKWVKSHKDT 
SMGMFTAALFTIAKT*NQSTCPSMVDWN 

STTRTS 


11197 


25098 


A 


11294 


274 


429 


ECEPVQTLWKIRKFLKKLKIELPSDPAI 
PLKGINLKEVKSVP*SDTYTPMCI 


11198 


25099 


A 


11295 


168 


469 


PSLGNKSETPS PKKKKNLFLGFPYFREK 
LVGILLKTVLQKGAGGVFFGPKNPPEKN 
PDPPLDSPPKTFRRLGAFVKTFPKGLKA 
AWFPALV* PQGQMGG 


11199 


25100 


A 


11296 


37 


348 


IKIFNKTLKYFKKKKTIKKKKKKGGGPL 
KRPRGGPRIYPDGARKNLPHIGTAKKHW 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
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Q=Glutaniine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGYWEKNPHGGGGNTGPTPPQKTRPLGG 
KNMLKRVQGEKPRPRLGGGKTRSQLFSC 
HA* P * VQGE KPRPRLGGGK 


11200 


25101 


A 


11297 


93 


388 

^^543 


DEYASDNCCYFPGSSHDSQVKQIGSQFT 
TQIQAHLNLaKPRRAI.*NHS*PMEMRSK 
CLLIiAYKKKKKKKKKKKKKKKKKKKKKK 

PKYLSSRGKDHF 


11201 


25102 


A 


11298 


61 




YYIHRT'^VFMCMN*G^,Kr:WVDK*TTnLC 
LCKKKKKKKKKKKKKKICKKKKKKKKKKK 
KKKKKKKKPQGGGA 


11202 


25103 


A 


11299 


250 


341 


IYMSLXLGLQNL*EIHLTHKDSHKLQVK 
GWKNIFHANVNQKQTGVAILISDNIDIK 
ATTVKKKKKHENPTILNIYVPNTGAPKF 
MKHLLLNLKNERDGTIVLG 


11203 


25104 


A - 


11300 


3 


285 


DAWERKKRSQMNQKTTNKVAGVSPYLS I 
MAWNVNELNFLIKRHRVa*CIKKTKSMI 
CYLQETNFIKDTHRLKIRGWKKIFHPNR 
N*KLAGWy 


11204 


25103 




11301 


175 


407 


AENNPDYLQLKA*TSRGTFMP*ILIiRVK 
RGSNSRlilNRCWQGWRGTGiyiLLHC*QECK 
LVQPLWKTVWRILPDLESEIL 


11205 


25106 


A 


11302 


75 


280 


YH]:jILVLyPHt'PlCNRVC*lCKKKKKKKKK 
KKKKKKKKKKKKKKKRGGPLKKNLLC3GP 
FFPGGKKKNFFF 


11206 


25107 


A 


11303 


299 


3 


IHASFLFRGCITLAFTVRCSHRSITSSE 
TSDLTHYYYYYYYYF*LftSQSVAi:iAGVQ 
WDQLGS1IHPPTSAS*VAETAGASHHARL 
IPNFFVEMEVSLYC 


11207 


25108 


A 


11304 


137 


323 


FFTVNGRHHKEKKSNKDQIiNLGLHDPLD 
NPLLLF*KyLGREKKKKKKKKKKKKKKK 
KQKKKK 


11208 


25109 


A 


11305 


64 


481 


RPTRPHVYHIVKP IP* PLTGALSALLMT 
CGLSM*FHFHSITLL1LGLLTNTLT1YQ 
*WRDETRESTYQGHHTPPVQKGLRYGI I 
LFITSEDLFFAGFF*AFYHSSLTPTPQL 
GGHWPPTGITPLNPLEVPLLNTSVLLA 


11209 


25110 


A 


11306 


142 


330 


FFTVWGRHRKEKKSNKDQLNLGIiHDFLD 
NPLLLF* KYLGREKKKKKKKKKWKKKKK 
KKKKKKN 


11210 


25111 




11307 


259 


440 


RPWIRDNPIYYLKSCFLRTIFLSLLPLQ 
PRPYPPTRRALSPNRHHPAKSPRSP*PL 
TGALSALLMTSDLTMGLHFHYITLLILG 
LLTNTLTIYQ*WRnATRESTYQGHHTPP 
VHKGLGYGI ILFITSKWFFARFF*AFY 
HSSLDPTPQLGGHWPPTGITPLNPLEVP 
LLNTVRL 


11211 


25112 


A 


11308 


433 


3 


FPPPDIRHSRSLGFPPSFGRAGFMFAF 
SKGQEPALPGGFYYPSPKKKNLAFCSPP 
LFYWGKFGSKFSLGVGPriFFP**SPKSF 
CQFLENKGFFPEKSFFKFFPPAGGCVFF 
FSEGGFPFCLFLFFFDRVWLCHPGWISV 
ARS 


11212 


25113 


A 


11309 


2 


423 


GRVGDR*LFYTNHMDIGTLYLLFGA*SG 
VLGTALSLLIHAELGQPGNLLGNDHIYN 
GIVTDHAFDIIFFIAIPIIIGGFGN*LV 
PLI IGAPDMAVARINNISV*LLPPSIjLL 
LLAYAIVEAGAGTG*TVYPALAGNYSHP 


11213 


25114 


A 


11310 


234 


1 


tffgkfpfclkfkpaqrggentpgpfpf 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tInknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVFFFFEMESRSVAQTGVQWRDLLGSSN 
SPTSAS*VAGTTGACHHARLIF 


11214 


25115 


A 


11311 


406 


211 


LSLFLHSLISTHGARPTDPVGLVPTQDG 
LDLLTS * SAHIGLPKCWDYREAPPCPAC 
GFDLHLPND 


11215 


25116 


A 


11312 


379 


398 


KT *NFTVRGS IPLLNNIPMASMTFSKKK 

KKKKKKKRGGPLKKNPGGAKFWGGRKRK 

IFSLKGGEKKNPGGILEKKPFFGGGKKG 
ENP P KiajKGLRE KKKF * RGKGGKKPQKG 


11216 


25117 


A 


11313 


390 


489 


RYRLGWAHftCNPS TLGSRGRWI I *GQE 
FETSV 


11217 


25118 


A 


11314 


286 


434 


ILWYMNYISIKLLKIIIYKLGAVAHTCN 
PSTLGGRGGWIP*GWEFETSL 


11218 


25119 


A 


11315 


322 


3 


FTF^GCRQFQVF* IVTKNDRIIFSSi'TK 
NLLKI YLGLY* FYYVPLLRTKSLKFCTGS 
FESNLSFK*VFLWSGTVSHTairPSTriGG 
RRGWIA*GQELE'rSLTlSIMERPR 


11219 


25120 


A 


11316 


251 


389 


TFFFFEREFHFVPQVGVQGGSLGSLQAP 
PPGSHSLA*ASPRCWDGG 


11220 


25121 


A 


11317 


313 


80 


El?lCPRCLSPGFliD*PC3QHeETPSLQKL 
QKSAGCGGAHIiRSQLIiRELGDRVRHCLK 
KRIYLSLEKTSLVLSMNRFQI I 


11221 


25122 


A 


11318 


145 


410 


APKTLKVALTSAAAARESICQLGVLDEIi 
VIFLELNRSTDORPGAVMETCNPRTLGG 
HGGWIT*AQEFKTNIiftNMVKTC 


11222 


25123 


A 


11319 


393 


3 


NPFKIIFTLSPPLPLIFTPPFPPFFKKI 
PPKPLPFGVFHSQTCPQISFPKNPYPFL 
FWFSNPFFLLFSPSRKGPPPFSRPFPFK 
VGNPLFFLFFFFFFFFFG*EGVSLLLPR 

LECNGTl SAHYYLRPRV 


11223 


25124 




11320 


135 


1 


HRDVKLEYAIRKMFVRPGVAAHACNPST 
LGSRGGWIT*GQEFETS 


11224 


25125 


A 


11321 


282 


2 


NSESVLIKEKGDEEEKVLSFLLFSKRKV 
LESKTFPTLHSGCKKRSYFLKITTSSFL 
*PRVECSGVIIDHCKLKLLGSGDPPTSA 
FPVARTTGM 


11225 


25126 


A- 


11322 


149 


30 


WGSGGQVRWLTPI IPALWKAEAGGSPEV 
RSS*PVWPCLY 


11226 


25127 




11323 


316 


399 


S TLGGRGAQ I T * GRE FETSIANMPNP I S 


11227 


25128 


A 


11324 


223 


397 


IIKAYQ*DVNREQIKTIIIYFQNKLKCI 
KKM I WPGAWHVYN P STLGG * GGW I T * G 
QE 


11228 


.25129 


A 


11325 


3 


396 


NYS TAAL FiRGHI WGWRWGTL YYGS S P I 
YCRGLNSTLGLYPMASSNPHPRKLCKP 
KIYTHIAKCPIGHK1IYPYANTPIII*HH 
THTHTHTHTL*DS LLWPGWAHTCNHNT 
LGSWGRQIT*VQEFKTSLA 


11229 


25130 


A 


11326 


113 


442 


KRY IQ IRAHSEVLCGHELWGTLHNPVQT 
IQTWEPFPYLPLCSRVTSGKSHHRFVPL 
LFHL*NRNWSHHHHHHHHHHHHHRNPC* 
ECCRAHSDSYAVWSDKVSQEPVAAM 


11230 


25131 


A 


11327 


145 


3 


ELDIKPPNPFFCHTGVQ*CDLGSPQLLP 
PGPK*FCLSFPSSWDYRCA 


11231 


25132 


A 


11328 


189 


1 


KTNLLLIHSKLRLHLQSSVFQC*SKSEK 
SNVCT*CAFSCI.PKLFWLGKVAHACHPS 
TLGGR6G 


11232 


25133 


A 


11329 


193 


405 


ID*QTLTKKKKKKKKKKKKKKKKKKKKK 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AYTSKKKT 


11233 


25134 


A 


11330 


209 


11 


GaKRGTPPTINNIFFNFFF*EKGSCCSS 
RAC* CNGE I * AHCPLRLPGS SNS PAAAS 
RVQETATTPS 


11234 


25135 


A 


11331 


38 


390 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 
K*QRQKKKKKKKKKKKKKKKKKKKGGGG 
LKKKLFLGPGGEKFFFFFWGPFFFFGGR 
FLKRGC3GKKPGYKKNKISGHPPPFLGGG 
KKKKR 


11235 


25136 


A 


11332 


38 


326 


VDQSTVCKMGQSAGCXSWGSDKSIKAGCIi 














FIKKIFFGPGGEKIFFFFWGGFFFFGGG 
FKKRGGEEKNLF 


11236 


25137 


A 


11333 


1 


234 


RTRGKNWMRFIKEGWVPTGRMGGSFIKH 














PHDKIKDLFDPV*NKKQNISIP 


11237 


25138 


A 


11334 


244 


398 


INHLYSNKNDLIFLLVFYLFFFP*RRPG 
EPPLIKGWLPYLGGVIiNLRKDPL 


11238 


25139 


A 


11335 


174 




HQVYPyVKQEHYRQDSCQCYIKYMIWWP 
DAVAHGCNPGTLGAQGGWII*GQKFERT 
RG 


11239 


25140 


A 


11336 


148 


411 


LFFFPHCYLLNAPSILYUSrEPKWyNLH 
LCVSAVCLLVPVHSPHRTCLY*KKKKKK 














KLLP 


11240 


25141 


A 


11337 


237 


453 


GQRPLLSPRERGSFFFLLETEFCFAPQA 
EGQGHNLG*LQLPLWG*SNSLVSSSQEV 
GITGAGHHPRLFFVF 


11241 


25142 


A 


11338 


65 


347 


VCRVDDFVPEARTRFFKS I * EAWNNKNI 
KPLLSTFSQVPGSENEKKCTLDQAFIGI 
LEEEIINHSSCENVLAIISLAIGGVTEG 
KYGSVLFCLK 


11242 


25143 


A 


11339 


158 


393 


VPCTHSEGIRKGRKCLPKTRENAREERD 
KKIPPGLC*AGKLFPQEKSSTWPGKVVH 
AWWPSTLEGRGGWIT*GLEFKTT 


11243 


25144 


A 


11340 


117 


3 


GIRPGAVAHVCNPSPLGG*GRWISRDWE 
L*TSLAWIR 


11244 


25145 


A 


11341 


292 


60 


SGQGLRLRGAFRGKVQAKEWMPIPKLGC 
LVKDMKIKSLEEIYFFSPPIK*FEIIDF 
FLGASFKDEVFFLFVCLFVCF 


11245 


25146 


A 


11342 


149 


30 


TQIFPPPPPPGKKKKFFFPKKKKKKKKK 
ERKKDRKNKQK*KGPGVGVSPLFPPPLG 
GQGGGFPQGLGFKTPIjAPKGKPWFP*KT 
QKFPRRGGGPPKIPLFGGGGEKKPFSPG 
GQGSNKPKFFPPPPPRGKKKNFFSPKKK 
KKKKRKKERKTEKTNKK 


11246 


25147 


A 


11343 


120 


504 


GVHDRCLWDDMTRRPPAAAATACPAGRP 
PPRREEEHSQLLLISFQGFRWD*DQDVN 
TPNLDHLAREGVKAKYLMPPLVTMTSPS 
HFTAITGKRHSAHFTRCPSNPQRPSFPV 
IRSKSSVSSREVEVA 


11247 


25148 


A 


11344 


213 


399 


ALVIHCLTTTIOVSGLSGSKYLMRQGMV 
AHGCHPNTLGGRGGWIT*SQESETSLAN 
TVKPCL 


11248 


25149 


A 


11345 


308 


3 


NPTPRSGDLWVEEGSPGAGNWGCLSEQT 
LRAI IKATTSYSFYCYFFETGSHSVAWA 
RVQWYNPGCS I CGSSHLPT SAS * GAGT I 
GMHHHAWLI FLFFVENE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PI)enylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine,R=Arginine, S=Serine, 
T^^TIireonine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$$ible 
nucleotide insertion 


11249 


25150 


A 


11346 


189 


433 


GHCRAGTSLTLTHLGGSAMELI IHLGLC 
LGRMKFIHTTCI ILFLFFLSATQAGGQG 
GDLS* LQPPPLGFKQFSCFSLPRSWD 


11250 


25151 


A 


11347 


257 


3 


KWGELCFLT I KEERMGFPLAKKRKIKGV 
F*FGLFFFFETESHPVAQAHCNLSLLDS 
SYSPASAS*VAGITGACHHNQLIFIFYE 


11251 


25152 


A 


11348 


159 


434 


LSPSCRDVGTYSARAHPPHTHTRVRAHT 
HACTHVRAHTHTHTTAHGSHLTKEQPGL 
CTLPGSSW*PGLGGRAIATAQAWVHCAG 
IITAAHPP 


11252 


25153 


A 


11349 


240 


406 


AFKKLAKNWSMGVNRSLSRPGWAHAWir 
PSTLGGRGRQIT*GQEFETSLftlIMVKP 


11253 


25154 


A 


11350 


163 


360 


PQLLQLMKFLQN*LIiERLRPQISrGLNLEG 
LACSESRSRPCaPVWRTERDSVSSPQPR 
PGAGGRPSFF 


11254 


25155 


A 


11351 


145 


406 


PLNSGAKPTLPPWSPRRPELNAPPPPPG 
EKEFFFFLGRGFLFAPQEEAGGENIV*G 
KPPPPG*REFFAPPPGGDGNKGAPPPPP 
LFL 


11255 


25156 


A 


11352 


122 


337 


ILWVLLLVRESLKFFFFLLLLPPLFFET 
DVA*AAVQ*SDHSSRQP*PPGLRGSYTS 
AFRIAGAICSl CBiHLY 


11256 


25157 


A 


11353 


325 


416 


KCVYIMGWA*WLTPVIPALWESfiAGRSL 
EV 


11257 


25158 


A 


11354 


260 


415 


LEELQNTACQKLEPFLSRTETKQGCLLS 
PRLFNIVLEVLa*AVRQEKEIKGI 


11258 


25159 


A 


11355 


160 


2 


ISEISTKRNNYFLKNWLGWAHACNPST 
*HG*GGQTTRGQDFQTSLAMMVKP 


11259 


25160 


A 


11356 


160 


3 


ENLKLYHGS PFLKI I WLGMWHTCNPST 
LGGQGGRIP*AQEFKTSLGNVRRP 


11260 


25161 


A 


11357 


235 


408 


GQPLPAALFFYQWHLHFKELLAHPWGKT 
VWRFL*PLKKEGPHDPAIPLLDIYPKNK 
KW 


11261 


25162 


A 


11358 


11 


393 


QTERTPTPKPHLYVSFPPNSQDPQLQGG 
KLTNRKKTHTKTPSVCHHHQRPKVDKTT 
KIGKKQSRKTGNSKNQSASPPPKECSSS 
PATEQSWTENDFDELREEGFR* SNYTLL 
KEE VRTHGKEVKNL E 


11262 


25163 


A 


11359 


283 


473 


FMVSVSIKVYFFIFIFLFFEMESCSVTQ 
AGVQWHDCLSFLC*VLASSPGNSSPVR 


11263 


25164 


A 


11360 


266 


407 


LEIHHSKTL*FWLGIVAEACNPSTLGG* 
GRWIT*GQEAEIILANIVK 


11264 


25165 


A 


11361 


93 


1 


DVLSILLPRLECNGAISAH*NFRLPGSS 
NSS 


11265 


25166 


A 


11362 


131 


377 


MEEMIKGRGLSKPPALSSYLTLKSS*LK 
RVRARRRLFLKSRFSLDLQVAGFFVCLF 
VFWDRVSLCHPGWSVWQSRLTAftLT 


11266 


25167 


A 


11363 


212 


392 


QAEECAPLFIGVKEPPBaSILTHWLGTVAH 
TSNPSTLGG*GGQITWGQEFETSLMnyiR 

KPCV 


11267 


25168 


A 


11364 


141 


3 


NELWLNHQNVLEWLGTVAYTCNPSTVGR 
WIT*V*EFETSLGNMVK 


11268 


25169 


A 


11365 


215 


384 


ANFNIRPEISLASKCPSERKSRTLLPLN 
QKLEMI *LSEEATSKAKTG*KLGLLHQI 


11269 


25170 


A 


11366 


48 


373 


EPWTRKEVLKISSSLXIHCYLSILCLIL 
MPTLECTAAIMAYSNLELPGTSCPPTCA 
S*VAGTTGACHRAPTVLKIFLRDRIFFF 
FFTjaVTYPWCHTPADPAIFKAWAH 
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sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=AspartitAcid, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginiiie, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, V=po5sible 
nucleotide insertion 


11270 


25171 


A 


11367 


310 


2 


GLICLNRENAYFFVIRQISMSCLEKEKP 
AHYSSPYAKSVTRGEKRRQRFFLSRAPP 
I PFLFFI * IV*AESRSV2iQAGGQWIjHFS 
SLQTPP*RETSDSPASAS 


11271 


25172 


A 


11368 


303 


50 


MFALTNRGVQGSPVIFLC3VFLISKMGFT 
CFKGFSRFKVLVFFLGVFPSFFFFLRAR 
VLLCCPGWSPTLGI K* SSCLGLPQCVLT 














QSNASLSHFTSPPIiSVADE*LFSTDHKD 
IGALYLLFGA*AAVIjCTAISLFILDWLG 
QPCNLLRNDHIYYVIVTAHAFliRIFFIG 
IPIIIGGFGN*LVPLIIGA 


11273 


25174 


A 


11370 


112 


218 


GGRFKGSNFTSAGMQRNIFFMGPPK*NS 
RaGV*QRGEGKNPGVTKrj!IRL 


11274 


25175 


A 


11371 


823 


1078 


SQHFGMPRWADHLRSGVQDQPGQHGETL 
SLLKIQKLAGRGGRCL*SQLLRRLRQEN 
CLSLGGRGCSEPRSHHCTPAWATQ*DSV 
S 


11273 


25176 


A 


11372 


184 


420 


IKQQKNLFIiALVKELVQLLWKTVWQFLK 
KLNIELLCDPAILLLGIYPRELKMYVHT 
KTCT*MFIVELFIKSKK*KQPTC 


11276 


25177 


A 


11373 


234 


44 


LATITDRLIiALSDSAIPPLGIYAKEKKS 
VYQKDNCTYIFVAALFTVAQI*NQPKCP 
STRTRGS 


11277 




A" 








PPIGSPLVPYIPSPALSSPQAPRMGSHP 
CLIPSASCHPPLETDFLSLLPTHFCLAV 
FTKERFSPPPPSPYPRF*KFLRS*KFSF 
FLPPFLFFGGTVLLCLSGWSAVAQSWLT 
AASTSLGSSDPPTSTS 


11278 


25179 


A 


11375 


43 


456 


EFFHHVGQDGLDLLTS * SAHLGLLKCWD 
YRREPPRPASDGHY*TDATGSLPSSGTT 
* IRTKPSQAPASWGLWNLAHHPPRSHPS 
CPMAWLICSTLSSFDGGSPGTGPGGWCP 
LGLSGSPARAVFKDSSCSLHPIATGI 


11279 


25180 


A 


11376 


288 


484 


EWVLSAVGGSQYGVCLPFLHCFIFFETE 
SRSVAQDGVQ*CDLGSGSSLPPG 


11280 


25181 


A 


11377 


129 




NFNALNLIRAGAVAHAYNPSHLGGEDGR 
IA*GQEFETSLSNTA 


11281 


25182 


A 


11378 


307 




DSEIPSLGLAKFWNHRREPPCLALFLTL 
NQKLEMIKLSEEGISKAKTR*KAGFLCQ 
RVSQDVNAKEKFLKEIKRATLVKTQMMR 
HQNSFIADAEKVWMAW^5K 


11282 


25183 


A 


11379 


200 


397 


CISLFSHYYKDLPETGSFIKKRGLIGSH 
WLGAVAHACNPSILGGQGERIT*GQEFE 
TSLASMVKPC 


11283 


25184 


A 


11380 


167 


1 


SNGLYSREiyjMVQHTHIRPGWAHTCNP 
STLGGHGGRIT*AQELKTSLGNIVHPC 


11284 


25185 




11382 


12 


395 


AHSSFLSLDLSVFATCPHPRAETQTGHR 
FSTLLPLSALYPK* DYFKKKKKKKKKKK 
KKKKKKKKKKKKSGGGA 


11285 


25185 


A 


11383 


160 


2 


TMEMMLD IK* I*EIFLFEFKjyiGLKAAET 
TQtflHHTFGPGTANEQTAQRWFNK 


11286 


25187 


A 


11384 


274 


389 


GQAQWLTPVIPALWETKAGGSLE*GQAQ 
WLTPVIPALWETKAGGSLELKNSRPSLG 
NMVKPP 


11287 


25188 


A 


11385 


389 


37 


WSYESSWSFIFKSFLFSHyiISAYDM*YA 
SQILFTPKLYRTHPYFLLVLFLILSILV 
ILETGSRFVSRLECGGGISAYGSFK*LG 
SGNPPAaMLSSEIPYVHQSCWSDIiAFN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
.N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PEFEM 


11288 


25189 


A 


11386 


214 


374 


KGGGRFKGSKFTS PGGGEKNFF 1 GAPK* 
NSRAGV* KRREGKNPGVTQLKKGVG 


11289 


25190 


A 


11387 


142 


410 


HFVCSTIKVLRDLSSDRSNPGHFLSTSN 
* KKKI KKKKRGAPFKKKKTFFYHGGGGKY 
FFFGAPKIN51.GRVFQPGGGENPGRSPF 
KGAWG 


11290 


25191 


A 


11388 


144 


1 


CLSNEDSIKTPKGLGTVARTCNPSTWGG 
RNGWIT* GQEFKTSLANTVK 


11291 


25192 


A 


11389 


167 


2 


RNISSHLLNMAITKQTNBIH1ICM*GYREI 
ITHIHC*WEFKMMQLLQKLVPQKVKHT 


11292 


25193 


A 


11390 


2 


412 


SDQVQDAGRLISSEASLLGLPTATFLLC 
LPTGFALCTRVPGVFLLLRTSVILREGP 
TLVTSFYLNYLLKDLVSIQSHWGLGLQP 
MNFGETQFSL*QKLV*KEKESMPGMVAH 
ACNPRNLGGQGGLIT*GQEFKTILa 


11293 


25194 


A 


11391 


1 


206 


T6RLAGHRVSSLPIR6GRAEAPLTSRTG 
RLNQD6IiNLr.TS*SARLGLPKCWEYRRQ 
PLRLAVISGFLT 


11294 


25195 


A 


11392 


262 


388 


VSK*KKKKKKKKKKKKKKKKKC! 


11295 


'25196 


A 


11393 


78 


414 


ICFFLRAPPFFFFFFFF*TKIFFFFPAL 
NSMANFFFFWTPPFLGGIFLPPPFFLEG 
GGPGLASPPGGAFFF 


11296 


25197 


A 


11394 


238 


11 


IKWETAGRSSACL*S*LFERLRRE3SrPLN 
PGVHG*DEI**HHCTPAWVTVRPRI.SKY 
INKNKK+ARLAARCGGSRL 


11297 


25198 




11395 


340 


1 


KHILVPPFINVSEIQHYSLV*HLLFVGV 
*NLFFLFLS FKKPVL CSM*IICHEIISP 
PSKPKCKASHLSFV* KKNSNLWLGAVAH 
ACNPGTLGARGGR IT * GWE lETS LTNME 
K 


11298 


25199 




11396 


363 


245 


QMDLNRHLS KDDK * MAWQCMNR CLTSLA 
IRQMQIKTIMI 


11299 


25200 


A 


11397 


292 


2 


DPKKEAGRGGAQL*FQLLRRFKHETHLN 
PGG*GCSESRSHSSPPAGATK*NGVSKK 
K*Y*APKEERHPRPVLVEPRVPQVPSPG 
SQTYRQDHSPRV 


11300 


25201 


A 


11398 


121 


321 


KKLYTHKSKSDVMIFLKIETVSWLYCPG 
WRVMAHCDLKLLGSSKPPTSAS*VAETS 
GTCHHARLIFL 


11301 


25202 




11399 


118 


394 


QALPQPIPVLILLQFFFFFF*GTGFCFV 
AQIGGQGGHYG*LEPPIRGLKPSSRRTL 
RGGWD*GGHHNAQLILGFFEICTGSCFW 
QAGLKALA 


11302 


25203 




11400 


273 


379 


AEPGWAHSYNPGTLGCEGGGIT*AQEF 
YTSLGNM 


11303 


25204 


A 


11401 


360 


38 


KIFGQARWLTPVIPAFWETEVDGSPEVG 
RSRPA*PS 


11304 


25205 


A 


11402 


254 


404 


GPAGHDFHMI TPS KSTETGLGAVAHVCN 
PSTLGGRGGWII*RQDLETSLA 


11305 


25206 




11403 


161 


3 


ILTPTLQYIKINLRGQAVWFTHVIPALW 
KAK*SGGXiLELRSIiRPAWVXEGDL 


11306 


25207 


A 


11404 


1 


170 


MGVSLLSPRLEasrGVISAHCNCHLL*FT 

FK*FSCLSLPSSWYRCVPPCLATTFLIY 


11307 


25208 


A 


11403 


249 


3 


NAKS RI QVI S Il-JVFTVKFFHI .YYLCNFS 
Q*NEMAGGQS IFRKIOiLDFSLEHSGMMT 
THCSLDLPGSSDFPNSASSVGGTRH 


1130S 


25209 


A 


11406 


70 


401 


CGKHGWLIIHSFLNHLKYLLLLQMANL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predicf- 

nucle- 
otide 
iocation 
correspon 
ding to 
last amino 

reSdueof 

sequence 


Amino acid sequence (A=Alanrne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FISQFLLTCVSEPLLF*YCHQIIKVGRL 
HEVKQMRGTKKQEKMFP YWWNDKNVC PG 
VPAHACNPTTVGG*GGWIT*GQEFNT 


11309 


25210 


A 


11407 


257 


389 


GGSLLQKYLRPLVAFLFSFFFFSFETES 
*SWRLECNGMISAHC 


11310 


25211 


A 


11408 


364 


420 


SQPFGRPRQADCIiSSG*AQWLTPWPAL 
WEAKAGRLLELRSLR 


11311 


25212 


A 


11409 


312 


413 


TSGMGPGWAHACNPITLGGQGGWIT*G 
REFETS 


11312 


25213 


A 


11410 


283 


425 


HGHKDYVHWLQAVARAC53PS ILGGRGMW 
IS*GQEFETSLftHMVKPCL 


11313 


25214 


A 


11411 


389 


3 


CRIDGLFFNLVFLRLWWKF*VIFFLSY 
FSFLVIILQFLTYMIiNCTFF*QILNLFP 
ILVLLTKILNTLLSTENYCHFSLLYCLE 
FWFHPI I KHKMYHLFLI * FI LI ffl LRQS 
LARS PRLKYSGTIS IH 


11314 


25215 


A 


11412 


267 


3 


QEFQQEDTSEIJDLEIRSLCKRKGGLKQG 
DTKDNLSKRVWETDKGKAAWWPGAVTH 
ACNPSTLGGQGGWIT*GQEFETTLTIIRC 
PRV 


11315 


25216 


A 


11413 


3 


281 


GAMSflRIPGBGSLSASQAIQPFSLYPVH 

KKKGGGGVLKKIYFPPRGEKKFFFFWGP 
PKKTGGGGF 














11316 


25217 


A 


11414 


275 


2 


KTMFQLFNSDFKFPGSRNPGQSWSPRA 
KQSFCTQRIj:iKYFI.K*iyiNEMKLNKSRSG 
AVARTCNPSTLGGQGGWIT*GQEFETSL 
ANMAKPR 


11317 


25218 


A 


11415 


319 


1 


rnlgvqvyeVwlvykycsvssnnkmkm 
krdllvi *vpgivihliyrlikenpely 
islfiysmrslkkltyrpgavahacnpn 
tlggrggwivcsqefetslgnm 


11318 


25219 


A 


11416 


121 


428 


LSCDKWGNRDMTKMLSKELKKKKKKKKK 
KKKKKKKKKKKKKKLGDEEEDKE*QSSW 
GGGGALIKNILFPSRAASGFFLSLLKYK 
KLGAAVYI ILGEKTI LWL 


11319 


25220 


A 


11417 


79 


253 


KIFNFQFSNMLEMNSHTTRIimYFPGRE 
RWLTPVI PPLWEAKAGRSLEVRS *RPSC 
P 


11320 


25221 


A 


11418 


180 




EVGFMKPGLCGVFLEAGSHSVALTIVQW 
CDLGS PQHPPLGSRDPPP*ALQIVGNTR 

RVP 


11321 


25222 


A 


11419 


154 


3 


STLKDKENQKLSIQPDTMAHVCNPSTLG 
GRGGWIT*VQEFEISLNNMVKP 


11322 


25223 


A 


11420 


436 


478 


AI PLPH* RKKILHAALFFFFLKGS FVLS 
PRLECSTKIWAHCSLPPPNISNPPASSS 
QEGGTT 


11323 


25224 


A 


11421 


143 


398 


AAFGFWFVLIRFLRQRLALSPRLDCFFA 
KS ASCQPRP P* FKRVCWLS LPKSWDYEF 
DLLCAAGLVWVYLSVSRLHINEKNQPPP 
P 


11324 


25225 


A 


11422 


1 


138 


MVFHRVGQDGLDLLTS *SAHLGLPKCWD 
YRHEPPRPAAHSYFLLHK 


11325 


25226 


A 


11423 


2 


390 


LELPILLLNTLILFLFLILYIYLVFSPN 
AFRILFITIVLESFSFILLVPSKPWKYS 
LI LSNAFVLCMMISS PPFPLFSLSTTRI 
NKKLDFWLGAVAHAYNPSTLGSQGGWI I 
*GQEFETSLTNMAKPHL 
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SEQID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

iocation 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Tlireonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11326 


25227 


A 


11424 


267 


414 


VFQITFCIKNFWPGWAHACSPSTLGG* 
GGRIT*AQEFETSLGNMVKPC 


11327 


25228 


A 


11425 


189 


73 


HQOeLDLLTS *STRLGLPKCWDYRHKPP 
RSAHPNAtraaP 


11328 


25229 


A 


11426 


318 


33 


KKTISSHARGRKLGSFKTAAPFFFFF*T 
GSCSLaQAGVKWCSHSSMQPRPPGLK*F 
SASGSAGI IGMSHHTRPIAGFLLAKLKK 
KSNSTGYGGRHV 


11329 


25230 


A 


11427 


291 


393 


DRVSLCLPGWSAVLPS*LTAAQISWAQV 
IFLPQP 


11330 


25231 


A 


11428 


51 


399 


KRVARQNPPNGQGFGWCSCS PI LYGQSR 
AQGENIYKLLGSGEWPGQLI IDVEREQL 
EDWRQGDLGRGMWMTKARVHKGPGLVAG 
TCNPSTLGGRGRWIT*GQEFENSLflHMV 
KPCF 


11331 


25232 


A 


11429 


242 


420 


VKKKKRGGRFKGSKFTSaCLQGISFFKG 
APKLNSRAGF*QRWEGKNLGVPQFNRFG 
AHP 


11332 


25233 


A 


11430 


77 


405 


TKRGDSSFCLYRIWIIDSVMKWNQSECAA 
AAAAFFFFLRGTLLFLFRLKGKVGASVI 
*NFCLRGWGGSPASPSKEVGMEGACHNS 
GLFFVFLEKRRFPLVSQEFKTTRflW 


11333 


25234 


A 


11431 


121 


439 


LPLPTS6PEDPLALSFKPTPFPSTLSGM 
lYTGHPSPEHTSHLMPAPPVUHNKACRF 

KKKKI FAPPGGEI FFFFWGPKK 


11334 


25235 


A 


11432 


109 


283 


LTPWNLDSRCEIKGNGKPQENHLNLGGG 
GCSEPKSHHCTPAREKRQNSVSKK* INK 

IK 


11335 


25236 


A 


11433 


296 


1 


FLKKPSKGFSKLPHLPQNLNLFFPPK*K 
CPEAKKDIPPPSF*KISSLFGEG*GFTG 
SRPPSLNLENPPKRIFFFFFFLDRVSHC 
RPGWSAMASSCPRV 


11336 


25237 


A 


11434 


385 


418 


QSETLSQKQKTK*E*SWLGAVAHACNPS 
TSGGRGGWIT*GWEFENSLTNMVKPCL 


11337 


25238 


A 


11435 


3 


405 


LLHQGRLMLGQWGNGRTKTLLMRENVGK 
FPAEPSCSALGVWLQALACYPGLVRRQ* 
ANHWE KKKKKKKKKKKKKKKKKKKKSGG 
G 


11338 


25239 


A 


11436 


221 


3 


KHEFQTIQMESMK*NTSIKICIYAS*IQ 
DCLRYQVIQRPAAVAQAITEHFGRPRQAD 
HLRSGVSDQPGQHGET 


11339 


25240 


A 


11437 


288 


366 


I<MFFSWA*WLMPIIPELWEAKAGGSL 


11340 


25241 


A 


11438 


160 


370 


LWYHYFPVRDGLTILVTYQLKAIRRLLT 
LKSRSGAVAHTC1IL1ITLGGRGGWIT*GR 

EFETSLAHMEKPHL 


11341 


25242 


A 


11439 


230 


404 


MQHGQVGIYSQGGGWRPVDKKNSQLGTV 
AHTCNLITLGGRGGWIT*GQEFKTSLTN 
IW 


11342 


25243 


A 


11440 


302 


54 


LSGVWFLTPPPKGGGFPTP*NGGAPGFF 
PPPPFKNPPPEGKLGGPKKKKWFPPPGG 
EKLVFLKGAPPFFFFFFFFFFFFFFL 


11343 


25244 


A 


11441 


271 


416 


KRTLSTKLQYMCCLLIKSKSIFK*KKKK 
KKKKDKKKKKKKKKKKKKNPPSLP 


11344 


25245 


A 


11442 


254 


392 


SVTGMTIMTSSCCYKIQKCTKIQKLARH 
GGTCa.*SQLLRRLRQENS 


11345 


25246 


A 


11443 


390 


150 


LLKNQKTLGGPARNPPPLGGPNGGVPKA 
GNWGPPGPPGGTPFFFKNQKITPGGGGG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

otide 

location 

correspon 

Smino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gliitamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P*FPPLRGGGPGNPLYPGGQGCH 


11346 


25247 


A 


11444 


122 




PAPLFFFFFFETQSCSVA*AGVQWRNLG 
SLQPPPDDGTRL 


11347 


25248 


A 


11445 


272 


403 


RPKVDKTTKMGRNQSRKAENSKNQSASS 
PPKDHNSSPAME* SWM 


11348 


25249 


A 


11446 


70 


397 


LCHCAPAWATERDPVLGEKKRTMMSYCH 
PPMRMVEKKIVGDIKC**GYGETGFLTH 
C*WECKiyrVQIL*NIV*QC3:.KIIIiNVHSPY 
ELRIVLLGIYPREMKTYIFTKKTSM 


11349 


25250 


A 


11447 


3 


401 


HASAHASAHASAYYRLCAYPKSSFLYPS 

SFLLWSNFPLNILDTTIGPKATETLFI 

ICIVMHDVi SMQLKS KTFYNYFi-'LLSLNV 
'F YF VL S C C KRRKP. 3 Q LGMVSH I CNP S T L 
GGRGRWIT' GQSFETSI.ANMA 


11350 


25251 


A 


11448 


333 


1 


KSGCIPPKGQWLFFI,QKTAKIGSPKPV 
SPPRGKGQFFPQGLRGFY*FLPPPPGVP 
EIFFFFF*DSSHSVTLSPRLEYSGMNTA 
HCSLHLPGSSNPASDSHIAGSTGTCHH 


11351 


25252 


A 


11449 


104 


402 


WCRPLLGGVSQLGYM6VMDPFKEAVCL 
FSEUCHCAGRTTALFRAVRQGCLSLKNF 
LLPFFQLCPAHRGGVL*RQ*ALLSCGGL 
HPVHASRPLCLPTQA 


11352, 


25253 




11450 


198 


2 


KYKFMAGLGGSCL*SHHFARPRQQDGLS 
PGV*NQPGQHGETPSLQKKVSWAWWHAS 
VEAEftGEQP 


11353 


25254 


A 


11451 


171 


3 


TPPPRATSLFFFF*TESHSVSQECasr6TI 
SAHCNLCLPGSSDSPASflSRVAGITGA 


11354 


25255 


A 


11452 


177 


461 


PMTGGKYISNSTCSKPNSLSPHLPLKTQ 
TKNMEHFGRLTRGDCLSPGV*DQPGQRG 
KTLLLQKISOaiiWDVTAHaWWPGLPTR 
ERRRSLSFPTG 


11355 


25256 


A 


11453 


247 


3 


EGECCGIESR*ILHLKGRICWHINYLKI 
TKIKN*VILKKMYKQKLRPGAVAHAYNP 
SALGG*GGRIARAQEIiETRLGNIVR 


11356 


25257 


A 


11454 


328 


57 


EVHSNICATLSTHQMKILTKGTRHQHIS 
*GFF*EGEVLTLSTKLECSGVIMAHCNL 
NLGGSSDPIiALVSQVAETRGASYLLLAI 
CPKVNK 


11357 


25258 


A 


11455 


70 


435 


ATRAKLHLKKKKEGRKKEFFPHWQGQPT 
PALVFPQPRTHFIFKVSIFSLKKNREQL 
PYTVQTQSLLPLNSHWGQVA*AGVQWRN 
LRFSCLSLPSSWDY 


11358 


25259 


A 


11456 


112 


401 


KCWDYRRERPCPAPNICIF*LTFFFLGK 
KLNQS FFFFFFLKGS LVLPPRLKGGGAF 
LFN*NLTLQGKNNSPASPS*RAETLGLG 
PHAKLWFVFLKK 


11359 


25260 


A 


11457 


198 


3 


E KT S VKQ SGDSNKKAWLEMKTNKKS LKR 
*GWAHACNPSTLGGRGGWIS*GREFET 
SWTNMEKP 


11360 


25261 


A 


11458 


179 




FFHMYNHIVCEYTITLSVNINSQGPGPR 
AHaYNPSTSEDRWIP*AQEFKTSLANMV 
KPH 


11361 


25262 


A 


11459 


210 


5 


RLIHYFFKNMLPSGGGSTA*GQEFEIRL 
DNTVRPHFLKK*KISQVWWCMVWSATQ 
EVEVGNGLSPGV 


11362 


25263 


A 


11460 


139 


2 


RIIALSKLFFFFFKQESHFFLPMLECSG 
VI SAHCNL* LLGSSDLS 


11363 


25264 


A 


11461 


388 


272 


S * EOCKKRGGRFKGSMFLSPAGQGNPFFM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 

otide 

location 

correspon 

last°aniiiio 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutanilc Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutainine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














GPLKS I SLAW*QRREGKNLGAPHI TRL 
GAHPLFAGWHKTQGAEF 


11364 


23265 


A 


11462 


2 


387 


CVWKSNNKKAPPCVAALRFRWKDDMDEM 
APGKHWGGLTSKVQSGGLMGRYTGPLTK 
TENS ILLKNP YFEAGHSGSRL *HQHLGR 
PRWEDYLRPGGQGCSEL*SSHCTFT*VT 
EGDPVS KKLICRTI FGG 


11365 


25266 


A 


11463 


325 


398 


SLPTWAVCAPRPSG*GSGIPELKTMIAG 
VILEDYLDIKNFGAKWGLSCTLATGST 
LFLGKVGPFVHLSVMIAAYLGRVRTTTI 
GEPENKSKQNEM 


11366 


25267 


A 


11464 


174 


2 


YLEILFFFETGSRSITQAGVQWSHP6SL 
QTPPPCPSNS PASA* TOAHASADAWADA 
W 


11367 


25268 


A 


11465 


105 


411 


TDVE LLLMDE * RKWFLELETTPGEDDTO 

IVKMTTKDFGYSINLVDiaVSEFKRIHS 
YFERGSAYGIC.7LSII3IVC*GEIFHERKS 
PLMQQISLJjTFILFYFSE 


11368 


25269 


A 


11466 


291 


404 


SQEGVKIKRQGTVAHTCDPSTLGG*GGR 
IT*GQEFKTS 


11369 


25270 




11467 


64 


236 


THAS6LDLLTS*SSCLGLPKCWDYRHEP 
PRPARFSSFYSGSLUSTYLAKI I KRDAFC 
I 


11370 


25271 


A 


11468 




416 


NPQISCLKQHVWGAVMJICNPSTSGG*G 
GRIT*GQELKTILV1SIMVK 


11371 


25272 


A 


11469 


178 


325 


RPEVKDQGASCRDQPHRVGGLLPVCSDE 
RVPQKKKKKKKKKKKKKKKKKSKKKKKK 
















11372 


25273 


A 


11470 


264 


60 


DINHQSSWLSGRWTPLSWVS ISQREDRR 
GFSASIYIYIYIYIYIYIYPPIHIYIHT 
YT*SYHFNQNT 


11373 


25274 


A 


11471 


95 


399 


DFPHLLLFLPLSQFCTACCPVPENRCFM 
YFFLIFSALCWEMNSRISHSNTAEVEQC 
CKVFKILQMKSVLGQVQWLTPVIPALWE 
AEAS * LPS IRSRNQ PGQ 


11374 


25275 


A 


11472 


139 


2 


WHVSTEKLSLHTMAHTCNPSTAGGQGGQ 
IT*GQEFKTSLATMAKP 


11375 


25276 


A 


11473 


179 


3 


RARWNPQVFC *GFLPVFPKI LPIWPLRK 
GVPFPPNFFFFFIiRVSLCHPHWSAVARS 

RL 


11376 


25277 


A 


11474 


282 


12 


KYIRNE*IVLFVIYFCRNRISWCCPGWS 
*TPELKRSSCLCLPSCWDYRHEPLCPAE 
MNKLDRLNKMDTTKWIQQKKKNHRTQRQ 
DNKII 


11377 


25278 


A 


11475 


145 


3 


VCMQLRTDFQPGAVAHACNPSTLRC*GG 
WIT *GREFKTGLliINiyiEQPC 


11378 


25279 




11476 


61 


397 


PQTPRLKQSYHVSLPSSWDYSWIYHERL 
VNPGRETDWATCYSGGNIQLQENKLNTP 
TDS TLWVSLCCPRLECSSAI PADCSLNL 
PG*SDPPTSTSHVAETTGACHYAQLIFG 


11379 


25280 


A 


11477 


156 


3 


FIFLKSSQNSYI *KCWPGAVAHACNLDT 
LGG*GGWIT*GHRFKPTRRPRV 


11380 


25281 


A 


11478 


224 


383 


CWVCL IHRAFLAPRP VLGCQ * Q KKKKKK 
KNKKKKKKKKKKDSRGGG 


11381 


25282 


A 


11479 


339 


58 


NFKNKLCFCASGYLDSKDSQHYIIFFIR 
WSLALLPRLRWEDCLNPGGGGCSEPRS Y 
HRTPAWVIE *DPVSKK* IK*NKIK*KQQ 
DWLRLHDKIF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

peptide 
sequence 


M 
eth 
otl 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

location 
correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptopfian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11382 


25283 


A 


11480 


244 


407 


SFGKKIGFLFGPKKFNGGSYGNYYKINR 
RFFFPILGEGFSPHVYF*NLETRTTGGG 
LLETTKTTFNLLGRKLGSFLGPRSLTAA 
PTGTTIKLTGGFFFQFWGKVFPRMYIFW 
KDEWKKGF 


11383 


25284 


A 


11482 


308 


30 


S PQKNPPTYSGFKKNLSASPGQKAKPRF 
FLKNQFLPGIMGGGRQFPLLRRVRPPNC 
FNFGGKRCa!I*QKLGPCPPGGPKETVSKK 
KKKRERQQ 


11384 


25285 


A 


11483 


226 


407 


IIHWNKK*Ft*PKVAHTYNLSALGGQGGRI 
T*GQEFETSPSNMRPHAS 


11385 


25286 


A 


11484 


10 


411 


QPPI YFLSLRISLFWVEHINGI IQYVAI 
CVWLVSCHNFSRFI CVBACDQDFIVFLW 
LNNIPLCGYNILCIHSICENWRTTGLFP 
LIjAIIIKL*TFVKKKKKKKKKKKKKKKK 
KR 


11386 


25287 


A 


11485 


184 





TDEELLPMDEQRKWFLEKKPTSGEDAVN 
IFEMTRKDIECyVH*VDKaAAMYERIDS 
NFLSN 


11387 


25288 


A 


11486 


435 


222 


AKMLMRHFSKKVI*MaSRHIKRCSISLT 
I RET * I KS I IR* HVI PGQMTFI QKTGDM 
FGSVSPLKSHHIiKL 


11388 


25289 




11487 


317 


408 


FRPGTVAHAClSrPSTLGGQGGWIT*GKEF 
KT 


11389 


25290 


A 


11488 


340 


469 


GKSDTKCHLWLGTWEIACDPNTLGGQSR 
RIA*GQEFKTSLSNM 


11390 


25291 




11489 


38 


389 


KRPTSLKKSNFCSLQGFFFFGLSQKPGK 
SNRSPGKKKNPFFSGGKaP*KGF*KIFL 
SPGLGGSPQKPHLLGGLGEKIJYLTLEKG 
GCRDPK* CPCFPPWAGEGNPFSKKKKQK 
FPFFG 


11391 


25292 


A 


11490 


235 


2 


FVLMtJLMPKLHIPVTEFRRLFAFPEFL* 
FLFCFVLSYFVLLETGSHYVTYAECSGA 
ISVHCSLNLPGSSDPPTSVLV 


11392 


25293 


A 


11491 


25 


417 


GTLCLRIGFINISHHCMILKENSLFFFF 
WETKFYFAPQaD6RGP]!ILG*WNPPSPG* 
RGSPGPTSRKEGNG6GGPPGPVIFGFLR 
KNGVPPGGLKGPKSIiAPGIGPPGPPKDR 
GLRGGPPAPGLLKKNPKVW 


11393 


25294 


A 


11492 


234 


438 


MWI IAQQCDCTECHY*LKMAKMINFML 
FIFYHTHTHTHTHTHTHTHTRERGPSYI 
YGKVSFKLTHL 


11394 


25295 


A 


11493 


344 


477 


CFTMFIS * I KT* KFNI SQLVAHTCaSIPS I 
LGGQGGRIT*GQEFKT 


11395 


25296 


A 


11494 


35 


482 


G1PGFHHVGQNGLDLLTS*SIHLGLPKC 
WHyRPGPPRPAISITPST*AYLLGNQKI 
GVTHFLAILM.FFSGTKAMISPKYVCII 
FYLLVEEHLGCFYFIAINTATCVCVYKL 
FDTYMYQVESHATHEFISYYFFKILCSL 
SIRLYAVLL 


11396 


25297 


A 


11495 


307 


484 


LSPSERHELKCVLWPGAVAHAYNQSTVG 
GRGRQIT*GPEF 


11397 


25298 


A 


11496 


487 


349 


FLIKTEFHPIGQEGFDFLTL*FARLGLP 


11398 


25299 


A 


11497 


224 


417 


RTDGKVF* FLSAKCEGI TFFLS FFFFFL 
EVLFCSPG 


11399 


25300 


A 


11498 


17 


411 


KRLVPGGRGRTDRFISQYV*AEYVS INL 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etti 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l/=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tiireonine, V=Valine, 
W=Tryptoplian, Y=Tyroslne, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














LIFLTKKKKKKKKKKKKKKRGGALKKKP 
WGGQNKTGEKKKNFFFLRGGKKKPWGDF 
KKKNFFWGGEHWGKPPPKKKTPWGKKKI 
FKGKGGKTPPYFCFLKKM6 


11400 


25301 


A 


11499 


117 


370 


VSESADLAWSLGICFFIEFPSDADVAGL 
ETAFFLGGWTGS CS WQPGV* WCNHSSL 
QPRPSGPRQSSCLSLPSSITOYKYMPPCL 


11401 


25:i02 


A 


1)500 


244 


1 


EELFECYSCSPTFSSDPLTTPLLILTT* 
LLPLTIMRSQRHLSSEPLSRKKLYLS IL 
ISLQI SLI ITFTATELI I FYI FFET 


11402 


25303 


A 


11501 


130 


55 


RFVCSTIKVX,RDLSSDRSNPGRFLSTSN 
SSLKKKKKKGRPF*RDNSAILF 


11403 


25304 


A 


11502 


186 


421 


SFRLKIAGIIGVHHCTWLWLIIiNLVLL 
K* ITFALKIMMFSKYSDIGQAQSLMPVI 
PftLWEAnRGGSLQPRSLSLGKK 


11404 


25305 


A 


11503 


1 


213 


GELLFLYIFA*WLFLLPKLr.KFVIVCLC 
ELQFNERFIMVLCIIIILGVBGFFLFCF 
LL*FCFDFFFFFWRG 


11405 


25306 


A 


11504 


82 


6 


AGVQWPRSWLRLP*LFFTQVLMIFFPPF 
YHQNFFFP*KGFFFLGGLSHFFPPP]!IKG 
FFFNYPQGFFFS PSLKKKFFFFSSPI FL 
APPGIFIi*GPPRFFFFFFFFFF*E6VSL 
CHLGWSAV2Ui.ILAPPAGFTPSSCLSLPS 
S 


11406 


25307 


A 


11505 


245 


440 ■ 


WAPCIG*QLLKDPQVLFAGYKVPHPLEH 
KIIIRVQTTPDYSPQEAFTNAITDLISE 


11407 


25308 


A 


11506 


256 


370 


GKLM*GHHV(SQAGLKLLTSSDPPALaSR- 
SAGITGVNRHAPPRLNVTLYTKGQTI IL 
YHNQYNQ 


11408 


25309 


A 


11507 


1 


149 


GCMWPRTVLFAFQHKFVNFLKTLGDF6Q 
VQWLMRVIPMiWEAEAGKLPEARRWRPP 
♦ PRTVLFAFQHKFVHFLKTLGDFGQVQW 
LMRVIPALWE2^EAGKLPE 


11409 


25310 


A 


11508 


316 


420 


YTYI YEPG* VQWLTPVI PALWEAKAGGS 


11410 


25311 


A 


11509 


136 


2 


LDNRLWPGAVAHACNSRTLGDQSGWIT* 
GQEFETSLANMMKPHL 


11411 


25312 


A 


11510 


167 


420 


PTPRSGNTALQPPHHTHTHTHTHTHTHR 
HTHTHTHTHTRAC I * KVRVRAL FS P KKK 
PLFCLSVSQIYVGRCLLKRLLFIFLYRH 


11412 


25313 


A 


11511 


124 


4)5 


lYISVANTIILIFIAIHVLAISFFFFKK 
EVLFLPPRREGGGAF*TOI*NLCFWG*GN 
FPA*PSLKKGITGALYTPGLFFVFLKKT 
GFRHVGQAWLDFL 


11413 


25314 


A 


11512 


121 




PPVKFWAPFFFFF*IGSPSVAQAGVQWW 
NLSSLQSPPSCL 


11414 


25315 


A 


11513 


245 


13 


VLFILSGFFGPIWYPTSPPPQLPFPASG 
NHPSTLYLHEFNCFYFYFLFIYFF*DRV 
SLCRPGWSAVARSQLLVPSDS 


11415 


25316 


A 


11514 


434 


2 


RPPFFPVWGSPPPFFFFFGEGIWGQEER 
KGASS PPWSRGILRGGGPFGGPGAPPT 
PLKKPPLFFFLAPPFLGVPPRFFSFPTV 
FFLPFFVCPKKREALLGVPKGPRPFFFR 
SFLFFFSGD*VSLCRPGWSAVARSQLTE 
SLAS 


11416 


25317 


A 


11515 


247 


390 


QGLRIYSQDFFFFLETKSRFVAQAGSQG 
GNLG*LEPPPPS*SDFMASP 
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SEQID 
NO: of 
n ucleatide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l,=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11417 


25318 


A 


11516 


269 


368 


FFFERESCSVTR&GVQl\fHRLSSL* PLPP 
AFRRF 


11418 


25319 


A 


11517 


256 


404 


NILKI FYLRI *KKKNQIiGWAHACNLRT 
LGGRGGWITRGQ* FKTSKEKMVK 


11419 


25320 


A 


1151S 


211 


404 


YLCFTRCSVHSHLK* VFLHKCVI *MyRP 
HT*TGL* CMQVHTHTHTHTHftHTHTHTP 
LSQGGDCI 


11420 


25321 


A 


11519 


41 


232 


ESAffiPRRHRVPRAQITPIiHCSIiGDRARP 
CL*SQLLKRLRHENLLNPGGAGC31EPRS 
HHCTAAWATEQDPV*KKKKSLNPKTVRA 
INLKMLGALISFHLFLTOIWF 


11421 


25322 


A 


11520 


146 


2 


LIDQQGVNQLPRMECSGAITAHCCLELP 
GSSDPLTSAS*VAGRPPTRP 


11422 


25323 


A 


11521 


297 


437 


NPSTLGGRAHRIT*AQEFE 


11423 


25324 


A 


11522 


244 


460 


NCATITTNSRIFLQLQKKHCTKl'JLSLSI 
SWQSLISF**ICFVLFFETNLAVSLRLE 
CSGAILAHCNLCLLSS 


11424 


25325 


A 


11523 


222 




ESKDLDIFRCQLAPRNSSCFVLFCFVFP 
LEIGSCSVAGAGAQWCNHGSLQPPPPIA 
SAS *VAGTTNVRHYVQLF 


11425 


25326 


A 


11524 


308 


424 


I TFLTMFKI YLQPE PAIPLLGIYPKDY* 
SCCYKDTCTRM 


11426 


25327 




11525 


98 


2 


NIGGRPGAVAHAYNPSTIiGGRGGQIT*G 


11427 


25328 


A 


11526 


113 


399 


LDRFLTLLPRLRREDCLSPGGQ6CSEP* 
SHRCVPAWATE*DLLSKKKKKGVLEKPF 
LPPPSVGKPPFSPPERLFFFILRPPL6G 
VFPPPSKKIIS 


11428 


25329 


A 


11527 


280 


3 


PKYCQI*VLQCIAQLFBCLSIYNTYTHTH 
THS LS LS LS LPPPKVWKWLMS SEFTLTF 
KiyiYTFGPGRVTHTCNPSTLGG*GGWIT* 
GQEFETNL 


11429 


25330 


A 


11528 


284 


2 


FWGPQKKNFSFPPPGLKIGSFKRAPPFF 
FFFFETKSCSVTQAGVRWCDLGSLQPLA 
WATERD* LNK* IKGMSKKLHT* VCMFRT 
RGRTRGRTRG 


11430 


25331 


A 


11529 


134 


2 


D'l'LLPRLECNGT I TAHCSLKL PGSGDP P 
ASAS * lAGTTGMCSHA 


11431 


25332 


A 


11530 


104 


1 


GRGFFFFFFFFF*EVESCSVAQAGVQWL 
NLGSIQ 


11432 


25333 


A 


11531 


157 


373 


CVFYTYSTSQLRCYI FIGNTRSVFRCHE 
lYT* KNRFWAS TVAHTCNFSTLGSRGKW 
II*G*EFMTSLANWR 


11433 


25334 


A 


11532 


237 


389 


ICKCSKTGFG*DFQEFETSLGNIVRPCF 
YRKIKASQV*WIIAPIVLATWEAE 


11434 


25335 


A 


11533 


259 


122 
_ 


HISPSVQRHFFLFYF*TKPHLVAQAGVQ 
WCNLSSLQSPPPGFKRF 


11435 


25336 


A 


11534 


164 




SSWSQALILLFLKKTCFWPGAVAHACN 
PCTLGGRGERIT*GQKFETSLASINL 


11436 


25337 


A 


11535 


106 


310 


GGDKGTVWDTLHFFDGQKRLDHNKDHWM 
TIHAELPHELSA*CYAFKKECISGIGSI 
*AEKEYKIQFDG 


11437 


25338 


A 


11536 


167 


378 


ASFFGTTITYKTILVYVYMFEKFIIYIH 
MYTF*RQGLVLSKLECSNVIIGHaJLKH 
LGSSNLPTSASRVA 


11438 


2533 9 


A 


11537 


101 


396 


VHNLKQRIGLSKRGEKKLLVLISGFGGG 
FPGAGWIKKGKPPPGFPQGLGP*KRGPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 

location 

last^amino 
residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vidine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, possible 
nucleotide insertion 














GKKGNFNKMGFNPKTLVFKNQNFGPGGA 
KPGTPGVRGPLGDL 


11439 


25340 


A 


1153S 


18 


361 


QIGPRLY*DQWGETPSLLKIQiaCKIPGF 
GGGPLKSQLLGGLGWENH*NPGGGGCRD 
PK* PQGLPPWGTNQNFVKKKEGGREKGK 
EGRKEGGGEKKKTLPKGIGALNLGGLLT 

SP 


11440 


25341 


A 


11539 


134 


365 


PGFQNKKTGGGGPFRPKTKTNFKVF* KY 
LKICFGFWPKREPPPGFFILETRSLFFI 
FFLGFFF*ETKSRSVTRLECSGVILftHC 
NPRLVGSSHYP 


11441 


25342 


A 


11540 


2 


318 


TTERFIYRITFGPKETEVNDLPGAPSEP 
VAaiiRLEARSFILKL*SFSLVFDFFFLP 
F*KTGSCLSPRLECRGEIITVHCSRELP 
S*SHPPTSASQSSGSGGFHLG 


11442 


25343 


A 


11541 


294 


409 


GTISFFRTCRLGMVAHTWNPSTLGGQGG 
*II*GQEFQT 


11443 


25344 


A 


11542 


1 


123 


GKQRQEPALSYDHATVLLTS *SAtIIjGLP 
KCWDYRSDPPCLA 


11444 


25345 


A 


11543 


69 


350 


VGAKHWIHMDTKIETADP6NYYSGERGK 
RPRAEKLPVCYYALYLGNGI ICWPKPPC 
CWKGHDFIYIYVCGHIYIHIYICTHIYI 
YFF*DGGLLC 


11445 


25346 


A 


11544 


103 


340 


LFCAFFLSPSKLDFYHLSFYV*TLFLKD 
LISLAFPFLFWEAGSCSVTQAGVQGPNS 
THCNLCLPGSNNPSTSASQRA6T 


11446 


25347 


A 


11545 


256 


354 


PYPPGKPKRGDHWGLGV*NQPGQQGETP 
SLLKI 


11447 


25348 


A 


11546 


196 


335 


EIVARLKRSPVGEAQWLMPVIPALWKAK 
ESGSPEVRSS*VKKrTCRPGTVAHaCNP 
STLEGQGKWIT*GQEFKTSL2U!IMVK 


11448 


25349 


A 


11547 


202 


377 


KILGWARWPMPLIPAFWESEMGGLLEPK 

SL 


11449 


25350 


A 


11548 


396 


26 


PPRIICKEKRSPNGAFYPFPCPJNF*KKKN 
LSLAKRFPRDSVFFFVSW*GSYLPTTFL 
QLLRWYVLTSHIAVSFKHLCEPHKNFTL 
QHSIPYLYIIEIEVYSYFINRNTIYKIN 
MDFSIVGFVFL 


11450 


25351 


A 


11549 


232 


371 


REVPFENIKIGQVQWLTLI IPALWEVKA 
GGSLEARSL * KYQNW PGAVAHAYNP STL 
GGQGGRIA*GQKFEISPANMAKP 


11451 


25352 


A 


11550 


174 


381 


NRDEGFRYADRAEVQRLLTGTILEHCGL 
QRPGL*RSSCFTLP CGWDYRH * S P SHVA 
GTTG I SHHTRL I KKAFFSQTGVS S C 


11452 


25353 


A 


11551 


262 


359 


DYSNLGLVWWLTTVI PALWEAKAGRSSE 
VRSS*PALWEAKAGRSSEVRSS 


11453 


25354 


A 


11552 


70 


380 


LHKCS Fl YI NHCNNP I YGL I S LYPKQL F 
SYENFKDFCNKI*P* IPNFELKTSTP*L 
EKL*NWAMARHGGSRL* SQHFGRPRRAD 
HMRSGVPGQPGQRGESPST 


11454 


25355 


A 


11553 


319 


56 


TFFLGPPGFF* KI FFSVGPPLIFFLPKK 
FIGEQKKNAPGKNFFFGFLFSLFFFFLL 
TLSSALECSGAMSAHDNLCLPGSSNSPA 
SAF 


11455 


25356 


A 


11554 


230 


399 


MMGEIINQANIISVCQIIRRLRHESHLI 
PGGRGYSELRSYHCTSTWVTE * DSCLKK 


11456 


25357 


A 


11555 


376 


396 


F*IFWRDGVSPCCPGWSQILRLQQSTCL 
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NO: of 
nucleotide 
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SEQID 
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od 


SEQID 
NO: in 
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26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lys!ne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V'^possible 
nucleotide insertion 














GLPKCWDYRPPHPATAASCPYFYHYYLL 
K I WRYS FDLS CLWNLWWGLAF G 


11457 


25358 


A 


11556 


176 


384 


KAWPNVRNNYR*NFFFQCCLNIIFRKRLK 
CFRCGADKFGKTGFSCLQYALKPTNWGR 
EOWLTVNPTNPSK 


11458 


25359 


A 


11557 


295 


391 


PYKLLITHTHTHTHTHSRTHTI*HTTLS 

LYTF 


11459 


25360 


A 


11558 


1210 


1753 


NTWCMERLAVAFGSFNRNSPLRGWALWL 
TP I ILALWEAEAGGPLEPQE FETCLGNM 
VtaiCLYKNLKNVSGV*DLPGQHGETLSL 
QKFKKCVRRGGTCL*SQRLRRLSWEDCL 
SQGGQGCSEP*SHQCVPftWVTELDPDSK 
KTKTKKFFGLPSPFLEVCFLTHIYVNKE 
YAFVLAEEASGKTTSKLTMVTSRNGLGK 
TKKFFGL 


11460 


25361 


A 


11559 


172 


3 


KQKFFFPPPSHQIKIFGGGAPEMPFFFF 
FFEKESRFVA*flGVQWBNLSIiHI>PPHP 


11461 


25362 


A 

— 


11560 


143 


1 


GQAQWFMPVIPVLWEAKflG* SLEAHSSR 
PVCATATPPQLIPK*YEGL 


11462 


25363 




11561 


84 


2 


KPLDTLGGQGGCIT *GQEFETSLANMV 


11463 


25364 


A 


11562 


12 


387 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FGKGGQINPQGGGQATEPRLREPSPSGL 
TLQGPGNGGPPPPPGQNFFFKKKGGYPG 
GAGGVIiNPGPKGHTPPGPPKARKKRGGP 
PGPGQKFL*NPKG 


11464 


25365 


A 


11563 


230 


375 


RAKGKLLSESLLSIQSFKKPSVHTOIPS 
TLGG*GRWIT*GLEFTTSIiA 


11465 


25366 


A 


11564 


287 


55 


PPDGRVQWLTPWPGLWiiAKEGGSLKSR 
S S * SQINP VGS VP YSLRVLR 


11466 


25367 


A 


11565 


115 


411 


FFFFFFFLGENSFFSFRGGPRGGGHI FW 
NFPPPG*GVFPPPPPPGGGNKGGAPPPG 
FFFFFP*KTGFPPLGPGGPPPPLFGPPP 
FPPPKGYNFRRDPPP 


11467 


25368 


A 


11566 


139 


356 


CLLIGWYNTGCITQGSTRKIGTTFFLRQ 
SSSVAQAGVKCSGMILCYRNIiRLPGSSN 
LPDSAS »VaGTTATYP 


11468 


25369 


A 


11567 


101 


410 


FLPFFFFFFLKKIFFFPPRGGGGGGN*K 
KQTPPLGGKKNFPPPPPKRGGGGTPPPP 
PLLFLVF*GEGGFPGGAGGGKKTPKGNP 
PPPPPKGGEMGRGPPPRGG 


11469 


25370 


A 


11568 


417 


56 


EDCLRTGVRDLLPNPHPNLQ lAGRDGAR 
L*SQLQEHHLSPGFRGYSEL*SCHCSPA 
WATE*AYRKQNKTKQNYRSSALLHLYTP 
MflTAWlKIAFLRAGDIFLSVLSFLPNSY 
PLFSHPSQ 


11470 


25371 


A 


11569 


301 


443 


EDTTFVCFEVESCSVAQATDSPASAS * 1 
AGI IGVCHHARLIFVFLVE 


11471 


25372 




TT570 


44 


408 


RGRGHGPHSPWRRRRRLRWEGRKATES 
KRRAYKYPLPALLCSHQWKPNEWLDSQI 
KIPTGL**LK*PSFIFIFIYLFWKQGLS 
LAPMLECRGAVLAH*HLRLPRSSLSHAS 
ASQVAVTTG 


11472 


25373 


A 


11571 


201 


1 


AERPVTGPPVYAEPGLQGRQG*KNMFKN 
MQHPRWADHLRSGVQDQPGQYGETPVST 
KNTKITQAQCI 


11473 


25374 


A 


11572 


343 




FGLLFAFTLEYKLYESINHFAHCCVCRF 
SSGTLHVWCNRHPNKLDEWNSGGGGRQ 
DLTLTPRLGCSGVIMAHCNLVLLGLSNP 
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nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 

relLeof 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaiiiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSAS * VAGTIGflHSHaWIDKLFFVKTGP 
LY 


11474 


25375 


A 


1 1573 


209 


3 


VGPPPKSLPGGFPFFFCPKCFPRLKKKK 
KKPGLVAYTCNLNTSGVRGKKIT*AQEF 
KKSPSNIVKPWY 


11475 


25376 


A 


11574 


312 


430 


QRYARPGPWHCYNSSTVGGRSRGIVA 
QEFATSLCNMT 


1147(5 


25377 


A 


11575 


125 


3 


NSLGPWHACNPRSLGG*GGRITSAQEF 
ETSLGKIHEELY 


11477 


25378 


A 


11576 


190 


362 


CVNNYID*EKKKKKKKREEEKKKKKKKK 
KKKKKK*GGGAPKKKKKFrPGGGKNFFF 
L 


11478 


25379 


- 


11577 


231 


340 


KRITRVNQDGLDLLTS*aAPLGLPECWD 
YTMKPCLY 


11479 


25380 


A 


11578 


60 


3 


RMQVSV*IFTPMSLQAYLIYSSSVaAGA 
QSGIEECKYQFAWDRWNCPERALQ 


11480 


25381 


A 


11579 


285 


406 


KKGNWPGAMRHTCNPSTIiRGRGGWIA*G 
QEFETNVGDMAK 


11481 


25382 


A 


11580 


121 


1 


NLNCPGPVAHASNPNTIGG*CRPIV*GQ 
EFYTRIANTVKP 


11482 


25383 


A 


11581 


199 


384 


KKTPPGGIFFCPKEAGPAPPL*TPFC*T 
PLFFSPAPFKPKRALKFFLKPKRGFFQI 

PFFFF*DGVSLCHPGWSTVARSCPRV 


11483 


25384 


A 


11582 


677 


934 


YGSRHCICFLQAISEILFLKNPARHGGS 
QVQDQ PGQQ S KTLTLRKKS K * ARHNGS C 
LSSC2NFGRLRQEGHLSPGG*GCSEL*PR 
HC 


11484 


25385 


A 


11583 


104 


2 


KKIGGGPPLFFFFETESCSVARLE*SGA 
ISAHCN 


11485 


25386 


A 


11584 


349 


9 


GGRSLGPRKTRVQMGIPEFPPLHLGGKA 
KPCFPKKKKKKVKGLNBCHYSKEDIQMVW 
RYMKKCSTSLPRERKIKTIMRYHFPSIR 
MAIIRKIKDPKDQ*GCIEKGGSVVRPMY 

L 


11486 


25387 


A 


11585 


163 


1 


TAVRIKHNSYTLTPCLRHSRYLINVTCW 
PGPVAHACNPSTLGGRGRW IT *GQEF 


11487 


25388 


A 


11586 


192 


41 


SKSKYHIYGDSALQKSTVYKWITHFKKG 
*DDVKDKAHSTRLSMSICEBKE 


11488 


25389 


A 


11587 


30 


374 


PFQKKRGGGGGSS P PFP VLKRVRPENRL 
YPGGGGFR*PKQGPCPSAWGAEPDSLSK 
KKKGFGQKKKKKKVLKEKT 


11489 


25390 


A 


11588 


66 


245 


SLCFFVFFVFF*TESCSVTQVGV*WYNL 
GSLQPPPPKKKKNIKKQKKKINWGRGFK 
RRL 


11490 


25391 


A 


11589 


243 


402 


FLHLKKKKKKKKKKKKKKKKKN*KDAG 


11491 


25392 


A 


11590 


205 


3 


IRLLGIIKNPTPAHCQ*K*KTGEPLW*V 
VW*FFRRLHTELSYDPAIPLLCINSTEV 
WLAPGIPPRFR 


11492 


25393 


A 


11591 


404 


152 


MGFHHVSQDGLDLLTS*STLLSLPKCWD 
* CWDYTREPLCPGHFAISENKWLNTTT 
WLYFYLI ILHCI YCPYKLLFLFPFYI Y 


11493 


25394 


A 


11592 


311 


386 


tvypplagnlflhaga* agi vgtals il i 
raelgqpgallgddqiywivtahafvi 
iffiaipiiiggfgn*iivpliigapdia 
fprinnisf*llppsflij:,lassiveag 
agtg* tvyp plagnlahagasvdlt iff 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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Predicted 

nucleotide • 

correspond 
ingto first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11494 


25395 


A 


11593 


39 


397 


EGCPLPFFVPFFFFFFGGKKIFFFPPRG 
GGGAPHKIPGPPPPGGNFFFPPPPPMiG 
GKRGPPPPPGFFFFF*KKGGVSLFPRGG 
VFFFSFWRGGRGGFPPPPPQKKIFLKGP 
PPFYKKK ■ 


11495 


25396 


A 


11594 


243 


46 


GILSCCFHDSLSLNSL*CD*VWISLCLS 
YLVS lELLGQCGLI FFIKRES FWLLLLK 
MYLQPLSFLS 


11496 


25397 


A 


11595 


327 


2 


PRAILRIQFLLWPMRPYVMWSQPCSLCF 
CHTAKTVPTSRPLLLGFFSLPGIflHISM 
WLVLSYGLGFSSKVTLWPGWAHVCKPS 
TLGG*GGWIT*GWEFKTSLflNMVK 


11497 


25398 




11596 


292 


384 


LGTVMTCNPSTLGGQDGWIT* IQEFET 

SLR . 


11498 


25399 


A 


1159S 


125 




PFQGQRSGGTRGPGAVRHTYNPSTLEG* 
GGRIT*GQEFETS 


1 1499 


25400 


A 


11599 


302 


408 


TGGWLGWAHACNPSILGGRGRQIT*GQ 
KLETRLA 


11500 


25401 




11600 


409 


131 


GRSGGVPKSGVLKPPGPPGGTPFFFKKP 
KITRGGGQPPLFPLLQEVRPENCLSLGS 
GGCH*TKLPPCPSTWGaQ*KLVSKKKKC 
EGGGRIYS 


11501 


25402 


A 


11601 


196 


330 


HHCMLAWVTGRPYLNIYICIYADIYIHI 
yiCIYIYTHTEVYIYLIiAI*IYIYVYMR 
lYIYIYIYVYIYTHTHGYIYIY 


11502 


25403 


A 


11602 


41 


252 


TPGRARGQRLGHSSTGLDLLTS *SACLS 
LPKCWDYRREPPRPAEDIKFMF*AILSE 
TILVRGVRSRATPS 


11503 


25404 


A 


11603 


271 


440 


KKKKRGGRFKESKFTSAGLQGNI FFIGP 
PKLIPRaaV*QRGDWKlILGVTQLNRLCH 


11504 


25405 


A 


11604 


429 


2 


FSSFFAEKGCHCVSQAGFKLLSSRDLPT 
SAPHLGIRLLGRLRQEDPPSPGV*GCSE 
LQSRLCTPAWITKRDPVS*KFKKQKTKT 
HTH*RKPTKGQDHQYHCLPPPHVMPLKV 
FRGSNVHGAVRTRGSRGSIPELSHA.VPH 
AP 


11505 


25406 


A 


11605 


113 


2 


TEATHLVCTYKN»KLGWflWWIMPVIPAL 
WEAEVGRLL 


11506 


25407 


A 


11606 


301 


3 


WVMVCSVKNIALGHACGKFHYEGRGKPA 
EHPSWVLLVLESLQKXLFLSYFLFTFLL 
QPPSPRLT*PVFRFFFFRDRIMLCCPGW 
SAWPERVGRAGNSE 


11507 


25408 


A 


11607 


117 


431 


SCDFCLSFSLSLFFFFFRKGIFFLGPRG 
EKKGGNKIKWTLGPGGKGNSRPSPPKGG 
GKRGGPPPGANIJDF*KKTGFPQGGRGGE 
KPPPPGGEAGGAPQRGGKPAG 


11508 


25409 


A 


11608 


277 


88 


RRTTSSWLAHNCIQGPAGYHTGS*TPDI. 
KGSACLS FPKCWDYEWRLDL 


11509 


25410 


A 


11609 


200 


27 


VRLKPGDSRDPDMQWYYI*ENRPARCGG 

SLL* S *HFGRKRKADHLRLGVQDTRSSA 

R 


11510 


25411 


A 


11610 


17 


414 


TVWLVMQDPENEWIVSGWG6KEEPGRYF 
MCEARSSSMDRVFFFF*KKTLFFPPGGG 
GGAKIWWGIPPPQGSPLSVAYPLGVWE 
KKGPPIHPHKKRVPPV*PGGVIjFSEPGN 
PPPGPPKGVGKKSSPPSPVL 


11511 


25412 


A 


11611 


296 


428 


RIYENWRGTVAHACNPSTLGGQGGWIT* 
GQSLKTSTGTVAKP 
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NO: of 
nucleotide 
sequence 


SEQ ID 
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sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 
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nucleotide 
location 
correspond 
ingto first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

iocation 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=lInknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


11512 


25413 


A 


11612 


20 


362 


NFWGVARGSRGGATRCEWVGLRPVGMA 
YGRSPSWLL*VKPSAASHRPPLPRAADT 
PGTAPAPTPTPAPAAAPALTPSSGPGSA 
ALTLEEELQEAIRRAQRWGLTMLPRLAS 
NS 


11513 


25414 


A 


11613 


15S 


2 


LMLSRTPLQQL^YTLLELELPRLLAPDL 
LSNGSSLKDLKWTHSNYRASKETK 


11514 


25415 


A 


11614 


3 


410 


RSEVYGLVLTTWYQVNRVPWTHKftARFT 
RVSRWARPRSKISESCYWLPQAHRTKS* 
PLVRTLSGLCSLLGFHPLLFSLLATDLV 
PLPSFALLLLLELFFLPGMPKLAP*SPT 
KRS FS FYFFRDRVLLCCPGWSQTP 


11515 


23416 


A 


11613 


189 


397 


YKVLFLVEKNGFFFLFFETKPCCVAPTG 
VQGPNFG*LQAPPPGLPLFSAIiftAPEVG 
ITNGTPPPPRFFF 


11516 


23417 


A 


11616 


158 


1 


KGKPTPKTFFCQKQ IFFLVFFF*TESHS 
VAEAGVQWRDLDSLQAPPPGFTPR 


11517 


23418 


A 


11617 


319 


425 


ICVWLYSQAV* KVWLYSQAVAIITYNPNTL 
GGQGWRIA*AYEFKTSL 


11518 


25419 


A 


11618 


94 


2 


CLTISWLGTVAHVCaiPSTIiGG*GGHITR- 

SR 


11519 


23420 


A 


11619 


381 


9 


FFPLPLPLPTVSLFPRSPSDftEPKLDCT 
ARISAHCNLPA* FS CLSLPSACNCRRAP 
PRLTASASRGMIADGVSFTQCSMVPRL 
ECSGVISAHYNLHLPATSLGLPKCRDCS 
LCPDEIVDPRI 


11520 


25421 




11620 


144 


431 


IWIWIVGKGEKGRKIPE*LIRCNGWN*S 
WDRWVAEDHVLREPSENERLARVRCLTP 
VSQHFGRRRRLDHEVRSLRPACATWENS 
VSTYNTKSGWAC 


11521 


25422 




11621 


174 


449 


VFSLPLLGGASQLGYrjEVRDPIjEEAVCP 
FSDLKPHAGRTTTLFKftlRQGHLSLQRF 
LLPFVWLCPAPHGGVYRGRQASLS *GGL 
HPVHASRP 


11522 


25423 




11622 


316 


412 


GGPSAVAHACNPSALGGRGGRI S * GQEF 
KTSL 


11523 


25424 


A 


11623 


183 


2 


PKQVYLRNGLLSHS KCNK IHHINRLKRE 
KrQPGWAHAYNANTIjGGQGRWIT*GQE 
FENE 


11524 


23425 




11624 


355 


168 


DLLIVCYDPNAINL*SLLLRRLKWEDCL 
SPGG*GYSEPRSCHCSPAWAAEPDPVSK 
KDRNVEFIYCC 


11525 


25426 


A 


11625 


154 


3 


TADVLFLGQAGRGNFWMKKNCRPGTVA 
HACNPSTLGGRGG* ITRGLEFK 


11525 


25427 




11626 


412 


3 


IFFPPPKKDKRTRGGQKKNIPPAPPRNF 
LGRGGNFPPPQKP F FPPLF* I FPRPP* C 
KGSIPPFLFPLCFQKKKTPSFFFLPNGS 
PPICEKIFLFPXKGFLKFKKPRFLKKKG 
P FFFFFGNAVLLCQ PGWSAVAQSQ 


11527 


25428 


A 


11627 


290 


423 


VFEYFFKFFKKNWSWPGWAHTYNPSSL 
GGQGGWIT*SQVLRPP 


11528 


25429 


A 


11628 


213 


3 


LSGIYSGNRRLVRYSKSVSVIHHINRLK 
KKNHIIISIDGEKPLEQIQNSFMT*KRN 
KLKWEDHLRPADA 


11529 


25430 


A 


11629 


152 




FQKNRDLEPSKAGWPGMVAHACNPSTLG 
ARGGQIT*GQEFETSMVNMVKP 


11530 


25431 


A 


11630 


422 


189 


PLSGFCPSTWGAKQRALSKKKKRSEAGH 
SGSCL*SQYLGEI,KQKDHPSPGGRGFSE 
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sequence 


Predict- 
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location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptop]ian, Y=Tyrosine, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, V'^possibie 
nucleotide insertion 














LLNCTPAWVAKQDPVPKKRLK 


11531 


25432 


A 


11631 


239 


424 


KEDNNRVFVIGPLGGLNELAHDKAFKTQ 
NHHWLGVGMACNPSTLGGQGRWTT*GQ 
EFTTSL 


11532 


25433 


A 


11632 


343 


401 


AREREHKSAACQ*GMERSVIKHCHVATE 
*EREQKRERQQERASEKERQQESERDSK 
RERARESERERESTRAQLVRKIVNQIS 


11533 


25434 




11633 






fqgcaki cltntllpyahgyliwrhdgk 
gc:hekmtakiihscklspdal*sqlgvp 
l* thththththththtfsekkglvp 


11534 


25435 


A 


11634 


235 


433 


AALTSFSGPFSFCSYLINTIiGKNFPNHE 
ISTT*FTLQRL*KIIFWPGMVAHACNPS 
TFGGQGRWIT 


11535 


25436 


A 


11635 


93 


385 


FTLKSEHRKHSVPQGKKQNKlKiLSPDC 
KLENKRFGWARWLTPVIPAI.*EAKaGES 
PEGNGEREEATEPFGSIILGFFFCTQIA 
HLHRSDYEPNBDS 


11536 


25437 


A 


11636 


190 


398 


NTKKSQKAFFKTQGKPPPLDPKKKASKK 
TKFFLPKKISTRKFRTGPWRGTNFSLKK 
AKKNI,GKTRiaaFP*RPPFPSISSLFWP 
FFFPLFF*KDKVPLCHRGWSAWQSQLT 
ATSISQMY 


11537 


25438 


A 


11637 


190 


380 


TPKKWPKAFFKTKGKPSPFHPRKKALRK 
TKFFLPQKFP*NPLSPLIPaiiFWPFFFF 
FFF*KDKVPLCHRGWSAWQSQLTATSI 
SQMY 


11538 


25439 


A 


11638 


261 


91 


EKTLWASHQGTCL* SQLLGRLRWENHLN 
LGGRG*RSRHCTPAWVTERDSVSNK* IN 
K 


11539 


25440 


A 


11639 


-^ 


334 


FRANRTVKDAHSIHGTNPQYIiVEKIIRT 
RIYESKYWKEECFGLTAELWDKAMELR 
FVGGVYGGNIKPTPFLCLTLKMLQIQPE 
KDIIVEFIKNEDFK*VQCSLANIRGMY 


11540 


25441 


A 


11640 


193 


457 


DGILLCGPGWSAVAPSQLCSGTISAYCN 
LCLPGSSESPVSAP*YLGLQACAHAPLM 
FVFLVYTGTCMLPLSAHHSRAANSSHFS 
YPLL 


11541 


25442 


A 


11641 


257 


373 


NTISCFRSLRPVAHTCNPSTLGG*DGQI 
T*AQEFETCLG 


11542 


25443 


A 


11642 


211 


367 


DTTSHPFEWL*FKKSRK*RVLDRIWGN* 
MLVHCWWEHKMVQLL* KTVWWLFK 


11543 


25444 


A 


11643 


112 


367 


LQIKTIRYHHILTRMAKIQNTDTNKCYQ 
ECGATGTLIHCWWEFKIIPVIPALWEAE 
VGGLLEVRS SRPAWPTRENP IFTKI I KN 
Y*KYKMTPYIIR*LQIKTIRYHHILTRM 
AK I QNTDTNKC YQE CGATGTL I HCWWE F 
KIIPVIPALWEAEVGGLLEVRSSRPAWP 
TRENPIFTKIIKNY 


11544 


25445 


A 


11644 


310 


335 


YHLIAGSLGDIERLSML*IGKAVCHNKY 
* *GCGEPGFLLHCWWEWKMVQSL*KIVC 
QILQT 


11545 


25446 


A 


11645 


143 


455 


KRKGPSLWLLGKSMGGIRFF*KKVFFSG 
FKSLWFFFWPLWKPLERLN*IKISFYPP 
AKKKGGQRKAQFFFFFFPRSFSLAGCSG 
SISAHCNLCLLGLSDSPTS 


11546 


25447 


A 


11646 


182 


458 


SRGGIRFF*KRVFFSVIKSIWFFFWPLL 
KPLGRVN*IKISFNPPAKKRGWERKAQF 
FFFFLPRSFSLAGCSGSISAHCNLCLLG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
25 


Predicted 

beginning 

nucleotide 

location 

correspond 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidlne, I=lsoleucine, 
K=Lys!ne, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














LSDSPTS 


11547 


25448 


A 


11647 


319 


3 


RSTKYSPRGWEPSMWFLFLHVFSLPFRS 
ILGPPLLGTPLRGSPPPPPPFEGKGFIY 
GLTFFFP*EPPCEIF FFFFLDRVLL YC P 
GWSAWQSAHCSLELKDKVCI 


11548 


25449 


A 


11648 


165 


2 


DQSGRPRKILCAHTHTHAHMHTSLGQT * 
QDPV*THTHTHTHTHTPLYAAAQRIN 


11549 


25450 


A 


11649 


290 


3 


SKQNTTHWATLKSYIYPSQFWRLGSPSS 
RWQLDGFLVRALFLVCRQPPSLRILTWW 
EKTEERKREREK*DREREHNEIERNERE 

RERERETQRMY 


11550 


25451 




11650 


220 




TPTLPKGTGLTALDRIPGRDPGPRRAG3 
PF*aGVAAENCPDPA3GGSTRPLGAPGR 
GSAPP'RHPGPAGGRR 


11551 


23452 


A 


11631 


29 


387 


FLIFDVTIVIDLACHEPHSYKMAHLINV 
VCVLTAPPTSYFPISFLGPPySLRHNDI 
E IRPINNPTRTSQC^'SERKSHTSLTLNQ 
KLEMTTLSKDGMSKAKTG*EPGL*C*TV 
SQAVNAK 


11552 


25453 


A 


11652 


267 


2 


KFFFPVFSTSKLNFYPGTPFPLFLFSFF 
FFFFFFFMEEGFAVTWVPGAVAPGAVIjA 
HCSLELVGSGKPPISAS*GAGTTGMHYH 
TRMY 


11553 


25454 


A 


11653 


50 


411 


EAVAGGMEKSMMNLPKGPDTLCFDKDEF 
MNEDFDVDHFVSDCRNRGQLEELRDNPQ 
LYYKLLKTAMVELINKDYADFVNLSTDL 
VGMEQALNQLSVPL*QLREEV1SL*SFV 


11554 


25455 


A 


11654 


303 


403 


VRNMVEWLGVAAHACNPSTLGGQGEQIT 
*GQEF 


11555 


25456 


A 


11655 


356 


553 


SVSPCPFPGPASTLSFLVADFRRRGVDV 
SQVAWQSKGDTPSSCCIINNSNGNRTIV 
LHDT*GPRASPCY1IPPISWLNQFPHSPP 
WASQLPPSVASVYQK 


11556 


25457 


A 


11656 


442 


2 


GRPILMRPAVL*SEPRTPACVDIQQQIM 
TIIDEWKACAKVQTLSAPINSASEMQS 
IRHWYILKDSSARPAGKGAI IGFIKVG 
YKKLFVLDDREAHNEVEPLCILDFYIHE 

SVQRHGHGRELFQYMLQKERVEPHQLAI 
DRPSLV 


11557 


25458 


A 


11657 


109 


474 


DINMCGRVVLVAERM*KRKRDKYJMYEET 
E * VL* KKVRHSQLQKEWAQNAPWQNA 
AYIDQPS PAHVGQQGLSKLPSRPGDQGV 
EPQNLRTLQGRSVIESATNTTLPHMLMS 
QRVIAPNPAQLQGQRGPPKHGICRTTTP 
NMNPAWV 


11558 


25459 


A 


11658 


342 


3 


EHQLKWGEHSKSQSASSPPNDHNTSLAR 
A*KWAGAEAEIALLTEAGFGRWVIMNFA 
ELKEHWTQCKEAKVHDKTTQELIAGFE 
RNI T YLM * IiKI TTRE PHI AITS I NS RMA 


11559 


25460 


A 


11659 


204 


17 


STCTPLPTA*SQSRPHPPPTPTPPAPWR 
SSNGPLSRGSSSRNSTSSWSIALATGPW 
THRSEL 


11560 


25461 


A 


11660 


505 


514 


GS * GNHFNPDGASHGGPQDSDRTGPRGP 
CLPSPWGAQ ILRDWYPQEEMGTLGKQ 


11561 


25462 


A 


11661 


80 


374 


FLFDIYNRCYLIFINII*YLSSNFD*IF 
IIIYLVAIYRLLCLGLFRAPRQCKHPRP 
QFS FNKI GTKR * AWPGAMAHTCNPNTLG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue uf 

peptide 

sequence 


Predict- 

otide 

correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PlienyIaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threunine,V=Valine, 
W=Tryptapliiin, Y=Tyrosine, 
X=L'nknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLC5AQIM*GQEFET 


11562 


25463 


A 


11662 


142 


437 


GQRVQIFFFFWKRFPFFAQGGTEGAQFR 
LIKPSPSC3LKEIPPPNPPRKGE*RDGPP 
FRGDFVFLRKNGVPPCGQGGSKPPAPGE 
PPPFTPPRGENNRG 


11563 


25464 


A 


11663 


120 


353 


QALQVMLKHVNFFFFLFFLFWKQSFVPL 
PRLECNGATSVHCNLCLPG*RNFPASAQ 
EFRIAGSYDHAI CPPASGKBB 


11564 


25465 


A 


11664 


1 


404 


NTWSDLIISYFHPRPSAFFLRQGLALSP 
RLEYHGTITGHYSLDLWGSSDSPASISR 
VAGTTYRCM*FQLLGRLRREHHLSPRGR 
GCSDP*LCPYTPAWVTERDPVSKKKLRG 
GGENSLLSSQTMY 


11565 


25466 


A 


11665 


493 


3 


NIWIPDVI.PGHIiARQ*IiGSPGKLSAGF* 
PLQFPQTLSSARAPPFCLPRLQVRNTVG 
FLPSPHTSRPSHCLPACPLPSRTQPWVP 
VKPGPTACRGFLQHPPTPGSPSP*SFQG 
SSWWDTDLRALECLLRILRGHNCSVFVT 
TQGQLSHEGMNKEFLLTCAKKKK 


11566 


25467 


A 


11666 


3 


401 


CMERAVTVLLPGSATQSPPVYAPR2\LAR 
LWLTAAMMISGFIADYEA*SSRCSSACP 
AGDSLS YYHSPADS FS IMGSPVGAQDFC 
ADLGVSRMJFIPTDTDMSTSPDLQWLVQ 
PALVYSVGPSETIAPHPLGVP 


11567 


25468 


-A- 


11667 


1 


446 


LSCAKPPQRPLRHGIIKIFNVFEETRAN 
KHLLGVSKLKPRGFPALFLSVSSFGQKK 
HVPQWLQHSHSHILVISLDFFFFFFFEK 
KFPFFPPGGRERDQTPEN*TPPRGG*RE 
SPPFPSRGGGTQGWAPHCPIIFRFFQKK 
GVFPLHAR 


11568 


25469 


A 


11668 


18 


403 


DPLQRPQYRGGLRSVGCDLQSPTPRGPS 
GRWPNPAQARGPGEAQAWAWPGGGPREA 
FSGQGRPPPLGLHPHRRRGflGPPGPMGP 
*HEGQGSC* GASCRGGLACTAFPAASS P 
PIFGGLTPPNLNFPEE 


11569 


25470 


A 


11669 


157 


1 


PRPQGFVFVFFFFEMQFGRLECSGAISA 
H*1!ILHFLGSSNSPASASHVYAPLY 


11570 


25471 


A 


11670 


396 


7 


EKVGQARDKSACEGHSCMIARSCLHLRV 
DQGGQHGSHGACPGRRLRGPDTHRQSCY 
QCRCNRSPDRLQPGEAGTP*CAEAAAEP 
DAAVHPGQPAPARLPPALPGRGLLPLPH 
PGGSQRVYCVECIGSKLR 


11571 


25472 


A 


11671 


209 


3 


DDLGTHRPGKSQVRVTPREAGENPSAVL 
PCYLHLVPFV* EDEVKPEDS I PDMPGNE 
YAREFLAHAPLY 


11572 


25473 


A 


11672 


270 


101 


LSLNRWILGAYriFFETESHSVAQAGVR 
*HNLGLLQSLPPGSRRPLAFKVAFLFIG 


11573 


25474 


A 


11673 


421 


592 


SSCTPGVIIKIPCVRSNHMKLPGPGMVA 
HTCNPSTLGGQGGRITWSE*FKTRLGNM 

A 


11574 


25475 


A 


11674 


176 


6 


LTFVYMPCFCALS SLIMTYGLAM* FHFH 
SMTLFIIGVLSVALArYQ*WLDVSRVSV 


11575 


25476 


A 


11675 




418 


NTVCVCVCVCVCVCVERJCAGHRLTDSFP 
IQAGPGAQARTLCLRESTGGALSSGLPA 
SRYQDPGRGALHPHCVWTPRGWNWRLNY 
PPARS PSREAVCYFRQTLKKWRS PRGQG 
GRILRVIRTASTFRDTS*TESACLGARP 
CLDGK*VCESVPCFSLHTHTHTHTHTHT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
begmiurio 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Tlireoiiine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HCI 


11576 


25477 


A 


11676 


305 


698 


SFMGRSPRKIDQFCNSRNMVHGSVTFRD 
VAIDFSQEEWECLQPDQRTLYRDVMLEN 
YSHLISLGSSISKPDVITLLEQEKEPWV 
WSKETSRWYPGK*E*SRQGEAIIVPDS 
PCAQRGVTPLRCGLENFLQ 


11577 


25478 


A 


11677 


107 


397 


GGDGRETLRAPADDGCI 'QERAETPLAI 
SCPRSSSTCRGRARRYDGCRTRDGHQLI 
QRGQQQESSRRTAQPGSSKLCRGPGGHR 
PSLGTVGVHGCSL 


11578 


25479 


A 


11678 


514 


554 


RAKPTPP*GGGEGQNFGLLKPPVSGLPP 

LACPTPPKRWKPKPPPPAPPHi'LFF**K 
P.GL: J * PGEVFKLRNWKLPPSPPQHGGN 
KGRNQPPPRGTFMF 


11579 


25480 


A 


11679 


266 

_ _ 


3 


GPPLFLGLKN*TFFSPTKV*IFFSQVFS 
SFPQGLKGGVREIFFFFFF*DRLECNGT 
ISAHCm,RLPGSSDSQASAGTTGVCHYT 
RMY 


11580 


25481 


A 


11680 




408 


NTCALRHHI3HFSFWSDGKTKEGEERIVQ 
ENWLISWHNFSVPETLLTLSCPLGKEGM 
PGEDGTAGAGKVPGEDKIPG*DGTAGED 
GTEDENGTTGEDETAGVGKTPGAGGTTG 
EDDTESEDGTTGEDETAGKGGTAG 


11581 


25482 


A 


11681 


244 


420 


KDSESTKAYIRDNISSTKKKKSWPGAGA 
HPCNPSTLEGKGG* IT*SQEFETSLANM 
VKL 


11582 


25483 


A 


11682 


294 


443 


QMNTCTHIQPAYVH*EKyFRPSAMaHaL 
KPSTLGGRGGWIT*SQEFQTRP 


11583 


25484 


A 


11683 


346 


1 


YNTNQFTLRGTQftSVYTCLTALIVLKLL 
NQPYTLSAVLUIAGNQPFHAFPSPPSLH 
PDAFHSKI MSHI I I LLLGFI PFLPASQS 
LTLS PSLERSGTISAQCNLCLPGSSNSP 

ISA 


11584 


25485 


A 


11684 


140 


423 


SSHQASSPPTTASHSMKISVAAIPFFLL 
ITITLGTKTESSSRGPYHPSECCFTYTT 
YKIPAQRIMDYYETNSQCFKPGIVFITK 
RGHSRWTNPK 


11585 


25486 


A 


11685 


163 


3 


IKIPCKRLLKFGPPSCCGFFFPLLPPFF 
FFETLSRSVAQAGLQWRDLGSLLAL 


11586 


25487 


A 


11686 


179 


1 


PIVFLC-M]:,FVAFAFRGPFESVTKPPNFF 
FFFEMESCSVAQAGVQWRDLGSLQARPP 

RST 


11587 


25488 


A 


11687 


192 


1 


EQGFDGHLASWISLKDTAKWKNGDELCS 
YYQS CCTDYTAECKPQVTRGDVFTMPED 
EYTVYDDG 


11588 


25489 


A 


11688 


89 


3 


SGGYCCCLCCRCSRYSCCCCRLREGPTK 


11589 


25490 


A 


11689 


168 


2 


QDRLQPHESKRKLGPAPQLRRSESDTPS 
WNFQSTESQLMSKGDEDTKDDSKETV 


11590 


25491 


A 


11690 


301 


397 


HFVFSLVTGMNPLSPYLNVDPRYLVQVR 

LRFY 


11591 


25492 


A 


11691 


18 


421 


TKALQITCYLHS TMS EESDGKLI EDSL I 
QLRCHFTWKLLIEAPE I PDLENRIWEE I 
QFLDTKYNVGIHNLLAYVKHLKGQNEEA 
LVSLKKAEDLIQKEHMSIQADIRSLETWG 
NFAWViTYHMGRIAEAQTYLDKV 


11592 


25493 


A 


11692 


188 


3 


PLQGKGJj'GAKERGQKKGBIjKGGVHASRP 
RPPFGPSLFFFFFFETDSHSVAQDGVQW 
CDLGS 
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SEQID 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515.1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 

last^amino 
acid 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11593 


25494 


A 


11693 


24 


391 


APRADAMGHFTEEDMATITSLWGQGNVE 
DAGGETLGRLLDVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKGLTSLGDAIQH 
LDDLKGTFSQLSELHCDKLHVDPENFKP 
LGWGLQTALQ 


11594 


25495 


A 


11694 


138 


425 


WSGVKAAFQLNPGNPNLEGKPGPGFKTH 
PGQLGETHFLLKIHYLSGLVGKPWESQL 
LGNLNEENHWTPEROaPRDIiRHCHQIQP 
WATNGNFVLKKK 


11595 


25496 


A 


11695 


158 


838 


CGTVHSGDAGQRQHAtAPSHPCDHGBTQQ 
PILyRVLCCQLFWILCFVFSHSIMSTKC 
PSVS PVSGEMKKRKAITLEMKLKI lAQH 
EGGKPVMAIARELGLWQSTISTILRDKK 
QISDAAKSSASVKSTVITKKRAGPIDDM 
EKLLVMWMEDQIQKRIPLSLLMIQAKAR 
SLFNMLKDRASDPTYTQMFICaSHGWFQR 
FKRRHNFHNVKI TG2A.^Rj\GNEGAIAFK 
EQL 


11596 


25497 


A 


11696 


834 


1431 


SSACQGGSQGWPPRQPWWMWGWSRRAPS 
VTHRTGERCSTSGSAPLPLLLSSLCSSL 
PRSWEHSTLRQPPHPPUjPPGCLPPGRE 

aqrlsgaqhqpgprpaaasyipdtqadr 

kqqqhhpgpdqhvghreglaledaathl 

glvi\alscwll3raeagyqv?hghdhpk 
dqhpqadggqrivr.'\iglglghhvsgrr 

ARP 


11597 


25498 


A 


11697 


143 


1 


ISKERGASRFSGPWVFFFFFLESRSVAQ 
AGVQWCNLGSLQAPPPGFT 


11598 


25499 


A 


11698 


5S 


459 


KGKEEKVKRKEAEQNFSPYAQDKQERIK 
GNENDEKTKQQKETIIDIELFKGLDETG 
ENMDSTLTRTPFEPLENNKQILVLGLDG 
AGKTSVUlSLASNRVQHSVaPTLGEHAV 
CIWTEYSHMEFLEIGGSKPFRS 


11599 


25500 


A 


11699 


215 


1 


GTKBCALWAGGGGFFPYLPPMGPLGPICG 
GVGKGSPGLGCPIFFFFFFFETESCSVA 
RLDAQWPDLGSLQSP 


11600 


25501 


A 


11700 


300 


420 


KPKILFQNVFAAPHMENLKCRGETVAKE 
ISEAMKVKM1C 


11601 


25502 


A 


11701 


351 


466 


RIKNADLSQAQWFTPVIPALWEAKVGRS 
LEVRSSRPVW 


11602 


25503 


A 


11702 


172 


400 


SNRLENKIQGVFLNDSSISPFILRKQSI 
GQAWWLTPVIPALWKAEASGSPEMDSTT 
LLPSSSQVPSLVKMEKLNYS 


11603 


25504 


A 


11703 


284 


408 


ASVFSSFFFVCLFVFEMESRSFAQAGVL 
WRDLGSLQAPPPG 


11604. 


25505 


A 


11704 


20 


447 


ESQVTLNSAICVLSTVLIMEFPDLGKHC 
EEKTCKQLDFLPVKCDACKQDFCKDHFP 
YAAHKCPFAFQKDVHVPVCPLCNTPIPV 
KKGQIPDWVGDHIDRDCDSHPGKKKEK 
I FA 


11605 


25506 


A 


11705 


1 


455 


HSCSLQFTPTAWDCTGSVSSEQGERPAA 
AMKICSLTLLSFLLIAAQVLLVEGKKKV 
KNGLHSKWSEQKDTLGNTQIKQKSRPG 
NKGKFVTKDQANCRWAATEQEEGISLKD 
ECTQLDHEFSCVFAGNPTSCPKAQDERV 
YWKQVARKLRS 


11606 


25507 


A 1 11706 




428 


DAEADKMAAAAVEGGRSGGSGGCSGAGG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ JD 
NO: in 
USSN 
0?/515,l 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 

lociition 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucire, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unl£nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASNCGTGSGRSGLLDKWKIDDKPVKIDK 
WDGSAVKNSLDDSAKKVLLEKYKYVEUF 
GLIDGRLTICTISCFFAIVALIWDYMHP 
FPESKPVLALCVISYF\rMMGILTIYTSY 
RE 


11607 


25508 


A 


11707 


1 


422 


RSQAESSAAAAARASVPLRGSPGPSAIM 
PMFIVNTNVPRASVPDGFLSELTQQLAQ 
ATGKPPQYIAEHWPDQLMAFGGSSEPS 
ALCSLHSIGKIGGAQNRSySKLLSGLLA. 
ERLRISPDRVyiNYYDMNAANVGWNNST 


11608 


25509 


A 


11709 


88 


423 


AADAMKQAEEflMGAFMRIiAYQELQIDRL 
KEDKMLLNLEGNKREHRERLGMGLVSRS 
SVSHSVLSEMLVI EHETPVSAKS SRSQL 
DLFDDVGTFCSGPSKYTDNPFSLWESIG 


11609 


25510 




11710 


393 


130 


rCPSATAQSFHr.AAEKT,STI,DRPGSQAPQ 

KPHPSPKPGWLPASTNAVPTPLAEATP 
SICAHPAISLLSTE3GIF:CAVP3PASSCS 


11610 


25511 




11711 


331 


440 


KI FFLNFLI SRAWWCAPLVLATWEAEVG 
ESLYPRRS 


11611 


25512 


A 


11712 


122 


478 


SGLCPQQPPRANSCPPSSMASCAEPSEP 
SAPLPAGVPPLEDFEVLDGVEDAQGEEE 
EDEEEEEEDDLSELPPLEDMGQPPAEEA 
EQPGaiiAREFIiaaMEPEPGPSPVPKEWL 
DILOIGL 


11612 


25513 


A 


11713 


102 


2 


TNLaSTPRRPPPFFFFETVSLLAQAGVQW 
CDLGS 


11613 


25514 


A 


I17U 


126 


2 


FFAPFLKIFFFFFFSEMESCSVLQAGVQ 
WHDLGSLQNEGSQ 


11614 


25515 


A 


11715 


178 


449 


LSGOILFGl I1IJ.YCA1 IGGKLLGLIKL 
PTLPPLPSLLtayLLIiAGILITNIPVINDN 
VQIMHRWSYSLRSIALAIII.VRAGLVLE 
SKALEK 


11615 


25516 


A 


11716 


48 


417 


GSGGNHSVCCBTMEGGGGSGDKTTGVLA 
GFFGAGEAGYSHRDLAGVPLTGMNPLSP 
YLNVDPRYI-VQDTDEFILPTGANKTMGR 
FELAFFTIGGCXJMTGAAFGAMNGLRLGL 

KETQNMAWSKP 


11616 


25517 


A 


11717 


103 


2 


PKSPTQWLMAVIPALWEAKAGGSRESRS 
SRPAL 


11617 


25518 


A 


11718 


1 


413 


WPSGQVLVGCLSFSLYCWKRSLKRKNPG 
FEVLLKIFLKNHPRCHTWRMIQLTATPV 
SALADEPAHIRATGLIPFQMVSFQASLE 
DENGDMFYSQAHYRANEFAEADLNHAAS 
LGGDYLGDLALRLLCTLEPVFPQSY 


11618 


25519 


A 


11720 


116 


462 


TTQSEEEEKSGLGAKIPVTIITGYLGAG 
KTTLLNYILTEQHSKRVAVILNESGEGS 
ALEKSLAVSQGGELYEEWLELRNGCLCC 

SVK 


11619 


25520 


A 


11721 


167 


407 


EIYSLTRFIEVKMSKKISGGTWEMLGD 
EMTR 1 1 RETLKKKIiI FP Y VESHLHS YDL 
GI EMRDATNDQATKDALEAFNKP Y 


11620 


25521 




11722 


254 


423 


NQLSS IMAMFKKIKSFEWFNDPEKVYG 
SGEKVAGRVIVEVCEVTRVKAVEILACG 


11621 


25522 


a 


11723 


3 


424 


VSCDTMEGGGGSGDKTTGGLAGFFGAGG 
AGYSHADLAGVPLTGMNPLCPYLNVDPR 
YLEQDTDEFILPTGANKTRGKFELALFT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 

IISSN 

09/515,1 

26 


Predicted 
beginning 

nucleotide 

location 

correspond 

residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=IsoIeucine, 
I<=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutaminc, R=Argininc, S=Scrinc, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IGRCCMTGAAFGAMNGLRVGLKETQNMA 
WSKPRNVP I LDMVTRQGALWANTLGALA 


11622 


25523 


A 


11724 


2 


343 


AFGTMKWVTLVSVLFLFSSAYSRGVFRR 
DAHKSEVAHRFKDLGEENFKftLVMIAFA 
QDLQQCPFEDDRFSTSEVPEFATTCQDD 
DHFEDRCRRVHTLVWQLCTPATLLETD 
Y 


11623 


25524 


A 


11725 


1 


359 


HAFGTMKWVTVISLVFLFNSAYSRGVFR 
RDAHKSEVAHRFKDLGEENFKALVLIAF 
AHYLHQCPFEDHVKLVNDVTEFAKTCDA 
DESAENCDKSLHTLFGDKLCTVATLQET 
YGEMADC 


11624 


25525 


A 


11726 


1 


349 


QAiyiSDRKFSAPRHGSLGFLPRKRSXRHR 
GKAKSFPKDDPSKPGHLTGFLGYKAGMT 
HIVREVYRPGSKANTNEVAEAVTIVETP 
PMEVADIAGIffilETPRGLRTFNTAFAEHM 
SDEC 


11625 


25526 


A 


11727 


81 


349 


TKGS WS CCVCLCVCLCTWSHLCRLVTW 
LPDMPDDVLWLQWVTSQVFTRVLMCLLP 
ASRSQMPVS SQQAS P CTPEQDWPCWTPC 


11626 


25527 


A 


11728 


264 


388 


QADPKDIMKFPGPLENQRLSFLLEKAIT 
REAQMWKVNVRKM 


11627 


25528 


A 


11729 


2 


471 


PGCSASWSKRGSGPDMLSSMAAAGSVKA 
ALQVAEVLEAIVSCCVQPEQRQVLCTKP 
TGBVLLSRNGGRLLEALHLEHPIARMIV 
DCVSSHLKKTGDGAKTFIIFLCHLLRGL 
HAI TDREKDPLMCENIQTHGRHWKNCS R 
WKFISQAIiLTFQTQIL 


11628 


25529 


A 


11730 


160 


377 


lqgf6rpsvyhaaivifleffawglltt 
pmltvsiaelglcfvrerdkflgtyhcv 
cld'tclgvaldsdlkq 


11629 


25530 


A 


11731 


45 


438 


KLGQRKBATVTKBSCKSCSRKVGSPDRF 
RSPQKRSKSRQDCFTLFCFWKVLDKNME 
LIS PTVI 1 1 liGCIiALFLLLQRKKLRRP P 
CIKGWIPWIGVGFEFGKAPLEFIEKARI 
KVCGRGRRGLQRRQCFLF 


11630 


25531 


A 


11732 


169 


292 


DSILLVNLACSAVISAHCNLRLPGSSDS 
PASASRAAGGAHL 


11631 


25532 


A 


11733 


473 


600 


KFWEKRTEGRGKSKSKNSETGE IVS ISA 
LSTTEVAMHTSTSS 


11632 


25533 


A 


11734 


19 


349 


APSPDAMGHFTEEDKATITSLWGTVNVE 
DAGGETLGRLLGDYPWTQRFFDS FGNLT 
SASAIMGNPKVKAHGTKVLTSLGDAIKH 
LDDLKGTFAQLSELHCDICLLVDPEWF 


11633 


25534 


A 


11735 


234 


359 


FYHLKSFTVSSVQSRWLTPVIPALWEAE 
AGGSPEVKSSRPAS 


. 11634 


25535 


"A"" 


11736 


15 


372 


KLPLKALTGEEKTHINIDI IGHVHSVKS 
TTTGHLI YKSRGI DERTIE I FEKEAAEM 
GKGSFKYACILDKLKAERERGITIDISM 
RKFETSKYYVTIIDAPGHRDFIKDMTTG 
TSHADCA 


11635 


25536 


A 


11737 


1 


357 


IWKAflMASPAGSWARPPRPMREPQTLAM 
PTNAAEDQKLKLERLMKSPDLAVTl PEK 
MS E WS PG P P P E FDRDVMGS RAGAAS GE F 
HVYRLLRRREYQRQDYMDAMAEKRI LDA 
EFQRRLE 


11636 


25537 


A 


11738 


24 


335 


APNANAMGHYTEEDKATI TS PWGKGNVE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beyinnina 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
1 nd 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A==Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanlne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=pa$$ible 
nucleotide insertion 














DAGGKTLGRLLDVYPWTHRFFDRFGNLS 
SDSAIMGNPTDKaHGQKVLTSLGDATKH 
LDDLKGTFAQLSELHCDKLH 


11637 


25338 


A 


11739 


141 


335 


MQFLPCIPILKSLEKSVASHSQTVHSDI 
IS TVEFNHTGELLS TGDKGGRWI FQRE 
QESKNQVHR 


11638 


25539 


A 


11740 


7 


337 


APSPDAMGHFAEEDPCATITSLWCKVNVE 
DAGGETLGRLLAVYPWTQRFFDSFGNLT 
SDSAIMGNRKVHAHGTKVLT3LGDAIKH 
LDDLKGTFAQLTELHCDKLHVDPENF 


11639 


25540 


A 


11741 


182 


360 


SRHSISPPVNQIQLGASVTEELTWTKT 
ARVSRAQWLMPVIPALVSIEAEAGESPEVR 
WSR 


11640 


25541 


A 


11742 


174 


-J 


HFSDHSLFVCVRQDLTLLPRLECSSLIT 
AHCSIiHLLGSGDPPTAASWVSGNTGVHY 
HA 


11641 


25542 


A 


11743 


75 


218 


Kl ILGRAQWLMTVl PALWEAKVGRS PVS 
ASQSAWDYREEPPCPSCTI 


11642 


25543 


A 


11744 


I6S 


2 


LAPLWSLGPVLGGVGQGSPGPGFPFFFF 
FFFFETESCSVARLECSGPILAHCSLR 


11643 


25544 


A 


11745 


83 


2 


RBfiRFLLPFVQLCPAPR(3<3VYRGRQAS 


11644 


25545 


A 


11746 


156 


3 


FHSGSGRVBNPSFFFSFFBTGSCSVTQD ' 
GECTGATLAHCDLCFLGSSNSS 


11645 


25546 


A 


11747 




330 


KDRAQWVTPVIPALWAAKAGRSLEVRSS 
R 


11646 


25547 


A 


11748 


201 


1 


TSQPLIRITLSTFFFFSRDGGLTMLPRL 
DPELPGSSNP PS CSASRVDGTTGMCHHT 
RLIFFSQTDKK 


11647 ■ 


25548 


A 


11749 


262 


3 


VYTSLTPPPISVNLLTQVSPRSVSYEQN 
ILCLLFVTYFTCTFFVCLFVCLFVCFFK 
MESCSVAQAGVRWRDLGLLQAPPPGFTY 
TT 


1164S 


25549 


A 


11750 


225 


3 


PI INFSVPQFLHLYNGII lESTPPSCCI 
LKCQSLGQAQWFTPVITTLWEAEAGRSL 
EAWTLKTILANMAKPHL 


11649 


25550 


A 


11751 


185 


2 


VSTPFNSFPSPPWDLVFLKGFFFFFFFF 
FFETEFCSSPRLBRNGAPLAHCNLRLPR 

SSNSP 


11650 


25551 


A 


11752 


261 


336 


DRVSLWSPRLECNGAISSHCNLRLP 


11651 


25552 


A 


11753 


282 


381 


TDLLYKKKCGLGAVAHAYNPSALGGQGG 
RITCS 


11652 


25553 


A 


11754 


202 


45 


NPRKVKLQWALILPLPFNLGGQLKSRFQ 


11653 


25554 


A 


11755 


293 


357 


LT PVI PALWEAEVGGS PEVES 


11654 


25555 


A 


11756 


264 


1 


TLPINWCCDHSTGPSLISLPLLSLPYS 
LRHNNIEIRLINHPTTACKYSCLKKSHK 
SLTLDQKLEMIKLGEEGLLKAQIGQKLG 

LLHQ 


11655 


25556 


A 


11757 


126 


1 


ILPGFLKELKIELPFNPAILPLGIYPKE 

KKLLYQKDTCTHMP 


11656 


25557 


A 


11758 


399 


226 


lEKGTGRVWWLTPMIPALSEAKAGGSPE 
VRSPRPACIGLLKFWYYRCEPQHPARLS 

EI 


11657 


25558 


A 


11759 


235 


380 


DTTVLKLGQL ITLQWASKCSSERKSRTS 
LTLTQKLEMVKLS EEGMS KBD 


11658 


25559 


A 


11760 


249 


357 


WEASGQTWCPTPIMPALWEVEAGGSLEP 
RSLRLTWA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, r=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutaniine, R=Arginine, S=Serinc, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 


11659 


25560 


A 


11761 


168 


372 


KLHCLLCFYLSEFFVFCFLETRSLSVAQ 
SKCSGTVITHCSLKLLCSSDPATSDSRV 
AETTGTGYHAGL 


11660 


25561 


A 


11762 


303 


377 


QFAGCRWT,MPVI PAIjWEAKADGSPE 


11661 


25562 


A 


11763 


135 




ATTPGLFFIFYFFETESHS lAQAGVQCM 
ISVHCNLCLPGSSDSPV 


11662 


25563 


A 


11764 


219 


2 


KIiRS SQLSEPEVRPDSTLSMLMHAEflHT 
HTHTHPHLFLPHPCSRPHTSTPTCMPIV 
TESPIYLKFHSHTYIL 


11663 


25564 


A 


11765 


102 


1 


NRPIHNPTMASMCSSERKSHKSLTLNQK 


11664 


25565 




11766 


245 


3 


PGEPEEDQLVKFRPSLLWVFMFLFFLRR 
SLALSPRLECSGVECSGVISAHCNLCLT 
GFKQFSCLSLLSGWDYRHVPPCPS 


11665 


25566 




11767 


350 


97 


GSVPASPQVTRPTLMTSLfiDKWFSLCLT 
SLRVNFGVALILSGSHSVSQDTLDLLTL 
FSTHLGLPKCWDYKRVPPCSAQFLVTFL 


11666 


25567 


A 


11768 


188 


391 


LGSVAGDLLCFFGGVIFPCBEMFPMSLC 
YYLYIWYNSCVFLFFKTEPHFVTHSGVH 
WCDGLLQPPPPN 


11667 


25568 




11769 


131 


2 


ytlyipnkspieniGLSwaqWlvpvipa 
lcqaetgelleprss 


11668 


25569 


A 


11770 


165 


1 


VLIPLIHLCIYIYIYTYICVYIYTRIYT 
HICIHTHVYIHIYVYIHTYIYXYMYTY 


11669 


25570 


A 


11771 


72 


1 


ILLFFFFFEAESCSVAQAGMQWCN 


11670 


25571 


A 


11772 


121 


3 


tsfvlfcfetefcscccpglecngaila 
hcnlrlpgssd 


11671 


25572 


A 


11773 


152 


3 


HVCLNLTLPFLFEKNIFSLCVCVCVCVC 
VCWLCKKKKCEKDFYFERTF 


11672 


25573 


A 


11774 


190 


2 


gfsprqrgaprvppplagpfppifffff 
ffemwshsvapagvqwcnhsslaaltsp 

DSGDPP 


11673 


25574 


A 


11775 


287 


1 


gahrrklfitpbeslrdkqiagfehrgg 
ekkktlfykkkkkrkkkrkekkgkekkm 
lpkcnytmknhqvpys irptrecadlrv 
fyfikfqilky 


11674 


25575 


A 


11776 


121 


3 


kcasrdlskffffffletesrsyaqaqv 
qwcdlgslqal 


11675 


25576 


A 


11777 


142 




ektlhvrntihnsegslvikihgrlgw 
ahacnpstlggrggqitrs 


11676 


25577 




11778 


179 


3 


shqpvpgtldlprgpqklqstseaesea 
smseassedlvppleagaapyreeeeaa 

XK 


11677 


25578 


A 


11779 


147 


6 


ktpglkknsnffffffffetrshsiara 
gvrgcdlsslqppplglk 


11678 


25579 




11780 


184 


3 


gfgfyiipnyrlfffslligpffpliff 
etepcsvarlecsgvisahcnirllgsc 
dspa 


11679 


25580 


A 


11781 


120 


319 


vkslakflvnyqlhrqtcsgqagrgalr 
qrfgrprqvdhlrpgvrdqpgqhgetps 
llevrklssg 


11680 


25581 


A 


11782 




3 


yiclsliythththththththiyiyvc 


11681 


25582 




11783 


221 


332 


srlgmvdhprsgvrdhpgqhgetpsllk 
iqklarrsg 


11682 


25583 


A 


11784 


243 


379 


lkysvpiknckpwpgtwdhtrnpstlgg 
qggwitldqkfetslan 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ED 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

residue of 
peptide 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glj'cine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11683 


25584 


A 


11785 


327 


5 


GRVDSQPIYPKNPPGSFPKIYPGSFKTL 
FPYLQSRHSFSIPRSKLMFHQAKPLFC 
SKPSQSLSPHSELKKKDLSQAHWLTPVI 
RALIVESKAGRSPEVRSSRPTWPT 


11684 


25585 


A 


11786 


245 


359 


FKGMDRLRSGVRDQPGQHSKTPSVLKIQ 
ICLAGHGGMRL 


11685 


25586' 


A 


11787 


356 


67 


RDIAS ITARLRYDRGVGITRLEVLSKSD 
EYTKGSVLDFSRETEPIGYIYIYIYIYI 
YIYIYLHSLPLNHMEISLGHPPLSNNKF 
VQNFDSPKLSYH 


11686 


25587 


A 


11788 


103 


3 


KYI IPVCIFFFFETESCSVTQSGVQWCE 
LRRRG 


11687 


25588 


A 


11789 


136 


1 


VHQFTIRILNYYYYYFFETESYSVTQAG 
VRWCNLGSLQPLPPGFK 


11688 


25589 


A 


11790 


291 


163 


sllllprlecmstisahrnprLlgssds 
pasgrsleprssklq 


11689 


25590 


A 


11791 


107 


2 


I FMYVFFETESRSVAQAGVQWRYLGSVQ 
APPSGFS 


11690 


25391 


A 


11792 


329 


3 


KNPRNIIGKSRGPYITNFPGDKGPKNSS 
PRGNDLS INKNWAQAFQPRGQKKLFFPK 
KKKKKETQSRSIPRLECNGPTIAQCNLK 
LLASSNPPTSASHSAAIISMSHSG 


11691 


25592 


A 


11793 


213 


3 


ISPWLFLLLQATLFRSQYPGPFPPFSLS 
PYLFPFKQKLGSGFFFFFLFFFKTESRF 
LARLECSGIITAQ 


11692 


25593 


A 


11794 


1 


400 


KRAAPQPAPEQRDLKKKKKKPPLAPSSL 
FRFME KLLP SFQFRQPRGPFLGTRGVSF 
AFHRRRFWQYGNMGEKQTWGNPGSSSNP 
PPGTAGSLSGNRGPGWGGKIADRFWLYA 
RDSFGHSRGLKGS CCQAHLGK 


11693 


25594 




11795 


3 


769 


RKEQTRNARAEVLRQAKANFEKEERRKE 
LKRLRGEDTWMLPDVNERIEQFSQEHSV 
KKKKKKDKHSKKAKKEKKKKSKKQKYEK 
miESSDSSSSSEDEWVEAVPSQTPDKEK 
AWKVKDEKSGKDDTQ I IKRDEWMTVDFM 
SVKTVSSSSLKAEKETMRKIEQEKNQAL 
EQSMEIFQSKLEDAEKAASTKKDYRRER 
WRKPTYSDKAQNCQESRESDLVKYGFCS 
RDRYATTDTAKNSHNEKFIGDEKDKRPG 
SLE 


11694 


25595 


A 


11796 


110 ■ 


13 


HTHTHTHTHKHWitHTHTHKDRSGKIKC 

VP TV 


11695 


25596 


A 


11797 


184 


2 


SEIFGKFHVFYILTATHFMPPTQVYVERD 
VLGQEQLLTPVIPALWKAKAGRSPEVRS 
SRPA 


11696 


25597 


A 


11798 


83 


387 


GERRRRRRRRLWAPLQKKKKKKKKKKKK 
KKKKKKKKKKKGGAPLKKPPGGPHFSGG 
RQKNIPPLKGGELKRAPAGDFKNPGRGK 
lARGGFFEKNLSWGGEK 


11697 


25598 


A 


11799, 


153 


2 


IPWEDTTVYSGYPQFLFSHSFFYFYFFE 
TGSCSLTQTGVQWCNHGSLQPR 


11698 


25599 


A 


11800 


93 


3 


VWGRAWWLTSVISALWEAKVGESPEVRS 
S 


11699 


25600 


A 


11801 


115 


1 


NPLFFFFFFFEMESCSVAQAGVQWCNLG 
SLQPLPPRFK 


11700 


25601 


A 


11802 


295 


!37 


RCFIFILFINKLYFVTLFIYFFEMESHT 
VAQAGVQWRDLGSLQAPPPNKLYF 


11701 


25602 


A 


11803 


287 




MGPHAPFKGTCGGFQDFRFKNLEISAGL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=AsparticAcid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K^Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














THSPLLAQNFRVGAVPSPRYSPRFGSKG 
GGGFFFFFFFFETESHSVAQQAGVQWCD 
LGSLQALPPGF 


11702 


25603 


A 


11804 


202 


3 


WSVCCFKQGLGLSPRLECNGAPGFKQSF 
PLSLLSNWGYRGPPPRLNFFFFFFEMES 
CSLAQAGVQR 


11703 


25604 


A 


11805 


3 


315 


FMLLI LTLFLLRNDRLVQCDVRS S WCL 
GFLLGWSVI LYPLMaRFMPTMWI FLF I L 
FSMLFFYWFLFLLFFLLFLYSFPLFCF 
FFCFYLFRFFFFFLIFFSSP 


11704 


25605 


A 


11806 


126 


3 










DLGSLQALPPGV 


11705 


25606 


A 


11807 


130 


2 


QSIKMTCSSLYFFIKKNVAWWLTPWPA 
LWKAGAGGSHEPKS 


11705 


25607 


A 


11808 


149 


1 


GGKKAILFFFFSLSPRLECSGAIPTHCK 
LCLPGSRHSPASAFRWGTAS 


11707 


25608 


A 


11809 


258 


3 


KYYYQGNRQIKFVKaKSNKNLPSYSQKA 
SWFSSRNIRMMGQHQPMHLDNPTSFIKK 
LGQAWWLRPVTPALWEAEAGGSLESRSL 


11708 


25609 


A 


11810 


153 


285 


CASPIRSHQKLNKNGWVQWLMPVI PVPW 
EAETGGSLQLRSSRLA 


11709 


25610 


A 


11811 


160 


3 


NHLFLLLNRYKSIFIGRVQWLMPVSSAF 
WEAKAGRSFEPRSSRPAGQHGKTL 


11710 


25611 


A 


11812 


89 




AFFFFFETVSCSVAQGGVQLCHLHSLQL 

N 


11711 


25612 


A 


11813 


254 


336 


LLGQPQWLTPVlPAIiWEAEAGRPSEVR 


11712 


25613 


A 


11814 


108 


2 


CVGVLGFFVCLFFETESCSVAQAGMQWH 
DLLGSLH 


11713 


25614 


A 


11815 


166 


289 


SHSGWNAVWRDGSLQPLTPGFKRFPCLR 
VPSSWDHKCAPPC 


11714 


25615 


A 


11816 


192 


2 


DKDPGKVTRPFLLLC I FYLF I CLFLRRT 
IjAVSPRLEFSGRGCSEPRSHRCTPSWVT 
EQDSCRK 


11715 


25616 




11817 


228 


343 


LLQYS IQNEDAGWAWWLTPVI PTLWEAK 
AGRSLEVRSL 


11716 


25617 




11818 


322 


407 


VLRMLLHCLRECKLVQPLWKTVWQFLKD 


11717 


25618 


A 


11819 


119 




WEKIVRGKKCQKENTHTHTHTHTHTHTH 
THTYRESKRERLV 


11718 


25619 


A 


11820 


269 


1 


FVQVFYI LWTQSFCLTCCQWFLLGYHL3 
SVSNRFMRSGKSTFFFMTSEFFFFEMES 
RCVTQARVLECSGSISAHCKLHLPGPRH 

SPALV 


11719 


25620 


A 


11821 


217 ' 


389 


EHIETVYGKLGISEHWGKGGLFNKWLTI 
WKKDYSWQWWLMPVI PALWEAEVGGSRV 
H 


11720 


25621 


A 


11822 


188 


1 


GSHMPCRVISSVHESMNEFPAVPTSYPA 
NPQPRERAWRNQREKEDKKERSQRSVGR 
VQAGLV 


11721 


25622 


A 


11824 


126 


3 


KLQGQVQWLTPVIPAPWEAEAGRSPEVR 
SSRPASTWHKLV 


11722 




A 




224 


3 


ALIIKKEVSAMELEGSNRIQAYGVQSIT 
GARDYHAAASRVPAI KGTHHHARVI FVF 
LVETGFHHVGQAGLDSC 


11723 


25624 


A 


11826 


3 


364 


HELPEPLRVLWTAHRVAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQAHRAHQLAIDTYHEFEETYI pkdqk 

ysflhdsqtyfcfsdsiptpcnmeethq 



1257 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KSNLELLR 


11724 


25625 


.A 


11827 


2 


376 


ARELPEPITVLRTAHLKaMAPGSRTYLL 
LAFALLCLPWLQEAGAAQTDTLYMLFDH 
AMLQAHRAHQLAIDTYHEYDETYI PKDH 
KYSILHDYQTSFCFSNSITTPYNKEETQ 
QKSNLELLRISLL 


11725 


25626 


A 


11828 


288 


3 


IHTKNPSVHHHHQRPKVDKTTKMGKKQS 
RKTGNSKKWSASPPPKEHSSSPATEQSW 
T3NDFDELREEGFRRSNYSELQEEIQTK 
GKEVENFEKN 


11726 


25627 


A 


11829 


107 


1 


KRSGVRLQRGLVRWLTPVIPTLWE2VEVG 
GSFEVHC 


11727 


25628 


A 


11830 


220 


354 


QGNKSFKNIIAPWLGTVAHACNPSTLGG 
QGGRITGHEIETILAWM 


11728 


25629 


A 


11831 


273 


352 


ENLKTGQIQWLTPVIPALWEAKAGRS 


11729 


25630 


A 


11832 


239 


488 


SQTHCSGKTENSHGTNSDRNQVPTAIiHK 
RPSSPARFLTEAVPVPPVDYLQSVAAFA 
VSAVASQWERTGKPFNPLLGETYELIR 


11730 


25631 


A 


11833 


189 


294 


DQTRWLTPVIPALWGAKASGSPEVRSSL 
HFPLLF 


11731 


25632 




11834 




332 


GTSPEPLTVLWTCSDLLftMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQDHRAHHIiAlDTYHEFEETYIPKDQK 
YSFLHDSQTFFCFSDSIPTPSNMEET 


11732 


25633 


A 


11835 


330 


175 


PRELAQLGSPNVRVSRVKPRGQPQMFFF 
FFWRQESCSVAQAGVQWHEPEQQE 


11733 


25634 


A 


11836 


96 


1 


VFVETGSRSVAQDGGQWCDLSSLQPQPP 
RPEA 


11734 


25635 


A 


11837 


267 


334 


SWVQWLTPVILALWEAEAGGSP 


11735 


25636 


A 


11838 


180 


2 


SVLEKKEKNLYKNLFTYKLLKKVQGKTQ 
CEGRAQI CSVCVCVCVSVCTCVHVCAYV 
SSC 


11736 


25637 


A 


11839 


183 


2 


AAPLTSSAPQAPGWKPHLAPNSPFKNFT 
GRGQAQWLMPVIPALWEAKVGRSPEVRS 
LSSC 


11737 


25638 


A 


11840 


54 


330 


DPNGQLPEPLKGLWTAHLVAMAPGSRTS 
LLLAFGLLCLPWLQEAGAVQTVPLSRLF 
DHAMLQAHRAHQLAIDTYHEFEEPYIPQ 
DQKNSFLD 


11738 


25639 


A 


11841 


117 


344 


IQYVKQINETEMRNNESYLNHTSLTITI 
HTLCLMGSYLEHFQNQCKGRARRLMPVI 
PALWE AKAGRS PDVRS S KPA 


11739 


25640 


A 


11842 


750 


968 


RAQGPRWILKIPFSSPFRLFPLIPLVFL 
YNSPPLFPPFPRTNATNNSPFPPPLPPP 
LPFSPPPPPSPSRPPPS 


11740 


25641 


A 


11843 


757 


1000 


WELWYCCLFSPFFPLPSPPLFPPPPP 
SPPHSFPPPSPFRPPLPLAPTRPPPPFS 
LFQAPPLLPSFCSPPPPPPPSLSFL 


11741 


25642 


A 


11844 


278 


361 


TLLFFFFFFEMESHFVARAEVQWRDLGS 


11742 


25643 


A 


11845 


260 


2 


GS CLLEGECLTNRKDIHTKNPS VHHHHQR 
PKVDKTTKMGKKQSRKTGNSKKQSTSPP 
PBCERSSSPATEQSWMENDFDELREEGFT 

RA 


11743 


25644 


A 


11846 


194 


2 


TKFAKEPSPPPCWQEVFKTSPLGFFFCP 
RSFPVNGFPRLKNLFFFFETESCSVAQA 
GVQWCARA 


11744 


25645 


A 


11847 


219 




SLGHTLKQSESSLSFSPGGTPLLEFKPS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amilio IcUi 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptDphan, Y=TyrDsine, 
X=Unl<nown, *=Stop codon, /=possible 
nucleotide deletion, V'^possible 
nucleotide insertion 














VYDPSVPPFQVSKGGCVGFFFFFQTESC 
SVAQAGVQWHDLGSPRA 


11745 


25646 


A 


11848 


126 


3 


RFFFKGLFLHFFFFFFFFCS'ETESHSVA 
QAGVQWCDLGLV 


11746 


25647 


A 


11849 


216 


339 


KCFKVGICVyKLNRETFYLAQNFFDRNWA 
TQKMWKTLLELI 


11747 


25648 


A 


11850 


190 


334 


VSYHIEMSFFE-ZYNKKIHDLIjVCKDENG 
QRKQPVP.LKQFIICFEFLFL 


11748 


25649 


A 


11851 


198 


352 


SRQGTWGIHGLECFFVCLFVLFFETESH 
FVPRLECSGAISAHCTLHLPGEL 


11749 


25650 


A 


11852 


371 


1 


PPKKLGIQVFTPFAPRAFWPVLVFKKKQ 
FPQSNLVLP6FFSKVSIQGVPPFCRKPF 
SGWGEAFQIGFFIWRGKKFFFFPLGFFL 
KKVFLKFLAKFFFFFFFFFETESHSVAQ 
AGVQWRSLGLV 


11750 


25651 


A 


11853 


176 


3 


KKGFPLDWNSMVKNIIVKFGEDLQISKT 
LHTFFFLTESRSVAQAGMQWCaDLGSLHS 
C 


11751 


25652 




11854 


281 


375 


QLTFKKYFLGWARWLTPVIPALWEAKAG 
RSP 


11752 


25653 


A 


11855 


146 


356 


KCGALIAEIEVPLFSELRDFLLAYSRMT 
ELCFKRCVPSLHHRALDAVDDDCLHSCS 
CTDYPINRSLMNAY 


11753 


25654 


A 


11856 


136 


1 


LSILCEFFKWLIEMEYRSVSQAGVQWCD 
LSSLKPPPPELQQCHSC 


11754 


25655 


A 


11857 


277 


361 


IVLRGAMWLTPVIPTLWEAEEGGSPEVR 


11755 


25656 


A 


11858 


295 


152 


WFGFGFFETESHFVAQPGVQWCNLGSL 

CSLRLPGS SNS PASAFQVA 


11756 


25657 


A 


11859 


1 


342 


GTRLPEPLTVLWTAHLGAMAPGSRTSLL 
LAFALLCLPWLQEAAAVQTAPVSRLFDH 
AMLQAHRAHQLAI DT YQE SE ET Y I PKDH 
KYSLLHDSQTCFRFSDSIPTPYNMEDTQ 
HK 


11757 


25658 


A 


11860 


244 


332 


TIEMMLDIKQIQVI FLFEPKMGRKIAET 
T 


11758 


25659 


A 


11861 


1 


339 


GTRWTICQVQLHaYAHFLYFFEMEaRS 
VAQAGVRWCDLGSLQPPPPGSSSSSSSS 

S 


11759 


25660 


A 


11862 


311 


394 


GMIiGAVAHACMPSTLGGLGGWI TCSQEF 


11760 


25661 




11863 


130 


1 


VPPSVRTFFFFFFETESRSVSQAGVQWH 
YLGLLQAPPPGSTSC 


11761 


25662 


A 


11864 


3 


320 


IMMYALFLLSVGLVMGFVGFSSKPSPIY 
GGLVLIVSGWGCVIILNFGGGYMGLIV 
FL I YLGGMMWFGYTTAMAl E E YP EAWG 
SGVEVLVSVLVGLAl/IEVGLVLW 


11762 


25663 


A 


11866 


165 


309 


GL I LLPMLECSDRI SAHCSL YLLGS YDP 
SDSGSHVAGTTGTCHYAWLR 


11763 


25664 


A 


11867 


208 


314 


GSRNKLSGQAQWLTPVIPALWEAEVGGS 
PEIRSSR 


11764 


25665 


A 


11868 


102 


1 


KKFFFFFETESHSVAQARVQCNLGSLQA 

PFPRFT 


11765 


25666 


A 


11869 


126 


3 


JINALQLHSSYCKKPFFFFLKTESRSLTK 
AGVQWCDLGLLQ 


11766 


25667 


A 


1 1 870 


1 


273 


KQLPWFLNm/RLELTGLLCTLNLSKPC 
MIFIILVIVKYWFLLFCNIFKNHIFSQA 
QWPTPVI FALWSADMGGSHGYWITTIVD 
FMCATLT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 


11767 


25668 


A 


11871 


126 


2 


FPKALLWFFFFFFFFETESRSVAQARV 
QWHNPGSLQCPLR 


11768 


25669 


A 


11873 


269 


9 


QVSGKFWPDIFYKTGEKELQCPNFPQGH 
PPRGLKKKSFSKKKKKKKKKSIAVGQAQ 
WLIFVIPALWEAKVGRSPEVKSSRPACP 

TW 


11769 


25670 


A 


11874 


114 


1 


LEAKPGVFQSGVLLVl FFFFETESHSVS 
QAGVRWCDLG 


11770 


25671 


A 


11875 


146 


^ 


GNHLSSRVGVQDQPRQHGKTLSLQKIQK 
VAGPGGAHLQSQEVQVGGSL 


11771 


25672 


A 


11876 


2 


159 


SLQFFFFFFRWSFVLVAQAGVQWHDLGS 
PQPPPPGFKHSPASASQVAGRGGR 


11772 


25673 


A 


11877 


124 


240 


FRAFSSGQAVWLTPVIPALWEAETGGSL 
EARSSRPDWRT 


1 1 773 


25674 


A 


11878 


135 


5 


QVLPYPYFSDSFTLSPRLECSGVTLPHC 
NLCLPGSSDSCASAS 


11774 


25675 


A 


11879 


170 


2 


GFFFPEKWGYKNS FLCFFFFFFFEPEFC 
SVAQAGVQWLHLGSLQPPPPVFKVDRAP 


11775 


25676 


A 


11880 


97 


267 


GHGHATLRGLCVLSLFFHIPAPSVSGTS 
DAEECCLCVIYRLICGYISRNCLYFSDH 
R 


11776 


25677 


A 


11881 


1 


292 


LPEPLRVLWTAHLQAMAPRSRTTLLLAF 
AliLCLPWLQEAGADQTVPLSRLFDHTiML 
HAHRAHQLAIDTYQEPEQTYI PEDQKYS 
FLHDSQTYFCFSD 


11777 


25678 


A 


11882 


93 


2 


KYQMGWAWWLTPVIIALWEAEAGRSPET 
SC 


11778 


25679 


A 


11883 


145 


2 


PPLGLRLQVQAPTPGFFFFFFETESRSP 
RLKCKGAILAHCNVCLLLV 


11779 


25680 • 


A 


11884 


202 




TWWRWGVTVLVRLVMNCRPCDRHKSASQ 
LIGRVRQENGLNaRVGGCSEPRSRHCTP 

VWVTSENPSSC 


11780 


25681 


A 


11885 


42 


155 


GERSGLS PGVQDEPGQHSGTSSLQKILK 

LAGHGGTCP 


11781 


25682 


A 


11886 


215 




STMARHCPLSPMLFNSAMEVLVRAISQE 
KEIQGIQIGKEEVQLSLFTDDMIFNLEK 
RKDCSKNLLQLMNLV 


11782 


25683 


A 


11887 


214 




GYFFGLNEVLGKLE KPSLKVPQNPS PKR 
PFFLGFFFFETESRIVARLECSGAISAH 
FWLCLPGSSDSPVSC 


11783 


25684 


A 


"Tim— 


215 


3 


WGPGFPFFSFFFFFLRQGLAVIRLKCSG 
T I TAHC S LNFLGS SD PPAS VS LVATTTG 
HEPSITQFHSHGSC 


11784 


25685 


A 


11889 


118 


1 


RFFIMGENPTKFFFFFETESLLPRLECS 
GVISAHCNLSC 


11785 


25686 


A 


11890 


267 


83 


HCLRSGVQDQPGQHGKNPS IQKI QKLAT 
SFKKSLKIVIPPLPFKNDKTDSKSSCSS 
LSPHS 


11786 


25687 


A 


11891 


134 


2 


DRLAVLPRLECSGMIFLLPLPSRFKRFS 
CLSLPSSWDYRGAPRA 


11787 


25688 


A 


11892 


116 


1 


SKGVGHFLFFFFFFETESRSVAKTGVQW 
CDLGSLLCLV 


11788 


25689 


A 


11893 


397 


475 


HFVCSTIKVLRDLSSDRSNPGRFLST 


11789 


25690 


A 


11894 


115 


331 


KNVCLFVFVEMKNHLNPGDEGCSKPRSR 
HCTPSWAAQQDSISKYICIYRYLYLDIY 
LSBOiWRLRINQLPNS 


11790 


25691 


A 


11895 


134 


251 


INPPVSRKKKKKKKKKKKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 

ingto first 

peptide"' 
sequence 


Predict- 
ed end 

otide 

correspon 

last amino 
acid 

residue of 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamlne, R=Arglnine, S=Serine, 
T^Threonine, V=VaIlne, 
W=Trypfophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKGGGL 


11791 


25692 


A 


11896 


186 


402 


PGKNLTLENVPRENKVGEKAPGQNEPPV 
LGGGEYQDPGGNVKGVGPPPAPGFGKDG 
PKRLADTFDLIEGDGD 


11792 


25693 


A 


11897 


100 


2 


KGPPFFFFFFFFFFFFFFFGQSGQVKLK 
SPKCK 


11793 


25694 


A 


11898 


92 


3 


SVARIMPYALFLLSVGLVMGFVGFSSKP 
S 


11794 


25695 


A 


11899 


283 


362 


MTYTMTIVHALWASVCLLLNHMmPLD 


11795 


25696 


A 


11900 


248 


1 


VMSAQPGLSVILRFMGKWSNWTGMQYTE 
SEVERYRDRGRARETERQRDRERERENP 
RNRKLFFYGRFKHSPQDEFMPQKVHF 


11796 


25697 


A 


11901 


253 


2 


KLQENPFKLLNFI INLCSVSUilVPI ILN 
INIKPSFIAPKPRCPSKFQRNYIYIYPS 
lYLSIYLSfYLSIYLPITISKVLKDVE 


11797 


25698 


A 


11902 


179 


3 


LAVTKNEIMGFSTTWMEMGDINFFFFFF 
ETKSHPVMRLECSGVISAHCNLRILVSS 


11798 


25699 


A 


11903 


1 


403 


GTSSQESFGGCCVSGLIAMGTKAQGERK 
LLCLFILAILLCSLM.GSVTVHSSEPEV 
IIPENNPVKLICAFYDLSFFSSSFSYFF 
HLSFFIYYPYYLFVSCFTFYIVILLSFI 
YSHHLSSVSYYSYVLLIYTFIL 


11799 


25700 


A 


11904 


37 


242 


KBPTRDQLQHPKftRLPAPLRVLWTAHLA 
AMAAGSRTSLLLAFJiLLCLPWLIYDGAS 
HIVSLFMLFIIF 


11800 


25701 


A 


11905 


-i 


298 


GTSQHRGRKDSRTGSHSSSDHPGAKLLS 
TEEKQAAETMRPPSAPPRRGCI PWQGLL 
LSSS 


11801 


25702 


A 


11906 


225 


30 


AC YLQKDGAFAI LPDVLPDMFKGIIS S LY 
PCQHAECDHIKNIYNCVCVCVCVCVCVC 
IVICKT,N\f 


11802 


25703 


A 


11907 


196 


320 


HSSSSTTPSQDTHTHTHTHTHTHTHTHT 
HKIPQRELLPSVPD 


11803 


25704 


A 


11908 


130 


3 


NGFVFSPFPFFFFFEMESHSVTQMVQW 
CDLGSLQSLPPGFK 


11804 


25705 


A 


11909 


381 


2 


ASICGQKLIFFQHWKKI ILVQQVGLQKK 
NSTLKRAWEFLSFFQPAPPLGSPLKGIN 
ILNKGAKKLKGKTNFPGPIVFFFSVLPP 
FFGLKKFQKKVFHQGKSPFFFFFFFETE 
SRSVAQAGVQWHDL 


11805 


25706 


A 


11910 


138 


2 


SVPLKEFIISQARHLTPIIPALWEAEAG 
RSPEVRSSRPASTWRNP 


11806 


25707 


A 


11911 


231 


54 


IRASFGIQIRICLTIFFFFETEPHSVTQ 
AGVQWRDLG3LQPPPRRQEQNSVSKQTK 

TN 


11807 


25708 


A 


11912 


3 


461 


DAWGRVEGPPLRPPATSRRWAGPTLWRM 
EVTGDAGVPESGEIRTLKPCLLRRNYSR 
EQHGVAVSCLEDLRSKACDILAIDKSLT 
PVTLVLAEDGTrVDDDDYFLCLPSNTKF 
VAMAINEKWAYNNSNGGTGWISQESFDV 
FEAYSGATLLFFF 


11808 


25709 


A 


11913 


318 


407 


LCGRLWWLMPVIPALWEAEAGGLLKLRS 
MR 


11809 


25710 


A 


11914 


340 


5 


DRVAKFSEARLFRLFFPFFPLKI FCFPR 
GFKIFRGVCPLLFPPEFWGLFQKGPGMR 
LFFPPLGGFFFFFFFFFFFFFFS FFFFF 
FFPFFFFFFFFFCHKNKTLLKKVHKS I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possible 
nucleotide insertion 


11810 


25711 


A 


11915 


274 


385 


IKPQRWGETRAEKLKSLKIRVPLLLQRII 
iiAPHQQWNK 


11811 


25712 


A 


11916 


259 


441 


DTKLPKVNIKLNFHALKKKKKKKKKKKK 
KKKKKKKKSSSLRG 


11812 


25713 


A 


11917 


254 


402 


LIVSVIDFLRWRLPLLPKLECSGMISAH 
CRLRLPC3SYHSPASGCQLLGR 


11813 


25714 


A 


11918 


223 


1 


NTNSPRKKFFYEEVSSVPVFFGVFPPSP 
LKGSPRAFLKLAWRPPLPLFFFLEMESR 
SJ4AQPEVQWCDLGSLQPP 


11814 


25715 


A 


11919 


97 


3 


KPFFFPFFFFFFEIESRSVTQaSVQWRD 
LGS 


11815 


25716 


A 


11920 


154 


384 


KEFFLMLFFFLPPPPPFFFFFFLGKGLF 
FFPPGGGGGGQFFS IGPPPPRGKKDPPP 
SPPKMGGKRGTPPPPGYFFFF 


11816 


25717 


A 


11921 


225 


369 


GTLNLLTYKIKSWSGRVAHYNPSTLGGW 
GKWITSGQEFETSIANWKP 


11817 


25718 


A 


11922 


116 


372 


MEYTWNKMAMSTNIfllYQTYKNSVIEHP 
TGVLPHSRVIMVMNIVLCIS KQLKELLR 
RLMWEEHLSPGGGGCSEPRSRHCTPAWA 
T 


11818 


25719 


A 


11923 


271 


409 


KKKKKKRKEKKKKKKKItKKKTaGGA 


11819 


25720 


A 


11924 


121 


228 


KKKKKKKKKKKKKKKKKKKKKKKKKKKR 
GGGFKKKPWGGQK 


11820 


25721 


A 


11925 


55 


423 


NKPKKKWFLKKKKKNFFFPFYPLKFFFF 
PKSMTFFRRWPKISPPKKKFFFKNSPS 
VFFFPPLKKKNFFFLTPLKFGPPKNFFK 
RPPPLFFFFFFFFFFFLSGNCIEKL 


11821 


25722 


A 


11926 


253 


443 


YQHQRPKVDKSMKMGRNQRKKSENSKNQ 
NTSSPPKDHNSSPARQQNWMENEFDGLT 

EVGFRRW 


11822 


25723 


A 


11928 


175 


413 


KKKKKKKKKKKKKKKKKKKKKKKKAC3GG 
A 


11823 


25724 


A 


11929 


490 


182 


RKQKIKGCKKPNPLAQGGVKKKGKKGPP 
FNFFLKQKGQDFLKNWNLGQKKKIPIPP 
QVFSLRQKGAFPGRIFPFFFFFETESCS 
VAQAGVQTLSYKNKNKIK 


11824 


25725 


A 


11930 


no 


2 


KK I PRKP P FFFFFFFF YKAGSHLVAQAG 

VQWHDLGS 


11825 


25726 


A 


11931 


159 


987 


YLE KIMSEHSRNSDQEELLDEE INEDE I 
LANLSAEELKELQSEMEVMAPDPSLPVG 
MIQKDQTDKPPTGNFNHKSLVDYMYWEK 
ASRRMLEEERVPVT FVKSE E KTQE EHE E 
lEKRWKNMAQ YLKE KLKNE I YANKEES K 
GSSNIQETDEEDEEEEDDDDDDEGEDDG 
EESEETNREEEGKAKEQIRNCEHNCQQV 
TDKAFKEQRDRPEAQEQSEKKISKLDPK 
KLALDTSFLKVSTRPSGNQTDLDGSLRR 
VRKNDPDMKELNLNNIENIRK 


11826 


25727 


A 


11932 


161 


389 


SVQTHPNLRSCSVLKNAMHFYLLGYVIS 
GCTEPAICAIKPIDRKSVHQICSGPWLS 
LSTAVKKIVGNSLDAGATNI 


11827 


25728 


A^" 


11933 " 


134 


289 


ASTQNMGQMLRLMSSPTLLSLSLSHTHI 
IIKI-ITRMIITIITHTIITIITYTPTKKV 


11828 


25729 


A 


11934 


128 


1 


IKKGEFFFFFFFFETESHSPRLQCSGTI 
LAHCNLHLLGSSNS 


11829 


25730 


A 


11935 


131 


19 


MYIGWVQWLTPVIPALWEPKVGASLEAR 
SLRPARTSE 


11830 


25731 


A 


11936 


104 


3 


LKGQL IRAQWIjMPVI PSLWEAEMGRSP E 
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nucleotide 
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ing to first 

re^idrof 

sequence 


Predict- 

otide 

location 

correspon 

Emino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, I=Isoleiicine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 












VGSSR 


11831 ; 25732 


A 


11937 


146 


2 


GGRLGIIYHAWNPEGFFFFFEMESCSVG 
QAGVQWCNLSSLQPPHPGFK 


11832 : 25733 


A 


11938 


149 


1 


DKCFLCVCCLFIFIYLLIYFYFFETESC 
SIAQAGVQCCNLGSPQPLPLiA 


11833 i 25734 


A 


11940 


176 


380 


QSVAQLPHLECNGMIIAHCSLHLLGFKR 
FSCLSLPSSWDYSHLRNYQGGRRWSKTH 
QAWCQSLTILWN 


11834 25735 

11835 1 25736 


A 


11941 


133 


415 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKSG 




11942 


222 


420 


QGDKFLDNNNSTILYMEKKKKKKKKKKK 


11836 1 25737 


A 


11943 


163 


425 


KWPPKNSQLSPSTLKKKKKKKKKKKiaCK 


11837 


25738 


A 


11944 


143 


3 


KMGKKQSHKTGNSKKQSASPPPKERSSS 
PAMEQSWTENDFDELHEE 


11838 


25739 


A 


11945 


200 


3 


QNSQVGYKGAPHKEKTDOALQaRVNLGPP 
RGPLKRPALFFFFFETESPSVTQDGVQW 
HDLGSMQPP 


11839 


25740 


A 


11946 


221 


3 


RKIEGGPSTSRKSVRGVRCFQPLGLGFP 
GGLFFFFFFYETESCSVMLECSGWSA 
HCILHLRSTNEGRHGR 


11840 


25741 


A 


11947 


68 


177 


ISSSFKNKCLCBKKKKKKKKKKKKKKKK 
KKKKKRKR 


11841 


25742 


A 


11948 


21U 


329 


RHTHTHTHTHTHILSLSLSirrHTHaHTL 
MYFLAFFDLRS 


11842 


25743 


A 


11949 


183 


20 


LGTVAHS CNFVFSLSRGELHVGKNGLRL 
PT SDD P P TSASQSAGTTGMSPCTWPE 


11843 


25744 


A 


11950 


232 


329 


VGIKMSISSDEVNFLVYRYLQESGFSHS 

AFTF 


11844 


25745 


A 


11951 


412 


239 


LFYEKGSRFVSQAGLELLELKQSPCFGL 
PKCWDYRHEPPRQADLPFCVPSSAWASW 
L 


11845 


25746 


A 


11952 


310 


393 


PHTDISGTPEIMHYVHVHRVTTQPRNKP 


11846 


25747 


A 


11953 


193 


3 


LSSWPTLISGAFKAEENAVEGNDDSRIL 
RGFFVLFCFVLFFRQSLALSPRLECSGV 
ISAHCNL 


11847 


25748 


A 


11954 


185 


416 


SVQTHPNLRSCSVLKHAMHFYLLGTEPA 
KAIKP IDRKSVHQICSGPWLSLSTAVK 
KIVGNSLDAGATNIDLKLKDY 


11848 


25749 


A 


11955 


100 


1 


AHLSKVFPIFFFFFEMESHSVTRLECSG 
TISAH 


11849 


25750 


A 


11956 


35 


521 


KEKFFFH&GVYWGPPRNFLKRAPLFPFF 
PFFFFFFFFFSFFFS 


11850 


25751 


A 


11957 


214 


1 


KKKIFFFKEIFYPKLSRKKKNFFRGAGK 

ILNLSIFPEKKKIFSPFSFFFETESCTV 

AQ AGVQ'/J H V LGS L Q P 


11851 


25752 


A 


11958 \ 114 


1 


RYFSGQAQWLTPVIPALWEAEAGRSPEV 
SSSRPAWLTW 


11852 


25753 


A 


11959 


188 


12 


SKCQCTFFSNFKKTEMKSEYLAQAGLEL 
LGFSHLPTSVSQTVGITGVSHCAQPKAH 


11853 


25754 


A 


11960 


241 


556 


SSIPFLPNKHLLLSFSTLSsLeHEVSIQ 
DMCQGTYQKQPFPFPLPHVFHLS IHDGE 
LCVGRDVHLTCQTDIAQVFVCLFVLQVL 
LSLPRAGVQWCHLSSLQPPPP 


11854 


25755 


A 


11962 


362 


462 


KIGQALTP 1 1 PLLWEAEAGftSPEAESSR 
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nucleotide 
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SEQ ID 
NO: of 

sequence 
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od 
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NO: in 
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nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Ust amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph an, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SAWPT 


11855 


25756 


A 


11963 


43 


2 


CCCYCCCCCCYCC 


11855 


25757 


A 


11964 


30 


415 


GLGFKQKIFFFFFLGQGPNFDLLRAKKR 
PFLPPPFFPKWPKNRGVFLLGPSGVPKP 
RPRQGDVPPLKPRSLGVFLTSKKTPLGP 
WANPPPPPGDGP 


11857 


25758 


A 


11965 


149 


2 


SKTWLKLRKYLKNTYPGMVRWLIPIVPA 
LWCJAGftGELHEPRNLRPAWAT 


11838 


25759 


A 


11966 


179 


3 


KKNIFFPPPVKFGPPQGFFKRPPPLFFF 
FFFFFFFETESHSVAQAGVQWLSIGSLQ 
AP 


11859 


25760 


A 


11967 


245 


382 


DVTDSKMHPDFHSSHVKMWLGWAHACN 
PSTLGGSRGQEFKTSIiftN 


11860 


25761 


A 


11968 


264 


10 


LSTLECKGVSPGVFSPKGWHKGLGYIFS 
PFLEKPHIPPLGISMYFFFFFDGVSVAQ 
AGVQGRDLGSLHLPSPGSSYSCaSASLS 
S 


11861 


25762 


A 


11969 


326 


406 


RLDKGWaLWLTPVIPTLWEATVGRSPE 


11862 


25763 


A 


11970 


120 


1 


KGYGFPFPPKKPLGPPFFFFFATESCSV 
AQDGVQWRDLGS 


11863 


25764 


A 


11971 


84 


370 


RNGAELKTPLAGQTQITINCRYSWVTLV 
LGDLESILCWHIFKKTNLFICGFFSSFL 
LGAGRMKSHCWELECRGMISAHRNVCL 
LGSNDSPCSAF 


11864 


25765 


A 


11972 


196 


3 


SRGKMGEYPFVLFKNPSWAKAGGNKtSWP 
FFFFFEAESCSVAQAAVQMCNLGSLQAP 
PPRFTPRA 


11865 


25766 


A 


11973 


559 


644 


KIGCSAYGVAILLFLYFFNKLAFTLWKK 


11866 


25767 


A 


11974 


3 


391 


HEAQLPE PLMVLGTGHLAEMAGGSRTYL 
LLAFGLLCLPWLQEAGAAQTVPLSRLFD 
HAMLQAHRAHQLAIDTYQELEETYI PKD 
QKHSFLHDSQTSFCLSDSIPTPSNMEET 
QQKSNLELLRISLLLIE 


11867 


25768 


A 


11975 


138 


365 


LKFECHSTLCANHGKQTFTFVFYNKFLL 
KYTFYFSVLTYAINWYIKNLINVFKKKK 
KKKKKKKKKKKKNFKKKKN 


11868 


25769 


A 


11976 


275 


433 


MSILDLSKARNFFLSFLETGSCSITQAG 
VQWLNHSSLQPQTPGLRDPPASASQ 


11869 


25770 


A 


11977 


108 


3 


RCGLFFFETRASPCRPGWHNLGSLQPLP 


11870 


25771 


A 


11978 


270 


408 


FENNVWLGRAQWLTPVIPALWEAQAGRS 
PGPENQNHPDYKGNPWPY 


11871 


25772 


A 


11979 


228 


441 


QALKFVIEMLLSLKKCLDVSVIFNRHKK 
lELLQKKKKKKKKKKKKKKKKPSQKKKD 
SSRGKDS 


11872 


25773 


A 


11980 


2 


447 


GALALNKTTADVWRLNFLVSGLHWKRWL 
QQTSLSKWVKIKECS ILKKKKKKKKKKK 
KKKKKKKKDKK 


11873 


25774 


A 


11981 


124 


445 


KYGEMSQNPARGGPKFSKNQKYSEHLRI 
HCCPPFTFLISKKEIGDRKYSICKSGCF 
YQKKBEDWFCCPCQKTKTSRRAKSLKRP 
KQKPVAPPGGVKAPAKPRSLPRF 


11874 


25775 


A 


11982 


48 


429 


KESNGSQDRLLPKIHSPDVSGGKSGGMS 
QNPARGGPKLSKNQKSSEHLRIHCCPPL 
PFLISKKE IGDRKYS FGKSGCFSQKKKE 
DWICCPCQKTKLKGKIRPPPKKNGPGGS 
LNGRTTWVSGLFVHN 
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SEQID 
NO; of 
nucleotide 
sequence 


SEQID 
NO: of 

peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last Rinino 

acid 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=PlienylaJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11875 


25776 


A 


11983 


419 


500 


SLLVICVEKQWPGAVAHACNPSTLGGPG 


11876 


25777 


A 


11984 


362 


496 


LWSPIRILLLRWAWWHVPTVPAIWEAEV 
GRLLEPRSLCNMVEP IS 


11877 


25778 


A 


11985 


142 


433 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 

KKKKKKKKKGGGGVIKKFLGGPIEX3GGG 

KEKFFFFGGGFIN 














11878 


25779 


A 


11986 


177 


2 


CQRKvVri iSHKTKS TTRPLGQLKKGWSP 
GAVAHT YNP S FLGGQGGWVTY WRTRGR 
TR 


11879 


25780 


A 


11987 


178 


443 


DRKKKKKKKKKKKKKKKKKKKKAGGGLF 
KKI 


11880 


25781 


A 


11988 


329 


410 


LIFFFIFFERESHSVAQAIVQWHDLGS 


11881 


25782 


A 


11989 


146 


1 


DRASALQPGRQGRNFVSKKTKNNNVI YI I 
NTLTTT I FLGTT YDALS PEL 


11882 


25783 


A 


11990 


240 


420 


GVFAPLLGDVSQSGYTGFRDPLEEAVCL 
FLELEC3iaERTTALFRAWQGCLSLQNL 
SVAF 


11883 


25784 


A 


11991 


239 


410 


GTLFIYLLGDLFIYFRNKSLTLLPRLEC 
GGVIVAHCSLDLLGSGYPS ISALPS SWD 
C 


11884 


25785 


A 


11992 


126 


3 


YTGEYKSFCHKDTCTHMFTAALFTIAKT 
WNKPNGRVGGRV 


11885 


25786 




11993 


300 


22 


SQLLGRLRQENHFNWGGRGCSEPRSCHC 
IPAWATRANSIFCGFQASSVEVRRSARK 
KLFSDILKRHNTI SWRVSGLLLVDS YFG 
RLATPVRAQ 


11886 


25787 


A 


11994 


303 


2 


EGEEIFLPSPPFPPGQENPGSNCSPPLF 
PFGTFLDIRVAGSHKVQAKGVFKroPPS 
LHLFLI KKRFFFFFFETESRSVPQAGVQ 
WRDLGS LQTPPPGFKR 


11887 


25788 


A 


11995 


171 


457 


S FSDHL ILGW IGFOroiLS KIALCSS PI K 

SYQDSS 














11888 


25789 


A 


11996 


357 


204 


EKTGFPHVGQAGVDLLTSRSTRLSLPKC 
WDYRHEPMRLAGHLYYFYPQMK 


11889 


25790 


A 


11997 


219 


1 


PRWWAPPPR YP PGALFWAPRPVGVS LG 
ARAPTKAGPKQKGARGTFFFFFEMESHL 
SPRLECNGTIWAHCNLC 


11890 


25791 


A 


11998 


254 


1 


AHLRGNRQLPKHTFFQYMTTNLKCAFSV 
GRQSYSIPWRSFYTSLFFKYSCVFHTHT 
HTHTHTHTHTLYFQ liVlVLLL PS LRKKGS 


11891 


25792 


A 


11999 


167 


2 


NFKS FFQGLSRGVLNPKVNLFFFLETES 
CSVS QAGMQWGDLDSLQPPPTRPPTRP 


11892 


25793 


A 


12000 


104 


3 


RPWTFFFFSETKSRSVAQAGVQWRDLCS 

LQPPP 


11893 


25794 


A 


12001 


178 


2 


KI FQRGAKNS PWGLPFRGGENKKGAPPC 
QKGFGRFFFFFETESRFITQTGVQWCDL 
GP 


11894 


25795 


A 


12002 


163 


282 


GIGGEWCLSKYVIKYVKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKK 


11895 


25796 


A 


12003 


195 


2 


KWGRIPLKKGGFCFTKKKSGQTLLIKSP 
PPAFFVFFIFFFLETESRSVAQATVQWC 
DLGSLQAP 


11896 


25797 


A 


12004 


152 




CFVDVSABKLRPKDFM'nCTPKaMaTKAK 
IDKWALTKLKSFCTAKETIIRV 


11897 


25798 


A 


12005 


359 




EWPIWPPPKRRVLSKVSKQFFISAPIR 
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nucleotide 
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NO: of 
peptide 
sequence 


M 
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od 
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26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparti(! Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Froline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,y=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKPFPCQVGLILAPFRVLLIGRPFFFFF 
GVLWEIQRWSFISLWWVRARIMMYA 
LFLLSVGLVMGFVEFSSKHCPVYGALVl 
VSIGGRV 


11898 


25799 


A 


12006 


189 


12 


DGRLPGSVCYYLIFLRQSFTLLAQAGAQ 
WCHLGSLQSPPPGFKRGRKVSRIRAVEG 
DQ 


11899 


25800 


A 


12007 


131 


3 


KIFFFFFFFEMESHSVACJAGVCJWRnLGS 
MQDPPPGFMPLLFY 


11900 


25801 


A 


12008 


150 


2 


I ILFLVPIEAVFFFFETVAEAGVQWHNL 
SSLQPPPPGFKQSPASASRVA 


11901 


25802 


A 


12009 


305 


3 


KFFFSLKASIRGRWEGLFFTPPKKGFLP 
KI PHQVFKGGPLWEBCLQLGKAGLNLGPT 
KGFFKGftflRFFFFFFElESRSVAQAGVQ 
WRDLAHASVHRSAHAS 


11902 


25803 


A 


12010 


263 


517 


DKGFQLPPRWCPGQDPLFFYGGIVFHGG 
YGPPFFFFFFKAESRIiVAQAGVQWQDLG 
SLQPPPCRTRGSP 


11903 


25804 


A 


12011 


107 


374 


WMGTWGSLPVLLTTRSTCPPRSPRLHRR 
TGATIWIFFRLFETGTVLPMLECSTVIM 
AHCRLQLPGSSSSPTIASQVAGTKftHLS 
DCFVY 


11904 


25805 




12012 


144 


261 


FPKKINNFYFTFLQDPTGIFSLDKTIGL 
GTYGRIYLVS 


11905 


25806 


A 


12013 


371 


3 


IFHLRKIFTFLPGIFWPIRLLSEKKKPK 
RFWVKIKKFKNGYPIiKNPINPKYKGPRF 
FFKGEKNPNGKVPVLKKIKIPSFSKALG 
FLQKKASKPFPFPFFDTESCTVDQSGVQ 
WCDLGSLQAP 


11906 


25807 


A 


12014 


101 


3 


RGAFFFFFFETRSRSVIQAGVQWCDLGS 
LQAP 


11907 


25808 


A 


12015 


203 


2 


VFNPSGINVMYVDAI ILNKRLATR IQQH 
IKQRILHDQVGFIHGMQGWFS I RKS INV 
IQHINRPKDKN 


11908 


25809 


A 


12016 


168 




GCVCQFQKKLGNGGLNGFFFPLFFFFFF 
EMKSRSWQAGVHWRYLSSLQTPPPEFK 


11909 • 


25810 


A 


12017 


311 


2 


RGLRFGQWKTQMENPNTPFSALRPGQLS 
S I RSGLLHTYPGiGOKSBPLS CS ILS SS 
KYWWQAVTSALSSNKPGASQGHWKDDF 
FLFFETES CSVTQAGVQWH 


11910 


25811 


A 


12018 


383 


247 


LVEM6FRHVGQDGLDLLTS QS AHLGLPK 
WWD YRREPPRLATI EVL 


11911 


25812 


A 


12019 


2 


379 


RVLWTAHLAAMAAGSRTSLLLAFALLCL 
PWLQEAGAVQTVPLSRLFKEAMLQAHRA 
HQLAIDTYQEFISSWGMEAYITKEQKYS 
FLHDSQTSFCFSDSIPTSSNMEETQQKS 
NLELLHISLLLIES 


11912 


25813 


A 


12020 


3 


389 


PEPLRVLWTAHRGAKGAGSRTSLLLGFA 
LLCLPWLQEAGAVQTVPLS KLFDHAMLQ 
AHRAHQIAIDTYQEFEETYIPEDQKYSF 
LHDSQTSFCFSDSIPTPSNMEETQQKTN 
LELLRIFLVLIELWLDP 


11913 


25814 


A 


12021 


247 


386 


PMLGHVSQSGGNGVRDPLEEAVCPLAKL 
KHCSGRSTALFRAGRQKR 


11914 


25815 


A 


12022 


17 


371 


PLRDLWTAHLEAMAPS FRTDLLLAYALL 
CLPWLQEARAVQTVSLYRLSDHAMLQAH 
RAHQLAIDTYHEIEETYILKDQKYSFLH 
DSQTSFCFSDSIPTPSNKEETQQKFYLE 
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ing to first 

residue of 
peptide 


Predict- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 


Amino acid sequence (A=AIanine 
C=Cysleiiie, D=Aspartic Acid, 
IC=GIutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLPISL 


11915 


25816 


A 


12023 


3 


359 


LRALWTADLGAKAPGSRTFLLLASALLC 
LPWLEEAGAGQTVPLSKLFDHAILQAHR 
AHQLAIDTYQELEETYI PECDQKHSFLLD 
SQTSFCFLDSIPTPSHMEETRQKSNLEL 
FRTSLLL 


11916 


25817 


A 


12024 


2 


363 


PEPLKGLWTDHLGALAPSSRTSLLLAYA 
LLCLPWLQEAQAGQTVPVYTLFDHAMLQ 
AHRAHQLAIDTYHELDETYMPEDHKYSF 
LHDSQTCFCFSDSITTPYNMEETQHTSN 
VELLRISL 


11917 


25818 


A 


12025 


314 


393 


GQAWWLTPVIPAFWEAKGGRSLEARS 


11918 


25819 


A 


12026 


80 


392 


PLIACSFFLFLFFFFGKKILFLPPRWKG 
GGGIWVKGNPGFRGQAFFIiASSSKKPGM 
GGPPLTPGKRFVFLKKKGFSPGGPGGSK 
FSAPGTPPTGPPKGWEI PAK 


11919 


25820 


A 


12027 


165 


2 


WQKLLFHFGTESCSVARVQVQWRHFSS P 
KPPPPEFKQLSAPASRVADRRPPDAW 


11920 


25821 


A 


1202S 




338 


VFVALKFLMCLLSVCIFSLENIYSNSLP 
QKNCLPLIFFFLFFEMEACSVTQAGVQW 
GDLGSLQPPVSHLNLGGGGCCEPRECHC 
TPCSRPGDRDFVFNKNTMIQDNHIMELT 


11921 


25822 




12029 


2 


315 


HEBRERERERERERERERERERV6RTVG 
GVGETYKAELPRVGGAAQKRAHFSARGR 
LFMEICQDMCGEKPGKPPHSLTEECLSR 
CQERFFDTSLAITRGCaQSV 


11922 


25823 


A 


12030 


216 


3 


ERIIPRGVRQRNRLFSGRDTASPPI.FYP 
PPRQIHKRGVEDREHERRGERERERERE 
RERERERERARAAR 


11923 


25824 


A 


12031 


114 


329 


QTEHNSININKKDIHTKTPSVBHQHQRP 
KVDKTTKMGRNQSRKAENSKNWKRSSPP 
KEHNSSPAREQKrWMES 


11924 


25825 


A 


12032 


177 


1 


IKKKTPFWQNIPLPSKKKrCEQKFFFFF 
FFETGSLLPRLVCSGAVLAHCSLCLPGS 

AFL 


11925 


25826 


A 


12033 


127 


1 


PSFFFFFFFNETESRSVAQAGVQWRDLD 
SLQAPPPGFITPSC 


11926 


25827 


A 


12034 


47 


314 


GAPVASVSISCPSCSATDGWRNGKSTA 
GHQRYLCSHCRKTWQLQFTYTAS QPGTH 
QKI IDMAMNGVGCRATARIMGVGLNTI L 
RHLKN 


11927 


25828 


A 


12035 


15 


408 


GAIPGAMGHFTEEDKATITSLWGMVNAE 
DAGGETLERIiLVDYPWTQRFFDSFGNLS 
SASAIMGNPKVMAHGKKVLTSLGDATKH 
LDDLKGTFAQLSELHCDKLHVDPENFKL 
LGNVLVTGMAIHLGKEFTP 


11928 


25829 


A 


1203S 


122 


2 


KLVDVLPPPGGARVFFFFFFPLETEHCS 
VAKAGVQWHDFG 


11929 


25830 


A 


12037 


154 


1 


GKPPPKLGFWGGPRQRVRGRGPPFFFFF 
ETESPSVAKAGVQWCDLGSLHPL 


11930 


25831 


A 


12038 


58 


254 


DPRVRQQRTTAHCSLNLPGLQRPPSLSL 
PSGWDYRHAPLCATQKILAYRVAQRERS 
AHGSYYQAS 


11931 


25832 


A 


12039 


270 


124 


DGVLLLLPRLECNCAISAHGNLRLPGSS 
NSPASNKTKQNNNKKTLSNKF 


11932 


25833 


A 


12040 


63 


355 


LGRGTAHRRLWRPLSRPASRVSYLSAAT 
NKESFAPPSRAFPWDNKWERGGFYFFET 
GSHRRRPGRMECSGAITAHCSLDFPGSE 
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nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=passlble 
nucleotide insertion 














MGFTMFPSLVSNS 


11933 


25834 


A 


12041 


184 


3 


GVSRLLGGVSQLGYMGVRDPLEEAVCLF 
SKLKHHPGRTTALFRAVRQGCLSLQKFL 
LLFA 


11934 


25835 


A 


12042 


118 


3 


DRVLLCYLGWECSGTISAYCNLHLPGSK 
RFSCLSLPSS 


11935 


25836 


A 


12043 


40 


436 


LSEGKLTNRKDNHTKTPSVHHHHQKPKV 
DKTTRMGRIQSTKAENSKNESTSSPPKE 
HSTSPATEQSRMENDFDELREDERRSVI 
TNFSELKEDVRTLCKEAKMiEKRLDEWR 
TRINSMEKTINDLMELKTMA 


11936 


25837 


A 


'12044 


101 


2 


HLGQVRWLTPVIPILWEfiEVGGSPEHRS 
SRPAW 


11937 


25838 


A 


12045 


123 


3 


LTVIKELPLGQARWLTPVIPALWEVKVG 
GSPEVESSRPA 


11938 


25839 


A 


12046 


3 


432 


PIFGGGGERIFPFFFFFSQGGGWFPKT 
KNKGFFPFLGFFKKIFLENFFFLFFPFF 
FUSTPFFYFGAPPFFFFFPGGGFFFPLFS 
RFFKFFFQNFPQGFFFGGGFFFFFLGFF 
FFFFFL 


11939 


25840 


A 


12047 


325 


164 


KNKRKNRGQARWLTPVlPALWEAEAGGS 
PEVRTTGVSHGTRPHLLLNTVS EY I 


11940 


25841 


A 


12048 


187 


381 


LMDKRVSLWGDE1III.KVESNSGCTTLYL 
LVCFEMESRSITQAGVQWRDLSSLQPPP 
SRFKRFSCL 


11941 


25842 


A 


12049 


84 


464 


SYRVPSHPDTLVLSRISAQEAGEKSPFC 
FPERVWPCPRPLSDLQRRLKLECGPDLL 
DSTFLSFFFGEGLKTGSHSGALECSGVI 
RDHCGLCLSGSGDPPISaCLRVAGTTGV 
SHHCIFCRQDLPGS 


11942 


25843 


A 


12050 


232 


20 


LLQCSSRAKIHTSLTGNQKLEMIKLSEE 
GMSKAEISQKLGLLHHLGLVANAKQRFL 
KVIRSATPEFRHAE 


11943 


25844 


A 


12051 


401 


518 


FFETESCSVTQAGVQWRNLS PLEA 


11944 


25845 


A 


12052 


191 




LFPLKKEKI FFLGPGTNGAPPMFFLKAP 
PLFFFFFFFFFFFFFFFFFFFFVNLFLL 
LSFLHILEP 


11945 


25846 


A 


12053 


12 


369 


PLP I YSAC I E VGSNPQGPGIDAKS VSHN 
NCYLKEKKKKKSNGSQNNLLPKIHTPNG 
SGKKTGEKIQNPPRGGPKLSKNQKYSKN 
LKKHCCPRLTFLISKREKBNRKYTPGKS 

GCFYQKK 


11946 


25847 


A 


12054 


89 


513 


NFTVRGS I PLLNNI PMRNLLLLI VP ILI 
AMAFLMLTERKILGYIQLRKGPNWGPY 
GLLQPFADAIPLFPKEPLTPAPSAFALC 
FAAPALALPFALFLCPPHPLGRP 


11947 


25848 


A 


12055 


158 


3 


SQLTWPIMAQTDSMCLWIVYFDDNIGWA 
WWLMSVIPTLWEAKVGGSLEHSN 


11948 


25849 


A 


12056 


113 




KSHGGHGVEVLVSVLVGLAMEVGLVLWV 
KEYDGARLA 


11949 


25850 


A 


12057 


44 


253 


ARIARSAHEGKMPRKYTGAEKKAENRPQ 
RELQLKASRSTIDLAEHACNALWYQLFL 
ISVGSWKNYILFYLTYT 


11950 


25851 


A 


12058 


142 


1 


KKIFFRYKKEGFSFPKFFRISLFFFFFF 
ETESRSVAQAGVQWRDLGS 


11951 


25852 


A 


12059 


12 


347 


QTERNS ININKKD IHTETPSEGHQHQRP 
I VDKS TKMRKNQCKKAENS KNQS AS S PA 
KDHNCLPAKEQNWTENEEHKLTEVGFRI 
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NO: of 
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nucleotide 

location 
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ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaiiiine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














ITNSVELKEHILTQCKEAKNLEQHLEEL 


11952 


25853 


A 


12060 


245 


3 


MPEGSFFLEIPEITRVFVKKPGKFLGGV 
GPSLLfFfPFFlfFFFiyiRQSIiAVSPRLSC 
NSAISAHCNLCLLGSSDSRSSGSR 


11953 


25854 


A 


12061 


94 


3 


KGGIFFFFFFFFLRQSPSVAQAGVQWOD 
LG 


11954 


25855 


A 


12062 


108 


5 


NNLSSGVRDQPGQHGETTSLLKIQKLAR 
HGGRHL 


11955 


25856 


A 


12063 


298 


1 


KKKKINTHAQKRGPPFFFKPPPEKGKAP 
PPQRVTKKC3GEKDSFIPKRREKKNPPPL 
SFFFPLVRVHVRERERaEAREREHERER 
EREREREREHERERAR 


11956 


25857 


A 


12064 


122 




VCEGVCVGGWVRDRERERERERERERER 
ERERERERERAR 


11957 


25858 


A 


12065 


126 


1 


ATCTKFRNDNRALLRVRVFBRERERERE 
RERERERERERERESR 


11958 


25859 


A 


12066 


48 


351 


FFFFFFFFKKGPKIVPPGGGGGGDPIFL 
EPPPPPLKKFWGPPLRGRGNKGGGPPGW 
VNFGIFLKKKGLPWGPGGVKTPAFKGSP 
GPTPPKGGNNRKNPPPW 


11959 


25860 


A 


12067 


1 


365 


GTRLTVLWTAHLVAMAPGSRTSLLLDFA 
LLCLPWLQEAGAVQTVPLSRLFDHAMLQ 
AHRAHQLGIDTYQEDEETYIPKDHEDSF 
LHDCQTSFCFSDSIPTPSNMEETRQKYH 
LELLRISLLF 


11960 


25861 


A 


12068 


174 


1 


PAWVMQHNPVSLFFCFFETESRSLTRLE 
CSGTISAHCNVRLPGSSDSPVSPSRVAA 
RA 


11961 


25862 


A 


12069 


270 


375 


TRIKRCNGGRAQWLTPVIPALWEAKAGE 
SPEVRSS 


11962 


25863 


A 


12070 


197 


350 


KKIYIFLTINTIMDLIIPFLLDNILGFW 
LGAVAHACNPSTLGGRGGWI TRS 


11963 


25864 


A 


12071 


3 


378 


HEGQLPEPLKGLWTAHLLGMAPGSRTSL 

LLAIDLLCLPWLKEAVAVQTVPLSRLYD 
HAMLQAHRAHQLAIDTYQEIEETYIPKD 
QKLSFLHEYQTSFCFSDSIPTPSNMEET 
LHKSHLELLRIYL 


11964 


25865 


A 


12072 


219 


1 


FSHPFPPGVFQTPLVFPRPWEVLGSPGG 
GGGPPKKWAGVQKWVFFFFFFFl\rEMESC 
SVAQAGVQWYDLGS PRA 


11965 


25866 


A 


12073 


134 


3 


VFGPPFFFFFFCETESHSVAQAGVQWRD 
LCSLQAPPPGFMPSC 


11966 


25867 


A 


12074 


175 


357 


MPINQPVKKMCVCVCVCVYIYIHTPLYI 
RVYMYMCMYICVYIHVYIWMCIYIHTHV 
STYW 


11967 


25868 


A 


12075 


193 


350 


K I PHLTS L YHTQN YLKYC IGQAWWLTP V 
I PALW EAEAGGS P E VRS SRPAW PW 


11968 


25869 


A 


12076 


252 




GVFGPFPKKGGFFWVKGFLGTPQIFWVG 
VFFKKKGGALWVKGLLGKPPPPPLFFFF 
FETES CSVAHAGVQWRDSVKKKITIARA 


11969 


25870 


A 


12077 


273 


366 


I FI YLFIYLFBTESCSVSQAGVQWPNLG 
RLR 


11970 


25871 


A 


12078 


127 


2 


KKKKTLFFFFFFFFEME3RSVTQAGVQW 
CDLSSLQPPPPLV 


11971 


25872 


A 


12079 


206 


334 


HNRVTIVNNNLIVHFKITKRCWTQWLTP 
VIPALWEAKAGGSSE 


11972 


25873 


A 


12080 


135 


2 


KMKHKFGEDSQIPKTLQTFFFLTESRSV 
AQAGMQWCDLGSLHSC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

correspond 
ingto first 
amino acid 
residue of 
peptide 


Predict- 

lo cation 
eorrespon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11973 


25874 


A 


12081 


147 


3 


ATTPGLFFIFYFFETESHSIAQAGVQCM 
ISVHCMLCLPGSSDSPVLV 


11974 


25875 


A 


12082 


106 


2 


YFI PINEGYCFIYLFFETESRSWQAGV 
QWCDLV 


11975 


25876 


A 


12083 


164 


346 


CPYKKRKKAKQAKWLTPVIMLBIGRPRRV 
DRPSSGVQDKPGQHGETPSLPKIQKKSQ 

AWWCA 


11976 


25877 


A 


12084 


186 


1 


KSLFSKGNPLFKGPPPPPFFFFFFFETR 
SRSVTQAGWGAVir/iAHCCLNLPGPSDPP 
TPAPRA 


11977 


25878 


A 


12085 




CPFFRVNFFFFEMESRSVAQAGVQHRDL 

GSRA 




25879 




12086 


154 


2 


SPTPEKGVWELPSPFFSWGPKIWGCIFF 
FLYETESGSVAQAGVQWRDLGS 


11979 


25880 


A 


12087 


326 


3 


KTSVITCGLPMGWSWGPFPKGPWWCKKF 
CVCKFWMrQTLKKKLGaGRFQKPFSAAA 
FLEMGGFPFKFFRFFQGSQFFFFFFFET 
ESCSVTQAGGQWCNLGSLRPLPP 


11980 • 


25881 


A 


12088 


154 


3 


KEFSFFAPGGKQRGEIRSLRGPPPQVKP 
LFFFFFFETESRSVAQAGVQWR 


11981 


25882 


A 


12089 


174 


2 


SWKAILQYSLEIlLYLPSFOlFLdFSHT 
HTHTHTHTHTHTLYSQIMVLLLPSLRKK 
GS 














11982 


25883 


A 


12090 


265 


30 


WF I VYSE I SQLW6LPSFNTLGTWQS LS F 
IFVAEMRSCCVTQAGLELLASSDPPVSA 
SQSARIRAMSPSVAWVIPGRSR 


11983 


25884 


A 


12091 


257 


1 


C5SVMRMHTEEQYPENKVEQSSSGFIRPH 
LVYRVCFYLSVCLSVCLS lYLS lYLS I Y 
LSFFLSFFLLRKSLYLSIFLSFFLLSPR 
V 


11984 


25885 


A 


12092 


311 


404 


LYHQNPWLYSVFFFLETESCSVAQAGVQ 
■ WCD 


11985 


25886 


A 


12093 


331 


83 


GDDYKGARENSRDDKLFyVL IMMWIQL 
HVFVKIHKTLQLNGYILLCKLYLINLTK 
NNFQKPKTKSQKTLHI FHPDSTAANI 


11986 


25887 


A 


12094 


82 


2 


LRE PRLLNPGGEGCNEAEWCHCTPAL 


11987 


25888 


A 


12095 


255 


393 


HG I LFS S FVCRLRQ FLSQHWLM PV I PAL 
WEAEVGGSLEPRSSTIiflW 


11988 


25889 


A 


12096 


80 


3 


PFFFFFFFFETE SRS VARLE CSGMX 


11989 


25890 


A 


12097 


367 


1 


PRVDKPTKMGKKQGRKTGRS KKQSPSAP 
PKERSSSPATEQSWMENDFAKLREEGFR 
RSNYSELQEEIQTKGKEVETFEKNLDEC 
I TR I PNTEKCLKELMELKPKAQELREEC 
RSLRSRCDQL 


11990 


25891 


A 


12098 


94 


236 


MWADYFCMSPSIDEGLKKKKKKKKKKKK 
KKKKKKKKKKKKKKKGGAP 


11991 


25892 


A 


12099 


410 


212 


KSLHPSTSLFRFSSHFHIKLISGQAQWL 
TPI IPVPWEAKAGGLLETRSSRLAWAAQ 
EDPISMPVCK 


11992 


25893 


A 


12100 


140 


324 


NICQAQW-.MPVIPARPWEMVGRSLAVR 
SSSNSPASASQSAGPTGPSHHNWLMFLQ 
LLTRL 


11993 


25S94 


A 


12101 


247 


357 


KQIQGEAWWLTPVFSPLMEAKAGESPEV 
RSLRPAWPT 


11994 


25895 


A 


12102 


245 


2 


DVLVGGLCLPAGGTLHLTS IFLLSGCWC 
GSKDEEAPCKQRISVQRESQSRTLRAGV 
SPKICflHPCEMCGLILEDVLHFPDLQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

■ng to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, X=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Threonine, V=V aline, 
\V=Tryptophan, Y=Tyrosine, 
X=TJnlcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11995 


25896 


A 


12103 


230 


379 


KEASFFETESHSVSQAGVQWRSLNPLGS 
GNP PTSTSRVAGTTGMSHHVWL 


11996 


25897 


A 


12104 


86 


2 


DGVSVLSPRLECNGLILflHCNLRLPGSS 


11997 


25898 


A 


12105 


116 


2 


SQKTPFFFFETVSLLLPKLECNGAISAH 
CNLCT.PGSSD 


11998 


25899 


A 


12106 


271 


416 


QFTFSKAFNLVFIKPSFHISFYRTHYSQ 
AWWLTPVIPELWEAEAGGSP 


11999 


23900 


A 


12107 


194 


3 


HVIiGTVLVAEDTASNKIRSLQGCLLLPL 
LFSWliEVLARAIRQEKEIKVIQIGREE 
VNRRRQR 


12000 


25901 


A 


12108 


142 


3 


LCLLYWDCKRHAWWRGQVRWLTPVI PAL 
WEAEftGGSPEVRSLRPAW 


12001 


25902 


A 


12109 


233 


346 


HQRSLIGWAQWIiMPVIPALWEAEAeVSP 
EVRSLKPAWP 


12002 


25903 


A 


12110 


234 


349 


GFSPFYDFCLFVFETESCCWQAGVRWH 
DLGSLQPPPP 


12003 


25904 


A 


12111 


136 


333 


RLNFFYFYFFETASCSVAQAGVQWHDLA 
HCI PAWETEQDS I S KNTVQKRKKBCVTRA 
GISKTKNENF 


12004 


25905 


A 


12112 


,345 


110 


QDGLCLVMTLQETQPILAYSLWFPIVFP 
TTKACNVQGDSKFLLKKKyLQQVQWLMP 
VIPTLWGADVEGSPELRSLEPA 


12005 


25906 


A 


12113 


108 


285 


YNAMKNRFLKTILNNKNSIGWARWLMAV 
I PALWEAKAGRSPE VKS SRSRPGAVAQV 
DAA 


12006 


25907 


A 


12114 




364 


RWAAEMGKFMKPGKAALDLAGRYSGRK 
AVIVKNIDDGTSDRPYSHALVAGIDRYP 
RKETAAMGKKKIAKRSKIKSFVKVHNYH 
QLMPTRYSVDIPLDKTWNKDVFRDPAL 

KRKARREAK 


12007 


25908 


A 


12115 


120 


3 


TQIWGAFLTLFFFFFFEAKSHSVAQAGV 
QWCSLGSLQA 


12008 


25909 


A 


12116 


120 


3 


TQIWGPFLTFFFFFFFEAKSHSVAQAGV 
QWCSLGSLQA 


12009 


25910 


A 


12117 


231 


1 


FLSPFSCYSAITKLLSLSYFLLGHILLL 
LGSTPEAAAQWQWSFADSDIVPPAST 
WVFPTLG IMHHNKQATENAKE 


12010 


25911 


A 


12118 


216 




LIFPPLLNFCYGEVLGFQGFLALFFACM 
FFWPRFS PKTFFFFFYEKEFHSAAQAGV 
QWCDLGSLQPLPRGFK 


12011 


25912 


A 


12120 


267 


1 


KKKKKAFPSPPLLGCFIRLQKHLLGILV 
YRSHLISSLLCLEGIILSLFIIATLITL 
NTHSLMNI VP lAILVFAACKAAVGLAL 
LVSIS 


12012 


25913 


A 


12121 


167 


372 


ISGQDLLKETKRVKRPFQQDDVPFINIF 
VPTPGAPRSLRQILELRGKTDPSPIIVG 
DFNTLFSALDRS 


12013 


25914 


A 


12122 


124 


2 


GLLKFNIFAPFFFFFSETESHSVTQAGV 
QWCDLGSLQPPP 


12014 


25915 


A 


12123 


121 


1 


RFTAflSAWSGAVSAAVGYMGDPGS E 1 1 E 
SVPPAGPVSSW 


12015 


25916 


A 


12124 


176 


2 


RDLRGFSRFSNYKFWNFGRVGGI FKFVF 
GKGVKRFFFFFFETESCSVAQAGGQWRD 

LG 


12016 


25917 


A 


12125 


137 


2 


KLGFFGGGG6KI FAPQKNFFFFFFFFFE 
MESCSVAQAGVQWRDIG 


12017 


25918 


A 


12126 


103 


3 


LEENLQNTIQDIGRGKDFTSKTPKftMAT 
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peptide 

sequence 


Predict- 
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ding to 
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residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Piienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KAKID 


12018 


25919 


A 


12127 


212 


351 


LITFEILNICLGDNILDLGLARWLTPVI 
PALWEAKAGGSPEVRSLR 


12019 


25920 


A 


12129 


187 


3 


RGSTMQQTNTRTPGVHVLyYILMIATMA 
IPYIPMANnLLLIVPILIAMAFIMLTER 

KILGY 


12020 


-^wi — 


A 


12130 


13 


329 


ASRVTIRVIvlASNSTKSFLADAGYGEQEL 

DajNrSALMKLDKGLR3GKLGEQCF.AV\T?F 
PRLFQKYP?PILINSA?LKLADVFRVGN 
NFLRLCVLKVTQQSEKHLEKI 


12021 


25922 


A 


12131 


231 


330 


MDTDEIYLGEAPWLTPVIPALWEAKASG 
SPEVR 


12^2 


25923 


A 


12132 


95 


2 


KGKCFPPFFFFETESRSVAQAGVEWEDL 
SSL 


12023 


25924 


A 


12133 


129 


1 


DLALLPRLWCSAMIMflHCNSELMSSSDP 
PISASQIPGTIGMCH 


12024 


25925 


A 


12134 


196 


361 


TIMEYYARMKKKSESPLLHFQGKKIjNQD 
GGWAWWPTPVIPALWKAKTGGSLEPRS 


12025 


25926 


A 


12135 


120 


1 


PPPPGLFFFFFFETESYSVAQAGVQWCN 
LGSLHPPPPEFK 


12026 


25927 


A 


12136 


235 


3 


KKG I RI ERDRLKQLS VCQRFQLAKF P P F 
PSGVGEKQNFFFKKKKKKKGRAQWLTPV 
IPALWEAEAGGSFEVRSSRPA 


12027 


25928 


A 


12137 


185 


1 


IWCPFFCPGPQICSERCHFFFPFFFFFF 
LEAETRSFAC2AGVQWCDLGSLQSSWVTE 
LAAAS 


12028 


25929 


A 


12138 


158 


1 


LRRGGVFSIFFCGGTMVLSPADKTNVKA 
AWGKVGAHAGEYGAEALERMFLSF 


12029 


25930 


A 


12139 


76 


3 


KVLAEAIRQEKEIKDIQIGKEEVK 


12030 


25931 


A 


12140 


157 


2 


FFSSPRLLKRRPGNFLGARENFFFFFFS 
ETESPSIAQAGEQWRDLGSLQAP 


12031 


25932 


A 


12141 


234 


388 


LSQLMSANLFFVLLFEAEFHSVNRLECS 
GMIMAHCNLHLPGSSESPASTSQ 


12032 


25933 


A 


12142 


112 


453 


LGRRQAASMREGISIHVGQAGVHIGNAC 
WELYCLEHGIQPDGHMPSDKTIGGGDDS 
FNTFFNETGAGKHVPRflAFVDLEPTAlD 
EVCTGTYRQLFHPEQLITGKEDASMNYA 

RG 


12033 


25934 


A 


12143 




443 


AKLGTRKLPLKAKMGKEITLINIWIGH 
TOWGKTTTTGHLIYKCGGIDKRTIEKLE 
KEAAEMGKGS FKYAWVLDKLKADRERGI 
TIDISLWKLDTSKNYVTIIDAPGHRDFI 
KNM I TGTS HADCAVL IDAAGV 


12034 


25933 


A 


12144 


3 


386 


REAATMRECICIHVGHAGVQNGNACWEL 
YCLEHGIQPDGHMPSDKTIGGGDDSFNT 
FFSETGAGKHVPRAVFVDLEPTGIDEVR 
TGTYRQLLHPVQLITGKEDAAKNYARGH 
YTIGKEIIDLVLDRIR 


12035 


25936 


A 


12145 


3 


386 


AGATYIDRLRVALFEATTFTLWVRPELS 
SGEATTMRECMS IHVGQAGVQNGNACWE 
LYCLEHGVQPDGHMPSDKTIGGGDDALN 

GTGTYRQLFHPEQHMM 


12036 


25937 




12146 


3 


385 


GRATYIDRLRVALLEATTSTLRLRRELG 
SREATTMREGICIHVGQAGVQNGNACWE 
LYCLEHGIQPDGQMPRDKTIGGGDDFFN 
TFFIETGAGKHVPRAVFGDLEPTDIGEI 
RTGTYRQLLHPEQLI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Ttireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12037 


25938 


A 


12147 


11 


381 


HTPEGRTRVPVLIAGWYCQEALRDWGR 
VTASSTGAMAFLRSMWGVLTALGRSGAK 
LCTGCGSRLRSPFSFVYLPRWVSSVLAR 
CPKKPVSSYLRFSKEQLPIFKAQNPDAK 
TTELIRRIAQR 


12038 


25939 


A 


12148 


321 


509 


YSMDQYFIPFYCQIMHLCFFLKTRVSLL 
FPWLECSGATSaHC2JFCI.LSSSNSPASA 
SRVT6IT 


12039 


25940 


A 


12149 


51 


400 


AATMRECISIHVGQAGVHIGNACWELYC 
LEHG IQPDGQMPSDKTIGGGDDSENTFF 
NETGAGKHVPRAVFVDLEPTVIDEVRTG 
TYRQLFHPDQLITGKEDAANWYARGHYT 
IGKE 


12040 


25941 


A 


12150 


25 


399 


EATTSTLGLRHQLGSRESTAMRECIS IH 

VGQAGVHI GNACWEI.YWLEHGLHPDGQM 

PGDQTIGGGDDSFDTFFSETGAGKHVPR 
AVLVDLEPTVIDEVRTGTYRRLFIIPEQL 
ITGFEDAANNYAR 


12041 


25942 


A 


12151 


1 


402 


TKEEELCLACRQRRADGGWRSQMNAGSD 
PVGIVSAARTIIGSFNGALTAVPVQDLG 
STVIKKVLKRATVAPEDVSEVIFGHVLA 
AGCGQNPVRQASGGAGIPYSVPAWSCHM 
ICGSGLKAVCLAVQSIGIGDST 


12042 


25943 


A 


12152 


3 


400 


TDRLRVALKKTTTSSFCFPHHLGSItETA 
TMRECISIHVGQAGDHIGYACWELYCLE 
HSLQPEGQMPGDKTIGGGDDYFNTFFSE 
TGAVKHVPRAVFVDLEPTVIDEVSTGTY 
RQLFRPERLITSKEDAVNIY 


12043 


25944 


A 


12153 


2 


398 


GRCALDHNTFIFARFGCYI.IASGHPGEK 
LMDMDMS PLRPQNYLFGCKLKftENDYHF 
IVANDENEIIQISLTTASLRAGANDDMNI 
VEAEAMNYEGTPIKETLATLKMSVQATD 
SLGGSEITPPSVIJILKCGSR 


12044 


25945 


A 


12154 


203 


405 


FTCPSRIICVISRRISPCCAPDLNPMLS 
ANAMLDFVFTVEDPGGWDSKNLQKKWSH 
YSFLTGIRPKI 


12045 


25946 


A 


12155 


229 


397 


TKIAHHKAGFALISKKITKNLKMFLSKF 
LPIHALWVTGSSGMQPYPLWGHYDLGK 


12046 


25947 


A 


12156 


2 


399 


GGVPHCVWATAWGMRPGLPGPTGLCAQT 
S SRGQKSVLKQKES CGIWQLYHFLSRKQ 
EPRWEPCVSGSSSGDGAVADLADELRGY 
PALCCTLPVHSYRSWAGIRPQIMNGPLH 
PRPLVALLDGRDCTVEMPIL 


12047 


25948 


A 


12157 


85 


414 


ALLPQSEALQGAVTMPHSYPALSAEQKK 
EVSD lALRI EAPGKS I liflADESVGSMAK 
RLSQIGVENTEENRLLYRQVLFSADDRE 
KKCIGGVIFFHETLYQKDDHGVPFVR 


12048 


25949 


A 


12158 


85 


407 


GLLPHSEPLQRAVTMPHSDPALYAEHKK 
DVSDIALRIGSPGKGILflAYESMGSMAN 
PLNQMGVENTKQNRRLYRQDLFRAEDRE 
KKSLAGVIFFHDTLYHKDDNGVP 


12049 


25950 


A 


12159 


S5 


406 


VLLPHSEALEGAVTMPHSYPALSGEHNK 
ELSDIALRIVAPGKGILAADESVGSMAK 
RLSQIGVZNTEENHPLYRQDLFSADDRG 
KKS IGGDLFFHDTLYQKDDNGGP 


12050 


25951 


A 


12160 


275 


146 


EKTFIISFFFFFETESHSVAQTGVQWHD 
LGSLQAPPPWFRRIA 


12051 


25952 


A 


12161 


212 


2 


KRSRPFFFFFKEMGSCYIAQVGMQWLFV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 

last^amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,7=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GTITACYDPELLGSKDPPTSASQVTGNT 
GTHTHTQLIFVFLV 


12052 


25953 


A 


12162 


253 


2 


GRVDSKCCLMIGNAEKSIFCCLLVYLFY 
LSSRQSHSVAQAGVQWHDLGSLQPPPPG 
FSLLSRVAGTTGARHHDWLIFVFLGET 


12053 


25954 


A 


12163 


228 


1 


LLGLWFEQLGRFTKIAKTKGERSQTQSG 
HSMDMRVPAQLLGLLLLWFPGSRCDIQM 

TQSEPAVSASVGDRVI ITCR 


12054 


25955 


A 


12164 




1 


RLLFFETESRSVARLECSGAISAHCSLR 
LEGSGSSPASAS 


12055 


25956 


A 


12165 


239 


350 


GQARWLTPVIPALWEAICASKSLEVRSLR 
LAWACaiGGS 


12056 


25957 


A 


12166 





359 


TDKAISKRDLSVLREIIFFEMESRSVAR 
MECSGVISARCNL 


12057 


25958 


A 


12167 


209 


396 


QFSGIPFDQYKCGFICNYIYIYIYIYIYI 
YIYIYRKYTRAYTKRYMCVFSRATHNIV 
YLCAYI 


12058 


25959 


A 


12168 


327 


130 


GGVGFLGGPGEKPPRPSPFFFFFFEKES 
RSVPQftRVQWPDFGSLPAPPFGFTPFLR 
ESFLFSSLMI 


12059 


25960 


A 


12169 


317 


422 


RGEKPLFFFLQKKKKKKKKKKKKNIKKG 
GRRYKPS 


12060 


25961 


A 


12170 


260 


375 


EKVTCSWTQWLTPVIPALWEAKAGRSLE 
FRSLRSAWAI 


12061 


25962 


A 


12171 


304 


3 


KFFFFFKGFFFLGGVGPIFPPPKKRFFS 
KIPPGVFFFPPLKKKIFFFFPPVILGPP 
RVFFKGAPLFFFFFFFFFFFSETEFRSC 
CPGRLECSVAVSAHCK 


12062 


25963 


A 


12172 


105 


3 


LIFLRQSaALLPRLEGNOAISAHCNLCL 

TGSSD 


12063 


25964 


A 


12173 


146 


17 


RIFFFFFLGQSLAVAQAGVQQRDLGSLK 
APPPGVHAILLPQEY 


12064 


25965 


A 


12174 


234 


2 


LFPRKPKPSVSLSLSLSLSLFRLRKLSS 
FMRHNNI EVRP INNPAMTS KCS SERKSL 
VSFTLNLKPGM IRLS EEGMLK 


12065 


25966 


A 


12175 


292 


380 


VRGLGRVAHTCNPSTLGGQSGWITGGQE 

F 


12066 


25967 


A 


12176 


109 


1 


GRI KKVI KGQAQWLMPVIPALWEAKVGG 
SPKVESSR 


12067 


25968 


A 


12177 


170 


291 


LINFYFYFLRQSHSVTQTGVQWCNLGSL 
QPPPAARRRRRG 


12068 


25969 


A 


12178 


260 


380 


LTY lALF I FLRRS LTLLPRLE CSGMIS P 
HCNLCLLGSSDS 


12069 


25970 


A 


12179 


54 


166 


PKRGFHRVTQEGLNLLTLGSPRLGLPKG 
WDHRQDPLP 


12070 


25971 


A 


12180 


273 


359 


APAGHGGSCLQSQHFGRLRQADHLGSGV 
R 


12071 


25972 


A 


12181 


27 


225 


IGQAQWLTPVILAFWEAKAGGSLEVHSS 
RPAWPIYFLVLLASYLRRFCLIQGHSDL 
LYTSSTRGS 


12072 


25973 


A 


12182 


1 


155 


VHKIFIAYKYSGFQTVFRGRAWWLTPVI 
PALWEAKANRSPEVRSSRSGRRG 


12073 


25974 


A 


12183 


91 


2 


FNLFFFFFFETGSRSVARLECNGAISAd 


12074 


25975 


A 


12184 


122 


2 


GQAQWLTPVIPPPWEAKVGKSPEVRSSR 
RTRGRTRGRTRG 


12075 


25976 


A 


12185 


389 


20 


TDRGRRAERACFCGKVFDGELSFALKLA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REMGRPDWRAMLAGMS3TEYADWHRFYS 
THYFHDVLLDMHFSGLTYTVLSLFFSDP 
DMHPLDFSLLNRREADEEPEDDVLMQKA 
AGLAGGVLFLI 


12076 


25977 


A 


12186 


180 


393 


LNPGSGGCGEPRSRHCTLaWVTERESVS 
KNKIKIKINKQIEEGFLFQPPGQGFIWC 
YFPAKEFIIAIiSKT 


12077 


25978 


A 


12187 


37 


443 


PDFPIPFPPKKVQHLAKPSPKTAHPAWT 
IREILPPPPKKEHPPGIAEGKFGPRELA 
KAGLAKECPAGGPTPAVEFATPTRGGGG 
RVAGFQTPMQIPMQKARLGLLTFLGFAS 
GWIGAJfRPRETLCQGELVDTLQ? 


12078 


25979 


A 


12188 


3 


411 


AFPENAATGSTFQDPVPASKGRAEVGNM 
RLSVAAPISHGRVFRRMGIiGPESRIHLL 
RNLLTGLVEHERIEAPWARVDEMRGYAE 
KLIYYGKLGDTNQRftMRMADFWLTEKDL 
IPKLVQVLAPRYKDQTGGFTRMLQ 


12079 


25980 


A 


12189 




414 


QEFGTRKRATFISLLFFFSSVYSRGVFR 
RDAHKSEVAHRFKDLAEETLRALTLIAF 
AHYLHQCPFADHVKLGNEATEFAKTCVA 
DESAENCDKSLHTLFGDiCLCTDAPLRET 
YGEMADCCAKQEPERNECLLQHKDD 


12080 


25981 


A 


12191 


3 


514 


PRLLMEAGPHPRPGHCCKPGGRLDMNHG 
FVHHIRBNQIARDDYDKKVKQAAKEKVR 
RRHTPAPTRPRKPDLQVYLPRHRDVSAH 
PRNPDYEES6ESSSSG6SELEPSGHQLF 
CLEYEADSGEVTSVIVYQGDDPGKVSEK 
VSAHTPLDPPMREALKLRIQEEIAKRQS 
QH 


12081 


25982 


A 


12192 


182 


3 


RGLFFIPPPPIKTKIWGPFWKWFMGFG 
PPLFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


T2082 


25983 


A 


12193 


53 


400 


PETPSWLGPVERFYFI IIKSVWKWKKKK 
KKKKKKKKKKKKGGGPFKKTLGGDKFYW 
GVKKKIFFFLGGSKKHPWGFFEKKLFFG 
GGKCGAPPPKDISCLWGBCKNFLGAIGEK 
TCCC 


12083 


25984 


A 


12194 


182 


3 


KGIFIQIAPPKKKKNWGPLWKWFKGFG 
HPIFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12084 


25985 


A 


12195 


200 


378 


KSSKSQWLGYLFIGLLIYLFIKFFLETR 
SCFVTQAGILAHCNLKLLGSSDPPTSAS 
QAA 


12085 


25986 


A 


12196 


341 


54 


SLSFHGGLSVLCIFSTYKKLSQEQWLTP 
yiSVLWEVKVSRSPQVRSLRPGDPPASA 
SQSAG ITGVSHHARPHLSLNPRLEMI KL 
SEEGMLKRMTG 


12086 


25987 


A 


12197 


1 


462 


GGPPRPFRMKAAVLTLAVLFLTGSQARH 
FWHQDEPPQSPWDRVMDLATAYVDGLKD 
SGRDYASQFEGYALGKQLNLMLLDNWDS 
VTYTFSKLREQLGPVTQEFWDNLDKETE 
GLTHEMSKDLEDVNAKVQTYLYDFQKTW 
QKELTFTAESKPVP 


12087 


25988 


A 


12198 


80 


1 


SNKVFFLETESCSVAQAGVQWCDLGS 


12088 


25989 


A 


12199 


221 


3 


TMGIMLDKKEIQAIFSFKFKMGHAVAET 
TCNINNTSGPGTANKVTVQWWFKKFCKG 
DESLEDEEHHGRRLEV 


12089 


25990 


A 


12200 


239 


336 


FGALPITTCKFGKKKKKKKKKKKKKKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 

correspond 
Ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidlne, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKK 


12090 


25991 


A 


12201 


279 


366 


KKKKKKKKKKKKKKKKKKKKKKKKKGFK 


12091 


25992 


A 


12202 


153 


40 


GFLFLI FFFFETKSRS I IQAGVQWCHLG 
CTIAWATETD 


12092 


25993 


A 


12203 


317 


406 


LGKLKCNGAVSGHCNLRLPGSSDFSGSA 

S 


12093 


25994 


A 


12204 


2 


925 


RSGPDHELTLDSGKGCTRSMVPASAAS E 
DRRKLPIIVEDEGGPTRSRACSSPARGS 
RPPPSAIGCSPVAQASDSAAGPARRTAL 
QSLSSWLGYQIDRHSVPVYVFKSPLFSV 
IMAPKHKSSDAGNLDRPKRSRKVLPLSE 
KV3CVLDLIRKDKKSYAEVAKIYGKNESS 
I RE I VKKEKE IHASFAVS PPTAKVTATV 
RDKCLVKMEQALHLWVEEMNRKRVP IDS 
NMLRQKALSLYQDFSKGCSETDTKPFTA 
SKGWLHRFRHRFSHHYKKKKKGIMAQVA 
VSTLPVEEESSSETRMWTFLVSALESM 


12094 


25995 


A 


12205 


272 


1 


KMAR2iGLLVIEGKVWRTVYYRFATREER 
EGKMSTNLMNKLDTIGFDNKKDLLISVG 
DLVDRGAEKVECLELITFPWFRAVHGNH 
ERQPGQ 


12095 


25996 


A 


12206 


166 


478 


ISASGLLPTSPLTGTSKLQDPNEHLNLL 
MLNRVSLLLPRLECNDTIIAHYNLRLPG 
SSNSPDSAFQYRGITESFTCEINRLKDS 
S QVALWS FRCEMDHSS ICNR 


12096 


25997 


A 


12207 


102 


497 


PLLALLFGSQKTFLdQLSLKPTRSSLL 
SPGSAGNPEWEAPCPQLNPEATSLKKkK 


12097 


25998 


A 


12208 


131 


2 


FGFLGGDBCAKGKGPGPPFFFFFETES PS 
VAKAGVQWCaDLGSLH 


12098 


25999 


A 


12209 


221 


499 


QTRHKLTNSRSGRTGLQILIKEQHSSGR 
WCGTGDILWQSLEDTICYSVPKSLLDFL 
LFIYFETESHFVTRLECSGAISAHOSLR 
LLGSSDSPA 


12099 


26000 


A 


12210 


297 


398 


HNFFFFLKTES CS I SQAGGHWYDLSS PQ 


12100 


26001 




12211 


341 


452 


NWSPGLKQSSRLSLPKCWDYRREPPCIA 
GLLAYLFRR 


12101 


26002 


A 


12213 


214 


468 


FFFFFFWGKGVFFFFPPLEGRGGNSVYW 
TPPPRGKKNFPPLGLGKMGNWPPPPPPR 
FFFFFFKKKGVFFLGPGGVKTPSLRETP 

P 


12102 


26003 


A 


12214 


223 


477 


CPTETQLQLIFAREKHKPSPVLLDLDLY 
GFFFEMESHSVSPLECSGPISAQSNLYL 
LGSHNSPA 


12103 


26004 


A 


12215 


263 


55 


TPKRYGKMQTATYLPKSKIYIIISDLRK 
QDYIVYKQSKKPEGWAQWLTPVIPALWE 


12104 


26005 


A 


12216 


328 


471 


GVIHAQVLHAHTHTHTHTCTHTQVHAHS 


12105 


26006 


A 


12217 


434 




WDHSASP PTSVGTPT ,R ADGPHPRT ,TAPG 
HWGSSAMLSAPAAGRRGLSLGRWCDRD 
KLSQRGPQPKWILWWGNLPGGGFLHRHI 
HKPLPTI PI PSTVHGEEVLQQLLHSAS F 
ILFIFIFLDRVSLCHPGWSAVARSRFTT 
TSTS 


12106 


26007 


A 


12218 


304 


43 


EKP I GVAPS RVKKRAGGGE KTE CCLL KG 
PT,QGFQGVT,KGPI,KI,NGPI,FFFFFLRRS 
LALFRLECNGTIDAYCNLCLLGSSDSPA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


nucleotide 
location 
correspond 
ingto first 
amino acid 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SEI 


12107 


26008 


A 


12219 


171 


365 


ALVQPRAGREAVTVPACCSCSDVFLYET 
NKVARIQSINYGTI KFFHVIVFSYVSKE 
HLKRKLEK 


12108 


26009 


A 


12220 


171 


452 


ARYKFLHPKFAFGLYPSSSQIQGAHHQL 
WILIKCSFYLLFLRQCLTLLPQLECSNM 
ITAPLQPQLPGLKQSSHLSLPSSWDYRH 
EFP 


12109 


26010 


A 


12221 


323 


458 


FRVQRSLFLFIYFFETESHPWQAGVQW 
HDFGSLHRISP 


12110 


26011 


A 


12222 


189 


56 


DRVSLVQPTLECSGTISVYCSLNLLGSS 
DPPTSASFVASTTGGI 


12111 


26012 


A 


12223 


246 


494 


FLCSSYAVDDHELEPFFPIHLILDITAF 
SQDTIISNLLFQDACSVPKKAAMLGQAQ 
WLAPVIPVLWEAEVGRLLEVRSSRPAW 


12112 


26013 


A 


12224 


55 


353 


RIQGCSCWVKLCPKVEKWVQRTDAEQES 
QTKAEIQDMKQELSAVNMMDKFARSARL 
ERKINKMMNKLKTHVKVQTAQSGMLKWV 
ISVAFYKLPQTVIRL 


12113 


26014 


A 


12225 


199 


23 


LKLSSIYYVPQTILKYFTGWARWLLPVI 
PALWEAEVGGSPEVRSSRPANSTSWIER 
HQ 


12114 


26015 


A 


12226 


369 


14 


DCVRVGLSYPQTVCPSCESVRKRPFSSR 
PHRGVLESGQPRCKSRRVDIFFSMERER 
DRERERERERERERGQKRTRRVWGGERE 
KLGYSHAWGWRGGReLGWGGMCGIPPPC 
TSDVHQ 


12115 


26016 


A 


12227 


206 


2 


QNGHMNGGPCRVSLSLFLETRSFLFFLA 
SLSMLECSGAILAHCNLRLPGSSNSPAS 
ASQVAGITGACH 


12116 


26017 


A 


12228 


231 


501 


RSWEGQAIiHGSDPLASCTRIQSWYMaLQ 
RINQELEDKLYRMGQBTYEEEKRALSHEI 
VALNSHLLEAKVTIDKLSEDWSAAHTW 
PMEDCR 


12117 


26018 


A 


12229 


86 


455 


DLPTFPVPPHPGFSGPLLGQSELPSNCQ 
TGGRSHLSVSVLCPTLATTPLGLDFPTC 
GTSQLPLGPLQLAHRSELAGLSLPGLST 
NCTSQRLPDRPHRHSDAPHTHTHTQTHT 
HTHTHTPATVT 


12118 


26019 


A 


12231 


211 


96 


KSHFKAGGGGPPRYSPPFGGGGGGVFRV 
GGFPPPGLTQ 


12119 


26020 


A 


12232 


228 


3 


KRWFIKGFLSKGLGPPPPKLKFLGFFFF 
FFFEQSFVLVAQAGVQLECSGTILVHFN 


12120 


26021 


A 


12234 


137 


3 


VPLPGTLKNHSMVPCFLFVCLFVCYETE 
SHSVAQAECSGAI SAH 


12121 


26022 


A 


12235 


139 




FKLCFGQAWWFKPI IPALWETDASRSLE 
VHSSRPADAWADAWADAW 


12122 


26023 


A 


12236 


80 


1 


VLL I FI FLEEMGFHHVGQDGLDLLTL 


12123 


26024 


A 


12237 


187 


494 


TERNSFNINKKDIHTKTPFVGYQHQRPK 
VDKTTKMGRNQSRKAENSKNQSTSSPPK 
DRSSLPATEQSWTEMDFDELTEVGFRRS 
AITNFSELKEHVLTRRTE 


12124 


26025 


A 


12238 


252 


503 


CQGVGADDLQSQAE I QCELCAES IDRFD 
ILFFFFLGLGPPHKKPPPPTPQKWKPGK 
APPLALWDIPRGAKGPGRQPGRPAIHTC 


12125 


26026 


A 


12239 


2 


471 


RIALCPAVRIRHEERERERERERERETP 
GHTQLSPGARRTPPLEREEAFNDCFSLN 



1277 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 
sequence 


Predict- 
ed end 

location 
eorrespon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIi,m, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














yPGNLCPGDLIEVFRSVYQHCALYLGDG 
YFINITPVYGIPAFFTSAKSVFFSRALE 
KTHLLKDWGSDTYRINYICYETLPSLS 
VEEIITRSEFVIGQEA 


12126 


26027 


A 


12240 


14 


149 


FMTACRIRHEMGRGLLRGLWPLHIVLWT 
RIASTIPPHVQKSALPH 


12127 


26028 


A 


12241 


588 


671 


RSRPSFQEQAFESSQKYKEGKYI lELNH 


12128 


26029" 


A 


12247 


384 


464 


RGGQVWWLKPWPALWEAKAGRSLEAG 


12129 


26030 


A 


12243 




439 


TYLFIYLFIYFETESCSVAQAGVQWRDL 
VSLQPLPNSPALPTN 


12130 


26031 


A 


12244 


323 


457 


SLQRECQPADTFFEMDSCSVAQAGVRWC 
DLGSLQNSP 


12131 


26032 


A 


12245 


241 


466 


GALFFFFFFFFFFFFFFFPFFLSNKAVY 

F 


12132 


26033 


A 


12246 


385 


37 


QARRECLQTSEMEALGRWSYKVTIEFPL 
QILFFPPILKYKVIFGIKITPNFHFLEE 
YEEGQRLGKSTVSWVFNEDTRLINIMMR 
HITVNPRTNSQNRMLKMAHESKYSQVPF 
PLQA 


12133 


26034 


A 


12247 


318 


436 


LILKYFMYVCMYVHMYVCMYIKSCSVAQ 
DGVQWYDLGSLQPPPPGL 


12134 


26035 


A 


12248 


362 


473 


SPCQNNMNSAKTEAETNIKFMAKLMWKN 
GAIIDALQK 


12135 


26036 




12249 


310 


463 


FS IKTRKNALGKLQSLLRCHQFYLYVCV 
CVCVCVCVCVCTRTYAILTCIAV 


12136 


26037 


A 


12250 


169 


54 


EAKSGQARWLTLVIPALWKAKAGGCPEV 
RSSRLAWPIW 


12137 


26038 


A 


12251 


49 


449 


GIPATSTSCVQVILLPQPPLLLGLQDS F 
LPQEIIIKVEGEDTQSLTIPSQEGVNFK 
IVTVDFTREEQGTCNPAQRTLDRDVILE 
NHRDLVSWDLATAVGKKDSTS KQRI FDE 
E PANG VKI ERFTRDD PWLS SC 


12138 


26039 


A 


12252 


130 


428 


RWGLAMLPRLVLNSWSQVILPPQPQVGA 
TFGGFPNPPGHWKLLGGISSGEEAGVEE 
AEEGQALGFLGQFPPSSPQLLLQTSWCL 
RTTRTCRWSLLWKAH 


12139 


26040 


A 


12253 


204 


54 


PGMHKTQKSDLGQVQWLTPVILGLWETE 
AGDSPESRSSRPAWAKHKIVR 


12140 


26041 


A 


12254 


375 


470 


SLCL F ?s f:-' FQTE PRCVAQAGVQWRHLS 
SLQP 


12141 


26042 


A 


12255 


432 


512 


GQVRWLTPVIPALWEAEGGRSPEARSS 


12142 


26043 


A 


12256 


232 


1 


KGALSKKTQSFWGGQKFFPRPGVFKGVF 
FWGSGFCFFVPGGFFFFFFFFETESCSV 
AQAGVQWCSLGSLQPLPPRFK . 


12143 


26044 


A 


12257 


22 


123 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 

SSSTRP 


12144 


26045 


A 


12258 


71 


362 


QTERNSISINKKDVHTETPSKGHQHQRP 
KVDKSTKMRKKQRKEAENSKNQNASSPP 
KDHNSLPAREQNWTENEFDELTEIDFRR 
WVITNSSELKEHV 


12145 


26046 


A 


12259 


375 


3 


SPNIEAPPKAFFFFNQRHPVPCFFSCRA 
ENTHEIVWVKVYPFSNQSKAKLFFSFFV 
QRDRS S PDS YLMPLQLQMMDTAGQERFR 

S TAQSYYRSANALILAYGITCEESFRCL 
PEKLRQIEQYA 


"12146 


26047 


A 


12260 


98 




KYQ I DLC-GRGS SICS PASASRVAC- 1 TGTH 
m-ITQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

imilotcfd 

peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
lastamino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireoniiie,V--Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12147 


26048 


A 


12261 


125 


3 


LFLIQKYITGQERWLTPVIPAPWEAEAG 
RSPKVRSSSPAW 


12148 


26049 


A 


12262 


228 


385 


SCLYLQMIPSS lEKSQGMYKKATRGRAR 
WLTPVIPALWEAGAGRS PEVES SR 


12149 


26050 


A 


12263 


227 


369 


MLKHKSNKNLQDLGQMQGLTPVNPVLWE 
AKVGGSPEVKNSRPAWPTW 


12150 


26051 


A 


12264 


102 


487 


QQNRFFGTERGRGTGRGVSSFPQLSPGS 
LELASSPSPSVGRLGSAaSQRSRGGQER 
PKAQLRRPPRPSSSPRLRGRENCAHHAR 
TPGGILDRPPPTLGTNLLKGGRPAWMAG 
NPPEAHKTANGPGEQA 


12151 


26052 


A 


12265 


391 


476 


PRSYEVECSGSISAHCNLGLLGSSNPSS 


12152 


26053 


A 
— 


12266 


378 


476 


MSSRLGSLEVGILGRAWWLTPVIPALWE 
AKEGG 


12153 


26054 




12267 


166 


58 


NFFFFFFETDSCSVAQAGVQWHDLSHCN 
LCLPASSN 


T2T54 


26055 


A 


12268 


209 


49 


TPPNFYIKISRSGQAQWLTPVIPALWEA 
EAGRSPEVRSSRPASPTRRNSISNS 


12155 


26056 


A 


12269 


127 




SSSFFPLKTFGFLFFFLCFFESESRSVA 
QAGVQWOILGSLQA 


12156 


26057 


A 


12270 


154 


27 


IQTPGKKEFFFFFPETKSCLVAQAGARL 
CPKKKEKENSNFVL 


12157 


26058 


A 


12271 


180 


22 


VLKIHNIGRAQWVTSVIPELWEVKAGGS 
QALRSSRAARTRWDACaiFVEKTQVY 


12158 


26059 


A 


12272 


207 


422 


GDSSTRGADEKPKEGVKTDNJNJSLINLKV 
MGQDCSMEQFKIKSHIPAGYGGSHRESQ 
HFVMPRWADHLRSGV 


12159 


26060 


A 


12273 


59 


479 


NSLGGGGVYGSRFRFTFPGCRALSPWRV 
RVQRRRCEMSTMFADTLLIVF IS VCTAL 
LAEGITMVLVYRTDKYKRLKAEVEKQSK 
KLEKKKETITE SAGRQQKKKI ERQEEKL 
KWNNRDLSMVRMKSMFAIGFCFTALMGM 


12160 


26061 


A 


12274 


354 


439 


VKTGECSIAKTWNQPKCPTTIDWIKKMW 


12161 


26062 


A 


12275 


366 


499 


TRAQYFWWGIGSQCVTLECSGMIMAH 
CSLDLLGSSDPPTSAS 


12162 


26063 


A 


12276 


136 


37 


KFPGQVHWLKPVIPALWEVKAGGSPEVR 
SSRPA 


12163 


26064 


A 


12277 


419 


48 


DGKLNVS FLTVS S SGCPLI FLLI VRPPY 
FLRYNNSEIRPINNPTMTCKCSSKRKSR 
ICLTFNQKLEIIKLS EEGMS KIKTGQNL 
GI LCHVSQWDEQHKFVKATKSATAKIN 
LSISKYIMSGA 


12164 


26065 


A 


12278 


190 


1 


RKGVFSPQMGPGLGKGKSPFLAKLNFPF 
KKTFFFFFETGSCFVAQAGMQWLDPSSP 
DAWADAW 


12165 


26066 


A 


12279 


276 


389 


WFNLSFFCFCFLFFETESCSVARLECSG 
AISAHCNLT 


12166 


26067 


A 


12280 


425 


1 


PPHPPPGKPPPPPRPELFFPKTKHKKER 
VLFFSPPPKKNFFSLTTPRFFFS PPQKK 
KKKLNPPPQKGPPPPVPIKPPPPLFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FF P FFFFS FGLTVRV IMTCWLIVDIGLL 
I 


12167 


26068 


A 


12281 


253 


90 


IWPPQGSFKTAAPFFFFFVFLVEMGFCH 
WQAGVKLLTSSSPPASASQTAEKLF 


12168 


26069 


A 


12282 


61 


387 


PKSSSSAPFQISSWFSLPASTSASTMS I 
RVTQQPYKVFNSGPWAFSSSSYMSRPSA 
HISSLIVSRVGSSSTSFQGGLGAGMGLP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 

ing to first 
fcTu'cnce 


Predict- 

otide 

location 

correspon 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HjHSSIiANRARLSQKKKKRGGRS kep 


12169 


26070 


A 


12283 


186 


305 


LHHSTVQVREEEHKSIiHTQVHTHTHTHT 
HPHLVIMSSHS 


12170 


26071 


A 


12284 


2 


190 


PRVRFDRVNNRSLNPLISRTMRIEP I PE 
NPKFSVKKKKKKKKKKKKKKKKKKKKKK 


12171 


26072 


A 


12285 


17 


197 
















KKKKKKKKKKK 


12172 


26073 


A 


12286 


158 


405 


GVASYAEQRKYFLEMEYFPGGEAIS IVE 
MTTEDLHYSINIiDDKRPTGFERVYSHFE 
RTCTDDKMLSYCITCYHEIFCERNKIi 


12173 


26074 


A 


12287 


364 


142 


GCAFFLGGPLKERRKRHAVPPLFFFNFF 
PETESCSVTQAGVQWCNLGSLQPLLIDW 
WGQPGTVAHACHPGTLGG 


12174 


26075 


A 


12288 


171 


380 


VFEAGHAWLQENLTSPDFWGFSFFPTDT 
GMLE KKKKKKKKKKKRKKKKKKDR 


12175 


26076 


A 


12289 


194 


2 


GPRGFWQFWVTKGFFPKKGFVKFYPPAG 
GWFFFFPKRGFPFPSFLFFFDRVWLCHP 
GWISVARS 


12176 


26077 • 


A 


12290 


179 


416 


1.IFLFTFACAVCFQLARFHFKIHIETAL 
RCLIILSCPFTHPLYKMYREGQARWLTP 
VIPALWEAEVGGSPEVRSLRPAC 


12177 


26078 










LAYGFHDRTT YDTP SALMtWLLVDELI T 
GNKNGSGRAPEYLPEDFRDGEDDAAVTL 
EKQEDLTTLVTLPVTLGEHQRQRETQLE 
AKLLKKRLELGSLLDLVEDLELIIQLKK 
KKKKKKSPLWGAPFKGPLGAPI 


12178 


26079 


A 


12292 


119 


278 


PKSIEAGVKSVLDHPLPNICAKTHLSCKK 
LKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKK 


12179 


26080 


A 


12293 


119 


3 


MLFTGFHTKVMSLNSITGWAWWFTPVIP 
ALWEAEVGKS 


12180 


26081 


A 


12294 


205 




ILHFYSTETKQPKGLFLALFYSDCEQEK 
RKGGKLKKI FKKAQGQARWLTPVI PAJW 
EAKVGRSPEVGS 


12181 


26082 


A 


12295 


215 


1 


AFKIKLTMHIFSSPLCLPQDEFHPFIEA 
LLPHVRAIAYTWFNLQARKRKYFKKHEK 
RMSKEFHGARLEGNQ 


12182 


26083 


A 


12296 


185 


411 


DVPLMFHWLHLHSDFRIISTEIGVETVG 
KRVLHIRILLLLLLLFFEMESCSVTHAG 
VQWCDLGSLQAP P PGFTPF 


12183 


26084 


A 


12297 


281 


388 


RILFFFFFETKFCSVPQAGVQGRDLNSL 
QPPPPGFM 


12184 


26085 


A 


12298 


178 


3 


TLLSVTQVP ILPGSASSPLLFSRKDIKI 
PTSVFFS FQTE PRSVTQAGVSWCDLGSL 
QP 


12185 


26086 


A 


12299 


199 


450 


TLI RHMI CNYFLLSHRS PFS SVD YFLCY 
REIYSLIKTLNKLGIGGFFLNIIKVMYD 
KPAAS IIRSSEKQKAFPLRSGTRIRCPL 


12186 


26087 


A 


12300 


3 


407 


EFKDHSTAMDTEPNPGTSSVSTTTSSTT 
TTTITTSSSRMQQPQISVYSGSDRHaVQ 
VIQQAI.HRPPSSAAQYLQQMYA2^QQHL 
MLHTAALQQQHLSSSQLQSLAAVQASLS 
SGRPSTSPT6SVTQQSSMVQTSI 


12187 


26088 


A 


12301 


227 


3 


KRGTEGQTLFFPAPEARKSFPPPKMPKK 
VEKEFPKQFFFFFFFETESAPSPRIiECS 
GAISAHCSLNPmSAHAS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

peptide 


Predict- 
nucle- 

location 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamlc Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12188 


26089 


A 


12302 


292 


380 


KC YRWARWLMPVr PALWEAKAGGSLE PR 
N 


12189 


26090 


A 


12303 


13 


186 


VDERHIRLSEMAWVALYLLSLLWATAGT 
STQTQSSCCEYDVFPEKSGKQLPWGRGC 
DL 


12190 


26091 


A 


12304 


253 


409 


ITAVIAAAAAWGVTIIMTNFRSLPRLK 
CNGAISAHRNLLLLGSSDSPASAS 


12191 


26092 


A 


12305 


153 


64 


GRAQWLMP 1 1 PALWEAKAGGPT JDVRS SR 
PA 


12192 


26093 


A 


12306 


265 


420 


GTLMSRNTKLQLCKMNKSRTLGQPWWLT 
PVIPTLWEAKAGGSLEARCSRPAW 


12193 


26094 


A 


12307 


295 


-j 


RPCLKKKKTSPHWKKKFANPLFGKAPVF 
PLDKKPFKPHYRESPRPLKKKNGPKGFF 
FFFFKTKSCSVTQAEVQWRDLGSLQAPP 
PGSYHSPSKLIIIOSr 


12194- 


26095 


A 


12308 


247 


421 


SWCCLKKKKKKKKKKKKKKKKKKKKKKK 
PGGGALKKKKIFS 


12195 


26096 


A 


12309 


168 


3 


GQALLLLCLTVAFSKTTWCHAPILGWA 
WWLTPVIPALWEAEVGRSPEVRRSRP 


12196 


26097 


A 


12310 


117 


3 


VA7NIAVFRGGVSLLLPRLECSGVISAH 
CNLRLPCSS 


12197 


26098 


A 


12311 


256 


101 


EPPTPFFFFETGSCSITQAGAQRRDHSP 
PQSPTPGLSNTPAPASRACBSGR 


12198 


26099 


A 


12312 


146 


3 


FVALCIQLFPLVWVICBHKKHKTGRARW 
LMPVIPALWEAKAGGSPRV 


12199 


26100 


A 


12313 


207 


3 


VRVRTTGVIMALRGLCFSVEESMTKDEL 
lARLRSLGEQLNRDVSLTGTKEELALRV 
AELKQELDDSR 


12200 


26101 


A 


12314 


282 


2 


NCIuAEICVKFMVSAI SLQPQGISILFLSF 
FFEKESCSVPQARVQWPNFSSLQSPPPG 
FKLFFCLS PGGGGCSEPKS CRC IPGWQQ 
WETPSQKKK 


12201 


26102 


A 


12315 


298 


84 


KTPQPPVFALSKNFGPPKTAPIFFFFFF 
FETKS YSVAQAGVQWRDLGS LHSSLGHR 
ARLHLKKRKRKS INL 


12202 


26103 


A 


12316 


124 


1 


SKGHFFFFFFSETESCSVAQAGVQWHNL 
GSLQVPPPRFTPF 


12203 


26104 


A 


12317 


132 


398 


WTNAVF S I VT F S P S VCH S E VALAAY KW 
LVCYLLRETYQKLNQEIKPPTLSVPKKK 
KKKKKKKKKKKKKKKDKKK 


12204 


26105 


A 


12318 


262 


392 


PHYCQNQPQARRSGSHQHFGKQRREDHL 
SLGVRDQPGQYSETP 


12205 


26106 


A 


12319 


168 


3 


EAPKAISNCTTALQPRLQGKKLSQNKEG 
QVQ CLTPVI PALWEAKVGGS PEVRS S 


12206 


26107 


A 


12320 


48 


385 


DLGLSGHSTLASS F I SLLLLS YKKCLS P 
STMIMRLPLPCGSPIKPPFLPKKKKKKK 


12207 


26108 


A 


12321 


218 


83 


NLSPIFPEGNFFFFFFEMESRSVAQATV 
QWRDLGSLQAPPPRRKF 


12208 


26109 


A 


12322 


197 


1 


KETWPFPPKKEQGLFSFFKKRIFFFFSK 
KIPSPPFFFFFFFETESCSVAQAGVQWH 
DLGSLQAPP 


12209 


26110 


A 


12323 


128 


36 


KGLFFFFEMGVSLLPRLECSGTTSAHCU 

LP 


12210 


26111 


A 


12324 


140 


3 


ILNSNLKPIKKNTFFFETESRSIAQAGV 
QWHDPSSKQPPPLGFKR 


12211 


26112 


A 


12325 


273 


369 


RKEWFLKKKKKKKKKKKKKKKKKKTKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lJnkno-wn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKK 


12212 


26113 


A 


12326 


140 


415 


RTC3LLGPGRLQGPPYGSREYKGSGLLDL 
NASLLDPEVWAPS LSGPETKPPAHS PSD 
PGVHSSCSASQGSQGPGPLTPSSRIHEP 
KPPGPFFK 


12213 


26114 


A 


12327 




506 


FVPLSETEAPGCSGSRPPELEPERSLGR 
FRGRFEDKDEQLEEEEELEEEEEEEEED 
MSHFSLRLEGGRQDSEDEEERLINLSEL 
TPyiLCSICKGYLIDATTITECLHTFCK 
SCIVRHFYYSNRCPKCNIWHQTQPLYN 
IRLDRQLQDIVYKLVINLEEREKKQMHD 


12214 


26115 


A 


12328 


1 


419 


FRVPS CARCQYLPLLRGflSQLGYSGVRD 
PVEEAVCPFSDLKLCAGRTTTLFKAVRQ 
GHLSLQRFLLP FVWLCPflPRSGVYRGRQ 

ASLSCGGLHPVGASWPRCLPTQASAMAG 

TPPPAGLPPCSLISDCCASNEHGSMGV 


12215 


26116 


A 


12329 


259 


414 


NMENSLROTI-JVPKLAFVLFGASLLSMIL 
QVTGFQIKAFTALRFLSEPSDAVT 


12216 


26117 


A 


12330 


302 


404 


SCYRVSVISQftRWLTPVIPALWEAEVGR 
IPEVGS 


12217 


26118 


A 


12331 


94 


375 


SRDMS PGLLTTRKEALMftFRDVAVAFTQ 
KEWKLLSSAQRTLYREVMLENYSHLVSL 
GIAFSKPKLIEQLEQGDEPWREENEHLL 
DLCPQWSAMA 


12218 


26119 


A 


12332 


7 


246 


NPIVDPSPCGGIRVETPAGRGGPALRVR 
PETWEEAGEKMPSE SLCLAAQARLDSKW 
LICIDIQNCFITRKI SLLPLFCHHL 


12219 


26120 


A 


12333 


150 




YIYFFHLVRGLPGSGYDEYFVWWNTE 
SPSVTQAGVQWCHWSVQHKPP 


12220 


26121 


A 


12334 


34 


426 


EPGFLFVFFFLIRGKKTPLFFWGTRKWP 
KSGKPLFTPPGKSP SEPPKSRGGPLFFQ 
TRSRLTPQSWFGQAPKPPLGGEPPKPWE 
AGLGGDPPTPEAPHRADQTLEQPIQAPT 
RI IQNEQWAIKPFQPMPGG 


12221 


26122 


A 


12335 


185 


3 


TVITPLHFSLGHRAELWSRVPAPPAEWR 
SMEGGLGRAVCLLTGASRGFGRTLAPLL 
ASHE 


12222 


26123 


A 


12336 


60 


316 


GWGPIPSTDVTVLSPQKPILLQGHERAI 
TQI KYNREGDLLFTVAKDP VSVGWRGSG 
RGGILLPGRWVDTPVLPLAGNQRASHCE 
K 


12223 


26124 


A 


12337 


3 


167 


SFRIQVQGNHTSKHYPKIQYSDQAQWLT 
S V I PALWEAKTGRS LE VRS LRAFRAP 


12224 


26125 


A 


12338 


238 


401 


RPLSVMCGRLTS IRVEDSGARCWFYLFI 
YETEFRSVAQAGVERRDLGSLQTPPSG 


12225 


26126 


A 


12339 


354 


433 


GWAQWFMPVIPALWEAKftGVSPEVRS 


12226 


26127 


A 


12340 


238 


382 


LLIVYUMMLLPDAVAFTCNSSTLGGQGR 
RITRGQEFETSLANMVKPHL 


12227 


26128 


A 


12341 


101 


1 


KTKQTNKKTRSVGQTWWFTP I IPAYWEA 

QAGEL 


12228 


26129 


A 


12342 


276 


23 


GSCLLEGKLTNRKDIHTKTPSVRHHCQR 
PKIDKTTKMGRNQSRKAENSKNQSSSSP 
S KE CS SLAATEQSWMPHDFDELREEGFR 


12229 


26130 


A 


12343 


122 


1 


YMGVNERGCGQSIFKSSLSSQLWWWAPI 
IPATQEAEAGEW 


12230 


26131 


A 


12344 


180 


360 


LHWYPFSVCDFtPPPPPPLDDSSALtS 
ISGNFPPPPPLDEEAFKVQWAEVKUML 
GKS 
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peptide 


Predict- 
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acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionlne, 
N=Asparaglne, P=Prollne, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Tlireonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=lJnknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


12231 


26132 


A 


12345 


35 


411 


TMTIVDKASESSDPSAYQNQPGSSEAVS 
PGDMDAGSASWGAVSSUroVSKHTLSLG 
PVPGAWYSSSSVPDKSKPSPQKDQALG 
DGIAPPQKVLFPSEKICLKWQQTHRVGA 
GLQNLGNTCFMIA 


12232 


26133 


A 


12346 


278 


386 


IIYCYKQTIIGQftRWLTTLIPSLWEAKV 
GGSPEVRS 


12233 


26134 


A 


12347 


111 


2 


KSQTGQVQWLTPVI PALWEAEVGGLLEL 
RSSRLAWA 


12234 


26135 


A 


12348 


215 


316 


uskriwisfikkkkkkkKkkkkkkkkkk 
kkkkkkkggpl 


12235 


26136 


A 


12349 


265 


404 


SNISYFLNMSIFSKALSUSriKYMISQaRW 
LTPVIPALWEAKVGGLPE 


12236 


26137 


A 


12350 


372 


1 


PVFPLPPKKLGEQLPPPALRFLAVSPLP 
KAAHEQEIKEKVLAVHKNPIDPVYGFPK 
KGPTNFFLFPLFQRVIFLGAQKGVSWFG 
FYVKGLSLAPKLGGPPFFFFFETESRSV 
AQDGVQWCDLGS 


12237 


26138 


A 


12351 


318 


416 


QGRAQWLTPVILTLWEAKAGGSSEVRSS 
RPAPP 


12238 


26139 


A 


12352 


144 


1 


NGYAIVECWLFCFVLFLRQSLSVAQAG 
VQWCaSTLGSMQPPSTSLVQai 


12239 


26140 




12353 


116 


1 


SGRYSFKKMKCGQERWLASVIPALWEAE 
VGRSPEVRSL 


12240 


26141 




12354 


102 


2 


BTRYKKQPGGWERWLTPIIPVIJJEAEVG 
GSPEV 


12241 


26142 




12355 


112 


294 


LISPSCPMWPSPHGPTMLSRLISQHSPS 

NLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRGGGAF 


12242 


26143 


A 


12356 


138 


2 


SFFLKVFFFFFFEMESRSVAQAGVQWCN 
LGSPQAPPPGSRHCPVD 


12243 


26144 




12357 


148 


394 


PGEPPGEKGEGGKGKKPGGRGWGEPPCR 
GGFKEKFPRGKEKAPPLKENSRENLGWG 
TKIPPLGKKKPPPPQEQEMILGPPQF 


12244 


26145 


A 


12358 


97 


3 


EKFS P CFVRARTHTHTHTHTHTLSLSLF 
GFH 


12245 


26146 




12359 


204 


2 


KHYTPAAC FAP CLPDEAP I lAAAKPATT 
TS EQKMAVP PKYANLGKSARKVFNKGYG 
YGLIULLKTKS 


12246 


26147 


A 


12360 


156 


391 


NRGLFKVEESFSIPCSVRCSIHPSALVS 
PTPPTTDTTMWALFFETESCSIHQTGMR 
WRDLNLLQPLPPGFKEFSCLNL 


12247 


26148 


A 


12361 


179 


1 


ALKKLLI IDDNSHKTKIVKHFSFRWFTF 
LFLLETESRS lARLECSDANPTHCNLRL 
SGP 


12248 


26149 


A 


12362 


218 


391 


TI LKDCTFKMI CLSQAQWHI PWPATWE 
AESHGLLEPTS 


12249 


26150 


A 


12363 


2 


356 


TNSHVDWS ITQKPEFC IPRCCGYIVCQL 
QLVERAFIFNFFFFFLKESFVLLPRLEG 
RGAIFRVTQEGSNLLTI^PAPLGLPKGW 
NYRGDHPGPAYFYFLMGKRLLQIQGGRG 
LKRNLL 


12250 


26151 


A 


12364 


94 


1 


KNPPLFFFFFFFESESCYVAQTAVQWHD 
LGL 


12251 


26152 


A 


12365 


237 


2 


YSSLLFKPAESAKGIIiRHHVTPKSTSAI 
SKILFCFLFFVETEFLSPGLKCSGAIKV 
HC3LNLSGSSNSPTSASQVPGS 


12252 


26153 


A 


12366 


340 


442 


GLFVCFFVWFLETESCSVTQAGVHWCDL 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSLQPS 


12253 


26154 


A 


12367 


208 


45 


EGGEFFFFFFSVFFFVRAGVLPCCEGWS 
ETPGSSDPPALASQSAGITGATVNST 


12254 


26155 


A 


12358 


60 


443 


CSSCVSSKPQSPHLKKQGDRMGflHLISG 
GCTEATAGKGMRKCLVAWSTCAGftRPPS 
RNRGSQGHSARFQGPELPT WRLP P PP P 
QAKPFCKKWMLGGPISIEPKPQHPPGPG 
KKKWAGKAAPRLVPT 


12255 


26156 


A 


12370 


341 


421 


SGWVRWLTPVTPALWEaEAGGSPEVGR 


12256 


26157 


A 


12371 


280 


393 


NTIDHYVTQWLTPVIRTLWEAEaGRSPE 
VRSSRSAWPT 


12257 


26158 


A 


12372 


268 


399 


TLCSLSSAESIIHSYPEFSCKDNYLMTH 
NEAYKACMRRAFHI PL 


12258 


26159 


A 


12373 


209 


468 


EAVPDETSHPSPADSFSTSSNIACGTHH 
LRRLLKLYVEEELE YYPRKIQCEHFS FF 
FFESRSIiliLPRLECSGS ISAHCNLCLL 
GS 


12259 


26160 


A 


12374 


381 


488 


TITCLFQKCQRGRARWLTPVIPALWEAE 
AGGSQGQE 


12260 


26161 


A 


12375 


276 


395 


GHWICTKKLKLIFVFFEMESCSVAQAGV 
QWHDLGSLQPPP 


12261 


26162 




12376 


295 


472 


VYSSVALNTFTLLCNHHHQPSPEVLTFP 
NKKCRGAILAHCNLHLLGSSDSPTSASR 
VAG 


12262 


26163 


A 


12377 


390 


511 


PGAVAHACNPSTLGGRGGQITRIPSL 


12263 


26164 


A 


12378 


417 


511 


AHRSFFEMESCSVTQAGVQWRRLGSLQP 
SPP 


12264 


26165 




12379 


133 


406 


KLSLNKRDYKKQVCKETKKKKKKKKKKK 
KKKKKKPGAQKKKSGGSPR 


12265 


26166 




12380 


247 


345 


HMWSSQLNKGLINKKKKKKKKKKKKKKK 
KKKKKKKKG 


12266 


26167 


A 


12381 


156 


15 


NFFFFETESCSSVTQTRVQWFDLGSLQP 
GKQEQNYVSKNKQKELIH 


12267 


26168 


A 


12382 


275 


1 


KIVFLKGPPLFFFFKKKFVFFLKVGFKG 

GLNF I KTFPPGGKP I SKKRQI FFFFFFF 
ETEPHSVARLECSGTISPHCNLHLPSSN 
DSLASTS 


12268 


26169 




12383 


43 


379 


LLGYDESRSLSWICLCLSCLGQLLSFLN 
LLVYVFCQTCEVCFHYFFQYFSTLFLLT 
FWDSWVLNIRHFGIVPQVLDDLVFFSPN 
KEIFLGQLQWLMPVIPAFWEPEAGRSPE 


12269 


26170 


A 


12384 


303 


1 


KGPFFFFGLSPFFFLKKVFFFPPFFLRT 
PPFFFFPPFFKNFFPPPPKKKNPFFFFP 
PPPFFFFFFFFFETRSRSVAQARMQWGD 
HGSPQPQPPGSSDPTRP 


12270 


26171 


A 


12385 


141 


1 


KKEPLRAPPPFQLFCFFYFETTPCSVTQ 
ARVQWCNLSSLQPPPLRYK 


12271 


26172 


A 


12386 


31 


417 


CRLADSPSPNDTGQDSRGRAGIKHIPPL 














KKKKKSGGGA 


12272 


26173 


A 


12387 


291 


412 


LSSEMLLFCFVYFLRWSFTFVAQAGVQW 
CDLGLLQPPLPG 


12273 


26174 


A 


12388 


310 


1 


MTPCPFLFFCPKKRKA(3GGFIKKaLFFN 
PQKEVFLGPPPKGPFFLGFQPTPPWGKK 
IPRIiNPLFWRSSIFFFFLLDGVSRLLPR 
LECNGVISAHCaJLRLIiQSS 


12274 


26175 


A 


12389 


117 


2 


KRNPQRGGPLFFFFFFFETESRSWQAG 
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C=Cysteine, D=AsparticAcid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, lj=Leucine, M=iVIetliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=TIireonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unlinown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














AQWHDLGSLQ 


12275 


26176 


A 


12390 


117 


319 


SFLKKPKDSTKWLLKLINEFSKGSGPKI 
TIQKSVAFIYPNGEHSEKKIREVITFTI 
AAKTIKCPPQA 


12276 


26177 


A 


12391 


95 


2 


SKSWTSLKLRTLGWARWLTPVIPALWEA 


12277 


26178 


A 


12393 


127 


395 


IPGSQGFCLIKQSALHLLNKSSFSFLSF 
lETQSLSVTQAGVQWCYLSSLQPPPPRF 
KLRQKNRLNSGGRGCNKLRAHHYTPGWV 
TERDS 


12278 


26179 


A 


12394 


252 


391 


FFCFCFVLLCFFEMESCSVAQAGLQWPG 
LGSLRQGFAMLA3JLVSSS 


12279 


26180 


A 


12395 


275 


3 


TFRLVI LICKS KETVEINCLETKSHSVAR 
AGVQWCSQDWWEEKAGFNWRPSKARRCP 
TSVLKCHLGWAQWLMPVIPALWEAKAGG 

SPEVRG 


12280 


26181 


A 12396 


111 




GKW KI RGLR IRPLKGWACWFTPVIPAF 

WEAEAGG 


12281 


26182 


A 


12397 


124 


1 


VSRGRTVHQEKPTERAASPRLECNGTIS 
AHHNLCLPGSSNS 


12282 


26183 


A 


12398 


259 


387 


YQQESFSCLFIFIFLFFIFYDTESRSVA 
QAGMQWHDLGSLKPP 


12283 


26184 


A 


12399 


103 


1 


KGRFFFFFFYETESRSVaQRGVQWRDLD 
SLQAPP 


12284 


26185 


A 


12400 


308 


416 


CGNNFEKAGRVRWLTPVIPALWEAKftGG 
SLEVRSLR 


12285 


26186 


A 


12401 


92 


1 


HTWGLTQWLMPVIPALQKAKVGKLFEPR 


12286 


26187 


A 


12402 


265 


388 


GLSWLFRRPGHLFYFIFFETMSVS ITQA 
GVGCCDLGSLQPP 


12287 


26188 




12403 


279 


1 


SLSSKMESGSLTFSTIDIWGWI ITLSCC 
PGHCRMFSSIRGLYPLnaSSTTSSSCMN 
QNSPDMGTCSUfSQLLRRLRWEDCLNTG 
GRGCSEPGS 


12288 


26189 


A 


12404 


387 


152 


NPPPGFNFGGPLKKNFFSPPRGEKFVFL 
KRPPPFFFFFFFFFETESHSVAQAGVHW 
RDLGSLQALKSFCDLVSAFKGF 


12289 


26190 


A 


12405 


262 


464 


1 1 FHWLAC I LVIHS SFDGHGLFLPLAVA 
DSAAVKHSCTSICLNIEYIPGNGIAGSC 
GISVLNFLRNC 


12290 


26191 


A 


12406 


204 


67 


KCFGQAWWLMPVIPAU-JEAEAGRSQELR 
SSRPAWAKELLNNRTRAS 


12291 


26192 


A 


12407 


134 


I 


IQRPCIQLFSTFFLILFYFIFTEIDSCS 
VAQAEVQWHDFGSLQP 


12292 


26193 


A 


12408 


145 


403 


TYMYSLINYYKAUSHVTKVNKHNTARFL 
ESLLCAHPNPPIKLIISLLLREKKKKKK 
KKKKKKKKKKKKKQ K 


12293 


26194 


A 


12409 


65 


415 


RNKR 1 1 S QS WNGP S RKPQS SALLCGLGH 
LTSGVRARVSPVQGCLVRIKGGSGPSKP 
KKKKKKKKKKKKKKKKKSSKKKKAQKGG 
ALKKK 


12294 


26195 


A 


12410 


124 


404 


PWWEFLHHRNCQMWPGTVAHTCNPGTLG 
KKKKKKKKKKKKKKKKKKKSS S S S S 


12295 


26196 


A 


12411 


164 


2 


TGAGLTIMDMVGVGPDLLQLRFCFVLFC 
FLVFFFETESDSVAQAfiVQWCNPSSL 


12296 


26197 


A 


12412 


114 


3 


PGMVAHACNPSTLGGQGGWITRGQVYKT 
GLAKTVKP 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12297 


26198 


A 


12413 


583 


391 


LADTGFHHVGQADLKPLTSCSTRLSLPK 
CWD-ffiCEPPHLAVPDTVLSTRGLISEKS 

FLPLCTFR 


12298 


26199 


A 


12414 


280 


2 


KVSSMNAQTSVLFSTMLGISETLKYRGR 
KKRRERGREGGREGRERKDECRKVRSER 
lEGGKKRGRKEYREEGRRRIEVSANEET 
PRDHPANK 


12299 


26200 


A 


12415 


264 


352 


RFLVHGCDIYICINIYTEITHTHTHSHTH 
T 


12300 


26201 


A 


12416 


359 


485 


TDAEMDWCYQEGTGSQAWWLTPVi PAFW 
EAEAGGSFEVRSLR 


12301 


26202 


A 


12417 


171 


1 


PLFRWWGFELFPPFGYQKKKGYPPFFFF 
ETEPCCVTQAGMQWYDLRSLQPPPPBFK 
R 


12302 


26203 


A 


12418 


265 




WHKSNEEIAREASAPVKARASNTILGPH 
THGKERCPVSLRHVLDEGVKSTNFFSFF 
ETDSHSVTQAECSGAISAHCNLHLMGSR 
DSGA 


12303 


26204 


A 


12419 


278 


388 


SSLTGFFIRRRNEEPGQHGETPCLLKIQ 
KLAGCGGVL 


12304 


26205 


A 


12420 


205 


413 


RAIGQAEWFTPIIPAVWEAKVSRLPEVR 
SVRLSLPKWRITS 


12305 


26206 


A 


12421 


103 


2 


RSLSKHRTETILGLPAAVLI ILFPPLL I 
PTSKY 


12306 


26207 


A 


12422 


372 


478 


NIFVFLKETPCQSQWIiMSVIPTLWEAEA 
GGSLEPR 


12307 


26208 


A 


12423 


174 


416 


KMHYCVLSAFLILHLDTVALILSTCSTL 
DMDQFMRKRIEAIRGQILSKLKLTSPPK 

NYSEPEEVPPEVISIYNSTRDLLQE 


12308 


26209 


A 


12424 


572 


733 


RFSCLSLPGSWDYKNCLNSGSRGYSEPR 
SHHC I PAWVTEQDSVSKQNKTKQSA 


12309 


26210 


A 


12425 


123 


2 


GYIFIFIYFFETGSCSVAQAGVQWRNLG 
SLQLPPPSSSDS 


12310 


26211 


A 


12426 


224 


417 


TADFYGVHYISIKLLPKKKKKRGGPFKE 
SKFTAAGLQKNIFFLSAPNFIALAWLK 
RRDWETPG 


12311 


26212 


A 


12427 


115 


3 


FFETGSHPVTQTGMQWCDCNSLQPLTNR 
LKQSSHLSL 


12312 


26213 


A 


12428 


250 


398 


PWAGHLTSLGLSLPIFWIGIAGQACWLM 
PVIPALWEAEAGRSLEVRSSK 


12313 


26214 


A 


12429 


215 


3 


KKGLWWGANKLWVFTPQKFFLNQFSGW 
ALTPRGKSWFSKRRAPAPFFFFFFWETE 
SHSVAEAGMQWHDL 


12314 


26215 


A 


12430 


345 


2 


FSHHPAKPQGGLFYRQPLGLEGKETRPR 
LNDRFMAGPSFPAQQGLRMPFLGKEWER 
EGCSFVQCPVEGFAMWPYPLGSPPGPRL 
FFFFFFEAGSPSVTQAECSGS ITANCSL 

EL 


12315 


26216 


A 


12431 


348 


3 


LYSSLGDRVKLRLEKTKTNKWCLHSYIS 
TCKIINLDHYLIPCTKTNSEWIKYLNVR 
TNS IKLLE EDI SVNLHELGSDNSFIiAMT 
PKAEIIKEKIYKLHFIKIKIFHVSEDII 

EK 


12316 


26217 


A 


12432 


160 


52 


NTAIQHPHQAIYLSYLSIIYLSIYLSIY 
LSIYLSIYL 


12317 


26218 


A 


12433 


180 


2 


GRSKLHICREHSICaiBHVCGCDTENNT 
NLCQVQWLMPVILAIWEAEAWRSPEVRS 
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nucleotide deletion, \=possible 
nucleotide insertion 














LRA 


12318 


26219 


A 


12434 


193 


3 


PGAENPPPKGGPPFFFPFPRGFPPGVFF 
LKKKGFPFFFFFFFDTESRSVAQAGVQW 
HDLGSLQ 


12319 


26220 


A 


12435 


321 


422 


NVEGDFLQRIKNIRYWPGAVAHTCGPST 
LGGRGG 


12320 


26221 


A 


12436 


116 


2 


HDRFCLFVYDRVSTLSPRLECSSAISAH 
CNLRLPGSSN 


12321 


26222 


A 


12437 


302 


400 


THTHTHTHTHTHTHSGLLYYLPKGKQSH 
EFCW 


12322 


26223 


A 


12438 


329 


399 


LALQAKLECNGIISAHCSLCLPY 


12323 


26224 


A 


12439 


310 


2 


KPILSRGLFCRRPCFRRSNMESFNPSQS 
YQVGKDTFGFTVKVTKLKVLGPSRGPKG 
ARQGVHKGKCLLTLPKMLRFLFFFETES 
CSVTRLECSGAIIAHCajL 


12324 


26225 




12440 


151 


401 


EGMPHLPGSHSSSNILKRGRESGRBAGR 
KGRKERRKESIRERRKEEEKRGRGKIRR 
RKDRGENEEEENNRLSSKVSLEISTCP 


12325 


26226 


A 


12441 


276 


1 


FHPFGRPSVSPRVQPKIGI IIHTCVARI 
EKGGSCKALSLCPQYVGSNFFFFFETRS 
FSVAQAGVQRCDLGSMQPPHFPGSSDPP 
APASRVAG 


12326 


26227 




12442 


140 


1 


NSHFPKNLGSPSSFFFFFYLETESRSVA 
QAGVQWLDLRSLQPPPPG 


12327 


26228 


A 


12443 


310 


3 


LPPCSVgGLYGAFARCPLMDDKQLSQVPL 
WACPLSTAGRTRLCVACIKAAGKAQGFF 
FFFETESHSVTQAGVQWHDLGSLQPPTH 
AKPKHTHTHTQTHTIPDT 


12328 


26229 


A 


12444 


337 


2 


GFFAIVPKKGQKKMFSPFPKEGIFSGPC 
FISSTPTLLGVKKILPPNTSEIRPVLSS 
I INTSVHIREQKKTTYVGIFTGKSRFFF 
FETDSRS IAQVGVRWRHLGSLC3APPPG 


12329 


26230 




12445 


112 


9 


GRVRWLTPAIPALWEAKAGRSPEVRNSR 


12330 


26231 


A 


12446 


172 


3 


IPGELLIGRAKFKAKNWPWAPRGIPLGR 
QKFSFFFFFETESRSVAQAGMQWLNLGS 


12331 


26232 


A 


12447 


221 


418 


RKAERLEVFFRRLCQRALRPFPHCLAAA 
PMPLIVLKKPILGRAWWLTPVIPALWEA 
EAGGWLESRS 


12332 


26233 




12448 


280 


392 


PLGRVRWLTTVIPAHWEALAGGSPEFRN 
LRPAWPDMA 


12333 


26234 


A 


12449 


410 


129 


PSQEIFPYPGPPGKTPFPLKKKKKIJRGG 
GGGLPFP PPQKVKKKKFFYPGRGRFKQ P 
KFNPLPPPRGEKKNPKPPPPKKKKKKEK 
KRKEKKLVT 


12334 


26235 


A 


12450 


159 


1 


KKKRGGFLGLTFIKKTGPDPFFFFFETE 
SRSVAQAGVQWYDLGSLQAPPPGLM 


12335 


26236 


A 


12451 


270 


1 


NPLKFFFFSLFQNPLKKDPAIFNPFFFF 
FPRIFWPFFGGVSKAGIWFFFFPFFFFF 
FFFFFFETESRSVAQAGVQWRDLGSLRT 
RGRTRG 


12336 


26237 


A 


12452 


249 


33 


DKLVKLPLILKKLNSFFFFFETESRSVA 
QTGVQWRNLGSLHPPPPGFKPNAWAYAW 
AHAWWPGLPAQNLR 


12337 


26238 


A 


12453 


190 




THDLGWTQWLTPVIPALWDAEAGRSLEL 
RSTRKA 


12338 


26239 


A 


12454 


318 


2 


PPTFPLKTLFFPVFLPKLFSTLFSPKKK 
FSMFLVSAPPFFFLTLGNSQGVLLKGGP 
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otide 
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ding to 
last amino 
add 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFETESRCVSQVGVQWRDLEKK 
EKKKCNSNGGRHYI STHLKYK 


12339 


26240 


A 


12455 


204 


288 


IQLNTHTHTHTHTHTHTHTHTNETGKMW 
I 


12340 


26241 


A 


12456 


409 


13 


QGGAPP I PIMLS FLQEAGLLHVAQPGVK 
LLTSGDPPSPPSKSAWRQRQETRLNLGG 
RGRSEQRPPPRPPAWATQQDSVSKTNKQ 
TKNNKKIRKSLCKKTLEKEMLAVMIITK 
YLLCTDVPSTVLCTHLYYLT 


12341 


26242 


A 


12457 


204 


3 


KKKSPTPQSAKFPL(SGGVFFFFLVGMRS 
PFVAQAGFKPLGSSDPPIIASQSTAITG 
MNHHTSSAFH 


12342 


26243 


A 


12458 


289 




KAREKKKVGATKNIRIVDLSKGRTSLCK 
KTEMPPVGNENKQKPQNPSNLRISTRPE 
TESRSVTRLECSC3AISAQVILPGSSDSP 
ASNWRQSETLS 


12343 


26244 


A 


12459 


261 


388 


SWHFGRSEQVYCLSLGVRNQPGC3NPVST 
RTTKISCJAWWHAPW 


12344 


26245 


A 


12460 


313 


391 


RVGRVHWLKPVIPALWEAVAGGSPEV 


12345 


26246 


A 


12461 


112 


7 


HICIYFFETESRSVAQftGVQWHDLSSLQ 
PPPAGFK 


12346 


26247 




12462 


389 


3 


ALHEGAARGSPPPGGKKKKRGSPFPMGN 
QAPQVPGKLGGGFGFWFFKKSLPLQPGG 
KPGVFYKPKKVPTPKDPPPPTLWGGWGT 
TGGPPGPLFFFFFFFETESRSVTQAGVQ 
WRHLGPLQPQPPAHRS 


12347 


26248 


A 


12463 


26 


349 


CIMIDSTTGI FIHCWWECBMAHPFWKAV 
WQFLFFFFKEGVYFFFPGGIKGPGFGLR 
EPLPPGIKGVPRPNPLGGGDLRPPPPPP 
INLGFFKGKGVFMVWPGVGNFLI 


12348 


26249 


A 


12465 


187 


54 


GYKNKPGQAQWLTPVl PALWEAKAGGS P 
EVRS SRPAWP IWRTRG 


12349 


26250 


A 


12466 


199 


55 


IIQLRQQNSFLFICFDIESCSVAC2AGVQ 
WHNLSSLQLEVPGFKHAPSH 


12350 


26251 




12467 


107 


3 


FAHCLIGNSFPSFFFFFFEVGSCSVAQA 
GVQWSD 


12351 


26252 




12468 


120 


1 


GNWCRAQWLTLVIPALWEAEVGRSPEVG 
SSTRLGLPTHRP 


12352 


26253 


A 


12469 


105 


2 


YYLYFIFFLFFLETGSCSVTQAGVQWKD 
LRSVTS 


12353 


26254 


A 


12470 


238 


405 


FHICLFILKNQVSKMYTHTVSCSLLEII 
YCCLLQWLTPVI PALWETKVGGSLEVRS 


12354 


26255 


A 


12471 


425 


' 


LNPAPAIFFGGPKKKNFFSPPRGDKFFF 
FKRAPPFFFFFFFFFFFFQKKKKPFFPA 
QKKGRGFPGRALSQWPLAGFLKITQFPP 
WSRVPFSPLLFFFFDMESPSVAQAGVLW 
HDLLHLPGS SD P PT PASRVAGI TGVCHH 
A 


12355 


26256 




12472 


122 


2 


QNKYSELRMNTFGRPQWFTPVI PALWEA 
EAGGSPEVRSSR 


12356 


26257 


A 


12473 


144 


420 


GAPRPISRPRGKTPGGGGKRGKPNKNGL 
GENPFFPKGVKGKNPPIFFPFLGQNGPT 
PLVLGSANMGFYLYLKRKGPGKGREALR 

AEPGQPFY 


12357 


26258 


A 


12474 


80 


1 


KGGQVQWLTPVI PAFWEAEAGGSPEV 


12358 


26259 


A 


12475 


54 


294 


LIKSWQHRQEYVSLMSFIFFPCHIWYQS 
QRSPGTTQI ILE ISDTGKNKYFNCTGSH 
RKHTHTHTHTHTHTHTHAHTHTL 
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nucleotide 
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ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 

ding'to"" 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12359 


26260 


A 


12476 


364 


234 


EEMGFHHVSQDDLNLLNSWSACLGLPKC 
WDYRSETSRLASLPF 


12360 


26261 


A 


12477 


288 


450 


PGTVAHACNPSTLGGPGWSRTPGLKRIl " 
CLGLPKCWDFRRELLPSLPCLLIFL 


12361 


26262 


A 


12478 


271 


396 


TISFLSFFFLFLFFFFFFWRKSGRESRF 
SQKKKKYKKKNIYY 


12362 


26263 


A 


12479 


289 


2 


GSCIjLERKLTYRKDIHTKWPSVPHHHQR 
PKVDKTPKTGKKQGRKTGNSKNQSASPP 
PKECSSSPATEQKWMENDFAKLREKGFR 
RSNYSELQEEl 


12363 


26264 


A 


12480 


323 


1 


YSMVYPYVFLTAKKDQLQVHNTQLTCKS 
YQLYHCINHSTLQTIEsriSTLMILGHIPG 
LCIPVNLSEAWAAl'PALHFVKLLLTELT 
HHVCRALDI I ILAIVSLVALITS 


12364 


26265 




12481 


279 


429 


MNGQRKCVYVYTLYTYTTEYYSTLKRRK 
WPGTMAHACKPNTLGGHGGWTA 


12365 


26266 


A • 


12482 


308 


423 


LTFFKNITGFFFFFFETEFCFVPQAGAQ 
GGDLSSLKAP 


12366 


26267 


A 


12483 


272 


474 


MVSWGLSEAKIPffVLSFFFFFEKKSHFF 
FQGGGRGGDLSWEQPLPPRSKGFSCFTL 
PSTRDYRPAGP 


12367 


26268 




124S4 


"84 


129 


LFKSVCHLLPLSSCSSHVRQACIiPFTFC 
HDCKCSEASPAMLPVQPAEL 


12368 


26269 


A 


12485 


392 


3 


TGKKKRFPFFPKKKIFNSKPQFSWRKKK 
liFKIFSPPGGAPFPPGVQKKKTFFPFYP 
FSPRTPFFSIFFPGKGPPKGGGLGPPFF 
PQKEGGGFPNSKWETWEKFPPLFFFFFF 
ETESHSVAQAGVQWHDL 


12369 


26270 


A 


12486 


76 


T 


KLISWTWWHMPWPATWRAEAGDFS 


12370 


26271 


A 


12487 


204 


















EWHDHGSLQPQPPQSFHLSLPCATMPGL 
FFLFLFLFT 


12371 


26272 


A 


12488 


96 


3 


KKPLGQAMFFFFFETESCSVAQAGVQWC 
DL 


12372 


26273 


A 


12489 


126 


1 


KPPHPFLPFFFFFETESHSVAQAGVQWC 
YLGSLQAPPPGFTP 


12373 


26274 


A 


12490 


107 


2 


lAGRAWWLMLAIPTLWEAEAGGSLEPHS 

SQPTCAT 


12374 


26275 


A 


12491 


302 


2 


GGFGFFPPREKGGFFQTVLFGVPPGFFS 
PPVFKTGPGVFFLGAQKKKIFFPPPGGK 
IWFFLRGAPLFFFFFFFFFFFEMTSCSV 
AQAGLQWCDLGSLQHP 


12375 


26276 


A 


12492 


248 


398 


PTEQVTLGITAQSYSRVHIKNRVyDLAV 
GSGHPDGAAAIKGSFVQRLKSY 


12376 


26277 


A 


12493 


341 


1 


KTSHFREIYPLEHASSLSKKIETGALPC 
S QE LLNS QENS VMNKF P CLNQLP FQDVN 
SCI IRFQTKDTLKHTAIQPEAKTLSLPC 
YTHTHTHTHTHTNNHSIFELLCMQCDSY 
N 














PLFFFETKSCSVAQAGVQWQJ7LGSLQPP 
PP 


1237S 


26279 


A 


12495 


62 


420 


CAAKLCTEAHTAASADTHTSPHGSVSGL 
FCFHFPPHRREQESRPGRKPGAHRLAGR 
ALSQKPVGSGATPHNLHHQIRTQTNGLI 
QLLGDRQAPWVTPVLPVLWEAEAGGSLE 
ARSLRPA 


12379 


26280 


A 


12496 


58 


492 


NSPPPPAPSQRTSPPAAAASPTTGSSSA 
PCPASSSWPRSSIALITFYPDQPCGLSF 
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sequence 
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nude- 
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residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NNLAWKQNVLCFKERLLEKMFRRIQVLG 
LLVWTLIAGTEYFRVPAFGWVMFVAVFY 
WLTVFFLI lYITMTYTRIPQVPWTTVC 
LCFKR 


12380 


26281 


A 


12497 


2 


198 


RLQGILSAGFLVPGDTSVSSPRPRAAGS 
QSSTSRVLMKLSGAMLRLYVLVMGVSAF 

TLOPAAIITG 


12381 


26282 


A 


12498 


71 


346 


GMFMTSKETFTHYQPQGJSrSDPlfflTATAP 
GGLSAKAPAMTPLMLDTSSRKLVaWDGT 
TDGAAVGILAVAADQTSTTLTFYKSGTF 
RYEDVLWP 


12382 


26283 


A 


12499 


122 


59 


HQIYIYIHTHTHTHIYIYIYI 


12383 


26284 


A 


12500 


220 


3 


PKKESVRCWAMKWILLILCQTSLHWSL 
PQSSLAATPCCIilTLGTVDVLQGLGAVAH 
AWNPSTLGGRDGWITS 


12384 


26285 


A 


12501 


125 


-j— 


HVLVLSHFLFVLFLFFETESRSVTRPKR 
SSAVSAHRNIRLS 


12385 


26286 


A 


12502 


373 


476 


ILTRDGVSLLPRLECSGTISAHHNICLL 
GSGDYP 


12386 


26287 


A 


12503 


440 


573 


STWEDHLSLGNRGCSEPRLHLCTPTWVT 

E 


12387 


26288 


A 


12504 


146 


5 


HPHLQTMNGAPIPQELTPLLEKERDGLR 
CRGNRSPVPGIPPRWSRDL 


12388 


26289 


A 


12505 


140 


44 


AQGLTPVI PALWEDEVGRSPEVRS SRSD 


12389 


26290 


A 


12506 


247 


420 


LEISVTFSCSFGCLPKKHIHTHTHTHTH 
THTHTHTHTHTYIRTIYTTIARPLSALH 
NF 


12390 


26291 


A 


12507 


346 


473 


KAYI IGLNCFYETKSHSVAQAGVQWRDL 
NSLYS 


12391 


26292 


A 


12508 


87 


414 


RTGVYRVGKDGRSRSPDLL I CP PLGLPK 
CWDYRREPPRPACLPLLSHPSPPPHFSF 
LSPFLSFSLPCSPFLYIFFPPPLLPISL 
RWRLI PEGRFMAPLCLQMPLGHDT 


12392 


26293 


A 


12509 


386 


467 


LLLFFEMESHSITRLECSGAILAHCNL 


12393 


26294 


A 


12510 


344 


472 


KFLPFDPAVSLLGIYPKESKSFYQIODTC 
TRMFIAILFTIANTW 


12394 


26295 


A 


12511 


424 


132 


RGEGPPRSGILSPVGPPGETPPFLKKQK 
I TRGGGGGPLFPPLKRVRGENS FP PGGK 
S FHGAKFPFCPPPWATKRMSVSKKKKKQ 
TKKGSRIILYMNE 


12395 


26296 


A 


12512 


351 


510 


GGTDFCKQSKLQSLQPAVEIFIPGRVQW 
LTPVI SAIjWEAEAGGSPEVRRSRPA 


12396 


26297 


A 


12513 


362 


463 


NTITWLGAVAHACNPHTLGGPGGRITWG 
QEFETS 


12397 


26298 


A 


12514 


137 


1 


EEI SLLLPRLECNGAITAHHNLHLPCSR 
NRRPGKAVHTOQAYS FQ 


12398 


26299 


A 


12515 


251 


592 


GAFTGLAFTMAGGRPHLKRS FS 1 1 PCFV 
FVESVLLGIVILLAYRLEFTDTFPVHTQ 
GFFCYDSTYAKPYPGPEflASRVPPALVY 
ALVTAGPTLTILLGELARAFFPAPPLAG 
PV 


12399 


26300 


A 


12516 


3 


413 


SWGGRKFLCPPRSLSGSRELHPAQGDRP 
GPLSSSGKRETGTHRETLGKKKVSSAPE 
AQGAGIiRLSQALGGLCYDLSPLTEPRLP 
LAATAFPRPCPALPHPQPRVTMGSVSSL 
ISGHSFHSKHCRMQNKLRKSSHLK 
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nucleotide 
sequence 


SEQ ID 
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sequence 
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beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartii; Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=IIistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$$ible 
nucleotide insertion 


12400 


26301 


A 


12517 


115 


2 


KSHC3GHGVEVLVSVLVGLAMEVGLVLWV 
KEYDGAALA 


12401 


26302 


A 


12519 


444 


34 


YCLRQKHTEYRRRSNLEMASSCLSGRKT 
CTFFTLYQNLKMIEXSEEGTWQAEWLKS 
RPLAPNSKWEAKEKFLKKI KSVTSLNT 
YILRKPNSLIADMENILVIHIEYQTSYN 
IPSSQRLIQSKALCLFNYLEMECRI 


12402 


26303 


A 


12520 


1088 


935 


ILIIFINYRCEFLRSKKSSEBITQYIQS 
YKGFVDRTVMYNSNFFLKFLDYL 


12403 


26304 


A 


12521 




347 


AMAAGSRTSLLLAFALLCLPWLQEAGAV 
QTVPLSRLFDHAMLQAHRAHQLAIDTYQ 
EFEETYIPKDQKYSFLHDSQTSFCFSDS 
IPTPSNMEETQQKSNLELLRISLLLIES 
WLE 


12404 


26305 


A 


12522 


50 


205 


WGESDRQSSGFWASSAHAOIPTWEVET 
GGSLEAHSWRLQCTMFASVKSTRP 


12405 


26306 


A 


12523 


105 


2 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLYER 


12406 


26307 


A 


12524 


279 


455 


KTKNWIiALGGPPCFPGQYGETPSLLKNE 
KIiAGHGGAPLYSRLLRRLRRGNSLTPGD 
RGC 


12407 


26308 


A 


12526 


305 


454 


LYFLLFPFQIIHRMVLFFVFCFFFETES 
HSVPQAGVQWRDLGSLQAPPPG 


12408 


26309 




12527 


363 


459 


CTILGQAWWLTPVILALWEAEAGRSPEV 
RGSR 


12409 

~124T0 


26310 


A 


12528 


182 


36 


QHI WFHKEHGQVRWLTPVI SALWKAEA 
GRSPEVRSSNAAWHACRNSA 


26311 


A 


12529 


388 


530 


KSFLNFFPRbGILNCVPQAGLELGSSDP 
PASAFRVAGTTSVWHHSQI 


12411 


26312 


A 


12530 


134 


1 


GHKKGVLFFFFKTESRSVAQAGVQWCTL 
GSLQPPLPAHATRPRV 


12412 


26313 


A 


12531 


230 


1 


KKFGFFPQAGPQGGLHLFTTLAPWGQAI 
SQKREKPFFFFFFETEPHSVARLECSGT 
ISPHCNLHLPSSNDSQASTS 


12413 


26314 


A 


12532 


145 


3 


KKLEILSAIIPPKFRKEIFLFFFYETES 
CSVAQAGVQWRDLGSLQAP 


12414 


26315 


A 


12533 


365 


2 


LFFFTSLIWQITLGFLIIiNCPCIPGVCI 
Y I QFANTFRKFAFL FTNE I AYWI L I L P L 
TKQDYTNLI KL I KKHS FFSNFLKQLLFV 
FCFVFVSLFAMESCSVAQARVQWHNLGT 
LQPPPPRFK 


12415 


26316 


A 


12534 


313 


475 


FFWSCSSFQQGFGGGITIIIIILRWILA 
LSPRLERKWRDLGSLQPSPSGFKRFF 


12416 


26317 


A 


12535 


124 


3 


MGSLIMGAELSVYRQVCDCKPQGVCVCL 
CVCVCVCASTCM 


12417 


26318 


A 


12536 


333 


58 


ACNPSKVRSCRPAWSNMVKS CLSKNAKI 
TKEWWWANFFFVFVFLVEMMFHHVGQLS 
LKLLTSSDLAASASQSYGITGVNHYAQP 
ARTRGSFR 


12418 


26319 


A 


12537 


214 


3 


SDPREAC31CPKEAPEQQRRLPGGSVPAL 
SIFFFFKTESCSVPLSPRLECSGVISAH 
CKLRLPGSCVPPAS 


12419 


26320 


A 


12538 


146 


1 


MNFLAFTNPPRPPQQILKPLLFILRRES 
HSVAQAGVQWCDLSSLLPPA 


12420 


26321 


A 


12539 


195 




IFIPPQAQKRGDPFLSFFFFETRSPSPR 
LECSDAITAHCSLHLPGPGEPPTPPIPK 
SWDHSHVPP 


12421 


26322 


A 


12540 


151 


334 


LLGRLRHKNHLNPGGGGCSGTIMALCNI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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Predicted 
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correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

locntion 
correspon 
dingto 
last amino 
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LLPRSSSSPTSASKVAGITGACHHACCL 
RYFLY 


12422 


26323 


A 


12541 


314 


402 


NLNSGPAWWLMPVI PALWEAEAGGS PE V 
G 


12423 


26324 


A 


12542 


339 


3 


GS CLLEGKLPNRKDIHTENPSVHHHHQR 
PKVDKPTKMGKKQNRKTGNSKTQSVSPP 
PKERSSSPATEQSWMENDFDELREGFRR 
SNYCELREDIQTKGKEVEIFEKNFERV 


12424 


26325 


A 


12543 


31 


365 


RSAVMPTEDRATWKFNYFLKIIQLLDDY 
PKCFIVGANDVGSKQMQQIRMSLRGKAV 
KGMf TMMRKAI RGRLENTJPALE KLL P 

HI 3GNEGS VFTIIEDLTE IRDMLLGNKV 


12425 


26326 


A 


12544 


278 


410 


?S FICWTAGRWWLMPVI PALWEAEVGG 
SP3VKRSRTTLANMVK 


12426 


26327 


A 


12545 


88 


276 


EVPQAHRKLPSAPQDRPLQAISTKGPCP 
PQSPQPSPRPEGSSAHSRGEKRTRERGK 
EKKRKKK 


12427 


26328 


A 


12546 


251 


3 


PVEDNSKRRFQTGBBKVTQEEQEALTPW 
RTNFFCFSDRVSVAQARVQCCDSGSLQP 
QLPGSSDLPTSASQVAGTTOVIRHHAC 


12428 


26329 


A 


12547 


106 


2 


DEHLPFLTSFBTESCSVAQAGVUWRDLG 
SLQAPP 


12429 


26330 


A 


12548 


225 


1 


GFFKKEPHPRPCKKKTPPKLKPFWGGLN 
KKIPLSFVFSKKGPLPKKFFFPFFFFET 
GSCSCHPAGVQWCBHGSLQ 


12430 


26331 


A 


12549 


218 


3 


VPPPGLVPGLPQI IPKGFPKTKGPPKGP 
PPGDKQKSGPPNKKNFFFFFLRRESHSV 
AQAGVPWCDLGSLQP 


12431 


26332 


A 


12550 


276 


24 


GPLKMVEAPPQNPLSPKNKIFFFFFFET 
ESHSVAQAGVQWRNLRLPGSSDFPASDS 
RGAGITGARPHVQLI FVFLVKTGRKRR 


12432 


26333 


A 


12551 


331 


440 


KSASSWSFNSKGWGWWLTPVIPALWEAE 
VGGSLEVR 


12433 


26334 


A 


12552 


136 


1 


RHTGSKTTATALVDVFLLLFFETESHSL 
SRLECSGTTPGHCNLYE 


12434 


26335 


A 


12553 


84 


1 


IFFCLFSFEMESHSVAQTGVQWCDHGSL 


12435 


26336 


A 


12554 


3 


349 


HASGPEELSKDPSLVSQGQPHRKPGLKR 
CSSRPLGPPDKLGGEGKQGLLGFTLWLS 
GPIKPCDDEEKWKKKKKKKKKKKKKLGG 
GPFKKKLFFPPGGGRNFFFLGAPKFFWA 
GRF 


12436 


26337 


A 


12555 


257 


417 


KQLHLLQGRLFSPSFLPSISKLFFFEME 
SRSVTQAGVQGCDLASLQAPPPGFT 


12437 


26338 


A 


12556 


268 


403 


YMRLSFQDLVMFKDVAVDFSQEEWECLN 
SYQRNLYRDVILENYSN 


12438 


26339 


A 


12557 


286 


426 


DML IKTC CHLVCE KKKKKKKKKKKKKKK 
KKKKKKRGGAIiKKKPLPPP 


12439 


26340 


A 


12558 


114 


1 


PLKEIANRHMKKCSSSLAIRQMQIKTTM 
RYYFTPVTMA 


12440 


26341 


A 


12559 


106 


1 


FFFFFFFFFFFYFFFFFFEMESRSVAQA 
GVQWCDL 


12441 


26342 


A 


12560 I lis 




KWYSNKEiKTNAGRAQWLTPAIPALWEAE 
AGKSPEVRNW 


12442 


26343 


A 


12561 


196 


2 


MCCFSKNWQEGAQMTPGFEPPQDNSEKT 
KLSDTRLLFFFFERESRSVAQAGVQKRG 
LGSLQAPP 


12443 


26344 


A 


12562 


128 


3 


RAPPFFFFETEFRSLPRLEajGAISAHR 
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NLRIPGSSDSPAS 


12444 


26345 


A 


12563 


277 


370 


WSFTLVPQAGAQWCDLGSLQPAIiSRFKR 

F3R 


12445 


26346 


A 


12564 


55 


251 


A^^VIPTVPFNITINSKPLG^:ISFQLFAD 
K?PKTGENFHTLNWKDKGFGSCFHRIIP 
EFiCQGDJF 


12446 


26347 


A 


12565 


129 


3 


KYSVLRPGTVAHTCNPNTLGGRGGQITW 
GWEFETSLfflSVaK 


12447 


26348 


A 


12566 


172 


1 


PPPKKKKPRPPKKKNFFFFPGAISAHCN 
LCVPGSSDSPASPSRVAEITGSRCHREP 
Y 


12448 


26349 


A 


12567 


257 


380 


GICRPLLGGVLQSGATGVRDPLEEAVCP 
LAELKHCAGRSTA 


12449 


26350 


A 


12568 


120 


1 


GVFLFCPVLPETESCSVARLECSGMISA 
HCaSTLRLLGSSNS 


12450 


26351 


A 


12569 


250 


2 


IGKPKTPQFWFLKIKRGGPNAAFSFKKF ' 
FFQLKPPPRVFLKI FSCQKKI FFFFFFE 
TESRSVPQARVQWHDLGSLQAPPPGF 


12451 


26352 


A 


12570 


338 


3 


VSTPEKNFVFHTRGFSKKSSKYKVCNFF 
FQKKLGSPPFSLKKVFPGKPPYSCCSGR 
VSTAPYFLNPGPSTKILPPFFGPLGPEE 
KFLSFFFFFEMECRSVIQAGVQWCDLG 


12452 


26353 


A 


12571 


5 


556 


ICCCLCFKINIFQLHFNKINFFCVTRSG 
PVTQAGVQWCaSTLGSQQPQPLSSKQS 


12453 


26354 




12572 


115 


2 


DRVSLLVLAHCWLCLRGSSDSPASASRV 
AGITGARHD 


12454 


26355 




12573 


279 


377 


GQVQWLMP WPAFWEDEAEGLLQPS SSR 
PAWAT 


12455 


26356 


A 


12574 


117 


1 


KTPLFFFFFFFEMESRSVAQAGVQWYDL 
GSLQAPPNGFT 


12456 


26357 


A 


12575 


253 


374, 


SLPGMWHACNPSPLGGRGRRITWGRKF 
GTSLANMAKPCL 


12457 


26358 


A 


12576 


130 


2 


KFPGPFLGFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


12458 


26359 


A 


12577 


132 


3 


F.VPGPPLLFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


12459 


26360 


A 


12578 


3 


370 


LRKNCLNLGGRECSELRSHHCTPAWVTE 
RDSVSKKKTLSVLMEKWPNFPLRPLALS 
RKKFPRGPFEQVTHLVKEWSLTKPCCA 
EGADLDCYDPRTSPLFAKSWERNFPFPV 
HPGPVEGCPK 


12460 


26361 


A 


12579 


131 


2 


FIYFLFYFLFLETESLSVAQAGVQWHDV 
GSLQTPPPRFTPTRA 


12461 


26362 


A 


12580 


202 


2 


RMRKNQHKRAENSKNQNTSFPPKDHNSS 
QTREQNWMEWEFDELTEVGFRRWVITNS 
SNLKEHVFSC 


12462 


26363 


A 


12581 


109 


2 


KrFFFFFFETESRSVPQARVQWHnT,GSL 
QAPPP6F 


12463 


26364 


A 


12582 


105 


375 


LIMILLPLQILVHSGTFCFLFF6FETEP 
HSTPRLECWGVMSAHCHLCLPVKTSPAS 
TSQIS 


12464 


26365 




12583 


106 


3 


RPI FFFFFFQTESRTVARAGEQWCDLGS 
IQPPPP 


12465 


26366 


A 


12584 


484 


700 


SSSWNEAFSRKKDKTWMHTPEALSKHFI 
PYNAKFLGSTEVEQPKGTEWRDAVRKL 
KFARP IKKSEGQKKKK 


12466 


26367 


A 


12585 


301 


1 


TQPKVRTQMKNNIPEHAFLSSTFPWPLP 
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NLPADFYSATSQTKPDILQSMQKAFKNS 
GIGQAWWLIPAIPAALWKAEVGGLLEPR 
NLRPAWYGKTLSLQK 


12467 


26368 


A 


12586 


76 


379 


PLHPPASCPAPSLSCLIPGSPSDPLFSD 
SARPGWGRGWGRAPQLPPSPPFLYNLIK 
KWSRFCHQKKKKKKKKKKKKKKKKKKKK 
KGFFFWEAPPFFFFGGGFFS 


12468 


26369 


A 


12587 


109 


376 


LEFFIMSGWPTAPEQPflSEMENQTKPP 
DPRPDAPPEYNSHFLPGPPGTAVPPPTG 
YPGGLPMGYYSPQQPSTFPLYQPVGGIH 
PVRYQ 


12469 


26370 


A 


12588 


323 


1 


KKFCPPRFFPPLFGFQKKGTNPISPFSG 
GFKKNLGGFNPPPRPPWSPQIGFFLGRS 
KELILGLFPTRIGIGGFFSFFFFCETKS 
HPVMRLECSGVISAHCNLRLLVS 


12470 


26371 


A 


12589 


262 


I 


GDPPTTSGPQTNQPKEHEMNFKSDSQLY 
EDTLAGRSVLIKNLTPQTLQPRWTGPYL 
VIYSTPTAVHLQDPPHWVHRSRIKLCPS 
EKK 


mil 


26372 


A 


12590 


284 


374 


TESEHWGWAQWLTPVILPALWEAEAGGS 
PE 


12472 


26373 


A 


12591 


297 


387 


GQA*rWLTSVIPAFWEAKEGGSPDVRSSG 
LA 


12473 


26374 


A 


12592 


206 


407 


VKPQNPRKAQKLKGLVYIHIYVYIYVYV 
YIYIYIYMYICRYISLTWYTVUSKEKG 
LDTAAHTCNSR 


12474 


26375 


A 


12593 


222 


379 


LTTGSIMGNFSFLILVYTRKVMGSVQWL 
TAVIPIiWEAEAGGSLGSRSSRPA 


12475 


26376 




12595 


301 


189 


KKIQSQAQWLPPI IPALLEAEAGRSLEL 
KNSRSAWAS 


12476 


26377 


A 


12596 


209 


3 


SVKPLFFINYQSQWLFLFFRGRLKRSG 
T ISAHCNLHLPGSRFSCLGLP KCWDYRC 
APPTLKIDLTKD 


12477 


26378 


A 


12597 


306 


3 


NGKKGFYMAPEPPFFFHFFFFSPLFPTL 
FFFFIFPLPFCFALSFLVGSSSLSPRLL 
CPMQHYFCDAWNTFDALIWGSIVDIAI 
TEVNVSTWRLSLTVRV 


12478 


26379 


A 


12598 


1 


249 


WTGAQCPLEFSIQELQLLFLGGGEDGVS 
LLLLRLECNGAISAHCNLRFPGSGDPPA 
SASRVAGITGACRDHTCEPPRPVPPSS 


12479 


26380 


A 


12599 


228 


404 


RLSSLSPVTEFGCLSPPDLILKCDPWPG 
TVTHTCNPSTLGGRGGWITRGQEFKTSP 
VNM 


12480 


26381 


A 


12600 


101 


580 


LSLTKNCALLGEETMMEQEMTRLHRHVS 
EVEAVLSQKEVELKASETQRSPLEQDLA 
TYITECSSLKRSLEQARMEVSQEDDKAL 
QLLHD IREQSRKLQE I KEQE YQAQVEEM 
RLMMNQLEEDLVSARRRSDLYESELRES 
RLAAEE FKRKATE CQHKLLK 


12481 


26382 




12601 


464 


3 


CAI QQTL YEHPMKSSRLGPTQLKIFTCE 
YCNKVFKFKHSLQAHLRIHTNEKPYKCP 
Q CS YASA I KANLNVHLRKHTGEKFACDY 
CSFTCLSKGHLKVHIERVHKKIKQHCRF 
CKKKYSDVKNLIKHIRDAHDPQDMYCGR 
SRGSAEMSRALPS 


12482 


26383 


A 


12602 


140 


1 


WIGSFLSSEEDGTWQKISGVQRSLSLY 
VY IHTHTHTHTHTHTHTV 


12483 


26384 


A 


12603 


112 


2 


ESLRPGAYAHTHNPSTLGGRDGWINKDK 
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nucleotide insertion 














EFKMCERV 


12484 


26385 


A 


12604 


139 


1 


KKRVKRGPPGGFKGLGGKFFFSPKTFFF 
FFFETESCSVAQAGVQWR 


12485 


26386 


A 


12605 


204 


460 


SGQGNGEICHPRGFCTLPKAISSLLIAR 
VCLKHFNKBVASLSQTANGCFFEGFSET 
ESRSVIQAGGQGGDPGSLQPPPP6SCLS 

h 


12486 


26387 


A 


12606 


317 


1 


KRPLFKNI FQKPFFKNGAPRKKGPRCI F 
GPNQKNPLFFPAPLSPPFWETKKGAGFF 
PWGFPFPPTKIIGGQKVPPPPTFPPFFF 
FFETESLSVTSLECSGAISJiH 


12487 


26388 


A 


12607 


248 


367 


TQILSSGSVQSSLHHPPICFIHTHTHTH 
THKHKHPHS 


12488 


26389 


A 


12608 


2 


387 


QKQQQRAGRETSTCSLRI ISAPTMATFV 
ELSTKAKMPIVGLGTWECSPLGKVKEAVK 
VAIDAGYRHIDCAYVYQNEHEVGEAIQE 
KIQEKAVKREDLFIVSKVQWCIWIiAFNT 
IVPKSQWQSLQTPQHC 


12489 


26390 


A 


12609 


258 


23 


MWERGIIRPQTQTLTGEPEGIDHGRSIS 
PNSVMNGIVPHISILTLNINGLNAPLKR 
PGIAEWIRIYQTSMCCLQETGR 


12490 


26391 


A 


12610 


176 


433 


DALSSAWGTVLGTQIPSHFTEILMLSPP 
AWGSSSLTQTLFYVPSGAKRTGSYVLftR 
VGQKYKTLWLGAVAHACNPSSLGGRGRW 
IT 


12491 


26392 


A 


12611 


312 


410 


AHMYRTCSRGWVSWLTPGIPALWEAEAG 
GSLEP 


12492 


26393 


A 


12612 


3 


289 


vfeflsrKlSyilrmfwtfkewfwlerf 
WLPPTI kwsdledhdglvfvkpshlyvt 
ipyafllliirrvfekfvaspiaksfgi 
ketvrkvtpnt 


12493 


26394 


A 


12613 


294 


432 


fmklffffffferrscfvaqsgmqggyn 
aslqplpfgssnfsglpl 


12494 


26395 


A 


12614 


138 


375 


SPNATC6HADPLPLCCLHICQALKRFTR 

qtfffegkpylgillyavffflrlescs 
vaqagvqqpnls sletpp psa 


12495 


26396 


A 


12615 


232 




gvp qraklragl cp pqamdtfs tks lal 
qaqkkllskmaskawavlvddtssevl 
delyratreftrsrkeaqkml 


12496 


26397 


A 


12616 


236 


419 


lemrwgl>amffppvwptlwetegvrdqp 
gqheetlslkiifkisrhggmrlgsqll 

RRLRQ 


12497 


26398 


A 


12617 


197 


1 


LCHE SEGFLSRVSALLFTLLLLVLMES P 

psvsqpgvqwphfsslqpllprfkqfcn 
tlpsgwdyk 


12498 


26399 


A 


12618 


132 


3 


nsfmfhlftyfemescsvaqagvqwrdp 
gplqappervggrv 


12499 


26400 


A 


12619 


255 


2 


mnslnsftdiaviplhynrilphfkiil 
syflkskfqvkgigrvkwltpviptlwe 
aevgrsleprncpahqpgqygkillykk 


12500 


26401 


A 


12620 


141 


2 


fhftyiiiscvcvcvtesrfvarlecsg 
Ai sahcknhrpgs sns pa 


12501 


26402 


A 


12621 


274 


421 


tyvmevcctyyflaqvlslapfnyfi ir 
pdprtdprpdprpdprpeprp 


12502 


26403 


A 


12622 


132 


2 


nqsikpoaelqgfqrsggrgkgqvqwlt 
hiiparweakvgrsp 


12503 


26404 


A 


12623 


390 


86 


IFPFFYQNTNGiVIAPGNQIPQGFSSRFFF 
FLRQVLALWPSLEYTGEITDQCRLKCSS 
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SLSLCS SWDNRCAHP PLVNI LFYVETGS 
HYWHIGLKLLHSRYPG 


12504 


26405 


A 


12624 


297 


396 


I IRVRGRAQWLTPVI PALWE AKAGGS PG 


12505 


26406 


A 


12625 


303 


381 


KFHLVPSINTMSGSQELQWMVQPHFL 


12506 


26407 


A 


12626 


318 


152 


GDKRRIVS KKKKNLWNFGRI QWVP P VI P 
ILLEGEAGGSPEVRSFRPTGQQSMTPF 


12507 


26408 


A 


12627 


48 


388 


QLGNGKVRLLYQHHS SSFFFFWGEGFP P 
GPQGGGGGPNLGPRESPPPGVKGTPPPS 
LLEGGEPPGPPPPPGNFWGFWKKGGLPL 
YPGGPPTSGPKGEAPPTPPKGGGGIKGL 

P 


12508 


26409 


A 


1262S 


116 


3 


KVATNQAQWLMPVI PALWEAKAGGS SEV 
RSSRTADAW 


12509 


26410 


A 


12629 


227 


400 


VLFVLFFCKEWHVFIFLILPVYSYKKKK 
















12510 


26411 


A 


12630 


182 


3 


TNWLFPPPKFGPVPLFFLPPFFFFFFET 
TPA 


12511 


26412 


A 


12631 


134 


473 


ASTIMDLLFGRRKTPKELLRQNQRALNR 
AMRELDRERLKLETQEKKI lADIKKMAK 
QGQMDAVRIMAKDLVHTRRYVRKFVLMR 
ANIQAVSLKIQTLKSNNSMAQAMKGVTK 

A 


12512 


26413 


A 


12632 


100 


462 


QLLLCCCCCRQWTAFNVPASPAPPPAWA 
HMAPSLLEPPSLLVTQICXLSAFSGPSI 
NAFLLSKKKKKKKKKKKKKDRGGAPLKK 














KK 


12513 


26414 


A 


12633 


313 


441 


CWELLKWLVCFLCHI IKTNNCWAQWLTP 
VIPTLWEAEVGGSLE 


12514 


26415 


A 


12634 


129 


400 


VSQQCWPTPPALYSISRQALAASPQGRP 
WDLQPTMPESPLliPPRAPAWPKPLRRAL 
PPAPRCLGPSTAQGLRSAGAWCGTGRHL 

HLRPQC 


12515 


26416 


A 


12635 


308 


399 


E IRGWTQWHVPVI PALWEABAGGLPEVG 

SS 


12516 


26417 


A 


12636 


285 


390 


IFGVLINSFIYFETESRSWRAGVQWRD 

LGSLQ?P 


12517 


26418 


A 


12637 


260 


2 


GQGPFFFGGRGSPKKKPPLK6FFLVGGS 
KTQFFFPRPKKGPF6VFFFFFSQTASRS 
VTRLECSGMILAHCNLRLPGSSDSPASA 

SR 


12518 


26419 


A 


12638 


76 


3 


ALQVQWLMPVIPALWEAKAGGSLE 


12519 


26420 


A 


12639 


219 


484 


LGLQEPDLDPKPILELPLAELAQQLQTE 
ELSLES I LCSyLKQALKVHQEVNCLM I F 
LGECEEELLALKKLKKSERGLLYGVPMS 
LKDT 


12520 


26421 


A 


12640 


33 


462 


EGLSWGYREHNGPIHWKEFFPIADSDQQ 
S P I E I KTKE VKYDS SLRPLSIKYDPSSA 
KIISWSGHSFKVDFDDTENKSVLRGGPL 
TGSYRLRQVHLHWGSADDHGSQHIVNGV 
SYAAELHWHWNSDKYPSFVEAAHEPDG 
LAG 


12521 


26422 


A 


12641 


2 


414 


SGPAAPATPMSI FPELYFNVDNGYLEGL 
VHGLKAGVLIQADYLNLEQGETLEDLKL 
HLQSTDYGNFLAKFEASPLTVSVIDDRLK 
EKMAVEFRHMEKHAYQPIiASFLDFITYS 
YMIDMVILIiITGTLHQRSIAELVPK 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamie Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12522 


26423 


A 


12642 


161 


463 


KEKREGRPKPLLLLPITSATTAMGLTIS 
SLFSRLFGKKQMRILMAGLDAAGKTTIL 
YKLKLGE I VTTI PT I VFNVETVE YMSTIC 
FTVWDDGGQERIRPLWN 


12523 


26424 


A 


12643 


124 


2 


TVFGGQAQWLMP 1 1 PAIWEAEAGRSPEV 
TSLRPAWPRTRG 


12524 


26425 


A 


12644 


258 


454 


NRPELCDYQGLRKDSKRKGBINQTACQF 
LWPRQKMSSQTKFKKDKEIIAEYKAQIK 
EIRTQLVEQ 


12525 


26426 


A 


12645 


636 


182 


APLBRPSMIPPRCSGGQPSDLRSSPAQG 
TPRPALGDRKGTPRIftEPGGAGRAEAPP 
PRGSRQRFGPQRRQTNAHTKRVRTRLSA 
APARHSPAFIACRQVGRPRLSARRSGKK 
PHLTVWCPFSFQDP IQDTTLQI ILLSLL 
GGDTFSDFPCS 


12526 


26427 


A 


12646 


199 


414 


lYHTSRSLLDRKLRAQRYDELPHYGGMD 
GVGVPASMYGDPHAPGPIPPVHHLNHGP 
PFHAKQNYGAHAPHPN 


12527 


26428 


A 


12647 


130 


3 


DRGFLCVHTPLKKKKNTHTHTHTHTHTH 
TNTHTHSHTHTHTK 


12528 


26429 


A 


12648 


192 


443 


LLLCWAQESLGTLGENTASSHTAGLEWE 
SPLFACWFFWGFCLFVWLFVWFEPKSGS 
VLTLKCSSVITAHCSLDLPGSRDPPASA 


12529 


26430 




12649 


393 


179 


LHSRVQRSGCFCSCVSNPVTGGMHFTRS 
SPQSNQEADGQaRWLTPVILALWEAKAG 
GLPELKSSRPAWATP 


12530 


26431 


A 


12650 


263 


2 


KAPLLVYKWEPNPGGGNFSPGMGTPNTF 
GVRFGGETKPLFFFFFETGSHSVIQAGV 
QLCNLDSLQPPPPSVKPSQYSWDHRKQS 
NTK 


12531 


26432 




12651 


355 


111 


QDPCFIRLHLPLSFLSVFLECTKISLLD 
WRMLFPLLGDPPHLMiSSLVSPINTWNY 
S FQQLMMFRDVAVDFSQBEWECLDL 


12532 


26433 


A 


12652 


271 


3 


RQLKNILIFFSHKNMLSIKHIIYNYTLF 
LKSAFIFNLETDHLNCFPGSISAKSYFI 
LFYFFETEPHSVAQAGVQWRDLSTLQPP 
PPRFT 


12533 


26434 


A 


12653 


333 


2 


PPSPGVFGNPPSPGDPLGKTFFFPRGPP 
PRFWGGGFFFILAPFPFGGLFPPFPLPP 
SQGPPFFPFFFFFFFFEMESRSVSQAGV 
QWRDLSSLQPPRARFSDSPLYFLEGR 


12534 


26435 


A 


12654 


310 


444 


TYLSNIFYFILFLVETESPSVAQDGVRW 
CGLGSLGPPPPGFGRFS 


12535 


26436 


A 


12655 


336 


411 


DQPGQHGETPSLLKIQKLVECGGAH 


12536 


26437 


A 


12656 


114 


2 


GIFFFFFGLESRSVAQAGVQLCDmSLQ 

SPPPGFKQF 


12537 


26438 


A 


12657 


191 




KMFILTFSFRGAKWGDVFFPPPGYIQTG 
ENFFFFFFLKWSFTLVAQAGVQWCWLSS 
LQPPPPR 


12538 


26439 


A 


12658 


1 


445 


LRTGSEFSGRDSKGLAAAEPTANFGLLL 
ASIEDQGAGGGGYCGSRDQVRRCLRAKL 
LVLLTEAAGVAGVALGLGVS EAGGALAL 
GPERLSAFFFPGELLLRLLRMIILPLVG 
CSLIGGANSLDPGALGRLGAWALLFFLG 
TTLLASAL 


12539 


26440 


A 


12659 


2 


224 


ESTASRIAFEAWQPETLPKGLNYSGASP 
WLNAVLPLKKKKKKKKKKKKKKKKKKK 
KKKGGGVFKKKKIKGGGG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 

NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12540 


26441 


A ■ 


12660 


96 




GLKDGRGQVQWLTPVISAPWEAKVGGSP 
EVMS 


12541 


26442 


A 


12661 


1 


457 


CWVHLQKGCHFTLGRLHSGQAEALLTSQ 
TGRPGRGAPHIPDGAARQRRSSHPRRGG 
QAEALLTSQTGQPGRGAPHLPDGAAGQR 
RSSPPRRRAAGQRGSSPPRRWAAGQRLS 
SLPRQGGRAEALLTFQTGRPGRGAPHIP 
DEAJ«3QRGGAG 


12542 


26443 


A 


12662 


175 


351 


KGIFFFPPSWKAGGAIWFKGTFAPRGKG 
NPPPNPSGBGGPKKKKPGPQIPRGPPLG 
PPV 


12543 


26444 


A 


12663 


247 


401 


SSNQEPGCKGPCVLTNFYFILGRAWWLM 
PVIPAIWKAGTGGSPEVRSWRPT 


12544 


26445 


A 


12664 


152 


251 


RENSCCFIKKKKKKKKKKKKKKKKKKKK 
KRGAP 


12545 


26446 


A 


12665 


2 


365 


APTRSRCLRHASCTACRTVSTDTSSLRR 
ADPKGRSALLADIQQGTRLRKVTQINDR 
SAPQIBSSKGTNKBGGGSftNTRGASTPP 
TLGDLFAGGFPVLRPASQRDVAGKBEFI 
LAPLWTVSP 


12546 


26447 


A 


12666 


179 


1 


LELSKSGSHWMRHFLSRSFYSLISPQLN 
TTVWPTI ITPILLTLFLITQLKILNTNY 
HLP 


12547 


26448 


A 


12667 


279 


35 


NQAPFKARFFTTFKNVFLKE IKKKRCWS 
GAPPI IPPPWGGGGGGPPWGGNLSPPGP 
PVSTPFFNGAQKKKKKKDRYGGACL 


12548 


26449 


A 


12668 


299 


399 


ISIIKKQRKGPVQWLMPVIPALWBAKAVG 
SPEVR 


12549 


26450 


A 


12669 


1 


385 


KMQNRGFFFLTFLALLGLTSGAAKKKNK 
GKKAGPGSKSPNWPWGPCPPSSKDCGGV 
FRKGTWGAQTHGIRGRGPCNWKKEFEAN 
CKSKFKNWGAGDGGPGTKVRQGTLKKAP 
SIAKGQETIRVTKPCT 


12550 


26451 


A 


12670 


357 


1 


AGKIAKI CPVSSMQAPTCGFPVGGNDNQ 
GQAPDGQFQPPLQQNQTSSPDFSNENSP 
ATPPNEQGQGDAPPQLEDEEPAFPHTDL 
AKLDDMINRPRWWPVLPKGELEVLLKK 
GIDFSKK 


12551 


26452 


A 


12671 


107 


373 


TGNYTPLEDCAQEQMRLIAQVHLLETRV 
KQQQVKl KQLLQENBVQFLDKGDBNTW 
DLGSKRQYADCSBTFNDGYKLRGFYKIK 

PLLNP 


12552 


26453 


A 


12672 


196 


45 


YLYFHGSREKRLAKKYYDKLFKECCIAD 
LSKYKENKVCFPHIYGKRFLFK 


12553 


26454 


A 


12673 


33 


448 


KEGAERAGAAAPWS FLLDRRDSARTRS 

IDANEAIHSGPEENDVPGGPGKPQPCVG 
SESEHTASASAGDGPGVPAHASDHGLPP 
PDISVLSVIVKNTGLSRAGWRPWVDV 


12554 


26455 


A 


12674 


250 


29 


KKQTNMWQSPYEDYRIFYTISSSLYHLS 
lYPPIYLSI YLSIYLSLYLPTHLSI ITD 
YVS ILEHELRPLWRI QP 


12555 


26456 


A 


12675 


377 


442 


SDRQWWCMPIVPATWEAEAGES 


12556 


26457 


A 


12676 


1 


553 


RIFPGRRFRVKLPSCPDPAMGTRLLFWV 
AFCLLGADHTGAGVSQSPSNKVTEKGKD 
VELRCDPISGHTALYWYRQSLGQGLEFL 
lYFQGNSAPDKSGLPSDRFSAERTGGSV 
STLTIQRTQQEDSAVYLCftSSLATAWHS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

lorr'eind 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

relidueof 

seTul'nce 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RLLPAHKPHPSLSLQLLDTLNRGFSLLL 
PPHGKQVDLDLSLSFG 


12357 


26458 


A 


12677 


232 


3 


GWSSDCQQRARVIEEKHSRQKGHIAENP 
SVHRHHQRPKVDKTTKMGKKQNHKTGWS 
KTQSASPPPKERSSSPATCI 


12558 


26459 


A 


12678 


134 


3 


IF IVEFGGKPFS CTS ISLSQWLWCLFI G 
IGELLWGQVSTGTLY 


12559 


26460 


A 


12679 


212 


47 


NGTHPHGYTLHTHKHTHSCAHTHTHTHS 
LLLVRKSFLPQKNTTISFRSCCKLGKP 


12560 


26461 


A 


12680 


224 


2 


MKMASFLAFLLLNFHVCLLLLQLLMPHS 
AQFSVLGPSGPILAMVGEDADLPCHLFP 
PMSAETMELKWVSS SQCI 


12561 


26462 


A 


12681 


35 


397 


KELISQNKIHSVSSSQTGFFFFFFFFGE 
KKPPFAPRGGGKGPPPPLPPGGGNRAPP 
RGKKKGKGKPPPKNPGAFPPQKEKKTPG 
GGGGKKKTPAPPTPKBVDQPLPRREQRG 
PGRLPPRGA 


12562 


26463 


A 


12682 


188 


514 


HTTHLVyVLSMAAFFFFFFEKNFLFAPR 
VEKR6KDLGSLKLPPPGFRHFSGLTLQG 
SGNNGAPPPSPVIFLVFFEKRGFPLVGR 
EGLILPPLQPAPFCISFRGAINGPS 


12563 


26464 




12683 


366 


3 


I SQTQTQWTDWVSLELQGCELAHRDTVA 
I PQTRSFPFPLMIS CMTLFSLLQMDRAF 
PPFPPQTPITIQGQISTSPATWPLTHLH 
STPGLSVEYSSNTHKSSLSPTSQWIRVD 
PMLASPTV 


12564 


26465 


A 


12684 


99 




SSCIRFFLEQAYGQQSYGTYGQPTDVSY 
TQACI 


12565 


26466 


A 


12685 


247 


31 


FLKEKRNQFYLFIFQRWLDPNKPIRKQL 
KRGS PYSLNFRVKFFVSDPTKLQEEYTR 
WVYGYIFLENIVKTLI 


12566 


26467 


A 


12686 


416 


253 


PSPMTI PVTGAPRDADLWSSHAKMIAQP 
LKDSDVEVRFLGSSQIFLCWRPSFNL 


12567 


26468 


A 


12687 


116 


3 


CLLGRMWWLMPVIPALWEAKAGRSLEVR 
SLRPARPMY 


12568 


26469 


A 


12688 


380 


1 


SPESSKLGLWSKHVSPLLCIGVCPPPVS 
MAELRQVPGGRETPQGELRPEWEDEVP 
RSPVAEEPGGGGSSSSEAKLSPREEEDL 

DPRIQEELEHLRQASEEINQVELQLDEA 
RTTYRRILQKSASV 


12569 


26470 


A 


12689 


157 


488 


REFVSQGGSHLKAQVRLEALLLTIGTPP 
WAHLS ILHMTAMGQRENKREAASSRIQV 
LLNHSQAI KQES 1 1 LFFEMESGVWPRLQ 
YSRM I SAS CTLFLSGSNNS PVSAPRI 


12570 


26471 




12690 


514 


3 


PS IRAGLLCGSAENATPFLCGITiVlAAGP 
L YT Y PENWRAFKAL I AAQYS GAQVRVLS 
AP PHFHFGQTWRT PEFLRKF PAGKVPAF 
EGDDGFCVFES lAIAYYVSWEGLRGSTP 
EAAAQWQWVSYADSDIVPPASTWVFST 
LGIMRHNKQATENAKEECMRPLRVSSLV 
GP 


12571 


26472 


A 


12691 


304 


4 


AESLRVAHERLDTRSTSSDIFNFPQTQS 
NLEMNSE I LESWANYQSSTS YS INTELS 
LFSKVNGKFSTEFQRMKTLQVKDQAITT 
RVQVENLVYTVKINPL 


12572 


26473 


A 


12692 


221 


2 


VGPATRDLCFADEPWGRQKQKQWWWVL 
CLSPNRGLAGRPDTLHITCASAHMRTHT 
CMHTHEHAHAHVHTHCI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 

— 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

corrcspon 

last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucirie, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireoni'ne, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unkno'wn, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


12573 


26474 




12693 


3 


205 


YMLCSILSTQEAFSICLGEKFIYLFTYL 
FRDGVSLLLPRLECNGVILAHCNLRLGD 
RARLHLSKKQK 


12574 


26475 


A 


12694 


233 


3 


IMG PAHL FKGNHWMGGWGLMYKAFF FL F 
FFKTGSHSVAQAGTIPVHCNLRLPGSSD 
S PAPAS RVAGNTNVPHHTL Y 


12575 


26476 




12695 


408 


3 


PAEEMEKFRVCVRKRPLGMREVRRGEIN 
IITVEDKETLLVHEKKEAVDLTQYILQH 
VFYFDEVFGEACTNQDVYMKTTHPL IQH 
I FNGGNATCFAYGQTGAGKT YTM IGTHE 
NPGLYALAAKDIFRQLEVSQPV 


12576 


26477 


A 


12696 


123 


631 


REAVQGGKGSGGGFGSRDSRSSGAVSAA 
VGDMGDPGSEIIESVPPAGPEASESTTD 
ENEDDIQFVSEGPSRPVLEYIDLVCGDD 
EWPSAYYSDILFPKMPKRQGDFLHFLNV 
KKVKTDTENWEVS KNHCRL S KAKE PHF E 
YIEQPIIEEKPSLSSKKEIDNLGASDCW 
D 


12577 


26478 


A 


12697 


429 


629 


LALLYPLKWVGKPVNFMEEDILGPLPPP 
HEITLKHGTKT 














12578 


26479 


A 


12698 


158 


404 


LPLLSVERLVEI YAYTFIQDLFSASlSri P 
YSSSVFRIEMDQPRTHSGPTTASWPAPS 
STNSSSflPSATNSKQERSSSSLSKPS 


12579 


26480 


A 


12699 


185 


3 


HRRPIS IPSSVIHPAMVHLGLQYSQGLV 
SGSM-ARCIALLRALQQVCPILLSLMIQP 
HLVY 


12580 


26481 


A 


12701 


181 


3 


TVWASMFLSAALRARMGLAftHWGTHVR 
HLHKPDMQNGAGGALFVHRDPRENNRDT 


12581 


26482 


A 


12702 


373 




ILVQERDSQRVIRMIDKLVSSRDRVGRG 
VEKQMTDEIRVIYSVRKLQKRITPEPFIP 
SRTTPKLLFLLVLPSGKTPGHPVSSRT I 
PEPPLPTEPLERIPEHPVPSGTIPKPPE 
PPLPIEPHETMY 


12582 


26483 


A 


12703 


272 


508 


TKLGKKQNRKTGNSKQQSASPPPKERSS 
SPATEQSWMENDFDELREEGFRRSNYSE 
LRED IQTKGKEVENCEDNLEE 


12583 


26484 


A 


12704 


341 


1 


VPQHPHRPLAPPSLGPQSWPLMEGSRPR 
SSLSIiASSASTISSLSSLSPKKPTRAVN 
KIHAPGKRGKMiRRDPNLPVHIRGWLHK 
QDSSGLRLWKRRWFVLSGHCLFYYKDSH 


12584 


26485 




12705 


187 


365 


WEPSCRGPSAFKPTRCQSYDWACMCGAE 
GRSAMEQPQEES PE VREEEE lEEMAHAE 

GTP 


12585 


26486 




12707 


238 


596 


LWLVRVKYSIMSGAALGLEIVFVFFLAL 
FLLHRYGDFKKQHRLVIIGTLIiAWYLCF 
LIVFILPLDVSTTIYNRCKHAAANSSPP 
ENSNITGLYATANPVPSQHPCFKPWSYI 
PDGIMPI 


12586 


26487 


A 


12709 


131 


2 


DRVSLSPRLECSGTILAQPSRLKQSSHL 
S LTGSWDYTHAAPC I 


12587 


26488 


A 


12710 


151 


2 


YRQGLILLPRLECRVMIiyiALCWFESPGL 
SDPLTSASRVASTTATCHHTC 


12588 


26489 




12711 


151 




NFPEFDHFTVGCLRAGSSLIHYWWECKL 
VQSLWKAVWRFIKDLKIDLPLY 


12589 


26490 


A 


12712 


28 


411 


RWPAHPAGEPREPHVSWVMKLWPQQAP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

ing to first 

residue of 
peptide 


Predict- 

location 
correspon 

last amino 

residue of 

seq'uencc 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LYGDCWTVLLAEEDKAEDDVWYLVFL 
GSTLRHCTSTRKVSSDTLETIAPGHDCC 
ETMKVQLCAFKEGL PVFDVTEKYFLL PR 
MYRYYRCLYTYPSAAV 


12590 


26491 




12713 


351 


675 


ENPRHTFIYSLPLGLQKHQVLTVDIGFG 
GTAIMTVGKSSFfMLQHIDYRKRWILQDG 
RIFIGTFKAFDKHMNLILCDCDEFRKIK 

PKNAKQPEREEKRVL3LVLLR3EN 


12591 


26492 


A 


12714 


326 


457 


ACSHHQRKSQAPPPAHLKPDCPHPGSLP 
LLKP S VCXPGREVS PT 


12592 


26493 


A 


12715 


200 


1 


TITKVCVFTRTCGLKGSLMAKALFDQRK 
DRESETECQVWWLTPVIPALSEAEVGGL 
LEPRSSRPGV 


12593 


26494 


A 


12716 


208 


405 


ICKERENKKHTWKKGRKKMVHICQWHSSL 
HRKFQGIYRKTTGTGWAQWLTPVIPfiLW 
EAEVGGPPEV 


12594 


26495 


A 


12717 


157 


3 


AKETPTHKGSCKKKFFFFFFETESRSVA 
QAGVQWRGLGSLQAPPPGFTPCI 


12595 


26496 


A 


12718 


1 


472 


SPAILPRLAILPYLLFDWSGTGRADAHS 
LWYNFTI IHLPRHGQQWCEVQSQVDQKN 
FLSYDCGSDKVLSMGBJLEEQLYATDAWG 
KQLEMLREVGQRLRLELflDTELEDFTPS 
GPLTLQVRMSCECEADGYIRGSWQFSFD 
GRKFLLFDSNNRKWTW 


12596 


26497 




12719 


537 


1 


LHTMNGGNESSGADRflSSPVATSVPIGW 
QRCVREGAVLYISPSGTELSSLEQTRSy 
LLSDGTCKCGLECPLNVPKVFNFDPIAP 
VTPGGAGVGPASEEDMTKLCNHERKAVA 
MATLYRSMETTCSHSSPGEGASPQMEHT 
VSPGPPSARPPCRVPPTTPLNGGPGSLP 
PEPPSVSQACI 


12597 


26498 


A 


12720 


369 


3 


AAKI IPATRKKASLELELPS SPDSTGGT 
PKATISDTNDALQKNSNPYITPNNRYGH 
QNGASYAWHFEARKSQILKCMECGSSHD 
TLQELTAHMMVTGHFIKVTNSAMKKGKP 
IVETPVTPV 


12598 


26499 


A 


12721 


230 


3 


kffwvlagltgknsdasas lnqvh i s pf 
lfsrhSpfslgplsspvllqgskrrqll 
atlralesaslsqhphvpv 


12599 


26500 


A 


12722 


26 


110 


reqyaegnmrgpapgkktsglqqknvev 


12600 


26501 


A 


12723 


224 


1 


wqnqdqasdppkysftsqcclsfarlar 
rygdvfqirlgscpiwlngerailqam 
vqqgsafadrpafas frv 


12601 


26502 


A 


12724 


120 


530 


KKVARGRSRSRERSRRRRSRPKAIT\NR 
T*GTRCTPRRWRSTVLGMRSCTRAR*QR 
RSGLSRGHTRSAGLCDHGSVRAGSGDGA 
DGTGGGDRRLGLGRDS LLL S S S Q SAAF S 
SSASGSSSFFSATQPLRMLLEYFWL 


12602 


26503 


A 


12725 


434 


222 


KEEKSRGEVKEQD*EKDREEEVEKSRE/ 
RRRSRRRRRRSEV* YRKGKRRREREDI L 
VAERSHRSFRNSIPSTL 


■12603 


26504 


A 


12726 


268 


370 


SNILIHMESMFFFFGTDIPSV*KNLPA* 
KTPGPDGFIDKLYITFRGELTTFPHILL 
H*FTEGAVLFNSFSK\AASITLTPKPNY 
DIMRKENYSPISSYIWNQCSFFLEQIFP 
LSFKLNCPGLIIPH 


12604 


26505 


A 


12727 


103 


636 


VCFISMKQPHGSRHPPCS/PPRPGS*VC 
QQSRPLGPQPSAFGQLA*HRRGLLGLQG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
!ng to first 

residue of 

sequence 


Prediet- 

correspon 
ding to 

acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S-=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRSPMQPVPHGGRVSGQTLPKGPTPKRP 
PGPAPRHGGHSWLLK* LVCAQPPPRP * S 
*ASQAVFTLQVPGKPQVWTPCPVPVRAP 
TP\PLSNGGLGVSERH*DGH*QAPTSP* 
PRCQGGAGEGPQ 


12605" 


26506 " 


A 


T2728 


1193 


1638 


GLGFAMLPELALNSWPQGDPSVPASRAA 
GTRGVHRHTQLQVS FNYYKVLAMHSGSQ 
L * S QH FRKPRRSDH LRSGVRDQ S GQHGE 
TPSL/LKNTRI SWAWWHTPLVPATREAE 
ARELPEX PGRQKI AS E PRVKP I CTAS LG 
NTCETPPQKK 


12606 


26507 


A 


12729 


304 


101 


RHLHWPCPSPLAPTPDISHCPEPPKSQT 
SP/CPGT*ClflFLKCPSSCHPA*K/PLH 
PSEPARKSLAPDTCP 


12607 


26508 


A 


12730 


91 


264 


SQRSISGLRVKENLIFVMIMFPPIYSSQ 
TFWSQTPLMLKrVFMGISISICYIL/NT 
EK1ILGQGWWLAPIIPALWEAEAGGLL*L 
RVKENLI FVMIMFPPIYSSQTFWSQTFL 
MLKIVFMGISISICYILKLRKI 


12608 


26509 


A 


12731 


1026 


406, 


LAHFRSQ I FSFSHI LVHFERMVLNYRYY 
LACVERWFRFLHVLAYPGR IVLKYRYF 
LVHFRREVPRFRHMLACERRMVLYCYFL 
VLRKILLIYVTFLTYFRKVFLHHNFLRG 
G Y*RKYFGYAPRWL S FFLYRCQCFLHYF 
LFYLWLRLHLQF/CCFWSFCLFDFLFL 
FSaaCVFSVLQIAIVMFPFHALQLFFF* 
WLCFSNSRPQ 


12609 


26510 


A 


12732 


1508 


157 


QDVGGSGPKVDTHPRGRMAS I FSKLLTG 
RNASLLFATMGTSVLTTGYLLNRQKVCA 
BVREQPRLPPPSADYPDLRKHNirCMaEC 
LTPAIYAKLHNKVTPNGYTLDQCIQTGV 
DNPGHPFIKTVGMVAGDEESYEVFADLF 
DPVIKIiRHNGY*PQG*LKHTTDLnASKI 
T\QGQPDEHYVLSSRVRTGRS IRGLSLP 
PACTRABRREVENVAITRLEGFKGDL\A 
GRY\YK\LSEMTEQDQQRLIDDHFLP\D 
KPVSPLLTCAGMBSDWP\DARGI \WHNY 
DKT\FLIWINEED\HTRVISMEKGG\NM 
KRVILSR\FCEG\LKEVERLI\QERGWE 
F\MWK*AP*EYILT\CPSCILGT\GLRA\ 
GVHVRDPQSPSQDPTAFLKILEKPRTPR 

krghkvvwdiaa\vadvyd\isnidrig 
rsegel\vqividgwylvdcekklerg 
qdikXvppplpqfgkk 


12610 


26511 


A 


12733 


261 


487 


TGSETDCAKPQASFLPREVPTVAEMKME 
LLKNKQFW\RGWAHBiC3StPSTLGGQGGW 
IT*GQEFETSIiANMAKPHLY 


12611 


26512 


A 


12734 


382 


668 


YKRITDIPVDSETVHILINKRQSCRIPG 
FIQLVQLISHQLAAPRDYTVSHSVAQAG 
VQW/RKLGSLEPL?PGFKRPLCLS*HAL 
KNLSSCDTPPQY 


12612 


26513 


A 


12735 


401 


27 


GDR2\EESAEPRAWSHSDNSHRYTTLFIC 
LTHTHVHWPVHS \HTHTHTHTHTHTHTH 
TVSHRHTETPPLLLKQTGLKFY*NSRDD 
TPRSRPGSSGLQRLSSSPPVPFQPGTVB 
ASADFCGHDLLTT 


12613 


26514 


A 


12736 


202 


182 


KYLPIFINLITMNILFFFETHSCSVGQA 
EVH* S * LKPMPPG \ S SDSPASAFQVSG I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

location 

correspond 

amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glyeine, H=Histidine, I=Isoleucine, 
K=Lysine, D=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \==possible 
nucleotide insertion 














TGSPSSSWLIL*FFLKIGFTKKG 


12614 


26515 


A 


12738 


319 


2 


QNTPPHIYEYIPRVDAFLLFLEMSYSVY 
WAVLSPLYFWKLALTKHCRIjMjNSFL 
HKIQEPS\LGSGSG\PLSCErTII*RLCL 
RSLHLVFRSLWILPICDSSSVF 


12615 


26516 


A 


12739 


1363 


2000 


DRVFVIPGWKCSGTIKVHCSLDLPGSSN 
PSTSAP * VAC\ATGTYPPCPGLNFLYF * 

meergftqvaXqagsqnswgskpngtse 
aspicvlgmnkp*nlpnpgrsiln*dgsi 
rykdnypa/weppkrs ykclrqkks i *l 

SAGPRDYEASQPRKNYPISSLLTFCTSS 
LHFNPTELTCSSQLQKSIRRKLKSEESC 
PPLPACSLKHTQAIIKITF*RLHRTA/L 
FYLLK*NCFRN*S* *KyFLLSVSNLKI / 
QWVRSSLHPGAPGYTPTV*RPKHKSNVL 
I*IQS/ITRCLFYILVYMRTIFPLQLFL 
SKHPSTYKEDFA*VLPLSFFFFEMESRS 
AARRSLSSLQPLPPRLKRFSRLSLLSSW 
DYRCAPPGIiASPSILSS*GSCCSIQFIL 
RMSTISHAINVLVLKNTYLVLVSASEHS 
LIKKPC 


12616 


26517 


A 


12740 


1696 


743 


GGQIMRSGVHDQPDQHGETLSLLKIQKS 
AGPGGMHL*SQLLRRLRQEHRL\NRGCS 
E PR/ SRHCTPAWATEQDSVSKKIKK* KK 

QLQDGRRELaADMTL*?QHSKRLGftLPR 
PLLSAYYFNKVHAARRQAYLETPGTFTS 
YQREEGEQaLILGVEAQASSPTVFHEEE 
RQSQALCSHRKSQEAPVRPAHPPRRVPG 
LGKPSSQGLSAHLGQDRAPflPRRASWDQ 
RSQAPI SVTFPSVLDKEESVPCGPGFPG 
HAPAPRGIHGATWEGAHSRGYPGHFLAL 
PQHNSDEQRPN 


12617 


26518 


A" 


12742 


2473 


445 












PEQGRRRGAPTHGPQLIMMDLELPPPGL 
PSQQDMDLIDILWRQDIDLGVSREVFDF 
SQRHKEYELEKQKKLEKERQEQLQKEQE 
KAFFAQLQIiDEETGEFLPIQPaQHIQSE 
TSGSANYSQVAHIPKSDJiLYFDDCMQLL 
AQTFPF\VDDNEVSSATFQSLVPWYSPG 
HIESPVFIA\TNQa\QSPETSVAQVaPV 
DLDOIQQDIEQVWEELLSIPELQCLNIE 
\NDKLVETTMVPSPEaKLTEVDNYHF\Y 
S S I PSMEKEVGNCS \PHFLNAFEDSFSK 
HPLHESIDPNQLTVNSUSrSRM/P'tWrDF 
G*WNFILVF\MaEPSIRQQAWPSPATLS 
HSLS\ELLNGAHGCFLDLFTL/CKAFNQ 
NHPEGTA\EFHGF\DSGISIJTTSP\SYA 
SP\EHS\VESSSYGDTLLGLSDSEVEEL 
DSAPG\SVKQNGS*NTMYSSSGDM\VQP 
LSPSQGAEHFTCMDAQCEEHTRGKDLPV 
\SPG\HRK1IPISQKDKHSSPLGGLISQR 
DEL\RAKAL\HIPIPCRiCKSENLPVG\D 

fnemmskeqfneaql\alir\dirregv 
rnks/aaasgicrkkenweniveleqdl 
dhlkdek\ekllk\ekgendkslh\llk 
kqlstlyl\evfqhayrdedgkpyspse 
yslqqtrdgnvflvpkskkpdvkkn 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12618 


26519 


A 


12743 


6 


457 


rprnrpgirvdprvrgrvghapheglvp 
ermyrgsptacetqaaa*erafgpsspt 
cVrlpiprmstsvpqgHtwtqrvkkdde 
eedpldqlisrsgcftashfavqecmaqh 

QDWRQ CQ P QVQAFKDCMSE QQARRQE EL 
QRRQEQAGAHH 


12619 


26520 


A 


12745 


297 


12 


QERPVRMLVLHFGRLRliADHLKLGVRDQ 
PGQYGETPSLLKI P KLSGM\MWAP * I C 


12620 


26521 


A 


12746 


98 


428 


YNTSFNETVLLLTTPITIDCLYTRKDAI 
APESACGVCPCSLLGVGA*IPKWIRLD 
LSKKHVTTAYGGFMCAKCVLDRINRAFL 
IDEH\KIVPKV*KALSQSSKAFCEHET 


12621 


26522 


A 


12747 


3 


638 


LWLWSLCWVWAVSLPLQFILGSLHPCQG 
QASWREVDLLHEGSQEASSKPSSGSVPV 
GLLLDQEMVTPLLVCICGILLHQEMVSP 
LLVYTRGILLHQEMVTPHPGLYPWDSPP 
LDGHSPPGLF\RGAAPPS*GG/PLPVLV 
YTRGAAPPSGDS\PPPPGLYPWDT\PPS 
GDRHSPPGLYPWDTPPSGDGHSPPALYP 
WDSPPSGDSHSPPGLFPRG 


12622 


26523 


A 


12748 


2 


333 


DMVLLCHPGWSAGS I * KKTKERERMMKL 
HR**IDRERERERERQIQ*KE*KRINEF 
EGSKERFTQSVKQRNK/ELEKIRTLDTS 
ETIYLSIYRN*SLRRGQERRTERERMRH 


12623 


26524 


A 


12749 


30 


333 


KTSYLLPVQWWAQNDNERYSSSKNTIMA 
LPLPLPVFPRSPSDAERKLDCSAAISflH 
CNLPA\DSPASACRVPAZAGARRHA* LV 
FGFFWWRRRFAVMAGLVS 


12624 


26525 




12750 


231 


39 


INDLL/CLF*FKICLIWGRVW*LMPIIPA 
RWEAICAGGSFEPRSLKLQ*AMITPMYSS 
MGGREQDPVS 


12625 


26526 


A 


12751 


49 


273 


HLQVTEVFWFWCVFFRRWGGSHCV/AQ 
AGV*WLFTGTWPCCSPELLGSRDPPAS 
AS*VAGTTGACriAfllirCRGF 


12626 


26527 


A 


12752 


505 


897 


SYLRVQFYSAFSLFFRHEVLPCHTGWISrA 
W*SQLTTASN/CLGPSNLLK3iPHLAW* 
KKI F* RQGLSMLPRLVLNSWPHMILLP * 
LPE*LGLQARATAPGSGFTFCLTQDSIL 
MCSPTVHKLSDSIALETKQ 


12627 


26528 


A 


12753 


335 


542 


CCNEFFLSQVWWLMPWPATR/SAEAGG 
LPGPGSQRLR*ARSKPVIISHCSaGGRCG 
IDPISIKQQKNNMR 


12628 


26529 




12754 


356 


72 


WHE YYLLMEHTKKCHLS *GLYNGLNN* Q 
/WCTHPWPATWEAKAGESLEPRSSRL* 
CTMITPANSHCPPAWATARSCLNQSINQ 
SSNNWQGMITGGK 


12629 


26530 


A 


12755 


340 


127 


NYLFIYFRDSLTMLPRLECSN*FTGSII 
VHI TLKLLGSSLP /ASASQVAGTTGTRH 
HVQLSNYFKIKIKPKP 


12630 


26531 


A 


12756 


290 


511 


KKNQPGT/CGSRL*SQRFGRLRQADHL\ 
DQLGQHGKAPSL/LKNTKI SWALWRTPV 
FPSSQEAEMEELIEPTSSRLQ 


12631 


26532 


A 


12757 


254 


549 


YPGAKKQRPGGDSVRGTHLQSRCWCVLL 
fflrCQ*PKSNTQTFP*PKNLRE*NAT*KS 
TDL/WPGAAAHLCNPSTLEG*GEWIT*G 
QEFETSLGNIPRPQIY 


12632 


26533 


A 


12758 


617 


451 


NKKRRGLTLLSRLKCNGVIIAHCSLKYP 
GSSEPPTSAF*VAGITY\GT*HHAWAY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, Ij=Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12633 


26534 


A 


12759 


91 


393 


KWYTKECLYLLVS I FSRYLLSIYYEEVL 
AQTLGKLSKTVMCPALEETVAVNGLQIK 
S KK* / WLGMVTHACDPSTLGG*GGWIAW 
GREFGTSLASMAKPCLC 


12634 


26535 


A 


12760 


10 


221 


QSHNTNGCLDLHFQKGPWQSVGETLNL* 
TI FLCVCVCVCVCVRHVYHM/CYI YSDF 
ICPSIECEYFSFAHR 


12635 


26536 


A 


12761 


309 


55 


IIRYKNYISIKLEGWKRLYHATTHEKKV 
GVAVLISEBCVDFRAKN/MYQDKEGRFLL 
*R 


12636 


26537 


A 


12762 


640 


896 


TIVLKCNFVLFCFRDEVPLCCPGWSAVA 
/QSQTQLW*TTQPGLKQSSCLSLLSSWN 
*RHVPPHLAMFLKTTFTLLYTCGHLETN 
IC 


12637 


26538 


A 


12763 


1 


357 


IVPLHSSLGDRVKPHLLKKKKTCSTSLV 
IRETQIKITRYTYSNG*S*RKNKAGISINK 
CWHGYTATGTLIAD/CKSKQMLWKTF*Q 
FHIKLNIHY*YNSEIQFLGLYPRDKNIA 

RCGGLRL 


12638 


26539 


A 


12761 


129 


i20 


RWKCVKLOTTGQAWWLIPVI PALWKNQX 
AEGSL^ARSr RLQ*TLIAPLGYSLPG*W 
SGTLSLNK 


12639 


26540 


A 


12765 


617 


435 


GVKGLILPKLQKLLKSIE/I/E*KLPKS 
FYDASLTMIPKUIKD\RPISHVKIEaRI 
LHKISINII 


12640 


26541 


A 


12766 


243 


5 


QKIKKRIKKNKiQNL*EIWDYVK*PNL*I 
IGIPERGEKVYNTEWIFEGI IQ/ESVPN 
L/AYLVREVnUQlQEIQRTPTOYYTK 


12641 


26542 


A 


12767 


101 


16 


RTQLFVSVFWFCFLFCFETGSCSITQVG 
VQWCHIiRSLQLPPPP\GSRDSPPPSSQN 
RG*TNI 


12642 


26543 


A 


12768 


362 


92 


RVPHMDLGBECTSPYMYI*PMTCVFKHG 
*DCasrFCVTYFATI 1 1 YTYI / YLSVCVS I 
* I*CYLCVCVyiYIYIYIYIYIYFIYKY 
MKERRL 


12643 


26544 


A 


12769 


1161 


824 


QAKIILSTHFTKENM*MG/NKQMKRLSES 
LVISEMQ I KTTMRSHLTHTR I T I I KMTS 
DKICP*RCG*MRA/LLQC*WKYKIVSPL 
WKIVWPFLKT\INMP!IISCNLMILFLGFY 
PIY 


12644 


26545 


A 


12770 


241 


237 


K* R IWA\GVMflHACKPSTLflGCDGWIA* 
SQEFETNLSNTGRPCLYER 


12645 


26546 




12771 


326 


6 


DTVSRKNKSGKIFQLSSRV* I YERSQSG 
VKVYKCKTFGKAFTQ/HF*AHMRMYTGE 
KP YKY* E CGKFFI LVLLLLLM IQKYFHL 
I KIVRLYLIRKKVSCKQPSNKILQS 


12646 


26547 


A 


12772 


413 


1 


KKTFYFFFFETTQAGV* *CDHNSPWP*P 
P\GSDNPLTSAS*MA\GLQSCMHHHAQL 
S *FVFLEAGSHYVAQTDGDSPTSASQSA 
GITAVSHHTRPILYILIVLSYAISFFSI 
LYSAYFLQPHLPIHNKFLFSYISSSC 


12647 


26548 


A 


12773 


2 


1055 


FFFFLRLNLTLLPRLE/CVILAHCNLHF 
PG*SNSPASASRVAEITGSRHQARIIFV 
FLVEMGFHHVSQAGLE/SPDLK*SAHLG 
LPKC*DYRRDITPGHH*SF**RYFLKTY 
ISRVL/WCSLLGTFMFMIAFDHM*TW*H 
* QNHFHFLLQIWKLI SERLRSAQGQQAL 
KTVTSTFSSKYVCRAFLAEPGWIAI LYV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

lacation 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamlne, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TF*SLDSRRTCLWAVSNFFYSTLFLKL 
RQQVLLTG/PYHLFSARPSSCGSDKKAR 
ELNLSLSL\CFFFFGKRVSGTQAGVQ*H 
DHSSLQLWPSRLKQCSHLSLLSSWDYRS 
ITPHLAMF/CNFLSGWSRTPGLK 


12648 


26549 


A 


12774 


315 




FRCFLNSINPLESPHVEKTCflGRAQWLA 
PVIPPFWEAEA\SQMPQVIHSR*LEARS 
SRPAWPTWQKPCLKYKN 


12649 


26550 


A 


12775 


275 


59 


NPSPYKIQKISWAWWYMPWLATR \RLQ 
*flEIAPLHSSVGDRARPCLFKKQKTWKQ 
TKTSGEPVLVQGPATVC 


12650 


26551 


A 


12776 


73 


381 


SLLNSELSSIPRPTWESPSTSDSAWPMQ 
NnASEPLPLP*TTPPPDFPALSA/PE/P 
LPFTPSPElDTIiPPPLSPSYKQAPVLSVH 
PPVMPLKYVPSSPLLLSFLYL 


12651 


26552 


A 


12777 


274 


342 


LNCVIA*WIiMPIISALWEAEVGVSFA/P 
/RSLRPAWATW 


12652 


26553 


A 


12778 


2 


478 


Tl i! ilTKlirrFUVJJNiiUmiiLTAi'FCVCV 
GVSLNGFKNAKDYGGSHLKYSVYITGFL 
LQFSLKFDCFLCVCVCVFP*MA*KMPKI 
M/RGSHLKYSVYMT*FLF*FSLKLKYFC 
DPILRKRI/WPGAMAYTCNPSTLGG*GG 
QIT*AQEFETSLAKWKPCLY 


12653 


26554 


A 


12779 


680 


453 


SETDWRKNKFFPSNFPSNLETNSWTNSC 
S KKTFKKH/RVGLGVIiAHTCNPSTLGGR 
GGWSP*GQEFENSLTNMVNHFS 


12654 


26555 


A 


12780 


59 


487 


SLHKHPERGLQSFQGCWTQEMLGGSHVQ 
QRAWELCAPPNLTCPGYLSFFIiRQnQAL 
LSKLECSQTIPAHENPSLPSPQLKVSSH 
LSFPSSWNSRCTTPHQLLLLFF*DRIRL 
FCPSWSAVAQSQPTVT/SSLPSPQLKVS 
SHLS FPSSWNSRCTTPHQLLLLFFNFL* 
RQVFTMLPRLVBNS *VQA1LPPWPPKVL 
RLQA 


12653 


26556 


A 


12781 


109 


361 


LSPWHHSRNYTLIYPYTUSrFWMSAFPTP 
KEEFFFSFPETRPHSVAQAGAHW\AI lA 
HCGLDLLGS\SDPPTSAS*AAGTTAIIP 

G 


12656 


26557 


A 


12782 


3 


204 


LIDGSLALSC*LFC6GAIIAHCSLELLG 
LGQ \S PVSATRVAETTGVCHHAQLAFVN 
LHLRTRSKHCGH 


12657 


26558 


A 


12783 


2357 


6366 


LTGS\NSHITILTLNI/NMGLNAPI*RH 
RL\ANWIKSQDPSVCCIQETHLT\CRDT 
HRLKIKGWRKIYPSPMGKQKKKAGVAI\ 
LVSDKTDFNPTKIKRDKEGHYIMVKGSI 
QQEELTILNIYAP\NTGAPRFIKQVLSD 
LQRDLDSHTLI \MGD\FNTPLSTLDRST 
RQKVNKDTQELWSALHQADL\IDIYRTL 
HP\KSTEYTFF\SAPHHTYSQNWTT1VG 
SKALLSKCKRTE I IT\NYLSD\HSA\ IK 
LKIjRIiarLTQNHSTTVWKLNNLLLNDYX 
WVIINEMKAEIKMFFETNE/NKKTPTYQN 
FWDAFKAVCRGKFI \ALNAHKRKQE\RS 
KIDTLTSQLKKTREAKSKQHSKASRRQE 
ITKIRA\ELKErETQKKTLQP\LKKISE 
SRSWFF\ERINKIS\RPLARL1KKKREK 
NQIDTIKNDKGDI\TTDLTEIQTTIREY 
YKHLYAXNKLENLEGMDKFLDTYVSLPR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

bcation 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














LNQEEVESLNRP\ITG\SAIVAIINS\L 
PTKKSP\GPDGF\TAEF\YQRYKEELVP 
\FLLKLFQLIEKEGILPNSFYEASIILI 
PKPGRDTIKKENFRPISLMNIDAKILNK 
ILAKRIQQH1KKL1HHDQVGF1PG\MQG 
WFNIRKSINVIQHINRAKDKNHMIISID 
AEKGFDKIQQP FMLKT INKIGIDGT/YY 
FKIIRRAIYDKPTANIILNGQKLEAFPL 
KTGTRTGMPSLTSPLLFNIVFCPIVFWA 
R\AIRQEKEIKGIQLGKEEVKLSLFADD 
MIVDLENPrVSAQNLLKLISNFSKVSGY 
KINAQKSQAFLYTNNRQT/EEAKS *LMS 
ELMSELPFTIASKRIKYLGIQLTRDVKD 

lfkenyklXplikeikedVtnkwkni FI 

PCLWVGRIS\IMKiyiAIL\PKVIYRFNAI 
PIKLPMTFFTELEK\TTLKFIWNQKRS\ 
RIAKSILSQKNRAGGITLSDFKLYYKAT 
VTKTAWHWYQNSMVLVPKQRYI DQWNRT 
EPSEIIPHIYT1LI\F\DKPLLEKNKQW 
GK\DSL\FIKW\CW\ENWLA\1CRKLNL 
\DPF\LTPYTKINSRWrKKKDLNVRPKT 
IKTLEENLGITIQDIGVGKDFMSKTPKA 
\MATKAKIDKWDLIKLKSFC/TICTAKE 
TTIRVNRQPTKWEKIFATYSSDKGLISR 
IYNELKQIYKKKKKTTPSKKWTK\DMNR 
HFSKE\DIYAA\KKHMKK\CSS\SIAIR 
EMQ \ I KTT\MRYHLTPVRMAI I KK\SGN 
NRCWRGCGEIGTLLHCWWDCKLVQPLWK 
SLWRFLRDLELEIPVDPVIPLLGIYPED 
YESCCYKDTCTRMFIAALFTIAKTWNQP 
KCPTMIDWIKKMWHIYTMEYYAAIKNDE 
FMS FVGTWMKLET 1 1 LSKLSQEQKTKPR 
IFSLIGGN 


12658 


26559 


A 


12784 


787 


926 


PQAIRRPRPPKALG*HNSVDLGWAWWFT 
PAIPTLWDYTHEPLYIAKIS*CFKKVYK 
FVLNCIQBrCPGPHAARHS/SGKTS*V*S 
NYLFFSFCFETKFCSVTQftRLQWHDLWS 
LQPPPPGFKRFSCLSLPGGWDYRRIiHTR 
SflNFCIFSRNRVSPSWPGWSPTPDLRRS 
AVLGLPKRWDYRRE PPCPAKI YTIMAPQ 
KVKSHSSQPVSFLS 


12659 


26560 


A 


12785 


248 


285 


C*VIF*CUMILYVLIYVEKITSIRIAVy 
W/CI ITRLFVK* ICSCLLGC*FLFFKV 

ILY 


12660 


26561 


A 


12786 


243 


37 


RRSAGHGGSCL*SQYFGRPTEADHKVRS 
N/RGETLSLVKIQKVSQAWW*EPVVPAT 
RQAEAGAnaWVDR 


12661 


26562 


A 


12787 


75 


289 


DYRHEPRFFFLYTFFEMDSHSVARAGEH 
WCDLGSL*PST\*GSSDSSASTSR\QRH 
HAWIil FGFL*RRGFAC 


12662 


26563 


A 


12788 


83 


256 


RKMYWGLFVTAQN*K*PKYPSTEE/YI 
K/RLWYLHTMEYYSAIKIEKLGTRAQNR 
LILY 


12663 


26564 


A 


12789 


197 


499 


QSLKSVEETVFRNNKQ/SIPTFQVILMP 
WWLIP 1 1 *ATWEG*GRRSPIJIPGV*iDQP 
GQHSKTPSLQKKTKFS*AWWRVPWPA\ 
IGEAEVSGIPLSPERSRL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


otide 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide Insertion 


12664 


26565 


A 


12790 


394 


91 


FYNSTKFKWKNIQFLITRCRGDQIQVIR 
AHAVQLGFHVL YF/ CLSQNLS I YLYI S I 
YVYTYRYIHXHMC IHI I * YLYI YTY I VI 
KH*KEHWWIKHCFPTERK 


12665 


26566 


A 


12791 


735 


75 


FFFRQEGFLSPPKRGGVPPPHTKKYFSP 
RGS FLWGWGTSRP PPRKCFSPPPPPVFL 
PPPKKKKIISFSPTKLAPPPEYFKSPPP 
PS\PPPPSSPSPSTFFFFKNFPFSPSSF 
SSPPFFFPSPAWEDPPVQSRRIY*FLPP 
PF*IPEIPR\RVLKKKPGEGRSWGLFPP 
PKGPKRRVPLGPEFKTRLGHKTKPRFFK 
KKKKKKRFLISCLWWHMPAIPSTLGG 


12666 


26567 




12792 


'182 




MGVFVFFFi:TGSHSSVTWAGVQ*CNLSL 
LQP/PASSDPPTSDSRVAGST 


12667 


26568 


A 


12793 


405 


169 


fykkkktgsglalsprlecsgti lahcn 
lellgsrslrlXsntptlas * aartvsv 
chqthlif*ffvetgscyvasa 


12668 


26569 


A 


12794 


270 


94 


yvcysykqikicshvkrre*ntfkiyll 
kyialgltghffpl* tfsktvtlwpgav 
ahrcnls plegqgrwi i /r/gqeiktil 
ahm* ipvflhvni flsvcs ys ihttldy 

LFYYSCYILISL 


12669 


26570 


A 


12795 


284 


415 


NFIKIIKKKVMGLFYGKTFLLNLKKKKR 
KKK*WPGTVPHA*NPST\LGGLGGWITR 
GQEFETSTLaTN*NPTKSISRGF 


12670 


26571 


A 


12796 


1387 


32 


apssfairsffsgpmnaffssmvwkrpw 
pnlewsmnlrsifsrarrfvctskdlr 
rvstrflvhiitaflgykagmthivrevd 
rpgskvnkkewkavtivetppmvwgi 
vgyvetprglrtfktvfaehisdeckrr 
fyknwhkskkkaftkyckkwqdedgkkq 
lekdfssmkkycqvirviahtqmrllpl 
rqkkbhlmgdqvergapvpekadXwapr 
ealsskviivtqvfwagik*sbrfiggdpr 

AKGYKGGQPVCWAHPRKLPPQRPHPRAL 
RKGGPVLGAW\HP\ARVA\ FSVARRWGR 
KGLPFHRTEINKKIYKIGQGYLIKDGKL 
IKNNflSTDYDLSDKS INPLGGFVHYGEV 
TNDFVMLKGCWGTKKRVLTLRKSLLVQ 
TKRRMiEKIDLKF IDTTSKFGHGRFQTM 
EEKKAFMGPLKKDRIAKEEGA 


12671 


26572 


A 


12797 


86 


364 


EQDNRIFFSFLSLFFFFGETEFGPFAQG 
GGQGAILGPLKPPPRGL\ S SFPT* gsqe 
VGTTGAPHDIFCFFWKKGETPRLYKKNK 
NWRGGGATP 


12672 


26573 




12798 


101 


440 


HCSRYATGIFEWYSGILVLILLRFGIYE 
PMNLNIGDPYDSPPSRLYKMRLWERMAK 
SLIEVSLKNSHFWLGMLAHACNPRTLRG 
IGGRrA*TWGF\KTSLGNIARPHLYLKK 
I 


12673 


26574 


A 


12799 


364 


195 


NP PGDLGEETLLWGGE IMGTTPP * FPP P 
KEGFLPQDPPGGFBTSPPI /QGRSFSFPP 
PGKFGPPQGFFKRPPPFFFFLNNNQ 


12674 


26575 


A 


12S00 


86 


4S9 


PTAMAEEGIAAGGVMDINAALQELLKTA 
LIHDGLAHGIRKATKALDKRQAHLCVLA 
SNGDETVYVKLVEALCAEHQINLIKVDD 
/HKKLGEWVGLCQIDREGKPRKWGCSC 
IVVKDYGKESQAKHVIRENF*CKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, F=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


12675 


26576 


A 


12801 


64 


211 


GNHKKSNYSLAPWWYTTWPAT * EAEAG 
GSLE\L*CAMITPVNSHSSLA 


12676 


26577 


A 


12802 


300 


3 


NNVTTGSNIQFHSFPLIRKDICVSYIKN 
DAWN Y I KNSNPNGE TQVYQ* YS FSN*NL 
KIWLGKVA\NPSTLGGHGGRIA*A*KFK 
TSMGNIVTPCLIKTKQK 


12677 


26578 


A 


12803 


208 


394 


SQHFERPRREDCLS \QEGQYSEIPSVQK 
NVKISWAW*CMPAIPATREAEAGGPIiEP 
RRSRLE 


12678 


26579 


A 


12804 


183 


348 


AFILGKGKYEPYNPCLGWA*WCMPVVLA 
SWBAEAGG\SLEPRSSEL*CAMLIGCLH 


12679 


26580 


A 


12805 


397 


238 


ETEFPFCCPGWKHGDLGSQQPLPPGFKR 
FSCFSLPSK\WGQ*SLaFVGKGCK 


12680 


26581 


A 


12806 


1 


293 


PTRPRTSYEKQGYLLLPFVFSIVI.EVLA 
RAISQETEIKSVQMGKEEVKLSL\FVCV 
YIYA* ENPVESTKTVDANLFDTRDWFHA 
IFPQTVGEVGMVIG 


12681 


26582 


A 


12807 


297 


281 


AKNPRRQPRE I PGGVFPTPGFP PYFKAK 
MPEGPFPGGFFFKGGVLWGTPPPFFFFF 
FFSETESRFVTR\LQCSGAISAHCKLRL 
PGSCHSPETC**R*G 


12682 


26583 


A 


12808 


255 


59 


AGCSfiSQtFGRPRGVDHLS PGV*DQSVQ 
HGEILSTKNTKISWVWWSVPWG\EVGR 
SLEPGR*RLE 


12683 


26584 


A 


12809 


247 


310 


PliHSILGTPTTPAPTPGKPLLIRTSILY 
♦KPILSQAWWCAPLVPATQ/EAEARGSL 
EPRDSRRS*AMIKPVNSHCTPSWALPRP 
LPQPLESHC 


12684 


26585 


A 


12810 


101 


477 


IGKEE IKLLLFPDNLMEYTVWF * MYKPL 
ELIS * FSKVTE YQVNTEK/ SHC I YT* QL 
QIENEIAKTI*FMIASKSIKYLKISLTK 
CN\KWRDILCICIGRLSIIKVLVLPKLM 
CGGEKIFNPNPTGFW 


12685 


26586 


A 


12811 


1022 


696 


CFFVLFFKEMGSHYVAQAGVRWCDHGS L 
YPPTP\GSSDPPTSAS*VAGTIGAHHHT 
WLIFKFFVETRSCFLIQAGLKLLASSYP 
SQPRTSPKCLGFTDVKSLHLAWLCP 


12686 


26587 




12812 


166 


143 


RKEERFQINNLGFHVEKLEKEEQI \NPK 
TNRKKKI IKTRV* INEIEYRKTIGKNIK 
KNKSWFF*KINKIDKPLftMLDSGRRDSN 
Y*NITKS IEEGRRRKEEAGKRKEEGRKR 

RRRWRQRWR 


12687 


26588 




12813 


163 


423 


KAFWEFINHKKTGPF*RGPIGKVSPPGG 
NLASS I * KG/DPFPSPLKGPQGSSGGPP 
PKKPPTGFNIiAFGGLRGRETKRGPFFLA 

GLLP 


12688 


26589 


A. 


12814 


2521 


2842 


INTMYFPP*EMLVGWAWWF/RASNPQHF 
GRLRQADRLRSGVRDQSGQHGETPRLLK 
NTK I S WAWWRAPV I PATWEAEAGE S LE P 
GREGCSEPRSCHRTPVWVTRMRLY 


12689 


26590 


A 


12815 


173 


395 


LDQPGQQRETLS LLKKKQTNKQT KKKNG 
FKECFL*LDTWVHTY\NPTPLGGRGGGFF 
*GQDFETTLANMGKRGFy 


12690 


26591 


A 


12816 


43 


346 


CVYVS /GVCVCVCVCVCLASVCMYVCLH 
L*VLFGIFYV*DHNICKLRSFCFLFKNT 
FSHVYF*KI IWPGVEL*VTVFLFLSSLL 
HTSCVYEKSYAILIFSAL 


12691 


26592 


A 


12817 


647 


931 


SQHFGRPRHADHLRSGVQDQPGHHGETP 



1309 



w o 01/64835 



PCT/llSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspai-tic Acid, 
E=GIutamic Acid, F=PlienyIalanine, 
G=GIycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=lVIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














PLLKI \QKLAGRGGA*L* SQLLGRLKQE 
NRLNPGIRSCSEPGSHHCTPAWATE*DS 
VLKKKIGADKY 


12692 


26593 


A 


12819 


392 


2 


PLFFFFFFFLETESHSVTQAGVHNPPGP 
SKPPTSAP*AAGTTGMHYSaHAQ/LIFF 
L*TELPSCRPGCSAMAYHRHDPFTDQHG 
SAS FLIWAS S PLLRQTGDSS LLGGHHVD 
AKFSVDTQLALCTKAQNS 


12693 


26594 


A 


12820 


1183 


870 


drvslltprlgvavarnlaqwkplppgf 
krfsclslpsswdykhtpptpg*fh/rg 
*ffvflvetgfrhasq\aglnslpqsdp 
ptla/fpkcw\dye\hdhlawprkm 


12694 


26595 


A 


12821 


172 


298 


lYLSIYLPI/YLCIHLSIHPSIYPYLYL 
SINPSYWVSFSREL*LT*L*YLSIYLPI 
lYVSIYLS IHPS IHIS lYPLIHPIGFLF 
LENFD 


12695 


26596 


A 


12822 


51 


254 


YDFLKKHLLSSYRAKDLESLFLmiiHIF 
SGN*SVSYICVCVCVCVCICVCVYICIL 
IKSWCS*S/CGVL*AIC3iGVGWC*LIFG 
VL 


12696 


26597 


A 


12823 


199 


668 


QPLPAPS PSAQKGRAHPTPS PARPQAE / 
SLPSQSLGTRTRPPLGRPP* SPPWGQVD 
LWGWLGPSPTLPAPESLSPASLSTEWL 
CGTRYLVWGRSGAGQA 


12697 


26598 


A 


12824 


33 


442 


GRGKLLKKSQKQRKNALLKYDSKTRVGT 
LFHWTSVPVTRPLLL* INIRLIKLDVIK 
CSIHKIFHSEFLFFKQHSGL/CI*LSK* 
LCS*KIHQSGLGTVAHVasrPSALGGQDR 
RIT*GQEFETN*VNMVGPCLYPPLPP 


12698 


26599 


A 


12826 


363 


75 


HWEPRPAGPTLPSAPGSPSGRPCPPHTT 
SG/PPPPNTHTPMSRSPWSWEGSDQRPQ 
PPHTDTGSPPN*GSRLRHERPPWGRERQ 
RPPPTTLVRPLHSH 


12699 


26600 


A 


12827 


76 


486 


CTVNLFIYFEMESCSWQTGVQWCDLSS 
LQPLIWFSCVPTQISS*IPMCCGEDLVE 
GP*MMGAINSFLCCSYDSK*VS/ARSDL 
FFFFFFFF 


12700 


26601 


A 


12828 


345 


672 


KMKCTADLSLLEKDT*IKLEENR*QERI 
MLRABVNE I ENKI ENINKTGGS SSEKYY- 
KIDWPLTRS IRRKVIT\ KITS I RITEKGS V 
TTNPTE I KRTVKE Y I MKNFM P INL'i'T 


12701 


26602 


A 


12829 


208 




PSNSTPIFKRNENICPHKDPCTLMLMAA 
LFVMSKK*KQLKCSSNDE\KLWYIHMME 
*YIAIKRNKLLI 


12702 


26603 


A 


12830 


649 


172 


FPLKYSLSTTSVMAPSFTSCGGSQQVLL 
APLSRLLAWPNV/HPPPQPRHPAPHSFL 
SPPRSPPNYSP*SSQGPLLQPQSKHVEP 
LPQANARLPSAKNPARGPPGAAQAPRDR 
PPA/VCPPLPPHSPLHHERAPHLPLPLP 
ATEPAALHAVHPGRRQERVT 


12703 


26604 


A 


12831 


1068 


667 


KLSWFFCSRLRLHFGSIFKVPSL/PVPK 
LLLPIPSLCPSLPLQLQPAQPVPFNTAT 
WLCFC * / P* RV* PAPGYQRVGHFWFFPL 
QLNFIPIILQ*HLGF*KSASLSSSSSPK 
KVSLCHPGCSAWRSAHCMLKLLIS*SA 
CLGLPKC*DYRLQSPRPP/PTIFKVTKM 
CLLK^LTGSVSF 


12704 


26605 


A 


12832 


268 


1 


KCFFLLALNWPEFILDYFimW+NSLRT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Hlstidine, I=Tsoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=pos$ibie 
nucleotide insertion 






— 








VTEKFLFMTLFLTY*11MNRPGMVAHACN 
PS /TLGGSGGHIT*AQES *TSLGNIVI P 
CLYQKL 


12705 


26606 




12833 


654 


321 


LVGIRFLKVKYFNIKFPHLLAI ISPSLE 
VFNLQHFYIYFLLFIYF*ETRSCSVA\R 
LECSGVI IVHCSLKLLGSSNPPALASQV 
AGTITAHHCRQQHFCILEKSNLCTYFPL 


12706 


26607 


A 


12834 


867 


201 


RQMSPTIKTKSRDVASRSSPSAWaWGKL 
FTFWDFSiVISLVSLQGPGGPFHSPQP*TQ 
GR*VENNQEPIATPFPGPBRSLPGQPRL 
EPAHSGKRRGLPLE IRPPVHVSDIHGDP 
SGAQWLVRGVEAGPWAVGGQaP*SEGPS 
PSPPLDVSPDPGSSLPRSSSP*MDLRA* 
LSLTF*TFQKGSGN/DPASSPA\GKPG* 
EWRKIKGPPQGLLLGPEMWGSQARVLS 


12707 


26608 


A 


12835 


328 


1365 


YAFCLMMLDKKQI \ *AVFLFEFKMVHKA 
AKTTQNISNTFGPGTANKHTVQWWLKKF 
CKEESLEDEERDMGHGKVDHDPI*EP\Y 
*TT*KITEELSVDYSTVIRWQHSKQIG 
KVKKLDKWVPHELSGNQNYRFEV*SSFM 
LRKMHPFLNRIVTCNEK* ILYN/RPAQW 
LGPRGQLQSQFPKPHLAPKKKRSLVHW/ 
SPVGLIiAPriiDPTYSFLNP/GGETITSE 
KLCSGKLGKMHGKLQYLLPALVNRKGPl 
\LLHDNT/RDCVFAQPVLQKLKELG\YK 
VLPHPPYSPDLSPSDYHFC*HLD1NFLQG 
KH/ SQHGAENAFQEFVKS * STDFYATGI 
NK/LFSHWQKCVDCNGSCFD 


12708 


26609 


A 


12836 


7 


328 


RRERERERERERERERERERERERERER 
ERERERERERERERERERERERERERER 
ERERERERERERERERERERARV*EKST 
RYKSPPPRIPPLRR/RRV*RKHPHaRAL 
SFFLYKKCTGGERAPSPVCVGGKFTW 


12709 


26610 


A 


12837 






pfkht yatws iii snvqvcfmraqd i yl 
liylfeagscsvaXrlecsdmimlhcsl 

PLPGSKTtJPPISAT*! 


12710 


26611 


A 


12838 


189 


359 


LGLDGVMRVQPCRALGPS *EEKSSRVQW 
LTPVIPA/LLEAEAGGSPEVRSLRPAWP 
TW 


12711 


26612 


A 


12839 


368 


77 


KNPNFLKFGSKPMGPPIYSPP\LEG*AG 
GF INPGF* TPPGYMGKPP FFLKYSNLPG 
LftAPGGCSPFPGGIiGRKISFTPEMEVS I 

NPGSPLSLPPGEQN 


12712 


26613 


A 


12840 


16 


173 


KTDVHSKTCTKLFTAALFLIVKKWKHLK 
PP*VD/EINKMWYNHIVEYYLAIKS 


12713 


26614 


A 


12841 


425 


45 


NSFVFFFFLFENRVLTPVAPRLEXCTGV 
IL/APHCTLPALPEFKRFLVPSALLSSW 
DYRPVPVMPWLNFCIFVETGFH\HVAQS 
VLKLVSLKL/PSPTWDVPKC*DFRYSVR 

CCGLFF 


12714 


26615 


A 


12842 


237 


375 


LLLWVYILKKLW\LGWAHACNPSTLG 
GQGGWIL*GREFETSQVNM 


12715 


26616 


A 


12843 


21 


325 


TSFFFLESYSVAQAG\QWCDPGSLQPQP 
PG/SQ/DNPPTSG*VAVTTG/MHHHARL 
IFVFFCRDGILLYCPGRSKIIFSSCIRE 
LFKKISDFFLLTMLIFCNSNKG 


12716 


26617 


A 


12844 


369 


58 


PISPLQFMLPFLKNQSPYGPFFWKKKNF 
PPPFLGGAVLKTPGF*NFLFKKIQRGFP 
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sequence 
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sequence 
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nucleotide 
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correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, !VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno¥vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PFPKKKKPRERPKPCPPKFG/SPPFFFY 
PGFPPPKKGPPPPFFFFFFF 


12717 


26618 


A 


12845 


211 


409 


NALKPKIHFFFSGGGIjKGIVWGNTLLDI 
GLHKTFFF\*GDFYYAL*I*AENALFGG 
GGDCAPHLQVK 


12718 


26619 


A 


12846 


200 


33 


QKTNNTKCWQGWGATGTLIHF*GEFTRV 
QSHWKIWKFL*/SLNILP/PSSSSSSS 
SSSSSSSVKNLCLHKACTWMFIAALLLL 
AKTWKQP*CE 


12719 


26620 


A 


12847 


416 


125 


KIPTRPGK(3<3PPLYPHPFRGLNKQIGLT 
PEFGTPLGNKGKS FPFFQKNKN\NWPRG 
GGPIGPPSPGG*VGGLTLKGEVSTKLDS 
PPAPPPGGQKWPLP 


12720 


26621 


A 


12848 


85 


397 


KIHIFFIFLVSLLKGLSFLLTFSKN*VF 
/SFIDFLILCVCVCIEFYCFILILFHYF 
CLFGFCLFLMFVS * CQSLDFYFNSF/YI 
F*YKIQCYKF/CILIF 


12721 


26622 


A 


12849 


1 


246 


RPRRHLVASLVLTQLNGSSPLPIKIISW 
AWWLMPVIPTFWEAEVGGLLEPRSSRQ* 
*AMI\SPCPPAWAT 


12722 


26623 




12850 


161 




NKTTSSQIW*PMPIVSTT*EAEVGGSLE 
PRS/L/KLKCAMISPVNSHCTPTWTTK 


12723 


26624 


A 


12851 


65 


244 


STYYLHSLPFLDISYKWNHTTC\SRYFM 
GALFLIAIK*KQPKEPSM)EQI*T/IMW 
YIYTLEYDLAIKRNEILIHAATWMNLKN 
IMLSKRSQTQKT\HW*FHL 


12724 


26625 


A 


12852 


235 


34 


TTIAVSVLILKELMEP*TLQPDFLG/WS 
LITESVDHVQWHAPVIPALWEGEVEGLL 
EPRSLRPAWATW 


12725 


26626 


A 


12853 


168 


453 


CQLRGASGTQGPGLSESQCQQPCP/ PEC 
RPSKPRPCGCRI*SPARTSPQPSPAASM 
ALPTNERTPPPALKW*PK/CPKQSPQSA 
KSKSPVKSTERTAK 


12726 


26627 




12854 


244 


3 


ELQVYMHI*IYI/C*SIYTKfiNIYIKYM 
HMCVYIYTHTHTHVCIHLHIYLLRHGFG 
CWLMLGKVRGCEGMAGLMKAQQGWGW 


12727 


26628 


A 


12855 


376 


428 


KWEPGGHTSFFFFKGLVLNFGRGVLQKT 
RP * GGGQGGKFNPNFSGPREPPP P/ PPP 
GGGKKG P PP P P GVFVFFLE KGGS P I LPR 
GVLNPGAPKGVYTQRGGIKSGNQGATPL 
SFFLKVWF 


12728 


26629 


A 


12856 


129 


395 


APPNTHPFLSEAPQSLSLEARPPSAPSN 
SSSR/PPVWRASQTHPPQDWSSHCSPPW 
GAPPQI*PFSDSLAHPPQDWSSHCSPPW 
GAPPQI * PFSDSLAHPPQDWSSHCSPPW 
GAPPQI 


12729 


26630 


A 


12857 


363 


97 


GLATLRRLVSNSQAQVIRPPQPPEVLSL 
PKC*DYRPGRPPASLIHVNSRQT*KVaP 
/CSKPLAAAAHQGSPGATETTRRCPPSR 

CGPIF 


12730 


26S31 


A 


12858 


322 


361 


KSEKQS * VMLAVCTLDMMKMT \ F I SWF 
LPKTHN/LMSNYKKNTRQIPMEGHSTIY 
LTRIPQNCQGHQKQ*KSEKQSQQESKKK 


12731 


26632 


A 


12859 


88 


417 


HFTFFFFFFFFKRGGGFFSPRLKRLGKN 
FFFLDPPPPG*RDFSPSPFKEGGF*KP/ 
SPPPLVFFFFF*KKKGFPLGAGGF*TSG 
PGETPPLY3PEWFFYGGTPGPTPFFF 


12732 


26533 


A 


12860 


217 


2 


LILINICLANMHCFLFVFETSLT*AGVQW 
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nucleotide 
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SEQ ID 
NO: of 
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NO: in 
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beginning 
nucleotide 
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amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V'^possible 
nucleotide insertion 














HDL/GLLLRLAPPRFKQFFCLILPSSWD 
YRCALLHPATREDEA 


12733 


26634 


A 


12S61 


375 


145 


I,SFFFFF*KPGSCSVAQAGVQWHAHSSL 
QLQTP\GFSDSLNSAFSVAilPVRVHCHT 
RPLSPFNICWRARPKQKVFL 


12734 


26635 


A 


12862 


525 


852 


NLLS I YFFETESRS VAQAGVQWCHLGSL 
QPLPPRF* IKKPSNTALFM*NLLSIYFF 
ETESRSVA\RLECSGAILAHCNLCLPGS 
SDSPASTSRVAGVHHHSHLSSWDYSCAP 
PCLMirFFVFF*VETGPHYVSQDGL/DTS 
* PQVI CPSRPPKVLGLQA 


12735 


26636 


A 


12S63 


3093 


204 


EPDKTGPVLWKVGGGARVPGMAETLSGL 
GDSGAAGRAALSSASSETGTRRLSDLRV 
IDLRAELRKRNVDSSGNKSVLMERLKKA 
I EDEGGNPDE IE ITSEGNKKTSKRSSKG 
RKPEEEGVEDNGLEENSGDGQEDVETSL 
ENLQDIDIMDISVLDEAEID\NGSVaDC 
VEDD\DADNLQESLSDSRELV\EGEMKE 
\LPEQLQ\EHAIEDKETIMMIjDTSSSDF 
TILQEIEEPSLEP\ENEKILDIL\GET* 
RSEPVN*ESSELEQPFAQDTSSVGPDRK 
LAEEEDLFDSftHP\EEGDL\DIASEST\ 
AHAQSSKADSLLAWKREPAEQPGDGER 
TDCEPVGLEPAVEQS/SVAASELAEASS 
EELAEAPTEAPSPEARDSKEDGRKFDFD 
A\CNEVPPAPKESSTSEGAD\QKMSSPE 
DDSDTKRLSKEEKGRSSCG\RKFLGLVG 
LSSTTRATDLK\NLFRQIWGRVGGAPRL 
WTNARSPGSFAVYGFVTMSTAEEATKCI 
NHLHKTELHGKMISVEKAKNEPWKKTS 
\ PKR6SDGKKEKSSNSDRSTNLKR\DDK 
CDRTDDaKRC33DESVEKSK\DKNDQK\ P 
GPSERSRATKSRKSRGPKRTW\MDKSK 
\GVPVISVKTSGSKERA\SKSQ\DE\KS 
ASREKRS WSFDKVKEPRKSRDS \ESHS 
RVSERSEREQR\MQA\QWEREERERLE I 

arerlafXqrqrlereXrmererlerer 
mhveherrreqeri \hre\reelrrqqe 
lryeqerrpavrepydldrrddaywpea 
\ icraalderyhsdfnrqerfhdfdhrdr 
grypdhsv\drregsrsmmg/siregqh 
ylerh\sdpeph\gqdslr*lgwgyel* 

Q*RLS\EGRGLPSFPPGA6RDWGDTWLE 
DEDDPVM6K6TAERGPMMDR\DHK\RWQ 
\GGERSMSGHSGPGHMMNRGGMSGRGSF 
APGGASRGHPI PHGGMQG6PGGQSRGSR 
PSDARFTRRY 


12736 


26637 


A 


12864 


2 


456 


IHLGSGSEGDGSSGGLGRGNSNTSRFSS 
S SWARGDNVPRPPPPAVCSWI segdvqn 

pglgeagagsstpgdggelry/wpgllg 

A*GR6IG6DDDDLRTIiGIja6VQQCGK6/ 

rrgprgpgrgqeprrprdlglrgprraq 
gaa/sataappp 


12737 


26638 


A 


12865 


388 


1 


likmtkvkktaitkc*rgwgrtgafihc 
sweckmiqplwqtvwqllkklniyl\wk 
qpkcplm/vkwikklyismdsysavkkk 
etlmytta*mtlevallsersqtrkger 
ghilwdsiyinfqkmqtnl 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: In 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 

location 

last amino 
residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12738 


26639 


A 


12866 




3 


KKQKQTKNPPKIS*ALWCVPWPATWGA 
KVEGSPE\PRRSRLQ*AMITPLHSSLCN 
RVRTRGRTRG 


12739 


26640 


A 


12867 


415 


973 


NPVNCAQFSLLPTTEYMGHRVEGATGHG 
TC P S VP PNTHLHTGWWCLQHHRS CRAWG 
RGGSHTHRCPQRVPDGEHP* lYIAVHGE 
PP/ESPPQPSPLSCCPPQGNYIALREPP 
QGLPLPGTLPSHPPPFWHICKTHSSESR 
HPPFPGFCGLELEKGVGDFIIHPPLTLP 
KPLPNPLKPEPTPTPQTHI 


12740 


26641 


A 


12869 


715 


893 


PCVAPHSIREERFELTSESTNQKVLWWG 
ID*T/VILILT6NWQKRHLKSFFLGQKP 
GGMAP 


12741 


26642 


A 


12870 


342 


58 


KKKKPKENEQSIWDMWDNFRTSDI *TTG 
VLEEQFRE*GRKIIFKEIVEENSSNLTK 
/QINLLTQEV*TTT/HKNMNKSTPIHII • 
IKLMKTTTKEKI 


12742 


26643 


A 


12871 


110 


401 


SFYSVI!1ILKASRIVICLFT*LYAYIHTL 
YIYTLCIYYAYYVNIYICTHI\NIHI II 
HVYTYYIVYASIYYSMYSAWYNIKVSKD 
RQTYENIALITYY 


12743 


26644 


A 


12872 


439 


256 


SYFMRFGGDKISKLYQFILSRRWKQSTC 
PSMDEWINKMWH/ITIMEYYSAK*RNEY 
SYMRHE 


12744 


26645 


A 


12873 


356 


43 


GTPERHSHSHSQGSPIIGAFM6IRASFV 
YSLLSCVLLMLSPQRVLFYFSAFKQQQS 
\ WSGKVAQAYKSNTYGKflGS I T *GQE FK 
TILGNIARPCFYKKKFLBNS 


12745 


26646 


A 


12874 


341 


205 


RE I KKMVliNKKFFSr.QFLGKKKKFWE FF 
LKKKKKVFKAPPVFQF*VI'IF/CSF*KK 
KVFGFFFSPKIFFFFFFFFFFETESTSV 
TQEFNGTILAHCNLGLQGSSSSPASAF* 
VAVSQDCai ELLGDRGRLCLEKKKKKKK 
KKNFGGKKKSKNLFFLKRKKMGT 


12746 


26647 




12875 


474 


206 


PIRANGPVFSPPPPPPAPGTCPHPLVIM 
LSRHLRQSHPHTTVQPWTSGLLPS PPA* 
LSLETQHFKTAHFIFP*H*SSGLGPGWG 
SPVGEGPG/HPKLCLQPA2«AEAWPPRE 
GTKARRHEGLPPAPCAWGPAPRDAGSRL 
EFPTLPTIPVSQSPGR*HGAGGRPSCLR 
AFVPSRGGQftSAAAAGCRQSFGDLALLP 
LGSPILGQGQNFSARGR 


12747 


26648 


A 


12876 


257 


467 


YRVFTLLPRLEYIMAHCNLDLGS SD P/ P 
QVAGTTLK*LSRASRVAGTTGVHHHSWL 
IFK/YFL*RRWRRFSL 


12748 


26649 


A 


12877 


867 


1409 


GPTVLVGGQDVmSVLQQPRAHSFTPRGG 
AGGLGGGYVLHKPIPFSQELLGAFPCQK 
QYRVPPPLEAQLLRQAHSPAFIY*AAVL 
CQAQGPSPSPCSPMVSPRPSGRAPHLGA 
APQSGQKALASTS S PDAEPV/ S VG * WC P 
SNPALQGGQNGAE PRAVS ITSGALSL YD 
CFLDGWWEGRSLGP 


12749 


26650 


A 


1287S 


1416 


740 


I RNPRWVQLDVFAAAPMGRSS PL FLLLF 
SFLKTAniiRGSMIPSTVPS/FHKGPQACL 
YSLSLRS*SQAAPSPAPSSLPTPSPHPG 
LTQPGRGRRKADIS/CSHSWAFYVP/SL 
PHLPWACPLSLPQPVLFSILFCFVLF*D 
GVLLCHPG\WSAMAESWAHCNLHLPGFK 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last^amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-T]ireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QFSCLSLPSSW\DYRCMPP\GLG*FLEF " 
*VEDGGFnLVRLIWNS*PQVIHLPQPP 

KVLGLQA 


12750 


26651 


A 


12879 


961 


31 


PS PTODSCVAWAGERRGRLGGSGVWRSH 
ARGPGLMRGLPASEPAPCVLGSRPCFRV 
AFETPWTSPASCMFERCPLWRPPALVAA 
GLDP FLLSSTLTLAVCPRDL IRSWKKGC 
DVCDSRNCWSTHSRPACRDAILAPILAG 
RGEC IGLWG/AAAACTGRV/ TPVSGLTF 
stvfdfssarg/v* GGLPRSTGGPHVRG 
PGQPVPLPGLPAGVALAPLEGRPQE PAP 
ALSHSLTGSLFLGAWLCGCTRHGRAPVL 
LEVS IVACGQLCEAAPQIFIQPGLQQLP 
TPSATSPAYLWVDVLLTLPSVIBCQKRVI 
SFLTKE 


12751 


26652 


A 


12880 


232 


28 


PALWVPWARFDGMBETGQNHWSKEKAW 
PGAVAHTCNPNTLGG*GGH/MT*GQEFE 
TSLANMftKPRIPP 


12752 


26653 


A 


12881 


331 


1 


kngpffknppeffpppkwgvppnptpyf 
pwpepif*ltpgplkkqkipfq\kgktt, 

GWGGKMGNPLPFWKWGGPPRESKSSPRF 
YKTPPPIFKKKKKGGSSRSRTSPRV 


12753 


26654 


A 


12882 


370 


287 


VSILAGCCGVAMIDNTVLVHKEVKII,RK 
*IIKK*CI**FRGHSK/HNFSG*AWWLM 
PVIQHFGRPR*GST,KaRST,RPAWATWQD 
PHLYEK 


12754 


26655 


A 


12883 


656 


1168 


EPSRGVWPHEDARINGSKKKKSKDKKRK 
REDEETQLD/IVGIW*TVTNPGEISGTI 
AIEVDEGTYIHALNNGLFTLGAPHK/ER 
lALKPGYGKYLSINSDELW/GRSUAIG 
PREQWEPVFQNQEVRNGGPAEMGEEKRir 
GTKWREDTDHTSFPLFPSTGGQPKAHSN 
WRKVCH 


12755 


26656 


A 


12884 


205 


452 


ASGSAHRFRTFSFETGRVLDKPSQTDHT 
NRKRGGPNVK*SFL/WIiGftAAHACNPNT 
IiGGHGRRTA*AQEPRTSLGNTVRPCLY 


12756 


26657 


A 


12885 


112 


290 


KYNIIQYNIVFSCHDPLVSFNLFFFFFF 
GKGGFPLSPRLKARGRVWGYLNPPLPGK 
RNFWAPPPQGFEIKGLPPP*SPPFGGVG 
QAKNLGP6FQP PPAPM6KTP FF* KNQNN 
PNWGGQPLNFKTLGGGGPKI SF/PPGGG 
GLNNPKPSPQPSTWQKGGNPLSQKKKKK 
KD 


12757 


26658 


A 


12886 


406 


190 


TRSLARGGFGRPLQKPYWNEQSFIMAKK 
\YEQPKCPWTDKWVNKMWSILTVG*YAA 
MKRRDVLTCaTITCR 


12758 


26659 


A 


12887 


440 


160 


RVCVCVCVCVCVGVFVCPALWLSSLIYN 
*VYLFLFVHNIjGRVSFSYFIJ!ILI*FFEH 

vnhv/hnyvqmiqntklylksyletfh/ 
cpc*fppillh 


12759 


26660 


A 


12888 


201 


411 


hiwdvilirhalgnfltsiscyivkiwl 
davahtcnpntlggqgg/wit * sqe fki 
ilgnigdplstkknk 


12760 


26661 


A 


12889 


138 


464 


SCTMNPSEMQRKAPPWRWRHRS*APSTH 
KMNRMVMSEQMKLPSTKKAEPPTWAQLK 
KLTELA/TKKKSLENTRGTQTSKNMLFA 
ALMIVSTGCAGVPSSSKETATIEDKP 


12761 


26662 


A 


12890 


146 


366 


fsvsmkyfv*ih*fyhl/klllmvyl*y 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
etli 
od 


SEQID 
NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 


Predict- 

otide 
location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














GFLAINTSQNISCSQKIMKLHL*GRGW 
AHACNPGTLGG*GEWMA*AQEFNTGLSlSr 

TAKPYLDF 


12762 


26663 


A 


12891 


368 


50 


KGRQSGVSSFPVGTVTNYTNLVAHNNRY 
LFPHSPGCQTSEVKVLA\GCSLLKASRG 
EGFFPLPGSGGSKCPLAYGCITPISASS 
CDLLLWVCVFSSYLAY*DTCHWI 


12763 


26664 


A 


12892 


227 


456 


klttllklfekMedrtlsksfyevsit 
*ipssssssssp/islintdakkiwkip 
rnriqqcikkiifhkfst6iys 


12764 


26665 


A 


12893 


461 


89 


iyqnlsrkpdnikcw\*rcteiqtlihc 
w*qckimqllwkiiwqfl*/elntevpy 
dsiplldmy/pkktec/ytstqtliycm 
npieyysaikrdkvlihattwmnleni/ 
iskrrqsqkstycmipli 


12765 


26666 


A 


12S94 


222 


185 


RIVMQYLHLNLGACVCVCVFVCD/CVCL 
CV*MCVHIHI YISTHTYIHI YVCM* I CR 
LS**GEFYS 


12766 


26667 


A 


12895 


186 


166 


ifffffemgshsvtqarvqwhnhsskqh 
ppp\gasdpptsay*vakiniffff 


12767 


26668 


A 


12896 


55 


485 


TCWDCKDEPPRPAIiESVFIiTSFLEDiiFS 
IEIADLVPHPTPCSHPLQPP**VIIYFC 
S\KPLVT6SSSPPRKPFPSLHEPKSASL 
♦PPSSTIQPVRSSPKG*PSPPIiA*VQFP 
KGMTIPTLCLSPVLPVPSTIPLPNVSKP 
LPFY 


12768 


26669 


A 


12897 


923 


312 


VDVIRKVQIEITLRCHLIHVRIAIIKKT 
RflNKCW*GCGEEGAl.VHC*WEC/KYILV 
QPLWKTVWRFLSKLKIELPYDPAIPLLG 
ISPKGNEII/CCTPKFIAAQFTIVKIQT 
QPRCSSMGKWIKKL/W/HIERSIDR*LE 
YYLAFKKKAVLSFAKIWIDLEDIMUira!! 
SQTQKEKYCVISLICGIKKKKVKYIEIE 
NKTVITKVRIRWRK 


12769 


26670 


A 


12898 


320 


3 


KSWRWGRIVRPHSVP6ETWLCPVPPGPP 
NAPDGSFPLAICPVPPSCRSL/CPP/DD 
CIiASAGV*GGFPLPRTEAS PHHSWACPQ 
EQPWPPQCKLKVGAPNSTALSEGS 


12770 


26671 


A 


12899 


138 


447 


I FVFHYKTQNIYNEDTLHVI I INLWCYP 
GSYIKSTGRLRPEVEHGLGPTTMCC*FS 
S IKNIFYF* KLKS YRKL/WLGTVAHAYN 
PSTLGG*GRWIT*GQKFNRD 


12771 


26672 


A 


12900 


279 


1 


inqsih1jnikstyisqckliqp*wkiiw 
rfir/nlkiellyvpatppvgv*lkelk 
sac*rdnctt/m/ftaalfcigygsnlg 
nstaligsrvlfp 


12772 


26673 


A 


12901 


235 


183 


I INYGGGPPEKKKKGGKKI PLI FYKKI K 
KFLWPKKKKLFGGAKNPIV*AHHIFTV* 
IDVDTRADFTSATI \ I lAI PTGVKVFS * 
IATLHGSNMK*SAAVL*ALGFI 


12773 


26674 


A 


12902 


180 


19 


LFIYLFIYLFICEIRCHFVTQAGLQW\L 
IIASCSFALLGSSDPPTSAS*VKGI 


12774 


26675 


A 


12903 


406 


949 


APVPGHGFCVSF/ CGENGVSLYCPGHS * 
TPGLKLCSCL\CLPESWDYR/R*APVPG 
\LGFVFHLTL*KPPFFKDYMKSLFQFLK 
YLIQG*CTLV*GVRYRSSFIFFFFFLRR 
SLA/SVTQAGVQWRDLGSLQSPPPGFTP 
FSRLSLLSSWDYRRPPPCPAWFAFVFLV 
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SEQID 
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sequence 


SEQID 
NO: of 
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sequence 
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od 
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NO: In 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 

otide 
location 
corrcspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lys!ne, L=Leucine, M=Methionine, ■ 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EMGF\TVLTRMVWIS*PCDPPASASQSA 
GITGVSHRAHPKFVLYFKDNGEPLGGVL 
HRSHIAVHWM 


12775 


26676 


A 


12904 


265 


385 


GNGSPSLRWIGVLRGRSPTLELRH*PIF 
LTAASRRSLDRVSVPMWGTFLSEPLSIE 
GLVSRYLTNNLMERIPILYEN/PFNNET 
M/PESLCYRVLIFLSKGYPRVKGRLDTC 
YSPVRRSPAKKASFLPDAPRIjACVKPVA 
SVHPEPQSNRTRQ 


12776 


26677 


A 


12905 


363 


137 


APGMQA*WCM/HCIPGAQEAEAGEWTEP 
RSSRLWCTVIMPMS SHCTPAWATW 


12777 


2667S 




12906 


315 


565 


TPVDHGWLECSGAIMaHCSLKFLGSRDP 
PASASRVAGTTGVCHHSQLI FtFFVGTR 
SNFVAQAGIELLG\ *VILUQSPKVLGL 


12778 


26679 




12907 


376 


412 


DAWPTWEm* FGVYVMTOR* EPRQMLTY 
LTA\PLKYFKSGVMAHSCN\PSTLGGQG 
RWIT*GQEFET\liANMEKPCLY 


12779 


26680 


A 


12908 


213 


2 


NIHINGQKLFL/WLGWaHTYNISTLGG 
RGGKIS*TQEFETSIiRNTVEPHLHKKQK 
lALPLPLPLPLPTVR 


12780 


26681 


A 


12909 


307 


379 


KR*KQPQYPLTEEWINKMRHV\LQYYSI 
LKKKEIL 


12781 


26682 


A 


12910 


279 


123 


KMILSW2UIWLMS 1 1 PAAWEflEAGGSVEL 
RS * \LRLE *flMTVPLRHFTP AWARE 


12782 


26683 


A 


12911 




194 


AKQHI PLLARLECI /GLFTNTVTAHCSL 
ELLASSHPPASAS*VARMTECVHRAWIiE 
WIFYLCVIL 


12783 


26684 


A 


12912 


206 


388 


lELSKQGPVYYKWYVRITPPYFPFFFFF 
BTGSCSVr\RLECS /GTITAHCSLNLPD 
LSYSPTAAS*VAW 


12784 


26685 


A 


12913 


238 


426 


TPLLPSQQWLKKESLWLGRWHACNPST 

i,KG*GGRDS\liGQ*J''K'rsLGNlAKPCl-.i 
REKFLKI 


12785 


26686 


A 


12914 


115 


1436 


AKDRHFTEV*I?!NKQMKRCSTSLGMREM 
QSK I VRYHLSE */R* LFFFNGGNCWQG/ 
CKK/lQDLIHCWECKMVQPLiraiIW\RF 
L * NHT C I CHASG I ALLS I YPRE MKTC VH 
TKTCIRMFIAALFVMVETWRQPIYPSLG 
G 


12786 


26687 


A 


12915 


282 


21 


IKFPPCPSTWGTKKKLFPKKKKEVMKML 
CFHFNLLVQNI F * Y RLGNE ISDKAG\ PV 
AHAWNPSNLGS*GELIT*GQEFDTSLAT 


12787 


26688 


A 


12916 


254 


150 


TLPGRSSDFSLPEV/YLKKMJH/WLGW 

ahscnpstlggkggwit*gpphmlk:vs F 
lptdptsrvkqlynwvphstvlialfpt 
knpg 


12788 


26689 


A 


12917 


181 


24 


aqwcdgi pvskgnl/wlgmvahacnlst 
lggpgrw it*gqefets lrne fhrtd 


12789 


26690 


A 


12918 


75 


298 


LFILFTS*VD*MRPTHIKENliIL\LAQST 
NPNVHL I Q / KH PHRHS QNRVW PNVWAPH 

gXqsschikmtimarpsgsql 


12790 


26691 


A 


12919 


251 


9 


KFMFLARCGGSRL*SQLSGRPREADHLR 
PGVQNQPGQHSESQYLPKSTNFCRVWWC 
TPWPATREAEA\GESTEPGRQRLQ 


12791 


26692 


A 


12920 


198 


211 


RFCFSVGDEREGKRA/RERNRETERKKQ 
TEDTE TQRE ERDRDRKR\ERE TDTERER 
PREERHTEREKK* KAAFPFQTLGCHAQD 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQED 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L,=Leucine, lVl=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPKQLFPSRGWGVGPMTVLL 


12792 


26693 


A 


12921 


438 


127 


RWMERI FVRPH I KDSMWWVRS TLLGRG 
DALNDLFLyCWEKEDPRYKERPXRERER 
ERERERERDRCWEK*APGWHERPVQSLR 
ACHPPWHRFAIiAGPVPSAQ 


12793 


26694 


A 


12922 


158 


460 


EARVELLLSSRCKPRAVITAETQPAVYR 
LQLEIENFPGLSPRLECSGRITAHCSLD 
LLG*SDPPIJSASR\AGTTGMCHAQ1VNS 
CKADWLVLLSSRGLTHR 


12794 


26695 


A 


12923 


33 


484 


GIQAA.CIPGLRLSPPADAHCHRHIPSPP 
PSSTAPRDDVGPMQITPDNFPPQDPPLX 
TPPSQSLLPHRAP* PQGSSVDLPGLLFK 
LPQMPQTPGPEEVDTCVQIHKTSKftRED 
VQMRKTPMAREGRHRAAHRWIGALGKAP 
QSQRRLRHTVG 


12795 


26696 


A 


12924 


335 


332 


RIKVYVRGHAKE * LRPGP/ VAHTCNC3n' 
LGGQGMRITQAQEFETSLGDIGRPHLYK 
KKKKKGGPF 


12796 


26697 


A 


12925 


302 


407 


AQWLMAIIPAPWEAEAGGLLEPRS*\AW 
VTW 


12797 


26698 


A 


12926 


766 


244 


RVPRLPPPAL*LEFSPSGLRLaHSHSFQ 
LRSSPPPVSPPST/PFSPHRSPPLSLLS 
LSVPS /PPLHHSFFSQRAPHILPPPPLI 
FPSSPPPAPPPPPPSSSSPSSSSSPPPP 
/PPINWGTPRFFPPPPFKNPPPBINPGG 
PKKKKIFSPPRA*KFVFLKGPPPFFFFF 
FFFFLGINLL 


12798 


26699 


A 


12927 


127 


720 


WCGLLESTELQLQSLPGLYVSVIPGKIF 
YISGLGI CCTKIFPSALPSAQPAASLPL 
PARSALGIVFFLHFC* lE/OSnf KKLPIF 
HHI*LIKTFSYGLT*LFFFFFKTKSLIF 
SPRLNCRGPFLVN*NLCLRG*GNSPA*P 
FGGAGIEGAPHKAGLIFGGGGGAFLKKT 
G\LHHVAPGGASNSGTGNISHPPLNPPK 
GAGE 


12799 


26700 


A 


12928 


308 


49 


KEHW * AVGHFNI CVTGGP*GERKKGTER 
VFKEI ITPNFPNLIQTIKPKTQETQHTP 
SRRNTKI /TPRYI I IKFLKTSNKEKILT 

TAR 


12800 


26701 


A 


12929 


375 


2 


GCMLSLKTINILEVLANVLAQKKIKRRK 
LND\EQADTTFTI * FCVRQKTKKSTDKL 
VKLI /ERFYKVATYMINFLKKSNS FHTP 
ATINC*KQIPFIMaTKTIKYLRINPRKK 
CVRFVGKKLTFPER 


12801 


26702 


A 


12930 


202 


379 


SSENFSGKTVNLCCLGRLHRMISWLHTV 
AHACNPIT\GWIT*GQEFETSIiADMVKP 
CLY 


12802 


26703 


A 


12931 


209 


376 


TVGEKMYLKVKRI \WLGWAHASNPNTL 
GGQGGWIT*PQEFETSLGNMAKPHLYKK 


12803 


26704 


A 


12932 


280 


31 


I S C KVTKKVTLFE LKY I WNKFY I YERVS 
I VGTTEDAACE VTSKVI TGPGAVAHACN 
PST/RGRWIT*GRELKTSPA]!JIAKPRLY 


12804 


26705 


A 


12933 


1 


189 


QMYTESEP*CKLWTLGDYWV*M*FIS*/ 
IQCTTQAGNVNIRE/RLCMCSGGEYMRN 
LCAFLSILL 


12805 


26706 


A 


12934 


108 


409 


HHSHLLKLPWVWFSFCHF1VTVFLSL*R 
VFFVFY*IFLSSSHFLFSFCFCPIMFIM 
FPFKFILSCNIVFI *NI IFFLF/ VHV*G 
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SEQm 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 

location 

correspon 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














* STLVSLFVFFFFLLQF 


12806 


26707 


A 


12935 


225 


11 


ILLCMEPSFLFWFGFICFVKTGSHYIAQ 
VEVLWLFSGMIKVHCSL/KLLHSSNPPT 
SAS * VAGNTGSS YCTW 


12807 


26708 


A 


12936 


74 


12 


GTTHASAGVINILVFIIiSVFFLTHSLCI 
HISKYMFVYICVCI CTRTDFYTEFKSFC 
TFCFFGSSFFFLEKKFPLVAGLEEQGKD 
LG*LKPPPPRLKKFSC:riTLPG/AME*RT 
NG 


12808 


26709 


A 


12937 


166 


311 


PHS*WECQMVEAPALENMQFLTKFIRGL 
PLNDAALPIjHM/CHRKSCTQLFI IALLI 
VAKKa}HHKNL*ANEWIIJKM*YIHTTDy 
PLAIKQNE ILIHATTWMNWTL 


12809 


26710 


A 


12938 


51 


296 


LOyiYTLHTJj^MHTCI^A^ICYTCKYIHR 
Y/LHT * I PEDANS VKHKYTHTHI YVHTL 
VYPHTTYLMCSYIYSWVCVCVCVYIYT 


12810 


26711 


A 


12939 


426 


28 


KGFNLVNPRVFPSPRF*TPAPVFYFCGP 
IKKIFFFKAGGVKFDSIiKRAPLFFFFLK 
TGSCFVP\RVECNGI ITAHCNLCLPRSN 
DPPPSDLQVTGVKPPP\AKLFFFFLW*M 
GB'HHILDYQGETCLSQLLGWR 


12811 


26712 


A 


12940 


11 


213 


ATAPGLFSFFETGFCSVDQa3VQ*HDHG 
SLQP* ?P\GSSDPPAIiASQaARITGVSH 
CKGPEHTSLLT 


12812 


26713 




12941 


416 


38 


WLNNHSRLGFPNCWDYICKLPRLGPDVA 
SLLIHNSS\GTWYGATKLESNWYFSIPI 
YKDHQK* FAFTWQGQQYTFTVLPQDYVS 
SAL*H1ITVHRDLDHLDNPQNITLVHYSD 
DTMLLDLMSHKYQAL 


12813 


26714 


A 


12942 


361 


74 


IIHPSFSVSSKNIHKNAYPAALGGYSVY 

GVAILLFHYFINKLAFTLHCGLA/LNSF 

LR* iqepplgVsgsgpfscnvflapteg 

TIVQKPDPMATFG 


12814 


26715 


A 


12943 


125 


401 


S S FE PRC I PLNPE VAMQKE SVMI LC SPE 
SQEFLES RI KI KVLTDLTQDEL* GQAQ * 
LTP 1 1 PS \ I weakaggfle PRSLRLRS C 
HCTPAWVTE 


12815 


26716 


A 


12944 


92 


306 


KRTTNSPWGCSAy/GVAILLPL*IL/KL 
LAFTLLHGLPSNSFLRKIQEPSLGV SGS 
G PI FGNKLVAPGGLMI WS 


12816 


26717 


A 


12945 


322 


24 


NFKKNINLNPPLRGYFLFLKGGPGLTPK 
KVGEPLKKKKKKLNQQFSYFLSTYTVK* 
NN* / WLGAVARTYKPSTLGGRGGWI T* G 
QEFGTSLANMVKKILAK 


12817 


26718 


A 


12946 


2 


448 


GGAPMKRHG SLDRNYRVDTTRPS QL FL F 
QLYHNYLKFAYEATKEYMHRKETLCLPC 
PAIQFS-FPKSNLCYRGYLCMRTCAHTQM 
*MYRYIYTYVYMHIYICM/YRAICTYYT 
FE*KYILQKCILLCIPLFPLQYLKYVSY 
LNRHHFLIAA 


12818 


26719 


A 


12947 


155 


415 


LLETGGNAGCSSWMHVLLGKIHRYGiaGS 
HPl SLMAG* \GPPWLRRVaGLHGRPATL 
GLRYGPDSYGRQQW\EYCTMGASLMQRR 

RVRD 


12819 


26720 


A 


12948 


247 


250 


KYVYKYIHIVYKCVYIFTHVYGYTYLYT 
DVYKYTYLYTDVICICIYKCIYLYWYl 
YIHLCINMYIYIHLCINiyiYIHIHV*IYI 
HIYI\HVYIFIYIFLYTVYK*IHIYEVC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GlutaDiic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metiiionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=llnknown, *=Stop cotlon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EYIYTLWKL 


12820 


26721 


A 


12949 


25 


407 


EALSLLLLLVLWGLLLGIARKMYQQMHL 
VQYCLLVL*P*NL*EQKRFEREVGLDHR 
EP*PHVLGCIHSEKSFRFWSW*EL* IRK 
LTLVA\GGVRHTyNPNTLGHEGRRIT*G 
QEPETSLGNWRPLFL 


12821 


26722 


A 


12950 


370 


106 


ILYLETLLKSPIS IRSFSAVDSHFEKSS 
TVGKMLPNS ITCTDPI \FNKKKSQLMWQ 
SSLLPFSQKLPQSPQPVSQQPSPSRQQS 
I PATR*QLA*GSVQVMLFGSILPTVELF 

S KWES TAEKLLIL INDFSKVS RYKI YV 


12822 


26723 




12951 




DQPGQHSKTLF*VSH*KKKKGWM/PGVY 
AYNPRi'LGGRGGW IT*GQGFETSLANMG 
KPCFY 


12823 


26724 


A 


12952 


337 


113 


I QIAISLLGICPRDLNTVTCS \SDTCTQ 
MFLSVLLMVAKR* KQPKYLQIDEWINKM 
QCIHMESYLVSKGRKYLCKL 


12824 


26725 


A 


12953 


9 


186 


DPTVCCLQDTHFTYKDTNKLKVMGWTKY 
/CHaNGNQKRAQWIQIEyP*SKSLKSE 
IVQN 


12823 


26726 




12954 


46 


381 


NENTYSYKKNTQMFIAALFWTPNWKQP 
TCPS/SGEWINKLL\YHI.MVyYSAVKTN 
/ILVYVATSVNLRllILGKQPDKKRVLT 
V*FHLYRKyKr.ISSDRKQIHGRlEVENK 
GK 


12826 


26727 


A 


12955 


19 


250 


CSDMVLKACIKKLMSYSKMGHIQAMDYY 
TAL/NKEL/UyreaTI *ANLEDIMLSKIR 
QTQKDNCLMVDCIYIRYAEQSSL 


12827 


26728 


A 


12956 


173 


163 


GAKHSASGTGWIRKDFTRPGAVAHACNP 
STLGG*GGWLSRSEFETSLTNMVKL/CL 
YKSICCTRCLLGCFPQALSKAID 


12828 


26729 


A 


12957 


301 


79 


EKDNQPLLTPRNCNVUIMKESENHRIT 
ELKVTLRPDAVAHTCnSPSTLSGRSQ/WI 
A*GQEFE ISLAKKVKPCLY 


12829 1 26730 


A 


12958 


314 


126 


KHHHFKKHNFRVLCVCVCVCVCVCYKHF 
YLFSLSFSQ*P/CICC*CVCCSFLLCCF 
*SATYLKII 


12830 


26731 


A 


12959 


529 


1697 


VPFMGHISNFFFSPFLRQKYLALLPRLG 
VQWQ* SRAHCNLRFPG\ S SNSHAS ASRV 
AGI TGACHHGWLLFVFLVETWFHHVG\ Q 
AGLKTP*PQV\SACFSLPKCCDRRCEPL 
CPATSSILNL*TMDRL/PGEPKNQ*NLL 
VSFMCAVFYREGQQHVYFQRSLWCKREF 
IVRSPWAINLEKGGLRMGAV* IRIMLNL 
TLT*GFCTEDEKTWSRVNQKLTPT*LG 
DE*DVKREKNIIinElKLl,TCNSEYTKSY 
RLGR*IKLGAFCFFM\VPMGQTGENGQL 
ICVQV*TGTVAHVCNPSTLGG*DRRITRV 
QEFETNLGDHSESSSLQKIKLARHGAAR 
L*SLSL*VSWDYHCSPPWP\SNFCIFK\ 
RDGVSPYWPKA/WS*TPDLQRSAHFSLP 
KVLG 


12831 


26732 


A 


12960 




395 


GTREFRILSEKFKKEIDFFFFFKKQSSP 
PPGPKKQTL/C*KQTTPTKPQKTNLGPK 
KFGGQPKKPFKISHPQIGPPKKKNYNP* 
GQPVWNPPGEKKEGPTPPKFKDPGANLK 
VMGLKGGVKRKRGGKTFPQRA 


12832 


26733 




12961 


90 


436 


LWRLTSPKSAGWAGNLETQEESMLQFKS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
ifist amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=TJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EGHR/PAESPLA*SSVLPSLSTDWMRPI 
LI/MEGDLLYSKPIHFNVKISSKNTYRN 
IWNNWPHV*TL*PSQSD/CLK*TITPI 


12S33 


26734 


A 


12962 


415 


463 


LRAKDQVLPKILVC*TLLFFCAQIKKKK 
K/WPGAVAHaYNPSTLEGQGGWIT *GQE 
FKTSLGNMVKPSGIPPHWI 


12834 


26735 


A 


12963 


774 


316 


SISPTCSGKIGGKLNRHFSKEDKTG\RY 
MNKCFISLVISECyLKP*EDTH*TP\LR 
MAKIKRADH/DKC*QRYKMTGTLIQC*E 

erk1mvqplwktv*qflkrlnihlpddsi 
pllg\iy*rkmkacvhtnti iwmpiasl 
lvkak\akkqpkcpstr 


1283S 


26736 


A 


12964 


44 


395 


MYFTLVWGSLGPKFLGGHQPLQGQGPFL 
HFLKAL* * FPFPQTPGNGGGTPPAKTFP 
PFPTPLSSSSPCPSPAEGGSREPSLSTP 
/SCLHLYIGGAASNPSPPSPLPPPQCSG 
LGYPVC 


12836 


26737 


A 


12965 


424 


50 


mFPFrfiCAPDAINNNNKKCCREINNEE 
PPWEPLKWRHIMIFFFEAESHSFAR\LE 
CSDAISAHCKLCLPGSRHSPASAYRCPP 
RHLMTFSAFLVQTGPPRVSQDGKDLQTS 
♦SALLMPPLDPAIP 


12837 


26738 


A 


12966 


245 


44 


Q3KELQNYPLP3PSPS3P3PAFHGLPPP 
SSPSPAFHGLPLLP\SWTVQPQ*RLTAT 
SLPDPPASPHPG 


12838 


26739 


A 


12967 


205 


30 


QEQASLSRXKRGPGVQAWLGIVIVAHACNP 
STLGGQG/SWIT*AQEFETSLGNPHLYQ 
KYKN 


12839 


26740 


A 


12968 


114 


465 


DPVSNPETTKPEPPKKTTES * P/EPKSL 
LWLPS \P*NSS *CAHREQQPPSPQPHLP 
HPLPTSLQVHRLP**SLPPPNLFTPLYD 
/SPEPSLLPViaFPA*TSPP*PPVYSSP 
PSPTCPIfl 


12840 


26741 


A 


12969 


1618 


1038 


VHMVQDKDINLNNQFLSGTMLLFFKETS 
HR*DRGPGDF* SCLYSAPIPTQPLLF* Y 
LPQNFIEG*PS*NSRIKGTSLKTYISRV 
PSFFFSFL*DGVS\TVPQAGVQWHDLGS 
LQPPPPGFBCRFSCLSLLSSWDYRHPPLC 
PANFLDF* *RRGFTMLARLVSNS * PCDL 
PASASQIAGITRiXISHHTLPHVFPLISIV 


12841 


26742 


A 


12970 


197 


411 


EQALRSTVCYCLNPLPPAPLRPRQDC\G 
PCRFRWMPGGRTRWLAPVIPTL*EAEAQ 
VWLEARSLRQAWATW 


12842 


26743 


A 


12971 


616 


777 


MGIHGVEHNAQIMFVFFVEDGGEWQLTC 
TGAGDSSWVLGEP *VFAC/PKriVLFYLH 
P*LFG*I*IYQLKITF*NYEGMVP/FVF 
RFLFETRSLLS PQLE CSGT ITTHCSLEL 
VGS INS S I SAS *LDGNT \GVRHNAQLNF 
VFFVEDGGEWQLTCTGAGDSSWVLGEPL 
RYYKERLIKWYL 


12843 


26744 


A 


12972 


222 


389 


VITEALGSDELEGRQLWW* IKIHSQVQW 
L I PVI PAL/LEAKAGGLLEAKTSRPAWE 
T 


12844 


26745 


A 


12973 


425 


145 


QIIFFLFQKIKNQFNKASFFKKMTLEQM 
TSRQKNSAQISVFKKKNSKWITDLNVTH 
* TIQLTGKNXNIGKNLQDQGLGKEILNL 
TLKAQSTKB 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 

sequence 


Predict- 

otrde 

location 

correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valiiie, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12845 


26746 




12974 


322 


411 


NNKKIFMSTS*KFFYCFIVQLIFETILS 
VRVLESIILINYL*YCIICIFPFLFL*T 
*EIRKVTKFTSFPQIEEK*KCRR/WLGT 
VAHTYNPSTLGG*GGRIA*AHEFETSLG 
NTGRLPSVQKIL 


12846 


26747 


A 


12975 


428 




SRKSSSQFRLSFFPSHFLDLGAKGAVSQ 
DRAIALQPPGPRSR*S*RPPSRPRCFRR 
PRPSRSRSNKERAGEKRRPRRSLQPRPP 
HIAGP/VPRRQ6SPSAGPEAPLATDEPP 
PQRRSGIRGGSANIRSKQTSSLRPRVRG 
RVG 


12847 


26748 




12976 


60 


294 


NHCTETVLFQ*LTCVCVCVCVCVCVCVT 
ERI /CFFKFL IGKGTYGCI YLWPKTF IL 
IFiyiLYSPRGRSILKKKFFFLT 


12848 


26749 


A 


12977 


6 


341 


DSLTLLPGLECSQAISAHCSLGLSG/FK 
QFSCFSLPSSWGFTG/VPPRMPQLIFPy 
FSVKTGB'H\HVG*AGLKLLIPPASASPK 
eWDyRREPPRPGDLWNF*PTGFKLRFPQ 
FPL 


12849 


26750 


A 


12978 


296 


42 


SQPASLGFKSVDLPLLAVCPLGKITLTL 
WDFSFI IYrKQE*WPRAQIFMPY\NPST 
LGGRGGRIS*AQEFKTSLGNIVKPHYLV 
K 


12850 


26751 


A 


12979 


20 


373 


KLYGGIWGFFFFRRGLNSATQAEMQGHN 
LG*LKSPPPG\*RNFPASASGEPGITGS 
HHKAGIVFYIKKKPGLKTFTLGKPTPPP 
SFRHWHQPAPMGVYSEILQEKSLTNQWD 
PQTKKG 


12851 


26752 


A 


12980 


380 


96 


PVFPPPWRAQKGGSIGREIRPPRGNQGK 
PGPP* KKKKNYPGGGGGPPSPPFPEG\ * 
PKKWHNPGGGTPP* TEFPPWPSTRGGEK 
KPPSKKKKKEK 


12852 


26753 




12981 


331 


71 


ELLIYSWVWWCTPWPATW*GRKLVBRI 
T\KSRRLRLQ*AKIVPI:jHSSLGDRVRPC 
PPPPTKKKLMTLCPWTEGSLSFSYLLEQ 
FHSC 


12853 


26754 


A 


12982 


383 


189 


RRALKSCiRHSTPAGPKNAALSQAQWLTP 
VIPTL*EAEAGGSLELRSPRLL*AMIEP 
\CTPAMVTE 


12854 


26755 


A 


12983 


144 


6 


ILAQ*KTWLSVVaHTL*S/TLGGQDGRI 
TQAQEFKTSLGNTRRPCLYK 


12855 


26756 


A 


12984 


341 


509 


ETLYTME*YTAIEKNEIiyiSFAGTWLELE 
A\VIL\SNLV*EQKTKMLPCSPLMGWEL 
K 


12856 


26757 


A 


12985 


387 


125 


DLPPTPPKLEPRTCSVAQVEVWCL/GSL 
QPQPPGP\SDPPA*AS*IAGTTGTRHHA 
*EMFPF*VYFVQTSEHII.GINPPGRHTK 
VAWEV 


12857 


26758 


A 


12986 


151 


350 


GRGGF*PLP*IFPPGQERPGFTPPKAPP 
PLGFFFPG/RFFFFLRRSIAIjSPRKWRD 
LGPLQPPPPPPTPGY 


12858 


26759 


A 


12987 


251 




RMSLLEKRRSNN*EARV/CGRVIYVATE 
VIKIMIGISMVAHAYNPSTLGGQGRRIA 
SAQEFETSLGNMTNPCLYKKYTHKKSRA 


12859 


26760 


A 


12988 


320 


3 


TPWGVKDFSPPPPPGEGVFPGRAPPPR* 
KFFGNFWKKRGFPNWVRGVLKFWG\KNF 
FPPNLPKSLGLRFKPPSPP*KAPPFFPL 
FFFFFEMEFRLCCPGWSAVAQS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, b=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \r=possible 
nucleotide insertion 


12860 


26761 


A 


12989 


353 


183 


EYGTHILTAAIFS IAKRWAHR*GSLTDE 
WVRKNSSLQSMEYY\SAIKKMKVLLHAA 

P 


12861 


26762 


A 


12990 


377 


2 


LSI IL INFPRFSLQRGKYPGPPPPKKNP 
PQKTPPRAPKKKKPFFFFFFFSFFKKNF 
PFFFKGQSKGGFFGPP*PLIFRFK\NPP 
PPPPQKIGFKAWVFSTPPPPIWGFFFFF 
FLRWSLfiliLPRLEC 


12862 


26763 


A 


12991 


134 


298 


PPNSYCDGSITLIPKSDKNITRKEKYL/ 
P/IYLMTIDAKVENICVLPNRIQQ*IKRI 

I 


12863 


26764 


A 


12992 


892 




QT* SH/ SLSRLEYSDTI lAHCSLKLMGS 
SDPPASaSQVAKPTGHCTTMPQPHTEGF 


12864 


26765 


A 


12993 


269 


2 


GLWCFVKQLKLTETCPHFAVPQFPLGIS 
KIPISILFFKKLLL/WPGTVJMHTCWPST 
LGGRG6WTT*GQEFGLiyiFHHWPSKFHHG 
QHGET 


12865 












SLYPIiLFQAEIYFLVTIHRSSMESCrYF 
FSVWPLDSLVQIHVKI /HIKGIGML* V 
N* QNCFKINISE* L* YIWPSTVADTLGC 
QGEITRAQGFKTSLSHMTKPHLY 


12866 


26767 


A 


12995 


106 


531 


WLLNLFAFLNDKGFLVPMNRSRI FSQQK 
VKQRSFE6CSSFNDV6SCYVTQAGEQWL 
FTGVIGTL*PGTPG\SSDPPASASQAaG 
TTGS*HHIQIJSIL1LYSSG*QTI*LL*TT 
DSTHLIFASLYAKEQLITFYFQATSLQP 
CT 


12867 












NTEPFLNIFFKDKVS /CSIAQAGGQWRL 
YSRCSLKLLGSSNSPTSASLVAGSKGRH 
HYTRL * LFFYVSRPQDYLPWS PVFFFS F 
FLSFFFWKGSFFLPPSWKARAPIWVNGS 
LPPRVKPIFIA 


12868 


26769 


A 


12997 


3 


379 


YRPSPSETKAVLVFFSFFSFFFLGTPGG 
KGPPKGAPNPTLTWRGKENFFI* KKKKP 
/RLI*KKGPPPKMGFKRERFWVLKPKGP 
QFGKGEKG/SPNYPHPRGDPKAPSLWKK 
PQFPGGGGGGAP KKAS 


12869 


26770 


A 


12998 


230 


409 


SCRPKKLLAFLKSSTEISQRWKQPIYI/ 
PSTDE * INKMWHIHTMEYYSALKRTKVL 
IHATT 


12870 


26771 


A 


12999 


310 


343 


DPVCTI I * 1 1 CDI INL* lECSKVW* SLKF 
YKIL ILFNVCVWPSVGNCHKWLCDCW* 
KQW/LWPGMVAHTCNPSTVRGGGGRIV* 
GQEFETSLGNIVRPSLYNNKSCI 


12871 


26772 


A 


13000 


390 


289 


IKLCQPRGVKLDSFKRAPPFFFFFETRS 
CFVS*AGVQWCN/LGSLQPQTSGPMHPA 
TSA*LSLIFMCSNSYVCIVK 


12872 


26773 


A 


13001 


1 


289 


GILRQISWYLPVGRSLAQAGVQRCDLG 
SL*PLPPKVKQFSCVAGTTRGCHHAWLI 
FIFSREG\GHPVKGASSDSPTFGFSKAG 
ITRLSPRALACT 


12873 


26774 


A 


13002 


359 


105 


KQKKNPSP PPGGGGGQGKPG/ P PKPFPN 
SRDFPPKNQSPFFPKLKKKFFNFY*KAK 
NP*KPNPPLKKGKKPKGFFSPPPPLKKP 
PNV 


12874 


26775 


A 


13003 


243 


293 


mnsqflkrl\riespygsvisllgtyst 
ek*kacpck;tctqmftsalyiiakkw\n 
kcpstvrw/iklwhintmeffcq * kkfii 
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nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO; in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

peptWe"^ 
sequence 


Prediet- 

otide 
location 

last^aniino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireoiiine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CIDMPQFVHLTVEGHLSPLFCYYI 


12873 


26776 


A 


13004 


465 


307 


SEACFILKLFMFCG*CCLFVC\SILLVF 
YFVFCFVLFCFAYYMQFFWQCLFG 


12876 


26777 


A 


13005 


207 


3 


DGSLTMLPKLEC/RWLFTGVIMAHCHLE 
/RLGSNDPPASASRVAGTTGVCHHA*LI 
FIFFVKMRSHYPTRP 


12877 


26778 


A 


13006 


2 


56 


DLALLFRL*CNGMITflHCNLK/RPG*LL 
GSRDPPTMVS*STQITGMSHEAHRPSPS 
CDF *RSCSWQAWQWHDYSSL 


12878 


26779 


A 


13007 


402 


74 


FGASLLKRHFLFFLSKPLVYNKKIPRPL 
VFLKKT\ PPPPGEATPFYYCCFFHKEGP 
GFGVKKEG*PKG/SPPPPQR6STPPFFL 
GAFKKPKVTTPFFKKNPPKTLKGPPFF 


12879 


26780 


A 


13008 


356 


145 


RPPPFFFFFETVSHPVAHAGVQWHDHSS 
LQL*TP\SSREPPASaS*VAGTTGVHYL 
PGSQLPILEHSPRR 


12880 


26781 


A 


13009 


227 


353 


GWSELT*/SN*AQWLTPVirAL*E2iEAG 
GSSEPRSLRSAWATW 


12881 


26782 


A 


13010 


249 


32 


NPDSHSWAWWCASWPGTQKARVGGSLE 
SRSLRLR*AMIVP \ CTPAWVAQEPLSLK 
KPPTIETLFPSPQDPSQ 


-nm 


26783 


A 


13011 


247 


403 


TPGGGGKYFFFWGPQKKNRGGGFKMGGG 
GKTRGPP*KQPASSSPFAPPPFFWGAPG 
FSPPPRF\KPPPPVFFLGPPKKKIFPPP 
PGGLFFFFLGGPPSPFFFFFFFLGGGPL 
FFPGLGAMOPMAGFRSLAPPGNSLSKKK 
KSEGLGEGGNSVLTRVLLISSYQIPGNP 
R 


-mm 


26784 


A 


13012 


206 


386 


TREAQTFWFPLYS*KAL/WLGTVAHT*N 
PSTLGGQGRRITSSQESETSLANMVKPH 
LLLEI 


12884 


26785 


A 


13013 


354 


31 


YKMNIWDIIHI*HT\yiPWPKGKERKR 
KKLEEYL/QDIMTKNFLNLMKNMNIYFQ 
EVQQTPSRINSQTSTPRHI IIKLSKYKT 
GREQW ILSVMIALWEAEWG P PEVES 


1288S 


26786 


A 


13014 


1 


419 


EETSFGKARCLPGYPGFHPRRLRSRSPL 
LS*MPLSHPQGPSQPPAGSMSSSPPATT 
STCSCSPPAAGCTRGRTTGRP FWTASG P 
AELDWASGSSV/EALTSGLPAEGLWGQY 
AGLGVPPRLLGPMTSCCMSLPFPSPARP 


12886 


26787 


A 


13015 


109 


426 


tslgniarph*qthmatirsvpppswkd 
st*lrlhpppphmcf/pd*slksn*y*h 
ylvwpXiwpmltllfvclfvcfrdrvsl 
ccpgwsawpsrltaastlwvqvilpp \ 

RLAN* IFFLRWGLAMLPRLV*NSWAGDI 
LPP*PPKVLGLQA 


12887 


26788 


A 


13016 


149 


422 


LKRIFFLKWFTHDTAPGDPDNMCPYSC 
F*NQIIKSGLFFFKKPHL/WLGEGAYPC 
NPTLLGGQGGRIT+FQEFETTPANMAKP 

LLYKKYKN 


12888 


26789 


A 


13018 


162 


377 


QSPGYPERVSKLPTITQRGMWMHQPSFV 
CTSL*KV*TGWAQWLTPVIPAF*ED\AL 
AGGLLEVKNLRPAWST 


12889 


26790 


A 


13019 


205 


427 


IQTGGPPARPFLWPWAPGKK*RVPPEGG 
PPKKKKKTPPPHPGGEREGS/PFSKKKK 
KKTPKKRAGVFFPGGPRPKEGPGWGPPR 
LDSKPVGAPPEPGRLGNGPGKIFFGPGP 
FFPLQPQKGRP*PGGEGGSFFFFLGVPP 



1324 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

otide 

location 

correspon 

last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGEPSIFCLDLPENA 


12890 


26791 


A 


13020 


248 


358 


SRCGGEAQWLLPV/I / PAF*EAEAGGSF 

EPKSLRPAWAT 


12891 


26792 


A 


13021 




ITEAGSDTQEIIIIITLYTIILCVYWRFFTK 
iKAQVKKKX'WVHM* PQLLGRL3MGGSLEP 
RSLRLQGAMTLPVTSTT\QAWAT 


12892 


26793 


A 


13022 


162 


410 


LTQKTAASQIILSFSSTSLEKI PISSES 
GGGGSVSVTQAAEQWRYHSLLQP*TPGD 
KRSSHLGIPHP\SSWDYRISKDWVMPV 


12893 


26794 


A 


13023 


250 


40 


LIKLNIFSCSLAIWTSSCMTSLFKYFAH 
FSFF* IDFYSSYYILDTNPFFRN/GITN 
ILSSSVTCSFILFW 


12894 


26795 


A 


13025 


264 


1 


VSaTKHGKLRLEPEy*C:B^IHH*EEKVFE 
S5fFEY/PCQTFLRKETVEPGAVAYACNP 
STLGGQCRWII*EQEFETSLTCIAflALR 
VTLR 


12893 


26796 


A 


13026 


110 


323 


CIDSSSWTQTDRCKEDRLCPGTTGTCAP 
GLLFLSAPPLGPF*PSQAHPSFHVSLSG 
PSPGLRPQLDIPLHPSTSIAPRPLPGPA 
S\LPRLEPHPSLPSPTGACPFLPPCLGE 
QRHWPLPTSLLWLGLDGHCLATPSVS 


12896 


26797 


A 


13027 


52 


53 


CPTLLQLTLC/LPPQASAMVDAPP*ARP 
LPPSSILAGGTSGEQGSVWGLAEPVKG 
YNLLVCRLLRPLEKCGIWVAVSRFSLYS 
LSWLPLARKG\NPLTACASWVR 


12897 


26798 


A 


13028 


262 


381 


HFGRLRQVnRLSPGVRDSLaJKVKP*QI 
IYTF*FSVSLSVINTYLKGPSTVAHTYN 
PSTSGG*GKWIA*AQEFE/TSLCNKVKP 
CLY 


12898 


26799 


A 


13030 


1 


846 


FRPIQAGRAPWALTLPCMGFTEGSPALL 
SHSSRCDIQMGCNTHA'VDDSTGEGPFHK 
GRMRESCWLRTPRCGLYTKGVGGPGMEG 
RERRSRSRRQHMPGRGHSTMASWSLRQC 
MRLKLQWASDLLSQLEMQIPSLPPGESG 
SLGLTQPRSLH/ CEQT IQLS PT PQESWS 
EELG\GGIVGGGRGTVAGRGGHREAGNK 
PSGTWGRQCCASGDKAMS I* LTSLY\AR 
RGEPGPGRPVRKWLRESSQNRIGGGEGR 
WE*GQRRSREGRECPVGWGGRKKPLRWG 
LMWS 


12899 


26800 


A 


13032 


308 




KHYHDFWCWI ILALSDAQSLDHTCV\CV 
CVCVCVCVCVLRDQGSHS CL* CEG* LVP 
HFVPGPVLCLLSPVIYVCFICSLTCKVE 
SLSNFFKKLNKALLRKCI 


12900 


26801 


A 


13033 


763 


244 


FLGPRI IGLRHE ISVETQDHKSAVRGNN 
THDNYENVEAGPPKAKGKTDKELYENTG 
QSNFEEHIYGNETSSDYYKFQKPRPSEV 
PQDEDIYILPDSY*L/CQNIDFCYWMIN 
IHCNFSTAKTRNQTKC*STVDWIKKMWY 
TYTIE YYAAVKK/DTKLTWEQKI KYHI F 
SLKSGS 


12901 


26802 


A 


13034 


326 


58 


EHYKCNYCQKIFQKL*/HL*SKKHVYAE 
DKYYKYKDGWPLLVSQIILYTFYTRGK 
P*SSCSNFVQNQI lYIGQKSCKCNKFGN 
TFFKKP 


12902 


26803 


A 


13035 


199 


3 


RFIYNHKVCITTFLPHFYF*KLYL/WLG 
TVAHAWNTSTLGG*GGRIT*GQELETGL 
DNTARPHmW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionin6, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12903 


26804 


A 


13036 


94 


448 


gscatsmafkdtrktavepgwvhgiri 
kplekvcadlitgkkfknl/kvkgpvcm 
ptktijcittrktpcgesska\wdgfqmr 

IQK*LTNLHSPSKIVKQIISISGA?GID 
SEVTNADG 


12904 


26805 


A 


13038 


964 


1717 


HKLS CVKDLKDFMRQAGE \ VTFA\DAHR 
PK\LNEGWEFASYGDLKNAIEKLSGK\ 
E INGKNINLIKGSK\RPSRS RS RS SDPG 
TQKSPLGPRSRSPS PWRKP *QPVQEAR 
SREPGflRSQVPVPVK*GPPPCPEKEPQK 
\RGST,QSKILSPPSILWIRPEVSGPRSK 
VPRSVDSGN 


12905 


26806 


A 


13039 




725 


SLLFSPVLPLFLFLPLAYILQNLKPLQF 
SPDLKSKHLIFFCNAA+PQYKLDRSSK* 
PENGTLNFSILQDLNNSC31KIGKWSEVP 
EVQAFF\SHQSLPSLCAQCNS/CPNLPS 
FPP\PVPSVPTPSVTESF*SSFSTDPSD 
LSLPPQALLLARPS*VPILPQPPLLHPI 
I LLS PPLLTPRPAYSF I P *L /CPPPPAQ 
QFTLKKVAGAEGIVKVNAPFSLSQIR+H 
LGSFSSNIKIQPSSHILVWQQ? 


12906 


26807 


A 


13040 


1131 


2 


DGVSLLSPRLECSGTISAHCNLHLLCLN 

dspasas*vagitgpattpplifvyfff 

lvetgfhhvgqsglelptd\sdeppasd 
sqsvritgvsh/aylacsqf/ctrc*fp 

II*EQCVNMNINSSK*IFILISYLYRWM 
NKDPI/VIIERLLVRMPEGQSHAAW*NK 
TNKQT*ILELVLSVTSOGVSVRFNLSIP 
* VFPSTK*EGWMRCVNPSTLT* FIPQVA 
*TCQEHGLMKIQT*NPGLDPSIVSDHQT 
Q*EPSVKQVIiDISHDKLPMSLEELYHCR 
HTLASGKQISGVPRTIFLNHFHGFYFIY 
/CIFEMESCSV/SQAG/VQWRDLSLLQA 
RS /pgsr/rfscls /LPVQHPPPGPANF 
\CIFSGDGVSPC*PGWSQSPDLVIR 


12907 


26808 


A 


13041 


97 


626 


I CFTQVKVLNESQVQWCMPWPATWEAG 
AGGVLEPGS*EL*/CPVNSHCPPAWAT 


12908 


26809 


A 


13042 


2 


431 


HDPDTHSQIKVIKT,MQSEFCWGWa.W* LT 
PVI PALWEAE /AG3SPEVRSS 


12909 


26810 


A 


13043 


405 


1319 


RYEKKLQNKPLWLH/CLLLLPNSATR+T 
LQK*SK* ISPLESSA* *KEPPFLSVLSV 
TSVKT/SAIPMPPHWENVKTQVPYQLIP 
LHNQTHEYMEVSHLFGKTMDRNRIKRIQ 
RIQNLDLWEFFCRKKAQLECKKRGVPQIN 
KQMLFHGTSSEFVEAICIHNFDWRINGI 
HGAVFGKGTYFARDAAYSSRFCKDDIKH 
GNTFQIHGVSLQQRHLFRTYKSMFLARV 
LIGDYINGDSKYMRPPSKDGSYVNLYDS 
CVDDTWNPKIFWFDANQIYPEYLIDFH 
*FHFQISVVKEMjFFFAGRFALQSSSH 


12910 


26811 


A 


13044 


23 


450 


RSRAAAI QLGQRLRLP6PDS PRLTGRQL 
EPGTGRGSRAKRGPCSIFRFRRETPPQR 
EGGEALGATSRQAPPPCPPRQPWELDAS 
AKAAAVAPPKKVGFKAVSRDPRAEALGG 
KGSP\GGKGSLKGVAHR*GGLGPIi*GAP 
EAI 


12911 


26812 


A . 


13045 


172 


16 


AHVSPEVDKTKFW\LGPVAHICNSSTLG 
GRGGQIT/R/GQEFETNLTNMEKPHLY 
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SEQID 


SEQ ID 


M 


SEQID 


Predicted 


Prediet- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 






C=Cysteine, D=Aspartlc Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 




E=Glutamic Acid, F=PhenyIalanine, 


sequence 


sequence 




09/515,1 






G=Glycine, H=Histidine, I-Isoleucine, 








26 


correspond 
ing to first 

residue of 

peptide 

sequence 


location 
correspon 

last^amino 
arid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


12912 


26813 


A 


13046 


181 


9 


YKWGPAMLPRLKCRAI \ IGSNMAHCRLD 
LPGSSDPPSSASQVAGTTGTCHLGARGG 

MV 


12913 


26814 


A 


13048 


2 


3507 


YVRVSLPPPPPAAGRPGAAVADDAREEE . 

EEAAPPPPPPPPPRLAAARPPGSQPRPP 

AAGEAQAAADMNHQQQQQQQKAGEQQLS 

EPEDMEMEAGDTDDPPRITQNPVINGNV 

ALSDGHNTAEEDMEDDTSWRSEATFQFT 

VERFSRLSESVLSPPCFVRNLPWKIMVM 

PRF YPDRPHQK\SVGFFLQ \CNAESDS T 

SWS CHAQAVLKI INYRDDE KS FS RR I SH 

LFFHKENDWGFSNFMAWSEVTDPEKGFI 

DDDKVTFEVFVQADAPHGVAWDSKKHTG 

YVGLKNQGATCYMNSLLQTLFFTNQLRK 

AVYMMPTEGDDSSKSVPLALQRVFYELQ 

HSDKPVGTKKLTKSFGWETLDSFMQHDV 

QELCRVLLDKVENKMKGTCVEGTIPKLF 

RGKMVSYIQCKEVDYRSDRREDYYDIQL 

SIKGKKNIFESFVDYVAVEQLDGDNKyD 

AGEHGLQEAEKGVKFIiTLPPVLHLQLMR 

FMYDPOTDOHriKINDRFEFPEQUiPLDEF 

LQKTDPKDPANYILHAVLVHSGDNHGGH 

YWTLNPKGDGKWCKFDDDWSRCTKEE 

AIEHNYGGHDDDLSVEHCTNAYMLVYIR 

ES KLSEVLQAVTDHDIPQQLVERLQEEK 

RI EAQKRKERQEAHLYMQVQIVAEDQFC 

GHQGNDMYDEEKVKYTVFKVLKNSSLAE 

FVQSLSQTMGFPQDQIHLWPMQRESNQT 

KRPAMLDNEADGNKTMIELSDNENPWTI 

FLETVDPELAASGATLPKFDKDHDVMLF 

LKMYDPKTRSIJSYCGHIYTPISCKIRDL 

LPVMCDRAGFIQDTSLILYEEVKPNLTE 

RIQDYDVSLDKALDELMDGDI IVFQKDD 

PENDNSELPTAKEYERDLYHRVriVIFCD 

KTIPNDPGFWTLSNHMNYFQVAKTVAQ 

RLHTDPMLLQFFKSQGYRDGPGNPLRHN 

YEGTLRDLLQFFKPRQPKKLYYQQLKMK 

ITDFENRRSFKCIWLNSQFREEEITLYP 

DKHGCVEDLLEECKKAVBLGEKASGKLR 

LLEIVSYKIIGVHQEDELLECLSPATSR 

TFRIEEIPLDQVDIDKENEMLVTVAHFH 

KEVFGTFGI PFLLRIHQGEHFREVMKRI 

QSLLDIQEKEFEKFKFAIVMMGEHQYIN 

EDEYEVNLKDFEPQPGNMSHPRPWLGLD 

HFTIKAPKHSRYTYLEKAIKIHN 


12914 


26815 


A 


13049 


30 


477 


VHAEHCAVWEENFEETVRWTSVKPLMTS 
PEIASLSWGQMKVKGSNTTYKDCKVWPG 
GSRTWDWRETGTEHSPGVQPAAC\DVKE 
WEKGVQTLVIGROMSEALKVPSSTVEY 
LKKHGIDMRVLQTEQAVKEYNALVAQGV 
RVGGVFHSTC 


12915 


26816 


A 


13050 


179 


389 


NIETIQSMFPI/DNEMKLEINLKKAIWG 
IHKYASQVLWFMPIPPTLWEAKVGEFLE 
PRSSRSVWETWRDPI 


12916 


26817 


A 


13052 


466 


1489 


PGKGEMRTRSPSPLAIVPRPQRASRPLL 
CAVSPMASASQATAKHEQILVLDPPIDL 
KFKGPFTDWTTNLKLRNPSDRKVCFKV 
KTTVPHRYCVKPNSGIIDPGSTVTVSVM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 1 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

peptide 
sequence 


ed end 

location 
eorrespon 

last amino 
acid 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQPFDYDPNEKSKHKFMV/QIFAPLNTS 
DMEAVWNEAKPHELMDSKWR/CPNENDK 
LNDMEPSKAVPLNASKQDGPTPQPHSAS 
LHDTETRKLTEECKRLQGEMMKLSEENQ 
HLRDEGLRLRKVAYSDKPGSTSTASFRD 
VTSLLPSLLWIAAIFTGFLFFFWRRSL 
TLVAQAGVQRRDLGSLQPPSPGFKQFSH 
LSLPSSWDYRRPPPHPANFCIFSREGVS 
PCWPGWS 


12917 


26818 


A 


13053 


892 


189 


SYDPGALGCRPAPRSGRGKGSGATRDMH 
PMSRERAKFVKSGL\YCKTTRELMLHAR 
CCLNQKGTILGLDLQKCSLEDPGPNFHQ 
AHTTVI IDLQANPLKGDLANTFRGFTQL 
QTLTLPQHVNCPGGINAWNTITSYIDNQ 
ICQGQKNLCNNTGDPEMCPENGSCVPDG 
PGLLQCVCADGFH/GIQVYAPGLVLTAY 
VLRD SG SHHS I RLHS ALGD PAPKSQDFM 
NYIGLTIDLRSI 


12918 


26819 


A 


13054 


423 


12 


sfnqknplrqkcslrptrpasqslatwp 
gqstrfweqalpwalwdpqskt?re\mql 
epphahtwthahtcghtga\fcsahtev 
yvhlhspvhahthahthsppvhtcgcrl 

HAHTH\PQTPSPPGLLSLLAVARPQS 


12919 


26820 


A 


13055 


16 


349 


RRSGKNDPWTDQS PRAAASHTRCPHPAV 
AAAAMPKRKTEGDAKGDKAKVKDESQKT 
S /ARMSSKSAASKAYAKPIKAPAMNGEN 
VPKGIQRKLEAFYDLQACRHLTRLIDFA 


12920 


26821 


C 


13056 


716 


594 


MMQTETGVMPPQARQCLEPPGAGTGRKD 
SPLYESEQSWPL* 


12921 


26822 


A 


13057 


333 


2 


IGNPGKTTFNKPPPPKKFIFSKFFMFLF 
FLVKTRSCYVAQAGLQTPG\SSDPSASA 
NQSAGITGVSHHTQSEILKYTVFCLTFQ 
TPEYTIPSTWSIPCCSHFSCYLCFLFAC 
FLRQSF 


12922 


26823 


A 


13058 


864 


529 


EVSTSKTAGCRGCALVISMLCIIFPQGL 
FTRALLRTCSMjLHADWPE I SFWVWF 

frtescsvaqdqvqwcdlrslkppppXg 
ssdspasaspvakitgmhhharhglks 


12923 


26824 


A 


13059 


2714 


1376 


GRYDGLVEQLGGRATPAVGFAMGLERLV 
LLVQAVNPEFKADP WD I YLVASGAD TQ 
SAAMALAERLRDELPGVKLMTNHGGGNF 
KKQFARADKWGARVAWLGE SEVAHGTA 
WKDLRS AIiAVGVI LGVGAL I GWR YWNS 
HQVDSARSASLAYQNAVTAVSEGKPDS I 

paaekfaaenkntygalaslelaqqfvd 
krjelekaraqlqqgladtsdenlkaviw 
lriiaevqvqlkqadaalktud/tpikge 
gwaaivadlrgeallskgdkqglt.svtl 
lsgcslenseedwkmsplptvenqftp 

yaadraglvkainaddgke iwsvsltek 
dgwfskepallscrvtsvwpnnqtekfk 
kqrtegpavaepeqagvrrhylypftgg 
yrrrraacqaerpaarcateehgsrw 


12924 


26825 


A 


13060 


99 


254 


RNHHLVAFKELSALiKVKRVDWAPWLMPV 

ipaf/leakvgrsleprssrpawat 


12925 


26826 


A 


13062 


127 


386 


FSYYYYFFFLFFLCVWIPGEPFLKMKL 
GKPKTIMLKEKKGGESYSELFFETDQME 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
l^ast^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplmn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WL/ TPVI PTLWEAE VGGS FEHRS SR 


12926 


26827 


A 


13063 




209 


GKYILLLKVFCCFLRQGLCHLAHCSLRL 
PGSSDSRASASQVAG/ISITGVHHHARL 
NFVFCFFFLVCFCF 


12927 


26828 


A 


13064 


132 


1 


SHAMCCNYLKHFGWARWLMPV/ I PALWE 
AEAGRS PEVRSSRSAL 


12928 


26829 


A 


13063 


269 


134 


TLWSVFASS/WAWWLTPVILAPWEAKAG 
GSPEVRSLRICKQKVLS 


12929 


26830 


A 


13066 


853 


461 


RLWRSVSQFLCACVHMCVPSVCVCGLAH 
AGTYAVSTCVH/MCVCVCMCARITLAKI 
CSETSGNLLFRKI I SLLDLFTLNFS IKY 
KVQLVLKSRSLAWWGRIKLLTLSRPFSD 
VNTFERINYWLINDISIIR 


12930 


26831 


A 


13067 


57 


2066 


AQPTGRQTSTVAPTPATARSRGGRVPIF 
PCPHPAPTLLSTSRWTPASPAASMKAL 
RLSASALFCLLLINGLGAAPPGRPEAQP 
PPLSSEHKEPVAGDAVPGPKDGSAPEVR 
GARNSEPQDEGELFQGVDPRALAAVLLQ 
ALDRPASPPAPS6SQQGPEEEAAEALLT 
ETVRSQTHSLPAAGEP\EPAAPPRPQTP 
ENGPEASDPSEELEALASLLQELRDFSP 
SSAKROQETAAAETETRTHTLTRVNIiES 
PGPERVWEASWGEFQARVPERAPLPPPA 
PSQFQAHMPDSGPLPETHKFGEGVSSPK 
THLGEALAPLSKAYQGVAAPFPKARRPE 
SALLGGSEAGERLLQQGLAQVEAGRRQA 
EATRQAAAQEERLADLASDLLLQYLLQ6 
GARQR6LGGRGLQEAAEERESAREEEEA 
EQBRRGGEERVGEEDEEAAEAAEAEADE 
AERARQNALLFAEEEDGEAGAEDKRSQE 
ETP6HRRKEAE6TEEGGEEEDDEEMDPQ 
TIDSLIELSTKLHLPADDWS I lEEVEE 
KRNRKKKAPPEPVPPPRAAPAPTHVRSP 
QPPPPAPAPARDELPDWNEVLPPWDREE 
DEVYPPGPYHPFPNYIRPRTLQPPSALR 
RRHYHHRLPPSRHYPGREAQARHAQQEE 
AEAEERRLQEQEELENYIEHVLLRRP 


12931 


26832 


A 


13068 


282 


53 


PGFPQGKFRFKKKILPQDYPFQQAPNSK 
ARPGQGVPPGIPA\LWRVKKGGPliRSGG 
PGPPGAKGGTPFLPIPNTKKS 


12932 


26833 


A 


13069 


177 


2 


VSLCRPGWMECSGVISftHOILRLPGSSN 
S /PALASRVAGITGAEHHALLNSLFKVL 
SRA 


12933 


26834 


A 


13070 


789 


302 


yfisshfischlasssssssssssssss 
ssppafhysfhhfisfhlaihy\hfiis 
fhlfhlfs s fhs i sfrdrdllycpgws v 
pphpafhysfhh/vhvmsrhfilrqtrs 

IAC3AGIQWRDLCSLQPEVGSSRSA 


12934 


26835 


A 


13071 


193 


2 


PPYLKSHSQVDLSPGVQDQPGQHRETPS 
/LTKN/TKVSWVWWCTPVI PARWEVEVR 
ESLEPRCSTRA 


12935 


26836 


A 


13072 


122 


1495 


LLSDFFFFETE/SRSIAQACMQWCYLSS 
LQSLPPGFKRFSCLSLPSSWDDRCPPPC 
LANYCIFSRDRVLPCWPGWSRTPDLR 


12936 


26837 


A 


13073 


178 


347 


CCLCNDNSVLLCMYYLIiVCMHACMHALY 
IKTMK/MQLGAVSHACNPSTLGGQGEWI 
T 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO; of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Met]iionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12937 


26838 


A 


13074 


165 


34 


FILSFPFETVSYFVAQAGVQW\LIiaHC 
SLDLPRSSDSPTSAP 


12938 


26839 


A 


13075 


203 


105 


GGWVQWFMS I IP /AIWEAKAGGLLEPRN 
SRPVWAT 


12939 


26840 


A 


13076 


3 


357 


HEREYTQAEKRKFIAFFFFKMGSCSVIQ 
AEVQRHDLS3LQPP/RVAGTTGTHYHTW 
LIFVFSRTDSRLPSSPRSPLTSVISLII 
PLSLIPTTYLTTPSRPSSTSPPSHPSFR 
APLTSDP 


12940 


26841 


A 


13077 


3 


477 


FLTELTRLFQKCRTSGSVYITLKKYDGR 
TKPIPKKGTVEGFEPADNKCLLRATDGK 
KKISTWSS KEVNKFQMAYSNLLRANMD 
GLKKRDKKNKTKKTKAGSNSS /ATTAAP 
AAAATAATTAATTAATAAQ 


12941 


26842 


A 


13078 


1077 


1566 


RQVTSLDPLKTFPRTSLEFTSPTGQLDS 
GATAFTWWPSGAPTPRGRESAAHQGN/ 
YPVGQRS\RMAGLSPRSFCWWEVSFSR/ 
SHSC/ICWVQLGKSAVLPQQSRGPGSHE 
GSRRRGGSSPPCPPLQTQLVFSPQELSM 
SAPIHRLSVTLQ6DCISTGGAPSRCRLA 


12942 


26843 


A 


13079 


3 


396 


HERGRKMVSVTNRLIQRLRNWASGHDLQ 
GKLQLRYQEISECRTQPTTKLPEGPSHKL 
FiSnSIYY\CTRDGRRESVPPFIIMSSHKAL 
VSRMPAYSFAEAATDBCKAVTPAPSIYRG 
EQSSDHPYLRHCTLTVTRLL 


12943 


26844 


A 


13080 


467 


109 


SLVPWSPHLKEPSCRYLIENVMLVPKE 
QEFWSKEAGSLFSPPVPFYCWTPSIVFS 

FGTKKALYLLEARGCKCJYKNQKLIWYLT 
FLSLLVK 


12944 


26845 


A 


13081 


458 


51 


VSLFCTVAPLLLLPEGIFFSCSLLEVKA 
PMRDVSGVFLTLFLYSVLFFPLGVFFFL 
FFFLILPPPKIFFFFFVCVALFFFSPAF 
GAVFLF\SLWSSVLPGVGFFTACSFFFF 
FFFFFFFFLRQSFALVAQARMQWR 


12945 


26846 


A 


13082 


23 


327 


RSDSEGGRRGHKYI FI PIYTHTHTYIFI 
YTQIYICLHIYACLS/LICTHIFISIYI 
HTYIYISIYAHIYLYIHTHIYLHIYTHT 
YIYIYIiHIHIYSSWVCKGS 


12946 


26847 


A 


13083 


352 


153 


KKLVEYYSAIKKNSVLIH/VSTWMRLKN 
ITLSLKTQS QRHLYYI I PS VENVQNRQ V 
HORSRLVDSED 


12947 


26848 


A 


13084 


121 


417 


dydyycyVypkiyltkllyfyWgnyiy 
thiyvhhvyicetympvythihtyiciy 

IYIWKETTWLFSCGISKTYRTHDLAYTY 
LVFCLTHVTFHVSST 


12948 


26849 


A 


13085 


242 


12 


lssrlpmlletkeitskilpsrkl/lpy 
gwcfksfkitwvsqihnrlVaqwlttvi 
pqlwea\ sgllkprsrrpawttm 


12949 


26850 


A 


1308S 


274 


340 


tkrgvfppspkkfffppktknfgrgggp 
kipppkkdffsknppgvffspp/ykkkk 
iifpppvklgppkdflkspppfffffyf 
ffffffssqevnlkalsstseiiffylv 


12950 


26851 


A 


13087 


241 


405 


pppskihignfrkmsdvkmatgwawwlt 
pviparweaeag\ vs lrprns rpawaa 


12951 


26852 


A 


13088 


537 


375 


KHSLTLSIiRLECRGTILAHCS / L/ CTS L 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-^Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycinc, H=Histidine, I=Isolcucine, 
K=Lysine, L=Leucine, M=Metli ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGS S DLSLQPQQ I AGTTGACHHTHLE K I 
F 


12952 


26853 


A 


13089 


448 


186 


GALVFPSSQAPPCPPKMEQSFFFFFFFL 
ETGSRSVSKAGVQLHHGSLQPPPPGFER 
SSHFSVTTI/TYLYHHRLALHVFELHIN 
GIMHR 


12953 


26854 


A 


13090 


365 


191 


GQEREKEREREKRRQKKEEVE/REEEKE 
RVRDLESESNRALERERSALVDRAPLSR 
PG 


12954 


26855 


A 


13091 


148 


316 


DKMESCSLAQTGVQWHD\LG\YLQPSPL 
GFKRFCLSLSSSWDYRWETSTEPFQQAV 
F 


12955 


26856 


A 


13092 


712 


229 


FVAWVKMAEYLASI FGTEKDKVNCSFYF 
KIGVCamODRCBRLHNKPTFSQEVFTEL 
QEKYGE lEEMNVCDNLGDHLVGNVYVKF 
RREEDQERAVAELSNRWFNGQAVH/QEC 
TRGGFCNFMHLRPISQNLQRQLYGRGPR 
RRSPPRFHTGHHPRERNHRVFP 


12956 


26857 


A 


13093 


835 


405 


elierlgpnqkppilmwkpmpspsnmka 
saallcllltaaafspqglaqpvgints 
ttccyrfinkkipkqri^syrrttsshc 
preavifktkldk\eicfldptqkwvqdf 
mkhldkktqStpkllnihdwnlenqamt 

LRKPN 


12957 


26858 


A 


13094 


1137 


424 


LCPSHFAPTTLTQPGAHKNMCCIKSRFK 
RDLGLCRTCLVNKMFTSS ILGKSHRHSL 
VSINQGNlIALWKAaG\PLSWKAGYC\QG 
FSPCDSLKYG\SWDEKDLTVPQPDTHKG 
SVLRWISKRGKPLAVEIEQRATaLPGLA 
PWGTE\CLGYKTPIV\HLFNSEMG\ENR 
PYGGEARHVCSNAaLLFFTPLRCLGGEK 
HKSGLEAHPVIVISLELNYDIDSFAHMF 
FADLLLIITLLSCYIPFC 


12958 


26859 


A 


13095 


1 


324 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERP/RPRVLFFVRGREQSHPEKCEN 
KKEHAPPSHflLRKISIETRGGVSITTEA 
CVFVIGAHRVCRSVYIYI 


12959 


26860 


A 


13096 


508 


840 


dgvslcrpgrtimcsgaisahcklrfpg 
srqspglslpsswdyrrlkpmrpactfff 
ciff\vktgfhlvsq/amgldllns /si 
pprlglpkcwXdyrreatapgqelllkr 

TGM 


12960 


26861 


A 


13097 


53 


254 


WPQTASCLVAQAGVQWSDHSSLQRQTPG 
\SSHPPASASQVAHITGiyiHHCRWLHLI I 
LLRflHGSSPLC 


12961 


26862 


A 


13098 


329 


57 


gpgapllkgekllpkeppprispivsaf 
fawgpppgycflkkggpff\vfppsffg 
kkknspppphinggaktfpnppllfssf 
fff 


12962 


26863 


A 


13099 


253 


444 


EGAFSLGTFYFI FFLVLSNTFLLS I KNQ 

qiwnkktvwXpspflptlialttlllpi 
spfiliil 


12963 


26864 


A 


13100 


341 


590 


pmassqicigrippflylkwyfwpgkw 

HTCNP KTLGGRGAL I TQGQKFENSLAN \ 

makphly 


12964 


26865 


A 


13101 


55 


184 


rlreptplyse/haptkyyrmahhnyph 
S PP ipqprhhyrvfll 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
corrcspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl<nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12965 


26866 


A 


13102 


433 


762 


RNLLNCKRGLERKGCRASWPAVPLFLQR 
RVWPRPTAELNEACMLAQHQNCQNVKME 
YFQLSFTKRQEI/WLGWAHACNPSTLG 
G\QGGGIACSQEFKTIPSSKVEPCFYLK 


12966 


26867 


A 


13103 


349 


250 


DLFFSTKPPPPPPPPKNLGPP\GPFGPP 
FQEG 


12967 


26868 


A 


13104 


3 


236 


YPQHYPS CPTFQYTLKTKNHIirFLI FFE V 
GSCSWQARVQWCHHGSLQPP/NSSDPP 
ASAFQVAQTPQVSHRAPPKNQNF 


12968 


26869 


A 


13106 


154 




/WLGGVAHTCKPSTLaGRSGRIT 


12969 


26870 


A 


13107 


391 


150 


RPPKKPNKKYSQITSKGIKMVYTRKYLT 
QKKKAMSQALWY/V/HAVPATQEAEVGG 
SGEPGSSRPRCTMIlMPjXDSITHCAPAWAT 


12970 


26871 


A 


13108 


90 


252 


KVGDGGRCLVFCFYFKRQGFTVAQAGVQ 
W\ATIAHCSLKLLGSSDPQNYEDSWGT 


12971 


26872 


A 


13109 


197 


414 


lyefikitvli.glgcplgrygcsnqrff 
slemefhtvawarvqwhdldslqplppr 
pnrfsclr/ypqsswdyr 


12972 


26873 


A 


13110 


2 


317 


GRVGANVHKGHRQRTYGSVIPHILPLHV 

lkktfslrdfhfsvslkk\nlvltclhl 
fl/gvrtprndpfvsimmllftafldrps 

T I LGTGLLYTEGLTVAIjRIiAYLR 


12973 


26874 


A 


13111 


413 


162 


L ILLPQPAECLQLQASAATLDCLGLPRC 
RDCRRSLVHSVLNGAQAGVQWRDLGSLQ 
PPPPS \TCLGLPKYRDCSLCPAATPSGK 


12974 


26875 


A 


13112 


40 


296 


BSRLSALYIYHICVCVYNEEHLLYGIHM 
/SIHTYTHTHTHTHIYIHMRAFALEDKF 
I CSLLCBRQnNAFILVSVKLQRKNYFLT 
RR 


12975 


26876 


A 


13113 


2 


288 


FEPRCKtfSARGKVPRGFSPNLPHVTVET 
PEGSKTGFELESR/RRHRQIHSSAQCPL 
TAVPGAGDAIPEDASGWHTWLPLHAQNC 
FLLYIQAPEQPPA 


12976 


26877 


A 


13114 


685 


340 


lsppragsarptgpptdapgqrstwtcg 
alkprrpj^lrdaaenlfqelqehfqalt 
atlnlrmeemgnriedlqk\wviidl\mv 
qagiensikeqi^lktvtflwmsvfgdgas 

YRSH 


12977 


26878 


A 


13115 


65 


3021 


gvltqmgdekdswkvktldeilqekkrr 

KEQEEKAEI KRLKNSDDRDSKRDSLEEG 
ELRDHOTEITIRNSPYRREDSMEDRGEE 
DDSLAIKPPQQMSWKEKVHHRKDEKRKE 
KCRHHSHSAEGGICHARVKEREHERRKRH 
REEQDKARREWERQKRREMAREHSRRBR 
DRLEQLERKRERERKMREQQKEQREQKE 
RERRAEERRKEREARREVSAHHRTMRED 
YSDKVKASHWSRSPPRPPRERFELGDGR 
KPVKEEKMEERDLLSDLQDISDSERKTS 
SAESSSAESGSGSEEEEEEEEEEEEEGS 
TSEESEEEEEEEEEEEEETGSNSEEASE 
QSAEKVSEEEMSEDEERENENHLLVGKN 
LPGDRVPVPESRFDRDSGESEEAEEEVG 
EGTPQ \ S SALTEGDYVPDSLPLS PIELK 
QELPKYLPALQGCRSVDEFQCLNRIEEG 
TYG WYRAKDI<a<TDE IVALKRLKMEKEK 

egfpitslreintilkaqhpnivtvrei 

WGSNMDKIYIVMNYVEHDLKSLMETMK 
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SEQID 
NO: of 

sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QPFLPGEVKTLMIQLLRGVKHLHDNWIL 
HRDLKTSNLLLSHAGILKVGDFGLAEEY 
GS PLKAYTPVWTQWYRAPELLLGAKE Y 
STAVDMWSVGCIFGELLTQKPLFPGNSE 
IDQINKVFKELGTPSEKIWPGYSELPW 
KKMTFSEHPYNNLRKRFGALLSDQGFDL 
MNKFLTYFPGRRISAEDGLKHEYFRETP 
LPIDPSMFPTWPAKSEQQRV\KRGTSPR 
PP\EGGLGYSQLGDDDLKETG\FHLTTT 
NPGGLLPRGPGFSLRVAEVSELDPVIGR 
NSAGDHRRGYLPAWNCDGTRNSSSYFVL 
PCFVFVFLVCKLSRIKSFSLLWRKELCF 
LHDLPRAERVGI FRGAHVGQHKPHHHP L 
PLSTRTGLGWAVIWKGTGGSRVDCL I FG 
AGDLFLCWDERCPLAPTHSSRPALSTTS 
PRPPTTQPQLLPAGLEQVFYYKWGVLK 
CIKIFFEE 


12978 


26879 


A 


13116 


469 


202 


MI I IVGIiCEYTKSAVLVQHSDPLPPAPG 
VFFIAVLDWDIKIWHPVFCLFVCLYE 
MES CPVAQAGVQWRDLGSLQPPP\PGSS 
TSCF 


12979 


26880 


A 


13117 


193 


383 


AARMASTFFVFWFWFSTF \SLYS I CVCV 
CVCVCVCVCRMHTVQLYFFQTCCQCSSL 
LNPLFPMlSr 


12980 


26881 


A 


13118 


146 


350 


HYLVKFIiLLAHPGSKAPPUSTTLQPPPLA 
TREQPPLTVIFHYPPTSYKMAP\PYLPS 
LTLFGLSPPAPR 


12981 


26882 


A 


13119 


124 


254 


HALLQTLRDLSQAWWHALVIPATW\EAE 
AGGSLEPRSSRPAWAT 


12982 


26883 


A 


13120 


150 


45 


SLPRLECSFTVIAH/C/NLHLLGSSDSP 
ASASQWGGI 


12983 


26884 


A 


13121 


336 


446 


IPLKISSWAQWLTPV/IMPAVWEREAGG 
SLEVRSSGPA 


12984 


26885 


A 


13122 


181 


39 


QNLTAQDGVQWCDLGSLQPP \LPSSWDY 
RRESLCPSSFFIFWKRWGFT 


12985 


26886 


A 


13123 


463 


2 


GPAVPSGLYLKGKPMRRLCaSHSPESHS 
HDKGQGTLPRIPQ/PLLGEGGSR/PLAW 
GAVAGLPSNHPRIVPLPAPTRSGTRVRP 
HTGHQPGMSPGCGCESKWPHSAAAWPEA 
YPHFFLLPFPTQGCEVLGPLYTADPWVI 
CVSLLPSCPNSTAVDVD 


12986 


26887 


A 


13124 


429 


263 


DHFSFLCVTGSQSVAQAGVQW\LIIAHC 
SLKLLASSDPPASVFQSTRITGPFLSFT 


12987 


26888 


A 


13125 


310 


486 


NKKTTHGESSKQRQPRVSHHPGWGTVAP 
SGFTAASAS\RARVILLPRPPE 


12988 


26889 


A 


13125 


124 


461 


GSPLQLLPSPLAALTRDCSEAPMGSCSV 
AQDGVQWRDLGSLQRLPPGFEPFSCLSP 
TPAP\FPSGWDYR 


12989 


26890 




13127 


2 


337 


RGAAPAAMAVTALAARTWLGVWGVRTMQ 
ARGFGSDQSENVDRGAGSIREAGGAFGK 
REQAEEERYFRAQSREQLAALKK\HHDE 
E I VHHKKE I ERLQKE I ERHKQKI KMLKP 


12990 


26891 


A 


13128 


681 


422 


CRSDRWAKEHRGKRGQDSS KDVMARLME 
APKQTAQYFFIFYFFETKSYSVTQAGVQ 
WLDLGSLQRPP PG\ S SDS PASAS CAWPQ 
TAH 


12991 


26892 


A 


13129 


299 


579 


LVMFCKVTKIERLNLWRPGTVAHVLWSQ 
HFGRPRWAVHVGWGVRDP/HLTQHGETP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nnde- 

location 
correspon 

last^mino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamlc Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=pos$ible 
nucleotide insertion 














SLLE \NRNISQAWWCMPWPATREAEAR 
ESLEPGGQRLQ 


12992 


26893 


A 


13130 


268 


438 


KVDKSMKMRKNRPKNAENSKWKKftSSPP 
NDCNSSPARVQ/SWTENEFDKLTEVGFR 
KW 


12993 


26894 


A 


13132 


1108 


1529 


kihssfvllnaklvavaaktpkfykkkt 
dlpqtyhsaqtavpeipvsahstfirsf 
ihsfifetesysvaqagiqwhdlpsXpp 
6s\sdsp\asvsrvaqtigvcqqawlff 
v\flvet\gsfphlspltmirnkkqlfk 

KGH 


12994 


26895 


A 


13133 


497 


370 


RPSTVSPRLECSGLILAHCS \LDPPASA 
SQTAETTYGHHHAS 


12995 


26896 


A 


13134 


217 


373 


CNHTEITYVNKKKHRPGTVAH/SLYNPS 
TLGGQGGLITQAQEFDTSLGHMTLS 


12996 


26897 


A 


13135 


257 


507 


PVSPGFPPL\CPVPHTPHLCPPCRYPES 
PGQPAAQHCGAPRQPQPNPRACSSRGLF 
RCPPACHHAPSCLLCVCPPLPQCCRVQG 


12997 


26898 


A 


13136 


137 


309 


PSFPTHPQFCFLFCFLFVFETE/SSIVA 
QAEVQWCDLGSPDQGSSASPPSASSLPL 
SIW 


12998 


26899 


A 


13137 


297 


419 


GGSAVYYIWEIVQVLWLMPVI PT\WEAY 
AGGLFEPRSSRPA 


12999 


26900 


A 


13138 


213 


492 


QVGINYQPPTWPVGDLAKVQRAVCMLS 
NTTAITEAWGLPGAISSAKCALVHW\YV 
GEGMAVGEFSEAEEDLAftLEDYETCGPW 
NPVETEAERR 


13000 


26901 


A 


13139 


246 


534 


DRVLPLLPRLECSSAILGHCP/APASQV 
TGTTGTHHHTQLIFLLYCWLCPPPIiASE 
ASAESPPLLPRGFLCFSSTTSSITAIPA 
TTRDYVDVSRSGS 


13001 


26902 


A 


13140 


1 


206 


SLEWPFFFSFSETGSHSFTQAIVQWCNH 
S SL \S PTSASQIAGTTGVHNHAQL/ VFV 
FCFAFMSPQLSNSC 


13002 


26903 




13141 


164 


582 


GLVLVNLYLRSYLMVFIYINSRLLQVTD 
ASRERRENESSWFCRSVCRM/PPVGPGP 
SCVHPCSPRGSISPCTPHCPLKTTQAPA 
\PPRPLPRVGPGPHLDQSCCVAARPLYD 
PILVAAHSSWPAPPSPVPVPQEDTEHGA 
G 


13003 . 


26904 


A 


13142 


235 


367 


LALNTFCWLGWAHACNPSTLGGQGEWV 
TRGQEXQANMAKTCLY 


13004 


26905 


A 


13143 


91 


403 


GALQPATAPWEPLSGLAEAGAISPCLQG 
G/SGGRGASENRGCTRPGRVPGGRRLNE 
PRAWS S WPAPPAPGS ERPSAI NCPRAE E 
CGRRVWDWQAALPAAPAWVPV 


13005 


26906 


A 


13144 


558 


147 


ghsfqnwllgcplqlqrhllhppdasee 
cgsppwtplrphihpsagphgsthk\ha 
ggclfplrpspthpcgppvplpwsqXqa 
plcphppihsphdwecdpgksalipppa 
aqspsrlikhaanepecsprnrrpgip 


13006 


26907 


A 


13145 


321 


421 


GCAQWLT PV/ 1 PALWEAKAGE SL LFRS L 
RPAWAI 


13007 


26908 


A 


13146 


3 


398 


I ELLQSHDKTLTDKKLLLLDEQRKWFLE 
MESTPGEDVNNVELTTKDLEYHINLDDX 
KAGFERIDSNFERVSAVG/KMLSNS lAC 
YRE I FHERS VKVANFTVAI LPQPHKLSV 
TINLIIQQPSTLRQDLLLAiCR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 


Predict- 

location 
corrcspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucinc, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13008 


26909 


A 


13147 


228 


261 


SKTQRSSEYYSLSVCVCVCVRVCIYIYI 
FFFFETEFLS\APQAGVQRCDLSSLRLP 
PP\GSSDSPFSTSQSAWDYRREPQRPTS 
LHYFVQLKHNLHRVKRRDLRNTTR 


13009 


26910 


A 


13148 


3 


275 


KHLSDLQILRLGLSSKTRKTB.YSHFAQG 
RSIFFLLRPSVDWIKPACMDGNLLFSKF 
TNLNVWI \TKKHPHRNVQNNVWPNVWAP 
WPKEVDT 


13010 


26911 


A 


13149 


218 


382 


AWDHIKLESFYTAKETT/RMKRQPTEWE 
KI FANYPSDEGLVTRMYKELKQLYRKKI 


13011 


26912 


A 


13150 


391 


29 


FCFLFLFMFLFFFEMQDGVQWCNLSSPQ 
PPPPG\SSHSPASASQWEAENCLNPGG 
GGCSLGDRARLPQRKRERERETERKRKR 
ERAS CAP CGAITFTOMWCQKHHTCNHLP 
WLTVFETNKL 


13012 


26913 


A 


13151 


351 


157 


FGYSRFLCVYINFRTNLSIYNYICIYIN 
LYIiXNIHTYTLIYTYVHTYTHIHIFYiy 
IGILTGIT 


13013 


26914 


A 


13152 


402 


111 


TFFGGEKPPFTLRPGPPRREGGGGPPGP 
REGGFFRVPLGAFGPSRPK/RPEKSGPV 
MAPQI FLWGGKSRAI KESGKGFFFSRIT 
IiLVFATEKKKKTTLF 


13014 


26915 


A 


13153 


310 


141 


rehekqkplsqvrwctplvqaaqeaeag 
geleprssrlqcsimtXtpvnshrpparv 

T 


13015 


26916 


A 


13154 


153 


21 


LNKCVISWLGAVRHACaT/PSTLEGRWIT 
QGQEFKTSLGSIAKPHLY 


13016 


26917 


A 


13155 


429 


275 


SVDIPLEKTLINKDVFKHAAFKCKVS/R 

qarvkfkkrykpvktkwffqklwf 


13017 


26918 


A 


13156 


303 


42 


YRERAPVYPIIINIAFGPHKQ \trahth 
THTHTRMHAHTHTHPAI I PAFAKNGLGT 
FHHKGRTYNDKVKPRLSCEAAI P SNLKS 
REFR 


13018 


26919 


A 


13157 


198 


13 


SHGVAILLFFHFLNKRAFT \ yspamns s 

lceiqepslg\sgsgplsgnrmsgkghr 

KYFAKI 


13019 


26920 


A 


13158 


192 


2 


isnshrkdikdpevllieqkyilsgqmq 
wltpvipal/v/eaeagrslefrslrpa 
watdrdsvsk 


13020 


26921 


A 


13159 


40 


190 


RVDPRVRESRSVAQAGMQWRDLGSLQAP 
P\PGSRHSPASASQVALFLNRK 


13021 


26922 


A 


13160 


1 


178 


PTMVLSPADKTKVKA/AWGKVGAHAGEY 
GAEALERMFLSFPTTKTYFPHFDLSHGV 
SSYL 


13022 


26923 


A 


13161 


411 


41 


ESLQICVGLHAKPSWDACRPWLGQAWFR 
YFLLTFLGPLVFLLVFWFSKIYFLYHLK 
NPSLS S CYNLD\ FNTQYLKYCKKKKRLG 
AVAYACNPS I LGSQGGRI TRGQE FETRL 
GNMAKPDAWVDP 


13023 


26924 


A 


13162 


372 


218 


FWKRSWISKKGLFKRGVSLSIiSLS/CDT 
HTHTHTHTHTHAHSGYLCVLRKVK 


13024 


26925 


A 


13163 


247 


89 


QGLVLSPWLKCRGGITAHCGLLGSSHPT 
TSAXQVAGTTGTQHHAQLKFF ILI I L 


13025 


26926 


A 


13164 


122 


409 


FPSTAIAHSH/CSSPQGCSAYGVAIRLF 
LRSSNKLAL\LYGLAL/NSFFCNFQEPS 

Xlgslsgplsdnksmwlccfkspsswlf 

AI AVLGNY YS LNT WS GRRDPGT PNAC S 


13026 


26927 


A 


13165 


69 


313 


PSVSLFFPPNHNIVIHMGLLKINSIFGE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
l»st amino 

residue of 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














glalssrmecsvtitvhcgldllgsggp 
psaas/rvavtagtchhswlmflsfv 


13027 


26928 


A 


13166 


310 


26 


IKSMTPLPALNVSLSYYLPPLRPCRERG 
MVQDRFFCKLFCCCCFIFSETGSHSAAQ 
AVSQNKKTTGWAWWLTPVI PATHEAEAG 
\ESIiERGRQRLQ 


13028 


26929 


A 


13167 


296 


68 


KNGPLNLATKSLL I LTMKCLGWLGGKP 
NCSKLKKNKIKTKKWGWAPWLTPILPA/ 
LLEAEAGRSLEARNLRPVWT 


13029 


26930 


A 


13168 


2 


235 


GATPCPREPSLFPLAGWAPLSCWAPPPH 

CTQGTTPPGLMGTG\APIAIIGGNARSP 

P^:PFLPLSGTRPJU.SAQECPTF 


13030 


26931 


A 


13169 


248 ■ 406 


TERCGFDLLSLGQVQWLTSVIPALE/EA 
EAGELLESRSWRPAWATVIQNLI SLKK 


13031 


26932 


A 


13170 


33 


308 


NSLQGAEMAAAQSSLFSASEGYRGERGR 
GRGRGRPGRa/RSEGDRGGRGRPGRGRG 
RKVLLLRICVWRRVLRLWNRGRIGVGGR 
VTYLMESEI 


13032 


26933 


A 


13171 


294 


56 


ICKRNYLFFIYFLREGLTLLPRLECSGA 
ITAQCSL\GSSDPPTSASTVSGTTC3ACQ 
CALSSRDALISLCYPGWSQAPGLK 


13033 


26934 


A 


13172 


71 




DSLNYDNHiyQMPLIYINMGIAVTISLL 
GMLVYRSHLISSLLCIiEGI ILSLFIIA 
TLITLNTHSLLSNIVPIAILVFAACEAA 
VGWEECGFRSRPTALILEGPLRSCQPRT 
CVRRPACTRRGPAFVGKNRHS I GG 


13034 


26935 


A 


13173 


86 


318 


MWCNVCVCVCVCVCVCVCIHICIYV/YV 
YICIYTYKYICIYTYKYICICVYIHVYM 
YMCVYVYICICMCVYICIDSKGR 


13035 


26936 


A 


13174 


466 


726 


NCMKFGAVTRIG\DLPWEINPLSSCSLL 
REKDPPTTSGPQTDQPKKHLTNFKSGKR 
PLFTLFSNLPHYPSTSFSFQSWRHTSIS 
PFS 


13036 


26937 


A 


13175 


128 


325 


FLRVILICHKMYGCVCVCM/YACICNGM 
CIYMCVCVKSWKLKPELKCLCI SDEVGW 
VLTQWAGREEN 


13037 


26938 


A 


13176 


134 


311 


INSNSEKMGSHYVAQDGVE/WQGLFTGH 
GIAHCNLKLLGS SNFSAS I SQGAGTTGI 
YANG 


13038 


26939 


A 


13177 


577 


1052 


S PTSTRTGSGVAMWLS QPMGRT \ CRS KV 
ASKSRLHGPLIWAEPPIVRCPMLRHHY\ 
KAQAGRGLS LEE \LRVAGI YKKVAQT IG 
IS\EDARRRNQST\QALQAKV"QRLK\ED 
RSSLILFPRK\PIjAPKKGDSSAEELELD 
TQLTGPEMPIGNVYKEKARVIAD 


13039 


26940 


A 


13178 


496 


678 


TIMFVWAQGLKITRPWKNAPVFPVIWXR 
LRRKNLLTLKGEICSDPKWPYCLPPWKT 
KKNLV 


13040 


26941 


A 


13179 


192 


421 


GIYTFFLSFFLFLRQSLTLSPEAGVQWC 
NLCLPGSSDSPASASRVPG\ATGT\CHH 
GSANFLYFLVGDRVFGYVGPR 


13041 


26942 


A 


13180 


144 


5 


LQLAI KLLKMRLG WAHICNPS / TLGGW 
G\GRSQGQEFETSLANIVKP 


13042 


26943 


A 


13181 


351 


51 


KKKKKIFFGTKKRRFFFLKGVWGPPPPK 
VSPPPPPKISPNPPKNNPLLKR/IKSPQ 
NWLFFFGPPSCKFFPPFFLNPKWSPWEN 
PPSGGEFSPKKKKKKSAI 


13043 


26944 


A 


13182 


2 


218 


IHFGCFVFLFFETEFCSCHPGCRAGVQW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lf=Leucine,M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RD\LAHNSLLPPPLG\SSDPPTSASQSA 
E ITGMSHHACPI FKKHS 


13044 


26945 


A 


13183 


33 


304 


EKHGIFRAVKILCMILQRWTYATHL^RP 
TGYTAPRVNPKLWTLGDNDVSM/SGSLI 
VTNTALGWGLLIRRSSACMGQGDMGNLC 
TFLSILL 


13045 


26946 


A 


13184 


277 


13 


FS FFLI VEMGS C Y I AQAGLE LLGSSNP P 
SSASQSAGITGMSHHPAWIATLKRAFVS 
D\NSLSFPSSENAHLSILAGHFCLMGFF 
FFQ 


13046 


26947 


A 


13185 


140 


361 


ITCCCVCIYTHIHTYVYVCVYIYTHTYI 
RVCV/YYIHTHTYIHIYIHVFIYIYIYT 
FMHTYIYPYVCMVLLQLTL 


13047 


26948 


A 


13186 


64 


759 


CLSAESAPTSTMPMTLGYWDIRGLAHAI 
RLLLEYTDSSYVEKKYTIiGDAPDYDRSQ 
WLNEKFKLGLDFPNLPYLIDGAHKITQS 
NAILRYIARKHNLCQETEEEKIRVDILE 
NQVMDHHMELVRLCYDPDFEKLKPKYLE 
ELPEKLKRLTS\EFSGGKRPWF/SQGDK 
ITFVDFIAYDVIiDMKRIFEPKCIinAFIJff 
LKDFISRFEGLKKISAYMKSSQFLRGLL 
FGKSATWNSK 


13048 


26949 


A 


13187 


41 


426 


LFFFFFESGPCWVTQAOQRR/WNDHGSL 
QPGFPGLKGS PCLTLRRSWNYRPGMCRH 
TQLVFAFFFFPREKKFLLGAPTGFKTRG 
LRGPSRFGFQKGGNKGQBPPPRliKGLIF 
LEPERGQAFFGVFFGPTQ 


13049 


26950 


A 


13188 


39 


197 


FFFLFKTRSCSFIQAGVQWCEHSSLQPQ 
TPG\SSAAPASASCKAGTTEAALKL 


13050 


26951 


A 


13189 


221 


396 


GIQLQGAEGLWDVTNSPFCWRLYDQKTV 
YHECRMWfiCTSHCP\PPKQLLRDITPRCW 
AP 


13051 


26952 


A 


13190 






PGSTISSRREGA.CQSRGGHFPSPRG3SG 
VASLERAESWSTEPAKAIKPIDRKSVHQ 
ICSGQWLSLSTAVKKIVENSLDA6ATN 
IDLKLKDYGMDLIEVSGNGCGVEEENFK 
glXtlsalkhhts KIQBFADLTRVETFG 
FRGEALSSLCALSDVTISTCHVSAKVGT 
RliVFDHDQKIIQKXPYPHPRGTTVSVKQ 
LFSTLPVRHKEFQRWIKKKRACFPFAFC 
RDCQFLEGSPAMLPVQP\AKLTPRSTPP 
HPCSLEDNVITVFS S VKNGPGS SR 


13052 


26953 


A 


13191 


235 




S PCARQCCPPNPAGQEPRRRLERGPGKW 
WPRS I KFPLPAV/RARFPLL PSAPLRQL 
VSGRVGGRVGRPGKAVQVSGGLN 


13053 


26954 


A 


13192 


2 


386 


VIYLLLFFETESCCVAQARDGVSPCWPG 
WFQTPG\SSDLPASAPK/VAGITSVSHR 
TRPDVRGFKKC/GLQLEMSHLALLL/TT 
SHRFWYFVLLFSFVSKGFFIFFFFLFFF 
FFLGKGFFFFFPRGGGGGQI 


13054 


26955 


A 


13193 


749 


506 


GQTKAFVLS YCGASPS IKQ/ PCPQAKDH 
PLEPSMHPEGTQLQSCSTMLGPRQLSSE 
KQPLLPPRSHLKSSPMLRACKGLTS 


13055 


26956 


A 


13194 


73 


297 


RMPGFGALGSPFSCQRIECPC\AGS.GSC 
TYASFCKCKEYKCTS CKKSECGAFPGNQ 
GAGQSQRREPRAQQAGAGQ 


13056 


26957 


A 


13195 


670^ 


390 


PRETY IKPFARS IGCQPPKT IHPTSREP 
STRPPARPHaRPPVHTP/APSVHMSDRP 
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nucleotide 
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correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
corrcspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop eodon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














STC?PSIH\RPPSTDVSRPQYSLSPSTS 
LALPGQSCLGR 


13057 


26958 


A 


13196 


2 


438 


LSFTSIEFCHLLFLNQLICNYVIFTKSS 
TISCEALGRLLWYPWTQRFFESFGDLS 
T PDAVMGNPKVKAHGKKVLGAFS DGLAH 
L\PPLKPTFATVSEIiHCAKLHVDPENFT 
L LGNVLVCV/MKHNFGK 


13058 


26959 


A 


13197 


57 


499 


S PSWPRSSARPCSGHCIiLPGLWDHGKAP 
CLEAAEHSQGHQFRKQRKEQCSLWAaGW 
TKNPVPTWRAG/SRRGPGVTSPPPVPSV 
QNPP\ PPQRRHGGSQATRPVLFTPPPKH 
MEGPNAPILGDQRITflPPPEILYRPPRA 
ATCDALRRG 


13059 


26960 


A 


13198 


34 


465 


GILPYSPCVLRAPNSKRVRKCGCTTPRH 
L/CKATRKCBCTTPRAPUSTETRKCGCTT 
PRHLSRQLGNAGALPRGHLNETRKCGCT 
TPRAPLNATRKCGCSTPRHLSRQLGNaG 
ALPRGHPLKETRKCXSGATPADAFYETKK 
IQGGL 


13060 


26961 


A 


13200 


177 


1075 


PTSSSMAFPKKKLQGLVAaTITPMTENG 
EINFSVIGQYVDYLVKEQGVKWIFVNGT 
TGEGLSLSVSRRRQYREEWVTKGKDKLD 
QVIIHVQALSLKESQEIAQHAAEIGftDG 
lAVIAPFFLKPWTKDILINFLKEVAAAA 
PALPFYYYHIPALTGVKIRAEELLDGIL 
DKIPTFQGLKFSDTDLIiDFGQCVDQNRQ 
QQFAFLFGVDEQLLSALVMQATGAVGST 
YNYLGKKTNQIVXEAFEQKDFSnaLNYQ 
FCIQRFINFWKLQFQVSQTKAIMTLVS 
GDSNGPTPASTCRKPPGEFT 


13061 


26962 


A 


13201 


35 


464 


VQEFKTSLGNVAKPCLYKKKKKKSPLGG 
GVPPPPKKTWVGGTLSPQKIRPAGPLIF 
PPPLPRGQQGEPLS PIKGGGGGGSS \YP 
PPPPPCLTGRA/PPPSPISAKSSPSPPP 
CQYVYHPPLLPLFFFLHALRRATAPLFL 
FLVEA 


13062 


26963 


A 


13202 


1153 


259 


AGGVLRLeWTCSRMASDSdNQGTLCTL 

efavqmtcqscvdavhkslqgvagvqdv 
evhledqmvlvhttlpsqevqall\egt 
grqpvlkgmg\tgqfsesggqpvaipgg 
gligtvqgwrflqltpercliegti \pg 
lepqlhqlhvhqygdlt1sincnscx3nhfn 

PDGAS \HGG?QDSDRH\RGDIiG\NVGaN 
AAGGAFFRMEDEQLKVWDVIGRSL\IID 
EG\EDDPGaREGQSLYPKITRELPGSRL 
SCGI\ IAKSAGIiFPEPQSKICSCDGLT\ 
I WEERGRPIAGKG \RKESAQPPAHL 


13063 


26964 


A 


13203 


281 


420 


VDGSKKYNEMPVIPALWEAKaGGLL\EP 
RKSRTAWATREDSVSTKN 


13064 


26965 


A 


13204 


3 


1125 


SDS PQTPRMRVMAPRTlilLLLSGAIALT 
ETWACSHSMRYFYTAVSRPGRGEPRFIA 
VGYVDDTQFVRFDSDAASPRGEPRAP\W 
VEQEGPEYWDRETQKYKRQAQTDRVSLR 
NLRGYYNQSEAGSHTLQWMYGCDLGPDG 
RLLRG YDQSAYDGKDYIALNEHLRS CTA 
ADTAAQITQRKWEAAEAAEQWRAYLEGT 
CVEWLRRYLENGKETLQRAEHPKTHVTH 
HPVSDHEATLRCWALGFYPAEITLTWQR 



1338 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
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SEQID 
NO: of 
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sequence 
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od 


SEQ ID 
NO: in 
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Predicted 
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nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan,Y=Tyrosine, 
X=Unlcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DGEDQTQDTELVETRPAGDGT?QKWAAV 
WPSGEEQRYTCHVQHEGLPEPLTLRWE 
PSSQPTI PI VGIVAGLAVLAVLAVLGAV 
MAWMCRRKSSGGKGGS CS QAAS SNSAQ 


13065 


25966 


A 


13205 


417 


13 


ghprvmlqchmgsaslsimtpppirsyq 
paltrssatkpa\tsapsitpmtpvtsn 
is/vwptanttttklsnptss/stlstp 

IWHPSIIjPAPPSTPAPLTTVTNSATTST 
I CFHVQLFQAHHCHHCI AAALEYTS 


13066 


26967 


A 


13206 


1139 


559 


GLSELSIHPSVARVQEGHGATPERSIPW 
PGGKPRRGQRGTGLREWGGGSHRSLGQT 
KKKGAKRKRDHCAHGPIAAFQGPFPRAT 
QLLSPCHGCSWKENQPP*GPPGHACTAW 
EKTVTHGSCRPVTKPSPDPGFVGQAPGT 
KGNPAPGMDGCaGFEISCHPVPITKPLS 
NTGICVLLCYEFGFLIAIV6CWGFK 


13067 


26968 


A 


13207 


306 


332 


ENGEIRIYIYICLYLIKKHWEEAGCGSV 
HL*SQHLKRLKWEDYLRPGVPDQPGQHR 
ETPSQKLNK* INNDF 


13068 


26969 


A 


13208 


48 


233 


GKQNHLKCLWCRLLFLIiSAVISRMVYIL 
T*LSLSLSQCVCVCVCVCVCVCVCVAIL 
ENYPH 


13069 


26970 


A 


13209 


250 


2 


YCEGDLGSSRPGVSKCaiEFSPRRDKHIM 
PWPGAVTYAYNPSTLGGRDGWIA*A*EF 
ETSLGNIVKPHLYTHTHTHTHTHTHT 


13070 


26971 


A 


13210 


615 


120 


SVFWPLLGGVSWSGYTEVRGPLEKAVCP 
LSELEYCaGRSAALFHAIRLFKDALSLL 
KLCPQPPFPLGAPLSP*EaLVCVVPLHV 
STCLS*WCPTPCPPERSWCVLFPSMYPR 
VCPDGLLPPVPLRGPGVCWPLHVSMCL 
LSLFNSLLRLRTCGVWFSVPVLVC 


13071 


26972 


A 


13211 


239 


429 


FILLRNKLQPGTVACGCNPSTLGSQGBCW 
IA*AQGFETSLD1!IMVKPCFYLPQLKKKK 

KIKNLGV 


13072 


26973 


A 


13212 


954 


503 


VQIiLPLCGRDLYLFAFNLPTYRCWQHFL 
SKPEAPIIiAGGSDRWPCPCCPFWLP*WC 
HVSAHAPQPFPCSiKPEAPLPVGCPCSPA 
PSLEAALLILRAGLWLRAPFCFSAHLL 
SCHSQCCQFPCSEKRSCLGGRDRHAASN 
PLSAVWPPGE 


13073 


26974 


A 


13213 


1844 


1147 


LHSQ I YSTAKKASLSMKGSRDKTRAASS 
RPVPSVLGVPPWSTLLQHPQNMWPGPAQ 
QQGQPSGRQAWCTPGEAPGAEAAPQ*QP 
HPEEDHSGGPQASAALALPPSPPSQRDV 
QGETGMQGRSAPRSASSSACCACRWSRL 
PCPQLLQRHPGLRLVSPAHSRPPGPAPS 
SS SGSGLVPGYLPQKGLAGLSAGAVLCP 
PGLLRVAHGAGYGPSAQMLHAHLSSSSL 
RGKRFLRF 


13074 


26975 


A 


13214 


413 


60 


LCSKSCHGSFAMDCETLLTFPRLFYPIL 
HMPTVPGQALPVHTPFL* LKLPLLPGI P 
SCHSPLFLD*AQE3PPPGGLPGHPPGSG 
TOCLWFHS LKHPVLSGS SVSTPGSAS PS 
RSRAL 


13075 


26976 


A 


13215 


67 


196 


GLSLSSPKVTHNATLMGCIFQN*KAFVY 
HSMKQKKIILLFJS1MA*P*YPLDFGEQWP 
njHGSHAYSTIL*LDLFCKKEEE*DEIPY 
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sequence 
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od 
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26 
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location 
correspond 
ing to first 

re™l°e'oi' 

sequ'ete 


Predict- 
ed end 
nucle- 

correspon 

last^amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaiiine, 
G=Glycine, H=Histidine, ]=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=TryptopIian, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














V*CFMLLWKSTTM*KKIILLFNMA 


13076 


26977 


A 


13216 


161 


2 


SVFFLFIDRRLYLIKSVWAAYNSSTLGG 
RGRR IT * AQE FKTSLGNICARPRVYQ 


13077 


26978 


A 


13217 


57 


184 


CLTVCKDKVSLCYP6WSPW*SYLTAAS 
WSWSKAILPPQPPE 


13078 


26979 


A 


13218 


292 


400 


LKVKQGRGHLDLHTQGPUDL* SNSLPLSH 

TPSHRRTV 


13079 


26980 , A 


13219 


186 


394 


YT VT,RF-,FYFT,FTFT,SRVST,?T ■-'FTT.KT 
*LGIVAYTCNPST1.GSQGGRIA*-AQEFD 
TSLGNIVRPPSLI 


13080 


26981 


A 


13220 


315 


448 


KTKDLGVVDHVYNPSTFGG*GGRIT*GQ 
KFETSLDNRARSdYK 


13081 






13221 


3 


13 


LRPLRSFVSVERLWASCCLGSELWDKT 
KHKKHRELSEEHKQEIKimFELFDTDKD 
EVIDYHELKVAMIALGFDVKKADVmiL 
KDYDR*VIGKISFGHSIEIVTDFQL*R 


13082 


26983 


A 


13222 


1 


715 


VDMHSRGSPELTHaSTHASGKMAAPWAS 
LRLVAPMWNGRIRGIHELGAAVAPEGNQ 
KKKRTILQFLTNYFYDVEALRDYLLQRE 
MYKVHEKNRSYTWLEKQHGPYGAGAFFI 
LKQGGAVKFRDKEWIRPDKYGHFSQEFW 
NFCEVPVEAVDA6DCDIISrYE6LnNI.LRL 
KELQSLSLQRCCHVDDWCLSRLYPLADF 
VAGAFRWAGCPRIFRTGAKHGFHHLQ*D 
LSSGWATCPGTSPT 


13083 


26984 




13223 


1019 


1238 


LTGAEYKKSDQLTESVTANIEPEAPKVF 
SRLDDEAPVTV1PARLPLPAP*HTARPD 
PPQRRRSMQLAPARPFL 


13084 


26985 


A 


13224 


1534 


1181 


RWNSEPDYSAGGPRAGRGARLLPVPKSP 
GGGGAVAPGPQTPLRAERTAVRLS I S S * 
NS STAPADPGAAPPLHPQPAEAGTWVSA 
FLRVIFCFPS.LGFGCWEKKIAFVPFPPT 
DEKCL 


13085 


26986 


A 


13225 


328 


22 


CLESPRDRPFQGPAFPKHPPPPLAFLSQ 
PGIECVQYFFESNCKSLSSQEIKKSRSVPT 
SPPAPSPPPPPFLYFSLTAEGGEV*KET 
LDSDEAPPS SREAP I CQV 


13086 


26987 


A 


13226 


180 


5 


SQGRRIT*AKEFETSLGNIVLLIYTIIY 
YI 


13087 


26988 


A 


13227 


237 


528 


vglcqdpllltltdsfcsvlwggshlaf 
hknlywhidl'vmyfsrisfyftlsaai 
clslylgshilspfl*lfycayttlylc 
littlfflipkv 


13088 . 


26989 


A 


13228 


140 


366 


NATCLWHPMS PYAPTY * C*LS PGLLRWV 
lAITPIVIiMILISSYNHVSISYGGTFRS 
NFFLDQI IliTFLI YELHIV 


13089 


26990 


A 


13229 


257 


25 


LPAHSLGRDLSAQPYSMPRPGGELRAEG 
QS3LCSVAR33L*LHVCVCVCVCVCART 
CVNVFACVLLSTKVCLHPLPE 


13090 


26991 


A 


13230 


561 


52 


WPSCSSSGSPiyiLLPPVPPGSLG**SFSP 
LPAAAALPAPCAAPRPAPLHPCGPAPTP 
AAAPAPAPPAAASLSAAHAAAGSPPGSR 
PSARGARRRPSGPCAPPEPRLGPRAPSS 
APRTRTPS*GRARSGGSAGNAPSARRTP 
QGPPEAACSLARSWIWPAGRGGSCGRAL 
GA 


13091 


26992 


A 


13231 


335 


103 


CQKYSTSYcasIKWIiATLVIICSRKI*DGL 
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nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
ocl 


SEQID 
NO: in 
USSN 
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beginning 
nucleotide 

correspond 
ing to first 
nmino acid 

peptide 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
'W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KGQAWWLVPVIPMLWKAEAVGLPEARSIi 
SSPCNMARAPALKKIFLIGQV 


13092 


26993 


A 


13232 


452 


712 


VGLEGED*PPFSPAQSQSLSCWPDCSCP 
PSWPPDQRPGERPQPPDPPESHGAEEEG 
LPERPSDA*ARPQPFSLHPGVFPPQNAS 
TQ 


13093 


, 26994 


A 


13233 


480 


2 


SCFSEDWNPLKFKLQLSTSLSYRKSHTL 
HCaHHFRI3QCHSD*AFLVFSLTCILQVG 
SLRFTflGDVSPSYPEKLRVL*EGRVADF 
HVMARKGQQAALAILLQNaRTQXVWWLT 
PVIPALWKAKASGSVKPRSLRQTWQNGE 
TRLLDRGESASQSAGMTGM 


13094 


26995 


A 


13234 


692 


824 


QIMaLPFPSPPVPPLVM*CKGGKIPPPP 
SKNPRTCVGNTVIIKA 


13095 


26996 


A 


13235 


657 




prgprldviiyacepqlitclelgtplkv 
rfwlevvdht™pstlgdqggrta*gqe 

FKTSVGiSrSKTLSLKKK 


13096 


26997 


A 


13236 


168 


365 


QESLCILQTCTCaHQNLK*KFGGKSRLG 
AVAHACNPSTLGGQGaRII*GQEFETSM 
VNMVKPYLY 


13097 


26998 


A 


13237 


31 




W YI VYI CVCVCVCI YI YE lYMI YI *YI* 
YIYIKHTVQ*H*HLAFSTFTLLCNCHHC 
PSLELFHYPTLTYHTH* I ITPHCSSC* Y 
QPLFYFHLYVFDYSRYIjV 


13098 


26999 


A 


13238 


714 


1001 


SSGVSCLNRAGWITGVNHHIHSLVLLKR 
EK*TQNIITEVCLMFLLLPHLAIEATVT 
WPGMVAHACNPSTLGSQGRRSV*AREFE 
FSLDNIARPCL 


13099 


27000 


A 


13239 


250 


406 


KLA*HGGVHMWSSASQVAGITGTRCHAQ 
LI FI FLVKMGFHHVGQDGLNLLTL 


13100 


27001 


A 


13240 


256 


193 


WINFISLPLSSLT*TLSLCVCACVYVCV 
CVCLRPCVHIVGGNLKRYIVFARSVTLG 
CKFTHLHIS L I INNDT 


13101 


27002 


A 


13241 


1 


257 


GMDLWQMCHSLSLPLIFFFFFFSEKTRS 
NFVPQAGFQLRALRDPPA*ASREAGITG 
LTHHVRPGQI FKKI GTLKPTI LLPLLPR 
K 


13102 


27003 


A 


13242 


319 


406 


KRGWT*WLMPVIPTLWEAEAGGSPDWS 
L 


13103 


27004 


A 


13243 


228 


56 


KGAPPVLKPGYPKAAKNPTPFKPPGSKV 
GENPPLF*NPCPGGGNPFPTFFFFFFFF 
F 


13104 


27005 


A 


13244 


1687 


612 


ILGNQCCKFDAYHALANESTMI S IKLNE 
TSSKGTGSTATFHV 


13105 


27006 


A 


13245 


220 


1 


TKDPSSTMPPQPNPLLSFKSQFIFLIFL 
FIKKYVGWMQWLLPVIPAL*EAEAGGSL 
QPRSSRPAWATWENPIF 


13106 


27007 


A 


13246 


204 


375 


ASWPPGLQYELHI IKCLLHCWTRAMIFR 
ERERRERERERERERERERE *VHLKRKG 
A 


13107 


27008 


A 


13247 


158 


2 


IWVFKETFFFQNFFFFFGKEHNFNGFFF 
FFFFFFFFF*DRVSLTLHPSLGYR 


13108 


27009 


A 


13248 


221 


3 


NPSQGFPLLKIFFFFLRWSLALSPRLE* 
NS I *KN* KI *KI SWAWWHTP WPGAWEA 
EPGESLEPGRQRLQNE 


13109 


27010 


A 


13249 


81 


341 


GELiqDTIHVKYLG*YIKVL*RNRT]!IRVC 
VCVCVCVCVCVYACTQKDLF *GTGS CNC 
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nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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Predicted 
beginning 
nucleotide 

lorr!s7ond 

peptide 
sequence 


Predict- 
ed end 

otide 
location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]Vl=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GALVEPKSARQASRLKTQKTWI RSLKA 
VC 


13110 


27011 


A 


13250 


299 


1 


ATARLRFLHGAWWYASWPPTQEAEARG 
FLDPRSSRME*JiMILPRCTPASVTDKIP 
SLASQIHHPSRVACVILLSSHAPTEAPS 
RWPPPSPLCPAGSRA 


13111 


27012 


A 


13251 


106 


146 


LCVLKQVKQATWTKYS SSQ I FAI LDCMT 

TLSSHPALKYCNYVLISNQSPNHT,RT,*C 
SGM I SAHCNLC LPG * SNS PASAS RVAVC 
FKASQASYLD 


13112 


27013 


A 


13252 


217 


158 


LFEIFCIPFFFFFFNFFFETRSIISVSQA 
GVQWCKQGSLQP*IPINFFWGKEGVIDK 
LILISYGNAQGFKVAVTPVCTTALQPGR 
QSETLSQKRN 


13113 


27014 


A 


13253 


150 


56 


TILILLLLYETESQSVAQDGVQWCDLGS 
IQSPPRSSCHSPA*AGV]SIRLRQENGMSF 
EAEIVLSPDRTTALHPGLQIETLSHIII 
IILILSTISEHQLLH 


13114 


27015 


A 


13254 


174 


145 


PCLKMTTYDFKSALEIAFFCFTAILRYK 
LLIKVTHFKWLRVAAHDCNPNTLGSGGG 
RITWGHEFKTSIjANMA*PTTRSYSQRVG 
IMGITIPDEVWSRTQPSHIRSGPSKVYP 


13115 


27016 


A 


13255 


179 


273 


GQWLTPVISAL*EVEVGGLLETRSLKQA 
WAT 


13116 


27017 


A 


13256 


264 


382 


GLKIGKGCSWWFrPVIPTL*EVKARGLL 


13117 


27018 


A 


13257 


1199 


1563 


YLWGPAVPPLDLEtGQAGATCGPVSHY 
KIFAKLWAVGS ILDLMPGPE*GS FLSVF 
TSLCFIYGVTYPFHSEQPHPASPSASGL 
LLLKGHAAWLVSTLCIGSPAPMLTGIFP 
HKVYNQLFI 


13118 


27019 


A 


13258 


194 


242 


ILYVETGSHYVAQAGLEIiLGSGNPPAST 
SEIAGIEA*ADEFIYVGVSWQDECIKSM 
KQVAHASIPAISEVEAGGLPEPRSSRPA 


13119 


27020 


A 


13259 


322 


120 


GPTPVLDAFQKLCLS FTSGQQADGI IPA 
FP IRKQGGTQWPKPVI PTLWEAKAGRFL 
*PRSFRPAWAT 


13120 


27021 


A 


13260 


237 




TSKKISLGWWGKQGIPGTWEG*AKRPFE 
PGKPRVQWTQVPALDFSLGGKAELCLKK 
KKKKQKQKTLKLCTHSRITYSRA 


13121 


27022 


A 


13261 


52 


318 


SAVGIHRCDDGSH*P*TPEHKQLSFLSL 
PSSWDYRGITRELFQRFPWIFLQLITAV 
ISSESTVLKNLELAAVRGSHVRVIMMAV 
PINPF 


13122 


27023 


A 


13262 


274 


1 


KKKKNSPVWWWTPVIPGSPG*AGELLEP 
WRQKVQLAQWPLAKWPQTKWQGCPL 
ALQPGEHQGIFVSQKKKNPKTKEKYWAL 
FCSLPSC 


13123 


27024 


A 


13263 


717 


877 


NSTKEMAHWPGWAHTCNPSTFGG* GGW 
IT*AHEFETSLPNMVKPHLYKKYKK 


13124 


27025 


A 


13264 


288 


131 


SLYIWHSKRLITITNNKISGVWWCLPW 
S STWEVEAGGSLEPRR* RPAWAT K 


13125 


27026 


A 


13265 


80 


287 


FMNGEAS*KTSILSQAWWCAPIVPAAQ* 
NEAGELLEPRSEWLVWATRALRISSRGL 
RFRFRLRRFTSTR 


13126 


27027 




13266 


3 


241 


VGLFLFFFETESCSVTQAGVQWCGLS*L 
*PPPPGIRDSPASASQVAGTTGTHHHTW 
LIFLYF**SSGFHYVGLGHSSNS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

pjtidr^ 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13127 


27028 


A 


13267 


103 


359 


ICVYVYVCIYVYEMRATSKDFLNQVWFS 
FLRTVAVLFEEYLIVL*MWLSFHIHIYI 
HTHIHIFIYVYTHVLIYVYIHTYSYMCI 
HIYI 


13128 


27029 


A 


13268 


172 


12 


SARSSQAIQ IQKS VTSGQVKPQW* QS * P 
TPFLPTLIALTTLLLPISPFILIIL 


13129 


27030 


A 


13269 


2 


203 


RIENIRSYKNLAALFMTAKKMGTIQMPH 
NFBWINKMWYIHTLEH*SDKKKKKKKVH 
RAEIPKLIIQK 


13130 


27031 


A 


13270 


236 


365 


KIYRQGAVAH¥aJPlrt)LGGQDGWIT*GQ 
KFETRIANMVKPHLY 


13131 


27032 


A 


13271 


371 


15 


QNQINFALILIINTLLaLLLIIITF*LP 
QLNGYIEKSTPYECGFDPISPARVPFSI 
KFFLVAITFLLFDLEIALLLPLP*ALQT 
TNLPLIVMSSLLLIIILALSLAYE*LQK 
GLD*AE 


13132 


27033 


A 


13272 


2 


16 


PRVRTSSRSRAALELIFFFFLGPPNLPV 
YNQPLGRTKPGTGELDTGGSPILCVQQG 
RHPYWKGGAKPLAPFGKGGGP*GTRPIiA 
GPIVPPKAGLQSEMPGMaPFCGSFG*?! 
RPD 


13133 


27034 


A 


13273 


329 


290 


SSALVPRLECKGIFSAHCMlCLMGSgNS 
PTSASRVAGITGVAGQATDKTPQTLS *R 
SQPSHMDPVRWSP 


13134 


27035 


A 


13274 


175 


13 


APCDHRPCPPENNPL*L*FS ITFPNP I K 
RPHPYLPLLTLFSDSAHLHPGEIEQR 


13135 


27036 


A 


13275 


355 


861 


PLTTTPAAPRAPCPPSRLSGQPLTGPTE 
GSRSRLSPNISEQGEPPLALTVGHPLST 
QPGPTVPSELEPIQGPRG*GDCPTPSQS 
A*GGVLSCTPESHTEFKPPPTGGGRRWA 
RLGLNGAT*GREEPLQTRLPAEYPGPGP 
IDPLQPPPISTASMATAFSDFLLLGRDP 
A 


13136 


27037 


A 


13276 


865 


667 


KFLCILLDFLFFIF*EMRSQHRLECSCA 

IIAHCSLKLLRSTDPPTSAS* lAGFTGM 

CH'CAQLFLTF 


13137 


27038 


A 


13277 


103 


375 


WSRRTiPWRRGTiGYIELFQGLEIRHHFLF 
GPHYLRRTQCQGPVIPSELDGQGWDYMS 
PGV*DQPGQHDETPSLQKI *KTSSAWWH 
APWPAT 


13138 


27039 


A 


13278 


177 


3 


QQTfiGSRAHSNSHRRPGAWHASNPNTL ' 
GGRGG* N I * SQKFKTSLVNMVKP CL YGR 


13139 


27040 


A 


13279 


379 


372 


SR*WVCMVAHaCDPSILGGQGGRIT*AQ 
EFETSLRNMVRPCLCLGNTNIYIYIHTH 

TYIFIQN 


13140 


27041 


A 


13280 


83 


1184 


PPAHAARASPPSYTWLCYEVKIKRGRSN 
LLWDTGVFRGPVL PKRQSNHRQEVS SWE 
CRKIIISKMSGGGLSTVYFFHRRFQITWF 
VSWNPCLPCWKVTKFLAEHPNVTLTI S 
AARLYYYRDRDWRWVLLRLHKAGARVKI 
MDYEGERCRGQGSMTGRNSLRDGWICNA 
II!JYASLHRTLKEILR*GSPSGLIVSLLS 
PPAHPPEDS PGNESWLCFTMEVTKHHSA 
VFRKRGVFRWQVAPKS YLHPK* ELS SWE 
RRKIINT*HTNYEVTWYTSWSPCPECAGE 
VAEFLARHSHVNLTIFTARLCYFWDTDY 
QEGLCSLSQEGASVKIMGYKDFVSCWKN 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Hlstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FVYSDDEP FKPWKGLQTNFRLLKRRLRE 
ILQ 


13141 


27042 


A 


13282 


1558 


1824 


SFPYLASFPFCLGLPLETLFHPLGL* KD 
♦SDTHTHTHnrrHTHSHTHAHFPSFPDP 
LFQSSPFSSGFIDEYKYPHLWPVMSVTC 
CRFCV 


13142 


27043 


A 


13283 


1 


519 


PHFFLPQGFWGFFSPFPL*KSSSPLKaL 

IFLGGFSPFFPPPKKRFFSKTURGVFFP 


13143 


27044 


A 


13284 


349 


159 


CPLEPKKNYPFCFFF*ETGSCYVA*AGL 
VLVASSNPPASASQ lAGIQGTSRHfiQPC 
RIFLSKT 


13144 


27045 


A 


13285 


748 


869 


WLGWAHACNPSSLGGRGGQIV*PQEFE 
TSRGNMARPQSL 


13145 


27046 


A 


13286 


315 


410 


VQWFTPVNSALWESEAGGLLEIRS *RTT 
WAT 


13146 


27047 


A 


13287 


355 


499 


ARCGGMHLYSQIFGRLRWEHCIJl*KVLS 
QVWWHAPVFAr)IWEVEVGALI.EARSLRL 
* CftMLLPVNSHQPPTWAMQ 


13147 


27048 


A 


13288 


880 


701 


KSYFSSfitLHFEGKKCVSSILD*SILVLM 
CSWLMNYTHTHTYI YI YTRTHI CVHVYV 


13148 


27049 




13289 


379 


434 


KRGPPPPKKRGCFFSEKFF*GPPKTPFF 
FFFPKKKKKKKKKKKECKKKKKKGGRSRS 
RLADAWftDAW 


13149 


27050 


A 


13290 


253 


344 


GLFYNLQKIKIFYVKNLFFFFSNS ITEA 
GVQWPNLGSLQPPPSGSNDSPASE*RIV 
V 


13150 


27051 


A 


13291 


561 


745 


AWEPSLVGETNVNSFNQKYINWPGAVAH 
TYWAGTLGGQGGWI T*GQEFETTLANMV 
KPSPY 


13151 


27052 


A 


13292 


289 


2 


TLPQGEDFNKFVFGSIKQKLQINL*NTD 
FIGN\'FQSWAQQYTPWPPSWVAEVEGS 
LAARSSRPLCTIITPIiNSHCSLAWATQQ 
DPAGRVGRPRV 


13152 


27053 


A 


13293 


281 


229 


LKNKNVNKEKLRQGWJiHACNPSILGGR 

GGWIT*GEM 


13153 


27054 


A 


13294 


276 


441 


GLFPKFL I QKINQNWPDAWKAYNPSTL 
GVQGRW 1 T * GQ * FKTS LAKRGRKS ENK 


13154 


27055 


A 


13295 


258 


389 


NMVEKRLGQflHAYNLSTLRGQGERIT*A 
WEFETSLGNWRPCI 


13155 


27056 


A 


13296 


358 


353 


SFLAFFNGAFLPLRGFFWKTFFFGGVCW 
RRPPL*RKKKKKKKKKKKKKKKKKKKKE 
MFKRNIGGEERGGGG 


13156 


27057 


A 


13297 


196 


1 


LMGKAPFLGGFIPPAFFFFLKKKKGGPG 
AVAYTCNLSTLGGRGRWIT*GQEFETSL 
ANIAKPCSC 


13157 


27058 


A 


13298 


182 


436 


GVTILNVRHRHRKKVTLYRDLKKVHKCP 
KLHGNLRKVFQAEGIASSKTLKGHVWWL 
MPVIPAL*EAKMEGLLEARSLRSAWATQ 


13158 


27059 


A 


13299 


334 


440 


RHYWLNF * AFYHS SLAPAPQGGGHWPPP 
G I T PLNS LHVPLLNT SA*LASGV* LT * A 
HHRLVPNNRALI IQALRITRILGLYSTL 
LPA* KNFEAPFTI SDGVyGSAFF*ATGC 
HGLNVIIGSTFLTICFIRQLIFHCTSKQ 
*LWLEPPAWDW 


13159 


27060 


A 


13300 


359 


372 


KKKKICGGKKNPPNKKKVKPRGEKTPLK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

residue of 

peptide 

sequence 


Predict- 

otide 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \r^possible 
nucleotide insertion 














GVPFFCLEGFFSPHKFFFFFIFSPNIC 
SPKKSFFLKFPPALFFFNLPFLKNIPSP 
PFLKKASF*EIFFCGPPPFFFFFQNTLF 
FFPPFFFFFFFFFFFFFFLDKFAVY 


13160 


27061 


A 


13301 


233 


406 


VTVEGMEEVLAGSSEGMGSRPTS*V*FL 
KYLLGAVAHAHKPSTLGGRGGRIT*AQE 
LKTSLCNKVRPRLYKINK 


13161 


27062 


A 


13302 


92 


429 


GSHFNSELIEMKFYIIKRNIFIILNCFY 
LIKHQHFHYTKFLLVSVINKF*LATVAH 
AYNtNTLGGQGGRSA*THEFKTSLGNTV 
RSCLYKNLKMSWWWCTAIWEAEVRGLP 


13162 


27063 


A 


13303 


302 


140 


GGAVNSVQSQAWWHMPWPVPWKAETGL 
LKPRSLSL*CPMIVHVNSHCTPAWAT 


13163 


27064 


A 


13304 


115 


312 


LRSPSFVLSPRLKCGGTIMVHCSFDLLG 
FF*KWWLFF 


13164 


27065 


A 


13305 


90 


246 


PGWAHACNPSTLGGRGGRIT*GQEFKS 
SLCKIAPPPERKEGRKEGGKKKERG 


13165 


27066 


A 


13306 


19 


310 


DSSFSPIMCFGMSVNKTGLWLGKKKKKK 
KKKKGGPP*KKPLGGP1SILPGGGKKKFFP 
* RGAKKNPPGDFWKKTLFLGGEKLGPPP 
PQK*NPFFGGGKIGPTPPPKIKPFGEKK 
KF 


13166 


27067 


A 


13307 


179 


56 


WVLVYKKQSILG*AQWVMSVISALWEAE 
AGGSLVSTSLRLA 


13167 


27068 


A 


13308 


425 


290 


KKNIYLAPPGYFWPPQRFFLSPPPPPNV 
VIFFFFLFFFFFLFFFFFFLIQYAEGIG 
VMRGRGEGKMGR*W*KFFGRRVIJSriRSI 
LLANFKHWPYW 


13168 


27069 


A 


13309 


214 


371 


I Y F * * RWI S QAWWLMP VI PKLWEAEAGG 
♦LEPMSLPGQYNETSPL 


13169 


27070 


A 


13310 


975 


572 


AWGMWGVGKHSLVSLGIEECQASTALS 
LDKSGWWEEAARREDVLCEDRRCLLCHV 
PAGVHGSLKPELGSRKGKNGQQSGSKPS 
VPSLGPRPQGPGNPALSKGTRPN*AICF 
LCQTPADHSAKKQAPHTLIPIR 


13170 


27071 


A 


13311 


270 


10 


TPKKSLI *RLQFSNIKLflNLGLGTMAHS 
CNLNTLGGQGARITSGEBFICrGLGNIDP 

I P I E S KVI KTLAGMVS PDAWVDAI KRTG 
TSG 


13171 


27072 


A 


13312 


103 


290 


LTE WSGVI L *NCI YLLPQYTSTRVHAY 
IHTYIHTHTHTSGLSSTSVGSTNHKSKI 
FGGKKG 


13172 


27073 


A 


13313 


136 


n 


SNTSSSLVTQAGGQWCELGSL*PLPPKF 
KRFSCLSLPSSWDYRRL 


13173 


27074 


A 


13314 


88 


183 


RVSTLLKKSCFFFFFFFFFFFFFFFFFF 
FFFFSSQKKKNLKNKKRE*KKSCFFFFF 
FFFFFFFFFFFFFFFFF 


13174 


27075 


A 


13315 


481 


517 


LLSP*RPKGKPKGKKRKPNLKEKKIGEA 
RGGKERQRKTPQRTGKQKKRRKYRRIGR 
RNYR 


13175 


27076 


A 


13316 


307 


196 


GMHLYLPRTW* WVRVI PGS RE I E VHAPL 
EPGSQRLLVAEMTPMHSSLDHMNKSPFP 
QPPKTEKQTKWPPKPHI lA 


13176 


27077 


A 


13317 


2 


209 


PARALDLKGSPYMESCSDAQAGVQDSIY 
GDHL* LRAPAV* *GQTVFVASPSKVGGI 
TGASHHPGLFFLF 


13177 


27078 


A 


13318 


3 


178 


SFCFSVLREIIGQSLIMKTLP*ITHTHT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 

loSion 
correspon 

last^amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl£nown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HTHTHTHTHTQREIERERDRERGESQVL 
GC 


13178 


27079 


A 


13319 


40 


382 


DSVPSLLLKLLiyiETGPHHWQASTTTPS 
* FFILEGFHHVftHRGLKLPSSSRPPHLA 
SQSAGIYGCEPLRSARLIINYIGSRLWG 
LRNICLSQDHEECLLCFILQAISMYEFC 
LG 


13179 






13320 


107 




PPL IMflWSGWAQWLTPWPTLWEAKAGG 


13180 


27081 


A 


13321 


111 


382 


ASSQLLELAIQVFPGIWIPPTTKQKGVP 
SGKTLVTSKI lAGLKMPKDSRSRPRTVA 
HTYNSSTLRGQDRRIT*AQEFKTSLGKI 
GRPCLF 


13181 


27082 


A 


13322 


49 


108 


CGLY* FFF*ATRSWHVaQAGLEFLGLSS 
FPASASSVAGTTGMCHHTQIHY*F*V 


13182 


27083 


A 


13323 


1 


257 . 


ARGERERERERERERERERERERERERE 














RERARERE*ECV*EGETVSDRERDFEEA 
PPRVLGAVAIYIEWOiaSLIQ 


13183 


27084 


A 


13324 


1 


248 


EFQYILCSALIFGRSKILK*ATKEVKKS 
KHIPRISCQDTRKTOWSflKAKRRNTTGT 
GRMRHLKIVYRRFRHGLHEENLFLKH 


13184 


27085 


A 


13326 


375 


1 


PPICRRGKTHCSPPKVFPPPQIIN*TTPPP 
QLI ICYL*RGGVFFSLPLII *APPAFFF 
FFQFFFFFFFFFFFFFFLQGQYWQFilDT 
DAEGHLASSRPSGAVFPSQGMHTTGSTR 
RWGQCVPFPPTRP 


13185 


27086 


A 


13327 


382 


15 


NWPKUCPLHFQNWFLRICrPQIFFCLKKF 
FFFFPKKVFYPPKKPFLEKPWPPVLN* K 
KPPPPFFFGWGPGGPQYCPFKKAPPLFI 
RGKERFPLWGKGDFFQIPCPPGPLKKKK 
RAAARDLGTS 


13186 


27087 


A 


13328 


2 


220 


GRVGSSRftRAVALFFFFFFFFCFFFFFF 
QNPPLKRGEKKKNTPPLKKKRPLRGGFK 
KQKEC*EKKKKLSGPK 


13187 


27088 


A 


13329 


427 


289 


IQLWSSL*K*LHTHTHTHTHTHTNTCFL 
TVMKELSTHPGNK 


13188 


27089 


A 


13330 


203 


269 


INFGPPREPGGFNQKPQFPSAPGFEPWN 
PPQGP KP * KKKKRKKPFKVWAPGGGS KV 
QNPGLRETGVFG 


13189 


27090 


A 


13331 


216 


418 


DVPVHYCRLYKPCTLRLHS I YKKQI LLW 
PGAVAHACNPTTLGGRGLWIT*GQEFEA 
SLANWKPHLY 


13190 


27091 


A 


13332 


160 


325 


WERQLFKIAQSGLARWLTPVIPALWEAE 
EGGLFEYTSLR*LWATQQDPISTKMFK 


13191 


27092 


A 


13333 


339 


62 


GML PLFVPPQKRG S PP YPCYGVYNS PPL 
KKQRFFSSLGIVLPPIVFITPPPPAFFF 
FFFCFFFFFFFFFFFFFLI *RRLMRRRM 
FSCFLHCP 


13192 


27093 


A 


13334 


45 


384 


DPSVRINTLLALLLIRITF*LPQLNGYI 
EKSTPYECEFDPISPARVPFSIKFFLGA 
ITFLLFDLEIALLLPLP*aLQTTNLPLI 
GMflSLLLII ILaLSLAYE*LQKGLH*AE 


13193 


27094 


A 


13335 


270 


660 


AGSRRPLRVPGFSLSMMSPSLCRPVCVA 
QCIRVSFLLTVGAVLRPGFQCLDWPCLV 
YARLSGWTGFPPCRYGKGCCRYDEGCCR 
FGEGCCRCDDRCCRCGEGCCRCDDGCCR 
YDEG*CRCDDGCCHYGE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 


Predict- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GlLtamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptoplian, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 


13194 


27095 


A 


13336 


298 


349 


KTTPFYGLRGSKIGTL*CSGVI lAH* SL 
KLLGSSDPLSLVAQAARTTGACHHDQLT 
NAFFFFPLERVPTPLKNNPFLWVKGVKN 
WDLD 


13195 


27095 


A 


13337 


865 


518 


KSATMRMVEDTHKMPAWGGYSGSLQQER 
GELSSLPCP*PESPPPLLIFLKLPSNPL 
PCSPLPCTQGPGQPAPyVSIHTSLKFPN 
AKLSVCLSRAIEFS 


13196 


27097 


A 


13338 


154 


354 


FNLQVKPDGYFVLPICHILPTDSKSGKA 
PGAPCHSRNFHIRKTLGWVQWLTPVILA 
LWET*FSHQENSGLGAVMTCNPSTLGN 
QGGWIA 


13197 


27098 


A 


13339 


262 


11 


QFHFSRKLTP * KM* LSAHS SNTHEAYVA 
FPKRHKPGLNSKTAF*LGTEAHACNPNI 
LGGQGGQITRGQAFKTSIiftMMAKPRliy 


13198 


27099 


A 


13340 


358 


164 


TKKGKPRFFLKNSKISRGWWWAPVISGP 
WEGEGGAFV*TGRPKIWLTEVGPLPFNL 
GKKKENPF 


13199 


27100 


A 


13341 


147 




FRPQKVSVRVGVLftHACNPTILGAQGGH 
1 1 * GQE F*ASMA]!IMVKPSS C 


13200 


27101 


A 


13342 


367 


17 


TSRSWNLFHVLVRFPTADKDIREIGSFT 
KKRSLIDLTIPYAWASFTLMMEGKKEQV 
TSS INGSRQKVRSQSGELLSLTPSDFSW 
AIIWLMPVFPALW*SETGSLFEVRPSRPA 
WPTW 


13201 


27102 


A 


13343 


166 


492 


EGTQETLCGCIICLVRGDALNLFHLKCS 
WVGWRGAICCMELRETAKQKLSVWKLYP 
FEIVFSFSHNlSrPRPGAVfiHiasrPNTLGG 
♦GGRIAEAQEFETNLOIIVRPHLFR 


13202 


27103 


A 


13344 


804 


1124 


TFFFCFFL*DRVWDVAPGWESSDMIMGP 
LQALTSWGSTDPPTLGLQSGLGDLQDTW 
PPYPTSFYYFFLQGRGLTMLPRLVLNSC 
TQAILPPQPGQRSKTPISLKTN 


13203 


27104 


A 


13345 


280 


448 


GGGGKKKKTPKPPtiEKNNFSPPP+FtPP 
KKQIKPPPPF*GGGGKKKKTPKPPLEKN 
NFSPPPFKPRKGIFFLFPPLSWVKKKGD 
PPGGTRPIiAPL 


13204 


27105 


A 


13346 


232 


400 


AELLRYCSCGHFL * SMARYKTKPGVRWL 
TPVIPVFWEAEAR*LHEPRSSRPASATQ 


13205 


27106 


A 


13347 


138 


332 


RAWKPSFEIFYFFF*RAHTF**SCVCL 
CVCVCVCVCV* THTHTHTHTHKHTHD Y 


13206 


27107 


A 


13348 


314 


395 


WLGIVAHAYNPSTLGT*GaWIT*GPEV 


13207 


27108 


A 


13349 


388 


371 


IMNGWQDKELVSRVIQTGIKK*KPRNRP 
EFRAPP*WHERNGRKERERMEGRKERER 
KKGKGKERKRKKEACIYKVKIEISNNLA 
NL I INVER* DRRRPDILSWVSLS SRVS S 
VFIALVTCLTSPQFFLSLHLNQILFPLG 


13208 


27109 


A 


13350 


412 


40 


LVFRFWMCLFSRELFGCFSHLLTKSMSQ 
M*WAFGDIMYTFDLLFIKKTKin\ICKLW 
QGCKEEGSL IHCWRECKLVQPL* RP I *R 
SLKKLQMGLPYDPAISLLETYPKERKSV 
Y*HNICTSGRVG 


13209 


27110 


A 


13351 


372 


148 


FFFFFFFFFFFFFFFFFFATGSCRVA*P 
GVKWL ISGTVPLL I STGVLTCS I SDLGQ 
PVHTSLGNLWPYSQELPY 


13210 


27111 


A 


13352 


209 


206 


KKKKKNFFSFHGKNPGNGGPFGGPPPPP 
F*TFFKKGGGLPKGPPPKGFFWNPPQNG 
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nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
25 


Predicted 

location 
correspond 
ingto first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGGP* PGGEKKTPKPSLGVLFFFFFKFF 
KKRPTAFFFLKKKI FF 


13211 


27112 


A 


13353 


43 


334 


KKKKKKKKKKKKKKKKKRGGGPFIKISG 
GAQFFRGEKKFFFFFLGGGKKTPRGFFE 
KKPFFWGGNFWPPPPQKNFPFGEKKKFL 
GGGGGKNRVFFWGEKIFSLGFFLKKFF* 

KKPEGKNFFPPKKNPVFSPPPP*KFFFF 
PKGEIFLGGGGPKIPPPKKRFFFKKPPG 
GFFSPP*KKKKKFFLPPEKLGPPRNFYK 


13212 


27113 


A 


13354 


264 


496 


KPTILRVTHFLEGCEKyEFLLPVAFSNF 
SEPLKRYYYWLGaEAHVCNPSTLEGRGG 
QIT*GQEFETSLVNMVKICLY 


13213 


27114 


A 


13355 


183 


9 


IRSPKIPEIRVFFFSRWSYALVAQVGVQ 
WGNLS*MHPPEflEFQPLSSLSVPST*KY 
R 


13214 


27115 , 


A 


13356 


208 


378 


E ICLEFKI IDEMDKCGKLWLGTGAHT*N 
PITLGGQGGQTT*GRAFETSLDIMEKRy 
L 


13215 


27116 


A 


13357 


256 


341 


KGQTQWLTPVIPTLWEA*AGGLLESRSS 


13216 


27117 


A 


1335S 


78 


344 


IVGLFEfiESKKdQHIVSGWIYllJiLLIYft 
FLFPLFSCLCKFFFFFFSPETEFCFVPQ 
AGVQ*HNLG*LKPPPPC3IjKQFSCLTLPS 
SWNYR 


13217 


27118 


A 


13359 


259 


291 


AQ*LMPWPALYE2\EJiGGSSLEPRSLLS 
VWAT 


13218 


27119 


A 


13360 


448 


121 


RLFDLGNKKTSSFVQNSENAKYEDSLQF 
YCRIYIYVYIYVYICIHICVYM*TYMCM 
YICIYMYTYTHVYTHVYMCIHTYMCI *A 
YICVYIYIF*KACLPriAISTLSSGH 


13219 


27120 


A 


13361 


125 


262 


FSFQASVEFTSKTVWSMVQWLTPI ISTT 
* EAQAGGSLEARNSRPD 


13220 


27121 


A 


13362 


362 


46 


ARAKGPKKIGFSGKMGPP*GAPPPKMGK 
KI*ITPPPKFLFFFLGKTKIKNPPWGFW 
PFGFPKKKRGGGARR*KTPSPFREKPPP 
QKLKRFKTPFPPLFFFKNPRP 


13221 


27122 


A 


13363 


378 


48 


FKKAAREKVSTRKFRGFVCLFWDFFWRQ 
SHSVTQAGVQ*CDLWLHPPGSSDSPflAA 
SQVAWTTGTHHHTQL I FIF I FCRNKI S P 
SLLKKYKKNLPGWAGACNPSHLGG 


13222 


27123 


A 


13364 


174 


396 


SLIFILEKRECYYPRSLFTVAUFTVAKR 
* QTLNCPSTDKWINKMCHIHTVE YYSD I 
KRNE I PMRATCRQTLKA 


13223 


27124 


A 


13365 


76 


280 


PLL PFICAWQWVGCHYW*KLD * INCYFR 
ALLAFSEYMYIYIYIRTHTHT* lYI YI Y 
lYKLTYSHLKVG 


13224 


27125 


A 


13366 


133 


324 


YLL * I LVYCSHLI F SLLCLEGI I LWVF I 
lATLITUMTHSLLINIVPIAILDLAAGQ 
AAVGLALLVSKKKKKGPPLKKPPLGAQ I 
SPANQRKKFPPKEKPIKTRRGTF 


13225 


27126 


A 


13367 


488 


190 


FSYAACFLPQCVFVCTFIYLHADGFLLL 
IIFFENSVFILCHSCWVFCFLLQWFLLM 
NLFSLFLLMLMHPVS*MECSIYSFSaLI 
IFHFSLMLLFFCFMI 


13226 


27127 


A 


13368 


229 


372 


YYATKDFSRPRAVAHTCNPSTLGGRGRQ 
I T* GRQFETSLfflWVKPQL Y 


13227 


27128 


A 


13369 


122 


2 


KEE CVGSGTWWVM P VI PAL * EAEGRGS L 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 

residue of 
peptide 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


13228 


27129 


A 


13370 


43 


412 


RPTRPKRQNYGDSENISGCQGLGKKGMN 
GKSTEDFEGSETTLYDAITVGTCHNK7I 
RSHIVyNTKSGP*HKI,WTWGGHDVSV*V 
PQLRKTYJIPGERCL* *GRLSMyRGREYI 
GNLCIFCSLLL 


13229 


27130 


A 


13371 


136 





KSPTWPGiWIRHTa!ILSTLEGQGGWIT*G 
QGFETSLAKMVKPSDAW 


13230 


27131 


A 


13372 


228 


385 


GLLIHKLBFKHFWLGTVAHTCMPSTLGG 
QGRHIT*GQGFETSLaNMaEPCLY 


13231 


27132 


A 


13373 


277 


429 


LETTTECAGLPTI IFTSSGQMSIWFFQNK 
PWDSKKLFNLLSISSP**SSKNY*PWQS 
TVAHTCNPSTLGGQGRWIT*GQEFEISL 
ANMVKPARVGEHVIRGLQVS 


13232 


27133 


A 


13374 


194 


3 


NQENYIYIYTHTHTYTHTYIYIL*LLLQ 
WVYLAYFLCLSYPSLFAQGFTHTTYTHT 
HMHTLIL 


13233 


27134 


A 


13375 


377 


228 


DRVLLLLPRLECRGI IMAHCRLPRLVSN 
S *APALSFQSAESTGVPNVPS 


13234 


27135 


A 


13376 


255 


465 


NAWKCPFIYKITFVIFLIHVSCKKFRNY 
RQREWKLP TVRPLS PS *AI I FPVTCTYT 
SRWPEATKDPQKK 


13235 


27136 


A 


13377 


336 


38 


VWWCTPWPATQEAEVGGSLESGRLRLQ 
*AVITLVNEHRESALASRRGPEETSSVK 
PPLPTILAHTCFSLPRTGQDITSRFLAQ 
RNTEENLELQMEARA 


13236 


27137 


A 


13378 


376 


293 


YI 


13237 


27138 


A 


13379 


215 


16 


HLTWS FTTATEGS KTVTQHS VYRKAKLG 
LGAVTHTCNPSTLGGLGRWIT*GQEYKG 

IPPHGLEDVQ 


13238 


27139 


A 


13380 


184 


64 


VDESLEGWMMDEWMSGWRGGCINRYMHA 

WMnG*GDGWIGG 


13239 


27140 


A 


13381 


233 


417 


LKPTTKGRKFRGFFT.PFKPKQKKYFWGF 
KKKKKKKKKKKKKKKKKKASRAPF* KKG 
PQKTP 


13240 


27141 


A 


13382 


34 


360 


RWNTTNADHDLKDNILSPPQINLYIRQN 
S S RLFCRH * QMDS KS P VKI PAG FVL Y I * 
I YRYTHTHTHTHTHTDS ELYMETHTRMA 
NTIVRKKNSLEHSYYLMLRMTIKLP 


13241 


27142 


A 


13383 


214 


71 


QTLWTDSGPGWIHACNPSNLGGHGGKI 
I *GQGFETSLANMVKRCLY 


13242 


27143 


A 


13384 


352 


312 


DKQLTmRTDSLYIFYTY*PPQSCEPIS 
YMHFP CLS Q YLYLC P S LHTHTHTHTHTH 
THTHTHTHTHTHVSVGRRSLFFSGPHTE 
APRSRVSVYT 


13243 


27144 


A 


13385 


393 


271 


lEGQIQYTSTIGNKFKDFYLILCKEGIM 
SREISPPSS CHLRQQ *RVRLRERDRERK 
RQRERQRERE* GRS VLHPHVT 


13244 


27145 


A 


13386 


180 


440 


PVEERTLCEDILCFP FS VLLC IQFHLL I 
QHACfKYPNPNSRFGSWPGAVAHICNPS 
TLGGGGRWIT*NQEFEARLSNMVKPRLY 
KNI 


13245 


27146 


A 


13387 


190 


47 


EGEKGVPSTILKMETLLGTVAYPCHPST 
LGGQGGRIAEAQEF *DHLE 


13246 


27147 


A 


13388 


188 


470 


ARPPCKGRDSSAEGPPGPPFPSWSSLGC 
WTREPPGRGEPIQVAVRREESAQDWARP 
ELI IKEWWPGLVAHTCNPSTLGGRGGW I 
A*A*EFENSQ " 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 


Predict- 
ed end 

correspon 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutainic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIlne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


13247 


27148 


A 


13389 


454 


367 


NKKXPPP3PF*FPP?PLGKTPFFKKKNP 
NFPRGGGNSFGSPSFGGGKGKDPPPGGF 
FPPPPYPRGEGGNPFFKKKKKKKRftSLD 
P*WSSQPCPAYEKELNFAPVLATVTLPP 
GHSS 


T3248 


27149 


A 


13390 


34 


354 


HTLLALLLIIITF*LPQLNGYIEKSTPY 
ECGFDPISPARVPFSIKFFLVAITFLLF 
ELEIALLLPLP*ALQTTNLPLIVMSSLL 
LIIILALSIAYE*LQKGLD*TEY 


13249 


27150 


A 


13391 


1089 


792 


THGEXSPQVGSPLDLKQPQKftMPRGPSLG 
NPPPGCQQHSHGAGGP*VPPGPRQPVM2V 
SRNLCLPLALPRTHPPSPGQTRDKPRTQ 
DRS CWRDPRNGPGV 


13250 


27131 


A 


13392 


299 


428 


WIRCRGRGRSCLPVPHSftPAlLSPLWD 
GTRRREAGGGTCQGGSGCMGAHWGLGGG 
EGSGCRSQALPREAAEAEOiEFKHGGIPR 
H*LTESLVSWEFRHa 


13251 


27152 


A 


13393 


382 


413 


HGKTHLYKKFKNKKFKATMPA*WLTPVI 
PTLQEAEAEASLKPKSSRPAWVTWQDPS 
LQKI 


13252 


27153 


A 


13394 


64 


401 


GGIPKEDSQAGAFTGIGERQQGLVIPLL 
ICNGQLSTDACVPPWGRVGVERAGPPNQP 
AGEGGGQGQECSLGLLPEANRCHPCNPS 
TLGGRSGQIA*GQEFEZSLG1ITVRSCPY 


13253 


27154 




13395 


146 


18 


FSPGWAHACNPSTLQGQGGQIT*GQAF 
KTSLSNTVKPKRPQ 


13254 


27155 


A 


13396 


1056 


679 


DIQVPER?L*NVPEPEAKGEPPDRAVGE 
HIDRDCRSDPAQQKRKIFTNKCERAGCR 
QREMMKLTCERCSRISTFCIKHRHPLDHDC 
SGEGHPTSRAGLAAISRAQAVASTSTVP 
SPSQTMPSCTSPSR 


13255 


27156 


A 


13397 


440 


423 


IHS PPTQRRV*QRERERERERERERERH 
ALAERNRTREGISGTTGERGNLMLVGHH 
LPGPLSSRRLGFCPDGFGGQHCLAQGDQ 
VRL 


13256 


27157 


A 


13398 


2 


256 


IETLGSAVEFIPYENTYC3TYI.KKKNRNS 
HMQSQPNRKGHIRLKYIAAWA*WLTPVI 
PALWEAEAGRSLESRSRNPVSTKKKIQK 


13257 


27158 


A 


13399 


102 


922 


LSFFLFSETGSCSVAQPEVQWCNHSSLQ 
P*TPGGPSMQFQLPQEQGQRTAVWHGGR 
A 


13258 


27159 


A 


13400 


198 


55 


KDYSAVVHTCNPSTLGGGGGWIT*GQEL 
KTSQAHMVKPHLYKNPKIEF 


13259 


27160 


A 


13401 


19 


447 


E S ALNL P SAGI I GVSHRTQ P I FC I LNAL 
ALGVLILERSPLQSPQVPPSHSHTKPGS 
LPSVTPGEGPGRPRftAEPTAQGRRYNSN 
PVLIS PGSVHPAS FALPAEPP *TGAPST 
PRPCVLRPLLLGNAMDLLCTKLSSRLPP 
YRT 


13260 


27161 


A 


13402 


93 


410 


DSNLNYSLFFHGEADLGTNQVLTHPSTT 
AMYFEHYCQPP*IVHGTINT*PPWHKN 
PIHI ITPS PCLRASTAINLLLSHINCYS 
*ATPHPLGYQQTYLPLTVHST 


13261 


27162 


A 


13403 


55 


387 


SNSHTYSLKKSAGITKFQNWHIVTTCICI 
CQWVPLEMTORHTVILCQYPVKPRILY 
QHHTAILVTILTFTLRPGWSHACNPST 
LGGQGGRII*AQEFKISLGNIVRPCLY 


13262 


27163 


A 


13404 


39 


365 


SGDRRVRLLLKI ITF *LSQLNGYIEKS T 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
TISSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartii: Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EYECGFDPISPARVPFSIKFFIiAAITFL 
LFDLEIALLLPL? *MjQTTNLPL IGMS S 
LLLIIILALSLAYE*LQKGLD*AE 


13263 


27164 


A 


13405 


32 


350 


HEFATSTSEMTKGTS*IVKRRIKTHTLC 
RRCGSKAYHLQKSTCGKCGYPAKEKRKY 
NWSATAKRRNTTGTGRHKHLKIVYRRFR 
HGFREGTTPKPKRAAVAASSSS 


13264 


27165 


A 


13406 


326 


668 


LQFKIFCMPG1FTISKVSSIVISN*RTAYS 
SQSFHKIVLI*LATFGQALSC*IL*IHL 
CYLKNKTKHRALVAHACNPNTLGS PRGR 
IA*GQEFETSLGSIVRLLLSTKNLKKKK 
NM 


13265 


27166 


A 


13407 


170 


472 


KLISVWSEESSYCVEHGEPWPQEKEYS 
WLEIGKAEMERSTLESLLGATLEALSSN 
PTRKMQGLKLKLQRASAFKEEPWLGWA 
HaCNPSTLGG*GGRIA 


13266 


27167 


A 


13408 


178 


422 


FYNRYF*yTKTYQTENIYCKIMSFRLLV 
*KI * SLPGAMAHACNPSTLGGRGGRI I * 
GREFETRNS 


13267 


27168 


A 


13409 


274 


444 


LFSQYIVFVHYSSFVLILFV* *LFSQYI 
VFVHYSSFVLILFVILHNYTYIYIHTHT 
HTYTLIFLLIFSQISLGWMKLIL 


13268 


27169 


A 


13410 


205 


23 


KEFLKFHRKCICQQVKKIWPRAVAHACN 
PSTLQGQGKRIT* GQELETS LANVEKPH 
FSNS 


13269 


27170 


A 


13411 


25 


426 


SVWWHSLETRSSRLRLLKKI *]!ILQNKKR 
KFEHRHAQREYDLETQGEnaiYKLRRER 
RKEEE*EKEEKKRRRRKEKEKKKKKPTL 
LCLKKQHNQHEiDLRLPASRTVKKLISVV 
*ATHSWPCYGSPSKLIQQSIH 


13270 


27171 


A 


13412 


227 


52 


EIGSHSCHPGWSTVAQS*LTLTSNS*TQ 
LILPPQPPRELHYNCVPPCPAFCRQALA 
MF 


13271 


27172 


A 


13413 


345 


67 


SKCCYWAHDCILLDLICLYFLELVGCV 
DCFSSSLGHY*PLFV*IFFLSPSLLFWY 
SHYEHICVPHFSKAPFIFLHFFPSPEFP 
FLCIEMVYV 


13272 


27173 


A 


13414 


345 


33 


KVFFFLIIC KMRMQNKQKNKHG I CLLNS 
AMNTCIS FPLRQQI MTTWWLEVTQIWGV 
LFYLFHFIEMGSCPGWSAVA*SQPSATP 
VSWSQVILLPQPPEDLGNS 


13273 


27174 


A 


13415 


187 


415 


FNQGWAFLFPFCVI FS YPWKDRNRTIPS 
IKITS FQESVLKLFCIFEVS * ILKIGLG 
QVRWLTPVI PALRRPRQADH 


13274 


27175 


A 


13416 


143 


50 


CMKLQSS I VMLGWAHACNPSTLGGQGG 
WIT*GQGGWIT 


13275 


27175 


A 


13417 


170 


17 


KQMREKEGNGLLTISKVWKQPKSPSQS* 
*IKKM*YICTMECYSaLRPEFPE 


13276 


27177 


A 


13418 


7 


163 


IK* IWYI *TMEYYSAMKRKKIMSSAAIW 
MKLEAI I LGBVMQB WKTKNLMFS F I S GS 


13277 


27178 


A 


13419 


901 


709 


SGRKTSLGNIARSCLYTKSFKIHCLPW 
PATQEAEAGGS LKPRSLRLQ * ARSHHCS 
PVWVTTRP 


13278 


27179 


A 


13420 


236 


4 


VIRAKTGSHNSLFP PHTFFFQGKNLCFF 
QKKKKKKIYIQSKPGAMflHACNPSTLGG 
QIT *DQEFETSLATMVKPNLY 


13279 


27180 


A 


13421 


46 


141 


MS VISAHCNLCL PGS S DS PS S AS RVAG I 
TGWC*HCNLCLPGSSDSPSSASRVAGIT 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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SEQID 
NO: in 


Predicted 
beainnin? 


Predict- 
ed end 


Amino acid sequence (A=Aliinine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methjonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovi'n, *=Stop codon, /=possible 
nucleotide deletion, \-possiblc 
nucleotide insertion 


USSN 

09/515,1 

26 






od 


location 
correspond 
ing to first 
amino acid 

sequence 


nucle- 
otide 




correspon 
ding to 

residue of 

peptide 

sequence 














GWC 


13280 


27181 


A 


13422 


604 


1121 


LGSGDLP*EINPLSSCSLLREKDPPTTS 
GPQTDQPKKHLTNFKSAPYKTITDAELR 

HTDAHEIWCRDSDLKLVPTLPLIPLEAA 
LRNITHS LS I PPPKNFRRPNTS TLFCVI 
FLINIRRQEYQASEPIO'SHRIPCDLHVY 
AQMA 


13281 


27182 


A 


13423 


214 


22 


IRESSSTPSSPTLPESCNSIPFPF*KFI 
WSGWAHAYNLNTIiGG*GRKIARAQEFE ' 
NSTRAGLW 


13282 


27183 


A 


13424 


397 


330 


SEYNSECS*VQWHTLVVSATQEAEAEGS 
LEARSSRLLCALIIPVNCHCTPAWAFPM 


13283 


27184 


A 


13425 


729 


885 


AKCYPVRNISLWPGAVjyEACNPSTLGGQ 
GGRIT*GQEFRTSLANMVKPHLY 


13284 


27185 


A 


13426 


231 


2 


ASVFFKATI.VDLHW*IYIYTYIYTYIHI 
YTHTYTHIYIYIHTYIYLNVNQYICLYV 
HINAWQQGyiYTGRTPICFF 


13285 


27186 


A 


13427 


184 


171 


LS*YIYIYTHTHTHTHTHTYILYIYTYE 
QPSSQSS 


13286 


27187 


A 


13428 


372 


166 


CKFRPRKINTTFSHICGS*KVDIlEflA* 
HWLGTVAHACNPSTLGGQGRWT I * VWEF 
KASLANLLTPLLQ 


13287 


27188 


A 


13429 


391 


437 


MYSLGVGFFHSTI * LLKDIFWSGWBHT 
CNPSTL*GQDQRIT*QRKLETRPGNRAK 
LYLYKKNSN 


13288 


27189 


A 


13430 


39 


254 


EFIPHAQDLETSQGNRVRPRLYRKFKNK 
* VWWCAS WPAMWEAEAGGLS B PGRLRL 
HSNMDNGERSCLKKK 


13289 


27190 




13431 


336 


488 


DIWPGMVAHACNPS TLGGQGGR I A* A* E 
FETSLSHMARPWLKKKLAEHSGA 


13290 


27191 


A 


13432 


154 


481 


FFFFFFFQRQKNGFF*KGIFFFFKLEGR 
GAI FV* PNLTLPQ* GKS PAS P L * KGGKK 
GGAPPP*LIFFFLKKKKLPLLGQGGLKL 
RALGKPPPFPSQRGGIWGVKTPFGL 


13291 


27192 


A 


13433 


388 


280 


KSKLKNGCCVALCLLGLI * S VLLA* FKT 
*LFCVSLDMYIDTTCSLS IS lYLS I YLS 
IYLSIYLSIV*SVS 


13292 


27193 


A 


13434 


187 


48 


SISSTKGPGAVAHAYNPNTWGGGGGRIT 
*GQEFETSLANMVKPRNS 


13293 


27194 


A 


13435 


71 


309 


DSVSEEEEEEEVEIiAHPLAAERYHCERA 
ET*VKALLWLELCDERVSSRSHTEEDCT 
EELFDFLHARDHCVDHKLFSNLK 


13294 


27195 


A 


13436 


18 


375 


RPAVPGRPTRPINTLLGLLLIIITF*LP 
QLSGYIEKSTPYECGFDPISPAEVPFSI 
KFCLLAMTFLLFDLEIALLLPLP*ALQT 
TNLPLIVMASLLLIIILALSLAYE*LQK 
GLD*AE 


13295 


27196 


A 


13437 


373 


27 


THPYYSHQE YQS P * PLTGALSALLMTSG 
LAM*FHFHS ITLLILGLLTNTLTIYQ*W 
RDVTRESTYQGHHTPPVQKGLRYGIILF 
ITSEVFFFAGFF*AFDHSSLAPTPQLGG 
HWPPTGITPUSrPLEVPLLHTSVLLASGV 
SIT*AHDP 


13296 


27197 


A 


13438 


575 


699 


LGTVAHT CNSSS LGGRGGW IT*GQEFET 
SLANMVKTCPQKK 


13297 


27198 


A 


13439 


247 


168 


LEMLIYTRVLERHREAKVHFPFSNISYS 
DKRTDTFVLTKTH*HTHTHTHTHTHTHT 
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peptide 
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Amino acid sequence (A=Alanine 
C=C ysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysinc, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=StDp codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














HTRGSHFFY*HTHTHTHTHTHTHTHTRG 
SHFFY 


13298 


27199 


A 


13440 


391 


168 


TKKKKHPSPRGAREVFF*QPPPPPFCVL 
* FF I VFFLS QGGGGGGGEFPP FF FP PP T 
SPPSPPPRGVWKKTPPFS 


13299 


27200 


A 


13441 


403 


246 


LFPTaiIGGPPGFPPPPF*WPAPEFFFW 
GPKKKKKLSPPPAIKMVFFKGPPP 


13300 


27201 


A 


13442 


305 


^14 


KNNFHPS ILGG*C3GWIT*GQALETTSLG 
NMVKPHIY 


13301 


27202 


A 


13443 


269 


409 


PG1TOUIAY1ILSTLEG*GTMITQGQEF*I 
SLVNTVIPHLYQKKKKKGR 


13302 


27203 


A 


13444 


295 


37 


DPSYLAWHMVTPHQVSTQHTLPRPEPCL 
PEMNFLVSLKIWQCWPGTVAHAC3SIPSTL 
RGRGGWIT*GQEFETS]:iANMTKTRTRGS 
AR 


13303 


27204 


A 


13445 


256 


388 


FCGFWPGAEAHACCPCTLGGRGGWIT*G 
QKFRTSLAHMVKPRVC 


13304 


27205 


A 


13446 


277 


374 


SRLTATSSRGGWIT*GQEFVTTLMJMVK 
PLLY 


13305 


27206 


A 


13447 


14 


387 


PQVIHSPWPPKVLGLQA*TTSQAWWLTP 
VIPALWEAKANGSLEVRSS 


13306 


27207 


A 


13448 


185 


379 


HFGRPRPADHLRPGATNHPGPQGETPSL 
L*NTKKKKRGRPFKKKKLQSLVWQISrKI F 
FliAELKLLW 


13307 


27208 


A 


13449 


168 


346 


ISILRTNHMISIKINLRLFIDELKKRDP 
FIFPYFFFFFFFFFFGGNHSL* SLIVIF 
Lie 


13308 


27209 


A 


13450 


57 


176 


VAESRPGPGGTATELVPPSTRLLTRAPR 
DLLTGKRKPPPLGRMSRARVQWHNHb'* T 
ESCSVAQARVQQRMLGSL*TLPPKIKQF 
SCLSLLSSWEDCLSQRVRDCSEK 


13309 


27210 


A 


13451 


411 


241 


PPLFFFFFFKQKFPSVP*GGGQGWDFGS 
LQPPPPRVKQIFCPKIPPPWPPKKGGVP 

G 


13310 


27211 


A 


13452 


307 


298 


PTHSIVENNKSL* IN*TSSMCSGLKVMR 
R*SQKQEDP*SGWAHACNLSTLGGRGG 
WIT*GQEFSTFLCPSA 


13311 


27212 


A 


13454 


191 


108 


GWS QT PGLGRS SLLS LPQWWDYRMQCRG 
VILAPHCGLDLLGSGDPPASASQVAGTA 
AFENCATSFWLP* FLNTQCFKI YKTKN 


13312 


27213 


A 


13455 


242 


400 


lATLLDYLQIPNTGPGAVAHACNPSTLG 
GQGRW 1 1 * GQE FENS LANTVKS CL 


13313 


27214 


A 


13456 




346 


YVTTARCSCWAGTESHMGAKDFYRQEKH 
PFHADSGIGWA*DRLRQRYGRFGRRM* F 
SWARGAGEKHSLA*AQWLMPWTTFWEV 
KEGRSLEPRSLRPAWATWRNVSTKIKKK 


13314 


27215 


A 


13457 


166 


289 


VRS S KLNTWVRWLVP 1 1 LTLWEA* AGG F 
LEPRISRLVWATQ 


13315 


27216 


A 


13458 


220 


76 


TASLKLSKSWLDWAHACNPSTLGGPGR 
WIT*GQEIKTSLaNMVKPSP 


13316 


27217 


A 


13459 


219 


46 


SPPLKKKHNFFPPGVMGAPPRFFLKGPP 
QNFFFFFFFFFFFFFFFFFLKQ*EQTFL 
lilY 


13317 


27218 


A 


13460 


126 


I 


WSLGAVARSCSPSTLGVQGG* IT*gQE 
FETSLANMVRTLTS 


13318 


27219 


A 


13461 


330 


420 


NICSWAQWLTPVTPAL*EAEflSGSPEVT 
SS 
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Predict- 
ed end 

location 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13319 


27220 


A 


13462 


577 


766 


AQWHTSIVLAfl.*EAAAAGLLEPRS*SRL 
KCTMIIPINSHCTPAWQHMRPCLLKKNF 

SNSFISK 


13320 


27221 


A 


13463 


206 


387 


VTVGETEAEAGRGHGLLTCCVLKTQKWL 
GTVAHAYNPSTLG6*GGRIT*DQEFETS 
FLGS 


13321 


27222 


A 


13464 


167 




DFWLGAVTHACNPNSVGG*GGQIT*GQE 
FETSLANMVKPMNS 


13322 


27223 


A 


13465 


480 


79 


SRVISPLLILNSVISAKSLLPVGSQGKS 
KGQVWSTQEERLEPGVMGEFTPGPCPCL 
VSQGPSLCPSPPPYS KRWSRLSQAPGS I 
RVLI *RPAAATSAPPPAQLGLPDTAFPP 
PLLSRDGQWAAGRQGDEDKGLP 


13323 


27224 


A 


13466 


3 


373 


DAWVAHASAHASVCLIIDTLLALLLIII 

T F * LPQLNGYIEES TPYECGLEP IS PAR 

VPLYIKFFLVAITFLLLDLEP_ALLLPLP 
*ALLTTNLPLIW/SSLLLIIIL7iLSLGY 
3*LRKGLD*AE 


13324 


27225 


A 


13467 


214 




TRQKIFNLFSS*KKFLIPPARAKDFLFF 
IFFFFFFFFFFFFFFFSCSRILUm 


13325 


27226 




13468 


111 


358 


VMKVFYI*IGELVTQAYIFVKSHQIAHI 
LFHSMYVLP* IKMFWPGTVAYTYNPSTL 
GGHSRRIS *AQEFETSLGNIVSPHLY 


13326 


27227 


A 


13469 


199 


607 


RIRGTSQKLLLLKKLISLIPKSQPTRGT 
P* TTFPPPNTTHFPPPPVRATAPDPSPA 
HFVSSPYNPDLGSP*PECPSPGRLQREI 
EQCKKDIQNFPFPTTSRICSNDLSLKGS 
ASRRRGHLFCERPVNQFRSPKPKK 


13327 


27228 


A 


13470 


128 


256 


EDWMMWLTPVIPTPQEVEVRGSLEPRSS 
ELQ*AVIVPCTPSE 


13328 


27229 


A 


13471 


94 


388 


KGEGKTEELWSWRSHRGTHLCLIiADISA 
LPLHYYYYYYYFWQRWSLPDQPGQHGET 
PCLPKIQIINRaYWHVPWPTDQEADAR 
*LPEPNRQRSQRPE 


13329 


27230 


A 


13472 


359 


73 


ATEPGQLFYF*ESESHNVTQAGVQWLDH 
GSLQPPTPGLKPSFPLIHTGITAASHYT 
WLNQLLPMHVSLLTFYESPQLKIQRSHI 
LCNSIHIKFLE 


13330 


27231 


A 


13473 


332 


10 


ILSKRGFPKPGRYCEVNPFLCVQGAK*V 
RP*EDTSLHCEIYHTHTHTHTHTHTHTN 
LKP*TSKKLIPQKTESLFFSLCIKKSLL 
RFQTFPDAGSTOSrSVGCACFISK 


13331 


27232 


A 


13474 


355 


385 


IMVCIKQS CVH*KHN* FWPGTVARVCNP 
RTLGGQDRQTA*AQELKTSLDNMAKPCL 
YKKFF 


13332 


27233 


A 


13475 


297 


425 


RKSAWWT,MPWSPI.*KAKAGGFT.EPGST. 
RSAWATWGDPHLYKK 


13333 


27234 


A 


13476 


160 




KSTGRLGMVAHICIPSTLGSQGGWIA*A 
QEFETSLGKILSLPEIQKSASLITW 


13334 


27235 


A 


13477 


334 


79 


GMKEQINFLYQKRGI KPNKLQAENWL I 
HYN *KLLSNSFK* LQDTAS I * YFFGNYL 
NRPGMVAHACNPS TLGGRGGW I T * GQE F 
L 


13335 


27236 


A 


13478 


234 


385 


YAKFHVS INEKKPDVAHACNPGTLGGQG 
KWIT*GQEFKTSLANMTKPRIY 


13336 


27237 


A 


13479 


408 


129 


KRSINRKKGGELSSSHFLLLLLPPLDEE 
PSPPPFCPPLPPFPPPCPRLPQSG*GAC 
GRLHPC*MDLGPCKCPARKVFSHLSCSL 
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Predict- 
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sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamicAcid,F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 














VLQVCRSP 


13337 


27238 


A 


13480 


251 


384 


TITPWRGAVAHVCNPSTLGGQSRQMT*G 
QE FKTSLSNMVKPRLY 


13338 


27239 


A 


13481 


164 


300 


AGDDSDNSMGLLGRIHEIKFFKCLGPCM 
TTKF IVPflLWEAEVGGWLEPTSLRLR*A 
LIALLQCTPA*GTVYDYKVYRPSTLGSR 
GRRMA 


13339 


27240 


A 


13482 


2 


30 


TGCHQFRFLNNPFSPHPVLQLVSPGPDC 
LIFRYFKF*LHQLM*INPNaHFCGAESF 
LCKFRVWIS*LRYPSLVALSVLFVFK*S 
VQI*VKKKKKKKKKKKKKKKKKKK*K 


13340 


27241 


A 


13483 


39 


199 


LDVGTCSPIVPASQEAEAGRIiHEPTSLR 
LW*AMIVQLYYSLGDRaRPCLYLKK 


13341 


27242 


A 


13484 


18 


72 


TRTRSYPGSTGCQLHTFGLVSLYNCVSQ 
FLITI!JLYISIYRDIY*YIYIYLYREIYL 
YIYIEIYTYIERYVYIHI*ILGLQVANC 
IPLD 


13342 


27243 


A 


13485 


266 


496 


LDaaSDITQAGVQWCMHSFL*PQTPGLK 
*ASTSAS*V2VEITGLSQHVWVKIIMSVS 
LSSYTFIQLPSILLYGNTAP 


13343 


27244 


A 


13486 


286 


72 


HILSHCTVLWFLTPHHGTTHYKNTTGQP 
SIVAHTHNLSTLGG*GRWTT*GQEFKTS 
LANTVKCSDAWVDRD 


13344 


27245 


A 


13487 


309 


6 


RFFFFFPPGKKGVFFQRFFFGFPRGFPP 
PRFFKTGPEIFFFGPLKKKKNFPPPGGK 
IVFF* RGAPPFFFFFFFFLFFFFFFFLK 
KNYLLLEKVQSPEKEG 


13345 


27246 




13488 


183 


459 


RRVKKICKCI IWQNNYL* KGINICSMPQ 
KKKKKKKKKKKKKGAPFKKKKKKKIS *G 
GGGPFF*KTKIKPPGGRGFFFFFGEKIM 
LFPAVFI 


13346 


27247 


A 


13489 


348 


1 


CEGKDGLPIQFASWFKYAGFHSLESIFQ 
SFCQTKKLEAPKELSPCSQLYRYHWQLT 
CRRMKLDPHLLPYTKINSRPGAAaHA*N 
PSTLGG*GGWII*GQEFETSLTNMGETP 
PTRP 


13347 


27248 


A 


13490 


313 


320 


KQGGPPLAQARVQWYNLSSGETSLPLPG 
SSISrPPPSATGVAGTTGSPPLA*LIFFFL 
♦DREWGDQRKL 


13348 


27249 


A 


13491 


187 


350 


GNPVKK* IKNTTSSSGWVWWLMPI IPAL 
WDAEVGGLLEPRSLRPAWATERDSVS 


13349 


27250 


A 


13492 


211 


83 


EYAKKNGKLLSWRKYL*SVCVCVCVTTC 
VCVCVCVTTCIKm 


13350 


27251 


A 


13493 


286 


407 


LGVEAYACNLWTLGG*GGRIT*GQEFET 
GLGNTARPCLYF 


13351 


27252 


A 


13494 


897 


1033 


KFIFKQMQWDDHSSLQP*TPELKQSSCL 
RLPSTWDYRREPLHLAM 


13352 


27253 


A 


13495 


265 


3 


HLILLEEDLTWQRGKWLKGRVSLCSLGS 
* TQSLTVLKKGRGLGWAVAPSTLGGRG 
RWVALAQELETSLGNMVKLSLSKIQKLA 
GMV 


13353 


27254 


A 


13496 


319 




CFKLWDTCAQCAGLLHRYTCVMWCYTH 
HPEFCYNIMGLPKYMWSYLWPKCCHAVY 
HFRYIKNLLPGCVAEIACNPSTLGGRGGW 
IT*GKEFKRRMKTKV*IGIYFI 


13354 


27255 


A 


13497 


265 


11 


VLGEVSNQHLLGAPSNRTPSEVPLAQAA 
QGRPSLVPPSAIVFPSYFTICYAFLSIS 
MPIIP*VWEAEVGELLEPRSLRQA*ATW 
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peptide 


Predict- 
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correspon 
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residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13355 


27256 


A 


13498 


164 


22 


GIKRKLIKSGWAHAENSSSLAGRGGWL 
T*GQEFETSLAKMMEIPSV 


13356 


27257 


A 


13499 


277 


363 


TYLC YI ENI CLFVGE lEKQR* LI * E-HRS 
TLNGHILYVEPVSPNPSQAVASPVTCTY 
MPTWPEWEESQKK 


13357 


27258 


A 


13500 


■ 


270 


YTCSVLLLVTVWLLLCLIYKLNFIIGML 
RKKTVYIYIERIQYYLKFQASTGCLGVY 
PTV*?YMYTYTYMYMYMYIYIYIY1YIT. 
OiCLDG 


13358 


27259 


A 


13501 


170 


383 


ILYSSKLTYPSPQIVIFFCGKST*GWVW 
(•niMPVIPAI *EAEVGGSLESTSLGQAWA 


13359 


27260 


A 


13502 


31 


402 


GDYLYLREGWQEPARREQHAQKLEPNRT 
BTPHLICLTNYRLFSJJflWSECEERRNET 
TDKNQTVKDLALDMKRVLPFS*GHQKVI 
KVGWAYACNPSTLGGQGQWITRGEEFK 
TSLINMAKPYLY 


13360 


27261 


A 


13503 


232 


446 


RNRPGAVAHACNPSTFRGRGGRIMRLGD 
*RS 


13361 


27262 


A 


13504 


196 


390 


VNKAEKKHLYFVARKE ILRPGVWHTCN 
PSTLGGRGGWII *GQEFETSLPLPKTQK 
LAECGRGCL 


13362 


27263 


A 


13505 


146 


7 


LKYVLYWLGAMAYTCWTSTLGGQGGWI S 
*GQEFETSIiAHMVKPCLF 


13363 


27264 


A 


13506 




18 


SQNFGRPRQQDHFRLDGRHSETPSLQEK 
VKKLTRHGGACVQSQLLERLRREDHLGP 
GGGGCSVP*SV1III 


13364 


27265 


A 


13507 


117 


338 


NKILKKKKGGGRFKESKFTSPGLQGNSF 
FMGPPKLNSRAGV*QRREGKNLGVPQLK 
PFEAWPLFARGPNTKNP 


13365 


27266 


A 


13508 


306 


443 


LARYEPAVRTRAC*AQWLTAVIPVL*EA 
EAGGPLEARNLRPAWAT 


13366 


27267 


A 


13509 


146 


390 


KMFKGHEQAAHRKKKKRGGRFKGSKFTS 
ACLQRNIFFLGPPKLNSRAGV*QRGDWK 
NPGVTQFNRFEENPLFARGPNTQKP 


13367 


27268 


A 


13510 


119 


356 


NEDRNLRGGCPGR* LLRTEGVCSNPAGW 
SGIRWCEESGGLFWRWEGTESVLFSGL 
FVCCVFAQEKftKftTGRAKVSLCP 


13368 


27269 


— 


13511 


221 


21 


EDLQRDKPLGSCYSTCGWAEQWYLQHPG 
GAGSKCRIWLGAVAHAYNPSTLGGQGRW 
IT*GQEFLSL 


13369 


27270 


A 


13512 


163 


418 


TFPDDQCLMLQDHACVKRSIQSA*YLTP 
AVLAHWE AE AGGLIiE PTS LRPAWAT 


13370 


27271 


A 


13513 


239 


21 


FGNLGGPGGRTACI QFSLGNI VRPYFYK 
TKPKPKPNQNKTKI S*AWI^CMPIVPATR 
KAEVGESLEPRRSRLQ 


13371 


27272 


A 


13514 


135 


11 


KFFFWPGVLflHaYNPSSLGAQGGRIT*G 
QEYETSLANMVKS 


13372 


27273 




13515 


162 


57 


EGTLRSRRPPLGGWVT*GQEFETSPflNM 
VKRCLY 


13373 


27274 


A 


13516 


424 


54 


PKRGFFPTPFIWVPPVFPLPPFFKPPPR 
IFFLGPPKKKFFSPPPGLKI FFF* KGPP 














FWARPFSFLSLFFFKETIPLYYNIWMHK 
DSCKAVIHHYH 


13374 


27275 


A 


13517 


310 


349 


GGGPLKKKLFFSRGGERFFFF*GAPPFF 
FFFFL*SSKI FI FFLKSFFFFFFFFFFC 
FFFFFSQILFFISSPCFVFFFLNYTSRT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unk;nown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CSYVCALILFKGNTYSRKARHILW 


13375 


27276 


A 


13518 


273 


393 


IKCLSYRKKATTLGGQGRWIT*SQEFET 
SLSNM*KPQLy 


13376 


27277 


A 


13519 


450 


139 


IPPIALVSPFAKHTWQQQMGLVRRSSKG 
NNMSKGWEIIHPSRIFCSNHTLLRIALW 
LGAVVHTCNPSTLVGRGGWIT*6WEFKP 
SKTSMERFCILSRHKRTR6S 


13377 


27278 


A 


13520 


47 


427 


KMKQLFEMLVFQNWTIFLSHSIFRTIR* 
LTLEPLMFRYYFKQ*RTLLPPKF*DRVW 
LCtffiQGCL* CSGTVSAHCNLRLLQSSDP 
STSAS*VAGTVGAGHHIWLILNNRELPK 
QKFFINSSRNLPFT 


13378 


27279 


A 


13521 


199 


98 


SLPSVKLSICCEFPEDIVSKDPIPWLP 
WKRESCSLCALPNLLPK*EKTNIQVP* 
VFQSTIKAKaQLGMVAHACWPSTLGGRG 
GWIT*GLR 


13379 


27280 


A 


13522 


314 


18 


QTLNFYILKITmiSLVHC*IFEflRFQI 
LV*V* *NYPF*SLI CFTFSWHETQNT* * 
KNLIIMAGAVAHTCNCHTIjGDQQRRIAL 
AQEIEPSLGNIAIQ 


13380 


27281 


A 


13523 




231 


CPPAVFGTSIEQIQLKYDIKD*KIGQJiW 
WCAPVIPGTQAEVGGSLEIRSSRL*CAM 
STSMNRHCVPAWAT 


13381 


27282 


A 


13524 


239 


465 


RVQVSECLLHREKCLYFQIiGLS*LIPQT 
GWLKQQKLPGTVAHTCNP6TLRGRGKRI 
T*GQEFETSLANIVKPRLY 


13382 


27283 




13525 


388 


468 


SIiRIiGTVAHTCNPSTLaG*GGWITRSG 


13383 


27284 


A 


13526 


342 


369 


CTLNTLLLYSISDFGVSAFLATGGDITR 
NKVRKT*LRLGTVAHTYNPNTLGGQGRR 
LT*AQEFKNSLSNIGRPHLYia'KKTKKT 


13384 


27285 




13527 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNSH I YTHM Y I HTHTHTH I C I YKL 


13385 


27286 




13528 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNSH I YTHMY IHTHTHTHICI YKL 
EEKYKEL 


13386 


27287 


A 


13529 


34 


299 


SAPASASPWGIGVLYRPLACPSDRFSPG 
PEANCGEIETTELRVTLSPRLECSGTIT 
APCSLDLQGSRDPPPSAS*VSETTGftSH 
HAQL 


13387 


27288 


A 


13530 


158 


45 


SGHRSRARWL*PVISALWEASAGGSPE I 

KSLRNRRPG 


13388 


27289 


A 


13531 


342 




CFFLKGAPPFFFFYFFNFFFFFFLVGTD 
KLILKVTWKWKGPKIDKAVPKKNWQARG 
LP* PD IKI FYKVWLGAVANAYNP SNLGG 
*GGWIA*E*EFETSLA]MMTKPQYKKYKK 
LA 


13389 


27290 


A 


13532 


589 


742 


RIMKMLRIKICGTGPGMVAHAVUPSSLG 
DGW I T * D QE FKAS LRNM I KPHLY 


13390 


27291 


A 


13533 




446 


HVGIPSRLTSVSHPRHCKHPRTESEHVY 
CAIEARHYQCKVSRPSDAAGSEGRPRDM 
TQDTEVADTPPPNIS S I TATRTMATIGV 
GWAHAYNPNTLGGQGRRIA*AQEFKTS 
LGNLAKPCLYKH 


13391 


27292 


A 


13534 


1941 


1040 


AFHLLPYLSSGFYCSLGPCFGLGSSHMA 
AWHKEPLHNACSDSRRSAPIRQGLGSPS 
ATHDTHTRTLAHI CKEI FKERLHE I KER 
EIDS*RG*NTQR*VEREEaENKYKPTEI 



1357 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
Dcid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NTE lENHKQQEMLRHAGEERENSPREAE 
RKCRNEKCEIKSEUSHKEARLRKKVKFK 
TNEKGKG*QIITVHKEKREVGRERRGRQ 
NKKRTKSTQHVTQKGDMR* SVKEEPHLR 
RAKRNSNKMKNERYVLRQDTDHSCEQKE 
KMRQRHTRKKLAEETWKWVEHTLNREQR 
AKPRKKETKEIHIHSAILKC 


13392 


27293 


A 


13535 


1192 


1476 


E I KRKWGPGASCHACNPITGRPRRVDHL 
RSGVQGQPGQRGETPSLLKNIQI SWAWW 
CAPVI *EIKRKWGPGASCHACNPITGRP 
RRVDHLRSGVQGQPGQRGETPSLLKNIQ 
I SWAWWCAPVIPATWEAEAGGSLETGSG 
GCTEPRSHHCTQAWVTE 


13393 


27294 


A 


13536 


55 


132 


RAESCSLAEDGVQ*GNLSSLQPPPPK 


13394 


27295 


A 


13537 




325 


S P VEFRLGEPTKGTS SFGKRRNKTHTIjC 

rrcgsi-cayhlqkstcgk^gypakrkrky 
nws?ib:akrrnttgtg:?mriilkivyrtfk 
hgfrkgttpkpkpjvzvvaassss 


13395 


27296 




13538 


259 


274 


KFEAAVCYHCTTLSL* * CHTVCFLS FFL 
LEIGYHYITQAQSGAMIAHCSLKLIiGSS 
NPPVPAF*VAETRSACYNAWLVFVKFLM 
GL*ELWKDMS 


13396 


27297 


A 


13539 


1 




QASVLPQKQLMSWQKLSESTFVKFWNLV 
KTGRLGEEISCCLWREYYSISDCIiATI 
KLPASHLSMRKPRHKDFKSTFLI STKNQ 

YNMRPGAVAHTCNPSTLGG*DWWIT*GK 
NCQNQLS 


13397 


27298 


A 


13540 


268 


468 


YRRLISKKFFKHRWKSVSFFKFSFFWLG 
EVflHACNPSTLGGQGGKIA*AQEFLTSL 
GNIVGPCLYKK 


13398 


27299 


A 


13541 


308 


437 


FSWAFEIGSQAEVQWHDLHSL*PLPPRF 
BCRFSHLSLPSSWDYR 


13399 


27300 


A 


13542 


7 


244 


AKTAPLFF*FETVSCFVAQARGQWHDPG 
SLQPRPPGQETSMIKTSSDPPPPASQVA 
GATGMGHHAQKI * FLVETGSPHVSQGGL 
ALLK 


13400 


27301 




13543 


50 


175 


ALPAIHTAVGQCSVEGFCVLSDPEGDGA 
ARMKLVRFLMKLSHETVTIELKNGTQVH 
GTITGVDVSMNTHIiKAVKMTLKNREPVQ 
LETLS ILGNNIRYFI LPDSLPLDTLLAD 
AEPKIVKSMNREaVAG*R* G*ARE I FDE I 
ES 


13401 


27302 




13544 


47 


362 


VAFQGLQVPATNPANFFFFWKGGFFFVP 
QGGRQGEKIGLRDPWPPGVRKIPGPTLS 
GTREKGAPPPPPIYF*FFWKKGGQKGGP 
GGF*TWDPKGPPSPTPPKGGD 


13402 i 27303 


A 


13545 


97 


351 


NKKKKGPPPFF*KKRVGKKKKKGARAGG 
PPPKPPPFGGPRGGGPRGQNSKTPRPKG 
GNPPFKKKKKKENIFPAGPMGLKGGGHL 


13403 


27304 


A 


13546 


214 


32 


NYIKISSVDGGAKIK*WT*NLKCLVLSV 
KNI TRWVDCLSSGVQDQPGQHGETP SLL 
KTRKTSWA* WRLGLT QRWRKMPWPSR 


13404 


27305 


A 


13547 


350 


159 


QTGTLSQKTKQKKQAQWLMTVILGr,*EA 
KAGGLLEPKSLRPAWATWQDP IYKKKFF 
LERKTWR 


13405 


27306 


A 


13548 


367 


149 


FFFFFFFF*TGSHYVAQNWTQIPGLNLP 
ASASQVTATTGMHHHTRHTPFPSLTSHT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 
peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Ptienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QSVSTASWTYLQNSGRH 


13406 


27307 


A 


13549 


321 


2 


GLVKVTHHGYDRNLGYPD SQAS VL S S F Y 
TA*MVQ*LQLPDKVQIYPAHPLFINDMP 
P*WLLKIPSFGPGTVAHPYNPNTLGGRG 
RRTTRDREFETSLANWKPHLH ' 


13407 


27308 


A 


13550 


216 


13 


GYSGKETVFSMRKGQQSGPCLQGPSIPM 
GKAGINQIIPKTTLGGRGGWIA*GQEFK 
TSLAHTVKPCLY 


13408 


27309 


A 


13551 


199 


391 


KDFSIFIACFKRPFITGSQGQEQWLIFP 
HAYNPSILGGRGRQIT*GQEFLTPAELG 
NMVKPCLY 


13409 


27310 


A 


13552 


70 


239 


ATCVKN*RTSWAWWCVPWPATHEAAaa 
GLLEPRSSRLQCSVIASVERHRTSVWTT 


13410 


27311 


A 


13553 


105 


257 


GQAQWLMPVIPALWEAEflGGSFAPRSLR 
IiA*ATVPGPNNFLKRAFHSLU!II,FLP 


13411 


27312 


A 


13554 


499 


165 


Y*MESHFVTQGGLQWCSLGSLQAPPPEV 
GGWLELGRQRLPMKPKIAPIWTPSPG 


13412 


27313 


A 


13555 


413 


719 


YTDKSIRSISLLPVKGSTHSMCPVKFIQ 
LQEAGGIHMRKLSGQRVDREWRLGTVAH 
TCNRSHLEGKGGQIT*AQVFKTSLGNMa 
KSCLCKKYQKLSWIWQH 


13413 


27314 


A 


13556 


224 


523 


DKVSLCHPGWECGWVQSQLTTASISQG* 
TILFPQSPE 


13414 


27315 


A 


13558 


355 


1 


QVSLQLPYCVLFQPHIYNKQAALLQRCY 
VSATKLLINAIWQYVNSLKMCEENQNVS 
LQNMPF*HRNYFGRAGHGGSCIi*SQLLG 
RLRH*NCLNLGGGGCSELRLRHCTPAWA 
TKASPS 


13415 


27316 


A 


13559 


73 


259 


KYVPHKGKISERRLCSVAQAGVQWCDHG 
SRSLEVLGSKDLLASAS*VAGITGCWGS 

SDPHTT 


13416 


27317 


A 


13560 


205 


21 


WKIS EDFLLGDAKKWAYQEDIKRUILG 
RGAVSHTCWPSTWRGRGGWIT*DQEFET 

SLPKC 


13417 


27318 


A 


13561 


254 


481 


LMALLPGSSDVDLSEYGWEHFNFVILIS 
KEIFILLFF*DGVSHLLPRLECSGMISA 
YCHLCLQGSSDRRDFLKKQ 


13418 


27319 


A 


13562 


174 


369 


TFFSPSMFVEPGPYYIAQAGVRWLFTGA 
I IVHCSLKLLALSDSPTSAS *VTATTGM 


13419 


27320 


A 


13563 


371 


109 


LFSKAGRCILQNLLCCMVSGKCWPMNGl 
DVTREYPTFIAHRVLTYAATSGSDFLGQ 
AQWLMPVI PAI * EANAGES LEPKS LRLA 
*ATVPGPRSLSQTLLHRSELCGL 


13420 


27321 


A 


13564 


206 


486 


RQLAAVHILVTPLPDHVTWAHYVTSLSL 
LSS S ENGVNNDNTH FI GLLLRD YK * GWV 
*WLTPVIPALWEAKVGRIAEGQELENSL 
GSKARPRFV 


13421 


27322 


A 


13565 


212 


398 


SRVRGC FPFNLPKSSCI*E CGQGAVftHA 
CNPSTLGGRGGQIA*AQEFKTSQGNVAK 
PHLYKI 


13422 


27323 


A 


13566 


26 


395 


Y3PVHTDKCSGVRKLGLFFFFFFEKEFH 
FFSPAWKARAKIGLNGTPLSQGKGNFRP 
KPPKKRGKRGRAPQPGKILNFKKKRGFS 
MGAKV1IPNFGPKGNPPP*PPKGAGKKGG 
TPRPGPFFFFF 


13423 


27324 


A 


13567 


400 


195 


ATRVSLAPTKNKINWGWKPTPVI PPPQE 
GEAGEFL*PQKWRVRLNKIVPLPSRPRD 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 
nucleotide 

location 

ing to first 

residue of 
peptide 


Predict- 

otide 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KTRLCFKKKKKN 




27325 




13568 


316 


154 


SQPFPGLRWEDRLSRGVQNQPKQ*LiyiPV 
IPAFSGAEVGGSLEPRSSKPA*ATNKLK 
SKVI IGGQICDKASKYSIVESSWWGWGC 


13425 


27326 


A 


13569 


297 


475 


FDYFFIY*ETKSHSAAQAGVQ*CKHSSL 
QP*PPGLKQSLLPWPPKSAGLTGVSYGT 

WPI 


13426 


27327 


A 


13570 


116 


332 


NRVLLLPKGEYRGTISAHCSLKIiLGSKR 
PPTSAA*GDDETTGVCHHTPV*FLKTFF 
lEIRPGYVPRVGHWRF 


13427 


27328 


A 


13571 


233 


405 


TVPELRASCRLKKKKKKKKGEKNLKKIK 
GFPGPGPRV*GPBAKNPGGGGPGIPKGP 


13428 


27329 


A 


13572 


408 


3 


TFCDYFKTPPPGKFFFINGPPSFFLNRG 

PSFFFFPGFPPPFLKNFFFFPLWFKGG 

GYKRGVFPPKKKGPPQPLFFFFFFFLI* 
KKTRPGTVAHPCI;IPSTLGGQGRWIT*GQ 
EFLSSLAKMftGRVURPGNCESV 


13429 


27330 


A 


13573 


273 


406 


IKEFKAERGGHS *NTS a'FGGQGRWXA*S 
QEFETSLANMVKPHLY 


13430 


27331 


A 


13574 


385 


38 


KKMSPPPPGKNFFFF*GPPPSPFFFFFF 
FFFFFFFLGGiCKTFFFPPPSRFFFFLKP 
FPPKNQGPKNFFPPKKKIGD*ARPPPLT 
GGAKGAPLFKKKKKKKKLKVRPGAVMT 
CNS 


13431 


27332 


A 


13575 


230 


53 


HWKVLKENKQIFTKISFKMNAIKDSVGQ 
AQWLMPVIPAL*EPDMGGPVEPKSLRPA 
WAT 


13432 


27333 


A 


13576 


406 


33 


VNSIVWVSS PFRVSTFI SFFELKSCSVT 
QAGV*WHDLSIiLQLQTPGLKRSPAPSRG 
LSFYLSS IFYPHSSQTELFVGTIiNAASF 
TNSRLLYLSLLLCGKYHPLVPSLPLSTI 
WYFSLRVSHHFP 


13433 


27334 


A 


13577 


426 


294 


GGFPS PPVFKPPPQFFFFGPLKKKI S P P 
PPGEKKIFF*KAPPP 


13434 


27335 


A 


13578 


268 


3 


LQAKGPAMGEAGARCSSEVWGWPRKGFD 
L* VSR* PS FDSHVRIGRVQRLMPWPVL 
WETEADISFEPCSSRPAWASWRKTYIQK 
PQRI 


13435 


27336 


A 


13579 


395 


49 


EKKKFFVCFPGFPGNPPNPS PQFF * KGL 
ISPSGSLRTRRGFFP*RVFFFFLKKQFP 
LSPRVEClirGlIPDP*PPFGEVTSTPQVA 
GAIGPPPPPRVNFYFFVKKGFIiNVGPMF 
PFF 


13436 


27337 


A 


13580 


68 


466 


GASPAQGSTLHLVILPRYFFKI PTVRTE 
SFSGSLVTSPPPLHFLPLMKERGGPGIP 
LPCPQLLQVLGGTSLPPVPSLPDTSQDK 
WPLHGVPPGHVCS*PLAGDGAWPPSPHW 
I PLNPGTSKSLQPAPPWNSA 


"13437 












ATAPGLLCS YKI FS CQLQWCVPVTQLAW 
ETEAGGSLEARSSRL* *TMITPVNRHCI 


13438 


27339 


A 


13582 


359 


3 


KQDSQWVAAAPASVAVSTCWRGLPLPWS 
GGQNRGKGSLPLQVHRGA*GTENK]SQGG 
TPRPGGGPGPSAPRGSSLGAHRKLPVHH 
TTLSSSRSALPPTPRHPAPSSPPCTEEF 
HRTRPI 


13439 


27340 


A 


13583 


174 


1 


FFFFFVETESHSVAQARVLE YSGAI PAQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methion!ne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CNLDNLSSSELPASAS*AS *VAAXTGT* 
KR 


13440 


27341 


A 


13584 


192 


206 


liFFLEVGSCYVAQAGFEFPGSGHLPLSA 
S * WGSTGVPHHTQLV*SFSSLYNLERN 


13441 


27342 


A 


13585 


244 


28 


KKPENQEINYIFYHLFNVHEQ IRISLGP 
GAVAYNCWPSTLRD*GGR1T*AQEFKTN 
LGNMARPRTRGSTRTF 


13442 


27343 


A 


13586 


312 


701 


RGPEET*PRTRKSSPRPTRSCESVSKLA 
RPPRRCSAAPAQVPRLSIiRSPKPDPPRV 
SSREKCSVPLPLLPECSETGALPRNSFLC 
QNASS PLLSLGLPPS PTVQALQPRALHQ 
HLGSTNKEDAHVAPAIOCK 


13443 


27344 


A 


13587 


410 


292 


GYMIKQQTVYHECRM*ANSHLHLPPEGL 
LRAVTLWRRAP 


13444 


27345 


A 


13588 


428 


36 


TPPPFFFFWAQKKKKIFPPPGGKIFFFL 
KGPPPLFFFFFFFFFFFFFFLKKFFGLK 
GPFFFVQD*QGLENPKEGSSLGWEKLGR 
ALLPQQRSFVPGLGPTRLFWGLVFWFFG 
FFFLPPGLCFFFALIFNC 


13445 


27346 


A 


13589 


3 


364 


TAPDDQGRPYQAGEPAAHVRADSTHGLG 
A*RRVAGHLPPRVGAPHHPGASSREGAP 
PLHPPPNL*HRRPNGSCRFLPGPAAPPQ 
GAEGGRGRDRRGTHSVAQTGGPGGFGSG 
VTPSWRSS 


13446 


27347 


A 


13590 


778 


910 


DRIRRSGAVAHTCNPSTLGGQGERTA*G 
QEFKTSLGDMVRFCLY 


13447 


27348 


A 


13591 


230 


3 


KLVNI LVEKSLVAMKSTVSEGKMVI HS P 
ASLGIWAHTHSPSTLGGQGRQIA*AQEF 
ETILGNMAEPCLYKKHSVY 


13448 


27349 


A 


13592 


223 


340 


LTPIIPEPWESEWGSLE*LTPGAVTHA 
YNPRTLGIRSGRIT*GHEFQTTLSHTMK 
TRLYHELL 


13449 


27350 


A 


13593 


35 


220 


DRASLSPRLECSGMIVAHCSINLSGSSD 
PLTPAS*GDGTTDTC3IHARLIFTGADFC 

IDASS 


13450 


27351 


A 


13594 


87 


369 


NVQKTECEISGKMQIWQKGSVKTKCSNG 
LFNFP IFSKKIASCKVKKLTRE* KYKSQ 
ARWCTPVIPATEAEVRGQLEPRSLRPAW 
ATYQPHFKS 


13451 


27352 


A 


13595 


261 


489 


SWVNERMWG**KHETSQAQWLMPI IPA 
LCEABVGGSLEARSWETSLCNIVRLHLS 
KKEOiGKGRKRGGNQ I AP SRE 


13452 


27353 


A 


13596 


238 


2 


ND CLWWLFR I PANVS TFGLLHMS LKVNT 
PGNNRFKSELGTRCL IHTCNPS I LGGQG 
GW1T*GQEFVTSMQKGSAVKNQ 


13453 


27354 


A 


13597 


36 


687 


RD VHRS T YQAGS KQDWGPGEAERLS SS R 
HGAYS CPVP I TFAEGKTRMGRDKMRLIL 
GLSWGPSLTLLLPITNAPRRPGM*EPAL 
PGNSTS*SGAVRFPGGGRK*EDSAESWS 
CFCSHPS PS SGPNPPSPPSAVCMLPGLS 
GLQR*PKQLSPPPALSLPVSSLLVRLSP 
WPPTSSHLLPQALPQSLHPQGSHRAVND 
S FSM* GMVLGSRRNRGCMEA 


13454 


27355 


A 


13598 


367 


468 


KKQRRGREPWLTPAI PALWKAKAGGSP * 
VQSSRP 


13455 


27356 


A 


13599 


183 


380 


LTSMLAVDNSSVEKTCPQAMRISRPGAV 
AHTCnSTPS I LGGQGGL lA* VQE YE TNLGS 
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SEQBD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, !VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MAKTHLYKN 


13456 


27357 


A 


13600 


161 


657 


LIQGCWCSLFFSTRVFLaAIGMKSQWCR 
PVAMDLGVYQLRHFSISFLSSLLGTENA 
SVRLDNSSSGASWAIDNKIEQAMDLVK 
SHLMYAV*EEVEVLKEQIKELIEKNSQL 
EQENNLLKTLASPEQLAQFQAQLQTGSP 
PATTQPQGTTQPPAQPASQGSGPTA 


13457 


273 58 


A 


13602 


121 


423 


MIKVNSFGLYLYSQVIYIYIMKYYTAIY 
LKDFFFKDGVLTRHPRWNTVTTQCSLEI 
LGSTDPPASAS *AAGTTNTSHHGQLYS Y 
FKKRPTYAEMEQYKI KK 


13458 


27359 


A 


13603 


1889 


1717 


VPRIRPLSWTPPSS I *RPKPPSSTFSHP 
GKPSMGPPGLNRCPVQRMiLPGWYQHCG 
R 


13459 


27360 


A 


13604 




267 


ETCCIKNNTLK*I*SQMLWCa.PIVPATW 
QAEVGGSIiDPRSSSL*CflMITSVNKHCT 
PAWAR 


13460 


27361 


A 


13605 


267 


418 


THASGMVTHTCNPGTLRVSAGHI T*GQE 
FKTSLGNIARPHLYKI IINKY I 


13461 


27362 


A 


13606 


428 


24 


CPESNPRKREPTLKSEPILTLPITGLIL 
DDGNSPTAAAGSVFAETLQPQCSSLCC* 
VKSSPWPTASLPQPFGSAPQTLPVRYAL 
IPGPSLVSTSLGPRLAPGLPGSPSPPSP 
QVTLDSERYPSPTPCIAPALED 


13462 


27363 


A 


13607 


267 


442 


RPPPTLKVPWLGVEVHACNPRTLGGQGG 
SIS*AQKLENSLSNILRPHLYQKQQKQQ 
KK 


13463 


27364 


A 


13608 


3 


280 


yrlSviCedpmrKrKqtCfppppsfppn 
rycbtrdggmegsqgdfrkgfpa*apcr 
sgggwrgccspagapgsplcgssqggpv 
spwgqkrs 


13464 


27365 


A 


13609 


291 


248 


EKRAHGSQPMKGRERNSMCKS S E FRWA 
WSLAGVKGDGGR*GSGAGGSPKLG*AV* 
LDPEGHAGQAVFCGEAGAETQ IGGKG* R 
RDQAPLPLS PNLCSGSCLPTKHSLAHMP 
LRVKSYSSAQLGRTTCTRPSPPTITFDS 
S 


13465 


27366 


A 


13610 


267 


3 


HTDLLYNTPTHPISPRCDPKHTAIPDKQ 
SLLFFFFEMESCSVARAGVRWHDLRSLR 
PLPPRF*KFFCCSIPSLYCGRSRGSSQT 
YTS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predlct- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=ljysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=lInknown, "=Stop codon,y=possible 
nucleotide deletion, V'^possible 
nucleotide insertion 


13470 


27371 


A 


13615 


197 


428 


KTffi/GLKTSASFIIQDFIQIPLNHNLSVP 
* TMLGSQPRFLFPPGTVSHI CWPSTLGG 
RGG1«IT*GQEFETSLATMVKP 


13471 


27372 


A 


13616 


445 


550 


ISITRGRSKWLTPVILVLWEAEAGGSP* 
GRSSRLP 


13472 


27373 


A 


13617 


3 


435 


TMVLSPADKraVKAAWRKVGAHAGSYGA 
EflLERIMFLSFPTTKTYFPHFDLSHGSAQ 
VKGHGKKEADALTKAVMIVDDiyiPIIMiSV 
LSDLHAHKLRVDPVNFKLLSHCLLVTLD 
RHLPAEFTPAVHASLDKFLASVSTVLTS 
KXR 


13473 


27374 


- 


13618 


719 


918 


CEGRREKWKIGRERKGRGRRKGEQMGRE 
GKGREDGGRKRPCTSRPRSS SRDRSNI I 
ISEHASAVEA 


13474 


27375 


A 


13619 


847 


938 


WLiyiPAVPALLEAEVGISLEAQSCRPPMA 
TQ 


13475 


27376 


A 


13620 


349 


409 


NLWSWPQaVAHVCNPSTLGQ 


13476 


27377 


B 


13621 


25 


189 


MVLSPADETNVKaAWGKVGAHAGEYQAE 
ALEHIlFLCFPTTriTYFPDFDLIHGSAQ* 


13477 


27378 


A 


13622 


142 


356 


LDQTWPILLQGLAVLAKERPPNPIEFL 
ASYLLKNKAQFEDRN 


13478 


27379 


A 


13623 


18 


1353 


AGAAQCEWSAGEAGARTMSEADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFBMLAVSLTVPLLGAMMLLESP ID 
POPLRQISGIALFCSFKEPPLLLCSViaP 
NTKLRQAERLFENQLVGPES lAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDEPVCGRPIiGIRAGPNGTLFVADA 
YKGLFEVITPWKREVKLLLSSETPIEGKN 
MS FVHDLTVTQDGRKI YFTD SS SKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLSPAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYIWVGMSTIRPNPGFSMLDFLS 
ERPWIKEMIFKLFSQETVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHLYLGS FRSPFLCRLS LQAV 


13479 


27380 


A 


13624 




1333 


AGAAQCEWSAGEAGARTMS EADGLRQR 
RPLRPQWTDDDGQAPEAKDGS SFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVGPES lAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDEPVCGRPLGIRAGPNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLS PAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKEMIFKLFSQETVTMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHLYLG SFRSPFLCRLS LQAV 


13480 


27381 


A 


13625 


1 


384 


QSFRGTGRKHERERKHMSLSDWHLAAKL 
ADQPLTPKS ILRLPETELGEYSLGGYSl 
SFLKQLIAGKLQESVPDPELIDLIYCGR 
KLLDDQTLDFYGIQPGSTVHVLRKSWPE 



1363 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PDQKPEPVDKEAAMRD 


13481 


27382 


A 


13626 


980 


1089 


KHIRIQLTGGLYPVPPPHPPPQSPPIFP 
RPTSPTRT 


13482 


27383 


A 


13627 


116 


587 


VCGELRADSWPVPSQPEQASGPQKQAFI 
WPEAPSRSARLPITYTDYWDSRLQTQES 
QMLGSMARKKPRNTSRLPLALNPLKSKD 
VLAVLRERNEAIVPVGAWVEPASPGSSE 
IPAYTSAYLIEEELECEQLRKKQEALKHF 
QKQVKYRVNQQITLRKQ 


13483 


27384 


A 


13628 


150 


484 


VAFPQASRGVRKTEVGEQQGQGTACGGC 
GHQCPPPTWHIQRAHCVPSTCX3AGGWHL 
QGEASCRRPPSQSPQVYQTQVGRQDPHF 
GVGRDSRGELAWSSFIHPYLLSVCNPL 


13484 


^738r— 


A 


13629 


330 


397 


ARCEWLTPVI PALWSAKAGGSH 


13485 




A 


13630 


10 


137 


SFTGAVILIIAHGLTSSLLFCLANSNYE 


13486 


27387 


A 


13631 


380 


98 


FQWLVS FTATLWFEEHHRKNKI EREERR 
AKGERGDRKEERREEGGEERGRRGGERS 
DJCREPKKKSKEESNHPKWKFS FICS FC 
FLTPFFPVFF 


13487 












PLFEAFTACGFVHDCGLLIHPEETCGLQ 
PISSDYIEAILQSELKRCPSGDMKGQWI 
VPCLS CSDNRTCDWREITWQPHNCQYGV 
LTKPQLQQCLGGRKILFIGDSTNRGIMY 














YL lERLNETLQEWQKVHGTKFYHNVNGG 














KTLIS YS YYPQFWI SPSLRPTFENALEH 
LLQRSRPLENTGQTVLWGGVQWUSISNH 
LQI IHKVLKSPFTTLNQPVTKSCLQAIY 
FPRLSPmHSNCLDLVYSFTKS FNIYFV 
VQFLN 


13488 


27389 


A 


13634 


3 


2718 


SGPCRTTVAPLLRAAPVEHCYAALRPTD 
STMLKKFDKKDEESGGGSNPFQHLEKSA 
VLQEARVFNETP INPRKCAH ILTKILYL 
INQGEHLGTTEATEAFFAMTKLFQSNDP 
TLRRMCYLTIKEMSCIAEDVIIVTSSLT 
KDMTGKEDNYRGP AVRAL CQ I TDS TMLQ 
AIERYMKQAIVDKVPSVSSSALVSSLHL 
LKCSFDWKRWVNEAQEAASSDNIMVQY 
HALGLLYHVRKNDRLAVNKMISKVTRHG 
LKSPFAYCMMIRVASKQLEEEDGSRDSP 
LFDF I ES CLRNKHEMVVYEAASAIWLP 
GCSAKELAPAVSVLQLFCS S PKAALRYA 

DSNRS lATLAITTLLKTGSESS IDRLMK 
QISSFMSEISDEFKWWQAISALCQKY 
PRKHAVLMNFLFTMLREEGGFEYKRAIV 
DCIISIIEENSESKETGLSHLCEFIEDC 
KFTVLATRILHLLGQEGPKTTNPSKYIR 
FIYNRWLEHEEVRAGAVSALAKFGAQN 
EEMLPSILVLLKRCVMDDDNEVRDRATF 
YLNVLEQKQKALNAGYILNGLTVSIPGL 
ERALQQYTLEPSEKPFDLKSVPLATAPM 
AEQRTESTPITAVKQPEKVAATRQEIFQ 
EQLAAVPEFRGLGPLFKSSPEPVALTES 
ETEYVIRCTKHTFTNHMVFQFDCTNTLN 
DQTLEWTVQMEPTEAYEVLCYVPAHSL 
PYNQPGTCYTLVALPKEDPTAVACTFSC 
MMKFTVKDCDPTTGETDDEGYEDEYVLE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 

nucleotide 
location 
correspond 
ing to tirst 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
laslnmino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DLEVTVADHIQKVMKLNFEAAWDEVGDE 
FEKEETFTLSTIKTLEEAVGNIVKFLGM 
HPCERSDKVPDNKNTHTLLLAGVFRGGH 
DILVRSRLLLLDTVTMQVTAKSLEELPV 
DIILASVG 


13489 


27390 


A 


13635 


118 


629 


LWALQLHPEPELPSRRGTGAAGVWTGVA 
MIRFILIQNRAGKTRLAKWYMQFDDDEK 
QKLIEEVHAWTVRDAKHTNFVEFRNFK 
IIYRRYAGLYFCICVDVNDMIILAYLEAI 
HNFVEVLNE YFHM VCELDLVFN F Y K V YT 
WDEMFLAGEIRETSQTKVLKQLLMLQS 
LE 


13490 


27391 


A 


13636 


57 


221 


LHHCTPPWAEVEETLKRLQSQKGVQGI I 
WNTEGGWEPLGHCGDRSRPPAQGCP 


13491 


27392 


A 


13639 


344 


544 


LSGGHAGALSSLWVHLyCLLSSQQLLGN 
VLVTVLAIHFGKBFTPEVQASWQKMVTG 
VCSALCFHYH 


13492 


27393 


A 


13640 


319 


623 


DMEEASEGGGNDRVENLQSEVEGVKNIM 
TQNVERILAHGENLEHIiiaSrKTEDLEATS 
EHFKTTSQKVARKFWWKNVKMIVLICVI 
VFI IILFIVLFATGAFS 


13493 


27394 


A 


13641 


2099 


769 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDIiNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQS PEFS VTRQHEDFV 
WLHDTLIETTDYAGLIIPPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AEYLAVFKKTVSSHEVFLQRLSSHPVLS 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
FFKSWKSADEVLFTGVKEVDDFFEQEK 
NFLINYYNRIKDSCVKADKMTRSHKNVA 
DDY IHTAACIiHSLALEEPTVI KKYLLKV 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLNIEAAKDLLYRRTKALIDYENSNKA 
LDKARLKSKDVKLAEAHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LEIKHaRMNVSLLQSCIDLFIMlSr 


13494 


27395 


A 


13642 


210 


772 


SVKMVRYSLDPEMPTKSCKSRGSNLRVH 
FKNTRETAQAIKGMHIRKaTKYLKDVTL 
QKQCVPFRRYNGGVGRCAQAKQWGWTQG 
RWPKKSAEFLIiHMLKNAESNAELKGLDV 
DSLVIEHIQVNKaPKMRRRTYRftHGRIN 
PYMSSPCHIEMILTEKEQIVPKPEEEVA 
QKKKISQKKLKKQKLMARE 


13495 


27396 


A 


13643 


168 


2172 


SPLCEVSVPSFCFRVFCKKHKKHKSDKH 
LYEEYVEKPLKLVLKVGGNEVTELSTGS 
SGHDSSLFEDKNDHDKHKDRKRKKRKKG 
EKQIPGEEKGRKRRRVKEDKKKRDRDRV 
ENEAEKDLQCHAPVRLDLPPEKPLTSSL 
AKQEEVEQTPLQEALNQLMRQLQRKDPS 
AFFSFPVTDFIAPGYSMIIKHPMDFSTM 
KEKIKHMDYQS lEELKDUFKLMCTNAMI 
YNKPETIYYKAAKKLLHSGMKILSQERI 
QSLKQSIDFMRDLQKTRKQKDGTDTSQS 
GEDGGCWQREREDSGDAEAHAFKSPSKE 
NKKKDKDMLEDKFKSNNLEREQEQLDRI 
VKESGGKLTRRLVNSQCEFERRKPDGTT 
TLGLLHPVDPIVGEPGYCPVRLGMTTGR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequen ce 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

correspond 
ing to first 
amino acid 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leuciiie, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQSGWTLQGFKEDKENKVTPVLYLNYG 
PYSSYAPHYDSTFANISKDDSDLIYSTY 
GEDSDLPSDFSIHEFLATCQDYPYVMAD 
SLLDVLTKGGHSRTLQEMENSLPEDEGH 
TRTLDTAKEMEQITEVEPPGRLDSSTQD 
RLIALKAVTNFGVPVEVFDSEEAEIFQK 
KLDETTRLIJJELQEAQNERLSTRPPPNM 
ICLLGPSYREMHLAEQVTNWLKELAQQV 
TPGD I VST YG VRKAMG ISIPSPVMEMNF 


13496 


27397 


A 


13644 


1032 


207 


PADVTPKPATEAVQSEHSDASPMSTNEV 
ILSASGACKLIDSLHSYCFSSRQNKSQV 
CCLREQVEKKNGELKSLRQRVSRSDSQV 
RKLQEKU^ELRRVSVPYPSSLLSPSREP 
PKMNPWEPLSWMLGTWLSDPPGAGTYP 
TLQPFQYLEEVHISHVGQPMLMFSFNSF 
HPDTRKPMHRECGF IRLKPDTNKVAFVS 
AQNTGWEVE EG EVNGQELC I ASHS I AR 
ISFAKEPHVEQITRKFRLNSEGKLEQTV 
SMATTTQPMTQHLHVTYKKVTP 


13497 


27398 


A 


13646 


148 


380 


RGSWREVPESASLPSRGAKGKWRGLCCC 
CCCCCCCCCCCCYHCHQEQGDLKHQADL 
MRSGRTQNQAG I WQEHQTLEG 


13498 


27399 


A 


13647 


2099 


769 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLLESRVAMAAVPKLI,QQQEEDRSK 
LRSVSVDIjNVDPSLQIDI PDALSERDKV 
KFTVHTKTTLPTFQSPEFSVTRQHEDFV 
WLHDTLIETTDyAGLIIPPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AEYIAVFKKTVSSHEVFLQRLSSHPVLS 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
FFKSWKSADEVLFTGVKEVDDFFEQEK 
NFL I NY YNRI KD SC VKADKMTRSHKNVA 

AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLNIEAAKDLLYRRTKALIDYENSNKA 
LDKARLKSKDVKIAEaHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LE IKHARiSNVSLLQSCIDLFKNN 


13499 


27400 


A 


13648 


1 


1206 


MSTSQSPCESICDYVTSHDKSNFTDMIK 
LNILRCEVrLDYPAPGGGSLQAKHCCSC 
YTVSSGVTEGERNAGEKGVKLNADGARI 
RGTPGRGRRAEAEASSPAPAAVARACW 
AAaAASRQLASGNRTRVSSGVPAPAFIiG 
TMNPNCARCGKIVYPTEKVNCLDKFWHK 
ACFHCETCKMTIiNMKNYKGYEKKPYCNA 
HYPKQSFTMVADTPENLRLKQQSELQSQ 
VHYKEEFEKNKGKGFSWADTPELQRIK 
KTQDQISNIKYHEEFEKSRMGPSGGEGM 
EPEHEDSQDGSSYHRPLEQQQPHHIPTS 
APVYQQPQQQPVAQSYGGYKEPAAPVSI 
QRSAPGQGGKRYRAVYDYSAADEDEVSF 
QDGDTIVNVQQIDDGWMYGTVERT6DTG 
MLPAireVEAI 


13500 


27401 


A 


13649 


3 


394 


GDGGGHLGSGENGGGSMNAPPAFESFLL 
FEGE KI TINKDTKVPNACLFTMNKEDHT 
LGWIIKSQLLKDPQVLFAGYKVPHPLEH 
KIIIEVQTTPDYSPQEAFTNAITDLISE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, ' 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Slop codon, /=possible 
nucleotide deletion, \=po$$ible 
nucleotide insertion 














LSLLEERFRVRAGPGGAD 


13501 


27402 


A 


13650 


1 


471 


SEPTGLREAAGSGPREAPRRSGCKSPGL 
GTVGMLRPKALTQVLSQflNTGGVQSTLL 
LNKEGSLLAYSGYGDTDARVTAAI ASN I 
WAAYDENGNQAFMEDNLKF ILMDCMEGE 
VAITRVANLLLCMYAKETVGFGMLKAKA 
QALVQYLKEPLTQVAAS 


13502 


27403 


A 


13651 


1334 


82 


CYTGGTQSLWPGSS CASSVARPSST..FRS 
AWSCEWSVRCAEACTMSVPAFIDISEED 
QAAELRAYLKSKGAEI SEENSEGGIiHVD 
LAQI lEACDVCLKEDDKDVESVMNSWS 
LLLILEPDKQERLIESLCEKLVKFREGE 
RPSi:iRLQLLS]Sn:.FHGMDKWTPVRYTVYC 
SLIKVAASCGAIQYIPTELDQVRKWISD 
WNLTTEKKHTLLRLLYEMiVDCKKSDAA 
SKVMVELLGSYTEDNASOaEVDAHRCIV 
RALKDPNAFLFDHLLTLKPVKFLEGELI 
HDLLTI FVSAKLAS YVKFYQKNKDPIDS 
LGLLHEQNiyiAKMRLLTFMGMAVENKEIS 
FDT>;QQE LQ I GADDi/E AFVIDAVRTKMV 
YCKIDQ"QRK\'\^/S"^STHRTFGKQQWQQ 
LYDTLNAWKONLNKVKWSLLSLSDT 


13503 


27404 


A 


13652 




377 


TTASGRSGVKGSTMSAEVPEAASAEEQK 
EMEDKVTS PEKAEEAKLKARYPHLGQKP 
GGSDFLRKRLQKGQKYFDSGDYHMAKAK 
MKNKQLPTAAPDKTEVTGDHI PTPQDLP 
QRKPSLVASKLAG 


13504 


27405 


A 


13653 


424 


596 


SLKNIYGLSCRKKKKGAVKKIILVQAWW 
LMPVITVLWEAEVGGLLBARGLRPTRAT 

W 


13505 


27406 


A 


13654 


190 


448 


LRSYPAPHLGSPELRIRKGRGHSHCLAG 
AAGPQRTALCGLSAPLCPPSPTPPGAGA 
PRYCSGSDAPPCLLRGMPP IPGMGDPE 
TS 


13506 


27407 


A 


13657 


1612 


563 


SMPGWRLLTQVGAQVLGRLGDGLGAliliG 
PGNRTHIWLFVRGLHGKSGTWWDEHLSE 
ENVPFIKQLVSDEDKAQLASKLCPLKDE 
PWPmPWEPGSFRVGLIALKLGMMPLWT 
KDGQKHWTLLQVQDCHVLKYTSKENCN 
GKMATLSVGGKTVSRFRKATSILEFYRE 
LGLPPKQTVKIFDTITDNRAIKPGTPLYA 
AHFRPGQYVDVTAKTIGKGFQGVMKRWG 
FKGQPATHGQTKTHRRPGAVATGDIGRV 
WPGTKMPGKMGNIYRTEYGLKVWRINTK 
HNI lYVNGSVPGHKNCLVKVKDSKLPAY 
KDLGKNLPFPTYFPDGDEEELPEDLYDE 
NVCQPGAPSITFA 


13507 


27408 


A 


13658 


128 


2626 


NSHRWVYVEARRWRRRGKQREQPEDRGV 
PMKRAAMALHSPQYIFGDFSPDEFNQFF 
VTPRSSVELPPYSGTVLCGTQAVDKLPD 
GQEYQRIEFGVDEVIEPSDTLPRTPSYS 
ISSTLNPQAPEFILGCTASKITPDGITK 
EAS YGS IDCQYPGSALALDGS SNVEAEV 
LENDGVSGGLGQRERKKKKKRPPGYYSY 
LKDGGDDS I STEAL VNGHANSAVPNSVS 
AEDAEFMGDMPPSVTPRTCNSPQNSTDS 
VSDIVPDSPFPGALGSDTRTAGQPEGGP 
GADFGQS CFPAEAGRDTLSRTAGAQ P CV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Fredict- 

otide 
location 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Metliionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlinown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide Insertion 














GTDTTENLGVANGQILESSGEGTATNGV 
ELHTTES IDLDPTKPESAS PPADGTGSA 
SGTLPVSQPKSWAS L FHDS KP S S S S P VA 
YVETKYSPPAISPLVSEKQVEVKEGLVP 
VSEDPVAIKIAELLENVTLIHKPVSLQP 
RGLINKGNWCYINATLQALVACPPMYHL 
MKFIPLYSICVQRPCTSTPMIDSFVRLMN 
EFTNMPVPPKPRQALGDKIVRDIRPGAA 
FEPTYIYRLLTVNKSSLSEKGRQEDAEE 
YLG F I LNGLHEEiyiLNL KKLL S P SNE KL T 
ISNGPKWHSVNEEEQEEQGEGSEDEWEQ 
VGPRNKTSVTRQADFVQTP ITGI FGGHI 
RSWYQQSSKESATLQPFFTLQLDIQSD 
KIRTVQDALESLVARESVQGYTTKTKQE 
VEISRRVTLEKLPPVLVIiHLKRFVYEKT 
GGCQKLIKNIEYPVDLEISKELLSPGVK 
NKNFKCHRTYRLFAVi/YHHGNSATGGHY 
TTDVFQIGLNGWLRIDDQTVKVINQYQV 
VKPTAERTAYLLYYRRVDLL 


13508 


27409 


A 


13659 


42 


382 


TLWLKTIQIYLTISLGLGSDYGLAGFSA 
LGCHC3AAVKMLTAFILIQGLDLGRSHFQ 
AHSGCWQDSFPCODSRIHGGLLLHVQQES 
PWFQSAERVSCITKCNHRSDTHHLCSIL 
L 


13509 


27410 




13660 


178 


349 


DMGPCYVAQWLKLLGSSDPPTSASQEA 
GTTCHHAHLLSHPFLMIYRNFSFIQCLE 
T 


13510 


27411 


A 


13661 


268 


2 


RQQKVILSSSGSLMSEMGRTVLPLKSPG 
MNPSSPLLASGGCWKSLACGHIIPFSSF 
ILIMWPSPLCVSVSSSLPMRKPLLLHLRS 
TLFQ 


13511 


27412 




13662 


112 


483 


AGVGAIiRMVQRLTYRRRLSYNTASNKTR 
LSRTPGNRIVYLYTKKVGKAPKSACGVC 
PGRIiRGVRAVRPKVLMRLSKTKKHVSRA 
YGGSIMCAKCVRDRIKRAPLIEEQKIWK 
VLKAQAQSQKAK 


13512 


27413 


A 


13663 


2 


873 


SVEEFDRGCTGRGCGADARAGAAHVKIS 
FQPAVAGIKGDKADKASflSAPAPASATE 
ILLTPAREEQPPQHRSKRGSSVGGVCYL 
SMGMWLLMGLVFASVYIYRYFFLAQIjA 
RDNFFRCGVLYEDSLSSQVRTQMELEED 
VKIYLDENYERINVPVPQFGGGDPADI I 
HDFQRGLTAYHDISLDKCYVIEUJTTIV 
LPPRNFWELLMNVKRGTYLPQTYIIQEE 














LREKATRRRINKRGAKNCNAIRHFENTF 


13513 


27414 




13664 


118 


3 


AWSLI PVI P WREAKAGGLLEPRSLRPT 
WATWQDPVST 


13514 


27415 


A 


13665 


1 


2876 


I RQRINFSRKNKWDSRKLDQAGVSELAT 
NQKLILVCGRYEGIDERVIQTEIDEEWS 
IGDYVISGGELPAMTLIDSVSRFIPGVL 
GHEASATEDSFAEGLLDCPHYTRPEVLE 
GMEVPPVLLSGNHAEIRRWRLKQSLGRT 
WLRRPELLENlALTEEQARLIJlEFKTEH 
AQQQHKHDGMAADEAGRTFLRADFNMIE 
EGDRIMVCLSGGKDSYTMLEILRNLQQS 
APINFSLVAVWLDQKQPGFPEHVLPEYL 
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SEQID 
NO: (If 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

location 

corrcspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=Valine, 
W=Try ptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EKLGVEYKI1/EENTYGIVKEKIPEGKTT 
CSLCSRLRRGILYRTATELGATKIALGH 
HRDDILQTLFLMMFYGGKMKGMPPKLMS 
DDGKHIVIRPLAYCREKDIQRFADAKAF 
PI I PCNLCGSQPNLQRQVIADMLRDWDK 
RYPGRIETMFSAMQNWPSHLCDTNLFD 
PKGITHGSEWNGGDLAFDREEIPLQPA 
CWQPEEDENQLDELELNWEPDGPRHRH 
APDARRFCQYLRHQARYLLSWPNLDVA 
VTEPLANGDGLNVL I KKEV VGFRAMTVE 
KTGENQYRVWPNEMPADLHKIRPHHPLN 
RNLDHFFPLDLTNSTFSGGYVHVLK6VL 
SDDLLLKSFQKMGYVRRDSHRLMVTALP 
PACQLVQVALGCFALRLECEILGEVLAQ 
LGTSVLPAEELLQAERASGDVASCQRLA 
QDEEPPPLPPRGSPAAYRAPLDLYRDLQ 














PPLWEQSAKLWGTGGRAWEPPAEELPQA 
SSPPYGAT.REGLRPKPSAFSFLSTiRREL 
SRPGDIaATPESSAAASPRRIRAEGVPAS 
AYRS VS EP PGYQGTQLPVPWRPAHPLLR 
HLSPAACCPLCSPARLPSRPLAACTiAWR 
RPAALVATACTDGHSALQQPRGPALGQL 
GPRSRALGRQLPWCFSLLGRGLGLWLPC 
PWGSPGHSRGLESPGQPLLEVGLSVYGR 
LQGQQA 


13515 


27416 


A 


13667 


76 


289 


SGTPASPCLEMDPNCSCSPVGSCACAGS 
CKCKECKCTSCKKSCCSCCPVGCAKCAQ 
GCICKGTSDKCSCCA 


13516 


27417 


A 


13669 


2 


657 


GRVDAEQSRLQATEEAAftaftNNPEYDYL 
FKLLLIGDSGVGKSCLLUIFADDTYTES 
YISTIGVDFKIRTIEIJDQKTIKLQIWDT 
AGQERFRTITSSYYRQAHQIIWYDVTD 
QESYMTVKQWLQE IDRYASENVNKLLVG 
NICSDLTTKKWDNTTAKEFADSLGI PFL 
ETSAKNATNVEQAFMTMAfiEIKKRMGPG 
AASGGERPNLKIDSTPVKPAGGGCC 


13517 


27418 


A 


13670 


176 


398 


RII.KTQLQEHNQPTTTTKNRQKTRTDTL 
PKGIYRRHISLWKLQTWLTPVIPALCEA 
NAGGRLESRSSRPVWATY 


13518 


27419 


A 


13671 


149 


247 


RKGLALSPRLDCSGAITVNCSLNLQGLS 
NIPI 


13519 


27420 


A 


13672 


257 


359 


VASLHGWAQWLTPVI PTLWKTKVGEFFE 
PRCSRP 


13520 


27421 




13673 


142 


1 


HLKGSGLAKHQWLMPVI PALWEAEVGGS 
LEPRSSRPVWATLQDPISS 


13521 


27422 


A 


13674 


619 


168 


MFIESCCTPKIYTSFVLRQSHSFAQAGV 
QWLDLGSLQPLSLGSSNSPASASCIQVL 
FVIYTSVKLGVGGRLGNKECVMSSNKKFV 
CECRGWIILPFSVPSLRFQDGGWDHGVS 
SAEKALQAGRTAGHRVGVPPLGAPCGGA 
PCTTPPAWRS 


13522 


27423 


A 


13675 


3 


370 


ARALLHPRLHQWQCLCHLQGTGGCPQQS 
PSSEGEADLLSPALEEGYPPPASGCFIP 
HFLCGC S I FNCLE P I MRHAHKS PDALLE 
TLGPCRAWRLMPVI PTLWEAKVDRLLES 
RSSRPSWPTW 


13523 


27424 


A 


13676 


3 


281 


IGUIPELSVSLGRQRLTAIRLQPLLPAR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 

sequence 


Predict- 

otide 

correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iVI=]VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlii eonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TPCGFLEARVftLALMADETDLPLLPRLE 
YSGMITAYCSLDFLGSSDPPTLC3SGVAG 

TTENCCED 


13524 


27425 


A 


13677 


272 


389 


YCRQ I EKSKKKRCWLGTVAHACNPSTWG 
GRGGRITRSRD 


13525 


27426 


A 


13678 


261 


363 


NSSWGHARWLTSVI PMiWEAEVGQSPEV 

RSLRPV 


13526 


27427 


A 


13679 


149 


44 


SCVMSVSVLKKMYWLGAVAHACNPSTLG 
GQGGRTA 


13527 


27428 




13680 


118 


267 


MSHCTWLMCVCVCVCVCGVYAYMYTYSF 
LRWDLPMLSRLECSGYSUAQ* * 


13528 


27429 


A 


13682 


16 


346 


NHHLTQPHPHLDINDFFFFFFFFFFFFF 
FFFFFFFFFFFLKKSPRPAGKKKKKKKK 
KKKPPRFFFLGGAPPPLFFFLINFFFFF 
LCKIISLVYNRRGGPCLSKRFFLKERKS 
PL 


13529 


27430 


A 


13683 


244 


140 


GRVDVFHHVAQAQLKLLSSGDLPAWPPK 
EURLQV 


13530 


27431 


A 


13684 


191 


777 ■ 


NSDEHVYRCLYGHKLSFLLSKYLKSGIA 
GVLLPLDAAVDMEKIEEQFANLHIVKCS 
LGTKEPTYLLGIDTSKTVQAGKENIiVAV 
LCSNGSIRIYDKKRLNVLRKFSGYPGLL 
NGVRFANSCDSVYSACTDGTVKCWDARV 
AREKPVQLFKGYPSNI FISFDINOTOHI 
I CAGTEKVDDDALPGGFGDaRMEFLQN 


13531 


27432 


A 


13685 


150 


253 


VGWGLWLVPVIPAFWEAKMGESLDPRNL 
LPCWAT 


13532 


27433 


A 


13686 


500 


321 


lEMGSHYVAQAGLELLSSSNPPTSVSQS 
AGI VGMSHHALftAI SKSSDASITSHHPM 


13533 


27434 


A 


13687 


6 


424 


MSLLQCGGIRCFKMPEPAKSAPAPKKGS 
KKAVTKAQKKDGKKRKRSRKESYSVYVY 
KVLKQVHPDTGISSKRMGIMNSFVNDIF 
ERIAGEASRLAHYNKRSTITSREIQTAV 
RLLLPGELAKHAVSEGTKAVTKYTSSK 


13534 


27435 


A 


13688 


178 


293 


TGYSSQAQWLMTVLTATWEAEAGGSLQP 
RSLRPAWATQ 


13535 


27436 


A 


13689 


663 


144 


KELSAVSAGIPHSCGSQGCGGGSVAACV 
PAAPAAAGLCSGEAQKVPPPPSLAGWPP 
GVNAPPPPVCSSVRLHVCQSDRLWVELA 
ARRGILALLRSALKAATLAGCQSVRWSV 
RPSESLRPTSWAASLFRSSVPTVLSHSV 
PLAASLGKRRACGGREHASVAVYLSVCL 
SLPT 


13536 


27437 


A 


13690 


125 


3 


FAHQGHAPGQAWWLRPVIPVLREAELGG 
RPEPRSSRTARA 


13537 


27438 


A 


13691 


136 


24 


GGPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFPGKLQQDFQYYVK 


13538 


27439 


A 


13692 


144 


1 


MLKIVQSGECLTLKFE<NFCLLSLCTLFP 
TLIALTTLLLPISPFILIIL 


13539 


27440 


A 


13693 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTEEYHHR 
LQTFASWWRKINAHHNGNHTFKMaLNQF 
SDMSFAEIKHKYLWSEPQNCSATKSNYL 
RGTGPYPPSVDWRKKGNFVSPVKNQGAC 
GS CWTFS TTGALE S AI AI ATGKMLS LAE 
QQLVDCAQDFHNHGCQGGLPSQAFEYIL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515.1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 

nncle- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVAN IT I YDEEAMVEAVAL YNP VS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVLAVGY6EKHGIPYWIVKNSWGPQ 
WGMNGYFLIERGKNMCGLAACASYPIPL 
V 


13540 


27441 


A 


13694 


87 


2 


PGWAWWFTPVILALWETEAGGSPEFKSL 


13541 


27442 


A 


13696 


130 


2 


CNILI 1 1 IQVCWAWWLMPVIPALWHAEP 
GGSPEVESSRPALV 


13542 


27443 


A 


13697 


282 


107 


GLTKCPGTVCRSLKWIS YQLS I ICLSM 
MEHSGTILAHCSLKLLGSSDPPTSVSRV 
AS 


13543 


27444 


A 


13698 


198 


3 


CSITVNKVKVMTLFRSSMSLEIVCDIDL 
AVLRRDLGSLQDPPPRFTPLSCLSLPSS 
WDYRRPLV 


13544 


27445 


A 


13699 


2 


120 


ARGDGVSLWLPRLKCN6AISAHCNLRPT 
GSSDKNFKTRF 


13545 


27446 


A 


13700 


60 


341 


PDMGLEDERKMLTESGDPEEEEEEQEEL 
VDPLTTVREQCEQLQKCVKARERLELCD 
ERVSSRSHTDDDCPEELFDFUiaKDHCV 
ALKLFNNLQ 


13546 


27447 


A 


13701 


6 


540 


KNSRTLGQCGGIRGSLCRPRKPGVGGTQ 
TRAVRPAVCSADSAHPHLPLRRADMKDS 
LVLLGRVPAHPDSRCWFLAWNPAGTLLA 
SCGGDRRIRIWGTEGDSWICKSVLSEGH 
QRTVRKVAWSPCGNYlftSASFDATTCIW 
KKLTUIIYNILRKLEACIKPLCCALKYG 
CLEEKQIflS 


13547 


27448 


A 


13702 


239 


346 


AKRGWLMPVIPALWEAKVGKSLEPRISR 
PLWATWQN 


13548 


27449 


A 


13703 


210 


300 


LMPVISALWEAEARGLLEPRTLRPAWAT 
W 


13549 


27450 


A 


13704 


141 


440 


PSAFEHFEKINLYFLKFCISQGFPERQN 
NRDREIHRERYERDRERHRERYERGLRE 
LAHVIVEAEKPHHRPFITWETLGCWWSG 
SVQVQKPWEPGKLMV 


13550 


27451 


A 


13705 


204 


438 


LLNVI.r.TQLFFI,FLSLRQSCSVAQARGQ 
WYNHGSLQPSTHGASNPPTSASQSVGTT 
GMSHDHGLFLHFSTLDIFSSL 


13551 


27452 


A 


13706 


1707 


1821 


AKAGGSQHLEILANAVKPCLYWECKMAG 
PWWCAPVGG 


13552 


27453 


A 


13707 


167 


334 


I RRANFKI LTE I GWTQWLTPVI PALWEA 
EGGGLLEPGSPSLVFLFFILTTILMRKK 


13553 


27454 


A 


13708 


224 


344 


KDTAMEEEIKDTEKTGKPWLGAVAHACN 
PSTLGDQGRQIA 


13554 


27455 




13709 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTEEYHHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
SDMS FAEI KHKYLWSEPQNCSATKSNYL 
RGTGPYPPSVDWRKKGNFVSPVKNQGAC 
GSCWTFSTTGALESAIAIATGKWLSLAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYIL 
YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANIT lYDEEAMVEAVALYNPVS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVLAVGYGE KNGIPYWI VKNSWGPQ 
WGMNGYFLIERGKWMCGLAACASYPl PL 



1371 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

sequence 


Predict- 
otide 

correspon 
ding to 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkiiown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















13555 


27456 


A 


13710 


753 


257 


YEKILKLTADAKFESGDVKATVAVLSFI 
LSSAAKHSVDGESLSSELQQLGLPKEHA 
ASLCRCYEEKQSPLQKHLRVCSLRMNRL 
AGVGWRVDYTLSSSLLQIRGKSPWCHCG 
WKVATCPQGPQAPACLPCPFSSKTKFPR 
SSLAETESRPKTLMKLPWAEEKGVPRPV 


13556 


27457 


A 


13711 


133 


401 


VLSKSGNPYWGLSTNHEIEGQEGCAWQL 
TPVIPALWEAEAGGSLEPRSLRPAWATS 
WLPRE IKLDLLKNGCS3PSKITQGGPMVA 
GSLKL 


13557 


27458 


A 


13712 


212 


74 


RRINLAPPRVFLKGPPFFFFFFFFFFFF 
FFAWGNPLFLFNSTFGRT 


13558 


27459 


A 


13713 


324 


141 


DRVLPRLECSGVITARYRLNLLGSRDPP 
TSASLIAGSTGTCHHAWLSFSYLFTYVF 
FIIL 


13559 


27460 


A 


13714 


299 


431 


IRLIWLSAVDHACNSSTLGGHGVQMTKG 
QDFETNLANTANPHPY 


13560 


27461 


A 


13715 


133 


1 


ILVWAWWDMPAIPVLWEAETEGLLEPRS 
liRPAWATWKNTISTKN 


13561 


27462 


A 


13716 


18 


165 


I PLHTHTHTHTHTHTHGlf LIiALSG'i'E FL 
SVLYMASERECRTYHLMUIK 


13562 


27463 


A 


13717 


119 




KPNVSQAQWLAPVIPALWEARVGVLFKP 
RSLRPTWATQQ 


13563 


27464 


A 


13718 


271 


64 


VLRSISTLNEILAKTRNPPLQILMKPRK 
RWGTVAHLRAPVGDQPDQHSETPSLLKI 
TKISWVWWHTPVI 


13564 


27465 


A 


13719 


21 


209 


KDGPKGTGGSETNESVEMTECCSVTQAG 
VQWHDLGSLQPPPPGFKRFSCLGLPSSW 
DYSRND 


13565 


27466 


A 


13720 


391 


151 


RGFFLKGEKKKNFFPIILGKKILGSPGK 
KGEKKKKKRGKIFFFLKNPLGFFPKKKV 
LGEKKNPYSGVWGGKKKRGPPPHP 


13566 


27467 


A 


13721 


109 


2 


RRLGWAHACNPSTLGGEVGGSQGPGNC 
GQVTKVS 


13567 


27468 


A 


13722 


69 


369 


RDILQADGAHSLFPTTPSLNAYIIFSSP 
IGPHTHRPYHATPTYLPFYTNNLIKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGP F 
KKTPGGAPI FGGGGK 


13568 


27469 


A 


13723 


58 


1208 


FWNENSPASELAPNGGGSVTSVPRLEDY 
LTRPQLYKLTGVEGP3RANSRDS FHDRK 
TYKTPSANMMVLKVEELVTGKKNGNGEA 
GEFLPEDFRDGEYEAAVTLEKQEDLKTL 
LAHPVTLGEQQWKSEKQREAELKKKKLE 
QRSKLENLEDLEIIIQLKKEKKYRKTKV 
P WKEPE PE I ITE PVDVPTFLKAALENK 
L P WE KFLS DKNNPDVCDE YKRTALHRA 
CLEGHLAIVEKLMEAGAQIEFRDMLEST 
AIHWASRGGNLDVLKLLLNKGAKISARD 
KLLSTALHVAVRTGHYECAEHLIACEAD 
LNAKDREGDTPLHDAVRLNRYKMIRLLI 
MYGADLNIKNCAGKTPMDLVLHWQNGTK 
AIFDSLRENSYKTSRIATF 


13569 


27470 


A 


13724 


400 


135 


KWNGKELNGVDGI GWDWNEMYWKGLE FH 
ETEWNELERNGLEWNGMSWDGIKCNGI K 
WNGIECSGMLWNGMECIGMECPLMEWIR 
VEWN 


13570 


27471 


A 


13726 


186 


344 


SASLGLWRCRDCRRSLVHSVLNVAQAGV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


nucleotide 
location 
correspond 
ing to first 

peptide 
sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QWEDLSSLQPPPPSRLPWPPKVLR 


13571 


27472 


A 


13727 


380 


168 


APPQKKVFS SKFPQGFFFAPPNRKKLFF 
S PPRKI VPPPKFFIiWAPPLFFFFFFFFF 
FFFFFFWYSRTDYK 


13572 


27473 


A 


13728 


324 


219 


RIMFFIFFYFFFFFFFFVFFLFFFFFKR 
PPPFFY 


13573 


27474 


A 


13729 


2 


270 


RLQEASLSPVPEILGGftLPSAKRTSYLL 
LSTLCLLLSETASRGNSLTGLtSHRSDHY 
NSVRSGGQSLYSACPIVTTIHGTCYGGK 
AKCCK 


13574 


27475 


A 


13730 


134 


3 


AVHRCKKWGRGWGRWLTPVI PTLWEAEV 
GGPLEPRSLRPAIAT 


13575 


27476 


A 


13731 


429 


641 


RKKAVCFMNDLICFLDNTFKHlSrVLSQAW 
WCVHIiVPTIWEAEAGGSLEPRSLKLQCP 
WAPVI3NCTPAWAT 


13576 


27477 


A 


13732 


188 


32 


IFFFFLFFYFLFFFFFFFPRSRHCTLQP 
GQQSETRSQNKKQKTKNKQTKNLP 


13577 


27478 


A 


13733 


341 


3 


GTVTPEEPKBCMGDQTPRGIiPRPERTLGR 
LSTSESRVDPTPRVGGGSSGGRTAPWGP 
PRERGRVAGSSLHRACNFSTSNTI ISVG 
LWLTPVIPTLGEAEAGGSPEVRSSTHAS 


13578 


27479 


A 


13734 


208 


2 


KLYKNFFFRFTKNPPPLLKKGFYLYNFF 














FKKKKKIVKKKKKKKKKKICKQKKKKKKK 














GRPliAIKNYPRV 


13579 


27480 




13735 


16 


212 


EGVSLLLPRLECRGTISftHCNLRHRGRR 
CTPASASQILRRLKQENHLNLGGGGYRD 
PIKLQRTAS 


13580 


27481 


A 


13736 


22 


214 


PDRPALPTRPGGWITWGRfiVRDQPGQCG 
ETPSLLKIEKLAGCDGTCLEQNCLQNNN 
SNNKVGIR 


13581 


27482 




13737 


386 


51 


WGNS I TQQALFEHLCFSGHTSLPAVP C P 
HQACFPTLPLICRGIIIAHCNLGLLGSS 
DSPISASPIDGNTGLHPHARLFSFLLWY 
RWGLTMMPKRSLSPGLKPSSRLADAWV 


13582 


27483 


A 


13738 


204 


431 


S QHFG IS RRADHLSSNVGDQPEQHGKNS 
LYKYKKINQGWAWWLTPVIPPLWEGQVG 
GSPEGKELKTPLRNLGKPHL 


13583 


27484 


A 


13739 


69 


159 


I KKEALSWAWWLTPVIPflLWKjyMAGGSP 
E 


13584 


27485 


A 


13740 


32 


285 


FFLSFFLFFFFFFGGGGKKKTKKFFSQD 
FLGPPKILGGGGFFSFFFGGPFFLKKGK 
KSPQNLFFLKKNPPPFFFPPPKKPLWGF 


13585 


27486 


A 


13741 


372 


213 


YRSCVLLQQNPPLFYQSWWNFSLCVCVC 
VCVCVCVCETGSLKKFLLTQKKKH 


13586 


27487 


A 


13742 


29 


237 


'TKRKGYKTNKGYWAWAWWLTPVI PALF I 
AEGNLILQMQKQAQPEAGCGCVCVCVCA 
CMCVCVCVFKVFF 


13387 


27488 


A 


13743 


322 


183 


HTHTHTHTHTHTHTH'mPIHVFWLLLKY 
PKIQSHQFHISAFQDPLL 


13588 


27489 


A 


13744 


305 


373 


VGWGWWLTPVIPALWEAKVGGS 


13589 


27490 


A 


13745 


172 


1 


KNKIEEQMKWEMRQEAVftHSCWPSTLGG 
QGEFETSLVNMMKPCLYKKMQKISQVCS 
C 


13590 


27491 


A 


13746 


285 


1045 


FQHPFGLSQSEMAAVKASTSKATRPWYS 
HPVYARYWQHYHQAMAWMQSHHNAYRKA 
VES CFWLPWYLPSALLPQS SYDNEAAYP 
QS F YDHHVAWQDYP CSS SHFRRS GQHP R 
YSSRIQASTKEDQALSKEEEMETESDAE 
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S£QID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 

ingt^first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PlienylaIanine, 
G=GIycine, H=Histidine, Msoleucine, 
K=Lysine, I^Leucine, M=Metliionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl4nown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














VECDLSNMEITEELRQYFAETERHREER 
REQQQLDAERLDSYVNADHDLYCNTRRS 
VEAPTERPGSRRQAEMKRLYGDSAAKIQ 
AMEAAVQLSFDKHCDRKQPKYWPVIPLK 
F 


13591 


27492 


A 


13747 


2 


305 


GRVGSFSVRDVBLSDPAEERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
ALFGTFSCLRIGMRGRELMGGIGKTMMQ 
SGGTFGTFMAIGMGIRC 


13592 


27493 


A 


13748 


2 


305 


GRVGSffSVRDVELSDPAEERGfiMfeVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
ALFGTFSCLRIGMRGRELMGGIGKTMMQ 
SGGTFGTFMAIGMGIRC 


13593 


27494 


A 


13750 


238 


423 


AVSWDQLTWGTGVQEKKIQAQAWGLMPV 
IPTLWEGEVGGSPEVKSSRPACPHGKTP 
FLLKMQ 


13594 


27495 


A 


13751 


455 


248 


I S VGPGI FQLNF CFLEPYGTFLSEKKFF 
IMETGSRFVIQSGVQWGSYSSPQPQFFG 
FESSLKPPHSGVK 


13595 


27496 


A 


13752 


2 


98 


I PTPTIHTHTHSHTHTHTHTHTSFLYMP 
PDLK 


13596 


27497 


A 


13753 


196 


2 


IWAPPKIFLKKPPLFFFFFFFFFFFFFF 
FFFFPFFIISFI PWPLTRKQKLERSWVF 
FKDSACSA 


13597 


27498 


A 


13754 


917 


379 


KYKKCSLQKNLLLVGCKKYSLCYRRHKL 
HLVTHGERKKPAAVNSFFLFFSFFFFIJf 
LVHNTEITKKRVNLHETKflDAESCNQDC 
TTSSKTASELEQIRSGKHNSGKWAGEGA 
AG6RREGERMEWTEMRRGARGRRGGRER 
EMERARVRGGEEREKEIDLYKKVTSKIE 
ETKLGNLLKLG 


13598 


27499 


A 


13755 


175 


21 


KKKNFFFPPRVILGPPKVFFKRAPLFFF 


13599 


27500 


A 


13757 


234 


341 


E IGWVQWLMPVIPAVWEAEVGGIiLEPRS 
LRAflRAI 


13600 


27501 


A 


13758 




365 


PAPNRRGGHIQDRATNSTELGANQCFFF 
SPRPPSLEKKTPEINKEPRPAPQPSNPG 
NLGTREGGDSWAGTTRCLRRTDEGDTYR 
TEPPTALSWGQTRAFFFPPALPAGKKRH 
RNLLKTQFF 


13601 


27502 


A 


13759 


87 


181 


SHTHTHTHTHTMTHTPftlHWlHPDMfil 

LGL 


13602 


27503 


A 


13760 


1 


228 


ARGERERERERERERERERERERERERE 
RYREGGRLLLTMGEGETERATDLYHTPP 
PSIKAWRLCEPRPRAGILCGRNIF 


13603 


27504 


A 


13761 


383 


197 


RCDPPAWSQSARITGVSYRAQPATSVL 


13604 


27505 


A 


13762 


276 


57 


YAVLGGGGGRKNLFVFVFFLFFFFFFFF 
FKKQGFGLMPLGEDRGPNMGFFYFPMVG 
DKTKLPFKKKKKKRPR 


13605 


27506 


A 


13763 


377 


209 


PRPAYGPAFLFTEGFSWNPFFFFQMESR 
SVAQAGVQWCYLGSLQPPPPWFPGTLLN 


13606 


27507 


A 


13764 


160 


202 


MEKYNVHPHSGILHSHEKEQAALFTIAK 
RWRQPSYPSIDEWKNIMSIHTVEYYTAM 
KRNKQLYSQ 


13607 


2750S 


A 


13765 


390 


284 


ESGGHFLSLSLSIYIYIHIYIYVCVYIY 
THTSHTI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 

residue of 

sequence 


Predict- 

otide 
location 
correspon 
dingto 
last amino 

residue of 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13608 


27509 


A 


13766 


350 


41 


YWILQENRMTQ1JRKLLPKSTEL.CMLLQ 
MLLFILNTTNYPLPLWQCYHRFCSFFLK 
TGS F YVTQAKVRWLFTGM I TVHCSLKLL 
DSSHPFKHLVLQELATVAS 


13609 


27510 


A 


13767 


330 


156 


KITQAWWCVHWPDNREAEVTGVIRLGR 
MRLPLEGACTNCVPSWEKEQDPVSHNEK 

K 


13610 


27511 


A 


1376S 


25 


292 


KCFFLSWRGGSRLQSQHFGRPBJRADHIjR 
PGVPDQPGQHGFLLVKMSDKPDIiSEVEK 
FDRSKLKKTNTEEKNTLPSKEIPFSLVG 
VNIQD 


13611 


27512 


A 


13759 


585 


680 


KCLGSRTRWLTPVIPTLWEAEAGGSLRP 
RSS 


13612 


27513 


A 


13770 


488 


359 


PSPRERKFLPFFKKKKGWGPPPPPKKNR 
GRGPQQKEGPQKPKP 


13613 


27514 


A 


13771 


794 


515 


PDMGLEDEQKMLTESGDPKEEEEEEEEL 
VDPLTTVRBQCEQLBKCVKaRERLELCD 
ERVSSRSHTEBDCTBBLFDFLHaRDHCV 
AHKLFNNLK 


13614 


27515 


A 


13772 


130 


397 


WGLTPLCFCMRASLPRHSKPQARKrKK 
KKKKKKKKKKKKRGGSUKKKIPGGAKKS 
GATKKKNFPKKRGQNKKNPGFFKNEIFF 
GGGAI 


13615 


27516 




13773 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKIPLELFPSNQTFFHFVRNWKKRGNN 
WLSNYSI SSTVTRHFPTLPQP 


13616 


27517 


A 


13774 


1 


247 


GLQSLADLLSGLLQKVCLPLSNGIMDLY 
LLYYLFTFE'iGSCSVAQARVQWCEHGSL 
QPHTSASSDPPTSATTTCHHARLI IW 


13617 


27518 


A 


13775 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKIPLELFPSNQTFFHFVRNWKKRGNN 
WLSNYSISSTVTRHFPTLFQF 


13618 


27519 


A 


13775 


107 


468 


EKARS PERftRDQEGGTERDRNS E RE KI L 
PKLREBLPWVSGGWRCPWPHQGWSHWED 
NPGWGIPTGPSVGWGEKKGPGEGRSHKY 
GTGRTKCELGVSIGNSAFTFLLHFYFKH 
RKRERKQI 


13619 


27520 


A 


13777 


125 


2 


NIFFPLFELYYFKCHWARWLTPVIPALW 
EAKEGGSPEVRSS 


13620 


27521 


A 


13778 


166 


74 


GRICYSHSLSKHTHTHTHTHTHTSHISF 

IP 


13621 


27522 


A 


13779 




341 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERLSFSLGGGGALKKKRIFLCVTLE 
GKHLPPLTHTGFFGENTHTLSMKTREGG 
F I CAGGD ERRS AS VMRAYI YRE C E PHTH 


13622 


27523 


A 


13780 


'614 


337 


RRCSSAICYRRHGNHKVKIRSKQGASVS 
PHEHLRI LSLLEVRNVGLGHS IFPHSLR 
MYVayiYVCMYVC ISIYLSIYLSIYLSIY 
LSISISSR 


13623 


27524 


A 


13781 


390 


158 


VLIRLSWGRICFQAHSGCWQNSGSCSHK 
TEGLCLLLAGSCSWLLGGALKGWASKEE 
FYTMQHS CQSDTLS PLPYS IH 


13624 


27525 


A 


13782 


308 


163 


TKFYLLFFFFFFFLFIFFFFFFFLKKKF 
LFF IQGGGKIKTPGTLTFLG 


13625 


27526 


A 


13783 


45 


277 


lASGRPFFFFLHLPPFQKAFVFGGGGGT 
PFEYQNFVAYIKGKQGNPFLFGCGDLFN 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predi 


ted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 




™ing 


ed end 


OCysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 




nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 






otide 


G=Glycine, H==Histidine, I^Isoleucine, 








26 






location 


K=Lysine, L=Leucine, M=Methionine, 










ing to 


first 


correspon 


N=Asparagine, P=Proline, 














ding to 


Q=Glutamine, R=Arginine, S=Serine, 












eof 


last amino 


T=Threonine, V=Valine, 












acid 


W=Tryptophan, Y=Tyrosine, 










sequence 


residue of 


X=Unknown, *=Stop codon, /=possible 














peptide 


nucleotide deletion, \=possible 














sequence 


nucleotide insertion 














APQL I KQLS QLGQKYPKKPLL 


"13^6 


"27527 


— 


^3784 


413 




~2T8 


PNFPPFLFFSGKFFFFAQNLPFGFFPPP 
















GEGKAFPGQGCPVGGGGFAGPTFFFGPP 
















AFPEILYFF 










1011 






IDWENDLTPMVLCHGPSIYNKYSFSQNI 
















FPISANFFFIAHINLTLLQLVSELSHKI 
















ILRHFYIKPIKDILNLYLILPGVWLFDL 
















CKSTAICRYLCVCIYIHLYIHTHTHIHI 
















YTHIRL 


13628 


27529 


A 


13786 


161 




425 


RWGSCYCPGYPHLKIFKLLLLLVLVLEM 
















GVLPLLPRLE CS GMI TVHCNLE FLGTS Y 
















PPISVSQTKKKKRPFTLPPLPKTLQIGLP 
LRGE 


13629 


27530 


A 


13787 


70 




311 


DLISITALVISPNKFYIYLSIYLSIYLS 
















lYLSIIYLIFISTNLLWVLSLWETLNKT 
















RLLNLFLTIQNNIWNPKLLVSALI 


13630 


27531 


A 


13788 


25 




219 


RNMAAATLTSKLYSLLFRRTSTFALTII 
















VGVMFFERAFDQGiUJAIYDHVNEGKLWK 
















HIKHKYENK 


13631 


27532 


A 


13789 


279 






RRSLJOiS PRKDC6LQWRNLGSLQALLPG 
















FTPFSCLSLPSSWDYRPF 


13632 


27533 


A 


13790 


138 


28 


GGWVRWLTPVI PALWEARVGGLIiE PGSS 
KKERGKKEV 


13633 


27534 


A 


13791 


133 




7 


YSCQRCLTHGQAQWLTTVISAPWEAKftfi 
















RLLEPRSVRSAWAT 


13634 


27535 


A 


13792 


86 




290 


EHVHRTLIEASVSGWGWWLTPVISAAWE 
















AEVGGSLEGQEFEISLGKCSETPSLTTT 
















TTESLGSFAATS 


13635 


27536 


A 


13793 


150 




37 


KI I YPSQARWLMPVI PALCEAEVGRLLE 
















PSSSRLAWAV 


13636 


27537 


A 


13794 


216 




77 


PTQHPTEGYMIKQQTVYHECRMWaNSHL 
















CLLPEGLLRAVTPWCHAP 


13637 


27538 


A 


13795 


251 




347 


VAEAWGLAPVIPALWEAEVGGSPEGRSL 
















KPAL 


13638 


27539 


A 


13797 


177 




274 


VQWHMSWPAIQEAEVGGSPEPRSLRPA 
















WATY 


13639 


27540 


A 


13798 


307 




406 


KFKKGWAWWLMPVIPALWEAKAGGSPES 
















RSLRP 


13640 


27541 


A 


13799 


194 




3 


FFFFFFFFFEMGSHYIVQARVQWLFTDA 
















NIVHCSLQLLASSDPPVSTSQVGLQACA 
















DDAQNPE 


13641 


"27542 





j3gQQ 


333 




198 


LPSVFFFFFQTESCSVTQAGVEWCDLGS 
















LHSSLC-NRARLCLKIK 


13642 


^27543 





"13801 






^35 


ETEHGRERETERGRERETERGRERETER 
















GRERERERERERERERDREEGRERQRGE 
















GKORDREENRRRETERVGDWCVCPHP 
















NLI LNCS PY I SHVLREGLGKT YLNYGGS 
















FPTLCSG 


13643 


27544 


A 


13803 


14 




454 


RVF FDRS RYRS RT LGSTHASALLG I LVY 
















RSHLISSLLCLEGIILSLFIIATLITLN 
















THSLLMTIVPIAILVFAACEAAVGLALL 
















VSKKKKKGAPVLKNPWGAQSLRGQARKY 
















FFPYREPNKNLPGNLGKEPFLGGGDILG 
















QPPYKN 


13644 


27545 


A 


13804 


1474 




1661 


TDFHNSHLLKCCKNCIFSTLNYYIRREH 














FSIVFIFCYICWKVHYIRRELFRCLYF 
















LLYMCC 


13645 


27546 


A 


13805 


360 


3 


LWGKRGGPFSPPPFFFFLELFFFLKKMG 



1376 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Ij=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GGGFFGGFPWGPPKGEIFFFVFFWAPQK 
KKKKKPPPFIFWGGGPFFFFFFPPPKQK 
KKKNKKKKKKKQKKKKKKKKKHAAHWPA 
RSRSSA 


13646 




A 


13806 




425 


RLGGVALRSAADGAFVSGEFCGGKLLRW 
CliVTDFPPDSCTTCSYSRRSTPGCSPGG 
SRGLSEGEGSSVSLQESRVLSAMKHVLN 
LYLLGWLTLLSI FVRVMESLEGLLESP 
SPGTSWTTRSQUaiTEPTKGLPDHPSRS 
M 


13647 


27548 


A 


13807 


5 


313 


EKPYLQGITFTS KTTHLI YE FTP YT I IV 
KEYSTNYVLLIGNTQITKIYHIVFRKFS 
MTPIYKFQDLAQWLTPVIPTLWEAKPGG 
LLEPRSLQSSYSLIILIL 


13648 


27549 


A 


13 SOS 




314 


VITQLDKTERAQWIilklPVILVLYEAEVGG 
LLEflMSSRPAWATKTW 


13649 


27550 


A 


13809 


219 


1812 


LPPPESGAMSGFNFGGTGAPTGGFTFGT 
AKTATTTPATGPSPSTSGTGGPNFGAPP 
QPATSTPSTGLFSLATQTPATQTTGFTP 
GTATLASGGTGFFLGIGaSKUSTLSNTAA 
TPAMANPSGFGLGSSNITNAISSTVTSS 
QGTAPTGFVFGPSTTSVAPATTSGGFS F 
TGGSTAQPSGFNIGSAGNSAQPTAPATL 
PLTPATPAATTAGATQPAAPTPTATITS 
TGPSLFAS lATAPTSSATTGLSLCTPVT 
TAGAPTAGTQGFSLKAPGAASGTSTTTS 
TAATATATTTTSSSTTGFALNLKPLAPA 
GIPSNTAftAVTAPPGPGAAAGAAASSAM 
TYAQLESIiINKWSLELEDQERHFLQCJAT 
QVNAWDRTLIENGEKITSLHREVEKVKL 
DQKRLDQEIiDFIIiSQQKEIiEDLLSPLEE 
LVKEQSGTIYLQHADEEREKTYKLAENI 
DAQLKRMAQDLKDI lEHUSTTSGAPADTS 
DPLQQI CKIIJTAHMDSLQWIDQNSALLQ 
RKVEEVTKVCEGRRKEQERS FRITFD 


13650 


27551 


A 


13810 


134 


263 


KNSLFKKKNNGRPWPLMPVIPALWEAEA 
GGLLEPRGLRPTWVT 


13651 


27552 


A 


13811 


210 


51 


TLSHRKPISFEAYFARVRGFILEVSETR 
NPPIGWTRWLTPVI PALWEAEAGG 


13652 


27553 


A 


13812 


241 


139 


LHEAGLAPMI PALWEVKVGGLLEPRS PR 
PAWASW 


13653- 


27554 


A 


13813 


539 


256 


RTTQiyiSTJUiGFTEAPNWKQPRCPWTEEQ 
TNKMWSLHAMECASAMKWNEVLIQPAVR 
MARENSRRKPGDMHDIWSVWSAGEWLPG 
LGRSTGKGSD 


13654 


27555 


A 


13815 


63 


369 


VRETPLKTHYLEEISSPASPTAIPQSLL 
FSFFISPPSSLATGSGHSGHPVHSLHHP 
PETE PS PS VCLWAG PKVPPGAAGKGS PD 
SNPLVIRSLLAPPPASL 


13655 


27556 


A 


13817 


258. 


3 


AESAPPFSTNSLFFTHTHTHTHTHTHTH 
THSLRTTRWAPSQVCRPQTSQRGGECVG 
P TAFAP S PTLLKPHHPSSHVHLPS QPRR 


13656 


27557 


A 


13818 


516 


665 


WFKSGS FWLGMVAHTCNPSTLGGQGGWI 
TSGRSRTSVTTTTTSQTCAPA 


13657 


27558 


A 


13819 


295 


361 


WLTPVI PALLEAVTGGSLEPRS 


13658 


27559 


A 


13820 


166 


368 


GWWATQHSAQYWS SSS IGALCLGES CS 
VTQAGVQWCDLSSPQPSHPGFKRFLCLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13659 


27560 


A 


13821 


133 


530 


RKYCGQCLNMALNNVSLSSGDQRSRVAY 
RSSHGDLRPRASALAMVSGDGFLVSRPE 
AIHLGPRQAVRPSVRAESRRVDGGGRSP 
REPDGRGRSRQARFSPYPIPAVEPDLLR 
S VLQQRLIALGGVI AARl SV 


13660 


27561 


A 


13822 


76 


1958 


RQELIWPLCSPPQGDRFLQKSWIFFRPV 
MADKLTRIAIVNHDKCKPKKCRQECKKS 
CPWRMGKLCIEVTPQSKIAWI SETLCI 
GCGICIKKCPFGALS IVNLPSNLEKETT 
HRYCANAFKLHRLPIPRPGEVLGLVGTN 
GIGKSTALKILAGKQKPNLGKYDDPPDW 
QEILTYFRGSELQNYFTKILEDDLKAII 
KPQYVDQIPKAAKGTVGS I LDRKDETKT 
QAIVCQQLDLTHLKERNVEDLSGQELQR 
FACAWCIQKADIFMFDEPSSYLDVKQR 
LKAAITIRSLINPDRYIIWEHDLSVLD 
YLSDFICCLYGVPSAYGVVTMPFSVREG 
INI FLDGYVBTENLRFRBAS LVFKVAET 
ANEEEVKKMCMYKYPGMKKKMGEFELAI 
VAGEFTDSEIMVMLGENGTGKTTFIRML 
AGRLKPDEGGEVPVLNVSYKPQKISPKS 
TGSVEQLLHEKIRDAYTHPQFVTDVMKP 
LQIENI IDQEVQTLSGGELQRVALALCL 
GKPADVYLIDEPSAYLDSEQRLMAARW 
KB F I LHAKKTAF WKHDF IMAT YT , ADRV 
IVFDGVPSKKTVANSPQTLLAGMNKFLS 
QLEITFRRDPNNYRPRINKLNS IKDVEQ 
KKSGNYFFLDD 


13661 


27562 


A 


13823 


3 


292 


KCWDYVHEPPRPAPMQTYQVDIiRCCFVS 
TINLLVCGERASLPSVESLVFSGLAEVW 

MKAVLAPSRESGKLVLQQVPEYAEAIWG 

AHPACHGLVFSA 


13662 


27563 


A 


13824 


245 


91 


"ykwfgpprgsffrpppffffevfhfff^ 
iifffffffffffsfflgcfmyv 


13663 


27564 


A 


13825 


276 


100 


PGQKKKTLSPKKKKSIYLYIHTHIYIYT 
HTHYIYKYIFVCVYICVS lYIHTHIVPR 
IFS 


13664 


27565 


A 


13826 


202 


375 


lYKTYIHTHTHTHTHTHNVYLYIQSGDC 
lYAVLLGELSFIYI FVmSSLISILFHL 
TY 


13665 


27566 


A 


13827 


278 


34 


KPRGLSPAGVSAQRAWGPHSCPRLPDPR 
TELQVRQIPALVGSSAPQGRRGRKGPVG 
GSGPSRPRAPPQPRPGCCQCTLLQI 


13666 


27567 


A 


13828 


62 


307 


GGARAIFLNVPHLKRPMFFLELCtSTKC 
PVGTEFALDSLLRLYNSNTFCMVQWLMP 
TIPALCKAEMGGLLEARSLIPAWAT 


13667 


27568 


A 


13829 


386 


478 


NRIGMVSHACNPSTLGGQGRRIMRSGDQ 
DHS 


13668 


27569 


A 


13830 


364 


484 


EVRGSGPAANMV 


13669 


27570 


A 


13831 


33 


486 


PARSAEFGTRRERERERERERERERERE 
RERERERERERERERERERERERERERG 
VICVCPCALLCVRESDPLSQKKYCVCVF 
WRYTESLALSPFCLGRCLALLIFLEGM 
WRKIWAP S FLS L CVGALFFS QHPP PFWC 
LCVLSLVRPLPLSLSGGVFVSQRALI 


13670 


27571 


A 


13832 


6 


718 


YSAVE FAMAGVGAGPLRAMGRQALLLLA 
LCATGAQGLYFHIGETEKRCFIEEIPDE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutan)ine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unl<;nown, *=Stop codon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














TMVIGNYRTQMWDKQKEVFLPSTPGLGM 
HVEVKDPDGKMLQWLSRQYGSEGRFTF 
TSHTPGDHQICLHSNSTRMAIFAGGKLR 
VHLDIQVGEHAOTTYPEIftAKDKLTELQL 
RAEQLLDQVEQ IQKEQDYQRYREERFRL 
TSESTNQRVLWWS lAQTVILILTGIWQM 
RHLKSFFEAKKLV 


13671 


27372 


A 


13833 


17 


130 


RLQEFGTRRERERERERERERERERERE 
RDTHSFLHG 


13672 


27573 


A 


13834 


283 


460 


STLGGQGGRTARGQEFKTSLDNIARPPS 
LQK 


13673 


27574 


A 


13835 


3 


435 


WPRFCTALQEFGTRRERERERERERERE 
RERERERERERERERERATSLSRARPSF 
LFPSACVSHTPYVGERENLS LHTLKHTH 
TREKHSLSYTNASARDTRARFSEGRAPP 
lYSVYLTHGVFFFFFFSLCLSVRHAEGL 
AHV 


13674 


27575 


A 


13837 


351 


57 


RTLVHISKFTGNLSDLYYVQSEWQDLGT 
ADLS IMTS VIHCLPSLES IKNTDSWPSS 
YftHTCNPSTLGGRGRCRDQHGQHSKQPS 
LLKIQKIARHCGG 


13675 


27576 


A 


13838 


336 


150 


TFCYKKYLWGAHCFIFRTTGLPIKNFYF 
GQAWCLLPVISARWEAETGGSLEPRSLR 
VAWAS 


13676 


27577 


A 


13839 


310 


454 


HCSLGLPQVLLIILYKILLGWARWLTPV 
I PALWEVEAGGLLEARSSRP 


13677 


27578 


A 


13840 


153 


297 


I IFFLLDLKLQQAMTI YVS I YLS I YLS I 
YLS lYLYTHTLMNTLQL 


13678 


27579 


A 


13841 


33 


316 


LDQHPTPRSPLLCHSLRKTSSSQGGKSE 
LVKQSLKKPBCLPEQRFDAPEDSHLEKEP 
LEKFPDDVNPVTKEKQQPRGPEPTRYaD 
WERKGRCIDF 


13679 


27580 


A 


13842 


4474 


2586 


DGGSGCVKMEFPdeNDNYLTITGPSHPF 
LSGAETFHTPSLGDEEFEIPPISLDSDP 
SLAVSDWGHFDDLADPSSSQDGSFSAQ 
YGVQTLDMPVGMTHGLMEQGGGLLSGGL 
TMDLDHSIGTQYSANPPVTIDVPMTDMT 
SGLMGHSQLTTIDQSELSSQLGLSLG6G 
TILPPAQSPEDRLSTTPSPTSSLHEDGV 
EDFRRQLPSQKTVWEAGKKQKAPKKEK 
KKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVSKIVASMWDSLGEEQKQIY 
KRKTEAAKKEYLKALAAYKDNQECQATV 
ETVELDPAPPSQTPSPPPMATVDPASPA 
PASIEPPALSPSIWNSTLSSYVANQAS 
SGAGGQPNITKLIITKQMLPSSITMSQG 
GMVTVIPATWTSRGLQLGQTSTATIQP 
SQQAQIVTRSVLQAAAAAAAAASMQLPP 
PRLQPPPLQQMPQPPTQQQVTILQQPPP 
LQAMQQPPPQKVRINLQQQPPPLQIKSV 
PLPTLKMQTTLVPPTVESSPEEPMNNSP 
EAHTVEAPSPETICEMITDWPEVESPS 
QMDVELVSGSPVALSPQPRCVRSGCENP 
PIVSKDWDNEYCSNECWKHCRDVFLAW 
VASKNSNTWFVK 


13680 


27581 


A 


13843 


54 


251 


EFYRMNSPSPLWLCVCVCVCACLFMCLC 
ALTCMCVWYKSLLHSNLTQVLSSKFYDS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 

iTptidr^ 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISPAENPGK 


13681 


27582 


A 


13844 


154 


26 


DPHVPPTERSEGSQARWLPPVI PTLWEA 
EACGSPEVKSSNST 


13682 


27583 


A 


13845 


475 


225 


GHTELYGGALFQLLQAAIPEGGGTSCLF 
KPSSDGGSSAAGGWGGAGDLARSSRVP 
SPTSSRPVPSPDPSPPSLYLTPWKETH 


13683 


27584 


A 


13846 


'!79 


319 


RDCRRSLTHSVLSGAQAGVQWRDLGSLQ 
PPPPSRLPWPPKAPRWQPLPGHHPI 


13684 


27585 


A 


13847 


182 


499 


LLCVKLCDSCNENKALREQTEAQVC2GGH 
SRAQVQQVQGGHSAHTWQIRSAfiSRRK 
AFSTCSSHLGMVLLFYQTGSSTYMRPTT 
RYSPLEGRLAAVFYS ILIPTLN 


13685 


27586 


A 


13848 


388 


494 


rdswerwleplipvlweaCjagrlleprs 

LRLAWAT 


13686 


27587 


A 


13849 


198 


63 


HHATTWMNLEDIMFRKVSQTQKNKYYMI 

pliwktkkvlshreqk 


13687 


27588 


A 


13850 


555 


353 


RCPPGAQAPLPAPSPAPPVHIALSPLSC. 
GVSWPRELTAHPPEPPPPPFFKIHPHPQ 
LSVGKRGQMTP 


13688 


27589 


A 


13851 


169 


66 


LLAQSTGQARWLTPVIPELWKAEVGGS L 
EIRSSR 


13689 


27590 


A 


13853 


12 


105 


I ASGLHDFFKKKKI KKKKKKKKKKKKGG 
AL 


13690 


27591 


A 


13854 


256 


363 


ICLFIKDSDRGQTQWLTPVILTLWKAEA 
GGSLEPKS 


13691 


27592 


A 


13856 


23 


436 


IDRLSDSHEERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERES SLLCVEHTCFAPPT YI FLWETLR 
VCPTPCVWGALSVFSHTERARIFFYTCQ 
GSVARAPVCAHTRIFSSRARRRALCARP 
PLSRERRAHTRVFFI 


13692 


27593 


A 


13857 


91 


428 


PDPDTSPLTTDRGETPLGATLPSCYCGG 
QQSFEPKTKKTTNQKKKKKGKKFTPQKK 
NPKIFPPKPFCPKGPRGENQPKKTQNQP 
KKKKKILGGGPAPNS PLGGGKKNSWAGF 


13693 


27594 


A 


13859 


465 


43 


KRVTTENPOHICSAETAIPWSLSQNCNF 
HILWWGHPNHPIYITTSIYIYIYIPHIY 
GMVYIYTRICMCVCIYVHICIHTHTHIY 
I YIHTHTHI YMERERETVSCSVAQAGVQ 
WCDLGSLQPPNKRLKRFSCLSLPSSWDY 


13694 


27595 


A 


13860 


290 


52 


WYKYLPGTRGFLIWLNYLKCIGRIELLV 
GYPSFIYLSIYLSIYLSIYLSSRSIAQP 
GVKRCNHSSLQPRTPGLMHLSNS 


13695 


27596 


A 


13861 


108 


22 


I FPLFSAVKTRFS YT^PKEFPYRMNHVS 

I 


13696 


27597 


A 


13862 


167 


33 


VFTVPGTYMCVSVRDLGQAWWLTPVIPA 
LWEAEAGGSIBEFIiRT 


13697 


27598 


A 


13863 


448 


355 


FFFFFFFFFFFIFFFFFYFFFFFFFAKN 

PLN 


13698 


27599 


A 


13864 


321 


445 


S QS TQAWCLMPVI PALWETKSS ESFEPR 
SLRPAWATGHNPI 


13699 


27600 


A 


13865 


380 


80 


MILSPSPKFPSFLFSLFSALLLWHCLEL 
PFLSNVGKGMQNDPVGI.SPTSTPSDI.SR 
lYLFFCFFVCLFVCFETGFPSVAQAGVQ 
WCDPL6LKWFLPPQS P 


13700 


27601 


A 


13866 


372 


264 


KMENPEDKNFFIFFMFFIVFLLFFFFFL 
FCLTQFP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


■ M 
eth 
od 


SEQ ro 

NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 

ding to*" 
last amino 
acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine,V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13701 


27602 


A 


13867 


161 


304 


LFIAEMTSHYIAQAGLELLASSNPPTSA 
SERTGITDGATEILPQLRL 


13702 


27603 


A 


13868 


230 


348 


VSFIQNLVFCVKRVYRVPDFGVWERGSK 
YUNGSTELHSK 


13703 


27604 


A 


13869 


168 


260 


EVIAECYNGCFQTLSLCVCVCVCVCVCV 

CY 


13704 


27605 


- 


13870 


140 


16 


LKYGQWRDLGSLQPPPPRFKQFSCLSLL 

SSRIICTTMTRDHE 


13705 


27606 


A 


13871 


143 


30 


KI LKWi/SQAWWL T PVI PALWEAEVCGS 
LEPRISRLA 


13706 


27607 


A 


13872 


158 


59 


HLHGCIHLLKLRIjDAVSHACasrSSTLGGQ 
GGRIA 


13707 


27608 


A 


13873 


232 


115 


GKPGNDVGVEG 


13708 


27609 


A 


13874 


1619 


1375 


KTTWRHHLEKLQDMSLLRHRVHVDSSQG 
LFPFFQRQGLLPRLKCNGDIIPYCNLKL 
LGWSNSPASAFRVARTTSLCRHTWL 


13709 


27610 


A 


13875 


219 


84 


NPLQSTST IC3IRAWDRVTCLWSQLLRRL 
RWEDHLSPGVPSCSAL 


13710 


27611 


A 


13876 


166 


496 


KKKKKKKKKGGGPLKKTPGQRKINR6RK 
KKIFFQKGGQKKTPGGILKKKPFLGGGK 
KGPNPPKKIKGLREKKKFKRGKGAKPAQ 
NPWGKKISPPGFFIiKKFFPRGRGFFI 


13711 


27612 


A 


13877 


132 


348 


PSKKKKRGGPFKRTPGGPKFNGGGKGNS 
FPLMGGGIKPNLGILGKNPYLGGGTNGN 
NPPTEIKGFGEKKKF 


13712 


27613 


A 


13878 


287 


134 


QQGPPISPGPKEENRMGPPSPGWGNPGI 

RSPPGAGPNPGAFQDGMPPSKKQ 


13713 


27614 


A 


13879 


333 


480 


VHPLRSAEGPRPESNMLHLTLKSKQQWL 
TPVIPALWEAKAGGSQGQEIE 


13714 


27615 


A 


13880 


I 


227 


PKIHCKIQHIWLVSFFPSFB'LSFFQGE 
SCSAAQAGVQWHDLGSLQAPPPGFKWFR 
FIELLGLGFHSRRMRGKDS 


13715 


27616 


A 


13881 


276 


378 


GWPWWLTPVIPALLEAEVGGLLEPRSLR 
LALAT 


13716 


27617 


A 


13882 


126 


389 


GSVLGTGCCGSVLGTVCRGS VLGTVL lA 
SATPDRYLRLSSGSSLTEITNSDLVPWL 
TPVIPALWEAEAGVSLEPRIWRPARETW 
EDPH 


13717 


27618 


A 


13883 


414 


150 


ARQAPKWGNPTGSPPPGFLWPPPKNFFL 
GPQKKKIFFPTPPPPHFFFFKGPPPFFF 
FFFFFFFSPHASGILLCHPGRIAVAQSV/ 
LTE 


13718 


27619 


A 


13884 


197 


363 


DLGWWNTFMLKFKCLNVSRGHVQWLI 
IPVIPILWKAEAGRSLKPRSLRPAWAT 


13719 


27620 


A 


13885 


101 


236 


KIHYANFPYFGLACPEYIVRALQKKYTQ 
THTHTHTHTHTRTTHSL 


13720 


27621 


A 


13887 


17 


140 


TVHLKMVKIGLHMVAHACNPSTLGGRGG 
PIMRSGALGNFCL 


13721 


27622 


A 


13888 


135 


1 


VLHAGL KLLGS S T S PTSAS KGAGT VGIH 
YHTQLAFQFFLFCFLIR 


13722 


27623 


A 


13889 


411 


276 


TLYFKHGEt1GFHHVGQ7\GLELLASSDPP 
ALEFETSLHSKVSPRL 


13723 


27624 


A 


13890 


216 


54 


EFKLCYTLFDHSYFPLEELVSGQVWRLR 
PAIPSFWKAKEGGSLEPRSSKHEIV 


13724 


27625 


A 


13891 


3 


149 


SSDPRPSSSWLAWVGLWSMVWRVPPFLL 
PILFLASHVGKSPLSPSPHF 
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SEQID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predlct- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13725 


27626 


A 


13892 


166 


1 


LFSKIFWWLLPVIPALREAKVGELLEPG 
SSRPAWATWKNPVYTKNTKISQAWQCT • 


13726 


27627 


A 


13893 


128 


2 


FSTTYPNPIKLPHPYLPLLTLFLDSAHL 

HPSEINSLV/iQKKK 


13727 


27628 A 


13894 


225 


391 


HLSIHILTt'PFSLPEQSWLVPIlPVLKE 
1/JMGGLLEPRSSRPAWATQQDPQVSRK 


13728 


27629 


A 


13895 


192 


1 


TLVCVSSSVPKWYNNICLACWFMPI IPA 
LWEVESGSLLELKSSPPAWPTWWVPLKS 
KNTKISLA 


13729 


27630 


A 


13896 


152 


.245 


DQSIADVLMYYTHTHTHTHTHTHTLQNQ 
VP3 


13730 


27631 


A 


13897 


270 


406 


ISTRFFLYRDRASLCHPGWSAGVQSWLP 
GSSNSQAQAILLPQPPE 


13731 




A 


13898 


232 


73 


EAEGPRDVTSSTFHWELYDQTTtJYHECR 
MWANSRLRLTPEGLLKVITPWCHAH 


13732 


27633 


A 


13899 


107 


7 


LGQAQWLMPI IPALWE?iKAGGSLEPRTS 
RPAWA 


13733 


27634 


A 


13900 


62 


224 


P'i'RPIIiAHCNLCVl.GSDDPPASASQSAG 
VTGMSHCaWSNLI IHLPSRSTAICTL 


13734 


27635 


A 


13901 


450 


194 


KRQVKSHSEVKTTHVQMPGI DPGS PAWE 
ARILPLNHQRLSGTTPMRITEKSNHKDL 
ETVLSGFFKCRLKANKDIQTKCVFIGNF 
Y 


13735 


27636 


A 


13902 


300 


404 


AAPGRAPQKKKKKKKKKKKKKKKKKKAP 
FIKRGR 


13736 


27637 


A 


13903 


131 


29 


ALGSLQPLSPRFKRFSCLSLPSSWNSTA 


13737 


27638 


A 


13904 


267 


74 


VWVLVPPFTSYGVLDKPILIVHNIYVHV 
CVCLCMCVCVCVCVCCTVIRRHCWHNMI 

IIISQHDQ 


13738 


27639 


A 


13905 


193 


64 


IRSHQIHKLHFKKFRILGWTWWLTPVI P 
APWEAEEGRSPEPSS 


13739 


27640 


A 


13906 


309 


424 


QSTDFLEKRRVGWAWWLTPVIPALWEAE 

VGGLPEIKNS 


13740 


27641 


A 


13907 


404 


251 


SCLISLRSWDYRHVPPQPAWFVFFFFEM 
ESCSVAQSGVQWQEEnSISKINK 


13741 


27642 


A 


13908 


471 


59 


GPPQAKKGXGFPTPGPPQKKGGGQPKSG 
VLGFSPPPLGPCPGLGPPKGWGFWGGSF 
FFFFFF3KWEHICNELFPFSKTEKPREAG 
EARKGSQSTKYAGLSRGERQASPTLRPG 
HLAKASAGGWLQY-RWQGKLPEPGN 


13742 


27643 


A 


13909 


479 


231 


QEYCMLIFCEHFGDFDLWIVFCLEKKMV 
GVIADFYLLSSFTGRPQWLMPVIPTLWE 
ABVGGSLEAKSFTPAWATWQDPISMF 


13743 


27644 


A 


13910 


167 


19 


CIVLGIMLDSLWEPRQWVPSWVRWFTPV 
IPALWEAEAGGSPEVRSSNSS 


13744 


27645 


A 


13911 


328 


446 


GSFSKITDLSQTWWYVPWLALQESEVG 
RSLEPRSSRLQ 


13745 


27646 


A 


13912 


145 


460 


HAFMSTGCEVLSYSELRQRMLVWWFLPV 
IPATPQAEftGRLLSPRSSRPPWATKGVR 
ALSHKQTKTKTNiCLSHSTGVSFVFTHHI 
HSPGILWRLSQWLLaMNLATN 


13746 


27647 


A 


13913 


319 


210 


KLGFFKtePPLFFFFFFFFFYFPFFFGFL 
TVSCNFTL 


13747 


27648 


A 


13914 


372 


10 


PKTTCTPITYGKKCCPREISYYLCHEFI 
RIRILTFFDGESCYIAQAGLKLLGSRSP 
PTSASGVAGNTDVCHHTQPILFYILETG 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutainic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LALLPRLILNSWSQAILLPPQPLRSYFN 
FWLFFCLYN 


13748 


27649 


A 


13915 


594 


471 


PSYIAYMCICMCVCVYVYINICVCIYTH 
VHTPYNPVISKtr 


13749 


27650 


A 


13916 


235 


3 


EAKNLIRKTYTSGWFQRTSKRPPAVRRM 
KVLFFETESCSVAPVGVEWHDHSLLQPR 
PPGLHRSSHLSFPSSSDYRWA 


13750 


27651 




13917 


272 


184 


KTTftWAQWLTPVISTLWEAKVGGSLEAG 


13751 


27652 


A 


13918 


46 


321 


SRVQC3TC3C3HRC3RC3CIQft3VGEQKAKRET 
EGPPWRWLTPVISKLWEAKAGQSLBPVS 
SRRATWNLLKRIYYTDBRBYLIPLMiKT 
SQGSTFSF 


13752 


27653 


A 


13919 


387 


484 


F lEI IGWARWLMPVIPTLlteAEAfiGSPE 
VRSS 


13753 


27654 


A 


13920 


142 


321 


FLFS IHFEILS CWGLFSNVWQSLSIFS YL 
KVRLYGQAWWFTPIILALWEDKftGGSLD 
PRSL 


13754 


27655 


A 


13921 


270 


412 


IEDKKIYSILHTETI.ERGQaRWLTPVIP 
AFWEVKAGGSFEPRSLRPV 


13755 


27656 


A 


13922 


304 


396 


LRS 


13756 


27657 


A 


13923 


305 


387 


VMWLTPVIPALWEAEAGGSPEPRSSRP 


13757 


27658 


A 


13924 


103 


3 


NKSPELGWAWVJLAPVIPALWEAEAGGSP 
EVRSS 


13758 


27659 


A 


13925 


195 


400 


CLLILSGLVASCI lERHGKGQAWWLMPV 
I P TLWEAKVGGS LE LRWLRS AWGTWENS 
ISIKYTEQLAQG 


13759 


27660 


A 


13926 


294 


489 


KWAKMQTLHKGRYACVCICVCVCVCVYT 
YTDIYIFYIHIWPMNTKCSMLLVIGEMK 
ILSRMWSNQ 


13760 


27661 


A 


13927 


363 


2 


NWGPPGFPPPPFLKTGPVFFIFGAPKKK 
FFLS TPRALKFVLLKGGPLFFFFFFLWV 
FLVKTGSHCVAQPDLELLSSSNPVLVSQ 
GAGITGMSPQVQPLDLFFFFSFFETGSR 
SYAWADAW 


13761 


27662 


A 


13928 


1 


218 


LRFKKRKTNKPKMWSVHTMEYHSALKRK 
EIRTQTTRWINLEDIVLSEIRVAVTKGR 
I L YDS TQMRNLEESNL 


13762 


27663 


A 


13929 


219 


408 


TFCHFIAFLHVYKMFWFFFSFIMFFFP 
LNCNKAFERKKKKNiOKNNKKKKKRKKIS 
LSPPAPP 


13763 


27664 


A 


13930 


161 


22 


WHPPASASHVAGTTGACHHERLRQEDHF 
SLRVQGCSEPRLCHHTPP 


13764 


27665 


A 


13931 


116 


T 1 


INTGWARWLTPVIPAFWEAAVGGSLEPR 
STSQAWATWK 


13765 


27666 


A 


13932 


181 


38 


PWPQRATQPKISPLPPAISALWEAEAGG 
ALEPRSSRPAWAKSDAWVD 


13766 


27667 


A 


13933 


267 


397 


TVFDRTLFFIKKI IHSWAPWLTP WPAL 
WEAEVCGSPEVRSSG 


13767 


27668 


A 


13934 


129 


12 


NIGVGWALWLTSVIPALWEAEAGGPPKV 
RSSRTARPMS 


13768 


27669 


A 


13935 


179 


1 


KTYYGGIPPPGPPKKGEKKPPFFFKKFF 
LFFISFFIFFFFFFLRQSLAKMQSASW 

KVS 


13769 


27670 


A 


13936 


245 


616 


DDKKKKEAAQKKATEQKIKVPEQIKPSV 
SQPQPANSNNGTSTATSTNNIIAKRATAN 
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SEQD) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanin6 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, F=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TIireonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NQQPQQQQQQQQPQQQQPQQQPQPQPQQ 
QQPQQQPQALPRYPREVPPRFEHQEHKQ 
LLKRGQHFPVEF 


13770 


27671 


A 


13937 


121 


292 


NTDLDLftLALIVRVLTSDDNFKKWGWVQ 
WLMPVFPAPWEAEVGGSSEATNLRPAWA 
T 


13771 


27672 


A 


13938 


421 


202 


GPREAPLFPSTQGISRPLFFFFFFETES 
RTVTRARAQWHirLGSPQPPLCSTjDILKT 
SQSVCMPEAAILVSQIT 


13772 


27673 


A 


13939 


205 


543 


ILESHRRLGWSEALPENQLLCRASSWES 
LGPPDPPCLKPPSPAFWGftAGGWRAKST 
SVSEPVPVGTLLVALFLEMAVGIiLLGLP 
?C?HSLGAPHLLCEPDGPSVLPAEARQS 


13773 


27674 


A 


13940 


140 


2 


LSQHSEEYTLPPPLYLEKESSRPPPPQH 
TPHPPPPWPMHPDIftDIQ 


13774 


27675 


A 


13941 


47 


303 


GLIFLIIWTCIVKTSTDFPRMEDCSQCI 
HQVTEESNKRMGFLSYYANPHHGSSRLL 
WPQHftAPWDDGRRGKPVFSLGFVSFPFP 
Q 


13775 


27676 


A 


13942 


I48S 


1719 


KLHVLSSNLSSPAEVTWaSINLWSEQ 
SFQNSQLPAMLCLIVHLQMS 


13776 


27677 




13943 


127 


402 


QPVTHPPNRTMCSSMIFLFLYLCFLSLS 














KKIHFPPGGGRI IFFY 


13777 


27678 


A 


13944 


852 


209 


EQTVYCTRIDLQQKLVFLRLEAFCLLSQ 
GDPGSPMMCQLQQFDLWVLRGVINFGGE 
TCPGLFLYTKVEDYSKWITSKAERAGPP 
LS S LHHWEKLIS FSHHGPNATMTQKT YS 
DSELGHVGSYLQGQRRTITHSRLGNSSR 
DSLDVREKDVKESGRSPEASVQPLYYDY 
YGGEVGEGRI FAGQNRLYQPEE I ILGSL 
RACFLFAAVSSPGATPPN 


13778 


27679 


A 


13945 


68 


260 


INIFWHCVLLYVILYAISQFLLFPCVKN 
lASWLGSVAHACNPSTLGGQAGESIiEPR 
NLRPAWAT 


13779 


27680 


A 


13946 


420 


156 


FLPPGVKYGSFKRAPPFFFFFFFFLVET 
LSCYIAHTGLELPDSSNYPTSASQSAVI 
TGMSHHTWPLEGACLAIPQALRILLVLV 

PLH 


13780 


27681 


A 


13947 


317 


17 


AGHGGSTCNPSQPIVTAVPINLWGRDLL 
QQWGAQAFIPEQLYSPQSQHTMQEMGYV 
PGMGIiEKNLKGLKKPLQAEGQNSHQGLG 
YNFSWQPLLSLQNLYL 


13781 


27682 


A 


13948 


55 


224 


NFRLTSS FS YLKMS FWLGMWLTPVI PAL 
WEAKTGGLFEARSLRLVWWQSKKAGSHL 


13782 


27683 


A 


13949 


1464 


805 


RATS VRGAGRERS CGAAWS PAS I GP FLR 
RSVLPNAHFRSSREGGMAASTDMAGLEE 
SFRKFAIHGDPKASGQEMNGKNWAKLCK 
DCKVADGKSVTGTDVDIVFSKVKGKSAH 
VINYEEFKKALEELATKHFKGKSKEEAF 
DAI CQLVAGKEPAWVGVTKAKTGGAVDR 
LTDTSRYTGSHKERFDESGKGKGIAGRQ 
D I LDDSGYVSAYKNAGTYDAKVKK 


13783 


27684 


A 


13950 


356 


98 


NAGPGI YFWGP I KKI LP CP P AGVKLGSL 
KEAPLFFFFVEMGFHHVAQAGLKLLSSR 



1384 



wo 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

TeTuete 


Predict- 
ed end 
nucle- 
otide 
location 

ding to'' 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=lVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thr6onine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DSPIPVSQSIRITSTCEPPCLAKIKSSL 
RL 


13784 


27685 


A 


13951 


194 


48 


NFLEGWPGQRLTPVIPALWKAEAGGSLE 
PKTFRPVWAMWRDELEKPQH 


13785 


27685 


A 


13952 


341 


418 


DQPGQHGETPS ILKIQKLAGEGSTCL 


13786 


27687 


A 


13953 


158 


323 


TWLKKYFQSTNQLPPftHLLNVQPLLPFG 
LPLLPLKKKKKKKKKKKKKKKKKKIKK 


13787 


27688 


A 


13954 


231 


355 


VFLPQKQKQNQCPLTGKWVHNSWNIHTL 
EYYSAVKGMEQLI 


13788 


27689 


A 


13955 


493 


39 


PAAFSSWGWAPGSSLLKKTTPASWPPGL 
LPPAPVSSMAftPDLDSFPLPLCWTFLRV 
LCRVFSSSTLAFLSCVSTSFGALTPAAS 
SNI FS ISLPHWASPPLPTARI^rSAVSPE 
PLPPLPSRLQRQPRGAPSHPGMSMGP IL 
DATSNSTTRSR 


13789 


27690 


A 


13956 


497 


372 


NTKISWASWYAPVIPATWEVSNSRPQAI 
HSHQPPKRIGLQA 


13790 


27691 


A 


13957 


438 


301 


GTTYLDIDMTCDYVSVCLCIYICTRQQS 
YIYTYVYTHTHTHTHMYI 


13791 


27692 


A 


13958 


84 


466 


QPLGRSGKVPQLHLWEMQRLPAAFERSF 
AAQGL3ESVHINTPTSLSLR3RPFISSS 
GPGARARVTCAPCLHAflRILCCFWRGLH 
RLKCSNVISAHCNLRFSGSSDSPASASQ 
VAGITGRSSLEQLLE 


13792 


27693 


A 


13959 


216 


28 


KYVKNYVLSS IIFWMQEIFHNFKKSSRT 
QWLTPVIPALWEAKAGGSPEPRSSRPAS 
SIQLENS 


13793 


27694 


A 


13960 


320 


943 


VLSFSLSDRAQRCNCRMGKQNSKLAPEV 
MEDLVKS TEFNEHELKQWYKGFLKDCPS 
GRLNLEEFQQLYVKFFPYGDASKFAQHA 
FRTFDKHGDGTIDFREFICALS ITSRGS 
FEQKLNWAFNMYDLDGDGKITRVEMLEI 
I EAI YKMVGTVIMMKMNEDGLTPEQRVD 
KIFSKMDKNKDDQITLDEFKEAAKSDPS 
IVLLLQCDIQK 


13794 


27695 


A 


13961 


191 


8 


GLFRFRRLSEEVKHCCCCCCCCCCCCCC 
CCCCCCCCCCCYLRLAPQSPAAPAPELH 


13795 


27696 


A 


13962 


203 


28 


VWGNQHFCFVPVFRKKVFCFFPLNKLLR 
GS 


13796 


27697 


A 


13963 


18 


171 


GIRHEERERERERERERERERERERERH 
PAREIDMCVSKRDTRGALCALF 


13797 


27698 


A 


13964 


25 


299 


HDFCTRRERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERERERERERERETHTLSLYIYMCV 
FLYIPLYFS FLYIHTQREAPPPLSATLS 
LSIL 


13798 


27699 


A 


13965 


246 


410 


AYNGQGSRGVTPVIPSLWEAKTGGSLEP 
RSWRSAWATWHTPPVPGPSKEISLYI 


13799 


27700 


A 


13966 


91 


26 


FCCCGCCCCCCFYCKKEKTLV 


13800 


27701 


A 


13967 


158 


24 


PFCHIYIYFLESCFVAMLECSGKILftHC 
YLRLLGSSNSLVPSLA 


13801 


27702 


A 


13968 


129 


7 


SWAQWLMPIIPTLWEHEAGHSLEAMSSR 
PAWATQQDPVSKK 


13802 


27703 


A 


13969 


445 


268 


KIQLTJCLKNASESLPSRINQAEEKISGW 
AWCLMPVIPVLWEAKAGGSLEPESLRRG 
QVT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 

otlde 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13803 


27704 


A 


13970 


1628 


1309 


HRRKKCAGYSSLSSALLRERERERERER 
ERERES VCVCVLGT ITCFRRQYCVIQG I 
SGVRQNCVQIPALSDPDLFLFLSLSVLH 
SNTWWRPTLRVTERNKC3AMCV 


13804 


27705 


A 


13971 


314 


140 


QCHNCYILTQLVKIKGDQAWWLMPIIPA 
LGAKRGGSLEAGSLRSAWATWRAPMSTK 

IF 


13805 


27706 


A 


13972 


292 


482 


YKSVRTQDSCAHGVDRLVGKIRYVNLFV 
YICQHSDGRTWWLTPVIPPLWETKGGGS 
SEVGRSG 


13806 


27707 


A 


13973 


502 


303 


STAIGPSFSPFLCTHGSFHGFLFFFFCF 
CFFLVEMGSHYVAPAGLKLLGSSDPPAS 
ASPSHLGLQV 


13807 


27708 


A 


13974 


493 


289 


IPPTALGTSFSPFLCTHGSFHGFLFFFF 
CFCFFLVEMGSHYVAPAGLKLLGSSDPP 
ASASPSHLGLQV 


13808 


27709 




13975 


405 


121 


IREAAQSESLFVRAAPHTGLISGPCITE 
lYAVKDTRCIMQRRILVEELECAaHGLTE 

CLS IDTHTHTHTHTHTHTHTHTHTHTHT 

HGTRWEAGQSLKGES 


13809 


27710 




13976 


135 




WTCTN3SWSLT?TSFLFLMPGWIQKGR 
RVCVCVCVCVC 


13810 


27711 


A 


13977 


179 


58 


VPLSINSPQEQCIFCCCCCCCCCFLVLQ 
KIISEESWAEIG 


13811 


27712 


A 


13978 


97 


186 


KI IFWLGWAHACNPSTLGGQGGRIMRG 
RD 


13812 


27713 


A 


13979 


289 


420 


elgi.kihgfpsqafwfmpiipalwe:aka 
ggllesrssriawamw 


13813 


27714 


A 


13980 


421 


106 


SGGENFFFFLGGGFYKKKFPPPPFLRGK 
KKKPPLPKKKNPTPIFFPPOaaPPFSPP 
PKMGGGKIFFFFGGKNFIMQNFaGFPPL 
WGKKKKFLPKKKNPLFFFFFF 


13814 


27715 


A 


13981 


309 


135 


KH I YI YLS lYLS I YLS lYLS lYLPAYLP 
TYUJAHPLIHANAYKNIHIDYPFKKGFE 
Y 


13815 


27716 


A 


13982 


231 


21 


QHMYCLKSIWHKKLSRKPYCYIISEYSS 
MVKIRVSISQVQWLMPVIPALWEAKAGG 
LLKARSSRPMWAI 


13816 


27717 


A 


13983 


209 


359 


HLHTHTHTHTPNGVKQIMTQIPILPLTG 
PLILEKFLNFSKPLSHPGKRDP 


13817 


27718 


A 


13984 


391 


249 


IWP ILPPQKI FFFFFETESCSVTQARVQ 
WCNLGSLQPPNPVFPSLAK 


13818 


27719 


A 


13985 


181 


403 


YLVLDSPESTLGILSQPKDYITKDQTLT 
FRPVWAHACNPSTLGSQGKWITSVQEV 
ETSLCNTARPCLYQKYKN 


13819 


27720 


A 


13986 


441 


358 


ETGSRCVILAGAQWRDLSSVQPLPPRFK 


13820 


27721 


A 


13987 


7 


233 


flSIPCIYQKTKIPNTKQTIOMKNTKITPA 
WWCTPIAPATQKAEIPPLHSSLGDFLRL 
SPKKKKGGPNWAPTPSKFF 


13821 


27722 


A 


13988 


152 


44 


PKAPLFFFFFFFFFFYETRSHSVAQAKV 
QRHDHGSL 


13822 


27723 


A 


13989 


188 


379 


ms CHAAETRVWWEKERPLPVPFGAYLL 
DGMPLVWLLSASSILSIWLSIYLSIYLS 

lYLSIYF 


13823 


27724 


A 


13991 


296 


53 


SS ILFHASKYHFSGRLFSLLSCKRRNIL 
QLVPHMYSPYLYMYIHTHKHIYEYMYVH 
lYIYMNLRNCRIYLHSCTPKENQAR 


13S24 


27725 


A 


13992 


37 


290 


RLGSPASEHQQIQCLIHACLKDGTLGWV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

reJfdueof 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cjsteine, D=Aspartic Add, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WWLIPVIPALWEAKVGGSLEPRSSTPAW 
ATQGDLILKKDGTLLQCPHVAKGHSSHS 


13825 


27726 


A 


13993 


189 


370 


SRNAIFKKRWESSCQSSFNSLGVFPQAA 
SRPA 


13826 


27727 


A 


13994 


179 


25 


S KHNKKNIQ I CLL CS KRSMFVHVC I HI H 
ICVYVYRYAYIYIGIVICIHIYIHS 


13827 


27728 


A 


13995 


159 


2 


EDPLSPGVQDQHGHHRKSLSGRGGKISL 
GQEFKSSMAAMYF6RSGGCKLEER 


13828 


27729 


A 


13996 


307 


99 


FIWEHFVNYTMFFTYKKPTSQAWWLMPV 
VPALWETEAGGSPEVRSPRPPGQHDDVR 
PLGPSLHVYLQMD 


13829 


27730 


A 


13997 


218 


21 


FQHFGRPHRVDHLSLGVQDQPGQHGETP 
SPLASSILDLIPPPFPIDEPVDLAFQSV 
TLGVQDTTS 


13830 


27731 


A 


13998 


227 


404 


GYSVTHLLGLTQLTQRIAHGTRKGERER 
ERERERERERVCSCLMHELflSHLLFFFL 
FRE 


13831 


27732 


A 


13999 


400 


668 


PVGRRWELQGMGLGVRTWQSSLPCQGPT 
SSSVLSLGRGTPGISNTCLPPRSATELG 
LPAQVPFPNVQSQDQDLSFSLGCFPRTY 
NRQTL 


13832 


27733 


A 


14000 


225 


414 


NRVIiCHTGWSTVAILAYCSFKLLGSRD 
SPTLASRAARTIVKCRNVWLIKKKKKKK 
KGGRRL 


13833 


27734 


A 


14001 


397 


118 


LFSGPFFFDKSECFQEPTVPFCLLLYNQ 
MQPYLLRSAFVDSYYMYDSFLRDRVSFC 
QSGWSTVAPSRLTAALNSLAQVILLPQP 
LNVLCGISR 


13834 


27735 


A 


14002 


3 


173 


YVFYRQKQMVCFYKIRI KNMFNRNRUJIL 
GQWCVPIVPATQVAEAGGSPEPTSLRP I 


13835 


27736 


A 


14003 


440 


290 


YTHTLYIYIYIHTHTHTHTYIYLCTYWG 
QFDLLVLDNKLYKEIYYLSVFI 


13836 


27737 


A 


14004 


2 


116 


ARLVAMPFKYEEIKDFLLTARRKDAKSV 
KIKKNKSAAV 


13837 


27738 


A 


14005 


279 


422 


SAKAPLSCLRTNSLLLIOJGLRGWAWWIT 
PVIPTLWEAEVGGSPDVTSP 


13838 


27739 


A 


14006 


3 


497 


GGIGDSRCGSTRASSSPQLflGRSSSVLP 
AAAQPCTPTMDVFKKGFSIAKEGWGAV 
EKTKQGVTEAAEKTKEGVMYVGAKTKEN 
WQS VTS VAEKTKEQANAVSEAWS SVN 
TVATKTVEEAENIAVTSGWRKEDLRPS 
APQQEGEASKEKEEVAEEAQSGGD 


13839 


27740 


A 


14007 


1 


292 


SLGGGGCNDPRSYHCTPAWATEQDTVSL 
SLSIYMSVRVCIYMYMCVCVCIRIYTYI 
HTHIHIRVYIYTIPYIWGIYIYIYVDLV 
lYMGWCGCPPPSI 


13840 


27741 


A 


14008 


57 


250 


KI FFFLMKHTWGGVPWLATWEAEVGGS 
LEPRSLGLQETMMTSLLSRLPRQQGEIL 
SLILKRNY 


13841 


27742 


A 


14009 


184 


2 


AVPGRWPCQQVPGLLPSIQDTLKQGRTG 
QWLTPVTPMPWEAEAGGSLDARSLRPAW 
ATGK 


13842 


27743 


A 


14010 


123 


3 


KEP SRPGRWLTS 1 1 PACWEAEAGGPPEA 
RSSTREFRTAK 


13843 


27744 


A 


14011 


403 


168 


GTGLRASRCISQPVMSGLLSPSEFPFPE 
lESHLHVSQRETAIUiLRDRERERARERE 
RERERGKNASSGKPPGSASCVP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQIO 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

; location 
j correspon 

lasf^amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V--Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Un!inown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13844 


27745 


A 


14012 


207 


422 


EIRRGSLAGPLSLNFILSVMGSHSKYLR 
KVIGRAQWLMPWPALWEAEVGKLHELR 
RLGHTWAAW 


13845 












WLCVLGWSLLftliSPRLECYGPISAHCNL 
HLLRSSDS PASASQALQEAWYQHLLLMM 
PQAflSTQVGRKK 


13846 


27747 


A 


14014 


319 


176 


KPSTITFLFLHLPISSRESRPVTQTEVK " 
WLNHSSPQPPTPRFSLPSS 


13847 




A" 








EQENKNTLNSPGWDGVHIHVLRFLNTRM 
PLGHLSFSEGVSGSGSSGRRCQAGAGGR 
SATAAARPCQCLGLKGLPPAVPSCGLEG 
KAPERAGYSASHPPHGCETGNGWVLYVN 


13848 


27749 


A 


14016 


416 


63 


VSKVYPKINHTQRTVRHETFKIAMPKYY 
SNLCSIljRFNKDRIiAFMHHYTKQCSRTS 
ITEYIAMPFFVCLFVLKIGAHCHPGWira; 
WLVCSQVTAASTSEAQAILPPQPPKIL 


13849 


27750 


A 


14017 


196 


425 


slmfhpplfsfccfcirypfifvffsll" " 

CFAFSWCSLLLFFFHYLKLFLliFLLFF 
LPVLLdiD SFLVHDFFLLS S 


13850 


27751 


A 


14018 


3 


233 


SSRMGRGTKTIAFDMRWSCEVDRARHRD 
REREREREREREHEKGAEPQEFPVI SPW 
VDRPCLAGILHLKCGCGHRL 


13851 


27752 


A 


14019 


163 


1 


AtlRWHDQICVEREIWGWVQWLMPVIPVli 
WEAKAKNLLEPKNLKLQHNRRPGNSR 


13852 


27753 


A 


14020 


203 


83 


LKKKKNKSQAQWLTSWLASQEABAGGS ' 
LEPRNSRPAWAT 


13S53 


27754 


A 


14021 


179 


3 


VTWQRELRLQMELRLLICRAWWLTPVIP 
VPWEAKAGGSPEPRSSRPAWETRNRRPG 
NS 


13854 


27755 


A 


14022 


358 


431 


WHKKKNQNWLGTGAHACNPSTLGG 


13855 


27756 


A 


14023 


98 


3 


GRPAGWLTPVAPTLWEAEMGGSLEPRSL ■ 


"13856 


27757 


A 


14024 


254 


385 


RENWAEPSSHSLLNIFLQLGTVARACNP 
STLC3GRGGWITRSGD 


13857 


27758 


A 


14025 


140 


1 


KGVLDGPRGLCVFCFCFFFLCVCVCVCV 
CVCVWRRSLTLSPRLQCI 


1385S 


27759 


A 


14026 


579 


852 


QREWVGWAGKEGEGWVSHVPASAQLPAN 
GQRGQPHPSPLGGTGSTWAQEGAYCGLS 
SCSHCCCCSSCCCCCPPGFCLFPSLGAI 
CHLYHL 


13859 


27760 


A 


14027 


30 


663 


LRIRALRELPASHIPGSLTICCVPRPPL 
PCSSTKPDAKGYKPLAQQKLGSQRTGPC 
LGHAPCYS PLWELRGHGCSPYGP S PSGS 
VSLWQEAMRLPKWTPEEKDRRTAALQEG 
LRRAVSVPLTLAETVASLWPALQELARC 
GNLACRSDLQVAAKALEMGVFGAYFIWL 
INLRDITDEAFKDQIHHRVSSLLQEAKT 
QAALVLDCLETRQE 


13860 


27761 


A 


14028 






AVGVHHAFHLPHCFFASLLESPVSPRLA 
MDPNCSCAAGVSCTCAGSCKCKECKCTS 
CKKSCCSCCPVGCSKCAQGCVCKGASEK 
CSCCD 


13861 


27762 


A 


14029 


46 


1746 


PAAGAATMEFRQEEFRKLAGRALGKLHR 
rjLEKRQEGAETLELSADGRPVTTQTRDP 
PWDCTCFGLPRRYI lAIMSGLGFCISF 
GIRCNLGVAIVSMVMSrSTTHRGGHVWQ 
KAQFSWDPSITVGLIHGSFFWGYIVTQIP 
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SEQID 
NO: of 
nutleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 
beginning 

nucleotide 

correspond 

residue of 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plien)'lalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine,lVI=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unltnown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGFICQKFAflHRVFGFAIVATSTLNMLI 
P SAARVHYGCVI FVRILQGLVEGVTYPA 
CHGIWSKWAPPLERSRLATTAFCGSYAG 
AWAMPIAGVLVQYSGWSSVFYVYGSFG 
I FW YLFWLLVSYES PALHPS I SEEERKY 
lEDAIGESAKLMNPLTKFSTPWRRFFTS 
MPVYAI IVANFCRSWTFYLLLISQPAYF 
EEVFGFEISKVGLVSALPHLVMTIIVPI 
GGQIADFLRSRRIMSTTMVRKLMNCGGF 
GME ATLLLWGYSHS KGVAI S FLVLAVG 
FSGFAISGFNVNHLDIAPRYASILMGIS 
NGVGTLSGMVCP I I VGAMTKHKTREEWQ 
yVFLIASLVHYGGVIFYGVFASGEKQPW 
AEPEEMSEEKCGFVGHDQLAGSDDSEiyiE 
DEAEPPGAPPAPPPSYGATHSTFQPPRP 
PPPVRDY 


13862 


27763 


A 


14030 


240 


63 


VFTAQSNGGVELNRFISEGPLSAVLKGRV 
AWAQWLTPVI PTIiWEVKflGGIiLEARSLS 
PA 


13863 


27764 


A 


14031 


289 


414 


HaEMGRVQWLTPAlPALCKAEACSGPPEP 
RSLRPAWATY 


13864 


27765 


A 


1403?. 


386 


1765 


LGDARAPEKMSAIQAAWPSGTECIAKYN 
FHGTAEQDLPFCKGDVLTIVAVTKDPNW 
YKAKNKVGREGIIPANYVQKREGVKAGT 
KLSLMPWFHGKITREQAERLLYPPETGL 
FLVRESTNYPGDYTLCVSCDGKVEHyRI 
MYHASKLSIDEEVYFENLMQLVEHYTSD 
ADGLCTRLIKPKVMEGTVAAQDEFYRSG 
WALNMKELKLLQTIGKGEFGDVMLGDYR 
GNKVAVKCIKNDATAQAFLAEASVMTQL 
RHSNLVQLLGVIVEEKGGLYIVTEYMAK 
GSLVDYLRSRGRSVLGGDCLLKFSLDVC 
EAMEYLEGNNFVHRDLftaRNVLVSEDNV 
AKVSDFGLTKEASSTQDTGKLPVKWTAP 
EALREKKFSTKSDVWSFGILLWEIYSFG 
RVP YPRI PLKDWPRVEKGYKMDAPDGC 
PPAVYEVMKNCWHLDAfllMEPSFLQLREQ 
LEHIKTHELHL 


13863 


27766 


A 


14033 


619 


414 


EQARCLEQHACTSPRQPRPQLQCSELKG 
HSLGMEHQGHVPCLHPCSPCPPVKASPA 
CWSSNPAPLCPH 


13866 


27767 


A 


14034 


2 


614 


LESRPDGRPSSTHPASPSAFSAPGKPHP 
PEAKMSSKRAKAKTTKKRPQRATSNVFA 
MFDQ SQ IQE FKEAFKMIDQNRDGF IDKE 
DLHDMIASLGKNPTDEYLEGMMSEAPGP 
INFTMFLTMFGEKUSTGTDPEDVIRNAFA 
CFDEEASGFIHEDHLRELLTTMGDRFTD 
EEVDEMYREAPIDKKGNFKYVEFTRILK 
HGAKDKDD 


13867 


27768 


A 


14035 


1 


254 


AVEFGPAGPGSLGRAAflMIIPVRCFTCG 
KIVGNKWEAYLGLLQAEYTEGDALDALG 
LKRYCCRRMLMHVDLIEKLLNYAPLEK 


13868 


27769 


A 


14036 


494 


148 


FRFLSDCGVFAEGHIELQVESGVPLGFS 
TMAEDMETKIKNYKTAPFDSRFPNQNQT 
RNCWQNYLDFHRCQKaMTAKGGDISVCE 
WYQRVYQSLCPTSWVTDWDEQRAEGTFP 
GKI 


13869 


27770 


A 


14037 


130 


3969 


IMGDVKWFLYAWCGKRKMTPSYEIRAVG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


otidc 
location 
correspon 
ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prol!ne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Val!ne, 
W=Tryptophan, Y=Tyrosine, 
X=€nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NKMRQKFMCEVQVEGYNYTGMGNSTNKK 
DAQSNAARDFVNYLVRINEIKSEEVPAF 
GVASPPPLTDTPDTTANAEGDLPTTMGG 
PLPPHLALKAENNSEVGASGYGVPGPTW 
DRGANLKDYYSRKEEQEVQATLESEEVD 
LNAGLHGNWTLENAKARLNQYFQKEKIQ 
GEYKYTQVGPDHNRSFIAEMTIYIKQLG 
RRIFAREHGSNKKLAAQSCALSLVRQLY 
HLGWEAYSGLTKKKEGETVEPYKVNLS 
QDLEHQLQNIIQEUSTLEILPPPEDPSVP 
VALNTGKLAQFEPSQRQNQVGWPWSPP 
QSNWNPWTSSNIDEGPLAFATPEQISMD 
LKNELMYQLEQDHDLQAILQERELLPVK 
KFESEILEAISQNSWIIRGATGCGKTT 
QVPQFILDDFIQNDRAAECNIWTQPRR 
ISAVSVAERVAFERGEEPGKSCGYSVRF 
ESVLPRPHASIMFCTVGVLLRKLEAGIR 
GISHVIVDEIHERDINTSFLLWLRDW 
QAYPEVRIVFMSATIDTSMFCEYFFNCP 
SLKLWRTYPVQEYFLEDCIQMTHFVPPP 
KDKKKKDKDDDGGEDDDMICNLICGDEY 
GPETRLSMSQLNEKETPFELIEALLKYI 
ETLNVPGAVLVFLPGWMLIYTMQKHLEM 
NPHFGSHRYQILPLHSQIPREEQRKVFD 
PVP VGVTKVILSTNIAETS IT INDWYV 
IDSCKQKVKLFTAHIJNMINYSTWASKT 
NLEQRKGRAGRSTAGFCFHLCSRARFER 
LETHMTPEMFRTPLHE lALS I KLLRLGG 
IGQFLAKAIEPPPLDAVIEAEHTLRELD 
AUJAIMDELTPLGRILAKLPIEPRFGKMM 

imgcifyvgdaictiaaatcfpepfvne 
gkqlgyihrnfagnrfsdhvallsvfqa 
wddarmgge3as irfcehkrlnmatlrm 
tweakvq:,ksii,insgfpedclltqvft 
ntgpdnnldwisllafgvypkvcyhke 
krkilttegrnalihkssvncpfssqdm 
nypspffvfgeki rtraisakgmtlvp p 

LQLLL FASKKVQSDGQ I VLVDDW I KLQ I 

sheaaacitglraamealwevtkqpai 
isqldpvnermlnmirqisepsaaginl 
migstrygdgprppkmarydngsgyrrg 

GSSYSGGGYGaaYSSGGYaSaGYaaSAT 
PSGRICAGVGGGYEGVSRGGFRGNSGGD 
YRGPSGGYRGSGGFQRGGGRGAYGTGYL 
DIBEEVAAIKLGYVSSVCRQ 


13870 


27771 


A 


14038 


431 


542 


EGITPGWARWLTPVIPGLWEEEAGGSPG 
REIDIGLAN 


13871 


27772 


A 


140'! 0 


238 


362 


RRCTFLSFRMLAPRVYSTAVGKRAFSPSV 
CVRAHGKCBYSYS 


13872 


27773 


A 


14041 


6290 


3514 


feaagssstnsgricplpsgallyqseg 
llarphgkgsfqvgrqqhqavrvgsaha 
ssagyvcvsafpfvfiirfcvgggpgisr 
vyalfyqecatptrewavsselspsfqeq 
nkmnkveqksqesvsfkdvtvgftqeew 
qhldpsqralyrdvmlenysnlvsvgyc 
vhkpevifrlqqgeepwkqeeefpsqsf 
pevwtadhlkersqknqskhlwewfim 
nemltkeqgdvigipfnvdvssfpsrkm 
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SEQID 
NO; of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first • 

residue of 


Predict- 
ed end 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=nireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCQCDSCGMS FNTVSELVI S KINYLC3KK 
SDEFNACGKLLLNI KHDETHTREKNEVL 
KNRNTLSHRENTLQHEKIQTLDHNFEYS 
I CQETLLEKAVFNTRKRENAEEWWCDYN 
EFGRTFCDSSSLLFHQISPSRDNHYEFS 
DCEKFLCVKSTLSKPHGVSMKHYDCGES 
GNNFRRKLCLSHLQKGDKGEKHFECNEC 
G KAFWE KSHLTRHQRVHTGQ KP FQ CNE C 
EKAFWDKSNLTKHQRSHTGEKPFECNEC 
GKAFSHKSALTLHQRTHTGE KPYQCNAC 
GETFYQKSDLTKHQRTHTGQKPYECYEC 
GKSFCMNSHLTVHQRTHTGEKPFECLEC 
GKSFCQKSHLTQHQRTHIGDKPYECNAC 
GKTFYHKSVLTRHQIIHTGLKPYECYEC 
GKTFCLKSDLTIHQRTHTGEKPFACPEC 
GKFFSHKSTLSQHYRTHTGEKPYECHEC 
GK I F YNKS YLTKHNRTHTGE KP YE CNE C 
GKTFCQKSQLTQHQRIHIGEKPYECNEC 
GKAFCHKSALIVHQRTHTQEKPYKCNEC 
GKSFCVKSGLIFHERKHTGEKPYECNEC 
GKF FRHKS SLT VHHRAHTGE KS CQ CNE C 
GKIFYRKSELAQHQRSHTGEKPYECNTC 
RKTFSQKSNLIVHQRRHIGENI.MNEMDI 
RNFQPQVSLHNASE YSHCGE SPDD I LNV 
Q 


13873 


27774 


A 


14043 


187 


422 


LTIPKLHSAQEPAGPVSPPPPPSPTFQI 
GGRSRAEPRNQYCPQVAPVPALRGCLPL 
SPGEPPHDTSSPEHLTCGGIVF 


13874 


27775 


A 


14044 


340 


442 


NKFLSWVRLLMPVIPTFWEAKVGGWLEA 
RSLRMQ 


13875 


27776 


A 


14045 




1240 


LVEGAAGQGVSDGARLRKCGTRSFPGSE 
EVLSSMaRGSAFPAAAIiWLWS ILLCLLA 
LRAEAGPPQEESLYLWIDAHCJARVLIGF 
EEDILIVSEGKMAPFTHDFRKAQQRMPA 
I SVKIHSMNFTWQAAGQAEYFYEFLSLR 
SLDKGIMADPTVNVPLLGTVPHKASWQ 
VGFPCLGKQDGVAAFEVDVIVMNSEGNT 
ILQTPQMAI FFKTCQQRECPGGCHNGGF 
CNERRICECPDGFHGPHCEKALCTPRCM 
NGGLCVTPGFCICPPGFYGVNCDKANCS 
TTCFNGGTGFYPGKCICPPGLEGEQCEI 
SKCPQPCRNGGKCIGKSKCKCSKGYQGD 
IiCSKPVCEPGCGftHGTCHEPNKCQCQEG 
WHGRHCMKRYEftSLIHALRPAGAQLRQH 
TPSLKKAEEREDPPESNYIW 


13876 


27777 


A 


14046 


18 


274 


YSQEVLCEMNRSGWSNIKRGWAWWQLT 
TTIPALWEAEVGGLLEPRSLRLASPQPP 
QHEDYSMSYCaQPRFSLCIRGFI FYSLW 
F 


13877 


27778 


A 


14047 


177 


441 


GGSHYAWPNITFIFTKLIAQFFCFFEMQ 
SCS VARLECRGAI SAHCNLHLPGSSNSP 
VMRQILKNQRKSNFDEDATLLGLSYTLL 

GEI 


13878 


27779 


A 


14048 


136 


319 


DVFLLLFHSNYFLFFETGSHSVSQAGVK 
WCDLSSLQPPPLGSSDPPTSASRVPALF 
FFLEF 


13879 


27780 


A 


14049 


388 


128 


ELECLPSYKGQSPQLSURRYFADLIAIVS 
NRFTLCPSAEHLAVYLLDLFMDR YD I S I 



1391 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 

NO: of 

sequence 


M 
eth 
od 


SEQ 10 
NO: in 
USSN 
09/515,1 
25 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 

last amino 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QQLHLVALSCLLLASKYESDLHDWKFLM 
FI 


13880 


27781 


A 


14050 


262 


3 


VIYNYISVRPFPNIYHTFYTLSLFLYLS 
THTHTHTHTHTHTQRETGHVCVCI YTYIVI 
YMCMYAKWWYKFTFLISGKRRIHTHLHP 


13881 


27782 


A 


14051 


149 


423 


YPHLCAFLLLPSISSFRAGVFFFSYLSL 

TAYPLTWKPPEPPATPQPTPNPPPGHPA 
TRGPKSALLDHSYACASAPPLGTEGTAP 
SPSFALI 


13882 


27783 


A 


14052 


2 


1440 


FVAVTAMAAPCLLRQGRAGALKTMLQEA 
QVFRGIASTVSLSAESGKSEKGQPQNSK 
KQSPPKNWEPKERGKLLATQTAAELSK 
NLSSPSSYPPAVKKGRKVaSPSPSGSVL 
FTDEGVPKFLSRKTLVEFPQKVIiSPFRK 
QGSDSEARQVGRKVTSPSSSSSSSSSDS 
ESDDEADVSEVTPRWSKGRGGLRKPEA 
SHSFENEAFRVTVSAKEKTLLQKPHVDI 
TDPEKPHQPKKKGSPAKPSEGRENARPK 
TTMPRSQVDEEFLKQSLKEKQLQKTFRL 
NEIDKESQKPFEVKGPLPVHTKSGLSAP 
PKGSPAPAVLAEEARAEGQLQASPPGAA 
EGHLEKPVPEPQRKAAPPLPRKETSGTQ 
GIEGHLKGGQAIVEDQIPPSNLETVPVE 
NNHGFHEKTAALKLEAEGEAMEDAAAPG 
NDRGGTQEPAPVPAEPFDMTTYKNLQHH 
DYSTYTFLDLNLELSKFRMPQPSSGRES 
PRH 


13883 


27784 


A 


14053 


177 


1253 


EKKRTLPVRSVTGELQGKSLSDLAAGTM 
DSEKKRFTEEATKYFRERVSPVHLQILL 
TNNEAWKRFVTAAELPRDEADALYEALK 
KLRTYAAIEDEYVQQKDEQFREWFLKEF 
PQVKRKIQESIEKLRALSNGIEEVHRGC 
TISNWSSSTGftASGIMSLAGLVLAPFT 
AGTSLALTAAGVGLGAASAVTGITTSIV 
EHSYTSSAEAEASRLTATSIDRLKVFKE 
VMRDITPNLIiSLLiiUaYYEATQTIGSEIR 
AI RQARARARLPVTTWRI SAGSGGQAER 
TIASTTRAVSROARILSATTSGIFIALD 
WNLVYESKHLHEGABCSASAEELRRQAQ 
ELEENLMELTQIYQRLNPCHTH 


13884 


27785 


A 


14054 


1585 


2099 


ICVKTFPPLALQVRMAAftEHHHSSGLPy 
WPYLTAETLKNRMGHQPPPPTQQHSITD 
NSLSLKTPSERIiLYPLPPSAPPSADDNL 
KTPPECLLTPLPPSALPSMDNLKTPAE 
CLLYPLPPSADDNLKTPPECLFTPLPPS 
ftPPSVDDNLKTPPECVCSLPFHPQRMIl 
SRN 


13885 


27786 


A 


14055 


2 


2865 


ALPDGGASVASDRAEGRPAKPSKTAARE 
KTEGAVAAVGGGPSSFRCCYGCCHEARL 
GRTSLPRGVTMLTEASLSIWGWGSLGIV 
LFLITFGPFVIFYLTFYILCFVGGGLW 
TLLFGKTWSEKYLEQCEHSFLPPTSPGV 
PKCLEEMKREARTIKlDRRLTQftNlIDE 
PLQQVIQFSLRDYVQYMYTTLSDDESFL 
LEIRQTLQNALIQFATRSKEIDWQPYFT 
TRIVDDFGTHLRVFRKAQQKI TEKDDQV 
KGTAEDLVDTFFEVEVEMEKEVCRDLVC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, l>=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TSPKDEEGFLRDLCEVLLYLLLPPGDFQ 
NKIMRYFVREILARGILLPLINQLSDPD 
YIWQYVIWMIRDSNCMYEAFMNIIKLSD 
NIGELEAVRDKAAEELQYLRSLDTAGDD 
INTIKETQINSLLFVKKVCDSRI QRLQSG 
KE INT VKLAaNFGKLCTVP LDS I LVDNV 
ALQFFMDYMQQTGGQAHLFFWMTVEGYR 
VTAQQQLEVLLSRQRDGKHQTNQTKGLL 
RAAAVGIYEQYLSEKASPRVTVDDYLVA 
KLADTLNHED PT PE I FDD I QHKVYE LML 
RDERFYPSFRQNALYVRMLAELDMLKDP 
SFRGSDDGDGESFNGSPTGSINLSLDDL 
SNVSSDDSVQLHAYISDTVYADYDPYAV 
AGVCNDHGKT YALYAI TVHRRNLNSEEM 
WKT YRR YSDFHD FHMR ITEQFESLSSIL 
KL P E KKTFNNMDRD FLEKRKKDLNAYLQ 

KGKGD FARKMDT FVNPLRNS MRNVSNAV 
KSLPDSLAEGMTKMSDNMGKMSERLGQD 
IKQSFFKVPPLIPKTDSDPEHRRVSAQL 
DDNVDDNIPLRVMLLLMDEVFDLKEENQ 
WLRRNIKNLLQQLIRATYGDTINRKIVD 
HVDWMTSPEQVADSVKRFRDAFWPNGIL 
AEAVPCRDKS IRMRTRVAGKTKLLAIMP 
GE 


13886 


27787 


A 


14057 


311 


150 


FLCFKYRRGFATLHSGLKLLGSSDPPSS 
TSQIPGITGTSHCVQPTHLFFLALS 


13887 


27788 


A 


14058 


53 


211 


RQHITCLDIFFIHMNYKYKYINIYLSIY 
LSIYLSIYLIYLSISHSSYITERA 


13888 


27789 




14059 




1642 


RDGRKMATATIALQWGQQGGGSEPAAA 
AAWAAGDKWKPPQGTDS IKMENGQSTA 
AKLGLPPLTPEQQEALQKAKKYAMEQSI 
KSVLVKQTIAHQQQQLTNLQMRAQRQRA 
LAIMCRVYVGS lYYELGEDTIRQAFAPF 
GPIKS IDMSWDSVTMKHKGFAFVEYEVP 
EAAQLMiEQMKSVMLGGRNIKVGRPSNI 
GQAQPIIDQEAEEARAFKRIYVASVHQD 
LSDDDIKSVFEAFGKIKSCTLflRDPTTG 
KHKGYGFIEYEKAQSSQDAVSSMNLFDL 
GGQYLRVGKAVTPPMPLLTPATPGGLPP 
AAAVAAAAATAKITAQEAVAGAAVLGTL 
GTPGLVS PALTLAQPLGTLPQAVMjyiQA 
PGVITGVTPARPPIPVTIPSVGWNPIL 
ASPPTLGLLEPKKEKEEEELFPESERPE 
iMIiSEQEHMS ISGSSAHHMVNQKLLRKQE 
STVMVLRMMVDPKDIDDDLEGEVTEECG 
KPGAVNRVIIYQEKQGEEEDAEIIVKIF 
VEFS lASETHKAIQALNGRWFAGRKWA 
EVYDQERFDNSDLSA 


13889 


27790 


A 


14060 


1238 


1474 


VIALQEGRPWRREPASIDACRLNFORLR 
RGKFSNVLFPGLAQEALYSGGYHLKFAD 
ELMGGNLKKSTADASGSRGHQL 


13890 


27791 


A 


14061 


266 


400 


GQWARPVIPALMEAKflGGSLEPSMLRPA 
GQHIKTPSVLITSKKT 


13891 


27792 


A 


14062 


2178 


1881 


VLQAPSILLDAPRTDGGDMGRAMVARLG 
LWLLLLALLLPTQIYSSETTTGTSSNSS 
QSTSNTGLAPNPTNATTKAAGGALQSTA 
SLFWSLSLLHLYS 



1393 



w o 01/64835 



PCT/llSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13892 


27793 


A 


14063 


261 




NSVLTLALLFSLHVHLPPPPSGVTDTAG 
ALSTGGPPSPTSVTGRGPAHSHASQLPP 
APGEFAPLMESCRGWAGEAFLERFPDAW 
VDP 


13893 


27794 


A 


14064 


2384 


1115 


QHFSRRGLCWEQRSSVTSSWTSGAWSP 
PCPPSNASCNTLHTRDWASPDPGGQGSL 
GESPGPAPPGQLHTLDTDLHSIiAQIGGK 
SPVAGVGNGGSLWPRESPGTANGHSPEH 
TPPGPGPPGPCPTKRRLLPAGEAPDVSS 
EEEGPAPRRRRGSLGHPTASNSSDAKAT 
PFWSHLLPGPKEPVLDPTDCGPMGRRLK 
GAERLBCLSPLRSLRKGPGLLS PPRASPV 
PTPAVSRTLLGNFEESLLRGRFAPSGHI 
EGFTAEIGASGSYCPQHVTLPVTVTFFD 
VSEQNAPAPFLGIVDHNPLGRKGYSVPK 
VGTVQVTLFNPNQTWKMFLVTFDFSDM 
PAAHMTFLRHRLFLVPVGEEGNANPTHR 
LLCYLLHLRFRSSRSGRLSLHGDIRLLF 
SRRSLELDTGLPYELQAVTEAPHNPRYS 
PLP 


13894 


27795 


A 


14065 


232 


416 


GWAVQLGIGTEGKEGTSSSERQREGRGD 
GNRKTGRRREGRRSTKTnAPS PYPAIOIP 
RSKLI 


13895 


27796 ; A 


14067 


238 


45 


iqtcgshsfcipsIweAkAgglleprsl 
spacptqkdpishlgselpqsfrllirl 

VPLFLSAC 


13896 


27797 




14068 


3 


1705 


SCESKATPWRAVSASQELQHPQGGQRSP 
LPGDLTPARPNPAYPLTVECQRSCSRPV 
PAPLPPHTHPPGSSCFY3SFSFITKATA 
PGAQRRAVTQAERGRMGFLGTGTWILVL 
VLP IQAFPKPGGSQDKSLHNRELSAERP 
LNEQIAEAEEDKIKKTYPPENKPGQSNY 

PLDNKIiNVEDVDS TKNRKLIDDYDS TKS 
GLDHKFQDDPDGLHQLDGTPLTAEDIVH 
KIAARIYEENDRAVFDKIVSKLLNLGLI 
TESQAHTLKUEVAEVLQKL IS KEANNYE 
EDPNKPTSWTENQAGKIPEKVTPMAAIQ 
DGLAKGENDETVSNTLTLTNGI.ERRTKT 
YSEDNFEELQYFPNFYALLKS IDSEKEA 
KEKETLITIMKTLIDFVKMMVKYGTISP 
EEGVSYLElilLDEMIALQTKNKIEKNATD 
NISKLFPAPSEKSHEETDSTKEEAAKME 
KEYGSLKDSTKDDNSNPGGKTDEPKGKT 
EAYLEAIRKMIEWLKICECDKKGNKEDYDL 
SKMRDFINKQADAYVEKGILDKEEAEAI 
KRIYSSL 


13897 


27798 


A 


14069 


1 


140 


NTSVAIQTI EYYSAFKRKETLTHVPLWM 
NLKDTMLRE I SQSQKDTV 


13898 


27799 


A 


14071 


212 


104 


HTNHQCQTVLPAGQALATPQGLAPS PVF 
PQSCLRMV 


13899 


27800 


A 


14072 


358 


112 


FPLTVMLNVFLWWQNLKHFCWFIQEQLM 
FFFFFFLRTGSHYWQAGLELLGSTNLP 
ASVSRVAGTTGIHNCTQFNPLHTLHL 


13900 


27801 


A 


14073 


3 


151 


YMGFHHVGYAGLELLTSSDLPPWPPKCW 
DYHHEPSCLAMFFYFALIAPE 


13901 


27802 


A 


14074 


8 


1493 


VTIHHLFVGQAVEADTLKKKSPSCLLCI 
VFIPVPRIDLTNIDQVAVI FKHHFPVGR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQJD 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop corton, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GDAVLKTWAPAQCLCSRMGPAWLWLLGT 
GILASVHCQPLLAHGDKSLQGPQPPRHQ 
LSEPAPAYHRITPTI TNFALRLYKELAA 
DAPGNIFFSPVSISTTLALLSLGAQANT 
SALILEGLGFNLTETPEADIHQGFRSLL 
HTLALPSPKLELKVGNSLFLDKRLKPRQ 
HYLDS IKELYGAFAFSMFTDSVTTGRQ 
INDYLRRQTYGQWDCLPEFSQDTFMVL, 
ANY IFFKAKWKHP FSRYQTQKQES FFVD 
BRTSLQVPMHOHQKEMHRFLYDQDLACTV 
LQIEYRGNALALLVLPDPGKMKQVEAAL 
QPQTLRKWGQLLLPSLLDLHLPRFSISG 
TYNLEDILPQIGLTNILNLEADFSGVTG 
QLNKT ISKVSHKAMVDMSEKGTEAGAAS 
GLLSQPPSLNTMSDPHAHENRPFLLLLW 
EVTTQSLLFLGKWNPVAG 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected firom the group 
consisting ofSEQ ID NO: 1-13901 , a mature protein coding portion ofSEQ ID NO: M3901 , 
an active domain of SEQ ID NO: 1-13901 , and complementary sequences thereof. 

5 

2. An isolated polynucleotide encoding a polj^jeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 

conditions. 

10 3 . An isolated polynucleotide encoding a polj^eptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

15 5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 

complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 
20 7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the pol5Tiucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the pol5mucleotide of claim 1 operatively 
25 associated with a regulatory sequence that modulates ejqjression of the polynucleotide in the host 

cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 
30 (b) a polypeptide encoded by a polynucleotide hybridizing under stringent 

conditions with any one of SEQ IDNOS: 1-13901. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 
35 12. An antibody directed against the polypeptide of claim 1 0. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 
5 b) detecting the complex, so that if a complex is detected, the polynucleotide 

of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
10 nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

15 sample. 

1 5 . The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

20 16. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to foim the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

25 

17. A method for identifying a compotind that binds to the polypeptide of claim 1 0, 
comprising: 

a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10, in a cell, under 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
5 that if the pol5^eptide/compound complex is detected, a compound that binds to the polypeptide 

of claim 10 is identified. 



19. A method of producing the polypeptide of claim 10, comprising, 

a) cultiu-ing a host cell comprising a polynucleotide sequence selected fi'om 
10 the group consisting of a polynucleotide sequence of SEQ ID NO: 1-13901 , a mature protein 
coding portion of SEQ ID NO: 1-13901 , an active domain of SEQ ID NO: 1-13901 , 
complementary sequences thereof and a poljTiucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-13901 , under conditions sufficient to express the polypeptide in 
said cell; and 

15 b) isolating the polypeptide firom the cell culture or cells of step (a). 



20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 13902-27802, the mature protein portion 
thereof, or the active domain thereof. 



21 . The pol3fpeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 



A collection of polynucleotides, wherein the collection comprises the 
sequence information of at least one of SEQ ID NOS: 1-13901. 



The collection of claim 22, wherein the collection is provided on a nucleic 
acid anay. 



The collection of claim 23, wherein the array detects full-matches to any 
one of the polynucleotides in the collection. 

The collection of claim 23, wherein the array detects mismatches to any 
one of the polynucleotides in the collection. 
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26. The collection of claim 22, wherein the collection is provided in a 
computer-readable format. 

27. A method of treatment comprising administering to a mammalian subj ect in need 

thereof a therapeutic amount of a composition comprising a polypeptide of claim 
10 or 20 and a pharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subj ect in need 

thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically 
acceptable carrier. 
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